Search | arXiv e-print repository

Using Contact to Increase Robot Performance for Glovebox D&D Tasks

Authors: Aykut Onol, Philip Long, Taskin Padir

Abstract: Glovebox decommissioning tasks usually require manipulating relatively heavy objects in a highly constrained environment. Thus, contact with the surroundings becomes inevitable. In order to allow the robot to interact with the environment in a natural way, we present a contact-implicit motion planning framework. This framework enables the system, without the specification in advance of a contact p… ▽ More Glovebox decommissioning tasks usually require manipulating relatively heavy objects in a highly constrained environment. Thus, contact with the surroundings becomes inevitable. In order to allow the robot to interact with the environment in a natural way, we present a contact-implicit motion planning framework. This framework enables the system, without the specification in advance of a contact plan, to make and break contacts to maintain stability while performing a manipulation task. In this method, we use linear complementarity constraints to model rigid body contacts and find a locally optimal solution for joint displacements and magnitudes of support forces. Then, joint torques are calculated such that the support forces have the highest priority. We evaluate our framework in a 2.5D, quasi-static simulation in which a humanoid robot with planar arms manipulates a heavy object. Our results suggest that the proposed method provides the robot with the ability to balance itself by generating support forces on the environment while simultaneously performing the manipulation task. △ Less

Submitted 11 July, 2018; originally announced July 2018.

Comments: 11 pages, 5 figures; Accepted for publication in Waste Management Symposia 2018

arXiv:1806.01425 [pdf, other]

doi 10.1109/IROS.2018.8594284

A Comparative Analysis of Contact Models in Trajectory Optimization for Manipulation

Authors: Aykut Ozgun Onol, Philip Long, Taskin Padir

Abstract: In this paper, we analyze the effects of contact models on contact-implicit trajectory optimization for manipulation. We consider three different approaches: (1) a contact model that is based on complementarity constraints, (2) a smooth contact model, and our proposed method (3) a variable smooth contact model. We compare these models in simulation in terms of physical accuracy, quality of motions… ▽ More In this paper, we analyze the effects of contact models on contact-implicit trajectory optimization for manipulation. We consider three different approaches: (1) a contact model that is based on complementarity constraints, (2) a smooth contact model, and our proposed method (3) a variable smooth contact model. We compare these models in simulation in terms of physical accuracy, quality of motions, and computation time. In each case, the optimization process is initialized by setting all torque variables to zero, namely, without a meaningful initial guess. For simulations, we consider a pushing task with varying complexity for a 7 degrees-of-freedom robot arm. Our results demonstrate that the optimization based on the proposed variable smooth contact model provides a good trade-off between the physical fidelity and quality of motions at the cost of increased computation time. △ Less

Submitted 30 July, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

Comments: 6 pages, 7 figures, 4 tables, IROS 2018 camera-ready version

arXiv:1805.10408 [pdf, other]

The Singular Values of Convolutional Layers

Authors: Hanie Sedghi, Vineet Gupta, Philip M. Long

Abstract: We characterize the singular values of the linear transformation associated with a standard 2D multi-channel convolutional layer, enabling their efficient computation. This characterization also leads to an algorithm for projecting a convolutional layer onto an operator-norm ball. We show that this is an effective regularizer; for example, it improves the test error of a deep residual network usin… ▽ More We characterize the singular values of the linear transformation associated with a standard 2D multi-channel convolutional layer, enabling their efficient computation. This characterization also leads to an algorithm for projecting a convolutional layer onto an operator-norm ball. We show that this is an effective regularizer; for example, it improves the test error of a deep residual network using batch normalization on CIFAR-10 from 6.2\% to 5.3\%. △ Less

Submitted 5 March, 2019; v1 submitted 25 May, 2018; originally announced May 2018.

Comments: Published as a conference paper at ICLR 2019

arXiv:1804.05012 [pdf, ps, other]

Representing smooth functions as compositions of near-identity functions with implications for deep network optimization

Authors: Peter L. Bartlett, Steven N. Evans, Philip M. Long

Abstract: We show that any smooth bi-Lipschitz $h$ can be represented exactly as a composition $h_m \circ ... \circ h_1$ of functions $h_1,...,h_m$ that are close to the identity in the sense that each $\left(h_i-\mathrm{Id}\right)$ is Lipschitz, and the Lipschitz constant decreases inversely with the number $m$ of functions composed. This implies that $h$ can be represented to any accuracy by a deep residu… ▽ More We show that any smooth bi-Lipschitz $h$ can be represented exactly as a composition $h_m \circ ... \circ h_1$ of functions $h_1,...,h_m$ that are close to the identity in the sense that each $\left(h_i-\mathrm{Id}\right)$ is Lipschitz, and the Lipschitz constant decreases inversely with the number $m$ of functions composed. This implies that $h$ can be represented to any accuracy by a deep residual network whose nonlinear layers compute functions with a small Lipschitz constant. Next, we consider nonlinear regression with a composition of near-identity nonlinear maps. We show that, regarding Fréchet derivatives with respect to the $h_1,...,h_m$, any critical point of a quadratic criterion in this near-identity region must be a global minimizer. In contrast, if we consider derivatives with respect to parameters of a fixed-size residual network with sigmoid activation functions, we show that there are near-identity critical points that are suboptimal, even in the realizable case. Informally, this means that functional gradient methods for residual networks cannot get stuck at suboptimal critical points corresponding to near-identity layers, whereas parametric gradient methods for sigmoidal residual networks suffer from suboptimal critical points in the near-identity region. △ Less

Submitted 16 April, 2018; v1 submitted 13 April, 2018; originally announced April 2018.

arXiv:1802.06093 [pdf, ps, other]

Gradient descent with identity initialization efficiently learns positive definite linear transformations by deep residual networks

Authors: Peter L. Bartlett, David P. Helmbold, Philip M. Long

Abstract: We analyze algorithms for approximating a function $f(x) = Φx$ map** $\Re^d$ to $\Re^d$ using deep linear neural networks, i.e. that learn a function $h$ parameterized by matrices $Θ_1,...,Θ_L$ and defined by $h(x) = Θ_L Θ_{L-1} ... Θ_1 x$. We focus on algorithms that learn through gradient descent on the population quadratic loss in the case that the distribution over the inputs is isotropic.… ▽ More We analyze algorithms for approximating a function $f(x) = Φx$ map** $\Re^d$ to $\Re^d$ using deep linear neural networks, i.e. that learn a function $h$ parameterized by matrices $Θ_1,...,Θ_L$ and defined by $h(x) = Θ_L Θ_{L-1} ... Θ_1 x$. We focus on algorithms that learn through gradient descent on the population quadratic loss in the case that the distribution over the inputs is isotropic. We provide polynomial bounds on the number of iterations for gradient descent to approximate the least squares matrix $Φ$, in the case where the initial hypothesis $Θ_1 = ... = Θ_L = I$ has excess loss bounded by a small enough constant. On the other hand, we show that gradient descent fails to converge for $Φ$ whose distance from the identity is a larger constant, and we show that some forms of regularization toward the identity in each layer do not help. If $Φ$ is symmetric positive definite, we show that an algorithm that initializes $Θ_i = I$ learns an $ε$-approximation of $f$ using a number of updates polynomial in $L$, the condition number of $Φ$, and $\log(d/ε)$. In contrast, we show that if the least squares matrix $Φ$ is symmetric and has a negative eigenvalue, then all members of a class of algorithms that perform gradient descent with identity initialization, and optionally regularize toward the identity in each layer, fail to converge. We analyze an algorithm for the case that $Φ$ satisfies $u^{\top} Φu > 0$ for all $u$, but may not be symmetric. This algorithm uses two regularizers: one that maintains the invariant $u^{\top} Θ_L Θ_{L-1} ... Θ_1 u > 0$ for all $u$, and another that "balances" $Θ_1, ..., Θ_L$ so that they have the same singular values. △ Less

Submitted 18 June, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

arXiv:1801.09834 [pdf, ps, other]

A Flexible Procedure for Mixture Proportion Estimation in Positive-Unlabeled Learning

Authors: Zhenfeng Lin, James P. Long

Abstract: Positive--unlabeled (PU) learning considers two samples, a positive set P with observations from only one class and an unlabeled set U with observations from two classes. The goal is to classify observations in U. Class mixture proportion estimation (MPE) in U is a key step in PU learning. Blanchard et al. [2010] showed that MPE in PU learning is a generalization of the problem of estimating the p… ▽ More Positive--unlabeled (PU) learning considers two samples, a positive set P with observations from only one class and an unlabeled set U with observations from two classes. The goal is to classify observations in U. Class mixture proportion estimation (MPE) in U is a key step in PU learning. Blanchard et al. [2010] showed that MPE in PU learning is a generalization of the problem of estimating the proportion of true null hypotheses in multiple testing problems. Motivated by this idea, we propose reducing the problem to one dimension via construction of a probabilistic classifier trained on the P and U data sets followed by application of a one--dimensional mixture proportion method from the multiple testing literature to the observation class probabilities. The flexibility of this framework lies in the freedom to choose the classifier and the one--dimensional MPE method. We prove consistency of two mixture proportion estimators using bounds from empirical process theory, develop tuning parameter free implementations, and demonstrate that they have competitive performance on simulated waveform data and a protein signaling problem. △ Less

Submitted 9 January, 2020; v1 submitted 29 January, 2018; originally announced January 2018.

Comments: 28 pages (including 9 pages of Technical Notes), 4 figures, 1 table

arXiv:1709.10082 [pdf, other]

Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

Authors: Pinxin Long, Tingxiang Fan, Xinyi Liao, Wenxi Liu, Hao Zhang, Jia Pan

Abstract: Develo** a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots' states and intents. While other distributed multi-robot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computational… ▽ More Develo** a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots' states and intents. While other distributed multi-robot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computationally prohibitive and not robust. More importantly, in practice the performance of these methods are much lower than their centralized counterparts. We present a decentralized sensor-level collision avoidance policy for multi-robot systems, which directly maps raw sensor measurements to an agent's steering commands in terms of movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to find an optimal policy which is trained over a large number of robots on rich, complex environments simultaneously using a policy gradient based reinforcement learning algorithm. We validate the learned sensor-level collision avoidance policy in a variety of simulated scenarios with thorough performance evaluations and show that the final learned policy is able to find time efficient, collision-free paths for a large-scale robot system. We also demonstrate that the learned policy can be well generalized to new scenarios that do not appear in the entire training period, including navigating a heterogeneous group of robots and a large-scale scenario with 100 robots. Videos are available at https://sites.google.com/view/drlmaca △ Less

Submitted 20 May, 2018; v1 submitted 28 September, 2017; originally announced September 2017.

arXiv:1709.09574 [pdf, ps, other]

Fillable arrays with constant time operations and a single bit of redundancy

Authors: Jacob Teo Por Loong, Jelani Nelson, Huacheng Yu

Abstract: In the fillable array problem one must maintain an array A[1..n] of $w$-bit entries subject to random access reads and writes, and also a $\texttt{fill}(Δ)$ operation which sets every entry of to some $Δ\in\{0,\ldots,2^w-1\}$. We show that with just one bit of redundancy, i.e. a data structure using $nw+1$ bits of memory, $\texttt{read}/\texttt{fill}$ can be implemented in worst case constant time… ▽ More In the fillable array problem one must maintain an array A[1..n] of $w$-bit entries subject to random access reads and writes, and also a $\texttt{fill}(Δ)$ operation which sets every entry of to some $Δ\in\{0,\ldots,2^w-1\}$. We show that with just one bit of redundancy, i.e. a data structure using $nw+1$ bits of memory, $\texttt{read}/\texttt{fill}$ can be implemented in worst case constant time, and $\texttt{write}$ can be implemented in either amortized constant time (deterministically) or worst case expected constant (randomized). In the latter case, we need to store an additional $O(\log n)$ random bits to specify a permutation drawn from an $1/n^2$-almost pairwise independent family. △ Less

Submitted 27 September, 2017; originally announced September 2017.

arXiv:1707.05834 [pdf, other]

Statistical methods in astronomy

Authors: James P. Long, Rafael S. de Souza

Abstract: We present a review of data types and statistical methods often encountered in astronomy. The aim is to provide an introduction to statistical applications in astronomy for statisticians and computer scientists. We highlight the complex, often hierarchical, nature of many astronomy inference problems and advocate for cross-disciplinary collaborations to address these challenges. We present a review of data types and statistical methods often encountered in astronomy. The aim is to provide an introduction to statistical applications in astronomy for statisticians and computer scientists. We highlight the complex, often hierarchical, nature of many astronomy inference problems and advocate for cross-disciplinary collaborations to address these challenges. △ Less

Submitted 19 October, 2017; v1 submitted 16 July, 2017; originally announced July 2017.

Comments: 9 pages, 5 figures

arXiv:1705.05980 [pdf]

doi 10.1038/s41566-017-0069-0

Active Tuning of Surface Phonon Polariton Resonances via Carrier Photoinjection

Authors: Adam D. Dunkelberger, Chase T. Ellis, Daniel C. Ratchford, Alexander J. Giles, Mi** Kim, Chul Soo Kim, Bryan T. Spann, Igor Vurgaftman, Joseph G. Tischler, James P. Long, Orest J. Glembocki, Jeffrey C. Owrutsky, Joshua D. Caldwell

Abstract: Surface-phonon polaritons (SPhPs) are attractive alternatives to far-infrared plasmonics for sub-diffractional confinement of light. Localized SPhP resonances in semiconductor nanoresonators are very narrow, but that linewidth and the limited extent of the Reststrahlen band inherently limit spectral coverage. To address this limitation, we report active tuning of SPhP resonances in InP and 4H-SiC… ▽ More Surface-phonon polaritons (SPhPs) are attractive alternatives to far-infrared plasmonics for sub-diffractional confinement of light. Localized SPhP resonances in semiconductor nanoresonators are very narrow, but that linewidth and the limited extent of the Reststrahlen band inherently limit spectral coverage. To address this limitation, we report active tuning of SPhP resonances in InP and 4H-SiC by photoinjecting free carriers into the nanoresonators, taking advantage of the coupling between the carrier plasma and optical phonons to blue-shift SPhP resonances. We demonstrate state-of-the-art tuning figures of merit upon continuous-wave (CW) excitation (in InP) or pulsed excitation (in 4H-SiC). Lifetime effects cause the tuning to saturate in InP, and carrier-redistribution leads to rapid (<50 ps) recovery of the tuning in 4H-SiC. This work opens the path toward actively tuned nanophotonic devices, such as modulators and beacons, in the infrared and identifies important implications of coupling between electronic and photonic excitations. △ Less

Submitted 16 May, 2017; originally announced May 2017.

arXiv:1701.00480 [pdf, ps, other]

doi 10.1109/TED.2017.2690669

A Multiscale Modeling of Triple-Heterojunction Tunneling FETs

Authors: Jun Z. Huang, Pengyu Long, Michael Povolotskyi, Hesameddin Ilatikhameneh, Tarek Ameen, Rajib Rahman, Mark J. W. Rodwell, Gerhard Klimeck

Abstract: A high performance triple-heterojunction (3HJ) design has been previously proposed for tunneling FETs (TFETs). Compared with single heterojunction (HJ) TFETs, the 3HJ TFETs have both shorter tunneling distance and two transmission resonances that significantly improve the ON-state current ($I_{\rm{ON}}$). Coherent quantum transport simulation predicts, that $I_{\rm{ON}}=460\rm{μA/μm}$ can be achie… ▽ More A high performance triple-heterojunction (3HJ) design has been previously proposed for tunneling FETs (TFETs). Compared with single heterojunction (HJ) TFETs, the 3HJ TFETs have both shorter tunneling distance and two transmission resonances that significantly improve the ON-state current ($I_{\rm{ON}}$). Coherent quantum transport simulation predicts, that $I_{\rm{ON}}=460\rm{μA/μm}$ can be achieved at gate length $Lg=15\rm{nm}$, supply voltage $V_{\rm{DD}}=0.3\rm{V}$, and OFF-state current $I_{\rm{OFF}}=1\rm{nA/μm}$. However, strong electron-phonon and electron-electron scattering in the heavily doped leads implies, that the 3HJ devices operate far from the ideal coherent limit. In this study, such scattering effects are assessed by a newly developed multiscale transport model, which combines the ballistic non-equilibrium Green's function method for the channel and the drift-diffusion scattering method for the leads. Simulation results show that the thermalizing scattering in the leads both degrades the 3HJ TFET's subthreshold swing through scattering induced leakage and reduces the turn-on current through the access resistance. Assuming bulk scattering rates and carrier mobilities, the $I_{\rm{ON}}$ is dropped from $460\rm{μA/μm}$ down to $254\rm{μA/μm}$, which is still much larger than the single HJ TFET case. △ Less

Submitted 3 April, 2017; v1 submitted 2 January, 2017; originally announced January 2017.

Journal ref: IEEE Transactions on Electron Devices, 2017

arXiv:1609.07203 [pdf]

doi 10.1063/1.4971341

Performance degradation of superlattice MOSFETs due to scattering in the contacts

Authors: Pengyu Long, Jun Huang, Zheng** Jiang, Gerhard Klimeck, Mark J. W. Rodwell, Michael Povolotskyi

Abstract: Ideal, completely coherent quantum transport calculations had predicted that superlattice MOSFETs may offer steep subthreshold swing performance below 60mV/dec to around 39mV/dec. However, the high carrier density in the superlattice source suggest that scattering may significantly degrade the ideal device performance. Such effects of electron scattering and decoherence in the contacts of superlat… ▽ More Ideal, completely coherent quantum transport calculations had predicted that superlattice MOSFETs may offer steep subthreshold swing performance below 60mV/dec to around 39mV/dec. However, the high carrier density in the superlattice source suggest that scattering may significantly degrade the ideal device performance. Such effects of electron scattering and decoherence in the contacts of superlattice MOSFETs are examined through a multiscale quantum transport model developed in NEMO5. This model couples NEGF-based quantum ballistic transport in the channel to a quantum mechanical density of states dominated reservoir, which is thermalized through strong scattering with local quasi-Fermi levels determined by drift-diffusion transport. The simulations show that scattering increases the electron transmission in the nominally forbidden minigap therefore degrading the subthreshold swing (S.S.) and the ON/OFF DC current ratio. This degradation varies with both the scattering rate and the length of the scattering dominated regions. Different superlattice MOSFET designs are explored to mitigate the effects of such deleterious scattering. Specifically, shortening the spacer region between the superlattice and the channel from 3.5 nm to 0 nm improves the simulated S.S. from 51mV/dec. to 40mV/dec. I. INTRODUCTION △ Less

Submitted 22 September, 2016; originally announced September 2016.

Comments: 16 pages, 8 figures

arXiv:1609.06838 [pdf, other]

Deep-Learned Collision Avoidance Policy for Distributed Multi-Agent Navigation

Authors: Pinxin Long, Wenxi Liu, Jia Pan

Abstract: High-speed, low-latency obstacle avoidance that is insensitive to sensor noise is essential for enabling multiple decentralized robots to function reliably in cluttered and dynamic environments. While other distributed multi-agent collision avoidance systems exist, these systems require online geometric optimization where tedious parameter tuning and perfect sensing are necessary. We present a n… ▽ More High-speed, low-latency obstacle avoidance that is insensitive to sensor noise is essential for enabling multiple decentralized robots to function reliably in cluttered and dynamic environments. While other distributed multi-agent collision avoidance systems exist, these systems require online geometric optimization where tedious parameter tuning and perfect sensing are necessary. We present a novel end-to-end framework to generate reactive collision avoidance policy for efficient distributed multi-agent navigation. Our method formulates an agent's navigation strategy as a deep neural network map** from the observed noisy sensor measurements to the agent's steering commands in terms of movement velocity. We train the network on a large number of frames of collision avoidance data collected by repeatedly running a multi-agent simulator with different parameter settings. We validate the learned deep neural network policy in a set of simulated and real scenarios with noisy measurements and demonstrate that our method is able to generate a robust navigation strategy that is insensitive to imperfect sensing and works reliably in all situations. We also show that our method can be well generalized to scenarios that do not appear in our training data, including scenes with static obstacles and agents with different sizes. Videos are available at https://sites.google.com/view/deepmaca. △ Less

Submitted 6 July, 2017; v1 submitted 22 September, 2016; originally announced September 2016.

Journal ref: IEEE Robotics and Automation Letters 2(2): 656-663 (2017)

arXiv:1607.04896 [pdf, ps, other]

doi 10.1109/TED.2016.2624744

Scalable GaSb/InAs tunnel FETs with non-uniform body thickness

Authors: Jun Z. Huang, Pengyu Long, Michael Povolotskyi, Gerhard Klimeck, Mark J. W. Rodwell

Abstract: GaSb/InAs heterojunction tunnel field-effect transistors are strong candidates in building future low-power integrated circuits, as they could provide both steep subthreshold swing and large ON-state current ($I_{\rm{ON}}$). However, at short channel lengths they suffer from large tunneling leakage originating from the small band gap and small effective masses of the InAs channel. As proposed in t… ▽ More GaSb/InAs heterojunction tunnel field-effect transistors are strong candidates in building future low-power integrated circuits, as they could provide both steep subthreshold swing and large ON-state current ($I_{\rm{ON}}$). However, at short channel lengths they suffer from large tunneling leakage originating from the small band gap and small effective masses of the InAs channel. As proposed in this article, this problem can be significantly mitigated by reducing the channel thickness meanwhile retaining a thick source-channel tunnel junction, thus forming a design with a non-uniform body thickness. Because of the quantum confinement, the thin InAs channel offers a large band gap and large effective masses, reducing the ambipolar and source-to-drain tunneling leakage at OFF state. The thick GaSb/InAs tunnel junction, instead, offers a low tunnel barrier and small effective masses, allowing a large tunnel probability at ON state. In addition, the confinement induced band discontinuity enhances the tunnel electric field and creates a resonant state, further improving $I_{\rm{ON}}$. Atomistic quantum transport simulations show that ballistic $I_{\rm{ON}}=284$A/m is obtained at 15nm channel length, $I_{\rm{OFF}}=1\times10^{-3}$A/m, and $V_{\rm{DD}}=0.3$V. While with uniform body thickness, the largest achievable $I_{\rm{ON}}$ is only 25A/m. Simulations also indicate that this design is scalable to sub-10nm channel length. △ Less

Submitted 17 July, 2016; originally announced July 2016.

Comments: 4 pages, 8 figures

Journal ref: IEEE Transactions on Electron Devices, 2016

arXiv:1605.07166 [pdf, ps, other]

doi 10.1109/JEDS.2016.2614915

P-Type Tunnel FETs With Triple Heterojunctions

Authors: Jun Z. Huang, Pengyu Long, Michael Povolotskyi, Gerhard Klimeck, Mark J. W. Rodwell

Abstract: A triple-heterojunction (3HJ) design is employed to improve p-type InAs/GaSb heterojunction (HJ) tunnel FETs. The added two HJs (AlInAsSb/InAs in the source and GaSb/AlSb in the channel) significantly shorten the tunnel distance and create two resonant states, greatly improving the ON state tunneling probability. Moreover, the source Fermi degeneracy is reduced by the increased source (AlInAsSb) d… ▽ More A triple-heterojunction (3HJ) design is employed to improve p-type InAs/GaSb heterojunction (HJ) tunnel FETs. The added two HJs (AlInAsSb/InAs in the source and GaSb/AlSb in the channel) significantly shorten the tunnel distance and create two resonant states, greatly improving the ON state tunneling probability. Moreover, the source Fermi degeneracy is reduced by the increased source (AlInAsSb) density of states and the OFF state leakage is reduced by the heavier channel (AlSb) hole effective masses. Quantum ballistic transport simulations show, that with V_{DD} = 0.3V and I_{OFF} = 10^{-3}A/m, I_{ON} of 582A=m (488A=m) is obtained at 30nm (15nm) channel length, which is comparable to n-type 3HJ counterpart and significantly exceeding p-type silicon MOSFET. Simultaneously, the nonlinear turn on and delayed saturation in the output characteristics are also greatly improved. △ Less

Submitted 23 May, 2016; originally announced May 2016.

Journal ref: IEEE Journal of the Electron Devices Society, vol. 4, no. 6, Nov. 2016

arXiv:1605.00955 [pdf, ps, other]

High-Performance Complementary III-V Tunnel FETs with Strain Engineering

Authors: Jun Z. Huang, Yu Wang, Pengyu Long, Yaohua Tan, Michael Povolotskyi, Gerhard Klimeck

Abstract: Strain engineering has recently been explored to improve tunnel field-effect transistors (TFETs). Here, we report design and performance of strained ultra-thin-body (UTB) III-V TFETs by quantum transport simulations. It is found that for an InAs UTB confined in [001] orientation, uniaxial compressive strain in [100] or [110] orientation shrinks the band gap meanwhile reduces (increases) transport… ▽ More Strain engineering has recently been explored to improve tunnel field-effect transistors (TFETs). Here, we report design and performance of strained ultra-thin-body (UTB) III-V TFETs by quantum transport simulations. It is found that for an InAs UTB confined in [001] orientation, uniaxial compressive strain in [100] or [110] orientation shrinks the band gap meanwhile reduces (increases) transport (transverse) effective masses. Thus it improves the ON state current of both n-type and p-type UTB InAs TFETs without lowering the source density of states. Applying the strain locally in the source region makes further improvements by suppressing the OFF state leakage. For p-type TFETs, the locally strained area can be extended into the channel to form a quantum well, giving rise to even larger ON state current that is comparable to the n-type ones. Therefore strain engineering is a promising option for improving complementary circuits based on UTB III-V TFETs. △ Less

Submitted 3 May, 2016; originally announced May 2016.

Comments: 6 pages, 11 figures

arXiv:1603.06317 [pdf, other]

DoraPicker: An Autonomous Picking System for General Objects

Authors: Hao Zhang, Pinxin Long, Dandan Zhou, Zhongfeng Qian, Zheng Wang, Weiwei Wan, Dinesh Manocha, Chonhyon Park, Tommy Hu, Chao Cao, Yibo Chen, Marco Chow, Jia Pan

Abstract: Robots that autonomously manipulate objects within warehouses have the potential to shorten the package delivery time and improve the efficiency of the e-commerce industry. In this paper, we present a robotic system that is capable of both picking and placing general objects in warehouse scenarios. Given a target object, the robot autonomously detects it from a shelf or a table and estimates its f… ▽ More Robots that autonomously manipulate objects within warehouses have the potential to shorten the package delivery time and improve the efficiency of the e-commerce industry. In this paper, we present a robotic system that is capable of both picking and placing general objects in warehouse scenarios. Given a target object, the robot autonomously detects it from a shelf or a table and estimates its full 6D pose. With this pose information, the robot picks the object using its gripper, and then places it into a container or at a specified location. We describe our pick-and-place system in detail while highlighting our design principles for the warehouse settings, including the perception method that leverages knowledge about its workspace, three grippers designed to handle a large variety of different objects in terms of shape, weight and material, and grasp planning in cluttered scenarios. We also present extensive experiments to evaluate the performance of our picking system and demonstrate that the robot is competent to accomplish various tasks in warehouse settings, such as picking a target item from a tight space, gras** different objects from the shelf, and performing pick-and-place tasks on the table. △ Less

Submitted 20 March, 2016; originally announced March 2016.

Comments: 10 pages, 10 figures

arXiv:1602.04484 [pdf, ps, other]

Surprising properties of dropout in deep networks

Authors: David P. Helmbold, Philip M. Long

Abstract: We analyze dropout in deep networks with rectified linear units and the quadratic loss. Our results expose surprising differences between the behavior of dropout and more traditional regularizers like weight decay. For example, on some simple data sets dropout training produces negative weights even though the output is the sum of the inputs. This provides a counterpoint to the suggestion that dro… ▽ More We analyze dropout in deep networks with rectified linear units and the quadratic loss. Our results expose surprising differences between the behavior of dropout and more traditional regularizers like weight decay. For example, on some simple data sets dropout training produces negative weights even though the output is the sum of the inputs. This provides a counterpoint to the suggestion that dropout discourages co-adaptation of weights. We also show that the dropout penalty can grow exponentially in the depth of the network while the weight-decay penalty remains essentially linear, and that dropout is insensitive to various re-scalings of the input features, outputs, and network weights. This last insensitivity implies that there are no isolated local minima of the dropout training criterion. Our work uncovers new properties of dropout, extends our understanding of why dropout succeeds, and lays the foundation for further progress. △ Less

Submitted 19 April, 2017; v1 submitted 14 February, 2016; originally announced February 2016.

arXiv:1511.09428 [pdf]

doi 10.1103/PhysRevB.93.085205

Photoinduced tunability of the Reststrahlen band in 4H-SiC

Authors: Bryan T. Spann, Ryan Compton, Daniel Ratchford, James P. Long, Adam D. Dunkelberger, Paul B. Klein, Alexander J. Giles, Joshua D. Caldwell, Jeffrey C. Owrutsky

Abstract: Materials with a negative dielectric permittivity (e.g. metals) display high reflectance and can be shaped into nanoscale optical-resonators exhibiting extreme mode confinement, a central theme of nanophotonics. However, the ability to $actively$ tune these effects remains elusive. By photoexciting free carriers in 4H-SiC, we induce dramatic changes in reflectance near the "Reststrahlen band" wher… ▽ More Materials with a negative dielectric permittivity (e.g. metals) display high reflectance and can be shaped into nanoscale optical-resonators exhibiting extreme mode confinement, a central theme of nanophotonics. However, the ability to $actively$ tune these effects remains elusive. By photoexciting free carriers in 4H-SiC, we induce dramatic changes in reflectance near the "Reststrahlen band" where the permittivity is negative due to charge oscillations of the polar optical phonons in the mid-infrared. We infer carrier-induced changes in the permittivity required for useful tunability (~ 40 cm$^{-1}$) in nanoscale resonators, providing a direct avenue towards the realization of actively tunable nanophotonic devices in the mid-infrared to terahertz spectral range. △ Less

Submitted 30 November, 2015; originally announced November 2015.

Journal ref: Phys. Rev. B 93, 085205 (2016)

arXiv:1511.02516 [pdf, ps, other]

doi 10.1007/978-3-319-31653-6_6

Quantum Transport Simulation of III-V TFETs with Reduced-Order K.P Method

Authors: Jun Z. Huang, Lining Zhang, Pengyu Long, Michael Povolotskyi, Gerhard Klimeck

Abstract: III-V tunneling field-effect transistors (TFETs) offer great potentials in future low-power electronics application due to their steep subthreshold slope and large "on" current. Their 3D quantum transport study using non-equilibrium Green's function method is computationally very intensive, in particular when combined with multiband approaches such as the eight-band K.P method. To reduce the numer… ▽ More III-V tunneling field-effect transistors (TFETs) offer great potentials in future low-power electronics application due to their steep subthreshold slope and large "on" current. Their 3D quantum transport study using non-equilibrium Green's function method is computationally very intensive, in particular when combined with multiband approaches such as the eight-band K.P method. To reduce the numerical cost, an efficient reduced-order method is developed in this article and applied to study homojunction InAs and heterojunction GaSb-InAs nanowire TFETs. Device performances are obtained for various channel widths, channel lengths, crystal orientations, do** densities, source pocket lengths, and strain conditions. △ Less

Submitted 8 November, 2015; originally announced November 2015.

arXiv:1509.05810 [pdf, other]

A Note on Parameter Estimation for Misspecified Regression Models with Heteroskedastic Errors

Authors: James P. Long

Abstract: Misspecified models often provide useful information about the true data generating distribution. For example, if $y$ is a non-linear function of $x$ the least squares estimator $\hatβ$ is an estimate of $β$, the slope of the best linear approximation to the non-linear function. Motivated by problems in astronomy, we study how to incorporate observation measurement error variances into fitting par… ▽ More Misspecified models often provide useful information about the true data generating distribution. For example, if $y$ is a non-linear function of $x$ the least squares estimator $\hatβ$ is an estimate of $β$, the slope of the best linear approximation to the non-linear function. Motivated by problems in astronomy, we study how to incorporate observation measurement error variances into fitting parameters of misspecified models. Our asymptotic theory focuses on the particular case of linear regression where often weighted least squares procedures are used to account for heteroskedasticity. We find that when the response is a non-linear function of the independent variable, the standard procedure of weighting by the inverse of the observation variances can be counter-productive. In particular, ordinary least squares may have lower asymptotic variance. We construct an adaptive estimator which has lower asymptotic variance than either OLS or standard WLS. We demonstrate our theory in a small simulation and apply these ideas to the problem of estimating the period of a periodic function using a sinusoidal model. △ Less

Submitted 15 May, 2017; v1 submitted 18 September, 2015; originally announced September 2015.

Comments: 28 pages, 4 figures, 2 tables

Journal ref: Electronic Journal of Statistics Vol. 11 (2017) 1464-1490

arXiv:1508.04772 [pdf, ps, other]

doi 10.1088/2041-8205/811/2/L34

A Multiband Generalization of the Analysis of Variance Period Estimation Algorithm and the Effect of Inter-band Observing Cadence on Period Recovery Rate

Authors: Nicholas Mondrik, James P. Long, Jennifer L. Marshall

Abstract: We present a new method of extending the single band Analysis of Variance period estimation algorithm to multiple bands. We use SDSS Stripe 82 RR Lyrae to show that in the case of low number of observations per band and non-simultaneous observations, improvements in period recovery rates of up to $\approx$60\% are observed. We also investigate the effect of inter-band observing cadence on period r… ▽ More We present a new method of extending the single band Analysis of Variance period estimation algorithm to multiple bands. We use SDSS Stripe 82 RR Lyrae to show that in the case of low number of observations per band and non-simultaneous observations, improvements in period recovery rates of up to $\approx$60\% are observed. We also investigate the effect of inter-band observing cadence on period recovery rates. We find that using non-simultaneous observation times between bands is ideal for the multiband method, and using simultaneous multiband data is only marginally better than using single band data. These results will be particularly useful in planning observing cadences for wide-field astronomical imaging surveys such as LSST. They also have the potential to improve the extraction of transient data from surveys with few ($\lesssim 30$) observations per band across several bands, such as the Dark Energy Survey. △ Less

Submitted 19 August, 2015; originally announced August 2015.

Comments: Submitted to ApJL, comments welcome at [email protected]

arXiv:1506.01332 [pdf, other]

A Study of Functional Depths

Authors: James P. Long, Jianhua Z. Huang

Abstract: Functional depth is used for ranking functional observations from most outlying to most typical. The ranks produced by functional depth have been proposed as the basis for functional classifiers, rank tests, and data visualization procedures. Many of the proposed functional depths are invariant to domain permutation, an unusual property for a functional data analysis procedure. Essentially these d… ▽ More Functional depth is used for ranking functional observations from most outlying to most typical. The ranks produced by functional depth have been proposed as the basis for functional classifiers, rank tests, and data visualization procedures. Many of the proposed functional depths are invariant to domain permutation, an unusual property for a functional data analysis procedure. Essentially these depths treat functional data as if it were multivariate data. In this work, we compare the performance of several existing functional depths to a simple adaptation of an existing multivariate depth notion, $L^\infty$ depth ($L^{\infty}D$). On simulated and real data, we show $L^{\infty}D$ has performance comparable or superior to several existing notions of functional depth. In addition, we review how depth functions are evaluated and propose some improvements. In particular, we show that empirical depth function asymptotics can be mis--leading and instead propose a new method, the rank--rank plot, for evaluating empirical depth rank stability. △ Less

Submitted 1 November, 2016; v1 submitted 3 June, 2015; originally announced June 2015.

Comments: 25 pages, 13 figures

arXiv:1412.6520 [pdf, other]

doi 10.1214/15-AOAS885

Estimating a Common Period for a Set of Irregularly Sampled Functions with Applications to Periodic Variable Star Data

Authors: James P. Long, Eric C. Chi, Richard G. Baraniuk

Abstract: We consider the estimation of a common period for a set of functions sampled at irregular intervals. The problem arises in astronomy, where the functions represent a star's brightness observed over time through different photometric filters. While current methods can estimate periods accurately provided that the brightness is well--sampled in at least one filter, there are no existing methods that… ▽ More We consider the estimation of a common period for a set of functions sampled at irregular intervals. The problem arises in astronomy, where the functions represent a star's brightness observed over time through different photometric filters. While current methods can estimate periods accurately provided that the brightness is well--sampled in at least one filter, there are no existing methods that can provide accurate estimates when no brightness function is well--sampled. In this paper we introduce two new methods for period estimation when brightnesses are poorly--sampled in all filters. The first, multiband generalized Lomb-Scargle (MGLS), extends the frequently used Lomb-Scargle method in a way that naïvely combines information across filters. The second, penalized generalized Lomb-Scargle (PGLS), builds on the first by more intelligently borrowing strength across filters. Specifically, we incorporate constraints on the phases and amplitudes across the different functions using a non--convex penalized likelihood function. We develop a fast algorithm to optimize the penalized likelihood by combining block coordinate descent with the majorization-minimization (MM) principle. We illustrate our methods on synthetic and real astronomy data. Both advance the state-of-the-art in period estimation; however, PGLS significantly outperforms MGLS when all functions are extremely poorly--sampled. △ Less

Submitted 19 December, 2014; originally announced December 2014.

Comments: 25 pages

Journal ref: Annals of Applied Statistics, 10(1):165-197, 2016

arXiv:1412.4736 [pdf, other]

On the Inductive Bias of Dropout

Authors: David P. Helmbold, Philip M. Long

Abstract: Dropout is a simple but effective technique for learning in neural networks and other settings. A sound theoretical understanding of dropout is needed to determine when dropout should be applied and how to use it most effectively. In this paper we continue the exploration of dropout as a regularizer pioneered by Wager, et.al. We focus on linear classification where a convex proxy to the misclassif… ▽ More Dropout is a simple but effective technique for learning in neural networks and other settings. A sound theoretical understanding of dropout is needed to determine when dropout should be applied and how to use it most effectively. In this paper we continue the exploration of dropout as a regularizer pioneered by Wager, et.al. We focus on linear classification where a convex proxy to the misclassification loss (i.e. the logistic loss used in logistic regression) is minimized. We show: (a) when the dropout-regularized criterion has a unique minimizer, (b) when the dropout-regularization penalty goes to infinity with the weights, and when it remains bounded, (c) that the dropout regularization can be non-monotonic as individual weights increase from 0, and (d) that the dropout regularization penalty may not be convex. This last point is particularly surprising because the combination of dropout regularization with any convex loss proxy is always a convex function. In order to contrast dropout regularization with $L_2$ regularization, we formalize the notion of when different sources are more compatible with different regularizers. We then exhibit distributions that are provably more compatible with dropout regularization than $L_2$ regularization, and vice versa. These sources provide additional insight into how the inductive biases of dropout and $L_2$ regularization differ. We provide some similar results for $L_1$ regularization. △ Less

Submitted 17 February, 2015; v1 submitted 15 December, 2014; originally announced December 2014.

Journal ref: Journal of Machine Learning Research, 16, 3403-3454 (2015). (See http://jmlr.org/papers/volume16/helmbold15a/helmbold15a.pdf.)

arXiv:1410.8282 [pdf]

Specific Absorbed Fractions of Electrons and Photons for Rad-HUMAN Phantom Using Monte Carlo Method

Authors: Wen Wang, Meng-yun Cheng, Peng-cheng Long, Li-qin Hu

Abstract: The specific absorbed fractions (SAF) for self- and cross-irradiation are effective tools for the internal dose estimation of inhalation and ingestion intakes of radionuclides. A set of SAFs of photon and electron were calculated using the Rad-HUMAN phantom, a computational voxel phantom of Chinese adult female and created using the color photographic image of the Chinese Visible Human (CVH) data… ▽ More The specific absorbed fractions (SAF) for self- and cross-irradiation are effective tools for the internal dose estimation of inhalation and ingestion intakes of radionuclides. A set of SAFs of photon and electron were calculated using the Rad-HUMAN phantom, a computational voxel phantom of Chinese adult female and created using the color photographic image of the Chinese Visible Human (CVH) data set. The model can represent most of Chinese adult female anatomical characteristics and can be taken as an individual phantom to investigate the difference of internal dose with Caucasians. In this study, the emission of mono-energetic photons and electrons of 10keV to 4MeV energy were calculated using the Monte Carlo particle transport calculation code MCNP. Results were compared with the values from ICRP reference and ORNL models. The results showed that SAF from Rad-HUMAN have the similar trends but larger than those from the other two models. The differences were due to the racial and anatomical differences in organ mass and inter-organ distance. The SAFs based on the Rad-HUMAN phantom provide an accurate and reliable data for internal radiation dose calculations for Chinese female. △ Less

Submitted 30 October, 2014; originally announced October 2014.

Comments: 9 pages,8 figures,Submitted to Chinese Physics C

arXiv:1401.3362 [pdf, other]

Kernel Density Estimation with Berkson Error

Authors: James P. Long, Noureddine El Karoui, John A. Rice

Abstract: Given a sample $\{X_i\}_{i=1}^n$ from $f_X$, we construct kernel density estimators for $f_Y$, the convolution of $f_X$ with a known error density $f_ε$. This problem is known as density estimation with Berkson error and has applications in epidemiology and astronomy. Little is understood about bandwidth selection for Berkson density estimation. We compare three approaches to selecting the bandwid… ▽ More Given a sample $\{X_i\}_{i=1}^n$ from $f_X$, we construct kernel density estimators for $f_Y$, the convolution of $f_X$ with a known error density $f_ε$. This problem is known as density estimation with Berkson error and has applications in epidemiology and astronomy. Little is understood about bandwidth selection for Berkson density estimation. We compare three approaches to selecting the bandwidth both asymptotically, using large sample approximations to the MISE, and at finite samples, using simulations. Our results highlight the relationship between the structure of the error $f_ε$ and the optimal bandwidth. In particular, the results demonstrate the importance of smoothing when the error term $f_ε$ is concentrated near 0. We propose a data--driven bandwidth estimator and test its performance on NO$_2$ exposure data. △ Less

Submitted 29 July, 2014; v1 submitted 14 January, 2014; originally announced January 2014.

Comments: 36 pages, 5 figures

arXiv:1307.8371 [pdf, ps, other]

The Power of Localization for Efficiently Learning Linear Separators with Noise

Authors: Pranjal Awasthi, Maria Florina Balcan, Philip M. Long

Abstract: We introduce a new approach for designing computationally efficient learning algorithms that are tolerant to noise, and demonstrate its effectiveness by designing algorithms with improved noise tolerance guarantees for learning linear separators. We consider both the malicious noise model and the adversarial label noise model. For malicious noise, where the adversary can corrupt both the label a… ▽ More We introduce a new approach for designing computationally efficient learning algorithms that are tolerant to noise, and demonstrate its effectiveness by designing algorithms with improved noise tolerance guarantees for learning linear separators. We consider both the malicious noise model and the adversarial label noise model. For malicious noise, where the adversary can corrupt both the label and the features, we provide a polynomial-time algorithm for learning linear separators in $\Re^d$ under isotropic log-concave distributions that can tolerate a nearly information-theoretically optimal noise rate of $η= Ω(ε)$. For the adversarial label noise model, where the distribution over the feature vectors is unchanged, and the overall probability of a noisy label is constrained to be at most $η$, we also give a polynomial-time algorithm for learning linear separators in $\Re^d$ under isotropic log-concave distributions that can handle a noise rate of $η= Ω\left(ε\right)$. We show that, in the active learning model, our algorithms achieve a label complexity whose dependence on the error parameter $ε$ is polylogarithmic. This provides the first polynomial-time active learning algorithm for learning linear separators in the presence of malicious noise or adversarial label noise. △ Less

Submitted 3 June, 2018; v1 submitted 31 July, 2013; originally announced July 2013.

Comments: Contains improved label complexity analysis communicated to us by Steve Hanneke

ACM Class: F.2

arXiv:1301.0246 [pdf]

doi 10.1021/nn304834p

Electronic Hybridization of Large-Area Stacked Graphene Films

Authors: Jeremy T. Robinson, Scott W. Schmucker, C. Bogdan Diaconescu, James P. Long, James C. Culbertson, Taisuke Ohta, Adam L. Friedman, Thomas E. Beechem

Abstract: Direct, tunable coupling between individually assembled graphene layers is a next step towards designer two-dimensional (2D) crystal systems, with relevance for fundamental studies and technological applications. Here we describe the fabrication and characterization of large-area (> cm^2), coupled bilayer graphene on SiO2/Si substrates. Stacking two graphene films leads to direct electronic intera… ▽ More Direct, tunable coupling between individually assembled graphene layers is a next step towards designer two-dimensional (2D) crystal systems, with relevance for fundamental studies and technological applications. Here we describe the fabrication and characterization of large-area (> cm^2), coupled bilayer graphene on SiO2/Si substrates. Stacking two graphene films leads to direct electronic interactions between layers, where the resulting film properties are determined by the local twist angle. Polycrystalline bilayer films have a "stained-glass window" appearance explained by the emergence of a narrow absorption band in the visible spectrum that depends on twist angle. Direct measurement of layer orientation via electron diffraction, together with Raman and optical spectroscopy, confirms the persistence of clean interfaces over large areas. Finally, we demonstrate that interlayer coupling can be reversibly turned off through chemical modification, enabling optical-based chemical detection schemes. Together, these results suggest that individual 2D crystals can be individually assembled to form electronically coupled systems suitable for large-scale applications. △ Less

Submitted 2 January, 2013; originally announced January 2013.

Comments: 16 pages; ACS Nano, ASAP Article (2012)

arXiv:1211.1082 [pdf, ps, other]

Active and passive learning of linear separators under log-concave distributions

Authors: Maria Florina Balcan, Philip M. Long

Abstract: We provide new results concerning label efficient, polynomial time, passive and active learning of linear separators. We prove that active learning provides an exponential improvement over PAC (passive) learning of homogeneous linear separators under nearly log-concave distributions. Building on this, we provide a computationally efficient PAC algorithm with optimal (up to a constant factor) sampl… ▽ More We provide new results concerning label efficient, polynomial time, passive and active learning of linear separators. We prove that active learning provides an exponential improvement over PAC (passive) learning of homogeneous linear separators under nearly log-concave distributions. Building on this, we provide a computationally efficient PAC algorithm with optimal (up to a constant factor) sample complexity for such problems. This resolves an open question concerning the sample complexity of efficient PAC algorithms under the uniform distribution in the unit ball. Moreover, it provides the first bound for a polynomial-time PAC algorithm that is tight for an interesting infinite class of hypothesis functions under a general and natural class of data-distributions, providing significant progress towards a longstanding open question. We also provide new bounds for active and passive learning in the case that the data might not be linearly separable, both in the agnostic case and and under the Tsybakov low-noise condition. To derive our results, we provide new structural results for (nearly) log-concave distributions, which might be of independent interest as well. △ Less

Submitted 26 April, 2013; v1 submitted 5 November, 2012; originally announced November 2012.

arXiv:1205.6081 [pdf, ps, other]

Criterions of Wiener type for minimally thin sets and rarefied sets associated with the stationary Schrödinger operator in a cone

Authors: Pinhong Long, Zhiqiang Gao, Guantie Deng

Abstract: In the paper we give some criterions for a-minimally thin sets and a-rarefied sets associated with the stationary Schrödinger operator at a fixed Martin boundary point or {\infty} with respect to a cone. Moreover, we show that a positive superfunction on a cone behaves regularly outside a-rarefied set. Finally we illustrate the relation between a-minimally thin set and a-rarefied set in a cone. In the paper we give some criterions for a-minimally thin sets and a-rarefied sets associated with the stationary Schrödinger operator at a fixed Martin boundary point or {\infty} with respect to a cone. Moreover, we show that a positive superfunction on a cone behaves regularly outside a-rarefied set. Finally we illustrate the relation between a-minimally thin set and a-rarefied set in a cone. △ Less

Submitted 28 May, 2012; originally announced May 2012.

MSC Class: 31B05; 31B25; 31C35

arXiv:1203.2557 [pdf, other]

On the Necessity of Irrelevant Variables

Authors: David P. Helmbold, Philip M. Long

Abstract: This work explores the effects of relevant and irrelevant boolean variables on the accuracy of classifiers. The analysis uses the assumption that the variables are conditionally independent given the class, and focuses on a natural family of learning algorithms for such sources when the relevant variables have a small advantage over random guessing. The main result is that algorithms relying predo… ▽ More This work explores the effects of relevant and irrelevant boolean variables on the accuracy of classifiers. The analysis uses the assumption that the variables are conditionally independent given the class, and focuses on a natural family of learning algorithms for such sources when the relevant variables have a small advantage over random guessing. The main result is that algorithms relying predominately on irrelevant variables have error probabilities that quickly go to 0 in situations where algorithms that limit the use of irrelevant variables have errors bounded below by a positive constant. We also show that accurate learning is possible even when there are so few examples that one cannot determine with high confidence whether or not any individual variable is relevant. △ Less

Submitted 8 June, 2012; v1 submitted 12 March, 2012; originally announced March 2012.

Comments: A preliminary version of this paper appeared in the proceedings of ICML'11

arXiv:1201.4863 [pdf, ps, other]

doi 10.1086/664960

Optimizing Automated Classification of Periodic Variable Stars in New Synoptic Surveys

Authors: James P. Long, Noureddine El Karoui, John A. Rice, Joseph W. Richards, Joshua S. Bloom

Abstract: Efficient and automated classification of periodic variable stars is becoming increasingly important as the scale of astronomical surveys grows. Several recent papers have used methods from machine learning and statistics to construct classifiers on databases of labeled, multi--epoch sources with the intention of using these classifiers to automatically infer the classes of unlabeled sources from… ▽ More Efficient and automated classification of periodic variable stars is becoming increasingly important as the scale of astronomical surveys grows. Several recent papers have used methods from machine learning and statistics to construct classifiers on databases of labeled, multi--epoch sources with the intention of using these classifiers to automatically infer the classes of unlabeled sources from new surveys. However, the same source observed with two different synoptic surveys will generally yield different derived metrics (features) from the light curve. Since such features are used in classifiers, this survey-dependent mismatch in feature space will typically lead to degraded classifier performance. In this paper we show how and why feature distributions change using OGLE and \textit{Hipparcos} light curves. To overcome survey systematics, we apply a method, \textit{noisification}, which attempts to empirically match distributions of features between the labeled sources used to construct the classifier and the unlabeled sources we wish to classify. Results from simulated and real--world light curves show that noisification can significantly improve classifier performance. In a three--class problem using light curves from \textit{Hipparcos} and OGLE, noisification reduces the classifier error rate from 27.0% to 7.0%. We recommend that noisification be used for upcoming surveys such as Gaia and LSST and describe some of the promises and challenges of applying noisification to these surveys. △ Less

Submitted 23 February, 2012; v1 submitted 23 January, 2012; originally announced January 2012.

Comments: 30 pages, 25 figures

arXiv:1106.2832 [pdf, other]

doi 10.1088/0004-637X/744/2/192

Active Learning to Overcome Sample Selection Bias: Application to Photometric Variable Star Classification

Authors: Joseph W. Richards, Dan L. Starr, Henrik Brink, Adam A. Miller, Joshua S. Bloom, Nathaniel R. Butler, J. Berian James, James P. Long, John Rice

Abstract: Despite the great promise of machine-learning algorithms to classify and predict astrophysical parameters for the vast numbers of astrophysical sources and transients observed in large-scale surveys, the peculiarities of the training data often manifest as strongly biased predictions on the data of interest. Typically, training sets are derived from historical surveys of brighter, more nearby obje… ▽ More Despite the great promise of machine-learning algorithms to classify and predict astrophysical parameters for the vast numbers of astrophysical sources and transients observed in large-scale surveys, the peculiarities of the training data often manifest as strongly biased predictions on the data of interest. Typically, training sets are derived from historical surveys of brighter, more nearby objects than those from more extensive, deeper surveys (testing data). This sample selection bias can cause catastrophic errors in predictions on the testing data because a) standard assumptions for machine-learned model selection procedures break down and b) dense regions of testing space might be completely devoid of training data. We explore possible remedies to sample selection bias, including importance weighting (IW), co-training (CT), and active learning (AL). We argue that AL---where the data whose inclusion in the training set would most improve predictions on the testing set are queried for manual follow-up---is an effective approach and is appropriate for many astronomical applications. For a variable star classification problem on a well-studied set of stars from Hipparcos and OGLE, AL is the optimal method in terms of error rate on the testing data, beating the off-the-shelf classifier by 3.4% and the other proposed methods by at least 3.0%. To aid with manual labeling of variable stars, we developed a web interface which allows for easy light curve visualization and querying of external databases. Finally, we apply active learning to classify variable stars in the ASAS survey, finding dramatic improvement in our agreement with the ACVS catalog, from 65.5% to 79.5%, and a significant increase in the classifier's average confidence for the testing set, from 14.6% to 42.9%, after a few AL iterations. △ Less

Submitted 17 June, 2011; v1 submitted 14 June, 2011; originally announced June 2011.

Comments: 43 pages, 11 figures, submitted to ApJ

arXiv:1008.4725 [pdf]

Electron-Doped Sr2IrO4-delta (0 <= delta <= 0.04): Evolution of a Disordered Jeff = 1/2 Mott Insulator into an Exotic Metallic State

Authors: O. B. Korneta, Tongfei Qi, S. Chikara, S. Parkin L. E. De Long, P. Schlottmann, G. Cao

Abstract: Stoichiometric Sr2IrO4 is a ferromagnetic Jeff = 1/2 Mott insulator driven by strong spin-orbit coupling. Introduction of very dilute oxygen vacancies into single-crystal Sr2IrO4-delta with delta < 0.04 leads to significant changes in lattice parameters and an insulator-to-metal transition at TMI = 105 K. The highly anisotropic electrical resistivity of the low-temperature metallic state for delta… ▽ More Stoichiometric Sr2IrO4 is a ferromagnetic Jeff = 1/2 Mott insulator driven by strong spin-orbit coupling. Introduction of very dilute oxygen vacancies into single-crystal Sr2IrO4-delta with delta < 0.04 leads to significant changes in lattice parameters and an insulator-to-metal transition at TMI = 105 K. The highly anisotropic electrical resistivity of the low-temperature metallic state for delta ~ 0.04 exhibits anomalous properties characterized by non-Ohmic behavior and an abrupt current-induced transition in the resistivity at T* = 52 K, which separates two regimes of resisitive switching in the nonlinear I-V characteristics. The novel behavior illustrates an exotic ground state and constitutes a new paradigm for devices structures in which electrical resistivity is manipulated via low-level current densities ~ 10 mA/cm2 (compared to higher spin-torque currents ~ 107-108 A/cm2) or magnetic inductions ~ 0.1-1.0 T. △ Less

Submitted 27 August, 2010; originally announced August 2010.

arXiv:nucl-ex/0604004 [pdf, ps, other]

doi 10.1016/j.nima.2006.10.184

Arguments for a "U.S. Kamioka": SNOLab and its Implications for North American Underground Science Planning

Authors: W. C. Haxton, K. A. Philpott, Robert Holtz, Philip Long, J. F. Wilkerson

Abstract: We argue for a cost-effective, long-term North American underground science strategy based on partnership with Canada and initial construction of a modest U.S. Stage I laboratory designed to complement SNOLab. We show, by reviewing the requirements of detectors now in the R&D phase, that SNOLab and a properly designed U.S. Stage I facility would be capable of meeting the needs of North America's… ▽ More We argue for a cost-effective, long-term North American underground science strategy based on partnership with Canada and initial construction of a modest U.S. Stage I laboratory designed to complement SNOLab. We show, by reviewing the requirements of detectors now in the R&D phase, that SNOLab and a properly designed U.S. Stage I facility would be capable of meeting the needs of North America's next wave of underground experiments. We discuss one opportunity for creating a Stage I laboratory, the Pioneer tunnel in Washington State, a site that could be developed to provide dedicated, clean, horizontal access. This unused tunnel, part of the deepest (1040 m) tunnel system in the U.S., would allow the U.S. to establish, at low risk and low cost, a laboratory at a depth (2.12 km.w.e., or kilometers of water equivalent) quite similar to that of the Japanese laboratory Kamioka (2.04 km.w.e.). We describe studies of cosmic ray attenuation important to properly locating such a laboratory, and the tunnel improvements that would be required to produce an optimal Stage I facility. We also discuss possibilities for far-future Stage II (3.62 km.w.e.) and Stage III (5.00 km.w.e.) developments at the Pioneer tunnel, should future North American needs for deep space exceed that available at SNOLab. △ Less

Submitted 6 October, 2006; v1 submitted 10 April, 2006; originally announced April 2006.

Comments: 23 pages, 10 figures; revised version includes discusion about neutrino-factory magic baselines

Showing 51–86 of 86 results for author: Long, P