Search | arXiv e-print repository

arXiv:2405.19658 [pdf, other]

Transport signatures of phase fluctuations in superconducting qubits

Authors: Maxwell Wisne, Yanpei Deng, Hilal Cansizoglu, Cameron Kopas, Josh Mutus, Venkat Chandrasekhar

Abstract: Josephson junctions supply the nonlinear inductance element in superconducting qubits. In the widely used transmon configuration, where the junction is shunted by a large capacitor, the low charging energy minimizes the sensitivity of the qubit to charge noise while maintaining the necessary anharmonicity to qubit states. We report here low-frequency transport measurements on small standalone junc… ▽ More Josephson junctions supply the nonlinear inductance element in superconducting qubits. In the widely used transmon configuration, where the junction is shunted by a large capacitor, the low charging energy minimizes the sensitivity of the qubit to charge noise while maintaining the necessary anharmonicity to qubit states. We report here low-frequency transport measurements on small standalone junctions and identically fabricated capacitively-shunted junctions that show two distinct features normally attributed to small capacitance junctions near zero bias: reduced switching currents and prominent finite resistance associated with phase diffusion in the current-voltage characteristic. Our transport data reveals the existence of phase fluctuations in transmons arising from intrinsic junction capacitance. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2404.05890 [pdf, other]

doi 10.1016/j.physleta.2024.129493

Current dependence of the low bias resistance of small capacitance Josephson junctions

Authors: Venkat Chandrasekhar

Abstract: The dc current-voltage characteristics of small Josephson junctions reveal features that are not observed in larger junctions, in particular, a switch to the finite voltage state at current values much less than the expected critical current of the junction and a finite resistance in the nominally superconducting regime. Both phenomena are due to the increased sensitivity to noise associated with… ▽ More The dc current-voltage characteristics of small Josephson junctions reveal features that are not observed in larger junctions, in particular, a switch to the finite voltage state at current values much less than the expected critical current of the junction and a finite resistance in the nominally superconducting regime. Both phenomena are due to the increased sensitivity to noise associated with the small capacitance of the Josephson junction and have been extensively studied a few decades ago. Here I focus on the current bias dependence of the differential resistance of the junction at low current bias in the nominally superconducting regime, using a quantum Langevin equation approach that enables a physically transparent incorporation of the noise environment of the junction. A similar approach might be useful in modeling the sensitivity of superconducting qubits to noise in the microwave regime. △ Less

Submitted 8 April, 2024; originally announced April 2024.

Journal ref: Physics Letters A, 129493 (2024)

arXiv:2403.06022 [pdf, other]

doi 10.1063/5.0189956

Intrinsic magnetism in KTaO$_3$ heterostructures

Authors: Patrick W. Krantz, Alexander Tyner, Pallab Goswami, Venkat Chandrasekhar

Abstract: There has been intense recent interest in the two-dimensional electron gases (2DEGs) that form at the surfaces and interfaces of KTaO$_3$ (KTO), with the discovery of superconductivity at temperatures significantly higher than those of similar 2DEGs based on SrTiO$_3$ (STO). Like STO heterostructures, these KTO 2DEGs are formed by depositing an overlayer on top of appropriately prepared KTO surfac… ▽ More There has been intense recent interest in the two-dimensional electron gases (2DEGs) that form at the surfaces and interfaces of KTaO$_3$ (KTO), with the discovery of superconductivity at temperatures significantly higher than those of similar 2DEGs based on SrTiO$_3$ (STO). Like STO heterostructures, these KTO 2DEGs are formed by depositing an overlayer on top of appropriately prepared KTO surfaces. Some of these overlayers are magnetic, and the resulting 2DEGs show signatures of this magnetism, including hysteresis in the magnetoresistance (MR). Here we show that KTO 2DEGs fabricated by depositing AlO$_x$ on top of KTO also show hysteretic MR, indicative of long range magnetic order, even though the samples nominally contain no intrinsic magnetic elements. The hysteresis appears in both the transverse and longitudinal resistance in magnetic fields both perpendicular to and in the plane of the 2DEG. The hysteretic MR has different characteristic fields and shapes for surfaces of different crystal orientations, and vanishes above a few Kelvin. Density functional theory (DFT) calculations indicate that the magnetism likely arises from Ta$^{4+}$ local moments created in the presence of oxygen vacancies. △ Less

Submitted 9 March, 2024; originally announced March 2024.

Comments: 5 pages, 4 figures, supplemental materials. arXiv admin note: text overlap with arXiv:2209.10534

Journal ref: Appl. Phys. Lett. 26 February 2024; 124 (9): 093102

arXiv:2210.12146 [pdf, other]

Nonlocal Differential Resistance in AlO$_x$/KTaO$_3$ Heterostructures

Authors: Patrick W Krantz, Venkat Chandrasekhar

Abstract: Local and nonlocal differential resistance measurements on Hall bars defined in AlO$_x$/KTaO$_3$ heterostructures show anomalous behavior that depends on the crystal orientation and the applied back gate voltage. The local differential resistance is asymmetric in the dc bias current, with an antisymmetric component that grows with decreasing gate voltage. More surprisingly, a large nonlocal differ… ▽ More Local and nonlocal differential resistance measurements on Hall bars defined in AlO$_x$/KTaO$_3$ heterostructures show anomalous behavior that depends on the crystal orientation and the applied back gate voltage. The local differential resistance is asymmetric in the dc bias current, with an antisymmetric component that grows with decreasing gate voltage. More surprisingly, a large nonlocal differential resistance is observed that extends between measurement probes that are separated by 100s of microns. The potential source of this anomalous behavior is discussed. △ Less

Submitted 21 October, 2022; originally announced October 2022.

arXiv:2209.10534 [pdf, other]

Colossal Spontaneous Hall Effect and Emergent Magnetism in KTaO$_3$ Two-Dimensional Electron Gases

Authors: Patrick Krantz, Alex Tyner, Pallab Goswami, Venkat Chandrasekhar

Abstract: There has been intense recent interest in the two-dimensional electron gases (2DEGs) that form at the surfaces and interfaces of KTaO$_3$ (KTO), with the discovery of superconductivity at temperatures significantly higher than those of similar 2DEGs based on SrTiO$_3$ (STO). Here we demonstrate that KTO 2DEGs fabricated under conditions that suppress the superconductivity show a large spontaneous… ▽ More There has been intense recent interest in the two-dimensional electron gases (2DEGs) that form at the surfaces and interfaces of KTaO$_3$ (KTO), with the discovery of superconductivity at temperatures significantly higher than those of similar 2DEGs based on SrTiO$_3$ (STO). Here we demonstrate that KTO 2DEGs fabricated under conditions that suppress the superconductivity show a large spontaneous Hall effect at low temperatures. The transverse response is asymmetric in an applied perpendicular magnetic field and becomes hysteretic at millikelvin temperatures. The hysteresis is due to long range magnetic order arising from local Ta$^{4+}$ moments. However, the most striking features of the data are the asymmetry of the transverse response and the large spontaneous transverse resistance at zero field, which can be a significant fraction of the longitudinal resistance and depends on crystal orientation. Both effects are due to the presence of a dominant contribution to the transverse response that is symmetric in perpendicular field, suggesting that its origin is topological in nature. We argue that this contribution arises from Berry curvature dipoles coupled with nonequilibrium conditions induced by the measuring current. △ Less

Submitted 2 February, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

Comments: 6 Pages of main text with 6 figures, includes supplementary materials

arXiv:2209.04743 [pdf, other]

doi 10.1063/5.0125708

Probing the topological band structure of diffusive multiterminal Josephson junction devices with conductance measurements

Authors: Venkat Chandrasekhar

Abstract: The energy of an Andreev bound state in a clean normal metal in contact with two superconductors disperses with the difference $Δφ$ in the superconducting phase between the superconductors in much the same way as the energies of electrons in a one-dimensional crystal disperse with the crystal momentum $k$ of the electrons. A normal metal with $n$ superconductors maps on to a $n-1$ dimensional crys… ▽ More The energy of an Andreev bound state in a clean normal metal in contact with two superconductors disperses with the difference $Δφ$ in the superconducting phase between the superconductors in much the same way as the energies of electrons in a one-dimensional crystal disperse with the crystal momentum $k$ of the electrons. A normal metal with $n$ superconductors maps on to a $n-1$ dimensional crystal, each dimension corresponding to the phase difference $φ_i$ between a specific pair of superconductors. The resulting band structure as a function of the phase differences $\{Δφ_i\}$ has been proposed to have a topological nature, with gapped regions characterized by different Chern numbers separated by regions where the gap in the quasiparticle spectrum closes. A similar complex evolution of the quasiparticle spectrum with $\{Δφ_i\}$ has also been predicted for diffusive normal metals in contact with multiple superconductors. Here we show that the variation of the density of states at the Fermi energy of such a system can be directly probed by relatively simple conductance measurements, allowing rapid characterization of the energy spectrum. △ Less

Submitted 10 September, 2022; originally announced September 2022.

Comments: 4 pages, 4 figures

Journal ref: Appl. Phys. Lett. 121, 222601 (2022)

arXiv:2207.13125 [pdf, other]

doi 10.1063/5.0119932

Characterization of Nb films for superconducting qubits using phase boundary measurements

Authors: Kevin M. Ryan, Carlos G. Torres-Castanedo, Dominic P. Goronzy, David A. Garcia Wetter, Matthew J Reagor, Mark Field, Cameron J Kopas, Jayss Marshall, Michael J. Bedzyk, Mark C. Hersam, Venkat Chandrasekhar

Abstract: Continued advances in superconducting qubit performance require more detailed understandings of the many sources of decoherence. Within these devices, two-level systems arise due to defects, interfaces, and grain boundaries, and are thought to be a major source of qubit decoherence at millikelvin temperatures. In addition to Al, Nb is a commonly used metalization layer for superconducting qubits.… ▽ More Continued advances in superconducting qubit performance require more detailed understandings of the many sources of decoherence. Within these devices, two-level systems arise due to defects, interfaces, and grain boundaries, and are thought to be a major source of qubit decoherence at millikelvin temperatures. In addition to Al, Nb is a commonly used metalization layer for superconducting qubits. Consequently, a significant effort is required to develop and qualify processes that mitigate defects in Nb films. As the fabrication of complete superconducting qubits and their characterization at millikelvin temperatures is a time and resource intensive process, it is desirable to have measurement tools that can rapidly characterize the properties of films and evaluate different treatments. Here we show that measurements of the variation of the superconducting critical temperature $T_c$ with an applied external magnetic field $H$ (of the phase boundary $T_c - H$) performed with very high resolution show features that are directly correlated with the structure of the Nb films. In combination with x-ray diffraction measurements, we show that one can even distinguish variations quality and crystal orientation of the grains in a Nb film by small but reproducible changes in the measured superconducting phase boundary. △ Less

Submitted 8 August, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

arXiv:2101.04859 [pdf]

A*HAR: A New Benchmark towards Semi-supervised learning for Class-imbalanced Human Activity Recognition

Authors: Govind Narasimman, Kangkang Lu, Arun Raja, Chuan Sheng Foo, Mohamed Sabry Aly, Jie Lin, Vijay Chandrasekhar

Abstract: Despite the vast literature on Human Activity Recognition (HAR) with wearable inertial sensor data, it is perhaps surprising that there are few studies investigating semisupervised learning for HAR, particularly in a challenging scenario with class imbalance problem. In this work, we present a new benchmark, called A*HAR, towards semisupervised learning for class-imbalanced HAR. We evaluate state-… ▽ More Despite the vast literature on Human Activity Recognition (HAR) with wearable inertial sensor data, it is perhaps surprising that there are few studies investigating semisupervised learning for HAR, particularly in a challenging scenario with class imbalance problem. In this work, we present a new benchmark, called A*HAR, towards semisupervised learning for class-imbalanced HAR. We evaluate state-of-the-art semi-supervised learning method on A*HAR, by combining Mean Teacher and Convolutional Neural Network. Interestingly, we find that Mean Teacher boosts the overall performance when training the classifier with fewer labelled samples and a large amount of unlabeled samples, but the classifier falls short in handling unbalanced activities. These findings lead to an interesting open problem, i.e., development of semi-supervised HAR algorithms that are class-imbalance aware without any prior knowledge on the class distribution for unlabeled samples. The dataset and benchmark evaluation are released at https://github.com/I2RDL2/ASTAR-HAR for future research. △ Less

Submitted 12 January, 2021; originally announced January 2021.

Comments: 5 pages, 3 figures

arXiv:2011.06667 [pdf, other]

doi 10.1103/PhysRevB.104.064503

Nonlocal superconducting quantum interference device

Authors: Taewan Noh, Andrew Kindseth, Venkat Chandrasekhar

Abstract: Superconducting quantum interference devices (SQUIDs) that incorporate two superconductor/insulator/superconductor (SIS) Josephson junctions in a closed loop form the core of some of the most sensitive detectors of magnetic and electric fields currently available. SQUIDs in these applications are typically operated with a finite voltage which generates microwave radiation through the ac Josephson… ▽ More Superconducting quantum interference devices (SQUIDs) that incorporate two superconductor/insulator/superconductor (SIS) Josephson junctions in a closed loop form the core of some of the most sensitive detectors of magnetic and electric fields currently available. SQUIDs in these applications are typically operated with a finite voltage which generates microwave radiation through the ac Josephson effect. This radiation may impact the system being measured. We describe here a SQUID in which the Josephson junctions are formed from strips of normal metal (N) in good electrical contact with the superconductor (S). Such SNS SQUIDs can be operated under a finite voltage bias with performance comparable or potentially better than conventional SIS SQUIDs. However, they also permit a novel mode of operation that is based on the unusual interplay of quasiparticle currents and supercurrents in the normal metal of the Josephson junction. The new method allows measurements of the flux dependence of the critical current of the SNS SQUID without applying a finite voltage bias across the SNS junction, enabling sensitive flux detection without generating microwave radiation. △ Less

Submitted 4 August, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

Journal ref: Phys. Rev. B 104, 064503 (2021)

arXiv:2011.03860 [pdf, other]

doi 10.1103/PhysRevLett.127.036801

Observation of zero-field transverse resistance in AlO$_x$/SrTiO$_3$ interface devices

Authors: P. W. Krantz, V. Chandrasekhar

Abstract: Domain walls in AlO$_x$/SrTiO$_3$ (ALO/STO) interface devices at low temperatures give a rise to a new signature in the electrical transport of two-dimensional carrier gases formed at the surfaces or interfaces of STO-based heterostructures: a finite transverse resistance observed in Hall bars in zero external magnetic field. This transverse resistance depends on the local domain wall configuratio… ▽ More Domain walls in AlO$_x$/SrTiO$_3$ (ALO/STO) interface devices at low temperatures give a rise to a new signature in the electrical transport of two-dimensional carrier gases formed at the surfaces or interfaces of STO-based heterostructures: a finite transverse resistance observed in Hall bars in zero external magnetic field. This transverse resistance depends on the local domain wall configuration and hence changes with temperature, gate voltage, thermal cycling and position along the sample, and can even change sign as a function of these parameters. The transverse resistance is observed below $\simeq$ 70 K but grows and changes significantly below $\simeq$40 K, the temperature at which the domain walls become increasingly polar. Surprisingly, the transverse resistance is much larger in (111) oriented heterostructures in comparison to (001) oriented heterostructures. Measurements of the capacitance between the conducting interface and an electrode applied to the substrate, which reflect the dielectric constant of the STO, indicate that this difference may be related to the greater variation of the temperature dependent dielectric constant with electric field when the electric field is applied in the [111] direction. The finite transverse resistance can be explained inhomogeneous current flow due to the preferential transport of current along domain walls that are not collinear with the nominal direction of the injected current. △ Less

Submitted 7 November, 2020; originally announced November 2020.

Comments: 6 pages, 4 figures, supplementary

Journal ref: Phys. Rev. Lett. 127, 036801 (2021)

arXiv:2007.05888 [pdf]

A Mach-Zehnder interferometer based tuning fork microwave impedance microscope

Authors: Z. Liu, P. W. Krantz, V. Chandrasekhar

Abstract: We describe here the implementation of an interferometer-based microwave impedance microscope on a home-built tuning-fork based scanning probe microscope (SPM). Tuning-fork based SPMs, requiring only two electrical contacts for self-actuation and self-detection of the tuning fork oscillation, are especially well suited to operation in extreme environments such as low temperatures, high magnetic fi… ▽ More We describe here the implementation of an interferometer-based microwave impedance microscope on a home-built tuning-fork based scanning probe microscope (SPM). Tuning-fork based SPMs, requiring only two electrical contacts for self-actuation and self-detection of the tuning fork oscillation, are especially well suited to operation in extreme environments such as low temperatures, high magnetic fields or restricted geometries where the optical components required for conventional detection of cantilever deflection would be difficult to introduce. Most existing and commercially available systems rely on optical detection of the deflection of specially designed microwave cantilevers, limiting their application. A tuning-fork based microwave impedance microscope with a resonant cavity near the tip was recently implemented: we report here an enhancement that incorporates a microwave interferometer, which affords better signal to noise as well as wider tunability in terms of microwave frequency. △ Less

Submitted 11 July, 2020; originally announced July 2020.

arXiv:2007.04756 [pdf, other]

Learning to Prune Deep Neural Networks via Reinforcement Learning

Authors: Manas Gupta, Siddharth Aravindan, Aleksandra Kalisz, Vijay Chandrasekhar, Lin Jie

Abstract: This paper proposes PuRL - a deep reinforcement learning (RL) based algorithm for pruning neural networks. Unlike current RL based model compression approaches where feedback is given only at the end of each episode to the agent, PuRL provides rewards at every pruning step. This enables PuRL to achieve sparsity and accuracy comparable to current state-of-the-art methods, while having a much shorte… ▽ More This paper proposes PuRL - a deep reinforcement learning (RL) based algorithm for pruning neural networks. Unlike current RL based model compression approaches where feedback is given only at the end of each episode to the agent, PuRL provides rewards at every pruning step. This enables PuRL to achieve sparsity and accuracy comparable to current state-of-the-art methods, while having a much shorter training cycle. PuRL achieves more than 80% sparsity on the ResNet-50 model while retaining a Top-1 accuracy of 75.37% on the ImageNet dataset. Through our experiments we show that PuRL is also able to sparsify already efficient architectures like MobileNet-V2. In addition to performance characterisation experiments, we also provide a discussion and analysis of the various RL design choices that went into the tuning of the Markov Decision Process underlying PuRL. Lastly, we point out that PuRL is simple to use and can be easily adapted for various architectures. △ Less

Submitted 9 July, 2020; originally announced July 2020.

Comments: Accepted at the ICML 2020 Workshop on Automated Machine Learning (AutoML 2020)

arXiv:2006.14265 [pdf, other]

Empirical Analysis of Overfitting and Mode Drop in GAN Training

Authors: Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar

Abstract: We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective. We show that when stochasticity is removed from the training procedure, GANs can overfit and exhibit almost no mode drop. Our results shed light on important characteristics of the GAN training procedure. They also provide evidence against prevailing intuitions that GANs do not memorize t… ▽ More We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective. We show that when stochasticity is removed from the training procedure, GANs can overfit and exhibit almost no mode drop. Our results shed light on important characteristics of the GAN training procedure. They also provide evidence against prevailing intuitions that GANs do not memorize the training set, and that mode drop** is mainly due to properties of the GAN objective rather than how it is optimized during training. △ Less

Submitted 25 June, 2020; originally announced June 2020.

Comments: To appear in ICIP2020

arXiv:2004.07543 [pdf, other]

doi 10.1016/j.neucom.2021.10.090

Classify and Generate: Using Classification Latent Space Representations for Image Generations

Authors: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh, Yasin Yazici, Chuan-Sheng Foo, Vijay Chandrasekhar, ArulMurugan Ambikapathi

Abstract: Utilization of classification latent space information for downstream reconstruction and generation is an intriguing and a relatively unexplored area. In general, discriminative representations are rich in class-specific features but are too sparse for reconstruction, whereas, in autoencoders the representations are dense but have limited indistinguishable class-specific features, making them less… ▽ More Utilization of classification latent space information for downstream reconstruction and generation is an intriguing and a relatively unexplored area. In general, discriminative representations are rich in class-specific features but are too sparse for reconstruction, whereas, in autoencoders the representations are dense but have limited indistinguishable class-specific features, making them less suitable for classification. In this work, we propose a discriminative modeling framework that employs manipulated supervised latent representations to reconstruct and generate new samples belonging to a given class. Unlike generative modeling approaches such as GANs and VAEs that aim to model the data manifold distribution, Representation based Generations (ReGene) directly represent the given data manifold in the classification space. Such supervised representations, under certain constraints, allow for reconstructions and controlled generations using an appropriate decoder without enforcing any prior distribution. Theoretically, given a class, we show that these representations when smartly manipulated using convex combinations retain the same class label. Furthermore, they also lead to the novel generation of visually realistic images. Extensive experiments on datasets of varying resolutions demonstrate that ReGene has higher classification accuracy than existing conditional generative models while being competitive in terms of FID. △ Less

Submitted 14 December, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

Journal ref: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh et. al., Classify and generate: Using classification latent space representations for image generations, Neurocomputing, Volume 471, 2022, Pages 296-334, ISSN 0925-2312

arXiv:1912.04219 [pdf, other]

FaultNet: Faulty Rail-Valves Detection using Deep Learning and Computer Vision

Authors: Ramanpreet Singh Pahwa, ** Chao, Jestine Paul, Yiqun Li, Ma Tin Lay Nwe, Shudong Xie, Ashish James, Arulmurugan Ambikapathi, Zeng Zeng, Vijay Ramaseshan Chandrasekhar

Abstract: Regular inspection of rail valves and engines is an important task to ensure the safety and efficiency of railway networks around the globe. Over the past decade, computer vision and pattern recognition based techniques have gained traction for such inspection and defect detection tasks. An automated end-to-end trained system can potentially provide a low-cost, high throughput, and cheap alternati… ▽ More Regular inspection of rail valves and engines is an important task to ensure the safety and efficiency of railway networks around the globe. Over the past decade, computer vision and pattern recognition based techniques have gained traction for such inspection and defect detection tasks. An automated end-to-end trained system can potentially provide a low-cost, high throughput, and cheap alternative to manual visual inspection of these components. However, such systems require a huge amount of defective images for networks to understand complex defects. In this paper, a multi-phase deep learning based technique is proposed to perform accurate fault detection of rail-valves. Our approach uses a two-step method to perform high precision image segmentation of rail-valves resulting in pixel-wise accurate segmentation. Thereafter, a computer vision technique is used to identify faulty valves. We demonstrate that the proposed approach results in improved detection performance when compared to current state-of-theart techniques used in fault detection. △ Less

Submitted 8 November, 2019; originally announced December 2019.

Comments: 8 pages, 8 figures, ITSC 2019

Journal ref: IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE - ITSC 2019

arXiv:1909.07541 [pdf, other]

A*3D Dataset: Towards Autonomous Driving in Challenging Environments

Authors: Quang-Hieu Pham, Pierre Sevestre, Ramanpreet Singh Pahwa, Hui**g Zhan, Chun Ho Pang, Yuda Chen, Armin Mustafa, Vijay Chandrasekhar, Jie Lin

Abstract: With the increasing global popularity of self-driving cars, there is an immediate need for challenging real-world datasets for benchmarking and training various computer vision tasks such as 3D object detection. Existing datasets either represent simple scenarios or provide only day-time data. In this paper, we introduce a new challenging A*3D dataset which consists of RGB images and LiDAR data wi… ▽ More With the increasing global popularity of self-driving cars, there is an immediate need for challenging real-world datasets for benchmarking and training various computer vision tasks such as 3D object detection. Existing datasets either represent simple scenarios or provide only day-time data. In this paper, we introduce a new challenging A*3D dataset which consists of RGB images and LiDAR data with significant diversity of scene, time, and weather. The dataset consists of high-density images ($\approx~10$ times more than the pioneering KITTI dataset), heavy occlusions, a large number of night-time frames ($\approx~3$ times the nuScenes dataset), addressing the gaps in the existing datasets to push the boundaries of tasks in autonomous driving research to more challenging highly diverse environments. The dataset contains $39\text{K}$ frames, $7$ classes, and $230\text{K}$ 3D object annotations. An extensive 3D object detection benchmark evaluation on the A*3D dataset for various attributes such as high density, day-time/night-time, gives interesting insights into the advantages and limitations of training and testing 3D object detection in real-world setting. △ Less

Submitted 16 September, 2019; originally announced September 2019.

Comments: A new 3D dataset by I2R, A*STAR for autonomous driving

arXiv:1907.07862 [pdf, other]

Artificial Intelligence-Enabled Cellular Networks: A Critical Path to Beyond-5G and 6G

Authors: Rubayet Shafin, Lingjia Liu, Vikram Chandrasekhar, Hao Chen, Jeffrey Reed, Jianzhong, Zhang

Abstract: Mobile Network Operators (MNOs) are in process of overlaying their conventional macro cellular networks with shorter range cells such as outdoor pico cells. The resultant increase in network complexity creates substantial overhead in terms of operating expenses, time, and labor for their planning and management. Artificial intelligence (AI) offers the potential for MNOs to operate their networks i… ▽ More Mobile Network Operators (MNOs) are in process of overlaying their conventional macro cellular networks with shorter range cells such as outdoor pico cells. The resultant increase in network complexity creates substantial overhead in terms of operating expenses, time, and labor for their planning and management. Artificial intelligence (AI) offers the potential for MNOs to operate their networks in a more organic and cost-efficient manner. We argue that deploying AI in 5G and Beyond will require surmounting significant technical barriers in terms of robustness, performance, and complexity. We outline future research directions, identify top 5 challenges, and present a possible roadmap to realize the vision of AI-enabled cellular networks for Beyond-5G and 6G. △ Less

Submitted 17 July, 2019; originally announced July 2019.

Comments: 7 pages, 3 figures, 1 table

arXiv:1902.03444 [pdf, other]

Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions

Authors: Yasin Yazıcı, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

Abstract: We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities. Each data distribution is modeled with a mixture of $K$ generator distributions. As the generators are partially shared between the modeling of different true data distributions, shared ones captures the commonality of the distributions, while non-shared ones capture uniqu… ▽ More We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities. Each data distribution is modeled with a mixture of $K$ generator distributions. As the generators are partially shared between the modeling of different true data distributions, shared ones captures the commonality of the distributions, while non-shared ones capture unique aspects of them. We show the effectiveness of our method on various datasets (MNIST, Fashion MNIST, CIFAR-10, Omniglot, CelebA) with compelling results. △ Less

Submitted 9 February, 2019; originally announced February 2019.

arXiv:1901.10074 [pdf, other]

CaRENets: Compact and Resource-Efficient CNN for Homomorphic Inference on Encrypted Medical Images

Authors: ** Chao, Ahmad Al Badawi, Balagopal Unnikrishnan, Jie Lin, Chan Fook Mun, James M. Brown, J. Peter Campbell, Michael Chiang, Jayashree Kalpathy-Cramer, Vijay Ramaseshan Chandrasekhar, Pavitra Krishnaswamy, Khin Mi Mi Aung

Abstract: Convolutional neural networks (CNNs) have enabled significant performance leaps in medical image classification tasks. However, translating neural network models for clinical applications remains challenging due to data privacy issues. Fully Homomorphic Encryption (FHE) has the potential to address this challenge as it enables the use of CNNs on encrypted images. However, current HE technology pos… ▽ More Convolutional neural networks (CNNs) have enabled significant performance leaps in medical image classification tasks. However, translating neural network models for clinical applications remains challenging due to data privacy issues. Fully Homomorphic Encryption (FHE) has the potential to address this challenge as it enables the use of CNNs on encrypted images. However, current HE technology poses immense computational and memory overheads, particularly for high-resolution images such as those seen in the clinical context. We present CaRENets: Compact and Resource-Efficient CNNs for high performance and resource-efficient inference on high-resolution encrypted images in practical applications. At the core, CaRENets comprises a new FHE compact packing scheme that is tightly integrated with CNN functions. CaRENets offers dual advantages of memory efficiency (due to compact packing of images and CNN activations) and inference speed (due to the reduction in the number of ciphertexts created and the associated mathematical operations) over standard interleaved packing schemes. We apply CaRENets to perform homomorphic abnormality detection with 80-bit security level in two clinical conditions - Retinopathy of Prematurity (ROP) and Diabetic Retinopathy (DR). The ROP dataset comprises 96 x 96 grayscale images, while the DR dataset comprises 256 x 256 RGB images. We demonstrate over 45x improvement in memory efficiency and 4-5x speedup in inference over the interleaved packing schemes. As our approach enables memory-efficient low-latency HE inference without imposing additional communication burden, it has implications for practical and secure deep learning inference in clinical imaging. △ Less

Submitted 28 January, 2019; originally announced January 2019.

arXiv:1901.02064 [pdf, other]

Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks

Authors: Xue Geng, Jie Fu, Bin Zhao, Jie Lin, Mohamed M. Sabry Aly, Christopher Pal, Vijay Chandrasekhar

Abstract: This paper addresses a challenging problem - how to reduce energy consumption without incurring performance drop when deploying deep neural networks (DNNs) at the inference stage. In order to alleviate the computation and storage burdens, we propose a novel dataflow-based joint quantization approach with the hypothesis that a fewer number of quantization operations would incur less information los… ▽ More This paper addresses a challenging problem - how to reduce energy consumption without incurring performance drop when deploying deep neural networks (DNNs) at the inference stage. In order to alleviate the computation and storage burdens, we propose a novel dataflow-based joint quantization approach with the hypothesis that a fewer number of quantization operations would incur less information loss and thus improve the final performance. It first introduces a quantization scheme with efficient bit-shifting and rounding operations to represent network parameters and activations in low precision. Then it restructures the network architectures to form unified modules for optimization on the quantized model. Extensive experiments on ImageNet and KITTI validate the effectiveness of our model, demonstrating that state-of-the-art results for various tasks can be achieved by this quantized model. Besides, we designed and synthesized an RTL model to measure the hardware costs among various quantization methods. For each quantization operation, it reduces area cost by about 15 times and energy consumption by about 9 times, compared to a strong baseline. △ Less

Submitted 4 January, 2019; originally announced January 2019.

Journal ref: Data Compression Conference 2019

arXiv:1812.07832 [pdf, other]

Semi-Supervised Deep Learning for Abnormality Classification in Retinal Images

Authors: Bruno Lecouat, Ken Chang, Chuan-Sheng Foo, Balagopal Unnikrishnan, James M. Brown, Houssam Zenati, Andrew Beers, Vijay Chandrasekhar, Jayashree Kalpathy-Cramer, Pavitra Krishnaswamy

Abstract: Supervised deep learning algorithms have enabled significant performance gains in medical image classification tasks. But these methods rely on large labeled datasets that require resource-intensive expert annotation. Semi-supervised generative adversarial network (GAN) approaches offer a means to learn from limited labeled data alongside larger unlabeled datasets, but have not been applied to dis… ▽ More Supervised deep learning algorithms have enabled significant performance gains in medical image classification tasks. But these methods rely on large labeled datasets that require resource-intensive expert annotation. Semi-supervised generative adversarial network (GAN) approaches offer a means to learn from limited labeled data alongside larger unlabeled datasets, but have not been applied to discern fine-scale, sparse or localized features that define medical abnormalities. To overcome these limitations, we propose a patch-based semi-supervised learning approach and evaluate performance on classification of diabetic retinopathy from funduscopic images. Our semi-supervised approach achieves high AUC with just 10-20 labeled training images, and outperforms the supervised baselines by upto 15% when less than 30% of the training dataset is labeled. Further, our method implicitly enables interpretation of the SSL predictions. As this approach enables good accuracy, resolution and interpretability with lower annotation burden, it sets the pathway for scalable applications of deep learning in clinical imaging. △ Less

Submitted 19 December, 2018; originally announced December 2018.

Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

Report number: ML4H/2018/227

arXiv:1812.02288 [pdf, other]

Adversarially Learned Anomaly Detection

Authors: Houssam Zenati, Manon Romain, Chuan Sheng Foo, Bruno Lecouat, Vijay Ramaseshan Chandrasekhar

Abstract: Anomaly detection is a significant and hence well-studied problem. However, develo** effective anomaly detection methods for complex and high-dimensional data remains a challenge. As Generative Adversarial Networks (GANs) are able to model the complex high-dimensional distributions of real-world data, they offer a promising approach to address this challenge. In this work, we propose an anomaly… ▽ More Anomaly detection is a significant and hence well-studied problem. However, develo** effective anomaly detection methods for complex and high-dimensional data remains a challenge. As Generative Adversarial Networks (GANs) are able to model the complex high-dimensional distributions of real-world data, they offer a promising approach to address this challenge. In this work, we propose an anomaly detection method, Adversarially Learned Anomaly Detection (ALAD) based on bi-directional GANs, that derives adversarially learned features for the anomaly detection task. ALAD then uses reconstruction errors based on these adversarially learned features to determine if a data sample is anomalous. ALAD builds on recent advances to ensure data-space and latent-space cycle-consistencies and stabilize GAN training, which results in significantly improved anomaly detection performance. ALAD achieves state-of-the-art performance on a range of image and tabular datasets while being several hundred-fold faster at test time than the only published GAN-based method. △ Less

Submitted 5 December, 2018; originally announced December 2018.

Comments: In the Proceedings of the 20th IEEE International Conference on Data Mining (ICDM), 2018

arXiv:1811.12065 [pdf, other]

TEA-DNN: the Quest for Time-Energy-Accuracy Co-optimized Deep Neural Networks

Authors: Lile Cai, Anne-Maelle Barneche, Arthur Herbout, Chuan Sheng Foo, Jie Lin, Vijay Ramaseshan Chandrasekhar, Mohamed M. Sabry

Abstract: Embedded deep learning platforms have witnessed two simultaneous improvements. First, the accuracy of convolutional neural networks (CNNs) has been significantly improved through the use of automated neural-architecture search (NAS) algorithms to determine CNN structure. Second, there has been increasing interest in develo** hardware accelerators for CNNs that provide improved inference performa… ▽ More Embedded deep learning platforms have witnessed two simultaneous improvements. First, the accuracy of convolutional neural networks (CNNs) has been significantly improved through the use of automated neural-architecture search (NAS) algorithms to determine CNN structure. Second, there has been increasing interest in develo** hardware accelerators for CNNs that provide improved inference performance and energy consumption compared to GPUs. Such embedded deep learning platforms differ in the amount of compute resources and memory-access bandwidth, which would affect performance and energy consumption of CNNs. It is therefore critical to consider the available hardware resources in the network architecture search. To this end, we introduce TEA-DNN, a NAS algorithm targeting multi-objective optimization of execution time, energy consumption, and classification accuracy of CNN workloads on embedded architectures. TEA-DNN leverages energy and execution time measurements on embedded hardware when exploring the Pareto-optimal curves across accuracy, execution time, and energy consumption and does not require additional effort to model the underlying hardware. We apply TEA-DNN for image classification on actual embedded platforms (NVIDIA Jetson TX2 and Intel Movidius Neural Compute Stick). We highlight the Pareto-optimal operating points that emphasize the necessity to explicitly consider hardware characteristics in the search process. To the best of our knowledge, this is the most comprehensive study of Pareto-optimal models across a range of hardware platforms using actual measurements on hardware to obtain objective values. △ Less

Submitted 21 October, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

Comments: Accepted by ISLPED2019

arXiv:1811.06231 [pdf, other]

Graph Convolutional Neural Networks for Polymers Property Prediction

Authors: Minggang Zeng, Jatin Nitin Kumar, Zeng Zeng, Ramasamy Savitha, Vijay Ramaseshan Chandrasekhar, Kedar Hippalgaonkar

Abstract: A fast and accurate predictive tool for polymer properties is demanding and will pave the way to iterative inverse design. In this work, we apply graph convolutional neural networks (GCNN) to predict the dielectric constant and energy bandgap of polymers. Using density functional theory (DFT) calculated properties as the ground truth, GCNN can achieve remarkable agreement with DFT results. Moreove… ▽ More A fast and accurate predictive tool for polymer properties is demanding and will pave the way to iterative inverse design. In this work, we apply graph convolutional neural networks (GCNN) to predict the dielectric constant and energy bandgap of polymers. Using density functional theory (DFT) calculated properties as the ground truth, GCNN can achieve remarkable agreement with DFT results. Moreover, we show that GCNN outperforms other machine learning algorithms. Our work proves that GCNN relies only on morphological data of polymers and removes the requirement for complicated hand-crafted descriptors, while still offering accuracy in fast predictions. △ Less

Submitted 15 November, 2018; originally announced November 2018.

Comments: Accepted for NIPS 2018 Workshop on Machine Learning for Molecules and Materials

arXiv:1811.06219 [pdf, other]

Predicting thermoelectric properties from crystal graphs and material descriptors - first application for functional materials

Authors: Leo Laugier, Daniil Bash, Jose Recatala, Hong Kuan Ng, Savitha Ramasamy, Chuan-Sheng Foo, Vijay R Chandrasekhar, Kedar Hippalgaonkar

Abstract: We introduce the use of Crystal Graph Convolutional Neural Networks (CGCNN), Fully Connected Neural Networks (FCNN) and XGBoost to predict thermoelectric properties. The dataset for the CGCNN is independent of Density Functional Theory (DFT) and only relies on the crystal and atomic information, while that for the FCNN is based on a rich attribute list mined from Materialsproject.org. The results… ▽ More We introduce the use of Crystal Graph Convolutional Neural Networks (CGCNN), Fully Connected Neural Networks (FCNN) and XGBoost to predict thermoelectric properties. The dataset for the CGCNN is independent of Density Functional Theory (DFT) and only relies on the crystal and atomic information, while that for the FCNN is based on a rich attribute list mined from Materialsproject.org. The results show that the optimized FCNN is three layer deep and is able to predict the scattering-time independent thermoelectric powerfactor much better than the CGCNN (or XGBoost), suggesting that bonding and density of states descriptors informed from materials science knowledge obtained partially from DFT are vital to predict functional properties. △ Less

Submitted 15 November, 2018; originally announced November 2018.

arXiv:1811.04595 [pdf, other]

Holistic Multi-modal Memory Network for Movie Question Answering

Authors: Anran Wang, Anh Tuan Luu, Chuan-Sheng Foo, Hongyuan Zhu, Yi Tay, Vijay Chandrasekhar

Abstract: Answering questions according to multi-modal context is a challenging problem as it requires a deep integration of different data sources. Existing approaches only employ partial interactions among data sources in one attention hop. In this paper, we present the Holistic Multi-modal Memory Network (HMMN) framework which fully considers the interactions between different input sources (multi-modal… ▽ More Answering questions according to multi-modal context is a challenging problem as it requires a deep integration of different data sources. Existing approaches only employ partial interactions among data sources in one attention hop. In this paper, we present the Holistic Multi-modal Memory Network (HMMN) framework which fully considers the interactions between different input sources (multi-modal context, question) in each hop. In addition, it takes answer choices into consideration during the context retrieval stage. Therefore, the proposed framework effectively integrates multi-modal context, question, and answer information, which leads to more informative context retrieved for question answering. Our HMMN framework achieves state-of-the-art accuracy on MovieQA dataset. Extensive ablation studies show the importance of holistic reasoning and contributions of different attention strategies. △ Less

Submitted 12 November, 2018; originally announced November 2018.

arXiv:1811.00778 [pdf, other]

Towards the AlexNet Moment for Homomorphic Encryption: HCNN, theFirst Homomorphic CNN on Encrypted Data with GPUs

Authors: Ahmad Al Badawi, ** Chao, Jie Lin, Chan Fook Mun, Jun Jie Sim, Benjamin Hong Meng Tan, Xiao Nan, Khin Mi Mi Aung, Vijay Ramaseshan Chandrasekhar

Abstract: Deep Learning as a Service (DLaaS) stands as a promising solution for cloud-based inference applications. In this setting, the cloud has a pre-learned model whereas the user has samples on which she wants to run the model. The biggest concern with DLaaS is user privacy if the input samples are sensitive data. We provide here an efficient privacy-preserving system by employing high-end technologies… ▽ More Deep Learning as a Service (DLaaS) stands as a promising solution for cloud-based inference applications. In this setting, the cloud has a pre-learned model whereas the user has samples on which she wants to run the model. The biggest concern with DLaaS is user privacy if the input samples are sensitive data. We provide here an efficient privacy-preserving system by employing high-end technologies such as Fully Homomorphic Encryption (FHE), Convolutional Neural Networks (CNNs) and Graphics Processing Units (GPUs). FHE, with its widely-known feature of computing on encrypted data, empowers a wide range of privacy-concerned applications. This comes at high cost as it requires enormous computing power. In this paper, we show how to accelerate the performance of running CNNs on encrypted data with GPUs. We evaluated two CNNs to classify homomorphically the MNIST and CIFAR-10 datasets. Our solution achieved a sufficient security level (> 80 bit) and reasonable classification accuracy (99%) and (77.55%) for MNIST and CIFAR-10, respectively. In terms of latency, we could classify an image in 5.16 seconds and 304.43 seconds for MNIST and CIFAR-10, respectively. Our system can also classify a batch of images (> 8,000) without extra overhead. △ Less

Submitted 18 August, 2020; v1 submitted 2 November, 2018; originally announced November 2018.

arXiv:1808.07272 [pdf, other]

doi 10.1145/3240508.3240713

Deep Adaptive Temporal Pooling for Activity Recognition

Authors: Sibo Song, Ngai-Man Cheung, Vijay Chandrasekhar, Bappaditya Mandal

Abstract: Deep neural networks have recently achieved competitive accuracy for human activity recognition. However, there is room for improvement, especially in modeling long-term temporal importance and determining the activity relevance of different temporal segments in a video. To address this problem, we propose a learnable and differentiable module: Deep Adaptive Temporal Pooling (DATP). DATP applies a… ▽ More Deep neural networks have recently achieved competitive accuracy for human activity recognition. However, there is room for improvement, especially in modeling long-term temporal importance and determining the activity relevance of different temporal segments in a video. To address this problem, we propose a learnable and differentiable module: Deep Adaptive Temporal Pooling (DATP). DATP applies a self-attention mechanism to adaptively pool the classification scores of different video segments. Specifically, using frame-level features, DATP regresses importance of different temporal segments and generates weights for them. Remarkably, DATP is trained using only the video-level label. There is no need of additional supervision except video-level activity class label. We conduct extensive experiments to investigate various input features and different weight models. Experimental results show that DATP can learn to assign large weights to key video segments. More importantly, DATP can improve training of frame-level feature extractor. This is because relevant temporal segments are assigned large weights during back-propagation. Overall, we achieve state-of-the-art performance on UCF101, HMDB51 and Kinetics datasets. △ Less

Submitted 22 August, 2018; originally announced August 2018.

Comments: Accepted by ACM Multimedia 2018

arXiv:1807.04307 [pdf, other]

Manifold regularization with GANs for semi-supervised learning

Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay Chandrasekhar

Abstract: Generative Adversarial Networks are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating a variant of the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the semi-supervised feature-matching GAN we achieve state-of-the-art results… ▽ More Generative Adversarial Networks are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating a variant of the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the semi-supervised feature-matching GAN we achieve state-of-the-art results for GAN-based semi-supervised learning on CIFAR-10 and SVHN benchmarks, with a method that is significantly easier to implement than competing methods. We also find that manifold regularization improves the quality of generated images, and is affected by the quality of the GAN used to approximate the regularizer. △ Less

Submitted 11 July, 2018; originally announced July 2018.

arXiv:1807.02629 [pdf, other]

Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

Authors: Panayotis Mertikopoulos, Bruno Lecouat, Houssam Zenati, Chuan-Sheng Foo, Vijay Chandrasekhar, Georgios Piliouras

Abstract: Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond. By necessity, most theoretical guarantees revolve around convex-concave (or even linear) problems; however, making theoretical inroads towards efficient GAN training depends crucially on moving beyond this classic framework. To m… ▽ More Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond. By necessity, most theoretical guarantees revolve around convex-concave (or even linear) problems; however, making theoretical inroads towards efficient GAN training depends crucially on moving beyond this classic framework. To make piecemeal progress along these lines, we analyze the behavior of mirror descent (MD) in a class of non-monotone problems whose solutions coincide with those of a naturally associated variational inequality - a property which we call coherence. We first show that ordinary, "vanilla" MD converges under a strict version of this condition, but not otherwise; in particular, it may fail to converge even in bilinear models with a unique solution. We then show that this deficiency is mitigated by optimism: by taking an "extra-gradient" step, optimistic mirror descent (OMD) converges in all coherent problems. Our analysis generalizes and extends the results of Daskalakis et al. (2018) for optimistic gradient descent (OGD) in bilinear problems, and makes concrete headway for establishing convergence beyond convex-concave games. We also provide stochastic analogues of these results, and we validate our analysis by numerical experiments in a wide array of GAN models (including Gaussian mixture models, as well as the CelebA and CIFAR-10 datasets). △ Less

Submitted 1 October, 2018; v1 submitted 7 July, 2018; originally announced July 2018.

Comments: 26 pages, 14 figures

arXiv:1806.04498 [pdf, other]

The Unusual Effectiveness of Averaging in GAN Training

Authors: Yasin Yazıcı, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

Abstract: We examine two different techniques for parameter averaging in GAN training. Moving Average (MA) computes the time-average of parameters, whereas Exponential Moving Average (EMA) computes an exponentially discounted sum. Whilst MA is known to lead to convergence in bilinear settings, we provide the -- to our knowledge -- first theoretical arguments in support of EMA. We show that EMA converges to… ▽ More We examine two different techniques for parameter averaging in GAN training. Moving Average (MA) computes the time-average of parameters, whereas Exponential Moving Average (EMA) computes an exponentially discounted sum. Whilst MA is known to lead to convergence in bilinear settings, we provide the -- to our knowledge -- first theoretical arguments in support of EMA. We show that EMA converges to limit cycles around the equilibrium with vanishing amplitude as the discount parameter approaches one for simple bilinear games and also enhances the stability of general GAN training. We establish experimentally that both techniques are strikingly effective in the non-convex-concave GAN setting as well. Both improve inception and FID scores on different architectures and for different GAN objectives. We provide comprehensive experimental results across a range of datasets -- mixture of Gaussians, CIFAR-10, STL-10, CelebA and ImageNet -- to demonstrate its effectiveness. We achieve state-of-the-art results on CIFAR-10 and produce clean CelebA face images.\footnote{~The code is available at \url{https://github.com/yasinyazici/EMA_GAN}} △ Less

Submitted 26 February, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

Comments: Published as a conference paper at ICLR 2019

arXiv:1805.08957 [pdf, other]

Semi-Supervised Learning with GANs: Revisiting Manifold Regularization

Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay R. Chandrasekhar

Abstract: GANS are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the feature-matching GAN of Improved GAN, we achieve state-of-the-art results for GAN-based semi-supervised learning… ▽ More GANS are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the feature-matching GAN of Improved GAN, we achieve state-of-the-art results for GAN-based semi-supervised learning on the CIFAR-10 dataset, with a method that is significantly easier to implement than competing methods. △ Less

Submitted 23 May, 2018; originally announced May 2018.

Comments: Accepted paper

Journal ref: Workshop track - ICLR 2018

arXiv:1803.11246 [pdf]

doi 10.1016/j.joule.2018.05.009

Accelerating Materials Development via Automation, Machine Learning, and High-Performance Computing

Authors: Juan Pablo Correa-Baena, Kedar Hippalgaonkar, Jeroen van Duren, Shaffiq Jaffer, Vijay R. Chandrasekhar, Vladan Stevanovic, Cyrus Wadia, Supratik Guha, Tonio Buonassisi

Abstract: Successful materials innovations can transform society. However, materials research often involves long timelines and low success probabilities, dissuading investors who have expectations of shorter times from bench to business. A combination of emergent technologies could accelerate the pace of novel materials development by 10x or more, aligning the timelines of stakeholders (investors and resea… ▽ More Successful materials innovations can transform society. However, materials research often involves long timelines and low success probabilities, dissuading investors who have expectations of shorter times from bench to business. A combination of emergent technologies could accelerate the pace of novel materials development by 10x or more, aligning the timelines of stakeholders (investors and researchers), markets, and the environment, while increasing return-on-investment. First, tool automation enables rapid experimental testing of candidate materials. Second, high-throughput computing (HPC) concentrates experimental bandwidth on promising compounds by predicting and inferring bulk, interface, and defect-related properties. Third, machine learning connects the former two, where experimental outputs automatically refine theory and help define next experiments. We describe state-of-the-art attempts to realize this vision and identify resource gaps. We posit that over the coming decade, this combination of tools will transform the way we perform materials research. There are considerable first-mover advantages at stake, especially for grand challenges in energy and related fields, including computing, healthcare, urbanization, water, food, and the environment. △ Less

Submitted 20 March, 2018; originally announced March 2018.

Comments: 22 pages, 3 figures

Journal ref: Joule 2 (2018) 1410-1420

arXiv:1803.02043 [pdf, other]

Online Deep Learning: Growing RBM on the fly

Authors: Savitha Ramasamy, Kanagasabai Rajaraman, Pavitra Krishnaswamy, Vijay Chandrasekhar

Abstract: We propose a novel online learning algorithm for Restricted Boltzmann Machines (RBM), namely, the Online Generative Discriminative Restricted Boltzmann Machine (OGD-RBM), that provides the ability to build and adapt the network architecture of RBM according to the statistics of streaming data. The OGD-RBM is trained in two phases: (1) an online generative phase for unsupervised feature representat… ▽ More We propose a novel online learning algorithm for Restricted Boltzmann Machines (RBM), namely, the Online Generative Discriminative Restricted Boltzmann Machine (OGD-RBM), that provides the ability to build and adapt the network architecture of RBM according to the statistics of streaming data. The OGD-RBM is trained in two phases: (1) an online generative phase for unsupervised feature representation at the hidden layer and (2) a discriminative phase for classification. The online generative training begins with zero neurons in the hidden layer, adds and updates the neurons to adapt to statistics of streaming data in a single pass unsupervised manner, resulting in a feature representation best suited to the data. The discriminative phase is based on stochastic gradient descent and associates the represented features to the class labels. We demonstrate the OGD-RBM on a set of multi-category and binary classification problems for data sets having varying degrees of class-imbalance. We first apply the OGD-RBM algorithm on the multi-class MNIST dataset to characterize the network evolution. We demonstrate that the online generative phase converges to a stable, concise network architecture, wherein individual neurons are inherently discriminative to the class labels despite unsupervised training. We then benchmark OGD-RBM performance to other machine learning, neural network and ClassRBM techniques for credit scoring applications using 3 public non-stationary two-class credit datasets with varying degrees of class-imbalance. We report that OGD-RBM improves accuracy by 2.5-3% over batch learning techniques while requiring at least 24%-70% fewer neurons and fewer training samples. This online generative training approach can be extended greedily to multiple layers for training Deep Belief Networks in non-stationary data mining applications without the need for a priori fixed architectures. △ Less

Submitted 6 March, 2018; originally announced March 2018.

Comments: 14 pages, 4 figures, 2 tables

arXiv:1803.00553 [pdf, other]

doi 10.1103/PhysRevB.99.035408

Low temperature magnetoresistance of (111) (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)/SrTiO$_3$

Authors: V. V. Bal, Z. Huang, K. Han, Ariando, T. Venkatesan, V. Chandrasekhar

Abstract: The two dimensional conducting interfaces in SrTiO$_3$-based systems are known to show a variety of coexisting and competing phenomena in a complex phase space. Magnetoresistance measurements, which are typically used to extract information about the various interactions in these systems, must be interpreted with care, since multiple interactions can contribute to the resistivity in a given range… ▽ More The two dimensional conducting interfaces in SrTiO$_3$-based systems are known to show a variety of coexisting and competing phenomena in a complex phase space. Magnetoresistance measurements, which are typically used to extract information about the various interactions in these systems, must be interpreted with care, since multiple interactions can contribute to the resistivity in a given range of magnetic field and temperature. Here we review all the phenomena that can contribute to transport in SrTiO$_3$-based conducting interfaces at low temperatures, and discuss possible ways to distinguish between various phenomena. We apply this analysis to the magnetoresistance data of (111) oriented (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)/STO (LSAT/STO) heterostructures in perpendicular field, and find an excess negative magnetoresistance contribution which cannot be explained by weak localization alone. We argue that contributions from magnetic scattering as well as electron-electron interactions can provide a possible explanation for the observed magnetoresistance. △ Less

Submitted 1 March, 2018; originally announced March 2018.

Comments: 10 pages, 4 figures

Journal ref: Phys. Rev. B 99, 035408 (2019)

arXiv:1802.06222 [pdf, ps, other]

Efficient GAN-Based Anomaly Detection

Authors: Houssam Zenati, Chuan Sheng Foo, Bruno Lecouat, Gaurav Manek, Vijay Ramaseshan Chandrasekhar

Abstract: Generative adversarial networks (GANs) are able to model the complex highdimensional distributions of real-world data, which suggests they could be effective for anomaly detection. However, few works have explored the use of GANs for the anomaly detection task. We leverage recently developed GAN models for anomaly detection, and achieve state-of-the-art performance on image and network intrusion d… ▽ More Generative adversarial networks (GANs) are able to model the complex highdimensional distributions of real-world data, which suggests they could be effective for anomaly detection. However, few works have explored the use of GANs for the anomaly detection task. We leverage recently developed GAN models for anomaly detection, and achieve state-of-the-art performance on image and network intrusion datasets, while being several hundred-fold faster at test time than the only published GAN-based method. △ Less

Submitted 1 May, 2019; v1 submitted 17 February, 2018; originally announced February 2018.

Comments: Updated version of this work is published at ICDM 2018, see arXiv:1812.02288 . Submitted to the ICLR Workshop 2018

arXiv:1711.05322 [pdf, other]

doi 10.1103/PhysRevB.98.085416

Strong spin-orbit coupling and magnetism in (111) (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35})$/SrTiO$_3$

Authors: V. V. Bal, Z. Huang, K. Han, Ariando, T. Venkatesan, V. Chandrasekhar

Abstract: Strong correlations, multiple lattice degrees of freedom, and the ease of do** make complex oxides a source of great research interest. Complex oxide heterointerfaces break inversion symmetry and can host a two dimensional carrier gas, which can display a variety of coexisting and competing phenomena. In the case of heterointerfaces based on SrTiO$_3$, many of these phenomena can be effectively… ▽ More Strong correlations, multiple lattice degrees of freedom, and the ease of do** make complex oxides a source of great research interest. Complex oxide heterointerfaces break inversion symmetry and can host a two dimensional carrier gas, which can display a variety of coexisting and competing phenomena. In the case of heterointerfaces based on SrTiO$_3$, many of these phenomena can be effectively tuned by using an electric gate, due to the large dielectric constant of SrTiO$_3$. Most studies so far have focused on (001) oriented heterostructures; however, (111) oriented heterostructures have recently gained attention due to the possibility of finding exotic physics in these systems due their hexagonal surface crystal symmetry. In this work, we use magnetoresistance to study the evolution of spin-orbit interaction and magnetism in a new system, (111) oriented (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)/SrTiO$_3$. At more positive values of the gate voltage, which correspond to high carrier densities, we find that transport is multiband, and dominated by high mobility carriers with a tendency towards weak localization. At more negative gate voltages, the carrier density is reduced, the high mobility bands are depopulated, and weak antilocalization effects begin to dominate, indicating that spin-orbit interaction becomes stronger. At millikelvin temperatures, and gate voltages corresponding to the strong spin-orbit regime, we observe hysteresis in magnetoresistance, indicative of ferromagnetism in the system. Our results suggest that in the (111) (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)/SrTiO$_3$ system, low mobility carriers which experience strong spin-orbit interactions participate in creating magnetic order in the system. △ Less

Submitted 11 March, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

Comments: 15 pages, 3 figures

Journal ref: Phys. Rev. B 98, 085416 (2018)

arXiv:1711.01714 [pdf, other]

End-to-End Video Classification with Knowledge Graphs

Authors: Fang Yuan, Zhe Wang, Jie Lin, Luis Fernando D'Haro, Kim Jung Jae, Zeng Zeng, Vijay Chandrasekhar

Abstract: Video understanding has attracted much research attention especially since the recent availability of large-scale video benchmarks. In this paper, we address the problem of multi-label video classification. We first observe that there exists a significant knowledge gap between how machines and humans learn. That is, while current machine learning approaches including deep neural networks largely f… ▽ More Video understanding has attracted much research attention especially since the recent availability of large-scale video benchmarks. In this paper, we address the problem of multi-label video classification. We first observe that there exists a significant knowledge gap between how machines and humans learn. That is, while current machine learning approaches including deep neural networks largely focus on the representations of the given data, humans often look beyond the data at hand and leverage external knowledge to make better decisions. Towards narrowing the gap, we propose to incorporate external knowledge graphs into video classification. In particular, we unify traditional "knowledgeless" machine learning models and knowledge graphs in a novel end-to-end framework. The framework is flexible to work with most existing video classification algorithms including state-of-the-art deep models. Finally, we conduct extensive experiments on the largest public video dataset YouTube-8M. The results are promising across the board, improving mean average precision by up to 2.9%. △ Less

Submitted 5 November, 2017; originally announced November 2017.

Comments: 9 pages, 5 figures

arXiv:1708.04809 [pdf, other]

doi 10.1103/PhysRevB.97.041408

Signatures of Electronic Nematicity in (111) LaAlO$_3$/SrTiO$_3$ Interfaces

Authors: S. Davis, Z. Huang, K. Han, Ariando, T. Venkatesan, V. Chandrasekhar

Abstract: Symmetry breaking is a fundamental concept in condensed matter physics whose presence often heralds new phases of matter. For instance, the breaking of time reversal symmetry is traditionally linked to magnetic phases in a material, while the breaking of gauge symmetry can lead to superfluidity/superconductivity. Nematic phases are phases in which rotational symmetry is broken while maintaining tr… ▽ More Symmetry breaking is a fundamental concept in condensed matter physics whose presence often heralds new phases of matter. For instance, the breaking of time reversal symmetry is traditionally linked to magnetic phases in a material, while the breaking of gauge symmetry can lead to superfluidity/superconductivity. Nematic phases are phases in which rotational symmetry is broken while maintaining translational symme- try, and are traditionally associated with liquid crystals. Electronic nematic states where the or- thogonal in-plane crystal directions have different electronic properties have garnered a great deal of attention after their discovery in Sr$_3$Ru$_2$O$_7$, multiple iron based superconductors, and in the superconducting state of CuBiSe. Here we demonstrate the existence of an electronic ne- matic phase in the two-dimensional carrier gas that forms at the (111) LaAlO$_3$ (LAO)/SrTiO$_3$ (STO) interface that onsets at low temperatures, and is tunable by an electric field. △ Less

Submitted 28 August, 2017; v1 submitted 16 August, 2017; originally announced August 2017.

Comments: 6 pages 3 figures

Journal ref: Phys. Rev. B 97, 041408 (2018)

arXiv:1707.05455 [pdf, ps, other]

Pruning Convolutional Neural Networks for Image Instance Retrieval

Authors: Gaurav Manek, Jie Lin, Vijay Chandrasekhar, Lingyu Duan, Sateesh Giduthuri, Xiaoli Li, Tomaso Poggio

Abstract: In this work, we focus on the problem of image instance retrieval with deep descriptors extracted from pruned Convolutional Neural Networks (CNN). The objective is to heavily prune convolutional edges while maintaining retrieval performance. To this end, we introduce both data-independent and data-dependent heuristics to prune convolutional edges, and evaluate their performance across various comp… ▽ More In this work, we focus on the problem of image instance retrieval with deep descriptors extracted from pruned Convolutional Neural Networks (CNN). The objective is to heavily prune convolutional edges while maintaining retrieval performance. To this end, we introduce both data-independent and data-dependent heuristics to prune convolutional edges, and evaluate their performance across various compression rates with different deep descriptors over several benchmark datasets. Further, we present an end-to-end framework to fine-tune the pruned network, with a triplet loss function specially designed for the retrieval task. We show that the combination of heuristic pruning and fine-tuning offers 5x compression rate without considerable loss in retrieval performance. △ Less

Submitted 17 July, 2017; originally announced July 2017.

Comments: 5 pages

arXiv:1707.03029 [pdf, other]

doi 10.1103/PhysRevB.96.134502

Magnetoresistance in the superconducting state at the (111) LaAlO$_3$/SrTiO$_3$ interface

Authors: S. Davis, Z. Huang, K. Han, Ariando, T. Venkatesan, V. Chandrasekhar

Abstract: Condensed matter systems that simultaneously exhibit superconductivity and ferromagnetism are rare due the antagonistic relationship between conventional spin-singlet superconductivity and ferromagnetic order. In materials in which superconductivity and magnetic order is known to coexist (such as some heavy-fermion materials), the superconductivity is thought to be of an unconventional nature. Rec… ▽ More Condensed matter systems that simultaneously exhibit superconductivity and ferromagnetism are rare due the antagonistic relationship between conventional spin-singlet superconductivity and ferromagnetic order. In materials in which superconductivity and magnetic order is known to coexist (such as some heavy-fermion materials), the superconductivity is thought to be of an unconventional nature. Recently, the conducting gas that lives at the interface between the perovskite band insulators LaAlO$_3$ (LAO) and SrTiO$_3$ (STO) has also been shown to host both superconductivity and magnetism. Most previous research has focused on LAO/STO samples in which the interface is in the (001) crystal plane. Relatively little work has focused on the (111) crystal orientation, which has hexagonal symmetry at the interface, and has been predicted to have potentially interesting topological properties, including unconventional superconducting pairing states. Here we report measurements of the magnetoresistance of (111) LAO/STO heterostructures at temperatures at which they are also superconducting. As with the (001) structures, the magnetoresistance is hysteretic, indicating the coexistence of magnetism and superconductivity, but in addition, we find that this magnetoresistance is anisotropic. Such an anisotropic response is completely unexpected in the superconducting state, and suggests that (111) LAO/STO heterostructures may support unconventional superconductivity. △ Less

Submitted 10 July, 2017; originally announced July 2017.

Comments: 6 Pages 4 figures

Journal ref: Phys. Rev. B 96, 134502 (2017)

arXiv:1706.05461 [pdf, other]

Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text

Authors: Zhe Wang, Kingsley Kuan, Mathieu Ravaut, Gaurav Manek, Sibo Song, Yuan Fang, Seokhwan Kim, Nancy Chen, Luis Fernando D'Haro, Luu Anh Tuan, Hongyuan Zhu, Zeng Zeng, Ngai Man Cheung, Georgios Piliouras, Jie Lin, Vijay Chandrasekhar

Abstract: The YouTube-8M video classification challenge requires teams to classify 0.7 million videos into one or more of 4,716 classes. In this Kaggle competition, we placed in the top 3% out of 650 participants using released video and audio features. Beyond that, we extend the original competition by including text information in the classification, making this a truly multi-modal approach with vision, a… ▽ More The YouTube-8M video classification challenge requires teams to classify 0.7 million videos into one or more of 4,716 classes. In this Kaggle competition, we placed in the top 3% out of 650 participants using released video and audio features. Beyond that, we extend the original competition by including text information in the classification, making this a truly multi-modal approach with vision, audio and text. The newly introduced text data is termed as YouTube-8M-Text. We present a classification framework for the joint use of text, visual and audio features, and conduct an extensive set of experiments to quantify the benefit that this additional mode brings. The inclusion of text yields state-of-the-art results, e.g. 86.7% GAP on the YouTube-8M-Text validation dataset. △ Less

Submitted 9 July, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

Comments: 8 pages, Accepted to CVPR'17 Workshop on YouTube-8M Large-Scale Video Understanding

arXiv:1706.00848 [pdf, other]

doi 10.1063/1.4986912

Electrostatic tuning of magnetism at the conducting (111) (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)/SrTiO$_3$ interface

Authors: V. V. Bal, Z. Huang, K. Han, Ariando, T. Venkatesan, V. Chandrasekhar

Abstract: We present measurements of the low temperature electrical transport properties of the two dimensional carrier gas that forms at the interface of $(111)$ (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)/SrTiO$_3$ (LSAT/STO) as a function of applied back gate voltage, $V_g$. As is found in (111) LaAlO$_3$/SrTiO$_3$ interfaces, the low-field Hall coefficient is electron-like, but shows a sharp reductio… ▽ More We present measurements of the low temperature electrical transport properties of the two dimensional carrier gas that forms at the interface of $(111)$ (La$_{0.3}$Sr$_{0.7}$)(Al$_{0.65}$Ta$_{0.35}$)/SrTiO$_3$ (LSAT/STO) as a function of applied back gate voltage, $V_g$. As is found in (111) LaAlO$_3$/SrTiO$_3$ interfaces, the low-field Hall coefficient is electron-like, but shows a sharp reduction in magnitude below $V_g \sim$ 20 V, indicating the presence of hole-like carriers in the system. This same value of $V_g$ correlates approximately with the gate voltage below which the magnetoresistance evolves from nonhysteretic to hysteretic behavior at millikelvin temperatures, signaling the onset of magnetic order in the system. We believe our results can provide insight into the mechanism of magnetism in SrTiO$_3$ based systems. △ Less

Submitted 2 June, 2017; originally announced June 2017.

Comments: 5 pages, 3 figures

Journal ref: Appl. Phys. Lett. 111, 081604 (2017)

arXiv:1705.09435 [pdf, other]

Deep Learning for Lung Cancer Detection: Tackling the Kaggle Data Science Bowl 2017 Challenge

Authors: Kingsley Kuan, Mathieu Ravaut, Gaurav Manek, Huiling Chen, Jie Lin, Babar Nazir, Cen Chen, Tse Chiang Howe, Zeng Zeng, Vijay Chandrasekhar

Abstract: We present a deep learning framework for computer-aided lung cancer diagnosis. Our multi-stage framework detects nodules in 3D lung CAT scans, determines if each nodule is malignant, and finally assigns a cancer probability based on these results. We discuss the challenges and advantages of our framework. In the Kaggle Data Science Bowl 2017, our framework ranked 41st out of 1972 teams. We present a deep learning framework for computer-aided lung cancer diagnosis. Our multi-stage framework detects nodules in 3D lung CAT scans, determines if each nodule is malignant, and finally assigns a cancer probability based on these results. We discuss the challenges and advantages of our framework. In the Kaggle Data Science Bowl 2017, our framework ranked 41st out of 1972 teams. △ Less

Submitted 26 May, 2017; originally announced May 2017.

arXiv:1704.08141 [pdf, other]

Compact Descriptors for Video Analysis: the Emerging MPEG Standard

Authors: Ling-Yu Duan, Vijay Chandrasekhar, Shiqi Wang, Yihang Lou, Jie Lin, Yan Bai, Tiejun Huang, Alex Chichung Kot, Wen Gao

Abstract: This paper provides an overview of the on-going compact descriptors for video analysis standard (CDVA) from the ISO/IEC moving pictures experts group (MPEG). MPEG-CDVA targets at defining a standardized bitstream syntax to enable interoperability in the context of video analysis applications. During the developments of MPEGCDVA, a series of techniques aiming to reduce the descriptor size and impro… ▽ More This paper provides an overview of the on-going compact descriptors for video analysis standard (CDVA) from the ISO/IEC moving pictures experts group (MPEG). MPEG-CDVA targets at defining a standardized bitstream syntax to enable interoperability in the context of video analysis applications. During the developments of MPEGCDVA, a series of techniques aiming to reduce the descriptor size and improve the video representation ability have been proposed. This article describes the new standard that is being developed and reports the performance of these key technical contributions. △ Less

Submitted 26 April, 2017; originally announced April 2017.

Comments: 4 figures, 4 tables

arXiv:1704.01203 [pdf, other]

doi 10.1103/PhysRevB.98.024504

Superconductivity and Frozen Electronic States at the (111) LaAlO$_3$/SrTiO$_3$ Interface

Authors: S. Davis, Z. Huang, K. Han, Ariando, T. Venkatesan, V. Chandrasekhar

Abstract: In spite of Anderson's theorem, disorder is known to affect superconductivity in conventional s-wave superconductors. In most superconductors, the degree of disorder is fixed during sample preparation. Here we report measurements of the superconducting properties of the two-dimensional gas that forms at the interface between LaAlO$_3$ (LAO) and SrTiO$_3$ (STO) in the (111) crystal orientation, a s… ▽ More In spite of Anderson's theorem, disorder is known to affect superconductivity in conventional s-wave superconductors. In most superconductors, the degree of disorder is fixed during sample preparation. Here we report measurements of the superconducting properties of the two-dimensional gas that forms at the interface between LaAlO$_3$ (LAO) and SrTiO$_3$ (STO) in the (111) crystal orientation, a system that permits \emph{in situ} tuning of carrier density and disorder by means of a back gate voltage $V_g$. Like the (001) oriented LAO/STO interface, superconductivity at the (111) LAO/STO interface can be tuned by $V_g$. In contrast to the (001) interface, superconductivity in these (111) samples is anisotropic, being different along different interface crystal directions, consistent with the strong anisotropy already observed other transport properties at the (111) LAO/STO interface. In addition, we find that the (111) interface samples "remember" the backgate voltage $V_F$ at which they are cooled at temperatures near the superconducting transition temperature $T_c$, even if $V_g$ is subsequently changed at lower temperatures. The low energy scale and other characteristics of this memory effect ($<1$ K) distinguish it from charge-trap** effects previously observed in (001) interface samples. △ Less

Submitted 4 April, 2017; originally announced April 2017.

Comments: 6 pages, 5 Figures

Journal ref: Phys. Rev. B 98, 024504 (2018)

arXiv:1701.04923 [pdf, other]

Compression of Deep Neural Networks for Image Instance Retrieval

Authors: Vijay Chandrasekhar, Jie Lin, Qianli Liao, Olivier Morère, Antoine Veillard, Lingyu Duan, Tomaso Poggio

Abstract: Image instance retrieval is the problem of retrieving images from a database which contain the same object. Convolutional Neural Network (CNN) based descriptors are becoming the dominant approach for generating {\it global image descriptors} for the instance retrieval problem. One major drawback of CNN-based {\it global descriptors} is that uncompressed deep neural network models require hundreds… ▽ More Image instance retrieval is the problem of retrieving images from a database which contain the same object. Convolutional Neural Network (CNN) based descriptors are becoming the dominant approach for generating {\it global image descriptors} for the instance retrieval problem. One major drawback of CNN-based {\it global descriptors} is that uncompressed deep neural network models require hundreds of megabytes of storage making them inconvenient to deploy in mobile applications or in custom hardware. In this work, we study the problem of neural network model compression focusing on the image instance retrieval task. We study quantization, coding, pruning and weight sharing techniques for reducing model size for the instance retrieval problem. We provide extensive experimental results on the trade-off between retrieval performance and model size for different types of networks on several data sets providing the most comprehensive study on this topic. We compress models to the order of a few MBs: two orders of magnitude smaller than the uncompressed models while achieving negligible loss in retrieval performance. △ Less

Submitted 17 January, 2017; originally announced January 2017.

Comments: 10 pages, accepted by DCC 2017

arXiv:1611.03040 [pdf]

Transduction between electrical energy and the heat in a carbon nanotube using a voltage-controlled do**

Authors: T. Gupta, I. P. Nevirkovets, V. Chandrasekhar, S. Shafranjuk

Abstract: High electric conductivity ~100 MegaSiemens/m and Seebeck coefficient >200 mkV/K of carbon nanotubes (CNT) make them attractive for a variety of applications. Unfortunately, a high thermal conductivity ~ 3000 W/(m*K) due to the phonon transport limits their capability for transforming energy between the heat and electricity. Here we show that increasing the charge carrier concentrations not only l… ▽ More High electric conductivity ~100 MegaSiemens/m and Seebeck coefficient >200 mkV/K of carbon nanotubes (CNT) make them attractive for a variety of applications. Unfortunately, a high thermal conductivity ~ 3000 W/(m*K) due to the phonon transport limits their capability for transforming energy between the heat and electricity. Here we show that increasing the charge carrier concentrations not only leads to an increase of both electric conductivity and Seebeck coeffcient, but also causes a substantial suppression of the thermal conductivity due to intensifying the phonon-electron collisions. A strong transduction effect corresponding to an effective electron temperature change ~115 K was observed in a CNT device, where the local gate electrodes have controlled the charge do** in the opposite ends. Transduction between the heat and the energy of the electron subsystem corresponds to an impressive figure of merit cold ZT ~ 6 and the transduced power density P ~ 80kW/cm2. △ Less

Submitted 9 November, 2016; originally announced November 2016.

Comments: 27 pages, 5 figures

arXiv:1603.04595 [pdf, other]

Nested Invariance Pooling and RBM Hashing for Image Instance Retrieval

Authors: Olivier Morère, Jie Lin, Antoine Veillard, Vijay Chandrasekhar, Tomaso Poggio

Abstract: The goal of this work is the computation of very compact binary hashes for image instance retrieval. Our approach has two novel contributions. The first one is Nested Invariance Pooling (NIP), a method inspired from i-theory, a mathematical theory for computing group invariant transformations with feed-forward neural networks. NIP is able to produce compact and well-performing descriptors with vis… ▽ More The goal of this work is the computation of very compact binary hashes for image instance retrieval. Our approach has two novel contributions. The first one is Nested Invariance Pooling (NIP), a method inspired from i-theory, a mathematical theory for computing group invariant transformations with feed-forward neural networks. NIP is able to produce compact and well-performing descriptors with visual representations extracted from convolutional neural networks. We specifically incorporate scale, translation and rotation invariances but the scheme can be extended to any arbitrary sets of transformations. We also show that using moments of increasing order throughout nesting is important. The NIP descriptors are then hashed to the target code size (32-256 bits) with a Restricted Boltzmann Machine with a novel batch-level regularization scheme specifically designed for the purpose of hashing (RBMH). A thorough empirical evaluation with state-of-the-art shows that the results obtained both with the NIP descriptors and the NIP+RBMH hashes are consistently outstanding across a wide range of datasets. △ Less

Submitted 14 April, 2016; v1 submitted 15 March, 2016; originally announced March 2016.

Comments: Image Instance Retrieval, CNN, Invariant Representation, Hashing, Unsupervised Learning, Regularization. arXiv admin note: text overlap with arXiv:1601.02093

arXiv:1603.04538 [pdf, other]

doi 10.1103/PhysRevB.95.035127

Anisotropic, multi-carrier transport at the (111) LaAlO$_3$/SrTiO$_3$ interface

Authors: Samuel Davis, V. Chandrasekhar, Z. Huang, K. Han, Ariando, T. Venkatesan

Abstract: The conducting gas that forms at the interface between LaAlO$_3$ and SrTiO$_3$ has proven to be a fertile playground for a wide variety of physical phenomena. The bulk of previous research has focused on the (001) and (110) crystal orientations. Here we report detailed measurements of the low-temperature electrical properties of (111) LAO/STO interface samples. We find that the low-temperature ele… ▽ More The conducting gas that forms at the interface between LaAlO$_3$ and SrTiO$_3$ has proven to be a fertile playground for a wide variety of physical phenomena. The bulk of previous research has focused on the (001) and (110) crystal orientations. Here we report detailed measurements of the low-temperature electrical properties of (111) LAO/STO interface samples. We find that the low-temperature electrical transport properties are highly anisotropic, in that they differ significantly along two mutually orthogonal crystal orientations at the interface. While anisotropy in the resistivity has been reported in some (001) samples and in (110) samples, the anisotropy in the (111) samples reported here is much stronger, and also manifests itself in the Hall coefficient as well as the capacitance. In addition, the anisotropy is not present at room temperature and at liquid nitrogen temperatures, but only at liquid helium temperatures and below. The anisotropy is accentuated by exposure to ultraviolet light, which disproportionately affects transport along one surface crystal direction. Furthermore, analysis of the low-temperature Hall coefficient and the capacitance as a function of back gate voltage indicates that in addition to electrons, holes contribute to the electrical transport. △ Less

Submitted 22 August, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

Comments: 11 pages, 9 figures

Journal ref: Phys. Rev. B 95, 035127 (2017)

Showing 1–50 of 107 results for author: Chandrasekhar, V