Search | arXiv e-print repository

Room Temperature Spin Filtering and Quantum Transport with Transition Metal-Doped Silicon Quantum Dot

Abstract: Spin filtering is a fundamental operation in spintronics, enabling the generation and detection of spin-polarized carriers. Here, we proposed and theoretically demonstrated that a 3d transition metal (TM) doped silicon quantum dot (SiQD) is a suitable candidate for spin filter device at room temperature. Using density functional theory (DFT), we investigate the structure, electronic properties, an… ▽ More Spin filtering is a fundamental operation in spintronics, enabling the generation and detection of spin-polarized carriers. Here, we proposed and theoretically demonstrated that a 3d transition metal (TM) doped silicon quantum dot (SiQD) is a suitable candidate for spin filter device at room temperature. Using density functional theory (DFT), we investigate the structure, electronic properties, and magnetic behavior of TM-SiQD. Our calculations demonstrate that Mn-doped SiQD exhibits the highest stability. The designed spin-filter device using Mn-doped SiQD shows a spin-filtering efficiency of 99.9% at 300K electrode temperature along with very high conductance. This remarkable efficiency positions it as a promising candidate for room-temperature spintronic devices. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.01714 [pdf, other]

doi 10.1109/TASLP.2024.3353574

TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy

Authors: Vibhav Agarwal, Sourav Ghosh, Harichandana BSS, Himanshu Arora, Barath Raj Kandur Raja

Abstract: Data-to-text (D2T) generation is a crucial task in many natural language understanding (NLU) applications and forms the foundation of task-oriented dialog systems. In the context of conversational AI solutions that can work directly with local data on the user's device, architectures utilizing large pre-trained language models (PLMs) are impractical for on-device deployment due to a high memory fo… ▽ More Data-to-text (D2T) generation is a crucial task in many natural language understanding (NLU) applications and forms the foundation of task-oriented dialog systems. In the context of conversational AI solutions that can work directly with local data on the user's device, architectures utilizing large pre-trained language models (PLMs) are impractical for on-device deployment due to a high memory footprint. To this end, we propose TrICy, a novel lightweight framework for an enhanced D2T task that generates text sequences based on the intent in context and may further be guided by user-provided triggers. We leverage an attention-copy mechanism to predict out-of-vocabulary (OOV) words accurately. Performance analyses on E2E NLG dataset (BLEU: 66.43%, ROUGE-L: 70.14%), WebNLG dataset (BLEU: Seen 64.08%, Unseen 52.35%), and our Custom dataset related to text messaging applications, showcase our architecture's effectiveness. Moreover, we show that by leveraging an optional trigger input, data-to-text generation quality increases significantly and achieves the new SOTA score of 69.29% BLEU for E2E NLG. Furthermore, our analyses show that TrICy achieves at least 24% and 3% improvement in BLEU and METEOR respectively over LLMs like GPT-3, ChatGPT, and Llama 2. We also demonstrate that in some scenarios, performance improvement due to triggers is observed even when they are absent in training. △ Less

Submitted 25 January, 2024; originally announced February 2024.

Comments: Published in the IEEE/ACM Transactions on Audio, Speech, and Language Processing. (Sourav Ghosh and Vibhav Agarwal contributed equally to this work.)

Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1173-1184, 2024

arXiv:2312.00766 [pdf, other]

Automated Material Properties Extraction For Enhanced Beauty Product Discovery and Makeup Virtual Try-on

Authors: Fatemeh Taheri Dezaki, Himanshu Arora, Rahul Suresh, Amin Banitalebi-Dehkordi

Abstract: The multitude of makeup products available can make it challenging to find the ideal match for desired attributes. An intelligent approach for product discovery is required to enhance the makeup shop** experience to make it more convenient and satisfying. However, enabling accurate and efficient product discovery requires extracting detailed attributes like color and finish type. Our work introd… ▽ More The multitude of makeup products available can make it challenging to find the ideal match for desired attributes. An intelligent approach for product discovery is required to enhance the makeup shop** experience to make it more convenient and satisfying. However, enabling accurate and efficient product discovery requires extracting detailed attributes like color and finish type. Our work introduces an automated pipeline that utilizes multiple customized machine learning models to extract essential material attributes from makeup product images. Our pipeline is versatile and capable of handling various makeup products. To showcase the efficacy of our pipeline, we conduct extensive experiments on eyeshadow products (both single and multi-shade ones), a challenging makeup product known for its diverse range of shapes, colors, and finish types. Furthermore, we demonstrate the applicability of our approach by successfully extending it to other makeup categories like lipstick and foundation, showcasing its adaptability and effectiveness across different beauty products. Additionally, we conduct ablation experiments to demonstrate the superiority of our machine learning pipeline over human labeling methods in terms of reliability. Our proposed method showcases its effectiveness in cross-category product discovery, specifically in recommending makeup products that perfectly match a specified outfit. Lastly, we also demonstrate the application of these material attributes in enabling virtual-try-on experiences which makes makeup shop** experience significantly more engaging. △ Less

Submitted 1 December, 2023; originally announced December 2023.

Comments: Presented in Fifth Workshop on Recommender Systems in Fashion(fashionxrecsys) of ACM Conference on Recommender Systems

arXiv:2301.11655 [pdf]

doi 10.1063/5.0136182

Nitrogen in Silicon for Room Temperature Single Electron Tunneling Devices

Authors: Pooja Yadav, Hemant Arora, Arup Samanta

Abstract: Single electron transistor (SET) is an advanced tool to exploit in quantum devices. Working of such devices at room-temperature is essential for practical utilization. Dopant based single-electron devices are well studied at low-temperature although a few devices are developed for high-temperature operation with certain limitations. Here, we propose and theoretically exhibit that nitrogen (N) dono… ▽ More Single electron transistor (SET) is an advanced tool to exploit in quantum devices. Working of such devices at room-temperature is essential for practical utilization. Dopant based single-electron devices are well studied at low-temperature although a few devices are developed for high-temperature operation with certain limitations. Here, we propose and theoretically exhibit that nitrogen (N) donor in silicon is an important candidate for effective designing of such devices. Theoretical calculation of density-of-states using semi-empirical DFT method indicates that N-donor in silicon has deep ground state compared to a phosphorus (P) donor. N-donor spectrum is explored in nano-silicon along with the P-donor. Comparative data of Bohr radius of N-donor and P-donor is also reported. The simulated current-voltage characteristics confirm that N-doped device is better suited for SET operation at room-temperature. △ Less

Submitted 27 January, 2023; originally announced January 2023.

arXiv:2209.02834 [pdf, other]

Unsupervised Scene Sketch to Photo Synthesis

Authors: Jiayun Wang, Sangryul Jeon, Stella X. Yu, Xi Zhang, Himanshu Arora, Yu Lou

Abstract: Sketches make an intuitive and powerful visual expression as they are fast executed freehand drawings. We present a method for synthesizing realistic photos from scene sketches. Without the need for sketch and photo pairs, our framework directly learns from readily available large-scale photo datasets in an unsupervised manner. To this end, we introduce a standardization module that provides pseud… ▽ More Sketches make an intuitive and powerful visual expression as they are fast executed freehand drawings. We present a method for synthesizing realistic photos from scene sketches. Without the need for sketch and photo pairs, our framework directly learns from readily available large-scale photo datasets in an unsupervised manner. To this end, we introduce a standardization module that provides pseudo sketch-photo pairs during training by converting photos and sketches to a standardized domain, i.e. the edge map. The reduced domain gap between sketch and photo also allows us to disentangle them into two components: holistic scene structures and low-level visual styles such as color and texture. Taking this advantage, we synthesize a photo-realistic image by combining the structure of a sketch and the visual style of a reference photo. Extensive experimental results on perceptual similarity metrics and human perceptual studies show the proposed method could generate realistic photos with high fidelity from scene sketches and outperform state-of-the-art photo synthesis baselines. We also demonstrate that our framework facilitates a controllable manipulation of photo synthesis by editing strokes of corresponding sketches, delivering more fine-grained details than previous approaches that rely on region-level editing. △ Less

Submitted 6 September, 2022; originally announced September 2022.

Journal ref: ECCVW 2022

arXiv:2208.13379 [pdf, other]

doi 10.1021/acs.nanolett.2c04791

Hot Carrier Thermalization and Josephson Inductance Thermometry in a Graphene-based Microwave Circuit

Authors: Raj Katti, Harpreet Arora, Olli-Pentti Saira, Kenji Watanabe, Takashi Taniguchi, Keith C. Schwab, Michael Roukes, Stevan Nadj-Perge

Abstract: Due to its exceptional electronic and thermal properties, graphene is a key material for bolometry, calorimetry, and photon detection. However, despite graphene's relatively simple electronic structure, the physical processes responsible for the transport of heat from the electrons to the lattice are experimentally still elusive. Here, we measure the thermal response of low-disorder graphene encap… ▽ More Due to its exceptional electronic and thermal properties, graphene is a key material for bolometry, calorimetry, and photon detection. However, despite graphene's relatively simple electronic structure, the physical processes responsible for the transport of heat from the electrons to the lattice are experimentally still elusive. Here, we measure the thermal response of low-disorder graphene encapsulated in hexagonal boron nitride (hBN) by integrating it within a multi-terminal superconducting device coupled to a microwave resonator. This technique allows us to simultaneously apply Joule heat power to the graphene flake while performing calibrated readout of the electron temperature. We probe the thermalization rates of both electrons and holes with high precision and observe a thermalization scaling exponent consistent with cooling dominated by resonant electron-phonon coupling processes occurring at the interface between graphene and superconducting leads. The technique utilized here is applicable for wide range of semiconducting-superconducting interface heterostructures and provides new insights into the thermalization pathways essential for the next-generation thermal detectors. △ Less

Submitted 29 August, 2022; originally announced August 2022.

Comments: main text and supplementary information

arXiv:2208.10911 [pdf, other]

Electrochemical investigation of MoSeTe as an anode for sodium-ion batteries

Authors: Priya Mudgal, Himani Arora, Jayashree Pati, Manish K. Singh, Mahantesh Khetri, Rajendra S. Dhaka

Abstract: Sodium ion batteries (SIBs) are considered as an efficient alternative for lithium-ion batteries (LIBs) owing to the natural abundance and low cost of sodium than lithium. In this context, the anode materials play a vital role in rechargeable batteries to acquire high energy and power density. In order to demonstrate transition metal dichalcogenide (TMD) as potential anode materials, we have synth… ▽ More Sodium ion batteries (SIBs) are considered as an efficient alternative for lithium-ion batteries (LIBs) owing to the natural abundance and low cost of sodium than lithium. In this context, the anode materials play a vital role in rechargeable batteries to acquire high energy and power density. In order to demonstrate transition metal dichalcogenide (TMD) as potential anode materials, we have synthesized MoSeTe sample by conventional flux method, and the structure and morphology are characterized using x-ray diffraction (XRD), field-emission scanning electron microscopy (FESEM), transmission electron microscopy (TEM), and Raman spectroscopy. These characterisations confirm the hexagonal crystal symmetry with p63/mmc space group and layered morphology of MoSeTe. We investigate the electrochemical performance of a MoSeTe as a negative electrode (anode) for SIBs in the working potential range of 0.01 to 3.0~V. In a half-cell configuration, the MoSeTe as an anode and Na metal as counter/reference electrode exhibits significant initial specific discharge capacities of around 475 and 355 mAhg$^{-1}$ at current densities of 50 and 100 mAg$^{-1}$, respectively. However, the capacity degraded significantly like $\approx$200~mAhg$^{-1}$ in 2nd cycle, but having $\approx$100\% Coulombic efficiency, which suggest for further modification in this material to improve its stability. The cyclic voltammetry (CV) study reveals the reversibility of the material after 1st cycle, resulting no change in the initial peak positions. The electrochemical impedance spectroscopy (EIS) measurements affirms the smaller charge transfer resistance of fresh cells than the cells after 10th cycle. Moreover, the extracted diffusion coefficient is found to be of the order of 10$^{-14}$ cm$^2$s$^{-1}$. △ Less

Submitted 23 August, 2022; originally announced August 2022.

Comments: to be published in PINSA (Springer)

arXiv:2205.05225 [pdf, other]

Hierarchy of Symmetry Breaking Correlated Phases in Twisted Bilayer Graphene

Authors: Robert Polski, Yiran Zhang, Yang Peng, Harpreet Singh Arora, Youngjoon Choi, Hyun** Kim, Kenji Watanabe, Takashi Taniguchi, Gil Refael, Felix von Oppen, Stevan Nadj-Perge

Abstract: Twisted bilayer graphene (TBG) near the magic twist angle of $\sim1.1^{o}$ exhibits a rich phase diagram. However, the interplay between different phases and their dependence on twist angle is still elusive. Here, we explore the stability of various TBG phases and demonstrate that superconductivity near filling of two electrons per moiré unit cell alongside Fermi surface reconstructions, as well a… ▽ More Twisted bilayer graphene (TBG) near the magic twist angle of $\sim1.1^{o}$ exhibits a rich phase diagram. However, the interplay between different phases and their dependence on twist angle is still elusive. Here, we explore the stability of various TBG phases and demonstrate that superconductivity near filling of two electrons per moiré unit cell alongside Fermi surface reconstructions, as well as entropy-driven high-temperature phase transitions and linear-in-T resistance occur over a range of twist angles which extends far beyond those exhibiting correlated insulating phases. In the vicinity of the magic angle, we also find a metallic phase that displays a hysteretic anomalous Hall effect and incipient Chern insulating behaviour. Such a metallic phase can be rationalized in terms of the interplay between interaction-driven deformations of TBG bands leading to Berry curvature redistribution and Fermi surface reconstruction. Our results provide an extensive perspective on the hierarchy of correlated phases in TBG as classified by their robustness against deviations from the magic angle or, equivalently, their electronic interaction requirements. △ Less

Submitted 10 May, 2022; originally announced May 2022.

Comments: main text + supplementary information

arXiv:2204.04867 [pdf, other]

Structured Graph Variational Autoencoders for Indoor Furniture layout Generation

Authors: Aditya Chattopadhyay, Xi Zhang, David Paul Wipf, Himanshu Arora, Rene Vidal

Abstract: We present a structured graph variational autoencoder for generating the layout of indoor 3D scenes. Given the room type (e.g., living room or library) and the room layout (e.g., room elements such as floor and walls), our architecture generates a collection of objects (e.g., furniture items such as sofa, table and chairs) that is consistent with the room type and layout. This is a challenging pro… ▽ More We present a structured graph variational autoencoder for generating the layout of indoor 3D scenes. Given the room type (e.g., living room or library) and the room layout (e.g., room elements such as floor and walls), our architecture generates a collection of objects (e.g., furniture items such as sofa, table and chairs) that is consistent with the room type and layout. This is a challenging problem because the generated scene should satisfy multiple constrains, e.g., each object must lie inside the room and two objects cannot occupy the same volume. To address these challenges, we propose a deep generative model that encodes these relationships as soft constraints on an attributed graph (e.g., the nodes capture attributes of room and furniture elements, such as class, pose and size, and the edges capture geometric relationships such as relative orientation). The architecture consists of a graph encoder that maps the input graph to a structured latent space, and a graph decoder that generates a furniture graph, given a latent code and the room graph. The latent space is modeled with auto-regressive priors, which facilitates the generation of highly structured scenes. We also propose an efficient training procedure that combines matching and constrained learning. Experiments on the 3D-FRONT dataset show that our method produces scenes that are diverse and are adapted to the room layout. △ Less

Submitted 22 July, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

arXiv:2202.09156 [pdf, ps, other]

doi 10.1063/5.0080784

Terahertz control of photoluminescence emission in few-layer InSe

Authors: Tommaso Venanzi, Malte Selig, Alexej Pashkin, Stephan Winnerl, Manuel Katzer, Himani Arora, Artur Erbe, Amalia Patanè, Zakhar R. Kudrynskyi, Zakhar D. Kovalyuk, Leonetta Baldassarre, Andreas Knorr, Manfred Helm, Harald Schneider

Abstract: A promising route for the development of opto-elelctronic technology is to use terahertz radiation to modulate the optical properties of semiconductors. Here we demonstrate the dynamical control of photoluminescence (PL) emission in few-layer InSe using picosecond terahertz pulses. We observe a strong PL quenching (up to 50%) after the arrival of the terahertz pulse followed by a reversible recove… ▽ More A promising route for the development of opto-elelctronic technology is to use terahertz radiation to modulate the optical properties of semiconductors. Here we demonstrate the dynamical control of photoluminescence (PL) emission in few-layer InSe using picosecond terahertz pulses. We observe a strong PL quenching (up to 50%) after the arrival of the terahertz pulse followed by a reversible recovery of the emission on the time scale of 50ps at T =10K. Microscopic calculations reveal that the origin of the photoluminescence quenching is the terahertz absorption by photo-excited carriers: this leads to a heating of the carriers and a broadening of their distribution, which reduces the probability of bimolecular electron-hole recombination and, therefore, the luminescence. By numerically evaluating the Boltzmann equation, we are able to clarify the individual roles of optical and acoustic phonons in the subsequent cooling process. The same PL quenchingmechanismis expected in other van derWaals semiconductors and the effectwill be particularly strong for materials with low carrier masses and long carrier relaxation time, which is the case for InSe. This work gives a solid background for the development of opto-electronic applications based on InSe, such as THz detectors and optical modulators. △ Less

Submitted 18 February, 2022; originally announced February 2022.

Comments: The following article has been accepted by Applied Physics Letters. After it is published, it will be found at https://publishing.aip.org/resources/librarians/products/journals/

arXiv:2112.12028 [pdf, other]

doi 10.1109/INDICON52576.2021.9691564

VoiceMoji: A Novel On-Device Pipeline for Seamless Emoji Insertion in Dictation

Authors: Sumit Kumar, Harichandana B S S, Himanshu Arora

Abstract: Most of the speech recognition systems recover only words in the speech and fail to capture emotions. Users have to manually add emoji(s) in text for adding tone and making communication fun. Though there is much work done on punctuation addition on transcribed speech, the area of emotion addition is untouched. In this paper, we propose a novel on-device pipeline to enrich the voice input experien… ▽ More Most of the speech recognition systems recover only words in the speech and fail to capture emotions. Users have to manually add emoji(s) in text for adding tone and making communication fun. Though there is much work done on punctuation addition on transcribed speech, the area of emotion addition is untouched. In this paper, we propose a novel on-device pipeline to enrich the voice input experience. It involves, given a blob of transcribed text, intelligently processing and identifying structure where emoji insertion makes sense. Moreover, it includes semantic text analysis to predict emoji for each of the sub-parts for which we propose a novel architecture Attention-based Char Aware (ACA) LSTM which handles Out-Of-Vocabulary (OOV) words as well. All these tasks are executed completely on-device and hence can aid on-device dictation systems. To the best of our knowledge, this is the first work that shows how to add emoji(s) in the transcribed text. We demonstrate that our components achieve comparable results to previous neural approaches for punctuation addition and emoji prediction with 80% fewer parameters. Overall, our proposed model has a very small memory footprint of a mere 4MB to suit on-device deployment. △ Less

Submitted 22 December, 2021; originally announced December 2021.

Comments: Accepted at IEEE INDICON 2021, 19-21 December, 2021, India

arXiv:2110.15717 [pdf, other]

doi 10.1109/ICMLA52953.2021.00182

LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network

Authors: Vibhav Agarwal, Sudeep Deepak Shivnikar, Sourav Ghosh, Himanshu Arora, Yashwant Saini

Abstract: Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained… ▽ More Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained environment. To this end, we propose LIDSNet, a novel lightweight on-device intent detection model, which accurately predicts the message intent by utilizing a Deep Siamese Network for learning better sentence representations. We use character-level features to enrich the sentence-level representations and empirically demonstrate the advantage of transfer learning by utilizing pre-trained embeddings. Furthermore, to investigate the efficacy of the modules in our architecture, we conduct an ablation study and arrive at our optimal model. Experimental results prove that LIDSNet achieves state-of-the-art competitive accuracy of 98.00% and 95.97% on SNIPS and ATIS public datasets respectively, with under 0.59M parameters. We further benchmark LIDSNet against fine-tuned BERTs and show that our model is at least 41x lighter and 30x faster during inference than MobileBERT on Samsung Galaxy S20 device, justifying its efficiency on resource-constrained edge devices. △ Less

Submitted 6 October, 2021; originally announced October 2021.

Comments: Accepted for publication in 2021 IEEE 20th International Conference on Machine Learning and Applications (ICMLA)

Journal ref: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), Pasadena, CA, USA, 2021, pp. 1112-1117

arXiv:2110.06199 [pdf, other]

ABO: Dataset and Benchmarks for Real-World 3D Object Understanding

Authors: Jasmine Collins, Shubham Goel, Kenan Deng, Achleshwar Luthra, Leon Xu, Erhan Gundogdu, Xi Zhang, Tomas F. Yago Vicente, Thomas Dideriksen, Himanshu Arora, Matthieu Guillaumin, Jitendra Malik

Abstract: We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog images, metadata, and artist-created 3D models with complex geometries and physically-based materials that correspond to real, household objects. We derive challenging benchmarks that exploit the unique properties of ABO and measure… ▽ More We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog images, metadata, and artist-created 3D models with complex geometries and physically-based materials that correspond to real, household objects. We derive challenging benchmarks that exploit the unique properties of ABO and measure the current limits of the state-of-the-art on three open problems for real-world 3D object understanding: single-view 3D reconstruction, material estimation, and cross-domain multi-view object retrieval. △ Less

Submitted 24 June, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

arXiv:2110.00644 [pdf, other]

RoomStructNet: Learning to Rank Non-Cuboidal Room Layouts From Single View

Authors: Xi Zhang, Chun-Kai Wang, Kenan Deng, Tomas Yago-Vicente, Himanshu Arora

Abstract: In this paper, we present a new approach to estimate the layout of a room from its single image. While recent approaches for this task use robust features learnt from data, they resort to optimization for detecting the final layout. In addition to using learnt robust features, our approach learns an additional ranking function to estimate the final layout instead of using optimization. To learn th… ▽ More In this paper, we present a new approach to estimate the layout of a room from its single image. While recent approaches for this task use robust features learnt from data, they resort to optimization for detecting the final layout. In addition to using learnt robust features, our approach learns an additional ranking function to estimate the final layout instead of using optimization. To learn this ranking function, we propose a framework to train a CNN using max-margin structure cost. Also, while most approaches aim at detecting cuboidal layouts, our approach detects non-cuboidal layouts for which we explicitly estimates layout complexity parameters. We use these parameters to propose layout candidates in a novel way. Our approach shows state-of-the-art results on standard datasets with mostly cuboidal layouts and also performs well on a dataset containing rooms with non-cuboidal layouts. △ Less

Submitted 1 October, 2021; originally announced October 2021.

Comments: 10 pages

arXiv:2106.16237 [pdf, other]

Multimodal Shape Completion via IMLE

Authors: Himanshu Arora, Saurabh Mishra, Shichong Peng, Ke Li, Ali Mahdavi-Amiri

Abstract: Shape completion is the problem of completing partial input shapes such as partial scans. This problem finds important applications in computer vision and robotics due to issues such as occlusion or sparsity in real-world data. However, most of the existing research related to shape completion has been focused on completing shapes by learning a one-to-one map** which limits the diversity and cre… ▽ More Shape completion is the problem of completing partial input shapes such as partial scans. This problem finds important applications in computer vision and robotics due to issues such as occlusion or sparsity in real-world data. However, most of the existing research related to shape completion has been focused on completing shapes by learning a one-to-one map** which limits the diversity and creativity of the produced results. We propose a novel multimodal shape completion technique that is effectively able to learn a one-to-many map** and generates diverse complete shapes. Our approach is based on the conditional Implicit MaximumLikelihood Estimation (IMLE) technique wherein we condition our inputs on partial 3D point clouds. We extensively evaluate our approach by comparing it to various baselines both quantitatively and qualitatively. We show that our method is superior to alternatives in terms of completeness and diversity of shapes. △ Less

Submitted 7 July, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

Comments: Project Website: https://sites.google.com/site/alimahdaviamiri/projects/shape-completion

arXiv:2106.03690 [pdf]

Accelerated Corrosion of High Entropy Alloys under Tensile Stress

Authors: Aditya Ayyagari, Riyadh Salloom, Harpreet Singh Arora, Sundeep Mukherjee

Abstract: High entropy alloys are finding significant scientific interest due to their exotic microstructures and exceptional properties resulting thereof. These alloys have excellent corrosion resistance and may find broad range of applications from bio-implants, aerospace components and nuclear industry. A critical performance metric that determines the application worthiness of the alloys is the resilien… ▽ More High entropy alloys are finding significant scientific interest due to their exotic microstructures and exceptional properties resulting thereof. These alloys have excellent corrosion resistance and may find broad range of applications from bio-implants, aerospace components and nuclear industry. A critical performance metric that determines the application worthiness of the alloys is the resilience of stressed structural members in a corrosive environment. This study reports the results from a novel experimental setup to quantify the corrosion rate under uniaxial tensile stress in a single phase fcc Al0.1CoCrFeNi high entropy alloy rods. Under a uniform uniaxial applied stress of 600 MPa, the corrosion current density was observed to increase by three orders of magnitude and ~150 mV drop in corrosion potential. The mechanism of accelerated corrosion is identified as surface passivation layer breakdown, pit initiation on un-passivated surface and rapid pit-propagation along the loading direction. △ Less

Submitted 7 June, 2021; originally announced June 2021.

arXiv:2101.05970 [pdf, other]

Affordance-based Reinforcement Learning for Urban Driving

Authors: Tanmay Agarwal, Hitesh Arora, Jeff Schneider

Abstract: Traditional autonomous vehicle pipelines that follow a modular approach have been very successful in the past both in academia and industry, which has led to autonomy deployed on road. Though this approach provides ease of interpretation, its generalizability to unseen environments is limited and hand-engineering of numerous parameters is required, especially in the prediction and planning systems… ▽ More Traditional autonomous vehicle pipelines that follow a modular approach have been very successful in the past both in academia and industry, which has led to autonomy deployed on road. Though this approach provides ease of interpretation, its generalizability to unseen environments is limited and hand-engineering of numerous parameters is required, especially in the prediction and planning systems. Recently, deep reinforcement learning has been shown to learn complex strategic games and perform challenging robotic tasks, which provides an appealing framework for learning to drive. In this work, we propose a deep reinforcement learning framework to learn optimal control policy using waypoints and low-dimensional visual representations, also known as affordances. We demonstrate that our agents when trained from scratch learn the tasks of lane-following, driving around inter-sections as well as stop** in front of other actors or traffic lights even in the dense traffic setting. We note that our method achieves comparable or better performance than the baseline methods on the original and NoCrash benchmarks on the CARLA simulator. △ Less

Submitted 15 January, 2021; originally announced January 2021.

arXiv:2101.04456 [pdf]

A character representation enhanced on-device Intent Classification

Authors: Sudeep Deepak Shivnikar, Himanshu Arora, Harichandana B S S

Abstract: Intent classification is an important task in natural language understanding systems. Existing approaches have achieved perfect scores on the benchmark datasets. However they are not suitable for deployment on low-resource devices like mobiles, tablets, etc. due to their massive model size. Therefore, in this paper, we present a novel light-weight architecture for intent classification that can ru… ▽ More Intent classification is an important task in natural language understanding systems. Existing approaches have achieved perfect scores on the benchmark datasets. However they are not suitable for deployment on low-resource devices like mobiles, tablets, etc. due to their massive model size. Therefore, in this paper, we present a novel light-weight architecture for intent classification that can run efficiently on a device. We use character features to enrich the word representation. Our experiments prove that our proposed model outperforms existing approaches and achieves state-of-the-art results on benchmark datasets. We also report that our model has tiny memory footprint of ~5 MB and low inference time of ~2 milliseconds, which proves its efficiency in a resource-constrained environment. △ Less

Submitted 12 January, 2021; originally announced January 2021.

Comments: Accepted for publication in ICON 2020: 17th International Conference on Natural Language Processing

arXiv:2009.10673 [pdf, other]

doi 10.1093/mnras/stab2041

Deep Forest: Neural Network reconstruction of the Lyman-alpha forest

Authors: Lawrence Huang, Rupert A. C. Croft, Hitesh Arora

Abstract: We explore the use of Deep Learning to infer physical quantities from the observable transmitted flux in the Lyman-alpha forest. We train a Neural Network using redshift z=3 outputs from cosmological hydrodynamic simulations and mock datasets constructed from them. We evaluate how well the trained network is able to reconstruct the optical depth for Lyman-alpha forest absorption from noisy and oft… ▽ More We explore the use of Deep Learning to infer physical quantities from the observable transmitted flux in the Lyman-alpha forest. We train a Neural Network using redshift z=3 outputs from cosmological hydrodynamic simulations and mock datasets constructed from them. We evaluate how well the trained network is able to reconstruct the optical depth for Lyman-alpha forest absorption from noisy and often saturated transmitted flux data. The Neural Network outperforms an alternative reconstruction method involving log inversion and spline interpolation by approximately a factor of 2 in the optical depth root mean square error. We find no significant dependence in the improvement on input data signal to noise, although the gain is greatest in high optical depth regions. The Lyman-alpha forest optical depth studied here serves as a simple, one dimensional, example but the use of Deep Learning and simulations to approach the inverse problem in cosmology could be extended to other physical quantities and higher dimensional data. △ Less

Submitted 5 September, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

Comments: 10 pages, 7 figures, submitted to MNRAS. Code and data used at https://github.com/lhuangCMU/deep-learning-intergalactic-medium Changes: 11 pages, 7 figures. Further described how we chose our architecture, why the NN has difficulty predicting high values of optical depth, added more references, added additional panels to figs 3-6, and corrected fig 1 and mean optical depth value

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 506, Issue 4, October 2021, Pages 5212-5222

arXiv:2008.11746 [pdf, other]

Tracing out Correlated Chern Insulators in Magic Angle Twisted Bilayer Graphene

Authors: Youngjoon Choi, Hyun** Kim, Yang Peng, Alex Thomson, Cyprian Lewandowski, Robert Polski, Yiran Zhang, Harpreet Singh Arora, Kenji Watanabe, Takashi Taniguchi, Jason Alicea, Stevan Nadj-Perge

Abstract: Magic-angle twisted bilayer graphene (MATBG) exhibits a range of correlated phenomena that originate from strong electron-electron interactions. These interactions make the Fermi surface highly susceptible to reconstruction when $ \pm 1, \pm 2, \pm 3$ electrons occupy each moir\' e unit cell and lead to the formation of correlated insulating, superconducting and ferromagnetic phases. While some ph… ▽ More Magic-angle twisted bilayer graphene (MATBG) exhibits a range of correlated phenomena that originate from strong electron-electron interactions. These interactions make the Fermi surface highly susceptible to reconstruction when $ \pm 1, \pm 2, \pm 3$ electrons occupy each moir\' e unit cell and lead to the formation of correlated insulating, superconducting and ferromagnetic phases. While some phases have been shown to carry a non-zero Chern number, the local microscopic properties and topological character of many other phases remain elusive. Here we introduce a set of novel techniques hinging on scanning tunneling microscopy (STM) to map out topological phases in MATBG that emerge in finite magnetic field. By following the evolution of the local density of states (LDOS) at the Fermi level with electrostatic do** and magnetic field, we visualize a local Landau fan diagram that enables us to directly assign Chern numbers to all observed phases. We uncover the existence of six topological phases emanating from integer fillings in finite fields and whose origin relates to a cascade of symmetry-breaking transitions driven by correlations. The spatially resolved and electron-density-tuned LDOS maps further reveal that these topological phases can form only in a small range of twist angles around the magic-angle value. Both the microscopic origin and extreme sensitivity to twist angle differentiate these topological phases from the Landau levels observed near charge neutrality. Moreover, we observe that even the charge-neutrality Landau spectrum taken at low fields is considerably modified by interactions and exhibits an unexpected splitting between zero Landau levels that can be as large as ${\sim }\,3-5$ meV. Our results show how strong electronic interactions affect the band structure of MATBG and lead to the formation of correlation-enabled topological phases. △ Less

Submitted 26 August, 2020; originally announced August 2020.

arXiv:2008.05723 [pdf, other]

Contextual Diversity for Active Learning

Authors: Sharat Agarwal, Himanshu Arora, Saket Anand, Chetan Arora

Abstract: Requirement of large annotated datasets restrict the use of deep convolutional neural networks (CNNs) for many practical applications. The problem can be mitigated by using active learning (AL) techniques which, under a given annotation budget, allow to select a subset of data that yields maximum accuracy upon fine tuning. State of the art AL approaches typically rely on measures of visual diversi… ▽ More Requirement of large annotated datasets restrict the use of deep convolutional neural networks (CNNs) for many practical applications. The problem can be mitigated by using active learning (AL) techniques which, under a given annotation budget, allow to select a subset of data that yields maximum accuracy upon fine tuning. State of the art AL approaches typically rely on measures of visual diversity or prediction uncertainty, which are unable to effectively capture the variations in spatial context. On the other hand, modern CNN architectures make heavy use of spatial context for achieving highly accurate predictions. Since the context is difficult to evaluate in the absence of ground-truth labels, we introduce the notion of contextual diversity that captures the confusion associated with spatially co-occurring classes. Contextual Diversity (CD) hinges on a crucial observation that the probability vector predicted by a CNN for a region of interest typically contains information from a larger receptive field. Exploiting this observation, we use the proposed CD measure within two AL frameworks: (1) a core-set based strategy and (2) a reinforcement learning based policy, for active frame selection. Our extensive empirical evaluation establish state of the art results for active learning on benchmark datasets of Semantic Segmentation, Object Detection and Image Classification. Our ablation studies show clear advantages of using contextual diversity for active learning. The source code and additional results are available at https://github.com/sharat29ag/CDAL. △ Less

Submitted 13 August, 2020; originally announced August 2020.

Comments: A variant of this report is accepted in ECCV 2020

arXiv:2006.13855 [pdf, other]

doi 10.1103/PhysRevResearch.2.043360

Autocorrected Off-axis Holography of 2D Materials

Authors: Felix Kern, Martin Linck, Daniel Wolf, Nasim Alem, Himani Arora, Sibylle Gemming, Artur Erbe, Alex Zettl, Bernd Büchner, Axel Lubk

Abstract: The reduced dimensionality in two-dimensional materials leads a wealth of unusual properties, which are currently explored for both fundamental and applied sciences. In order to study the crystal structure, edge states, the formation of defects and grain boundaries, or the impact of adsorbates, high resolution microscopy techniques are indispensible. Here we report on the development of an electro… ▽ More The reduced dimensionality in two-dimensional materials leads a wealth of unusual properties, which are currently explored for both fundamental and applied sciences. In order to study the crystal structure, edge states, the formation of defects and grain boundaries, or the impact of adsorbates, high resolution microscopy techniques are indispensible. Here we report on the development of an electron holography (EH) transmission electron microscopy (TEM) technique, which facilitates high spatial resolution by an automatic correction of geometric aberrations. Distinguished features of EH beyond conventional TEM imaging are the gap-free spatial information signal transfer and higher dose efficiency for certain spatial frequency bands as well as direct access to the projected electrostatic potential of the 2D material. We demonstrate these features at the example of h-BN, at which we measure the electrostatic potential as a function of layer number down to the monolayer limit and obtain evidence for a systematic increase of the potential at the zig-zag edges. △ Less

Submitted 25 June, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

Comments: 8 pages, 5 figures

Journal ref: Phys. Rev. Research 2, 043360 (2020)

arXiv:2004.04146 [pdf, other]

Complex Network Analysis of Indian Railway Zones

Authors: Nikhil Kumar Rajput, Piyush Badola, Harshit Arora, Bhavya Ahuja Grover

Abstract: Indian Railway Network has been analyzed on the basis of number of trains directly linking two railway zones. The network has been displayed as a weighted graph where the weights denote the number of trains between the zones. It may be pointed out that each zone is a complex network in itself and may depict different characteristic features. The zonal network therefore can be considered as a netwo… ▽ More Indian Railway Network has been analyzed on the basis of number of trains directly linking two railway zones. The network has been displayed as a weighted graph where the weights denote the number of trains between the zones. It may be pointed out that each zone is a complex network in itself and may depict different characteristic features. The zonal network therefore can be considered as a network of complex networks. In this paper, self links, in-degree and out-degree of each zone have been computed which provides information about the inter and intra zonal connectivity. Degree passenger correlation which gives an idea about number of trains and passengers originating from a particular zone which might play a role in policy making decisions has also been studied. Some other complex network parameters like betweenness, clustering coefficient and cliques have been obtained to get more insight about the complex Indian zonal network. △ Less

Submitted 8 April, 2020; originally announced April 2020.

arXiv:2003.12304 [pdf, ps, other]

Photoluminescence dynamics in few-layer InSe

Authors: Tommaso Venanzi, Himani Arora, Stephan Winnerl, Alexej Pashkin, Phanish Chava, Amalia Patanè, Zakhar D. Kovalyuk, Zalhar R. Kudrynskyi, Kenji Watanabe, Takashi Taniguchi, Artur Erbe, Manfred Helm, Harald Schneider

Abstract: We study the optical properties of thin flakes of InSe encapsulated in hBN. More specifically, we investigate the photoluminescence (PL) emission and its dependence on sample thickness and temperature. Through the analysis of the PL lineshape, we discuss the relative weights of the exciton and electron-hole contributions. Thereafter we investigate the PL dynamics. Two contributions are distinguish… ▽ More We study the optical properties of thin flakes of InSe encapsulated in hBN. More specifically, we investigate the photoluminescence (PL) emission and its dependence on sample thickness and temperature. Through the analysis of the PL lineshape, we discuss the relative weights of the exciton and electron-hole contributions. Thereafter we investigate the PL dynamics. Two contributions are distinguishable at low temperature: direct bandgap electron-hole and defect-assisted recombination. The two recombination processes have lifetime of $τ_1 \sim 8\;$ns and $τ_2 \sim 100\;$ns, respectively. The relative weights of the direct bandgap and defect-assisted contributions show a strong layer dependence due to the direct-to-indirect bandgap crossover. Electron-hole PL lifetime is limited by population transfer to lower-energy states and no dependence on the number of layers was observed. The lifetime of the defect-assisted recombination gets longer for thinner samples. Finally, we show that the PL lifetime decreases at high temperatures as a consequence of more efficient non-radiative recombinations. △ Less

Submitted 31 March, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

arXiv:2002.03003 [pdf, other]

doi 10.1038/s41586-020-2473-8

Superconductivity without insulating states in twisted bilayer graphene stabilized by monolayer WSe$_2$

Authors: Harpreet Singh Arora, Robert Polski, Yiran Zhang, Alex Thomson, Youngjoon Choi, Hyun** Kim, Zhong Lin, Ilham Zaky Wilson, Xiaodong Xu, Jiun-Haw Chu, Kenji Watanabe, Takashi Taniguchi, Jason Alicea, Stevan Nadj-Perge

Abstract: Magic-angle twisted bilayer graphene (TBG), with rotational misalignment close to 1.1$^\circ$, features isolated flat electronic bands that host a rich phase diagram of correlated insulating, superconducting, ferromagnetic, and topological phases. The origins of the correlated insulators and superconductivity, and the interplay between them, are particularly elusive. Both states have been previous… ▽ More Magic-angle twisted bilayer graphene (TBG), with rotational misalignment close to 1.1$^\circ$, features isolated flat electronic bands that host a rich phase diagram of correlated insulating, superconducting, ferromagnetic, and topological phases. The origins of the correlated insulators and superconductivity, and the interplay between them, are particularly elusive. Both states have been previously observed only for angles within $\pm0.1^\circ$ from the magic-angle value and occur in adjacent or overlap** electron density ranges; nevertheless, it is still unclear how the two states are related. Beyond the twist angle and strain, the dependence of the TBG phase diagram on the alignment and thickness of insulating hexagonal boron nitride (hBN) used to encapsulate the graphene sheets indicates the importance of the microscopic dielectric environment. Here we show that adding an insulating tungsten-diselenide (WSe$_2$) monolayer between hBN and TBG stabilizes superconductivity at twist angles much smaller than the established magic-angle value. For the smallest angle of $θ$ = 0.79$^\circ$, we still observe clear superconducting signatures, despite the complete absence of the correlated insulating states and vanishing gaps between the dispersive and flat bands. These observations demonstrate that, even though electron correlations may be important, superconductivity in TBG can exist even when TBG exhibits metallic behaviour across the whole range of electron density. Finite-magnetic-field measurements further reveal breaking of the four-fold spin-valley symmetry in the system, consistent with large spin-orbit coupling induced in TBG via proximity to WSe$_2$. Our results highlight the importance of symmetry breaking effects in stabilizing electronic states in TBG and open new avenues for engineering quantum phases in moiré systems. △ Less

Submitted 7 February, 2020; originally announced February 2020.

Comments: 12 pages, 4 figures; main text;

Journal ref: Nature 583, 379 - 384 (2020)

arXiv:1902.09685 [pdf, ps, other]

doi 10.4204/EPTCS.299.7

Iteratively Composing Statically Verified Traits

Authors: Isaac Oscar Gariano, Marco Servetto, Alex Potanin, Hrshikesh Arora

Abstract: Static verification relying on an automated theorem prover can be very slow and brittle: since static verification is undecidable, correct code may not pass a particular static verifier. In this work we use metaprogramming to generate code that is correct by construction. A theorem prover is used only to verify initial "traits": units of code that can be used to compose bigger programs. In our w… ▽ More Static verification relying on an automated theorem prover can be very slow and brittle: since static verification is undecidable, correct code may not pass a particular static verifier. In this work we use metaprogramming to generate code that is correct by construction. A theorem prover is used only to verify initial "traits": units of code that can be used to compose bigger programs. In our work, meta-programming is done by trait composition, which starting from correct code, is guaranteed to produce correct code. We do this by extending conventional traits with pre- and post-conditions for the methods; we also extend the traditional trait composition (+) operator to check the compatibility of contracts. In this way, there is no need to re-verify the produced code. We show how our approach can be applied to the standard "power" function example, where metaprogramming generates optimised, and correct, versions when the exponent is known in advance. △ Less

Submitted 20 August, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

Comments: In Proceedings VPT 2019, arXiv:1908.06723

Journal ref: EPTCS 299, 2019, pp. 49-55

arXiv:1902.05436 [pdf, ps, other]

Checking Observational Purity of Procedures

Authors: Himanshu Arora, Raghavan Komondoor, G. Ramalingam

Abstract: Verifying whether a procedure is observationally pure is useful in many software engineering scenarios. An observationally pure procedure always returns the same value for the same argument, and thus mimics a mathematical function. The problem is challenging when procedures use private mutable global variables, e.g., for memoization of frequently returned answers, and when they involve recursion.… ▽ More Verifying whether a procedure is observationally pure is useful in many software engineering scenarios. An observationally pure procedure always returns the same value for the same argument, and thus mimics a mathematical function. The problem is challenging when procedures use private mutable global variables, e.g., for memoization of frequently returned answers, and when they involve recursion. We present a novel verification approach for this problem. Our approach involves encoding the procedure's code as a formula that is a disjunction of path constraints, with the recursive calls being replaced in the formula with references to a mathematical function symbol. Then, a theorem prover is invoked to check whether the formula that has been constructed agrees with the function symbol referred to above in terms of input-output behavior for all arguments. We evaluate our approach on a set of realistic examples, using the Boogie intermediate language and theorem prover. Our evaluation shows that the invariants are easy to construct manually, and that our approach is effective at verifying observationally pure procedures. △ Less

Submitted 14 February, 2019; originally announced February 2019.

Comments: FASE 2019

arXiv:1902.00546 [pdf]

doi 10.22152/programming-journal.org/2019/3/12

Separating Use and Reuse to Improve Both

Authors: Hrshikesh Arora, Marco Servetto, Bruno C. D. S. Oliveira

Abstract: Context: Trait composition has inspired new research in the area of code reuse for object oriented (OO) languages. One of the main advantages of this kind of composition is that it makes possible to separate subty** from subclassing; which is good for code-reuse, design and reasoning. However, handling of state within traits is difficult, verbose or inelegant. Inquiry: We identify the this-leaki… ▽ More Context: Trait composition has inspired new research in the area of code reuse for object oriented (OO) languages. One of the main advantages of this kind of composition is that it makes possible to separate subty** from subclassing; which is good for code-reuse, design and reasoning. However, handling of state within traits is difficult, verbose or inelegant. Inquiry: We identify the this-leaking problem as the fundamental limitation that prevents the separation of subty** from subclassing in conventional OO languages. We explain that the concept of trait composition addresses this problem, by distinguishing code designed for use (as a type) from code designed for reuse (i.e. inherited). We are aware of at least 3 concrete independently designed research languages following this methodology: TraitRecordJ, Package Templates and DeepFJig. Approach: In this paper, we design $42_μ$ a new language, where we improve use and reuse and support the This type and family polymorphism by distinguishing code designed for use from code designed for reuse. In this way $42_μ$ synthesise the 3 approaches above, and improves them with abstract state operations: a new elegant way to handle state composition in trait based languages. Knowledge and Grounding: Using case studies, we show that $42_μ$'s model of traits with abstract state operations is more usable and compact than prior work. We formalise our work and prove that type errors cannot arise from composing well typed code. Importance: This work is the logical core of the programming language 42. This shows that the ideas presented in this paper can be applicable to a full general purpose language. This form of composition is very flexible and could be used in many new languages. △ Less

Submitted 1 February, 2019; originally announced February 2019.

Journal ref: The Art, Science, and Engineering of Programming, 2019, Vol. 3, Issue 3, Article 12

arXiv:1901.02997 [pdf]

doi 10.1038/s41567-019-0606-5

Imaging Electronic Correlations in Twisted Bilayer Graphene near the Magic Angle

Authors: Youngjoon Choi, Jeannette Kemmer, Yang Peng, Alex Thomson, Harpreet Arora, Robert Polski, Yiran Zhang, Hechen Ren, Jason Alicea, Gil Refael, Felix von Oppen, Kenji Watanabe, Takashi Taniguchi, Stevan Nadj-Perge

Abstract: Twisted bilayer graphene with a twist angle of around 1.1° features a pair of isolated flat electronic bands and forms a strongly correlated electronic platform. Here, we use scanning tunneling microscopy to probe local properties of highly tunable twisted bilayer graphene devices and show that the flat bands strongly deform when aligned with the Fermi level. At half filling of the bands, we obser… ▽ More Twisted bilayer graphene with a twist angle of around 1.1° features a pair of isolated flat electronic bands and forms a strongly correlated electronic platform. Here, we use scanning tunneling microscopy to probe local properties of highly tunable twisted bilayer graphene devices and show that the flat bands strongly deform when aligned with the Fermi level. At half filling of the bands, we observe the development of gaps originating from correlated insulating states. Near charge neutrality, we find a previously unidentified correlated regime featuring a substantially enhanced flat band splitting that we describe within a microscopic model predicting a strong tendency towards nematic ordering. Our results provide insights into symmetry breaking correlation effects and highlight the importance of electronic interactions for all filling factors in twisted bilayer graphene. △ Less

Submitted 9 January, 2019; originally announced January 2019.

Comments: Main text 9 pages, 4 figures; Supplementary Information 25 pages

Journal ref: Nature Physics 2019

arXiv:1802.01034 [pdf, other]

Multi-task Learning for Continuous Control

Authors: Himani Arora, Rajath Kumar, Jason Krone, Chong Li

Abstract: Reliable and effective multi-task learning is a prerequisite for the development of robotic agents that can quickly learn to accomplish related, everyday tasks. However, in the reinforcement learning domain, multi-task learning has not exhibited the same level of success as in other domains, such as computer vision. In addition, most reinforcement learning research on multi-task learning has been… ▽ More Reliable and effective multi-task learning is a prerequisite for the development of robotic agents that can quickly learn to accomplish related, everyday tasks. However, in the reinforcement learning domain, multi-task learning has not exhibited the same level of success as in other domains, such as computer vision. In addition, most reinforcement learning research on multi-task learning has been focused on discrete action spaces, which are not used for robotic control in the real-world. In this work, we apply multi-task learning methods to continuous action spaces and benchmark their performance on a series of simulated continuous control tasks. Most notably, we show that multi-task learning outperforms our baselines and alternative knowledge sharing methods. △ Less

Submitted 3 February, 2018; originally announced February 2018.

arXiv:1710.09798 [pdf, other]

Lip2AudSpec: Speech reconstruction from silent lip movements video

Authors: Hassan Akbari, Himani Arora, Liangliang Cao, Nima Mesgarani

Abstract: In this study, we propose a deep neural network for reconstructing intelligible speech from silent lip movement videos. We use auditory spectrogram as spectral representation of speech and its corresponding sound generation method resulting in a more natural sounding reconstructed speech. Our proposed network consists of an autoencoder to extract bottleneck features from the auditory spectrogram w… ▽ More In this study, we propose a deep neural network for reconstructing intelligible speech from silent lip movement videos. We use auditory spectrogram as spectral representation of speech and its corresponding sound generation method resulting in a more natural sounding reconstructed speech. Our proposed network consists of an autoencoder to extract bottleneck features from the auditory spectrogram which is then used as target to our main lip reading network comprising of CNN, LSTM and fully connected layers. Our experiments show that the autoencoder is able to reconstruct the original auditory spectrogram with a 98% correlation and also improves the quality of reconstructed speech from the main lip reading network. Our model, trained jointly on different speakers is able to extract individual speaker characteristics and gives promising results of reconstructing intelligible speech with superior word recognition accuracy. △ Less

Submitted 26 October, 2017; originally announced October 2017.

arXiv:1707.07701 [pdf, ps, other]

Interpolation on Gauss hypergeometric functions with an application

Authors: Hina Manoj Arora, Swadesh Kumar Sahoo

Abstract: In this paper, we use some standard numerical techniques to approximate the hypergeometric function $$ {}_2F_1[a,b;c;x]=1+\frac{ab}{c}x+\frac{a(a+1)b(b+1)}{c(c+1)}\frac{x^2}{2!}+\cdots $$ for a range of parameter triples $(a,b,c)$ on the interval $0<x<1$. Some of the familiar hypergeometric functional identities and asymptotic behavior of the hypergeometric function at $x=1$ play crucial roles in… ▽ More In this paper, we use some standard numerical techniques to approximate the hypergeometric function $$ {}_2F_1[a,b;c;x]=1+\frac{ab}{c}x+\frac{a(a+1)b(b+1)}{c(c+1)}\frac{x^2}{2!}+\cdots $$ for a range of parameter triples $(a,b,c)$ on the interval $0<x<1$. Some of the familiar hypergeometric functional identities and asymptotic behavior of the hypergeometric function at $x=1$ play crucial roles in deriving the formula for such approximations. We also focus on error analysis of the numerical approximations leading to monotone properties of quotient of gamma functions in parameter triples $(a,b,c)$. Finally, an application to continued fractions of Gauss is discussed followed by concluding remarks consisting of recent works on related problems. △ Less

Submitted 24 July, 2017; originally announced July 2017.

Comments: To appear in Involve-A Journal of Mathematics, 16 pages

MSC Class: 65D05; 33B15; 33B20; 33C05; 33F05

arXiv:1701.04743 [pdf, other]

doi 10.1109/WACV.2017.57

Computing Egomotion with Local Loop Closures for Egocentric Videos

Authors: Suvam Patra, Himanshu Aggarwal, Himani Arora, Chetan Arora, Subhashis Banerjee

Abstract: Finding the camera pose is an important step in many egocentric video applications. It has been widely reported that, state of the art SLAM algorithms fail on egocentric videos. In this paper, we propose a robust method for camera pose estimation, designed specifically for egocentric videos. In an egocentric video, the camera views the same scene point multiple times as the wearer's head sweeps ba… ▽ More Finding the camera pose is an important step in many egocentric video applications. It has been widely reported that, state of the art SLAM algorithms fail on egocentric videos. In this paper, we propose a robust method for camera pose estimation, designed specifically for egocentric videos. In an egocentric video, the camera views the same scene point multiple times as the wearer's head sweeps back and forth. We use this specific motion profile to perform short loop closures aligned with wearer's footsteps. For egocentric videos, depth estimation is usually noisy. In an important departure, we use 2D computations for rotation averaging which do not rely upon depth estimates. The two modification results in much more stable algorithm as is evident from our experiments on various egocentric video datasets for different egocentric applications. The proposed algorithm resolves a long standing problem in egocentric vision and unlocks new usage scenarios for future applications. △ Less

Submitted 17 January, 2017; originally announced January 2017.

Comments: Accepted in WACV 2017

arXiv:1606.05370 [pdf]

doi 10.1021/acs.nanolett.6b02345

Template-Assisted Direct Growth of 1Td/in$^2$ Bit Patterned Media

Authors: En Yang, Zuwei Liu, Hitesh Arora, Tsai-wei Wu, Vipin Ayanoor-Vitikkate, Detlef Spoddig, Daniel Bedau, Michael Grobis, Bruce A. Gurney, Thomas R. Albrecht, Bruce Terris

Abstract: We present a method for growing bit patterned magnetic recording media using directed growth of sputtered granular perpendicular magnetic recording media. The grain nucleation is templated using an epitaxial seed layer which contains Pt pillars separated by amorphous metal oxide. The scheme enables the creation of both templated data and servo regions suitable for high density hard disk drive oper… ▽ More We present a method for growing bit patterned magnetic recording media using directed growth of sputtered granular perpendicular magnetic recording media. The grain nucleation is templated using an epitaxial seed layer which contains Pt pillars separated by amorphous metal oxide. The scheme enables the creation of both templated data and servo regions suitable for high density hard disk drive operation. We illustrate the importance of using a process that is both topographically and chemically driven to achieve high quality media. △ Less

Submitted 16 June, 2016; originally announced June 2016.

Journal ref: Nano Letters (2016)

arXiv:1503.06664 [pdf]

doi 10.1109/TMAG.2015.2397880

Bit Patterned Magnetic Recording: Theory, Media Fabrication, and Recording Performance

Authors: Thomas R. Albrecht, Hitesh Arora, Vipin Ayanoor-Vitikkate, Jean-Marc Beaujour, Daniel Bedau, David Berman, Alexei L. Bogdanov, Yves-Andre Chapuis, Julia Cushen, Elizabeth E. Dobisz, Gregory Doerk, He Gao, Michael Grobis, Bruce Gurney, Weldon Hanson, Olav Hellwig, Toshiki Hirano, Pierre-Olivier Jubert, Dan Kercher, Jeffrey Lille, Zuwei Liu, C. Mathew Mate, Yuri Obukhov, Kanaiyalal C. Patel, Kurt Rubin , et al. (6 additional authors not shown)

Abstract: Bit Patterned Media (BPM) for magnetic recording provide a route to densities $>1 Tb/in^2$ and circumvents many of the challenges associated with conventional granular media technology. Instead of recording a bit on an ensemble of random grains, BPM uses an array of lithographically defined isolated magnetic islands, each of which stores one bit. Fabrication of BPM is viewed as the greatest challe… ▽ More Bit Patterned Media (BPM) for magnetic recording provide a route to densities $>1 Tb/in^2$ and circumvents many of the challenges associated with conventional granular media technology. Instead of recording a bit on an ensemble of random grains, BPM uses an array of lithographically defined isolated magnetic islands, each of which stores one bit. Fabrication of BPM is viewed as the greatest challenge for its commercialization. In this article we describe a BPM fabrication method which combines e-beam lithography, directed self-assembly of block copolymers, self-aligned double patterning, nanoimprint lithography, and ion milling to generate BPM based on CoCrPt alloys. This combination of fabrication technologies achieves feature sizes of $<10 nm$, significantly smaller than what conventional semiconductor nanofabrication methods can achieve. In contrast to earlier work which used hexagonal close-packed arrays of round islands, our latest approach creates BPM with rectangular bitcells, which are advantageous for integration with existing hard disk drive technology. The advantages of rectangular bits are analyzed from a theoretical and modeling point of view, and system integration requirements such as servo patterns, implementation of write synchronization, and providing for a stable head-disk interface are addressed in the context of experimental results. Optimization of magnetic alloy materials for thermal stability, writeability, and switching field distribution is discussed, and a new method for growing BPM islands on a patterned template is presented. New recording results at $1.6 Td/in^2$ (teradot/inch${}^2$, roughly equivalent to $1.3 Tb/in^2$) demonstrate a raw error rate $<10^{-2}$, which is consistent with the recording system requirements of modern hard drives. Extendibility of BPM to higher densities, and its eventual combination with energy assisted recording are explored. △ Less

Submitted 19 March, 2015; originally announced March 2015.

Comments: 44 pages

ACM Class: B.3.2; B.4.2

Showing 1–35 of 35 results for author: Arora, H