Skip to main content

Showing 1–19 of 19 results for author: Arora, H

Searching in archive cs. Search in all archives.
.
  1. TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy

    Authors: Vibhav Agarwal, Sourav Ghosh, Harichandana BSS, Himanshu Arora, Barath Raj Kandur Raja

    Abstract: Data-to-text (D2T) generation is a crucial task in many natural language understanding (NLU) applications and forms the foundation of task-oriented dialog systems. In the context of conversational AI solutions that can work directly with local data on the user's device, architectures utilizing large pre-trained language models (PLMs) are impractical for on-device deployment due to a high memory fo… ▽ More

    Submitted 25 January, 2024; originally announced February 2024.

    Comments: Published in the IEEE/ACM Transactions on Audio, Speech, and Language Processing. (Sourav Ghosh and Vibhav Agarwal contributed equally to this work.)

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1173-1184, 2024

  2. arXiv:2312.00766  [pdf, other

    cs.CV cs.AI

    Automated Material Properties Extraction For Enhanced Beauty Product Discovery and Makeup Virtual Try-on

    Authors: Fatemeh Taheri Dezaki, Himanshu Arora, Rahul Suresh, Amin Banitalebi-Dehkordi

    Abstract: The multitude of makeup products available can make it challenging to find the ideal match for desired attributes. An intelligent approach for product discovery is required to enhance the makeup shop** experience to make it more convenient and satisfying. However, enabling accurate and efficient product discovery requires extracting detailed attributes like color and finish type. Our work introd… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Presented in Fifth Workshop on Recommender Systems in Fashion(fashionxrecsys) of ACM Conference on Recommender Systems

  3. arXiv:2209.02834  [pdf, other

    cs.CV

    Unsupervised Scene Sketch to Photo Synthesis

    Authors: Jiayun Wang, Sangryul Jeon, Stella X. Yu, Xi Zhang, Himanshu Arora, Yu Lou

    Abstract: Sketches make an intuitive and powerful visual expression as they are fast executed freehand drawings. We present a method for synthesizing realistic photos from scene sketches. Without the need for sketch and photo pairs, our framework directly learns from readily available large-scale photo datasets in an unsupervised manner. To this end, we introduce a standardization module that provides pseud… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Journal ref: ECCVW 2022

  4. arXiv:2204.04867  [pdf, other

    cs.CV

    Structured Graph Variational Autoencoders for Indoor Furniture layout Generation

    Authors: Aditya Chattopadhyay, Xi Zhang, David Paul Wipf, Himanshu Arora, Rene Vidal

    Abstract: We present a structured graph variational autoencoder for generating the layout of indoor 3D scenes. Given the room type (e.g., living room or library) and the room layout (e.g., room elements such as floor and walls), our architecture generates a collection of objects (e.g., furniture items such as sofa, table and chairs) that is consistent with the room type and layout. This is a challenging pro… ▽ More

    Submitted 22 July, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  5. VoiceMoji: A Novel On-Device Pipeline for Seamless Emoji Insertion in Dictation

    Authors: Sumit Kumar, Harichandana B S S, Himanshu Arora

    Abstract: Most of the speech recognition systems recover only words in the speech and fail to capture emotions. Users have to manually add emoji(s) in text for adding tone and making communication fun. Though there is much work done on punctuation addition on transcribed speech, the area of emotion addition is untouched. In this paper, we propose a novel on-device pipeline to enrich the voice input experien… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: Accepted at IEEE INDICON 2021, 19-21 December, 2021, India

  6. LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network

    Authors: Vibhav Agarwal, Sudeep Deepak Shivnikar, Sourav Ghosh, Himanshu Arora, Yashwant Saini

    Abstract: Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in 2021 IEEE 20th International Conference on Machine Learning and Applications (ICMLA)

    Journal ref: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), Pasadena, CA, USA, 2021, pp. 1112-1117

  7. arXiv:2110.06199  [pdf, other

    cs.CV cs.AI cs.GR

    ABO: Dataset and Benchmarks for Real-World 3D Object Understanding

    Authors: Jasmine Collins, Shubham Goel, Kenan Deng, Achleshwar Luthra, Leon Xu, Erhan Gundogdu, Xi Zhang, Tomas F. Yago Vicente, Thomas Dideriksen, Himanshu Arora, Matthieu Guillaumin, Jitendra Malik

    Abstract: We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog images, metadata, and artist-created 3D models with complex geometries and physically-based materials that correspond to real, household objects. We derive challenging benchmarks that exploit the unique properties of ABO and measure… ▽ More

    Submitted 24 June, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  8. arXiv:2110.00644  [pdf, other

    cs.CV

    RoomStructNet: Learning to Rank Non-Cuboidal Room Layouts From Single View

    Authors: Xi Zhang, Chun-Kai Wang, Kenan Deng, Tomas Yago-Vicente, Himanshu Arora

    Abstract: In this paper, we present a new approach to estimate the layout of a room from its single image. While recent approaches for this task use robust features learnt from data, they resort to optimization for detecting the final layout. In addition to using learnt robust features, our approach learns an additional ranking function to estimate the final layout instead of using optimization. To learn th… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: 10 pages

  9. arXiv:2106.16237  [pdf, other

    cs.CV

    Multimodal Shape Completion via IMLE

    Authors: Himanshu Arora, Saurabh Mishra, Shichong Peng, Ke Li, Ali Mahdavi-Amiri

    Abstract: Shape completion is the problem of completing partial input shapes such as partial scans. This problem finds important applications in computer vision and robotics due to issues such as occlusion or sparsity in real-world data. However, most of the existing research related to shape completion has been focused on completing shapes by learning a one-to-one map** which limits the diversity and cre… ▽ More

    Submitted 7 July, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: Project Website: https://sites.google.com/site/alimahdaviamiri/projects/shape-completion

  10. arXiv:2101.05970  [pdf, other

    cs.LG cs.AI cs.RO

    Affordance-based Reinforcement Learning for Urban Driving

    Authors: Tanmay Agarwal, Hitesh Arora, Jeff Schneider

    Abstract: Traditional autonomous vehicle pipelines that follow a modular approach have been very successful in the past both in academia and industry, which has led to autonomy deployed on road. Though this approach provides ease of interpretation, its generalizability to unseen environments is limited and hand-engineering of numerous parameters is required, especially in the prediction and planning systems… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

  11. arXiv:2101.04456  [pdf

    cs.CL

    A character representation enhanced on-device Intent Classification

    Authors: Sudeep Deepak Shivnikar, Himanshu Arora, Harichandana B S S

    Abstract: Intent classification is an important task in natural language understanding systems. Existing approaches have achieved perfect scores on the benchmark datasets. However they are not suitable for deployment on low-resource devices like mobiles, tablets, etc. due to their massive model size. Therefore, in this paper, we present a novel light-weight architecture for intent classification that can ru… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in ICON 2020: 17th International Conference on Natural Language Processing

  12. arXiv:2008.05723  [pdf, other

    cs.CV

    Contextual Diversity for Active Learning

    Authors: Sharat Agarwal, Himanshu Arora, Saket Anand, Chetan Arora

    Abstract: Requirement of large annotated datasets restrict the use of deep convolutional neural networks (CNNs) for many practical applications. The problem can be mitigated by using active learning (AL) techniques which, under a given annotation budget, allow to select a subset of data that yields maximum accuracy upon fine tuning. State of the art AL approaches typically rely on measures of visual diversi… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: A variant of this report is accepted in ECCV 2020

  13. arXiv:2004.04146  [pdf, other

    physics.soc-ph cs.SI

    Complex Network Analysis of Indian Railway Zones

    Authors: Nikhil Kumar Rajput, Piyush Badola, Harshit Arora, Bhavya Ahuja Grover

    Abstract: Indian Railway Network has been analyzed on the basis of number of trains directly linking two railway zones. The network has been displayed as a weighted graph where the weights denote the number of trains between the zones. It may be pointed out that each zone is a complex network in itself and may depict different characteristic features. The zonal network therefore can be considered as a netwo… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  14. Iteratively Composing Statically Verified Traits

    Authors: Isaac Oscar Gariano, Marco Servetto, Alex Potanin, Hrshikesh Arora

    Abstract: Static verification relying on an automated theorem prover can be very slow and brittle: since static verification is undecidable, correct code may not pass a particular static verifier. In this work we use metaprogramming to generate code that is correct by construction. A theorem prover is used only to verify initial "traits": units of code that can be used to compose bigger programs. In our w… ▽ More

    Submitted 20 August, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: In Proceedings VPT 2019, arXiv:1908.06723

    Journal ref: EPTCS 299, 2019, pp. 49-55

  15. arXiv:1902.05436  [pdf, ps, other

    cs.SE cs.PL

    Checking Observational Purity of Procedures

    Authors: Himanshu Arora, Raghavan Komondoor, G. Ramalingam

    Abstract: Verifying whether a procedure is observationally pure is useful in many software engineering scenarios. An observationally pure procedure always returns the same value for the same argument, and thus mimics a mathematical function. The problem is challenging when procedures use private mutable global variables, e.g., for memoization of frequently returned answers, and when they involve recursion.… ▽ More

    Submitted 14 February, 2019; originally announced February 2019.

    Comments: FASE 2019

  16. Separating Use and Reuse to Improve Both

    Authors: Hrshikesh Arora, Marco Servetto, Bruno C. D. S. Oliveira

    Abstract: Context: Trait composition has inspired new research in the area of code reuse for object oriented (OO) languages. One of the main advantages of this kind of composition is that it makes possible to separate subty** from subclassing; which is good for code-reuse, design and reasoning. However, handling of state within traits is difficult, verbose or inelegant. Inquiry: We identify the this-leaki… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Journal ref: The Art, Science, and Engineering of Programming, 2019, Vol. 3, Issue 3, Article 12

  17. arXiv:1802.01034  [pdf, other

    cs.LG stat.ML

    Multi-task Learning for Continuous Control

    Authors: Himani Arora, Rajath Kumar, Jason Krone, Chong Li

    Abstract: Reliable and effective multi-task learning is a prerequisite for the development of robotic agents that can quickly learn to accomplish related, everyday tasks. However, in the reinforcement learning domain, multi-task learning has not exhibited the same level of success as in other domains, such as computer vision. In addition, most reinforcement learning research on multi-task learning has been… ▽ More

    Submitted 3 February, 2018; originally announced February 2018.

  18. arXiv:1710.09798  [pdf, other

    cs.CV eess.AS eess.IV

    Lip2AudSpec: Speech reconstruction from silent lip movements video

    Authors: Hassan Akbari, Himani Arora, Liangliang Cao, Nima Mesgarani

    Abstract: In this study, we propose a deep neural network for reconstructing intelligible speech from silent lip movement videos. We use auditory spectrogram as spectral representation of speech and its corresponding sound generation method resulting in a more natural sounding reconstructed speech. Our proposed network consists of an autoencoder to extract bottleneck features from the auditory spectrogram w… ▽ More

    Submitted 26 October, 2017; originally announced October 2017.

  19. Computing Egomotion with Local Loop Closures for Egocentric Videos

    Authors: Suvam Patra, Himanshu Aggarwal, Himani Arora, Chetan Arora, Subhashis Banerjee

    Abstract: Finding the camera pose is an important step in many egocentric video applications. It has been widely reported that, state of the art SLAM algorithms fail on egocentric videos. In this paper, we propose a robust method for camera pose estimation, designed specifically for egocentric videos. In an egocentric video, the camera views the same scene point multiple times as the wearer's head sweeps ba… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.

    Comments: Accepted in WACV 2017