Skip to main content

Showing 1–13 of 13 results for author: Garera, N

.
  1. arXiv:2406.19112  [pdf, other

    cs.LG

    A Teacher Is Worth A Million Instructions

    Authors: Nikhil Kothari, Ravindra Nayak, Shreyas Shetty, Amey Patil, Nikesh Garera

    Abstract: Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent limitations in training methods create substantial difficulties to train relatively smaller models with 7B and 13B parameters. In our research, we suggest an improved… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures

  2. arXiv:2406.10886  [pdf, other

    cs.CL cs.LG

    Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

    Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

    Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  3. arXiv:2404.05243  [pdf, other

    cs.CL cs.AI

    Product Description and QA Assisted Self-Supervised Opinion Summarization

    Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

    Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  4. arXiv:2402.15473  [pdf, other

    cs.CL cs.LG

    Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

    Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 19 pages, 6 figures, 21 tables

  5. arXiv:2402.11683  [pdf, other

    cs.CL

    One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

    Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  6. arXiv:2312.01107  [pdf, other

    cs.LG

    Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning

    Authors: Raviraj Joshi, Nikesh Garera

    Abstract: Text-to-speech (TTS) systems are being built using end-to-end deep learning approaches. However, these systems require huge amounts of training data. We present our approach to built production quality TTS and perform speaker adaptation in extremely low resource settings. We propose a transfer learning approach using high-resource language data and synthetically generated data. We transfer the lea… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at PACLIC 2023

  7. Code-Mixed Text to Speech Synthesis under Low-Resource Constraints

    Authors: Raviraj Joshi, Nikesh Garera

    Abstract: Text-to-speech (TTS) systems are an important component in voice-based e-commerce applications. These applications include end-to-end voice assistant and customer experience (CX) voice bot. Code-mixed TTS is also relevant in these applications since the product names are commonly described in English while the surrounding text is in a regional language. In this work, we describe our approaches for… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at SPECOM 2023

  8. arXiv:2310.15259  [pdf, other

    cs.CL cs.AI cs.LG

    Reference Free Domain Adaptation for Translation of Noisy Questions with Question Specific Rewards

    Authors: Baban Gain, Ramakrishna Appicharla, Soumya Chennabasavaraj, Nikesh Garera, Asif Ekbal, Muthusamy Chelliah

    Abstract: Community Question-Answering (CQA) portals serve as a valuable tool for hel** users within an organization. However, making them accessible to non-English-speaking users continues to be a challenge. Translating questions can broaden the community's reach, benefiting individuals with similar inquiries in various languages. Translating questions using Neural Machine Translation (NMT) poses more ch… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Published at: Findings of EMNLP 2023

  9. arXiv:2305.18596  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Building Accurate Low Latency ASR for Streaming Voice Search

    Authors: Abhinav Goyal, Nikesh Garera

    Abstract: Automatic Speech Recognition (ASR) plays a crucial role in voice-based applications. For applications requiring real-time feedback like Voice Search, streaming capability becomes vital. While LSTM/RNN and CTC based ASR systems are commonly employed for low-latency streaming applications, they often exhibit lower accuracy compared to state-of-the-art models due to a lack of future audio frames. In… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 Industry Track

  10. arXiv:2211.07710  [pdf, other

    cs.CL cs.HC cs.LG cs.SD eess.AS

    End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English

    Authors: Abhinav Goyal, Anupam Singh, Nikesh Garera

    Abstract: Automation of on-call customer support relies heavily on accurate and efficient speech-to-intent (S2I) systems. Building such systems using multi-component pipelines can pose various challenges because they require large annotated datasets, have higher latency, and have complex deployment. These pipelines are also prone to compounding errors. To overcome these challenges, we discuss an end-to-end… ▽ More

    Submitted 26 October, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022 Industry Track

    Journal ref: https://aclanthology.org/2022.emnlp-industry.59

  11. arXiv:2208.03713  [pdf, other

    cs.CL

    Study of Encoder-Decoder Architectures for Code-Mix Search Query Translation

    Authors: Mandar Kulkarni, Soumya Chennabasavaraj, Nikesh Garera

    Abstract: With the broad reach of the internet and smartphones, e-commerce platforms have an increasingly diversified user base. Since native language users are not conversant in English, their preferred browsing mode is their regional language or a combination of their regional language and English. From our recent study on the query data, we noticed that many of the queries we receive are code-mix, specif… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

  12. arXiv:2208.03711  [pdf, other

    cs.CL

    Vernacular Search Query Translation with Unsupervised Domain Adaptation

    Authors: Mandar Kulkarni, Nikesh Garera

    Abstract: With the democratization of e-commerce platforms, an increasingly diversified user base is opting to shop online. To provide a comfortable and reliable shop** experience, it's important to enable users to interact with the platform in the language of their choice. An accurate query translation is essential for Cross-Lingual Information Retrieval (CLIR) with vernacular queries. Due to internet-sc… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

  13. arXiv:2111.14003  [pdf, other

    cs.CL cs.LG

    Answer Generation for Questions With Multiple Information Sources in E-Commerce

    Authors: Anand A. Rajasekar, Nikesh Garera

    Abstract: Automatic question answering is an important yet challenging task in E-commerce given the millions of questions posted by users about the product that they are interested in purchasing. Hence, there is a great demand for automatic answer generation systems that provide quick responses using related information about the product. There are three sources of knowledge available for answering a user p… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: 7 pages, 9 tables, 1 figure