Skip to main content

Showing 1–50 of 74 results for author: Patil, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19112  [pdf, other

    cs.LG

    A Teacher Is Worth A Million Instructions

    Authors: Nikhil Kothari, Ravindra Nayak, Shreyas Shetty, Amey Patil, Nikesh Garera

    Abstract: Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent limitations in training methods create substantial difficulties to train relatively smaller models with 7B and 13B parameters. In our research, we suggest an improved… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures

  2. arXiv:2406.10886  [pdf, other

    cs.CL cs.LG

    Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM

    Authors: Sri Raghava Muddu, Rupasai Rangaraju, Tejpalsingh Siledar, Swaroop Nath, Pushpak Bhattacharyya, Swaprava Nath, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Sudhanshu Shekhar Singh, Nikesh Garera

    Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limi… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  3. arXiv:2405.15750  [pdf, other

    cs.CL cs.AI cs.LG

    Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence

    Authors: Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld

    Abstract: This paper introduces Filtered Corpus Training, a method that trains language models (LMs) on corpora with certain linguistic constructions filtered out from the training data, and uses it to measure the ability of LMs to perform linguistic generalization on the basis of indirect evidence. We apply the method to both LSTM and Transformer LMs (of roughly comparable size), develo** filtered corpor… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 10 pages + 7 pages of references/appendices. For code and trained models, see http://github.com/CLMBRs/corpus-filtering

  4. arXiv:2404.05243  [pdf, other

    cs.CL cs.AI

    Product Description and QA Assisted Self-Supervised Opinion Summarization

    Authors: Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

    Abstract: In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) s… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  5. arXiv:2404.02868  [pdf, other

    cs.ET

    UDON: A case for offloading to general purpose compute on CXL memory

    Authors: Jon Hermes, Josh Minor, Minjun Wu, Adarsh Patil, Eric Van Hensbergen

    Abstract: Upcoming CXL-based disaggregated memory devices feature special purpose units to offload compute to near-memory. In this paper, we explore opportunities for offloading compute to general purpose cores on CXL memory devices, thereby enabling a greater utility and diversity of offload. We study two classes of popular memory intensive applications: ML inference and vector database as candidates for… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Presented at the 3rd Workshop on Heterogeneous Composable and Disaggregated Systems (HCDS 2024)

  6. AI coach for badminton

    Authors: Dhruv Toshniwal, Arpit Patil, Nancy Vachhani

    Abstract: In the competitive realm of sports, optimal performance necessitates rigorous management of nutrition and physical conditioning. Specifically, in badminton, the agility and precision required make it an ideal candidate for motion analysis through video analytics. This study leverages advanced neural network methodologies to dissect video footage of badminton matches, aiming to extract detailed ins… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 7 pages, 11 figures. https://ieeexplore.ieee.org/document/9825164

    Journal ref: 2022 3rd International Conference for Emerging Technology (INCET), Belgaum, India, 2022, pp. 1-7

  7. arXiv:2402.15473  [pdf, other

    cs.CL cs.LG

    Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

    Authors: Swaroop Nath, Tejpalsingh Siledar, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Harshad Khadilkar, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a dominating strategy in aligning Language Models (LMs) with human values/goals. The key to the strategy is learning a reward model ($\varphi$), which can reflect the latent reward model of humans. While this strategy has proven effective, the training methodology requires a lot of human preference annotation (usually in the order of ten… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 19 pages, 6 figures, 21 tables

  8. arXiv:2402.15081  [pdf, other

    cs.SE

    How to Sustain a Scientific Open-Source Software Ecosystem: Learning from the Astropy Project

    Authors: Jiayi Sun, Aarya Patil, Youhai Li, ** L. C. Guo, Shurui Zhou

    Abstract: Scientific open-source software (OSS) has greatly benefited research communities through its transparent and collaborative nature. Given its critical role in scientific research, ensuring the sustainability of such software has become vital. Earlier studies have proposed sustainability strategies for conventional scientific software and open-source communities. However, it remains unclear whether… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  9. arXiv:2402.11683  [pdf, other

    cs.CL

    One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

    Authors: Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera

    Abstract: Evaluation of opinion summaries using conventional reference-based metrics rarely provides a holistic evaluation and has been shown to have a relatively low correlation with human judgments. Recent studies suggest using Large Language Models (LLMs) as reference-free metrics for NLG evaluation, however, they remain unexplored for opinion summary evaluation. Moreover, limited opinion summary evaluat… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  10. Streaming Bilingual End-to-End ASR model using Attention over Multiple Softmax

    Authors: Aditya Patil, Vikas Joshi, Purvi Agrawal, Rupesh Mehta

    Abstract: Even with several advancements in multilingual modeling, it is challenging to recognize multiple languages using a single neural model, without knowing the input language and most multilingual models assume the availability of the input language. In this work, we propose a novel bilingual end-to-end (E2E) modeling approach, where a single neural model can recognize both languages and also support… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Published in IEEE's Spoken Language Technology (SLT) 2022, 8 pages (6 + 2 for references), 5 figures

    Journal ref: 2022 IEEE Spoken Language Technology Workshop (SLT), Doha, Qatar, 2023, pp. 252-259

  11. L3Cube-MahaSocialNER: A Social Media based Marathi NER Dataset and BERT models

    Authors: Harsh Chaudhari, Anuja Patil, Dhanashree Lavekar, Pranav Khairnar, Raviraj Joshi

    Abstract: This work introduces the L3Cube-MahaSocialNER dataset, the first and largest social media dataset specifically designed for Named Entity Recognition (NER) in the Marathi language. The dataset comprises 18,000 manually labeled sentences covering eight entity classes, addressing challenges posed by social media data, including non-standard language and informal idioms. Deep learning models, includin… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted at Forum for Information Retrieval Evaluation (FIRE 2023)

  12. arXiv:2312.16015  [pdf, ps, other

    cs.IR cs.AI cs.LG

    A Comprehensive Survey of Evaluation Techniques for Recommendation Systems

    Authors: Aryan Jadon, Avinash Patil

    Abstract: The effectiveness of recommendation systems is pivotal to user engagement and satisfaction in online platforms. As these recommendation systems increasingly influence user choices, their evaluation transcends mere technical performance and becomes central to business success. This paper addresses the multifaceted nature of recommendations system evaluation by introducing a comprehensive suite of m… ▽ More

    Submitted 12 January, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 25 Pages

  13. On Significance of Subword tokenization for Low Resource and Efficient Named Entity Recognition: A case study in Marathi

    Authors: Harsh Chaudhari, Anuja Patil, Dhanashree Lavekar, Pranav Khairnar, Raviraj Joshi, Sachin Pande

    Abstract: Named Entity Recognition (NER) systems play a vital role in NLP applications such as machine translation, summarization, and question-answering. These systems identify named entities, which encompass real-world concepts like locations, persons, and organizations. Despite extensive research on NER systems for the English language, they have not received adequate attention in the context of low reso… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted at ICDAM 2023

  14. arXiv:2311.02216  [pdf, other

    cs.CL cs.LG

    Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data

    Authors: Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl

    Abstract: Numbers are crucial for various real-world domains such as finance, economics, and science. Thus, understanding and reasoning with numbers are essential skills for language models to solve different tasks. While different numerical benchmarks have been introduced in recent years, they are limited to specific numerical aspects mostly. In this paper, we propose a hierarchical taxonomy for numerical… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 (Findings)

  15. arXiv:2310.09277  [pdf, other

    cs.LG

    A Hybrid Approach for Depression Classification: Random Forest-ANN Ensemble on Motor Activity Signals

    Authors: Anket Patil, Dhairya Shah, Abhishek Shah, Mokshit Gala

    Abstract: Regarding the rising number of people suffering from mental health illnesses in today's society, the importance of mental health cannot be overstated. Wearable sensors, which are increasingly widely available, provide a potential way to track and comprehend mental health issues. These gadgets not only monitor everyday activities but also continuously record vital signs like heart rate, perhaps pro… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 8 pages

    MSC Class: 68T05

  16. arXiv:2308.09193  [pdf, other

    cs.SE cs.CL cs.LG

    A Comparative Study of Text Embedding Models for Semantic Text Similarity in Bug Reports

    Authors: Avinash Patil, Kihwan Han, Aryan Jadon

    Abstract: Bug reports are an essential aspect of software development, and it is crucial to identify and resolve them quickly to ensure the consistent functioning of software systems. Retrieving similar bug reports from an existing database can help reduce the time and effort required to resolve bugs. In this paper, we compared the effectiveness of semantic textual similarity methods for retrieving similar… ▽ More

    Submitted 30 November, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: 7 Pages

  17. arXiv:2308.04552  [pdf, other

    cs.DB

    WhaleVis: Visualizing the History of Commercial Whaling

    Authors: Ameya Patil, Zoe Rand, Trevor Branch, Leilani Battle

    Abstract: Whales are an important part of the oceanic ecosystem. Although historic commercial whale hunting a.k.a. whaling has severely threatened whale populations, whale researchers are looking at historical whaling data to inform current whale status and future conservation efforts. To facilitate this, we worked with experts in aquatic and fishery sciences to create WhaleVis -- an interactive dashboard f… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 5 pages including references, 2 figures. Dashboard served live at https://observablehq.com/@whales/whale-vis-dashboard-expedition-routes. To be published in the October issue of TVCG 2023

    ACM Class: J.2; J.3; H.5; I.m

  18. arXiv:2306.04964  [pdf, other

    cs.CL cs.LG

    Leveraging Language Identification to Enhance Code-Mixed Text Classification

    Authors: Gauri Takawane, Abhishek Phaltankar, Varad Patwardhan, Aryan Patil, Raviraj Joshi, Mukta S. Takalikar

    Abstract: The usage of more than one language in the same text is referred to as Code Mixed. It is evident that there is a growing degree of adaption of the use of code-mixed data, especially English with a regional language, on social media platforms. Existing deep-learning models do not take advantage of the implicit language information in the code-mixed text. Our study aims to improve BERT-based models… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  19. arXiv:2306.04699  [pdf, other

    cs.CV

    DiViNeT: 3D Reconstruction from Disparate Views via Neural Template Regularization

    Authors: Aditya Vora, Akshay Gadi Patil, Hao Zhang

    Abstract: We present a volume rendering-based neural surface reconstruction method that takes as few as three disparate RGB images as input. Our key idea is to regularize the reconstruction, which is severely ill-posed and leaving significant gaps between the sparse views, by learning a set of neural templates to act as surface priors. Our method, coined DiViNet, operates in two stages. It first learns the… ▽ More

    Submitted 1 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: To be presented at NeurIPS, 2023

  20. The ACROBAT 2022 Challenge: Automatic Registration Of Breast Cancer Tissue

    Authors: Philippe Weitz, Masi Valkonen, Leslie Solorzano, Circe Carr, Kimmo Kartasalo, Constance Boissin, Sonja Koivukoski, Aino Kuusela, Dusan Rasic, Yanbo Feng, Sandra Sinius Pouplier, Abhinav Sharma, Kajsa Ledesma Eriksson, Stephanie Robertson, Christian Marzahl, Chandler D. Gatenbee, Alexander R. A. Anderson, Marek Wodzinski, Artur Jurgas, Niccolò Marini, Manfredo Atzori, Henning Müller, Daniel Budelmann, Nick Weiss, Stefan Heldmann , et al. (16 additional authors not shown)

    Abstract: The alignment of tissue between histopathological whole-slide-images (WSI) is crucial for research and clinical applications. Advances in computing, deep learning, and availability of large WSI datasets have revolutionised WSI analysis. Therefore, the current state-of-the-art in WSI registration is unclear. To address this, we conducted the ACROBAT challenge, based on the largest WSI registration… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  21. Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data

    Authors: Aryan Patil, Varad Patwardhan, Abhishek Phaltankar, Gauri Takawane, Raviraj Joshi

    Abstract: The term "Code Mixed" refers to the use of more than one language in the same text. This phenomenon is predominantly observed on social media platforms, with an increasing amount of adaptation as time goes on. It is critical to detect foreign elements in a language and process them correctly, as a considerable number of individuals are using code-mixed languages that could not be comprehended by u… ▽ More

    Submitted 26 May, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted at IEEE 8th International Conference for Convergence in Technology

  22. arXiv:2304.06342  [pdf, other

    cs.CV cs.GR

    RoSI: Recovering 3D Shape Interiors from Few Articulation Images

    Authors: Akshay Gadi Patil, Yiming Qian, Shan Yang, Brian Jackson, Eric Bennett, Hao Zhang

    Abstract: The dominant majority of 3D models that appear in gaming, VR/AR, and those we use to train geometric deep learning algorithms are incomplete, since they are modeled as surface meshes and missing their interior structures. We present a learning framework to recover the shape interiors (RoSI) of existing 3D models with only their exteriors from multi-view and multi-articulation images. Given a set o… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  23. arXiv:2304.03188  [pdf, other

    cs.GR

    Advances in Data-Driven Analysis and Synthesis of 3D Indoor Scenes

    Authors: Akshay Gadi Patil, Supriya Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang

    Abstract: This report surveys advances in deep learning-based modeling techniques that address four different 3D indoor scene analysis tasks, as well as synthesis of 3D indoor scenes. We describe different kinds of representations for indoor scenes, various indoor scene datasets available for research in the aforementioned areas, and discuss notable works employing machine learning models for such scene mod… ▽ More

    Submitted 21 August, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Published in Computer Graphics Forum, Aug 2023

  24. arXiv:2303.12308  [pdf, other

    cs.CL

    XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages

    Authors: Dhaval Taunk, Shivprasad Sagare, Anupam Patil, Shivansh Subramanian, Manish Gupta, Vasudeva Varma

    Abstract: Lack of encyclopedic text contributors, especially on Wikipedia, makes automated text generation for low resource (LR) languages a critical problem. Existing work on Wikipedia text generation has focused on English only where English reference articles are summarized to generate English Wikipedia pages. But, for low-resource languages, the scarcity of reference articles makes monolingual summariza… ▽ More

    Submitted 18 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  25. arXiv:2303.11530  [pdf, other

    cs.CV

    Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images

    Authors: Ruiqi Wang, Akshay Gadi Patil, Fenggen Yu, Hao Zhang

    Abstract: We introduce the first active learning (AL) framework for high-accuracy instance segmentation of moveable parts from RGB images of real indoor scenes. As with most human-in-the-loop approaches, the key criterion for success in AL is to minimize human effort while still attaining high performance. To this end, we employ a transformer that utilizes a masked-attention mechanism to supervise the activ… ▽ More

    Submitted 27 November, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  26. arXiv:2303.09930  [pdf, other

    cs.CV

    Robust Semi-Supervised Learning for Histopathology Images through Self-Supervision Guided Out-of-Distribution Scoring

    Authors: Nikhil Cherian Kurian, Varsha S, Abhijit Patil, Shashikant Khade, Amit Sethi

    Abstract: Semi-supervised learning (semi-SL) is a promising alternative to supervised learning for medical image analysis when obtaining good quality supervision for medical imaging is difficult. However, semi-SL assumes that the underlying distribution of unaudited data matches that of the few labeled samples, which is often violated in practical settings, particularly in medical images. The presence of ou… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  27. arXiv:2301.06928  [pdf, other

    cs.LG cs.AI

    Towards Estimating Transferability using Hard Subsets

    Authors: Tarun Ram Menta, Surgan Jandial, Akash Patil, Vimal KB, Saketh Bachu, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Chirag Agarwal, Mausoom Sarkar

    Abstract: As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine tuning. In this work, we propose HASTE (HArd Subset TransfErability), a new strategy to estimate the transferability of a source model to a pa… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: First three authors contributed equally

  28. Auto-labelling of Bug Report using Natural Language Processing

    Authors: Avinash Patil, Aryan Jadon

    Abstract: The exercise of detecting similar bug reports in bug tracking systems is known as duplicate bug report detection. Having prior knowledge of a bug report's existence reduces efforts put into debugging problems and identifying the root cause. Rule and Query-based solutions recommend a long list of potential similar bug reports with no clear ranking. In addition, triage engineers are less motivated t… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 7 Pages, 11 Figures

    Journal ref: 2023 IEEE 8th International Conference for Convergence in Technology (I2CT)

  29. arXiv:2211.08956  [pdf

    cs.NI

    A Comprehensive Survey on Spectrum Sharing Techniques for 5G/B5G Intelligent Wireless Networks: Opportunities, Challenges and Future Research Directions

    Authors: Anita Patil, Sridhar Iyer, Onel L. A. Lopez, Rahul J Pandya, Krishna Pai, Anshuman Kalla, Rakhee Kallimani

    Abstract: The increasing popularity of Internet of Everything and small-cell devices has enormously accelerated traffic loads. Consequently, increased bandwidth and high data rate requirements stimulate the operation at the millimeter wave and the Tera-Hertz spectrum bands in the fifth generation (5G) and beyond 5G (B5G) wireless networks. Furthermore, efficient spectrum allocation, maximizing the spectrum… ▽ More

    Submitted 17 November, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  30. arXiv:2211.02989  [pdf, other

    cs.LG cs.AI

    A Comprehensive Survey of Regression Based Loss Functions for Time Series Forecasting

    Authors: Aryan Jadon, Avinash Patil, Shruti Jadon

    Abstract: Time Series Forecasting has been an active area of research due to its many applications ranging from network usage prediction, resource allocation, anomaly detection, and predictive maintenance. Numerous publications published in the last five years have proposed diverse sets of objective loss functions to address cases such as biased data, long-term forecasting, multicollinear features, etc. In… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: 13 pages, 23 figures

  31. arXiv:2207.11917  [pdf, other

    cs.LG cs.AI cs.IR

    Boolean and $\mathbb{F}_p$-Matrix Factorization: From Theory to Practice

    Authors: Fedor Fomin, Fahad Panolan, Anurag Patil, Adil Tanveer

    Abstract: Boolean Matrix Factorization (BMF) aims to find an approximation of a given binary matrix as the Boolean product of two low-rank binary matrices. Binary data is ubiquitous in many fields, and representing data by binary matrices is common in medicine, natural language processing, bioinformatics, computer graphics, among many others. Unfortunately, BMF is computationally hard and heuristic algorith… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Appeared in IJCNN 2022

  32. arXiv:2205.08583  [pdf, other

    eess.SY cs.RO

    Upper Bounds for Continuous-Time End-to-End Risks in Stochastic Robot Navigation

    Authors: Apurva Patil, Takashi Tanaka

    Abstract: We present an analytical method to estimate the continuous-time collision probability of motion plans for autonomous agents with linear controlled Ito dynamics. Motion plans generated by planning algorithms cannot be perfectly executed by autonomous agents in reality due to the inherent uncertainties in the real world. Estimating end-to-end risk is crucial to characterize the safety of trajectorie… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  33. arXiv:2205.00628  [pdf, other

    math.OC cs.RO eess.SY

    Chance-Constrained Stochastic Optimal Control via Path Integral and Finite Difference Methods

    Authors: Apurva Patil, Alfredo Duarte, Aislinn Smith, Takashi Tanaka, Fabrizio Bisetti

    Abstract: This paper addresses a continuous-time continuous-space chance-constrained stochastic optimal control (SOC) problem via a Hamilton-Jacobi-Bellman (HJB) partial differential equation (PDE). Through Lagrangian relaxation, we convert the chance-constrained (risk-constrained) SOC problem to a risk-minimizing SOC problem, the cost function of which possesses the time-additive Bellman structure. We show… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  34. arXiv:2203.08429  [pdf

    cs.NI

    A Survey of Machine Learning Algorithms for 6G Wireless Networks

    Authors: Anita Patil, Sridhar Iyer, Rahul Jashvantbhai Pandya

    Abstract: The primary focus of Artificial Intelligence/Machine Learning (AI/ML) integration within the wireless technology is to reduce capital expenditures, optimize network performance, and build new revenue streams. Replacing traditional algorithms with deep learning AI techniques have dramatically reduced the power consumption and improved the system performance. Further, implementation of ML algorithms… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  35. A Survey on Technological Trends to Enhance Spectrum Efficiency in 6G Communications

    Authors: Sridhar Iyer, Anita Patil, Shilpa Bhairanatti, Soumya Halagatti, Rahul Jashvantbhai Pandya

    Abstract: The research community has already identified that, by 2030, 5G networks will reach the capacity limits, and hence, will be inadequate to support next generation bandwidth-hungry, ubiquitous, intelligent services, and applications. Therefore, in view of sustaining the competitive edge of wireless technology and stratifying the next decade's communication requirements both, industry and research co… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Journal ref: 2022

  36. arXiv:2112.13865  [pdf, other

    eess.IV astro-ph.IM cs.CV cs.LG

    Astronomical Image Colorization and upscaling with Generative Adversarial Networks

    Authors: Shreyas Kalvankar, Hrushikesh Pandit, Pranav Parwate, Atharva Patil, Snehal Kamalapur

    Abstract: Automatic colorization of images without human intervention has been a subject of interest in the machine learning community for a brief period of time. Assigning color to an image is a highly ill-posed problem because of its innate nature of possessing very high degrees of freedom; given an image, there is often no single color-combination that is correct. Besides colorization, another problem in… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: 14 pages, 10 figures, 7 tables

  37. arXiv:2110.15879  [pdf, other

    cs.RO eess.SY

    Upper and Lower Bounds for End-to-End Risks in Stochastic Robot Navigation

    Authors: Apurva Patil, Takashi Tanaka

    Abstract: We present novel upper and lower bounds to estimate the collision probability of motion plans for autonomous agents with discrete-time linear Gaussian dynamics. Motion plans generated by planning algorithms cannot be perfectly executed by autonomous agents in reality due to the inherent uncertainties in the real world. Estimating collision probability is crucial to characterize the safety of traje… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  38. Using Natural Language Processing to Understand Reasons and Motivators Behind Customer Calls in Financial Domain

    Authors: Ankit Patil, Ankush Chopra, Sohom Ghosh, Vamshi Vadla

    Abstract: In this era of abundant digital information, customer satisfaction has become one of the prominent factors in the success of any business. Customers want a one-click solution for almost everything. They tend to get unsatisfied if they have to call about something which they could have done online. Moreover, incoming calls are a high-cost component for any business. Thus, it is essential to develop… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Accepted at ICCMDE-2021. To be published in Springer - Lecture Notes on Data Engineering and Communications Technologies

  39. arXiv:2110.00669  [pdf, ps, other

    cs.RO

    Expanding the Design Space for Electrically-Driven Soft Robots through Handed Shearing Auxetics

    Authors: Ian Good, Tosh Brown-Moore, Aditya Patil, Daniel Revier, Jeffrey Ian Lipton

    Abstract: Handed Shearing Auxetics (HSA) are a promising structure for making electrically driven robots with distributed compliance that convert a motors rotation and torque into extension and force. We overcame past limitations on the range of actuation, blocked force, and stiffness by focusing on two key design parameters: the point of an HSA's auxetic trajectory that is energetically preferred, and the… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: 6 pages+citations, 6 figures, submitted to ICRA 2022

  40. arXiv:2105.14710  [pdf, other

    cs.LG stat.ML

    Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models

    Authors: Ameya D. Patil, Michael Tuttle, Alexander G. Schwing, Naresh R. Shanbhag

    Abstract: Classical adversarial training (AT) frameworks are designed to achieve high adversarial accuracy against a single attack type, typically $\ell_\infty$ norm-bounded perturbations. Recent extensions in AT have focused on defending against the union of multiple perturbations but this benefit is obtained at the expense of a significant (up to $10\times$) increase in training complexity over single-att… ▽ More

    Submitted 11 June, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

  41. arXiv:2104.14264  [pdf

    eess.AS cs.NE cs.SD q-bio.NC

    Hardware-Friendly Synaptic Orders and Timescales in Liquid State Machines for Speech Classification

    Authors: Vivek Saraswat, A**kya Gorad, Anand Naik, Aakash Patil, Udayan Ganguly

    Abstract: Liquid State Machines are brain inspired spiking neural networks (SNNs) with random reservoir connectivity and bio-mimetic neuronal and synaptic models. Reservoir computing networks are proposed as an alternative to deep neural networks to solve temporal classification problems. Previous studies suggest 2nd order (double exponential) synaptic waveform to be crucial for achieving high accuracy for… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  42. arXiv:2012.06547  [pdf, other

    cs.CV cs.IR

    LayoutGMN: Neural Graph Matching for Structural Layout Similarity

    Authors: Akshay Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang

    Abstract: We present a deep neural network to predict structural similarity between 2D layouts by leveraging Graph Matching Networks (GMN). Our network, coined LayoutGMN, learns the layout metric via neural graph matching, using an attention-based GMN designed under a triplet network setting. To train our network, we utilize weak labels obtained by pixel-wise Intersection-over-Union (IoUs) to define the tri… ▽ More

    Submitted 5 April, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

  43. arXiv:2011.15000  [pdf, other

    cs.CV cs.LG eess.IV

    Fast, Self Supervised, Fully Convolutional Color Normalization of H&E Stained Images

    Authors: Abhijeet Patil, Mohd. Talha, Aniket Bhatia, Nikhil Cherian Kurian, Sammed Mangale, Sunil Patel, Amit Sethi

    Abstract: Performance of deep learning algorithms decreases drastically if the data distributions of the training and testing sets are different. Due to variations in staining protocols, reagent brands, and habits of technicians, color variation in digital histopathology images is quite common. Color variation causes problems for the deployment of deep learning-based solutions for automatic diagnosis system… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

    Comments: --

  44. arXiv:2008.07788  [pdf, other

    eess.AS cs.LG

    CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion

    Authors: Maitreya Patel, Mirali Purohit, Jui Shah, Hemant A. Patil

    Abstract: Recently, Generative Adversarial Networks (GAN)-based methods have shown remarkable performance for the Voice Conversion and WHiSPer-to-normal SPeeCH (WHSP2SPCH) conversion. One of the key challenges in WHSP2SPCH conversion is the prediction of fundamental frequency (F0). Recently, authors have proposed state-of-the-art method Cycle-Consistent Generative Adversarial Networks (CycleGAN) for WHSP2SP… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: Accepted in 28th European Signal Processing Conference (EUSIPCO), 2020

  45. arXiv:2006.09464  [pdf, other

    eess.IV cs.LG q-bio.QM

    Visualization for Histopathology Images using Graph Convolutional Neural Networks

    Authors: Mookund Sureka, Abhijeet Patil, Deepak Anand, Amit Sethi

    Abstract: With the increase in the use of deep learning for computer-aided diagnosis in medical images, the criticism of the black-box nature of the deep learning models is also on the rise. The medical community needs interpretable models for both due diligence and advancing the understanding of disease and treatment mechanisms. In histology, in particular, while there is rich detail available at the cellu… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 5 pages, 3 Figures

  46. arXiv:2004.02498  [pdf, other

    cs.CV q-bio.PE

    Image-based phenoty** of diverse Rice (Oryza Sativa L.) Genotypes

    Authors: Mukesh Kumar Vishal, Dipesh Tamboli, Abhijeet Patil, Rohit Saluja, Biplab Banerjee, Amit Sethi, Dhandapani Raju, Sudhir Kumar, R N Sahoo, Viswanathan Chinnusamy, J Adinarayana

    Abstract: Development of either drought-resistant or drought-tolerant varieties in rice (Oryza sativa L.), especially for high yield in the context of climate change, is a crucial task across the world. The need for high yielding rice varieties is a prime concern for develo** nations like India, China, and other Asian-African countries where rice is a primary staple food. The present investigation is carr… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: Paper presented at the ICLR 2020 Workshop on Computer Vision for Agriculture (CV4A)

  47. arXiv:2003.00823  [pdf, other

    cs.CV cs.LG stat.ML

    Breast Cancer Histopathology Image Classification and Localization using Multiple Instance Learning

    Authors: Abhijeet Patil, Dipesh Tamboli, Swati Meena, Deepak Anand, Amit Sethi

    Abstract: Breast cancer has the highest mortality among cancers in women. Computer-aided pathology to analyze microscopic histopathology images for diagnosis with an increasing number of breast cancer patients can bring the cost and delays of diagnosis down. Deep learning in histopathology has attracted attention over the last decade of achieving state-of-the-art performance in classification and localizati… ▽ More

    Submitted 16 February, 2020; originally announced March 2020.

    Comments: Accepted in 2019 5th IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE) and Awarded as best paper

  48. arXiv:2002.01664  [pdf, other

    cs.CL cs.LG eess.AS

    Identification of Indian Languages using Ghost-VLAD pooling

    Authors: Krishna D N, Ankita Patil, M. S. P Raj, Sai Prasad H S, Prabhu Aashish Garapati

    Abstract: In this work, we propose a new pooling strategy for language identification by considering Indian languages. The idea is to obtain utterance level features for any variable length audio for robust language recognition. We use the GhostVLAD approach to generate an utterance level feature vector for any variable length input audio by aggregating the local frame level features across time. The genera… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

    Journal ref: REJECTED ICASSP 2020

  49. arXiv:2002.01073  [pdf, other

    cs.AR

    TLB and Pagewalk Performance in Multicore Architectures with Large Die-Stacked DRAM Cache

    Authors: Adarsh Patil

    Abstract: In this work we study the overheads of virtual-to-physical address translation in processor architectures, like x86-64, that implement paged virtual memory using a radix tree which are walked in hardware. Translation Lookaside Buffers are critical to system performance, particularly as applications demand larger memory footprints and with the adoption of virtualization; however the cost of a TLB m… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

  50. arXiv:1912.01853  [pdf, other

    cs.LG stat.ML

    ADEPOS: A Novel Approximate Computing Framework for Anomaly Detection Systems and its Implementation in 65nm CMOS

    Authors: Sumon Kumar Bose, Bapi Kar, Mohendra Roy, Pradeep Kumar Gopalakrishnan, Zhang Lei, Aakash Patil, Arindam Basu

    Abstract: To overcome the energy and bandwidth limitations of traditional IoT systems, edge computing or information extraction at the sensor node has become popular. However, now it is important to create very low energy information extraction or pattern recognition systems. In this paper, we present an approximate computing method to reduce the computation energy of a specific type of IoT system used for… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: 14 pages

    Journal ref: Preprint TCAS-I 2019