Skip to main content

Showing 1–26 of 26 results for author: Patil, G

.
  1. Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval

    Authors: Rohan Chavan, Gaurav Patil, Vishal Madle, Raviraj Joshi

    Abstract: Stopwords are commonly used words in a language that are often considered to be of little value in determining the meaning or significance of a document. These words occur frequently in most texts and don't provide much useful information for tasks like sentiment analysis and text classification. English, which is a high-resource language, takes advantage of the availability of stopwords, whereas… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at I2CT 2024

  2. arXiv:2404.06921  [pdf, other

    cs.CL cs.AI

    GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

    Authors: Shishir G. Patil, Tianjun Zhang, Vivian Fang, Noppapon C., Roy Huang, Aaron Hao, Martin Casado, Joseph E. Gonzalez, Raluca Ada Popa, Ion Stoica

    Abstract: Large Language Models (LLMs) are evolving beyond their classical role of providing information within dialogue systems to actively engaging with tools and performing actions on real-world applications and services. Today, humans verify the correctness and appropriateness of the LLM-generated outputs (e.g., code, functions, or actions) before putting them into real-world execution. This poses signi… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  3. arXiv:2403.10131  [pdf, other

    cs.CL cs.AI

    RAFT: Adapting Language Model to Domain Specific RAG

    Authors: Tianjun Zhang, Shishir G. Patil, Naman Jain, Sheng Shen, Matei Zaharia, Ion Stoica, Joseph E. Gonzalez

    Abstract: Pretraining Large Language Models (LLMs) on large corpora of textual data is now a standard paradigm. When using these LLMs for many downstream applications, it is common to additionally bake in new knowledge (e.g., time-critical news, or private domain knowledge) into the pretrained model either through RAG-based-prompting, or fine-tuning. However, the optimal methodology for the model to gain su… ▽ More

    Submitted 5 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  4. arXiv:2310.08560  [pdf, other

    cs.AI

    MemGPT: Towards LLMs as Operating Systems

    Authors: Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez

    Abstract: Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows, we propose virtual context management, a technique drawing inspiration from hierarchical memory systems in traditional operating systems that provide the appea… ▽ More

    Submitted 12 February, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Code and data available at https://research.memgpt.ai

  5. arXiv:2306.04699  [pdf, other

    cs.CV

    DiViNeT: 3D Reconstruction from Disparate Views via Neural Template Regularization

    Authors: Aditya Vora, Akshay Gadi Patil, Hao Zhang

    Abstract: We present a volume rendering-based neural surface reconstruction method that takes as few as three disparate RGB images as input. Our key idea is to regularize the reconstruction, which is severely ill-posed and leaving significant gaps between the sparse views, by learning a set of neural templates to act as surface priors. Our method, coined DiViNet, operates in two stages. It first learns the… ▽ More

    Submitted 1 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: To be presented at NeurIPS, 2023

  6. arXiv:2305.18557  [pdf, other

    cs.CV

    Evaluating 3D Shape Analysis Methods for Robustness to Rotation Invariance

    Authors: Supriya Gadi Patil, Angel X. Chang, Manolis Savva

    Abstract: This paper analyzes the robustness of recent 3D shape descriptors to SO(3) rotations, something that is fundamental to shape modeling. Specifically, we formulate the task of rotated 3D object instance detection. To do so, we consider a database of 3D indoor scenes, where objects occur in different orientations. We benchmark different methods for feature extraction and classification in the context… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 20th Conference on Robots and Vision (CRV) 2023

  7. arXiv:2305.15334  [pdf, other

    cs.CL cs.AI

    Gorilla: Large Language Model Connected with Massive APIs

    Authors: Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez

    Abstract: Large Language Models (LLMs) have seen an impressive wave of advances recently, with models now excelling in a variety of tasks, such as mathematical reasoning and program synthesis. However, their potential to effectively use tools via API calls remains unfulfilled. This is a challenging task even for today's state-of-the-art LLMs such as GPT-4, largely due to their inability to generate accurate… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2304.06342  [pdf, other

    cs.CV cs.GR

    RoSI: Recovering 3D Shape Interiors from Few Articulation Images

    Authors: Akshay Gadi Patil, Yiming Qian, Shan Yang, Brian Jackson, Eric Bennett, Hao Zhang

    Abstract: The dominant majority of 3D models that appear in gaming, VR/AR, and those we use to train geometric deep learning algorithms are incomplete, since they are modeled as surface meshes and missing their interior structures. We present a learning framework to recover the shape interiors (RoSI) of existing 3D models with only their exteriors from multi-view and multi-articulation images. Given a set o… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  9. arXiv:2304.03188  [pdf, other

    cs.GR

    Advances in Data-Driven Analysis and Synthesis of 3D Indoor Scenes

    Authors: Akshay Gadi Patil, Supriya Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang

    Abstract: This report surveys advances in deep learning-based modeling techniques that address four different 3D indoor scene analysis tasks, as well as synthesis of 3D indoor scenes. We describe different kinds of representations for indoor scenes, various indoor scene datasets available for research in the aforementioned areas, and discuss notable works employing machine learning models for such scene mod… ▽ More

    Submitted 21 August, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Published in Computer Graphics Forum, Aug 2023

  10. arXiv:2303.11530  [pdf, other

    cs.CV

    Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images

    Authors: Ruiqi Wang, Akshay Gadi Patil, Fenggen Yu, Hao Zhang

    Abstract: We introduce the first active learning (AL) model for high-accuracy instance segmentation of moveable parts from RGB images of real indoor scenes. Specifically, our goal is to obtain fully validated segmentation results by humans while minimizing manual effort. To this end, we employ a transformer that utilizes a masked-attention mechanism to supervise the active segmentation. To enhance the netwo… ▽ More

    Submitted 7 July, 2024; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted to ECCV 2024. Project page: https://suikei-wang.github.io/mvp-seg/

  11. arXiv:2303.03286  [pdf

    cond-mat.mtrl-sci

    Giant electromechanical response from defective non-ferroelectric epitaxial BaTiO3 integrated on Si 100

    Authors: Sandeep Vura, Shubham Kumar Parate, Subhajit Pal, Upanya Khandelwal, Rajeev Kumar Rai, Sri Harsha Molleti, Vishnu Kumar, Rama Satya Sandilya Ventrapragada, Girish Patil, Mudit Jain, Ambresh Mallya, Majid Ahmadi, Bart Kooi, Sushobhan Avasthi, Rajeev Ranjan, Srinivasan Raghavan, Saurabh Chandorkar, Pavan Nukala

    Abstract: Lead free, silicon compatible materials showing large electromechanical responses comparable to, or better than conventional relaxor ferroelectrics, are desirable for various nanoelectromechanical devices and applications. Defect-engineered electrostriction has recently been gaining popularity to obtain enhanced electromechanical responses at sub 100 Hz frequencies. Here, we report record values o… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 26 pages, 4 figures, 8 supplementary figures

  12. arXiv:2211.03011  [pdf, other

    cs.LG eess.SY stat.ML

    On learning history based policies for controlling Markov decision processes

    Authors: Gandharv Patil, Aditya Mahajan, Doina Precup

    Abstract: Reinforcementlearning(RL)folkloresuggeststhathistory-basedfunctionapproximationmethods,suchas recurrent neural nets or history-based state abstraction, perform better than their memory-less counterparts, due to the fact that function approximation in Markov decision processes (MDP) can be viewed as inducing a Partially observable MDP. However, there has been little formal analysis of such history-… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

  13. arXiv:2210.07259  [pdf, other

    cs.NI cs.DC

    Skyplane: Optimizing Transfer Cost and Throughput Using Cloud-Aware Overlays

    Authors: Paras Jain, Sam Kumar, Sarah Wooders, Shishir G. Patil, Joseph E. Gonzalez, Ion Stoica

    Abstract: Cloud applications are increasingly distributing data across multiple regions and cloud providers. Unfortunately, wide-area bulk data transfers are often slow, bottlenecking applications. We demonstrate that it is possible to significantly improve inter-region cloud bulk transfer throughput by adapting network overlays to the cloud setting -- that is, by routing data through indirect paths at the… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: To appear at NSDI 2023

  14. arXiv:2210.05918  [pdf, ps, other

    cs.LG cs.AI eess.SY stat.ML

    Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation

    Authors: Gandharv Patil, Prashanth L. A., Dheeraj Nagaraj, Doina Precup

    Abstract: We study the finite-time behaviour of the popular temporal difference (TD) learning algorithm when combined with tail-averaging. We derive finite time bounds on the parameter error of the tail-averaged TD iterate under a step-size choice that does not require information about the eigenvalues of the matrix underlying the projected TD fixed point. Our analysis shows that tail-averaged TD converges… ▽ More

    Submitted 11 September, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, Proceedings of Machine Learning Research, 2023

  15. arXiv:2207.07697  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging

    Authors: Shishir G. Patil, Paras Jain, Prabal Dutta, Ion Stoica, Joseph E. Gonzalez

    Abstract: Fine-tuning models on edge devices like mobile phones would enable privacy-preserving personalization over sensitive data. However, edge training has historically been limited to relatively small models with simple architectures because training is both memory and energy intensive. We present POET, an algorithm to enable training large neural networks on memory-scarce battery-operated edge devices… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Proceedings of the 39th International Conference on Machine Learning 2022 (ICML 2022)

  16. Using Thermal Ratchet Mechanism to Achieve Net Motility in Magnetic Microswimmers

    Authors: Gouri Patil, Pranay Mandal, Ambarish Ghosh

    Abstract: Thermal ratchets can extract useful work from random fluctuations. This is common in the molecular scale, such as motor proteins, and has also been used to achieve directional transport in microfluidic devices. In this work, we use the ratchet principle to induce net motility in an externally powered magnetic colloid, which otherwise shows reciprocal (back and forth) motion. The experimental syste… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Journal ref: Phys. Rev. Lett 129 (2022) 198002

  17. arXiv:2102.01985  [pdf, other

    cs.LG cs.AI

    Variance Penalized On-Policy and Off-Policy Actor-Critic

    Authors: Arushi Jain, Gandharv Patil, Ayush Jain, Khimya Khetarpal, Doina Precup

    Abstract: Reinforcement learning algorithms are typically geared towards optimizing the expected return of an agent. However, in many practical applications, low variance in the return is desired to ensure the reliability of an algorithm. In this paper, we propose on-policy and off-policy actor-critic algorithms that optimize a performance criterion involving both mean and variance in the return. Previous w… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: Accepted to the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21), 2021

  18. arXiv:2012.06547  [pdf, other

    cs.CV cs.IR

    LayoutGMN: Neural Graph Matching for Structural Layout Similarity

    Authors: Akshay Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang

    Abstract: We present a deep neural network to predict structural similarity between 2D layouts by leveraging Graph Matching Networks (GMN). Our network, coined LayoutGMN, learns the layout metric via neural graph matching, using an attention-based GMN designed under a triplet network setting. To train our network, we utilize weak labels obtained by pixel-wise Intersection-over-Union (IoUs) to define the tri… ▽ More

    Submitted 5 April, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

  19. arXiv:1911.09204  [pdf, other

    cs.GR cs.CV

    DR-KFS: A Differentiable Visual Similarity Metric for 3D Shape Reconstruction

    Authors: Jiongchao **, Akshay Gadi Patil, Zhang Xiong, Hao Zhang

    Abstract: We introduce a differential visual similarity metric to train deep neural networks for 3D reconstruction, aimed at improving reconstruction quality. The metric compares two 3D shapes by measuring distances between multi-view images differentiably rendered from the shapes. Importantly, the image-space distance is also differentiable and measures visual similarity, rather than pixel-wise distortion.… ▽ More

    Submitted 31 March, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

  20. arXiv:1909.00302  [pdf, other

    cs.CV cs.GR cs.IR

    READ: Recursive Autoencoders for Document Layout Generation

    Authors: Akshay Gadi Patil, Omri Ben-Eliezer, Or Perel, Hadar Averbuch-Elor

    Abstract: Layout is a fundamental component of any graphic design. Creating large varieties of plausible document layouts can be a tedious task, requiring numerous constraints to be satisfied, including local ones relating different semantic elements and global constraints on the general appearance and spacing. In this paper, we present a novel framework, coined READ, for REcursive Autoencoders for Document… ▽ More

    Submitted 16 April, 2020; v1 submitted 31 August, 2019; originally announced September 2019.

  21. arXiv:1807.09193  [pdf, other

    cs.GR

    GRAINS: Generative Recursive Autoencoders for INdoor Scenes

    Authors: Manyi Li, Akshay Gadi Patil, Kai Xu, Siddhartha Chaudhuri, Owais Khan, Ariel Shamir, Changhe Tu, Baoquan Chen, Daniel Cohen-Or, Hao Zhang

    Abstract: We present a generative neural network which enables us to generate plausible 3D indoor scenes in large quantities and varieties, easily and highly efficiently. Our key observation is that indoor scene structures are inherently hierarchical. Hence, our network is not convolutional; it is a recursive neural network or RvNN. Using a dataset of annotated scene hierarchies, we train a variational recu… ▽ More

    Submitted 8 May, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: 21 pages, 26 figures

  22. arXiv:1707.01697  [pdf

    cs.AR math.OC

    Pipelined Parallel FFT Architecture

    Authors: Tanaji U. Kamble, B. G. Patil, Rakhee S. Bhojakar

    Abstract: In this paper, an optimized efficient VLSI architecture of a pipeline Fast Fourier transform (FFT) processor capable of producing the reverse output order sequence is presented. Paper presents Radix-2 multipath delay architecture for FFT calculation. The implementation of FFT in hardware is very critical because for calculation of FFT number of butterfly operations i.e. number of multipliers requi… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

  23. arXiv:1604.01962  [pdf, other

    cs.CV

    Automatic Content-aware Non-Photorealistic Rendering of Images

    Authors: Akshay Gadi Patil, Shanmuganathan Raman

    Abstract: Non-photorealistic rendering techniques work on image features and often manipulate a set of characteristics such as edges and texture to achieve a desired depiction of the scene. Most computational photography methods decompose an image using edge preserving filters and work on the resulting base and detail layers independently to achieve desired visual effects. We propose a new approach for cont… ▽ More

    Submitted 19 April, 2016; v1 submitted 7 April, 2016; originally announced April 2016.

    Comments: 6 pages

  24. arXiv:1602.03494  [pdf

    cs.ET

    Time and Frequency Domain Investigation of Selected Memristor based Analog Circuits

    Authors: G. S. Patil, S. R. Ghatage, P. K. Gaikwad, R. K. Kamat, T. D. Dongale

    Abstract: In this paper, we investigate few memristor-based analog circuits namely the phase shift oscillator, integrator, and differentiator which have been explored numerously using the traditional lumped components. We use LTspice-IV platform for simulation of the above-said circuits. The investigation resorts to the nonlinear dopant drift model of memristor and the window function portrayed in the liter… ▽ More

    Submitted 30 March, 2017; v1 submitted 6 February, 2016; originally announced February 2016.

    Comments: 11 Pages, 9 Figures

    MSC Class: 94C05; 00A72 ACM Class: J.2

  25. arXiv:1204.2062  [pdf

    cs.CV

    SVD-EBP Algorithm for Iris Pattern Recognition

    Authors: Babasaheb G. Patil, Shaila Subbaraman

    Abstract: This paper proposes a neural network approach based on Error Back Propagation (EBP) for classification of different eye images. To reduce the complexity of layered neural network the dimensions of input vectors are optimized using Singular Value Decomposition (SVD). The main of this work is to provide for best method for feature extraction and classification. The details of this combined system na… ▽ More

    Submitted 10 April, 2012; originally announced April 2012.

    Comments: Dec2011-volume2.Issue 12 (IJACSA)

  26. Variable - temperature scanning optical and force microscope

    Authors: P. S. Fodor, H. Zhu, N. G. Patil, J. Levy

    Abstract: The implementation of a scanning microscope capable of working in confocal, atomic force and apertureless near field configurations is presented. The microscope is designed to operate in the temperature range 4 - 300 K, using conventional helium flow cryostats. In AFM mode, the distance between the sample and an etched tungsten tip is controlled by a self - sensing piezoelectric tuning fork. The… ▽ More

    Submitted 6 March, 2004; originally announced March 2004.

    Comments: 24 pages, 9 figures, submitted to the journal "Review of Scientific Instruments"