Skip to main content

Showing 1–27 of 27 results for author: Kanade, A

.
  1. arXiv:2406.11638  [pdf, other

    cs.AI cs.SE

    MASAI: Modular Architecture for Software-engineering AI Agents

    Authors: Daman Arora, Atharv Sonwane, Nalin Wadhwa, Abhav Mehrotra, Saiteja Utpala, Ramakrishna Bairi, Aditya Kanade, Nagarajan Natarajan

    Abstract: A common method to solve complex problems in software engineering, is to divide the problem into multiple sub-problems. Inspired by this, we propose a Modular Architecture for Software-engineering AI (MASAI) agents, where different LLM-powered sub-agents are instantiated with well-defined objectives and strategies tuned to achieve those objectives. Our modular architecture offers several advantage… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: A**kya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  3. arXiv:2401.15963  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    NoFunEval: Funny How Code LMs Falter on Requirements Beyond Functional Correctness

    Authors: Manav Singhal, Tushar Aggarwal, Abhijeet Awasthi, Nagarajan Natarajan, Aditya Kanade

    Abstract: Existing evaluation benchmarks of language models of code (code LMs) focus almost exclusively on whether the LMs can generate functionally-correct code. In real-world software engineering, developers think beyond functional correctness. They have requirements on "how" a functionality should be implemented to meet overall system design objectives like efficiency, security, and maintainability. They… ▽ More

    Submitted 2 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Preprint

  4. arXiv:2309.12938  [pdf, other

    cs.AI cs.SE

    Frustrated with Code Quality Issues? LLMs can Help!

    Authors: Nalin Wadhwa, Jui Pradhan, Atharv Sonwane, Surya Prakash Sahu, Nagarajan Natarajan, Aditya Kanade, Suresh Parthasarathy, Sriram Rajamani

    Abstract: As software projects progress, quality of code assumes paramount importance as it affects reliability, maintainability and security of software. For this reason, static analysis tools are used in developer workflows to flag code quality issues. However, developers need to spend extra efforts to revise their code to improve code quality based on the tool findings. In this work, we investigate the u… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  5. arXiv:2309.12499  [pdf, other

    cs.SE

    CodePlan: Repository-level Coding using LLMs and Planning

    Authors: Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet

    Abstract: Software engineering activities such as package migration, fixing errors reports from static analysis or testing, and adding type annotations or other specifications to a codebase, involve pervasively editing the entire repository of code. We formulate these activities as repository-level coding tasks. Recent tools like GitHub Copilot, which are powered by Large Language Models (LLMs), have succ… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  6. arXiv:2307.12465  [pdf, other

    cs.SE

    StaticFixer: From Static Analysis to Static Repair

    Authors: Naman Jain, Shubham Gandhi, Atharv Sonwane, Aditya Kanade, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Static analysis tools are traditionally used to detect and flag programs that violate properties. We show that static analysis tools can also be used to perturb programs that satisfy a property to construct variants that violate the property. Using this insight we can construct paired data sets of unsafe-safe program pairs, and learn strategies to automatically repair property violations. We prese… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  7. arXiv:2306.10763  [pdf, other

    cs.CL cs.AI cs.LG cs.PL cs.SE

    Guiding Language Models of Code with Global Context using Monitors

    Authors: Lakshya A Agrawal, Aditya Kanade, Navin Goyal, Shuvendu K. Lahiri, Sriram K. Rajamani

    Abstract: Language models of code (LMs) work well when the surrounding code provides sufficient context. This is not true when it becomes necessary to use types, functionality or APIs defined elsewhere in the repository or a linked library, especially those not seen during training. LMs suffer from limited awareness of such global context and end up hallucinating. Integrated development environments (IDEs… ▽ More

    Submitted 3 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 and to appear as "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context" at https://neurips.cc/virtual/2023/poster/70362 . Contents: 11 pages, 15 additional pages of appendix, 13 figures, 3 tables

    ACM Class: I.2.2; I.2.7; I.2.5

  8. arXiv:2305.14129  [pdf, other

    cs.SE cs.LG

    GrACE: Generation using Associated Code Edits

    Authors: Priyanshu Gupta, Avishree Khare, Yasharth Bajpai, Saikat Chakraborty, Sumit Gulwani, Aditya Kanade, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

    Abstract: Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large langua… ▽ More

    Submitted 20 September, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  9. arXiv:2301.13779  [pdf, other

    cs.PL cs.AI cs.SE

    FLAME: A small language model for spreadsheet formulas

    Authors: Harshit Joshi, Abishai Ebenezer, José Cambronero, Sumit Gulwani, Aditya Kanade, Vu Le, Ivan Radiček, Gust Verbruggen

    Abstract: Spreadsheets are a vital tool for end-user data management. Using large language models for formula authoring assistance in these environments can be difficult, as these models are expensive to train and challenging to deploy due to their size (up to billions of parameters). We present FLAME, a transformer-based model trained exclusively on Excel formulas that leverages domain insights to achieve… ▽ More

    Submitted 19 December, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2024

  10. arXiv:2212.08170  [pdf, other

    cs.AI cs.LG cs.LO cs.SC

    BNSynth: Bounded Boolean Functional Synthesis

    Authors: Ravi Raja, Stanly Samuel, Chiranjib Bhattacharyya, Deepak D'Souza, Aditya Kanade

    Abstract: The automated synthesis of correct-by-construction Boolean functions from logical specifications is known as the Boolean Functional Synthesis (BFS) problem. BFS has many application areas that range from software engineering to circuit design. In this paper, we introduce a tool BNSynth, that is the first to solve the BFS problem under a given bound on the solution space. Bounding the solution spac… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    ACM Class: I.2.2; I.2.6; B.6.0

  11. arXiv:2209.08516  [pdf, other

    cs.CV cs.LG

    VisTaNet: Attention Guided Deep Fusion for Surface Roughness Classification

    Authors: Prasanna Kumar Routray, Aditya Sanjiv Kanade, Jay Bhanushali, Manivannan Muniyandi

    Abstract: Human texture perception is a weighted average of multi-sensory inputs: visual and tactile. While the visual sensing mechanism extracts global features, the tactile mechanism complements it by extracting local features. The lack of coupled visuotactile datasets in the literature is a challenge for studying multimodal fusion strategies analogous to human texture perception. This paper presents a vi… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  12. arXiv:2209.08372  [pdf, other

    cs.SE cs.CL

    CodeQueries: A Dataset of Semantic Queries over Code

    Authors: Surya Prakash Sahu, Madhurima Mandal, Shikhar Bharadwaj, Aditya Kanade, Petros Maniatis, Shirish Shevade

    Abstract: Developers often have questions about semantic aspects of code they are working on, e.g., "Is there a class whose parent classes declare a conflicting attribute?". Answering them requires understanding code semantics such as attributes and inheritance relation of classes. An answer to such a question should identify code spans constituting the answer (e.g., the declaration of the subclass) as well… ▽ More

    Submitted 14 July, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

  13. arXiv:2209.03750  [pdf, other

    eess.SP cs.CV cs.LG

    Towards Multidimensional Textural Perception and Classification Through Whisker

    Authors: Prasanna Kumar Routray, Aditya Sanjiv Kanade, Pauline Pounds, Manivannan Muniyandi

    Abstract: Texture-based studies and designs have been in focus recently. Whisker-based multidimensional surface texture data is missing in the literature. This data is critical for robotics and machine perception algorithms in the classification and regression of textural surfaces. In this study, we present a novel sensor design to acquire multidimensional texture information. The surface texture's roughnes… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

  14. arXiv:2204.07840  [pdf, other

    cs.CV cs.AI

    A Robust and Scalable Attention Guided Deep Learning Framework for Movement Quality Assessment

    Authors: Aditya Kanade, Mansi Sharma, Manivannan Muniyandi

    Abstract: Physical rehabilitation programs frequently begin with a brief stay in the hospital and continue with home-based rehabilitation. Lack of feedback on exercise correctness is a significant issue in home-based rehabilitation. Automated movement quality assessment (MQA) using skeletal movement data (hereafter referred to as skeletal data) collected via depth imaging devices can assist with home-based… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  15. arXiv:2112.03168  [pdf, other

    cs.AI

    Tele-EvalNet: A Low-cost, Teleconsultation System for Home based Rehabilitation of Stroke Survivors using Multiscale CNN-LSTM Architecture

    Authors: Aditya Kanade, Mansi Sharma, M. Manivannan

    Abstract: Technology has an important role to play in the field of Rehabilitation, improving patient outcomes and reducing healthcare costs. However, existing approaches lack clinical validation, robustness and ease of use. We propose Tele-EvalNet, a novel system consisting of two components: a live feedback model and an overall performance evaluation model. The live feedback model demonstrates feedback on… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  16. arXiv:2107.05166  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Stateful Detection of Model Extraction Attacks

    Authors: Soham Pal, Yash Gupta, Aditya Kanade, Shirish Shevade

    Abstract: Machine-Learning-as-a-Service providers expose machine learning (ML) models through application programming interfaces (APIs) to developers. Recent work has shown that attackers can exploit these APIs to extract good approximations of such ML models, by querying them with samples of their choosing. We propose VarDetect, a stateful monitor that tracks the distribution of queries made by users of su… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

  17. arXiv:2001.00059  [pdf, other

    cs.SE cs.CL cs.LG cs.PL

    Learning and Evaluating Contextual Embedding of Source Code

    Authors: Aditya Kanade, Petros Maniatis, Gogul Balakrishnan, Kensen Shi

    Abstract: Recent research has achieved impressive results on understanding and improving source code by building up on machine-learning techniques developed for natural languages. A significant advancement in natural-language understanding has come with the development of pre-trained contextual embeddings, such as BERT, which can be fine-tuned for downstream tasks with less labeled data and training budget,… ▽ More

    Submitted 17 August, 2020; v1 submitted 21 December, 2019; originally announced January 2020.

    Comments: Published in ICML 2020. This version (v.3) is the final camera-ready version of the paper. It contains the re-computed results, based on the open-sourced datasets

  18. arXiv:1905.12454  [pdf, other

    cs.SE cs.LG

    Deep Learning for Bug-Localization in Student Programs

    Authors: Rahul Gupta, Aditya Kanade, Shirish Shevade

    Abstract: Providing feedback is an integral part of teaching. Most open online courses on programming make use of automated grading systems to support programming assignments and give real-time feedback. These systems usually rely on test results to quantify the programs' functional correctness. They return failing tests to the students as feedback. However, students may find it difficult to debug their pro… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    ACM Class: I.2.5; D.2.5

  19. arXiv:1905.09165  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    A framework for the extraction of Deep Neural Networks by leveraging public data

    Authors: Soham Pal, Yash Gupta, Aditya Shukla, Aditya Kanade, Shirish Shevade, Vinod Ganapathy

    Abstract: Machine learning models trained on confidential datasets are increasingly being deployed for profit. Machine Learning as a Service (MLaaS) has made such models easily accessible to end-users. Prior work has developed model extraction attacks, in which an adversary extracts an approximation of MLaaS models by making black-box queries to it. However, none of these works is able to satisfy all the th… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  20. arXiv:1904.01720  [pdf, other

    cs.LG stat.ML

    Neural Program Repair by Jointly Learning to Localize and Repair

    Authors: Marko Vasic, Aditya Kanade, Petros Maniatis, David Bieber, Rishabh Singh

    Abstract: Due to its potential to improve programmer productivity and software quality, automated program repair has been an active topic of research. Newer techniques harness neural networks to learn directly from examples of buggy programs and their fixes. In this work, we consider a recently identified class of bugs called variable-misuse bugs. The state-of-the-art solution for variable misuse enumerates… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: ICLR 2019

  21. arXiv:1806.03806  [pdf, other

    cs.AI cs.PL

    Greybox fuzzing as a contextual bandits problem

    Authors: Ketan Patil, Aditya Kanade

    Abstract: Greybox fuzzing is one of the most useful and effective techniques for the bug detection in large scale application programs. It uses minimal amount of instrumentation. American Fuzzy Lop (AFL) is a popular coverage based evolutionary greybox fuzzing tool. AFL performs extremely well in fuzz testing large applications and finding critical vulnerabilities, but AFL involves a lot of heuristics while… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

  22. arXiv:1804.05655  [pdf, ps, other

    cs.SE cs.AI cs.PL

    Active Learning for Efficient Testing of Student Programs

    Authors: Ishan Rastogi, Aditya Kanade, Shirish Shevade

    Abstract: In this work, we propose an automated method to identify semantic bugs in student programs, called ATAS, which builds upon the recent advances in both symbolic execution and active learning. Symbolic execution is a program analysis technique which can generate test cases through symbolic constraint solving. Our method makes use of a reference implementation of the task as its sole input. We compar… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

    Comments: 14 pages, 6 tables, 2 figures

  23. arXiv:1801.10467  [pdf, other

    cs.AI cs.PL cs.SE

    Deep Reinforcement Learning for Programming Language Correction

    Authors: Rahul Gupta, Aditya Kanade, Shirish Shevade

    Abstract: Novice programmers often struggle with the formal syntax of programming languages. To assist them, we design a novel programming language correction framework amenable to reinforcement learning. The framework allows an agent to mimic human actions for text navigation and editing. We demonstrate that the agent can be trained through self-exploration directly from the raw input, that is, program tex… ▽ More

    Submitted 31 January, 2018; originally announced January 2018.

  24. arXiv:1603.04584  [pdf, ps, other

    cs.SE

    Semi-Supervised Verified Feedback Generation

    Authors: Shalini Kaleeswaran, Anirudh Santhiar, Aditya Kanade, Sumit Gulwani

    Abstract: Students have enthusiastically taken to online programming lessons and contests. Unfortunately, they tend to struggle due to lack of personalized feedback when they make mistakes. The overwhelming number of submissions precludes manual evaluation. There is an urgent need of program analysis and repair techniques capable of handling both the scale and variations in student submissions, while ensuri… ▽ More

    Submitted 15 March, 2016; originally announced March 2016.

  25. arXiv:1511.03213  [pdf, other

    cs.PL

    A Partial Order Reduction Technique for Event-driven Multi-threaded Programs

    Authors: Pallavi Maiya, Rahul Gupta, Aditya Kanade, Rupak Majumdar

    Abstract: Event-driven multi-threaded programming is fast becoming a preferred style of develo** efficient and responsive applications. In this concurrency model, multiple threads execute concurrently, communicating through shared objects as well as by posting asynchronous events that are executed in their order of arrival. In this work, we consider partial order reduction (POR) for event-driven multi-thr… ▽ More

    Submitted 16 October, 2017; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: 35 pages, 20 figures, 2 tables

  26. arXiv:1306.1286  [pdf, other

    cs.SE

    MintHint: Automated Synthesis of Repair Hints

    Authors: Shalini Kaleeswaran, Varun Tulsian, Aditya Kanade, Alessandro Orso

    Abstract: Being able to automatically repair programs is an extremely challenging task. In this paper, we present MintHint, a novel technique for program repair that is a departure from most of today's approaches. Instead of trying to fully automate program repair, which is often an unachievable goal, MintHint performs statistical correlation analysis to identify expressions that are likely to occur in the… ▽ More

    Submitted 16 September, 2013; v1 submitted 5 June, 2013; originally announced June 2013.

  27. arXiv:1209.5152  [pdf, ps, other

    cs.LO cs.PL

    A Logic for Correlating Temporal Properties across Program Transformations

    Authors: Aditya Kanade, Amitabha Sanyal, Uday P. Khedker

    Abstract: Program transformations are widely used in synthesis, optimization, and maintenance of software. Correctness of program transformations depends on preservation of some important properties of the input program. By regarding programs as Kripke structures, many interesting properties of programs can be expressed in temporal logics. In temporal logic, a formula is interpreted on a single program. How… ▽ More

    Submitted 24 September, 2012; originally announced September 2012.