Skip to main content

Showing 1–50 of 129 results for author: Guha, A

.
  1. arXiv:2406.16207  [pdf, other

    cs.CY

    Thinking beyond Bias: Analyzing Multifaceted Impacts and Implications of AI on Gendered Labour

    Authors: Satyam Mohla, Bishnupriya Bagh, Anupam Guha

    Abstract: Artificial Intelligence with its multifaceted technologies and integral role in global production significantly impacts gender dynamics particularly in gendered labor. This paper emphasizes the need to explore AIs broader impacts on gendered labor beyond its current emphasis on the generation and perpetuation of epistemic biases. We draw attention to how the AI industry as an integral component of… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Under review. An unindexed peer-reviewed working draft was accepted for presentation at IJCAI 2021 Workshop on AI for Social Good organized by Harvard CRCS

  2. arXiv:2405.20179  [pdf, other

    cs.CL cs.AI cs.RO

    Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning CodeLLMs

    Authors: Zichao Hu, Junyi Jessy Li, Arjun Guha, Joydeep Biswas

    Abstract: Large language models (LLMs) have shown great promise at generating robot programs from natural language given domain-specific robot application programming interfaces (APIs). However, the performance gap between proprietary LLMs and smaller open-weight LLMs remains wide. This raises a question: Can we fine-tune smaller open-weight LLMs for generating domain-specific robot programs to close the pe… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2404.10009  [pdf, ps, other

    physics.class-ph quant-ph

    Relating interfacial Rossby wave interaction in shear flows with Feynman's two-state coupled quantum system model for the Josephson junction

    Authors: Eyal Heifetz, Nimrod Bratspiess, Anirban Guha, Leo Maas

    Abstract: Here we show how Feynman's simplified model for the Josephson junction, as a macroscopic two-state coupled quantum system, has a one-to-one correspondence with the stable dynamics of two interfacial Rossby waves in piecewise linear shear flows. The conservation of electric charge and energy of the superconducting electron gas layers become respectively equivalent to the conservation of wave action… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  4. arXiv:2404.01903  [pdf, other

    cs.CL cs.LG cs.PL

    Activation Steering for Robust Type Prediction in CodeLLMs

    Authors: Francesca Lucchetti, Arjun Guha

    Abstract: Contemporary LLMs pretrained on code are capable of succeeding at a wide variety of programming tasks. However, their performance is very sensitive to syntactic features, such as the names of variables and types, the structure of code, and presence of type hints. We contribute an inference-time technique to make CodeLLMs more robust to syntactic distractors that are semantically irrelevant. Our me… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 16 pages, 7 figures

  5. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  6. arXiv:2402.13795  [pdf, other

    hep-ph astro-ph.HE nucl-th

    Estimating the dark matter halo velocity and surface temperature of some known pulsars due to dark matter capture

    Authors: Debashree Sen, Atanu Guha

    Abstract: Considering four known pulsars J1906+0746, J1933-6211, J2043+1711 and the Vela pulsar, we study the scenario of dark matter (DM) capture in neutron stars (NSs). For the purpose we choose four well-known relativistic mean field models to obtain the radius corresponding to the observed mass of these pulsars and consequently the scattering cross-section of DM with the different particles of the $β$ s… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  7. arXiv:2401.15232  [pdf, other

    cs.HC

    How Beginning Programmers and Code LLMs (Mis)read Each Other

    Authors: Sydney Nguyen, Hannah McLean Babe, Yangtian Zi, Arjun Guha, Carolyn Jane Anderson, Molly Q Feldman

    Abstract: Generative AI models, specifically large language models (LLMs), have made strides towards the long-standing goal of text-to-code generation. This progress has invited numerous studies of user interaction. However, less is known about the struggles and strategies of non-experts, for whom each step of the text-to-code problem presents challenges: describing their intent in natural language, evaluat… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Conditionally Accepted to CHI 2024

  8. arXiv:2401.14419  [pdf, other

    astro-ph.HE hep-ph nucl-th

    Constraining the mass of fermionic dark matter from its feeble interaction with hadronic matter via dark mediators in neutron stars

    Authors: Atanu Guha, Debashree Sen

    Abstract: Considering ten well-known relativistic mean field models, we invoke feeble interaction between hadronic matter and fermionic dark matter (DM) $χ$ via new physics scalar ($φ$) and vector ($ξ$) mediators in neutron star core, thereby forming DM admixed neutron stars (DMANSs). The chosen masses of the DM fermion ($m_χ$) and the mediators ($m_φ$ and $m_ξ$) are consistent with the self-interaction con… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in Phys. Rev. D ; 16 Pages, 10 figures

    Journal ref: Phys. Rev. D, Vol. 109, No. 4 (2024)

  9. arXiv:2401.07750  [pdf, other

    hep-ph astro-ph.HE

    Constraints on cosmic-ray boosted dark matter with realistic cross section

    Authors: Atanu Guha, Jong-Chul Park

    Abstract: Sub-MeV cold dark-matter particles are unable to produce electronic recoil in conventional dark-matter direct detection experiments such as XENONnT and LUX-ZEPLIN above the detector threshold. The mechanism of boosted dark matter comes into picture to constrain the parameter space of such low mass dark matter from direct detection experiments. We consider the effect of the leading components of co… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 27 pages, 10 figures, 3 appendices

  10. arXiv:2312.12450  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

    Authors: Federico Cassano, Luisa Li, Akul Sethi, Noah Shinn, Abby Brennan-Jones, Jacob Ginesin, Edward Berman, George Chakhnashvili, Anton Lozhkov, Carolyn Jane Anderson, Arjun Guha

    Abstract: A significant amount of research is focused on develo** and evaluating large language models for a variety of code synthesis tasks. These include synthesizing code from natural language, synthesizing tests from code, and synthesizing explanations of code. In contrast, the behavior of instructional code editing with LLMs is understudied. These are tasks in which the model is provided a block of c… ▽ More

    Submitted 19 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  11. Deploying and Evaluating LLMs to Program Service Mobile Robots

    Authors: Zichao Hu, Francesca Lucchetti, Claire Schlesinger, Yash Saxena, Anders Freeman, Sadanand Modak, Arjun Guha, Joydeep Biswas

    Abstract: Recent advancements in large language models (LLMs) have spurred interest in using them for generating robot programs from natural language, with promising initial results. We investigate the use of LLMs to generate programs for service mobile robots leveraging mobility, perception, and human interaction skills, and where accurate sequencing and ordering of actions is crucial for success. We contr… ▽ More

    Submitted 21 February, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: 8 pages, Accepted at IEEE Robotics and Automation Letters (RA-L)

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 3, pp. 2853-2860, March 2024

  12. arXiv:2309.14054  [pdf, other

    cs.LG cs.AI cs.CV

    Adapt then Unlearn: Exploiting Parameter Space Semantics for Unlearning in Generative Adversarial Networks

    Authors: Piyush Tiwary, Atri Guha, Subhodip Panda, Prathosh A. P

    Abstract: The increased attention to regulating the outputs of deep generative models, driven by growing concerns about privacy and regulatory compliance, has highlighted the need for effective control over these models. This necessity arises from instances where generative models produce outputs containing undesirable, offensive, or potentially harmful content. To tackle this challenge, the concept of mach… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 15 pages, 12 figures

  13. arXiv:2308.12545  [pdf, other

    cs.SE

    npm-follower: A Complete Dataset Tracking the NPM Ecosystem

    Authors: Donald Pinckney, Federico Cassano, Arjun Guha, Jonathan Bell

    Abstract: Software developers typically rely upon a large network of dependencies to build their applications. For instance, the NPM package repository contains over 3 million packages and serves tens of billions of downloads weekly. Understanding the structure and nature of packages, dependencies, and published code requires datasets that provide researchers with easy access to metadata and code of package… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  14. arXiv:2308.09895  [pdf, other

    cs.PL cs.LG

    Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

    Authors: Federico Cassano, John Gouwar, Francesca Lucchetti, Claire Schlesinger, Anders Freeman, Carolyn Jane Anderson, Molly Q Feldman, Michael Greenberg, Abhinav Jangda, Arjun Guha

    Abstract: Over the past few years, Large Language Models of Code (Code LLMs) have started to have a significant impact on programming practice. Code LLMs are also emerging as building blocks for research in programming languages and software engineering. However, Code LLMs produce impressive results on programming languages that are well represented in their training data (e.g., Java, Python, or JavaScript)… ▽ More

    Submitted 10 February, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

  15. arXiv:2308.08347  [pdf, ps, other

    cs.PL

    Continuing WebAssembly with Effect Handlers

    Authors: Luna Phipps-Costin, Andreas Rossberg, Arjun Guha, Daan Leijen, Daniel Hillerström, KC Sivaramakrishnan, Matija Pretnar, Sam Lindley

    Abstract: WebAssembly (Wasm) is a low-level portable code format offering near native performance. It is intended as a compilation target for a wide variety of source languages. However, Wasm provides no direct support for non-local control flow features such as async/await, generators/iterators, lightweight threads, first-class continuations, etc. This means that compilers for source languages with such fe… ▽ More

    Submitted 13 September, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

  16. arXiv:2306.12354  [pdf

    physics.med-ph cs.HC

    Seat pan angle optimization for vehicle ride comfort using finite element model of human spine

    Authors: Raj Desai, Ankit Vekaria, Anirban Guha, P. Seshu

    Abstract: Ride comfort of the driver/occupant of a vehicle has been usually analyzed by multibody biodynamic models of human beings. Accurate modeling of critical segments of the human body, e.g. the spine requires these models to have a very high number of segments. The resultant increase in degrees of freedom makes these models difficult to analyze and not able to provide certain details such as seat pres… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  17. arXiv:2306.04556  [pdf, other

    cs.LG cs.HC cs.SE

    StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code

    Authors: Hannah McLean Babe, Sydney Nguyen, Yangtian Zi, Arjun Guha, Molly Q Feldman, Carolyn Jane Anderson

    Abstract: Code LLMs are being rapidly deployed and there is evidence that they can make professional programmers more productive. Current benchmarks for code generation measure whether models generate correct programs given an expert prompt. In this paper, we present a new benchmark containing multiple prompts per problem, written by a specific population of non-expert prompters: beginning programmers. Stud… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  18. arXiv:2305.17145  [pdf, other

    cs.SE cs.LG cs.PL

    Type Prediction With Program Decomposition and Fill-in-the-Type Training

    Authors: Federico Cassano, Ming-Ho Yee, Noah Shinn, Arjun Guha, Steven Holtzen

    Abstract: TypeScript and Python are two programming languages that support optional type annotations, which are useful but tedious to introduce and maintain. This has motivated automated type prediction: given an untyped program, produce a well-typed output program. Large language models (LLMs) are promising for type prediction, but there are challenges: fill-in-the-middle performs poorly, programs may not… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  19. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  20. arXiv:2304.07301  [pdf, other

    gr-qc hep-ph

    Inflation and the late time acceleration from Hossenfelder-Verlinde gravity

    Authors: Youngsub Yoon, Atanu Guha

    Abstract: We show that Hossenfelder's covariant formulation of Verlinde's emergent gravity predicts inflation and the late-time acceleration at the same time, without assuming a separate field such as inflaton, whose sole purpose is producing inflation. In particular, for the current deceleration parameter $q=-0.95$ to $-0.55$, we obtained $λ^2$, the mass of the imposter field, from $1.85\times 10^4$ to… ▽ More

    Submitted 23 May, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Previous numerical mistakes fixed. Simulations for four different values of q. Connection with the fine structure constant suggested

  21. arXiv:2304.01651  [pdf, other

    cs.CY cs.HC

    Socio-economic landscape of digital transformation & public NLP systems: A critical review

    Authors: Satyam Mohla, Anupam Guha

    Abstract: The current wave of digital transformation has spurred digitisation reforms and has led to prodigious development of AI & NLP systems, with several of them entering the public domain. There is a perception that these systems have a non trivial impact on society but there is a dearth of literature in critical AI exploring what kinds of systems exist and how do they operate. This paper constructs a… ▽ More

    Submitted 27 August, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Under review

  22. arXiv:2304.00394  [pdf, other

    cs.SE

    A Large Scale Analysis of Semantic Versioning in NPM

    Authors: Donald Pinckney, Federico Cassano, Arjun Guha, Jonathan Bell

    Abstract: The NPM package repository contains over two million packages and serves tens of billions of downloads per-week. Nearly every single JavaScript application uses the NPM package manager to install packages from the NPM repository. NPM relies on a "semantic versioning" ('semver') scheme to maintain a healthy ecosystem, where bug-fixes are reliably delivered to downstream packages as quickly as possi… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  23. arXiv:2303.14789  [pdf, other

    physics.flu-dyn

    5 wave interactions in internal gravity waves

    Authors: Saranraj Gururaj, Anirban Guha

    Abstract: We use multiple-scale analysis to study a 5-wave system (5WS) composed of two different internal gravity wave triads. Each of these triads consists of a parent wave and two daughter waves, with one daughter wave common between the two triads. The parent waves are assumed to have the same frequency and wavevector norm co-existing in a region of constant background stratification. Such 5-wave system… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

  24. arXiv:2303.07464  [pdf, other

    physics.flu-dyn math-ph math.DS

    Understanding Stokes drift mechanism via crest and trough phase estimates

    Authors: Anirban Guha, Akanksha Gupta

    Abstract: By providing mathematical estimates, this paper answers a fundamental question -- "what leads to Stokes drift"? Although overwhelmingly understood for water waves, Stokes drift is a generic mechanism that stems from kinematics and occurs in any non-transverse wave in fluids. To showcase its generality, we undertake a comparative study of the pathline equation of sound (1D) and intermediate-depth w… ▽ More

    Submitted 24 February, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

  25. Do Machine Learning Models Produce TypeScript Types That Type Check?

    Authors: Ming-Ho Yee, Arjun Guha

    Abstract: Type migration is the process of adding types to untyped code to gain assurance at compile time. TypeScript and other gradual type systems facilitate type migration by allowing programmers to start with imprecise types and gradually strengthen them. However, adding types is a manual effort and several migrations on large, industry codebases have been reported to have taken several years. In the re… ▽ More

    Submitted 11 July, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Published at the 37th European Conference on Object-Oriented Programming (ECOOP 2023)

  26. arXiv:2302.02092  [pdf, other

    cs.LG stat.ML

    Interpolation for Robust Learning: Data Augmentation on Wasserstein Geodesics

    Authors: Jiacheng Zhu, Jielin Qiu, Aritra Guha, Zhuolin Yang, Xuanlong Nguyen, Bo Li, Ding Zhao

    Abstract: We propose to study and promote the robustness of a model as per its performance through the interpolation of training data distributions. Specifically, (1) we augment the data by finding the worst-case Wasserstein barycenter on the geodesic connecting subpopulation distributions of different categories. (2) We regularize the model for smoother performance on the continuous geodesic path connectin… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 34 pages, 3 figures, 18 tables

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:43129-43157, 2023

  27. arXiv:2301.11496  [pdf, other

    math.ST

    On Excess Mass Behavior in Gaussian Mixture Models with Orlicz-Wasserstein Distances

    Authors: Aritra Guha, Nhat Ho, XuanLong Nguyen

    Abstract: Dirichlet Process mixture models (DPMM) in combination with Gaussian kernels have been an important modeling tool for numerous data domains arising from biological, physical, and social sciences. However, this versatility in applications does not extend to strong theoretical guarantees for the underlying parameter estimates, for which only a logarithmic rate is achieved. In this work, we (re)intro… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 2 figures

  28. arXiv:2301.03988  [pdf, other

    cs.SE cs.AI cs.LG

    SantaCoder: don't reach for the stars!

    Authors: Loubna Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, Logesh Kumar Umapathi, Carolyn Jane Anderson, Yangtian Zi, Joel Lamy Poirier, Hailey Schoelkopf, Sergey Troshin, Dmitry Abulkhanov, Manuel Romero, Michael Lappert, Francesco De Toni, Bernardo García del Río, Qian Liu, Shamik Bose, Urvashi Bhattacharyya, Terry Yue Zhuo , et al. (16 additional authors not shown)

    Abstract: The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigat… ▽ More

    Submitted 24 February, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  29. arXiv:2211.04568  [pdf, ps, other

    stat.AP cs.CY cs.LG

    Towards Algorithmic Fairness in Space-Time: Filling in Black Holes

    Authors: Cheryl Flynn, Aritra Guha, Subhabrata Majumdar, Divesh Srivastava, Zhengyi Zhou

    Abstract: New technologies and the availability of geospatial data have drawn attention to spatio-temporal biases present in society. For example: the COVID-19 pandemic highlighted disparities in the availability of broadband service and its role in the digital divide; the environmental justice movement in the United States has raised awareness to health implications for minority populations stemming from h… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  30. arXiv:2209.15577  [pdf, other

    math.GT

    On knots that divide ribbon knotted surfaces

    Authors: Hans U. Boden, Ceyhun Elmacioglu, Anshul Guha, Homayun Karimi, William Rushworth, Yun-chi Tang, Bryan Wang Peng Jun

    Abstract: We define a knot to be half ribbon if it is the cross-section of a ribbon 2-knot, and observe that ribbon implies half ribbon implies slice. We introduce the half ribbon genus of a knot K, the minimum genus of a ribbon knotted surface of which K is a cross-section. We compute this genus for all prime knots up to 12 crossings, and many 13-crossing knots. The same approach yields new computations of… ▽ More

    Submitted 2 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 13 pages, 1 figure, 1 table. Comments welcome. V2: typographical corrections

    MSC Class: 57K10; 57K45

  31. arXiv:2209.09021  [pdf, other

    hep-ph astro-ph.CO astro-ph.HE nucl-th

    Vector dark boson mediated feeble interaction between fermionic dark matter and strange quark matter in quark stars

    Authors: Debashree Sen, Atanu Guha

    Abstract: We study the structural properties like the gravitational mass, radius and tidal deformability of dark matter (DM) admixed strange quark stars (SQSs). For the purpose we consider the vector MIT Bag model to describe the strange quark matter (SQM) and investigate the possible presence of accreted DM in the SQSs consequently forming DM admixed SQSs. We introduce feeble interaction between SQM and th… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted for Publication in Monthly Notices of the Royal Astronomical Society

    Journal ref: MNRAS 517, 518-525 (2022)

  32. Bounds on boosted dark matter from direct detection: The role of energy-dependent cross sections

    Authors: Debjyoti Bardhan, Supritha Bhowmick, Diptimoy Ghosh, Atanu Guha, Divya Sachdeva

    Abstract: The recoil threshold of Direct Detection experiments limits the mass range of Dark Matter (DM) particles that can be detected, with most DD experiments being blind to sub-MeV DM particles. However, these light DM particles can be boosted to very high energies via collisions with energetic Cosmic Ray electrons. This allows Dark Matter particles to induce detectable recoil in the target of Direct De… ▽ More

    Submitted 13 January, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: 11 pages, 3 figures; Title modified

    Journal ref: Phys.Rev.D 107 (2023) 1, 015010

  33. arXiv:2208.08227  [pdf, other

    cs.LG cs.PL

    MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

    Authors: Federico Cassano, John Gouwar, Daniel Nguyen, Sydney Nguyen, Luna Phipps-Costin, Donald Pinckney, Ming-Ho Yee, Yangtian Zi, Carolyn Jane Anderson, Molly Q Feldman, Arjun Guha, Michael Greenberg, Abhinav Jangda

    Abstract: Large language models have demonstrated the ability to generate both natural language and programming language text. Such models open up the possibility of multi-language code generation: could code generation models generalize knowledge from one language to another? Although contemporary code generation models can generate semantically correct Python code, little is known about their abilities wi… ▽ More

    Submitted 19 December, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  34. arXiv:2206.00807  [pdf

    cs.LG

    Applied Federated Learning: Architectural Design for Robust and Efficient Learning in Privacy Aware Settings

    Authors: Branislav Stojkovic, Jonathan Woodbridge, Zhihan Fang, Jerry Cai, Andrey Petrov, Sathya Iyer, Daoyu Huang, Patrick Yau, Arvind Sastha Kumar, Hitesh Jawa, Anamita Guha

    Abstract: The classical machine learning paradigm requires the aggregation of user data in a central location where machine learning practitioners can preprocess data, calculate features, tune models and evaluate performance. The advantage of this approach includes leveraging high performance hardware (such as GPUs) and the ability of machine learning practitioners to do in depth data analysis to improve mo… ▽ More

    Submitted 7 June, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

  35. arXiv:2203.13737  [pdf, other

    cs.SE

    Flexible and Optimal Dependency Management via Max-SMT

    Authors: Donald Pinckney, Federico Cassano, Arjun Guha, Jon Bell, Massimiliano Culpo, Todd Gamblin

    Abstract: Package managers such as NPM have become essential for software development. The NPM repository hosts over 2 million packages and serves over 43 billion downloads every week. Unfortunately, the NPM dependency solver has several shortcomings. 1) NPM is greedy and often fails to install the newest versions of dependencies; 2) NPM's algorithm leads to duplicated dependencies and bloated code, which i… ▽ More

    Submitted 24 August, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

  36. arXiv:2201.12991  [pdf, ps, other

    cs.LG cs.IT

    Federated Learning with Erroneous Communication Links

    Authors: Mahyar Shirvanimoghaddam, Ayoob Salari, Yifeng Gao, Aradhika Guha

    Abstract: In this paper, we consider the federated learning (FL) problem in the presence of communication errors. We model the link between the devices and the central node (CN) by a packet erasure channel, where the local parameters from devices are either erased or received correctly by CN with probability $ε$ and $1-ε$, respectively. We proved that the FL algorithm in the presence of communication errors… ▽ More

    Submitted 11 April, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: The paper is accepted for publication in IEEE Communications Letters

  37. arXiv:2201.02422  [pdf, other

    physics.flu-dyn physics.ao-ph

    A new Lagrangian drift mechanism due to current-bathymetry interactions: applications in coastal cross-shelf transport

    Authors: Akanksha Gupta, Anirban Guha

    Abstract: We show that in free surface flows, a uniform, streamwise current over small-amplitude wavy bottom topography generates cross-stream drift velocity. This drift mechanism, referred to as the current-bathymetry interaction induced drift (CBIID), is specifically understood in the context of a simplified nearshore environment consisting of a uniform alongshore current, onshore propagating surface wave… ▽ More

    Submitted 20 October, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  38. arXiv:2110.06903  [pdf, other

    hep-ph astro-ph.CO hep-ex

    EFT analysis of leptophilic dark matter at future electron-positron colliders in the mono-photon and mono-$Z$ channels

    Authors: Saumyen Kundu, Atanu Guha, Prasanta Kumar Das, P. S. Bhupal Dev

    Abstract: We consider the possibility that dark matter (DM) only interacts with the Standard Model leptons, but not quarks at tree level, and analyze the future lepton collider prospects of such leptophilic DM in the monophoton and mono-$Z$ (both leptonic and hadronic) channels. Adopting a model-independent effective field theory framework, we consider all possible dimension-six operators of scalar-pseudosc… ▽ More

    Submitted 29 December, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: 29 pages, 17 figures, 16 tables, version to appear in Phys. Rev. D

    Journal ref: Phys. Rev. D, 107:015003 (2023)

  39. Exclusion limits on Dark Matter-Neutrino Scattering Cross-section

    Authors: Diptimoy Ghosh, Atanu Guha, Divya Sachdeva

    Abstract: We derive new constraints on combination of dark matter - electron cross-section ($σ_{χe}$) and dark matter - neutrino cross-section ($σ_{χν}$) utilising the gain in kinetic energy of the dark matter (DM) particles due to scattering with the cosmic ray electrons and the diffuse supernova neutrino background (DSNB). Since the flux of the DSNB neutrinos is comparable to the CR electron flux in the e… ▽ More

    Submitted 27 May, 2022; v1 submitted 30 September, 2021; originally announced October 2021.

    Journal ref: Phys. Rev. D 105, 103029 (2022)

  40. arXiv:2109.05049  [pdf, other

    cs.PL

    Solver-based Gradual Type Migration

    Authors: Luna Phipps-Costin, Carolyn Jane Anderson, Michael Greenberg, Arjun Guha

    Abstract: Gradually typed languages allow programmers to mix statically and dynamically typed code, enabling them to incrementally reap the benefits of static ty** as they add type annotations to their code. However, this type migration process is typically a manual effort with limited tool support. This paper examines the problem of \emph{automated type migration}: given a dynamic program, infer addition… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  41. arXiv:2107.06407  [pdf

    physics.bio-ph physics.optics q-bio.BM

    Watching Single Unmodified Enzymes at Work

    Authors: Cuifeng Ying, Edona Karakaci, Esteban Bermudez-Urena, Alessandro Ianiro, Ceri Foster, Saurabh Awasthi, Anirvan Guha, Louise Bryan, Jonathan List, Sandor Balog, Guillermo P. Acuna, Reuven Gordon, Michael Mayer

    Abstract: Many proteins undergo conformational changes during their activity. A full understanding of the function of these proteins can only be obtained if different conformations and transitions between them can be monitored in aqueous solution, with adequate temporal resolution and, ideally, on a single-molecule level. Interrogating conformational dynamics of single proteins remains, however, exquisitely… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: 20 pages, 4 figures

  42. arXiv:2106.10353  [pdf, other

    hep-ph astro-ph.CO astro-ph.HE nucl-th

    Feeble DM-SM Interaction via New Scalar and Vector Mediators in Rotating Neutron Stars

    Authors: Atanu Guha, Debashree Sen

    Abstract: We investigate the possible presence of dark matter (DM) in massive and rotating neutron stars (NSs). For the purpose we extend our previous work [1] to introduce a light new physics vector mediator besides a scalar one in order to ensure feeble interaction between fermionic DM and $β$ stable hadronic matter in NSs. The masses of DM fermion, the mediators and the couplings are chosen consistent wi… ▽ More

    Submitted 29 August, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 20 Pages, 8 figures, Accepted for Publication in JCAP

    Report number: JCAP_070P_0621

    Journal ref: JCAP 09 (2021) 027

  43. arXiv:2106.03198  [pdf, other

    physics.flu-dyn physics.geo-ph

    Resonant and near-resonant internal wave triads for non-uniform stratifications. Part 2: Vertically bounded domain with mild-slope bathymetry

    Authors: Saranraj Gururaj, Anirban Guha

    Abstract: Weakly nonlinear internal wave-wave interaction is a key mechanism that cascades energy from large to small scales, leading to ocean turbulence and mixing. Oceans typically have a non-uniform density stratification profile; moreover, submarine topography leads to a spatially varying ocean depth ($h$). Under these conditions and assuming mild-slope bathymetry, we employ multiple-scale analysis to d… ▽ More

    Submitted 3 June, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted in the Journal of Fluid Mechanics

  44. arXiv:2105.06577  [pdf, other

    cs.LG math.DS math.OC

    Online Algorithms and Policies Using Adaptive and Machine Learning Approaches

    Authors: Anuradha M. Annaswamy, Anubhav Guha, Yingnan Cui, Sunbochen Tang, Peter A. Fisher, Joseph E. Gaudio

    Abstract: This paper considers the problem of real-time control and learning in dynamic systems subjected to parametric uncertainties. We propose a combination of a Reinforcement Learning (RL) based policy in the outer loop suitably chosen to ensure stability and optimality for the nominal dynamics, together with Adaptive Control (AC) in the inner loop so that in real-time AC contracts the closed-loop dynam… ▽ More

    Submitted 9 June, 2023; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: 38 pages

  45. arXiv:2104.06141  [pdf, other

    hep-ph astro-ph.CO astro-ph.HE hep-th nucl-th

    Implications of Feebly Interacting Dark Sector on Neutron Star Properties and Constraints from GW170817

    Authors: Debashree Sen, Atanu Guha

    Abstract: We investigate the effect of feeble interaction of dark matter (DM) with hadronic matter on the equation of state (EoS) and structural properties of neutron stars (NSs) in static conditions. For the purpose we adopt the effective chiral model for the hadronic sector and for the first time in the context of possible existence of DM inside NSs, we introduce DM-SM interaction through light new physic… ▽ More

    Submitted 7 June, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: 12 Pages, 13 figures, Minors typos are corrected in the latest version

    Journal ref: Mon.Not.Roy.Astron.Soc. 504 (2021) 3, 3354-3363

  46. arXiv:2103.16551  [pdf, other

    eess.SY

    Online Policies for Real-Time Control Using MRAC-RL

    Authors: Anubhav Guha, Anuradha Annaswamy

    Abstract: In this paper, we propose the Model Reference Adaptive Control & Reinforcement Learning (MRAC-RL) approach to develo** online policies for systems in which modeling errors occur in real-time. Although reinforcement learning (RL) algorithms have been successfully used to develop control policies for dynamical systems, discrepancies between simulated dynamics and the true target dynamics can cause… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Submitted to CDC 2021

  47. arXiv:2103.04880  [pdf, other

    cs.RO cs.PL

    Iterative Program Synthesis for Adaptable Social Navigation

    Authors: Jarrett Holtz, Simon Andrews, Arjun Guha, Joydeep Biswas

    Abstract: Robot social navigation is influenced by human preferences and environment-specific scenarios such as elevators and doors, thus necessitating end-user adaptability. State-of-the-art approaches to social navigation fall into two categories: model-based social constraints and learning-based approaches. While effective, these approaches have fundamental limitations -- model-based approaches require c… ▽ More

    Submitted 30 August, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: IROS 2021

  48. Predicting post-operative right ventricular failure using video-based deep learning

    Authors: Rohan Shad, Nicolas Quach, Robyn Fong, Patpilai Kasinpila, Cayley Bowles, Miguel Castro, Ashrith Guha, Eddie Suarez, Stefan Jovinge, Sang** Lee, Theodore Boeve, Myriam Amsallem, Xiu Tang, Francois Haddad, Yasuhiro Shudo, Y. Joseph Woo, Jeffrey Teuteberg, John P. Cunningham, Curt P. Langlotz, William Hiesinger

    Abstract: Non-invasive and cost effective in nature, the echocardiogram allows for a comprehensive assessment of the cardiac musculature and valves. Despite progressive improvements over the decades, the rich temporally resolved data in echocardiography videos remain underutilized. Human reads of echocardiograms reduce the complex patterns of cardiac wall motion, to a small list of measurements of heart fun… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: 12 pages, 3 figures

    Journal ref: Nat Commun 12, 5192 (2021)

  49. arXiv:2102.07695  [pdf, other

    stat.ML cs.LG stat.ME

    Scalable nonparametric Bayesian learning for heterogeneous and dynamic velocity fields

    Authors: Sunrit Chakraborty, Aritra Guha, Rayleigh Lei, XuanLong Nguyen

    Abstract: Analysis of heterogeneous patterns in complex spatio-temporal data finds usage across various domains in applied science and engineering, including training autonomous vehicles to navigate in complex traffic scenarios. Motivated by applications arising in the transportation domain, in this paper we develop a model for learning heterogeneous and dynamic patterns of velocity field data. We draw from… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 5 tables, 8 figures

  50. arXiv:2102.03895  [pdf, other

    stat.ML cs.LG stat.AP

    Functional optimal transport: map estimation and domain adaptation for functional data

    Authors: Jiacheng Zhu, Aritra Guha, Dat Do, Mengdi Xu, XuanLong Nguyen, Ding Zhao

    Abstract: We introduce a formulation of optimal transport problem for distributions on function spaces, where the stochastic map between functional domains can be partially represented in terms of an (infinite-dimensional) Hilbert-Schmidt operator map** a Hilbert space of functions to another. For numerous machine learning tasks, data can be naturally viewed as samples drawn from spaces of functions, such… ▽ More

    Submitted 28 August, 2023; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 48 pages, 10 figures, 3 tables