Skip to main content

Showing 1–50 of 160 results for author: Shankar, S

.
  1. arXiv:2406.17910  [pdf

    cs.SE cs.AI

    Transforming Software Development: Evaluating the Efficiency and Challenges of GitHub Copilot in Real-World Projects

    Authors: Ruchika Pandey, Prabhat Singh, Raymond Wei, Shaila Shankar

    Abstract: Generative AI technologies promise to transform the product development lifecycle. This study evaluates the efficiency gains, areas for improvement, and emerging challenges of using GitHub Copilot, an AI-powered coding assistant. We identified 15 software development tasks and assessed Copilot's benefits through real-world projects on large proprietary code bases. Our findings indicate significant… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 13 pages, 8 figures

  2. arXiv:2406.05224  [pdf, other

    cs.NE

    ON-OFF Neuromorphic ISING Machines using Fowler-Nordheim Annealers

    Authors: Zihao Chen, Zhili Xiao, Mahmoud Akl, Johannes Leugring, Omowuyi Olajide, Adil Malik, Nik Dennler, Chad Harper, Subhankar Bose, Hector A. Gonzalez, Jason Eshraghian, Riccardo Pignari, Gianvito Urgese, Andreas G. Andreou, Sadasivan Shankar, Christian Mayr, Gert Cauwenberghs, Shantanu Chakrabartty

    Abstract: We introduce NeuroSA, a neuromorphic architecture specifically designed to ensure asymptotic convergence to the ground state of an Ising problem using an annealing process that is governed by the physics of quantum mechanical tunneling using Fowler-Nordheim (FN). The core component of NeuroSA consists of a pair of asynchronous ON-OFF neurons, which effectively map classical simulated annealing (SA… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 36 pages, 8 figures

  3. arXiv:2405.19297  [pdf, other

    astro-ph.GA astro-ph.HE

    Genuine Retrieval of the AGN Host Stellar Population (GRAHSP)

    Authors: Johannes Buchner, Hattie Starck, Mara Salvato, Hagai Netzer, Zsofi Igo, Brivael Laloux, Antonis Georgakakis, Isabelle Gauger, Anna Olechowska, Nicolas Lopez, Suraj D Shankar, Junyao Li, Kirpal Nandra, Andrea Merloni

    Abstract: The assembly and co-evolution of supermassive black holes (SMBH) and their host galaxy stellar population is a key open questions in galaxy evolution. Stellar mass ($M_\star$) and star formation rate (SFR), are inferred by modeling the spectral energy distribution (SED). For galaxies triggering SMBH activity, the active galactic nucleus (AGN) contaminates the light at all wavelengths, hampering th… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: version resubmitted to A&A after a first positive referee report

  4. arXiv:2405.04674  [pdf, other

    cs.DB

    Towards Accurate and Efficient Document Analytics with Large Language Models

    Authors: Yiming Lin, Madelon Hulsebos, Ruiying Ma, Shreya Shankar, Sepanta Zeigham, Aditya G. Parameswaran, Eugene Wu

    Abstract: Unstructured data formats account for over 80% of the data currently stored, and extracting value from such formats remains a considerable challenge. In particular, current approaches for managing unstructured documents do not support ad-hoc analytical queries on document collections. Moreover, Large Language Models (LLMs) directly applied to the documents themselves, or on portions of documents t… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2404.12272  [pdf, other

    cs.HC cs.AI

    Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

    Authors: Shreya Shankar, J. D. Zamfirescu-Pereira, Björn Hartmann, Aditya G. Parameswaran, Ian Arawjo

    Abstract: Due to the cumbersome nature of human evaluation and limitations of code-based evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in evaluating LLM outputs. Yet LLM-generated evaluators simply inherit all the problems of the LLMs they evaluate, requiring further human validation. We present a mixed-initiative approach to ``validate the validators'' -- aligning LL… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 16 pages, 4 figures, 2 tables

  6. arXiv:2404.10547  [pdf, other

    cs.LG

    A/B testing under Interference with Partial Network Information

    Authors: Shiv Shankar, Ritwik Sinha, Yash Chandak, Saayan Mitra, Madalina Fiterau

    Abstract: A/B tests are often required to be conducted on subjects that might have social connections. For e.g., experiments on social media, or medical and social interventions to control the spread of an epidemic. In such settings, the SUTVA assumption for randomized-controlled trials is violated due to network interference, or spill-over effects, as treatments to group A can potentially also affect the c… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: AISTATS 2024

  7. "We Have No Idea How Models will Behave in Production until Production": How Engineers Operationalize Machine Learning

    Authors: Shreya Shankar, Rolando Garcia, Joseph M Hellerstein, Aditya G Parameswaran

    Abstract: Organizations rely on machine learning engineers (MLEs) to deploy models and maintain ML pipelines in production. Due to models' extensive reliance on fresh data, the operationalization of machine learning, or MLOps, requires MLEs to have proficiency in data science and engineering. When considered holistically, the job seems staggering -- how do MLEs do MLOps, and what are their unaddressed chall… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2209.09125

    Journal ref: Proc. ACM Hum.-Comput. Interact. 8, CSCW1, Article 206 (April 2024)

  8. arXiv:2402.15968  [pdf, other

    cs.LG cs.AI

    CoDream: Exchanging dreams instead of models for federated aggregation with heterogeneous models

    Authors: Abhishek Singh, Gauri Gupta, Ritvik Kapila, Yichuan Shi, Alex Dang, Sheshank Shankar, Mohammed Ehab, Ramesh Raskar

    Abstract: Federated Learning (FL) enables collaborative optimization of machine learning models across decentralized data by aggregating model parameters. Our approach extends this concept by aggregating "knowledge" derived from models, instead of model parameters. We present a novel framework called CoDream, where clients collaboratively optimize randomly initialized data using federated optimization in th… ▽ More

    Submitted 27 February, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: 16 pages, 12 figures, 5 tables

  9. arXiv:2402.11085  [pdf, other

    quant-ph cond-mat.mes-hall

    Kerr nonlinearity and parametric amplification with an Al-InAs superconductor-semiconductor Josephson junction

    Authors: Z. Hao, T. Shaw, M. Hatefipour, W. M. Strickland, B. H. Elfeky, D. Langone, J. Shabani, S. Shankar

    Abstract: Nearly quantum limited Josephson parametric amplifiers (JPAs) are essential components in superconducting quantum circuits. However, higher order nonlinearities of the Josephson cosine potential are known to cause gain compression, therefore limiting scalability. In an effort to reduce the fourth order, or Kerr nonlinearity, we realize a parametric amplifier with an Al-InAs superconductor-semicond… ▽ More

    Submitted 22 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 7 pages, 5 figures, v2-added a reference

  10. arXiv:2401.03038  [pdf, other

    cs.DB cs.SE

    SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines

    Authors: Shreya Shankar, Haotian Li, Parth Asawa, Madelon Hulsebos, Yiming Lin, J. D. Zamfirescu-Pereira, Harrison Chase, Will Fu-Hinthorn, Aditya G. Parameswaran, Eugene Wu

    Abstract: Large language models (LLMs) are being increasingly deployed as part of pipelines that repeatedly process or generate data of some sort. However, a common barrier to deployment are the frequent and often unpredictable errors that plague LLMs. Acknowledging the inevitability of these errors, we propose {\em data quality assertions} to identify when LLMs may be making mistakes. We present SPADE, a m… ▽ More

    Submitted 31 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 17 pages, 6 figures

  11. arXiv:2312.02438  [pdf, other

    cs.LG

    Adaptive Instrument Design for Indirect Experiments

    Authors: Yash Chandak, Shiv Shankar, Vasilis Syrgkanis, Emma Brunskill

    Abstract: Indirect experiments provide a valuable framework for estimating treatment effects in situations where conducting randomized control trials (RCTs) is impractical or unethical. Unlike RCTs, indirect experiments estimate treatment effects by leveraging (conditional) instrumental variables, enabling estimation through encouragement and recommendation rather than strict treatment assignment. However,… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  12. arXiv:2311.14641  [pdf, other

    cs.NE

    Neuromorphic Intermediate Representation: A Unified Instruction Set for Interoperable Brain-Inspired Computing

    Authors: Jens E. Pedersen, Steven Abreu, Matthias Jobst, Gregor Lenz, Vittorio Fra, Felix C. Bauer, Dylan R. Muir, Peng Zhou, Bernhard Vogginger, Kade Heckel, Gianvito Urgese, Sadasivan Shankar, Terrence C. Stewart, Jason K. Eshraghian, Sadique Sheik

    Abstract: Spiking neural networks and neuromorphic hardware platforms that emulate neural dynamics are slowly gaining momentum and entering main-stream usage. Despite a well-established mathematical foundation for neural dynamics, the implementation details vary greatly across different platforms. Correspondingly, there are a plethora of software and hardware implementations with their own unique technology… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: NIR is available at https://github.com/neuromorphs/NIR

  13. arXiv:2310.07516  [pdf

    cs.CY cs.AI

    Energy Estimates Across Layers of Computing: From Devices to Large-Scale Applications in Machine Learning for Natural Language Processing, Scientific Computing, and Cryptocurrency Mining

    Authors: Sadasivan Shankar

    Abstract: Estimates of energy usage in layers of computing from devices to algorithms have been determined and analyzed. Building on the previous analysis [3], energy needed from single devices and systems including three large-scale computing applications such as Artificial Intelligence (AI)/Machine Learning for Natural Language Processing, Scientific Simulations, and Cryptocurrency Mining have been estima… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 6 pages, 5 figures

    ACM Class: C.3; C.4; I.2; J.2

  14. arXiv:2310.07000  [pdf

    cs.LG eess.SP

    CarDS-Plus ECG Platform: Development and Feasibility Evaluation of a Multiplatform Artificial Intelligence Toolkit for Portable and Wearable Device Electrocardiograms

    Authors: Sumukh Vasisht Shankar, Evangelos K Oikonomou, Rohan Khera

    Abstract: In the rapidly evolving landscape of modern healthcare, the integration of wearable & portable technology provides a unique opportunity for personalized health monitoring in the community. Devices like the Apple Watch, FitBit, and AliveCor KardiaMobile have revolutionized the acquisition and processing of intricate health data streams. Amidst the variety of data collected by these gadgets, single-… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  15. arXiv:2310.00903  [pdf, ps, other

    math.CA eess.SY math.OC

    Symmetric Solutions to Symmetric Partial Difference Equations

    Authors: Shiva Shankar

    Abstract: This paper studies systems of linear difference equations on the lattice $\Z^n$ that are invariant under a finite group of symmetries, and shows that there exist solutions to such systems that are also invariant under this group of symmetries.

    Submitted 2 October, 2023; originally announced October 2023.

    MSC Class: 39A14; 93A30

  16. arXiv:2309.11943  [pdf

    physics.app-ph

    Multi-contrast x-ray identification of inhomogeneous materials and their discrimination through deep learning approaches

    Authors: Thomas Partridge, Sukrit S. Shankar, Ian Buchanan, Peter Modregger, Alberto Astolfo, David Bate, Alessandro Olivo

    Abstract: Recent innovations in x-ray technology (namely phase-based and energy-resolved imaging) offer unprecedented opportunities for material discrimination, however they are often used in isolation or in limited combinations. Here we show that the optimized combination of contrast channels (attenuation at three x-ray energies, ultra-small angle scattering at two, standard deviation of refraction) signif… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 23 pages - 13 main text, 10 supplementary 11 figures - 5 main text, 6 supplementary

  17. arXiv:2308.16229  [pdf, other

    quant-ph cond-mat.str-el

    Sequential quantum simulation of spin chains with a single circuit QED device

    Authors: Yuxuan Zhang, Shahin Jahanbani, Ameya Riswadkar, S. Shankar, Andrew C. Potter

    Abstract: Quantum simulation of many-body systems in materials science and chemistry are promising application areas for quantum computers. However, the limited scale and coherence of near-term quantum processors pose a significant obstacle to realizing this potential. Here, we theoretically outline how a single-circuit quantum electrodynamics (cQED) device, consisting of a transmon qubit coupled to a long-… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 9 pages, 4 figures

    Journal ref: Phys. Rev. A 109, 022606 (2024)

  18. arXiv:2308.03854  [pdf, ps, other

    cs.DB cs.AI cs.HC cs.LG

    Revisiting Prompt Engineering via Declarative Crowdsourcing

    Authors: Aditya G. Parameswaran, Shreya Shankar, Parth Asawa, Naman Jain, Yujie Wang

    Abstract: Large language models (LLMs) are incredibly powerful at comprehending and generating data in the form of text, but are brittle and error-prone. There has been an advent of toolkits and recipes centered around so-called prompt engineering-the process of asking an LLM to do something via a series of prompts. However, for LLM-powered data processing workflows, in particular, optimizing for quality, w… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  19. Fully Directional Quantum-limited Phase-Preserving Amplifier

    Authors: Gangqiang Liu, Andrew Lingenfelter, Vidul R. Joshi, Nicholas E. Frattini, Volodymyr V. Sivak, Shyam Shankar, Michel H. Devoret

    Abstract: We present a way to achieve fully directional, quantum-limited phase-preserving amplification in a four-port, four-mode superconducting Josephson circuit by utilizing interference between six parametric processes that couple all four modes. Full directionality, defined as the reverse isolation surpassing forward gain between the matched input and output ports of the amplifier, ensures its robustne… ▽ More

    Submitted 13 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Journal ref: Phys. Rev. Applied 21, 014021 (2024)

  20. arXiv:2303.06094  [pdf, other

    cs.DB

    Moving Fast With Broken Data

    Authors: Shreya Shankar, Labib Fawaz, Karl Gyllstrom, Aditya G. Parameswaran

    Abstract: Machine learning (ML) models in production pipelines are frequently retrained on the latest partitions of large, continually-growing datasets. Due to engineering bugs, partitions in such datasets almost always have some corrupted features; thus, it's critical to detect data issues and block retraining before downstream ML model accuracy decreases. However, it's difficult to identify when a partiti… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 14 pages, 4 figures

  21. arXiv:2303.02492  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Kondo effect in twisted bilayer graphene

    Authors: A. S. Shankar, D. O. Oriekhov, Andrew K. Mitchell, L. Fritz

    Abstract: The emergence of flat bands in twisted bilayer graphene at the magic angle can be understood in terms of a vanishing Fermi velocity of the Dirac cone. This is associated with van Hove singularities approaching the Fermi energy and becoming higher-order. In the density of states this is reflected by flanking logarithmic van Hove divergences pinching off the central Dirac cone in energy space. The l… ▽ More

    Submitted 2 June, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: 16 pages, 8 figures

    Journal ref: Phys. Rev. B 107,245102 (2023)

  22. arXiv:2303.02374  [pdf, other

    cs.SI

    Social Media COVID-19 Contact Tracing Using Mobile Social Payments and Facebook Data

    Authors: Shrivu Shankar, Dhiraj Murthy, Hassan Dashtian

    Abstract: Many in the US were reluctant to report their COVID-19 cases at the height of the pandemic (e.g., for fear of missing work or other obligations due to quarantine mandates). Other methods such as using public social media data can therefore help augment current approaches to surveilling pandemics. This study evaluated the effectiveness of using social media data as a data source for tracking public… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

  23. arXiv:2302.08876  [pdf, other

    cond-mat.str-el hep-th quant-ph

    Lyapunov exponents in a Sachdev-Ye-Kitaev-type model with population imbalance in the conformal limit and beyond

    Authors: A. S. Shankar, M. Fremling, S. Plugge, L. Fritz

    Abstract: The Sachdev-Ye-Kitaev (SYK) model shows chaotic behavior with a maximal Lyapunov exponent. In this paper, we investigate the four-point function of a SYK-type model numerically, which gives us access to its Lyapunov exponent. The model consists of two sets of Majorana fermions, called A and B, and the interactions are restricted to being exclusively pairwise between the two sets, not within the se… ▽ More

    Submitted 13 October, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 12 pages, 8 figures. Comments welcome

    Journal ref: Phys. Rev. D 108, 094039, (2023)

  24. arXiv:2302.03161  [pdf, other

    cs.LG

    Optimization using Parallel Gradient Evaluations on Multiple Parameters

    Authors: Yash Chandak, Shiv Shankar, Venkata Gandikota, Philip S. Thomas, Arya Mazumdar

    Abstract: We propose a first-order method for convex optimization, where instead of being restricted to the gradient from a single parameter, gradients from multiple parameters can be used during each step of gradient descent. This setup is particularly useful when a few processors are available that can be used in parallel for optimization. Our method uses gradients from multiple parameters in synergy to u… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at OPT workshop @ Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  25. arXiv:2301.10330  [pdf, other

    cs.LG cs.AI

    Off-Policy Evaluation for Action-Dependent Non-Stationary Environments

    Authors: Yash Chandak, Shiv Shankar, Nathaniel D. Bastian, Bruno Castro da Silva, Emma Brunskil, Philip S. Thomas

    Abstract: Methods for sequential decision-making are often built upon a foundational assumption that the underlying decision process is stationary. This limits the application of such methods because real-world problems are often subject to changes due to external factors (passive non-stationarity), changes induced by interactions with the system itself (active non-stationarity), or both (hybrid non-station… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: Accepted at Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  26. arXiv:2212.00666  [pdf, other

    cond-mat.soft cond-mat.mtrl-sci physics.app-ph

    Design rules for controlling active topological defects

    Authors: Suraj Shankar, Luca V. D. Scharrer, Mark J. Bowick, M. Cristina Marchetti

    Abstract: Topological defects play a central role in the physics of many materials, including magnets, superconductors and liquid crystals. In active fluids, defects become autonomous particles that spontaneously propel from internal active stresses and drive chaotic flows stirring the fluid. The intimate connection between defect textures and active flow suggests that properties of active materials can be… ▽ More

    Submitted 2 May, 2024; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 11 pages (including Methods), 5 figures. Changed title and format, final version

    Report number: 2212.00666

    Journal ref: Proc. Nat. Acad. Sci. 121 (21) e2400933121 (2024)

  27. arXiv:2211.11649  [pdf, other

    cs.LG cs.AI

    Implicit Training of Energy Model for Structure Prediction

    Authors: Shiv Shankar, Vihari Piratla

    Abstract: Most deep learning research has focused on develo** new model and training procedures. On the other hand the training objective has usually been restricted to combinations of standard losses. When the objective aligns well with the evaluation metric, this is not a major issue. However when dealing with complex structured outputs, the ideal objective can be hard to optimize and the efficacy of us… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: AAAI

  28. arXiv:2211.03758  [pdf, other

    stat.ME cs.AI cs.HC

    Privacy Aware Experiments without Cookies

    Authors: Shiv Shankar, Ritwik Sinha, Saayan Mitra, Viswanathan Swaminathan, Sridhar Mahadevan, Moumita Sinha

    Abstract: Consider two brands that want to jointly test alternate web experiences for their customers with an A/B test. Such collaborative tests are today enabled using \textit{third-party cookies}, where each brand has information on the identity of visitors to another website. With the imminent elimination of third-party cookies, such A/B tests will become untenable. We propose a two-stage experimental de… ▽ More

    Submitted 6 February, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: Technical report supplementing paper accepted to WSDM 23

  29. arXiv:2210.17509  [pdf, other

    astro-ph.IM gr-qc

    GRaM-X: A new GPU-accelerated dynamical spacetime GRMHD code for Exascale computing with the Einstein Toolkit

    Authors: Swapnil Shankar, Philipp Mösta, Steven R. Brandt, Roland Haas, Erik Schnetter, Yannick de Graaf

    Abstract: We present GRaM-X (General Relativistic accelerated Magnetohydrodynamics on AMReX), a new GPU-accelerated dynamical-spacetime general relativistic magnetohydrodynamics (GRMHD) code which extends the GRMHD capability of Einstein Toolkit to GPU-based exascale systems. GRaM-X supports 3D adaptive mesh refinement (AMR) on GPUs via a new AMR driver for the Einstein Toolkit called CarpetX which in turn… ▽ More

    Submitted 21 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 22 pages, 8 figures, to be submitted to Classical and Quantum Gravity

  30. Trends in Energy Estimates for Computing in AI/Machine Learning Accelerators, Supercomputers, and Compute-Intensive Applications

    Authors: Sadasivan Shankar, Albert Reuther

    Abstract: We examine the computational energy requirements of different systems driven by the geometrical scaling law, and increasing use of Artificial Intelligence or Machine Learning (AI-ML) over the last decade. With more scientific and technology applications based on data-driven discovery, machine learning methods, especially deep neural networks, have become widely used. In order to enable such applic… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 8 pages, 9 figures, Submitted to Proceedings of IEEE Conference on High Performance Extreme Computing (HPEC) 2022

    MSC Class: 68U01 ACM Class: C.4; I.2

  31. arXiv:2209.09125  [pdf, other

    cs.SE cs.HC cs.LG

    Operationalizing Machine Learning: An Interview Study

    Authors: Shreya Shankar, Rolando Garcia, Joseph M. Hellerstein, Aditya G. Parameswaran

    Abstract: Organizations rely on machine learning engineers (MLEs) to operationalize ML, i.e., deploy and maintain ML pipelines in production. The process of operationalizing ML, or MLOps, consists of a continual loop of (i) data collection and labeling, (ii) experimentation to improve ML performance, (iii) evaluation throughout a multi-staged deployment process, and (iv) monitoring of performance drops in p… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: 20 pages, 4 figures

  32. arXiv:2209.03056  [pdf

    cs.NE

    Parallel and Streaming Wavelet Neural Networks for Classification and Regression under Apache Spark

    Authors: Eduru Harindra Venkatesh, Yelleti Vivek, Vadlamani Ravi, Orsu Shiva Shankar

    Abstract: Wavelet neural networks (WNN) have been applied in many fields to solve regression as well as classification problems. After the advent of big data, as data gets generated at a brisk pace, it is imperative to analyze it as soon as it is generated owing to the fact that the nature of the data may change dramatically in short time intervals. This is necessitated by the fact that big data is all perv… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 25 pages; 2 Tables; 7 Figures

    MSC Class: 68T09; 68Txx ACM Class: I.2

  33. arXiv:2209.00302  [pdf, other

    cs.LG cs.MM

    Progressive Fusion for Multimodal Integration

    Authors: Shiv Shankar, Laure Thompson, Madalina Fiterau

    Abstract: Integration of multimodal information from various sources has been shown to boost the performance of machine learning models and thus has received increased attention in recent years. Often such models use deep modality-specific networks to obtain unimodal features which are combined to obtain "late-fusion" representations. However, these designs run the risk of information loss in the respective… ▽ More

    Submitted 20 November, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

  34. arXiv:2205.11473  [pdf, other

    cs.LG cs.AI stat.ML

    Rethinking Streaming Machine Learning Evaluation

    Authors: Shreya Shankar, Bernease Herman, Aditya G. Parameswaran

    Abstract: While most work on evaluating machine learning (ML) models focuses on computing accuracy on batches of data, tracking accuracy alone in a streaming setting (i.e., unbounded, timestamp-ordered datasets) fails to appropriately identify when models are performing unexpectedly. In this position paper, we discuss how the nature of streaming ML problems introduces new real-world challenges (e.g., delaye… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: ML Evaluation Standards Workshop (ICLR 2022)

  35. arXiv:2205.08636  [pdf, other

    cond-mat.soft physics.bio-ph

    Boundaries control active channel flows

    Authors: Paarth Gulati, Suraj Shankar, M. Cristina Marchetti

    Abstract: Boundary conditions dictate how fluids, including liquid crystals, flow when pumped through a channel. Can boundary conditions also be used to control internally driven active fluids that generate flows spontaneously? By using numerical simulations and stability analysis we explore how surface anchoring of active agents at the boundaries and substrate drag can be used to rectify coherent flow of a… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 12 pages, 9 figures

  36. arXiv:2201.00247  [pdf

    cond-mat.mtrl-sci physics.chem-ph physics.soc-ph

    Now is the time to build a national data ecosystem for materials science and chemistry research data

    Authors: E. M. Campo, S. Shankar, A. S. Szalay, R. J. Hanisch

    Abstract: A call for coordinated action from government, academia, and industry.

    Submitted 1 January, 2022; originally announced January 2022.

  37. arXiv:2112.09079  [pdf, other

    q-bio.PE cond-mat.soft cond-mat.stat-mech physics.flu-dyn

    Spatial population genetics with fluid flow

    Authors: Roberto Benzi, David R. Nelson, Suraj Shankar, Federico Toschi, Xiaojue Zhu

    Abstract: The growth and evolution of microbial populations is often subjected to advection by fluid flows in spatially extended environments, with immediate consequences for questions of spatial population genetics in marine ecology, planktonic diversity and origin of life scenarios. Here, we review recent progress made in understanding this rich problem in the simplified setting of two competing genetic m… ▽ More

    Submitted 30 June, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: 29 pages, 22 figures

    Journal ref: Rep. Prog. Phys. 85 096601, 2022

  38. arXiv:2112.05676  [pdf, ps, other

    cond-mat.soft eess.SY math.OC physics.flu-dyn

    Optimal transport and control of active drops

    Authors: Suraj Shankar, Vidya Raju, L. Mahadevan

    Abstract: Understanding the complex patterns in space-time exhibited by active systems has been the subject of much interest in recent times. Complementing this forward problem is the inverse problem of controlling active matter. Here we use optimal control theory to pose the problem of transporting a slender drop of an active fluid and determine the dynamical profile of the active stresses to move it with… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: 8 pages, 4 figures, SI available upon request

    Journal ref: PNAS 119 (35) e2121985119, 2022

  39. arXiv:2110.00385  [pdf, ps, other

    cs.NE cs.AI cs.LG

    Neural Dependency Coding inspired Multimodal Fusion

    Authors: Shiv Shankar

    Abstract: Information integration from different modalities is an active area of research. Human beings and, in general, biological neural systems are quite adept at using a multitude of signals from different sensory perceptive fields to interact with the environment and each other. Recent work in deep fusion models via neural networks has led to substantial improvements over unimodal approaches in areas l… ▽ More

    Submitted 4 October, 2021; v1 submitted 28 September, 2021; originally announced October 2021.

  40. arXiv:2108.13557  [pdf, other

    cs.SE cs.DB

    Towards Observability for Production Machine Learning Pipelines

    Authors: Shreya Shankar, Aditya Parameswaran

    Abstract: Software organizations are increasingly incorporating machine learning (ML) into their product offerings, driving a need for new data management tools. Many of these tools facilitate the initial development of ML applications, but sustaining these applications post-deployment is difficult due to lack of real-time feedback (i.e., labels) for predictions and silent failures that could occur at any c… ▽ More

    Submitted 15 July, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 11 pages, 6 figures

  41. arXiv:2108.12982  [pdf, other

    cs.LG

    Adversarial Stein Training for Graph Energy Models

    Authors: Shiv Shankar

    Abstract: Learning distributions over graph-structured data is a challenging task with many applications in biology and chemistry. In this work we use an energy-based model (EBM) based on multi-channel graph neural networks (GNN) to learn permutation invariant unnormalized density functions on graphs. Unlike standard EBM training methods our approach is to learn the model via minimizing adversarial stein di… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: Appeared at Machine Learning for Molecules Workshop at NeurIPS 2020.https://ml4molecules.github.io

  42. arXiv:2108.10875  [pdf, other

    cond-mat.soft cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    Geometric control of topological dynamics in a singing saw

    Authors: Suraj Shankar, Petur Bryde, L. Mahadevan

    Abstract: The common handsaw can be converted into a bowed musical instrument capable of producing exquisitely sustained notes when its blade is appropriately bent. Acoustic modes localized at an inflection point are known to underlie the saw's sonorous quality, yet the origin of localization has remained mysterious. Here we uncover a topological basis for the existence of localized modes, that relies on an… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: 17 pages, 3 figures, SI available upon request

    Journal ref: PNAS, 119 (17) e2117241119, 2022

  43. Wireless Sensor Networks for Optimisation of Search and Rescue Management in Floods

    Authors: Harshil Bhatt, Pranesh G, Samarth Shankar, Shriyash Haralikar

    Abstract: We propose a novel search-and-rescue management method that relies on the aerial deployment of Wireless Sensor Network (WSN) for locating victims after floods. The sensor nodes will collect vital information such as heat signatures for detecting human presence and location, the flow of flood. The sensor modules are packed in a portable floating buoy with a user interface to convey emergency messag… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

  44. AutoLay: Benchmarking amodal layout estimation for autonomous driving

    Authors: Kaustubh Mani, N. Sai Shankar, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Given an image or a video captured from a monocular camera, amodal layout estimation is the task of predicting semantics and occupancy in bird's eye view. The term amodal implies we also reason about entities in the scene that are occluded or truncated in image space. While several recent efforts have tackled this problem, there is a lack of standardization in task specification, datasets, and eva… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: published in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  45. arXiv:2107.14139  [pdf, other

    q-bio.PE

    Vaccination Worldwide: Strategies, Distribution and Challenges

    Authors: Chirag Samal, Kasia Jakimowicz, Krishnendu Dasgupta, Aniket Vashishtha, Francisco O., Arunakiry Natarajan, Haris Nazir, Alluri Siddhartha Varma, Tejal Dahake, Amitesh Anand Pandey, Ishaan Singh, John Sangyeob Kim, Mehrab Singh Gill, Saurish Srivastava, Orna Mukhopadhyay, Parth Patwa, Qamil Mirza, Sualeha Irshad, Sheshank Shankar, Rohan Iyer, Rohan Sukumaran, Ashley Mehra, Anshuman Sharma, Abhishek Singh, Maurizio Arseni , et al. (4 additional authors not shown)

    Abstract: The Coronavirus 2019 (Covid-19) pandemic caused by the SARS-CoV-2 virus represents an unprecedented crisis for our planet. It is a bane of the über connected world that we live in that this virus has affected almost all countries and caused mortality and economic upheaval at a scale whose effects are going to be felt for generations to come. While we can all be buoyed at the pace at which vaccines… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

  46. arXiv:2107.01368  [pdf, ps, other

    math.OC eess.SY

    The coarsest lattice that determines a discrete multidimensional system

    Authors: Debasattam Pal, Shiva Shankar

    Abstract: A discrete multidimensional system is the set of solutions to a system of linear partial difference equations defined on the lattice $\Z^n$. This paper shows that it is determined by a unique coarsest sublattice, in the sense that the solutions of the system on this sublattice determine the solutions on $\Z^n$; it is therefore the correct domain of definition of the discrete system. In turn, the d… ▽ More

    Submitted 23 January, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: To appear in Mathematics of Control Signals and Systems

    MSC Class: 39A14; 93B25; 13B05

  47. arXiv:2107.01338  [pdf, other

    stat.ME cs.LG

    Sibling Regression for Generalized Linear Models

    Authors: Shiv Shankar, Daniel Sheldon

    Abstract: Field observations form the basis of many scientific studies, especially in ecological and social sciences. Despite efforts to conduct such surveys in a standardized way, observations can be prone to systematic measurement errors. The removal of systematic variability introduced by the observation process, if possible, can greatly increase the value of this data. Existing non-parametric techniques… ▽ More

    Submitted 7 July, 2021; v1 submitted 3 July, 2021; originally announced July 2021.

    Journal ref: ECMLPKDD-2021

  48. arXiv:2105.08321  [pdf, other

    cs.LG cs.CY

    Can Self Reported Symptoms Predict Daily COVID-19 Cases?

    Authors: Parth Patwa, Viswanatha Reddy, Rohan Sukumaran, Sethuraman TV, Eptehal Nashnoush, Sheshank Shankar, Rishemjit Kaur, Abhishek Singh, Ramesh Raskar

    Abstract: The COVID-19 pandemic has impacted lives and economies across the globe, leading to many deaths. While vaccination is an important intervention, its roll-out is slow and unequal across the globe. Therefore, extensive testing still remains one of the key methods to monitor and contain the virus. Testing on a large scale is expensive and arduous. Hence, we need alternate methods to estimate the numb… ▽ More

    Submitted 21 June, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: Accepted as a full-length oral presentation at the International Workshop on Artificial Intelligence for Social Good (AI4SG), IJCAI-21

  49. Proto-magnetar jets as central engines for broad-lined type Ic supernovae

    Authors: Swapnil Shankar, Philipp Mösta, Jennifer Barnes, Paul C. Duffell, Daniel Kasen

    Abstract: A subset of type Ic supernovae (SNe Ic), broad-lined SNe Ic (SNe Ic-bl), show unusually high kinetic energies ($\sim 10^{52}$ erg) which cannot be explained by the energy supplied by neutrinos alone. Many SNe Ic-bl have been observed in coincidence with long gamma-ray bursts (GRBs) which suggests a connection between SNe and GRBs. A small fraction of core-collapse supernovae (CCSNe) form a rapidly… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: 13 pages, 12 figures

  50. Monocular Multi-Layer Layout Estimation for Warehouse Racks

    Authors: Meher Shashwat Nigam, Avinash Prabhu, Anurag Sahu, Puru Gupta, Tanvi Karandikar, N. Sai Shankar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna

    Abstract: Given a monocular colour image of a warehouse rack, we aim to predict the bird's-eye view layout for each shelf in the rack, which we term as multi-layer layout prediction. To this end, we present RackLay, a deep neural network for real-time shelf layout estimation from a single image. Unlike previous layout estimation methods, which provide a single layout for the dominant ground plane alone, Rac… ▽ More

    Submitted 28 October, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Visit our project repository at https://github.com/Avinash2468/RackLay