Search | arXiv e-print repository

Synthetic Multimodal Question Generation

Authors: Ian Wu, Sravan Jayanthi, Vijay Viswanathan, Simon Rosenberg, Sina Pakazad, Tongshuang Wu, Graham Neubig

Abstract: Multimodal Retrieval Augmented Generation (MMRAG) is a powerful approach to question-answering over multimodal documents. A key challenge with evaluating MMRAG is the paucity of high-quality datasets matching the question styles and modalities of interest. In light of this, we propose SMMQG, a synthetic data generation framework. SMMQG leverages interplay between a retriever, large language model… ▽ More Multimodal Retrieval Augmented Generation (MMRAG) is a powerful approach to question-answering over multimodal documents. A key challenge with evaluating MMRAG is the paucity of high-quality datasets matching the question styles and modalities of interest. In light of this, we propose SMMQG, a synthetic data generation framework. SMMQG leverages interplay between a retriever, large language model (LLM) and large multimodal model (LMM) to generate question and answer pairs directly from multimodal documents, with the questions conforming to specified styles and modalities. We use SMMQG to generate an MMRAG dataset of 1024 questions over Wikipedia documents and evaluate state-of-the-art models using it, revealing insights into model performance that are attainable only through style- and modality-specific evaluation data. Next, we measure the quality of data produced by SMMQG via a human study. We find that the quality of our synthetic data is on par with the quality of the crowdsourced benchmark MMQA and that downstream evaluation results using both datasets strongly concur. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: Submitted to ARR June 2024

arXiv:2406.15491 [pdf, other]

Vibrational Entropy and Free Energy of Solid Lithium using Covariance of Atomic Displacements Enabled by Machine Learning

Authors: Mgcini Keith Phuthi, Yang Huang, Michael Widom, Venkatasubramanian Viswanathan

Abstract: Vibrational properties of solids are key to determining stability, response and functionality. However, they are challenging to computationally predict at Ab-Initio accuracy, even for elemental systems. Ab-Initio methods for modeling atomic interactions are limited in the system sizes and simulation times that can be achieved. Due to these limitations, Machine Learning Interatomic Potentials (MLIP… ▽ More Vibrational properties of solids are key to determining stability, response and functionality. However, they are challenging to computationally predict at Ab-Initio accuracy, even for elemental systems. Ab-Initio methods for modeling atomic interactions are limited in the system sizes and simulation times that can be achieved. Due to these limitations, Machine Learning Interatomic Potentials (MLIPs) are gaining popularity and success as a faster, more scalable approach for modeling atomic interactions, potentially at Ab-Initio accuracy. Even with faster potentials, methodologies for predicting entropy, free energy and vibrational properties vary in accuracy, cost and difficulty to implement. Using the Covariance of Atomic Displacements (CAD) to predict entropy, free energy and finite-temperature phonon dispersions is a promising approach but thorough benchmarking has been hampered by the cost of Ab-Initio methods for sampling. In this work, we use a MLIP and the CAD to characterize the convergence of the predicted properties and determine optimal sampling strategies. We focus on solid lithium at zero pressure, showing that the MLIP-CAD approach reproduces experimental entropy, phonon dispersions and the martensitic transition while also comparing to more established methods. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2404.14361 [pdf, other]

Better Synthetic Data by Retrieving and Transforming Existing Datasets

Authors: Saumya Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig

Abstract: Despite recent advances in large language models, building dependable and deployable NLP models typically requires abundant, high-quality training data. However, task-specific data is not available for many use cases, and manually curating task-specific data is labor-intensive. Recent work has studied prompt-driven synthetic data generation using large language models, but these generated datasets… ▽ More Despite recent advances in large language models, building dependable and deployable NLP models typically requires abundant, high-quality training data. However, task-specific data is not available for many use cases, and manually curating task-specific data is labor-intensive. Recent work has studied prompt-driven synthetic data generation using large language models, but these generated datasets tend to lack complexity and diversity. To address these limitations, we introduce a method, DataTune, to make better use of existing, publicly available datasets to improve automatic dataset generation. DataTune performs dataset transformation, enabling the repurposing of publicly available datasets into a format that is directly aligned with the specific requirements of target tasks. On a diverse set of language-based tasks from the BIG-Bench benchmark, we find that finetuning language models via DataTune improves over a few-shot prompting baseline by 49% and improves over existing methods that use synthetic or retrieved training data by 34%. We find that dataset transformation significantly increases the diversity and difficulty of generated data on many tasks. We integrate DataTune into an open-source repository to make this method accessible to the community: https://github.com/neulab/prompt2model. △ Less

Submitted 26 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: PDF fixed in v3

arXiv:2403.00943 [pdf, ps, other]

On the Hardness of Fair Allocation under Ternary Valuations

Authors: Zack Fitzsimmons, Vignesh Viswanathan, Yair Zick

Abstract: We study the problem of fair allocation of indivisible items when agents have ternary additive valuations -- each agent values each item at some fixed integer values $a$, $b$, or $c$ that are common to all agents. The notions of fairness we consider are max Nash welfare (MNW), when $a$, $b$, and $c$ are non-negative, and max egalitarian welfare (MEW). We show that for any distinct non-negative… ▽ More We study the problem of fair allocation of indivisible items when agents have ternary additive valuations -- each agent values each item at some fixed integer values $a$, $b$, or $c$ that are common to all agents. The notions of fairness we consider are max Nash welfare (MNW), when $a$, $b$, and $c$ are non-negative, and max egalitarian welfare (MEW). We show that for any distinct non-negative $a$, $b$, and $c$, maximizing Nash welfare is APX-hard -- i.e., the problem does not admit a PTAS unless P = NP. We also show that for any distinct $a$, $b$, and $c$, maximizing egalitarian welfare is APX-hard except for a few cases when $b = 0$ that admit efficient algorithms. These results make significant progress towards completely characterizing the complexity of computing exact MNW allocations and MEW allocations. En route, we resolve open questions left by prior work regarding the complexity of computing MNW allocations under bivalued valuations, and MEW allocations under ternary mixed manna. △ Less

Submitted 1 March, 2024; originally announced March 2024.

arXiv:2401.10287 [pdf, other]

Open-Source Fermionic Neural Networks with Ionic Charge Initialization

Authors: Shai Pranesh, Shang Zhu, Venkat Viswanathan, Bharath Ramsundar

Abstract: Finding accurate solutions to the electronic Schrödinger equation plays an important role in discovering important molecular and material energies and characteristics. Consequently, solving systems with large numbers of electrons has become increasingly important. Variational Monte Carlo (VMC) methods, especially those approximated through deep neural networks, are promising in this regard. In thi… ▽ More Finding accurate solutions to the electronic Schrödinger equation plays an important role in discovering important molecular and material energies and characteristics. Consequently, solving systems with large numbers of electrons has become increasingly important. Variational Monte Carlo (VMC) methods, especially those approximated through deep neural networks, are promising in this regard. In this paper, we aim to integrate one such model called the FermiNet, a post-Hartree-Fock (HF) Deep Neural Network (DNN) model, into a standard and widely used open source library, DeepChem. We also propose novel initialization techniques to overcome the difficulties associated with the assignment of excess or lack of electrons for ions. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: Accepted at 3rd Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)

arXiv:2312.10632 [pdf]

Dosimetric calibration of an anatomically specific ultra-high dose rate electron irradiation platform for preclinical FLASH radiobiology experiments

Authors: **ghui Wang, Stavros Melemenidis, Rakesh Manjappa, Vignesh Viswanathan, Ramish M. Ashraf, Karen Levy, Lawrie Skinner, Luis A. Soto, Stephanie Chow, Brianna Lau, Ryan B. Ko, Edward E. Graves, Amy S. Yu, Karl K. Bush, Murat Surucu, Erinn B. Rankin, Billy W. Loo Jr, Emil Schüler, Peter G. Maxim

Abstract: We characterized the dosimetric properties of a clinical linear accelerator configured to deliver ultra-high dose rate (UHDR) irradiation to mice and cell-culture FLASH radiobiology experiments. UHDR electron beams were controlled by a microcontroller and relay interfaced with the respiratory gating system. We produced beam collimators with indexed stereotactic mouse positioning devices to provide… ▽ More We characterized the dosimetric properties of a clinical linear accelerator configured to deliver ultra-high dose rate (UHDR) irradiation to mice and cell-culture FLASH radiobiology experiments. UHDR electron beams were controlled by a microcontroller and relay interfaced with the respiratory gating system. We produced beam collimators with indexed stereotactic mouse positioning devices to provide anatomically specific preclinical treatments. Treatment delivery was monitored directly with an ionization chamber, and charge measurements were correlated with radiochromic film at the entry surface of the mice. The setup for conventional (CONV) dose rate irradiation was similar but the source-to-surface distance was longer. Monte Carlo simulations and film dosimetry were used to characterize beam properties and dose distributions. The mean electron beam energies before the flattening filter were 18.8 MeV (UHDR) and 17.7 MeV (CONV), with corresponding values at the mouse surface of 17.2 MeV and 16.2 MeV. The charges measured with an external ion chamber were linearly correlated with the mouse entrance dose. Use of relay gating for pulse control initially led to a delivery failure rate of 20% ($+/-$ 1 pulse); adjustments to account for the linac latency improved this rate to <1/20. Beam field sizes for two anatomically specific mouse collimators (4x4 $cm^2$ for whole-abdomen and 1.5x1.5 $cm^2$ for unilateral lung irradiation) were accurate within <5% and had low radiation leakage (<4%). Normalizing the dose at the center of the mouse (~0.75 cm depth) produced UHDR and CONV doses to the irradiated volumes with >95% agreement. We successfully configured a clinical linear accelerator for increased output and developed a robust preclinical platform for anatomically specific irradiation, with highly accurate and precise temporal and spatial dose delivery, for both CONV and UHDR applications. △ Less

Submitted 17 December, 2023; originally announced December 2023.

Comments: **ghui Wang and Stavros Melemenidis are co-first authors, and Emil Schüler and Peter G. Maxim are co-senior/co-corresponding authors

arXiv:2311.03566 [pdf, other]

Measuring Adversarial Datasets

Authors: Yuanchen Bai, Raoyi Huang, Vijay Viswanathan, Tzu-Sheng Kuo, Tongshuang Wu

Abstract: In the era of widespread public use of AI systems across various domains, ensuring adversarial robustness has become increasingly vital to maintain safety and prevent undesirable errors. Researchers have curated various adversarial datasets (through perturbations) for capturing model deficiencies that cannot be revealed in standard benchmark datasets. However, little is known about how these adver… ▽ More In the era of widespread public use of AI systems across various domains, ensuring adversarial robustness has become increasingly vital to maintain safety and prevent undesirable errors. Researchers have curated various adversarial datasets (through perturbations) for capturing model deficiencies that cannot be revealed in standard benchmark datasets. However, little is known about how these adversarial examples differ from the original data points, and there is still no methodology to measure the intended and unintended consequences of those adversarial transformations. In this research, we conducted a systematic survey of existing quantifiable metrics that describe text instances in NLP tasks, among dimensions of difficulty, diversity, and disagreement. We selected several current adversarial effect datasets and compared the distributions between the original and their adversarial counterparts. The results provide valuable insights into what makes these datasets more challenging from a metrics perspective and whether they align with underlying assumptions. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: ART of Safety workshop (AACL 2023)

arXiv:2310.03131 [pdf, ps, other]

Axiomatic Aggregations of Abductive Explanations

Authors: Gagan Biradar, Yacine Izza, Elita Lobo, Vignesh Viswanathan, Yair Zick

Abstract: The recent criticisms of the robustness of post hoc model approximation explanation methods (like LIME and SHAP) have led to the rise of model-precise abductive explanations. For each data point, abductive explanations provide a minimal subset of features that are sufficient to generate the outcome. While theoretically sound and rigorous, abductive explanations suffer from a major issue -- there c… ▽ More The recent criticisms of the robustness of post hoc model approximation explanation methods (like LIME and SHAP) have led to the rise of model-precise abductive explanations. For each data point, abductive explanations provide a minimal subset of features that are sufficient to generate the outcome. While theoretically sound and rigorous, abductive explanations suffer from a major issue -- there can be several valid abductive explanations for the same data point. In such cases, providing a single abductive explanation can be insufficient; on the other hand, providing all valid abductive explanations can be incomprehensible due to their size. In this work, we solve this issue by aggregating the many possible abductive explanations into feature importance scores. We propose three aggregation methods: two based on power indices from cooperative game theory and a third based on a well-known measure of causal strength. We characterize these three methods axiomatically, showing that each of them uniquely satisfies a set of desirable properties. We also evaluate them on multiple datasets and show that these explanations are robust to the attacks that fool SHAP and LIME. △ Less

Submitted 12 October, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

arXiv:2310.03047 [pdf, other]

Differentiable Modeling and Optimization of Battery Electrolyte Mixtures Using Geometric Deep Learning

Authors: Shang Zhu, Bharath Ramsundar, Emil Annevelink, Hongyi Lin, Adarsh Dave, Pin-Wen Guan, Kevin Gering, Venkatasubramanian Viswanathan

Abstract: Electrolytes play a critical role in designing next-generation battery systems, by allowing efficient ion transfer, preventing charge transfer, and stabilizing electrode-electrolyte interfaces. In this work, we develop a differentiable geometric deep learning (GDL) model for chemical mixtures, DiffMix, which is applied in guiding robotic experimentation and optimization towards fast-charging batte… ▽ More Electrolytes play a critical role in designing next-generation battery systems, by allowing efficient ion transfer, preventing charge transfer, and stabilizing electrode-electrolyte interfaces. In this work, we develop a differentiable geometric deep learning (GDL) model for chemical mixtures, DiffMix, which is applied in guiding robotic experimentation and optimization towards fast-charging battery electrolytes. In particular, we extend mixture thermodynamic and transport laws by creating GDL-learnable physical coefficients. We evaluate our model with mixture thermodynamics and ion transport properties, where we show improved prediction accuracy and model robustness of DiffMix than its purely data-driven variants. Furthermore, with a robotic experimentation setup, Clio, we improve ionic conductivity of electrolytes by over 18.8% within 10 experimental steps, via differentiable optimization built on DiffMix gradients. By combining GDL, mixture physics laws, and robotic experimentation, DiffMix expands the predictive modeling methods for chemical mixtures and enables efficient optimization in large chemical spaces. △ Less

Submitted 1 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

arXiv:2309.15985 [pdf, other]

Open Source Infrastructure for Differentiable Density Functional Theory

Authors: Advika Vidhyadhiraja, Arun Pa Thiagarajan, Shang Zhu, Venkat Viswanathan, Bharath Ramsundar

Abstract: Learning exchange correlation functionals, used in quantum chemistry calculations, from data has become increasingly important in recent years, but training such a functional requires sophisticated software infrastructure. For this reason, we build open source infrastructure to train neural exchange correlation functionals. We aim to standardize the processing pipeline by adapting state-of-the-art… ▽ More Learning exchange correlation functionals, used in quantum chemistry calculations, from data has become increasingly important in recent years, but training such a functional requires sophisticated software infrastructure. For this reason, we build open source infrastructure to train neural exchange correlation functionals. We aim to standardize the processing pipeline by adapting state-of-the-art techniques from work done by multiple groups. We have open sourced the model in the DeepChem library to provide a platform for additional research on differentiable quantum chemistry methods. △ Less

Submitted 27 September, 2023; originally announced September 2023.

arXiv:2309.00535 [pdf, other]

Towards a Reduced Dependency Framework for Autonomous Unified Inspect-Explore Missions

Authors: Vignesh Kottayam Viswanathan, Sumeet Gajanan Satpute, Ali-akbar Agha-mohammadi, George Nikolakopoulos

Abstract: The task of establishing and maintaining situational awareness in an unknown environment is a critical step to fulfil in a mission related to the field of rescue robotics. Predominantly, the problem of visual inspection of urban structures is dealt with view-planning being addressed by map-based approaches. In this article, we propose a novel approach towards effective use of Micro Aerial Vehicles… ▽ More The task of establishing and maintaining situational awareness in an unknown environment is a critical step to fulfil in a mission related to the field of rescue robotics. Predominantly, the problem of visual inspection of urban structures is dealt with view-planning being addressed by map-based approaches. In this article, we propose a novel approach towards effective use of Micro Aerial Vehicles (MAVs) for obtaining a 3-D shape of an unknown structure of objects utilizing a map-independent planning framework. The problem is undertaken via a bifurcated approach to address the task of executing a closer inspection of detected structures with a wider exploration strategy to identify and locate nearby structures, while being equipped with limited sensing capability. The proposed framework is evaluated experimentally in a controlled indoor environment in presence of a mock-up environment validating the efficacy of the proposed inspect-explore policy. △ Less

Submitted 1 September, 2023; originally announced September 2023.

arXiv:2308.15653 [pdf, other]

Statistical methods for resolving poor uncertainty quantification in machine learning interatomic potentials

Authors: Emil Annevelink, Venkatasubramanian Viswanathan

Abstract: Machine learning interatomic potentials (MLIPs) are promising surrogates for quantum mechanics evaluations in ab-initio molecular dynamics simulations due to their ability to reproduce the energy and force landscape within chemical accuracy at four orders of magnitude less cost. While develo** uncertainty quantification (UQ) tools for MLIPs is critical to build production MLIP datasets using act… ▽ More Machine learning interatomic potentials (MLIPs) are promising surrogates for quantum mechanics evaluations in ab-initio molecular dynamics simulations due to their ability to reproduce the energy and force landscape within chemical accuracy at four orders of magnitude less cost. While develo** uncertainty quantification (UQ) tools for MLIPs is critical to build production MLIP datasets using active learning, only limited progress has been made and the most robust method, ensembling, still shows low correlation between high error and high uncertainty predictions. Here we develop a rigorous method rooted in statistics for determining an error cutoff that distinguishes regions of high and low UQ performance. The statistical cutoff illuminates that a main cause of the poor UQ performance is due to the machine learning model already describing the entire dataset and not having any datapoints with error greater than the statistical error distribution. Second, we extend the statistical analysis to create an interpretable connection between the error and uncertainty distributions to predict an uncertainty cutoff separating high and low errors. We showcase the statistical cutoff in active learning benchmarks on two datasets of varying chemical complexity for three common UQ methods: ensembling, sparse Gaussian processes, and latent distance metrics and compare them to the true error and random sampling, showing that the statistical cutoff is generalizable to a variety of different UQ methods and protocols and performs similarly to using the true error. Importantly, we conclude that utilizing this uncertainty cutoff enables using significantly lower cost uncertainty quantification tools such as sparse gaussian processes and latent distances compared to ensembling approaches for generating MLIP datasets at a fraction of the cost. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.12261 [pdf, other]

Prompt2Model: Generating Deployable Models from Natural Language Instructions

Authors: Vijay Viswanathan, Chenyang Zhao, Amanda Bertsch, Tongshuang Wu, Graham Neubig

Abstract: Large language models (LLMs) enable system builders today to create competent NLP systems through prompting, where they only need to describe the task in natural language and provide a few examples. However, in other ways, LLMs are a step backward from traditional special-purpose NLP models; they require extensive computational resources for deployment and can be gated behind APIs. In this paper,… ▽ More Large language models (LLMs) enable system builders today to create competent NLP systems through prompting, where they only need to describe the task in natural language and provide a few examples. However, in other ways, LLMs are a step backward from traditional special-purpose NLP models; they require extensive computational resources for deployment and can be gated behind APIs. In this paper, we propose Prompt2Model, a general-purpose method that takes a natural language task description like the prompts provided to LLMs, and uses it to train a special-purpose model that is conducive to deployment. This is done through a multi-step process of retrieval of existing datasets and pretrained models, dataset generation using LLMs, and supervised fine-tuning on these retrieved and generated datasets. Over three tasks, we demonstrate that given the same few-shot prompt as input, Prompt2Model trains models that outperform the results of a strong LLM, gpt-3.5-turbo, by an average of 20% while being up to 700 times smaller. We also show that this data can be used to obtain reliable performance estimates of model performance, enabling model developers to assess model reliability before deployment. Prompt2Model is available open-source at https://github.com/neulab/prompt2model. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: 8 pages

arXiv:2307.13533 [pdf, other]

Differentiable Turbulence II

Authors: Varun Shankar, Romit Maulik, Venkatasubramanian Viswanathan

Abstract: Differentiable fluid simulators are increasingly demonstrating value as useful tools for develo** data-driven models in computational fluid dynamics (CFD). Differentiable turbulence, or the end-to-end training of machine learning (ML) models embedded in CFD solution algorithms, captures both the generalization power and limited upfront cost of physics-based simulations, and the flexibility and a… ▽ More Differentiable fluid simulators are increasingly demonstrating value as useful tools for develo** data-driven models in computational fluid dynamics (CFD). Differentiable turbulence, or the end-to-end training of machine learning (ML) models embedded in CFD solution algorithms, captures both the generalization power and limited upfront cost of physics-based simulations, and the flexibility and automated training of deep learning methods. We develop a framework for integrating deep learning models into a generic finite element numerical scheme for solving the Navier-Stokes equations, applying the technique to learn a sub-grid scale closure using a multi-scale graph neural network. We demonstrate the method on several realizations of flow over a backwards-facing step, testing on both unseen Reynolds numbers and new geometry. We show that the learned closure can achieve accuracy comparable to traditional large eddy simulation on a finer grid that amounts to an equivalent speedup of 10x. As the desire and need for cheaper CFD simulations grows, we see hybrid physics-ML methods as a path forward to be exploited in the near future. △ Less

Submitted 25 July, 2023; originally announced July 2023.

arXiv:2307.12516 [pdf, ps, other]

The Good, the Bad and the Submodular: Fairly Allocating Mixed Manna Under Order-Neutral Submodular Preferences

Authors: Cyrus Cousins, Vignesh Viswanathan, Yair Zick

Abstract: We study the problem of fairly allocating indivisible goods (positively valued items) and chores (negatively valued items) among agents with decreasing marginal utilities over items. Our focus is on instances where all the agents have simple preferences; specifically, we assume the marginal value of an item can be either $-1$, $0$ or some positive integer $c$. Under this assumption, we present an… ▽ More We study the problem of fairly allocating indivisible goods (positively valued items) and chores (negatively valued items) among agents with decreasing marginal utilities over items. Our focus is on instances where all the agents have simple preferences; specifically, we assume the marginal value of an item can be either $-1$, $0$ or some positive integer $c$. Under this assumption, we present an efficient algorithm to compute leximin allocations for a broad class of valuation functions we call order-neutral submodular valuations. Order-neutral submodular valuations strictly contain the well-studied class of additive valuations but are a strict subset of the class of submodular valuations. We show that these leximin allocations are Lorenz dominating and approximately proportional. We also show that, under further restriction to additive valuations, these leximin allocations are approximately envy-free and guarantee each agent their maxmin share. We complement this algorithmic result with a lower bound showing that the problem of computing leximin allocations is NP-hard when $c$ is a rational number. △ Less

Submitted 24 July, 2023; originally announced July 2023.

arXiv:2307.12482 [pdf, ps, other]

Tight Approximations for Graphical House Allocation

Authors: Hadi Hosseini, Andrew McGregor, Rik Sengupta, Rohit Vaish, Vignesh Viswanathan

Abstract: The Graphical House Allocation problem asks: how can $n$ houses (each with a fixed non-negative value) be assigned to the vertices of an undirected graph $G$, so as to minimize the "aggregate local envy", i.e., the sum of absolute differences along the edges of $G$? This problem generalizes the classical Minimum Linear Arrangement problem, as well as the well-known House Allocation Problem from Ec… ▽ More The Graphical House Allocation problem asks: how can $n$ houses (each with a fixed non-negative value) be assigned to the vertices of an undirected graph $G$, so as to minimize the "aggregate local envy", i.e., the sum of absolute differences along the edges of $G$? This problem generalizes the classical Minimum Linear Arrangement problem, as well as the well-known House Allocation Problem from Economics, the latter of which has notable practical applications in organ exchanges. Recent work has studied the computational aspects of Graphical House Allocation and observed that the problem is NP-hard and inapproximable even on particularly simple classes of graphs, such as vertex disjoint unions of paths. However, the dependence of any approximations on the structural properties of the underlying graph had not been studied. In this work, we give a complete characterization of the approximability of the Graphical House Allocation problem. We present algorithms to approximate the optimal envy on general graphs, trees, planar graphs, bounded-degree graphs, bounded-degree planar graphs, and bounded-degree trees. For each of these graph classes, we then prove matching lower bounds, showing that in each case, no significant improvement can be attained unless P = NP. We also present general approximation ratios as a function of structural parameters of the underlying graph, such as treewidth; these match the aforementioned tight upper bounds in general, and are significantly better approximations for many natural subclasses of graphs. Finally, we present constant factor approximation schemes for the special classes of complete binary trees and random graphs. △ Less

Submitted 12 October, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

arXiv:2307.05486 [pdf, other]

Importance of equivariant and invariant symmetries for fluid flow modeling

Authors: Varun Shankar, Shivam Barwey, Zico Kolter, Romit Maulik, Venkatasubramanian Viswanathan

Abstract: Graph neural networks (GNNs) have shown promise in learning unstructured mesh-based simulations of physical systems, including fluid dynamics. In tandem, geometric deep learning principles have informed the development of equivariant architectures respecting underlying physical symmetries. However, the effect of rotational equivariance in modeling fluids remains unclear. We build a multi-scale equ… ▽ More Graph neural networks (GNNs) have shown promise in learning unstructured mesh-based simulations of physical systems, including fluid dynamics. In tandem, geometric deep learning principles have informed the development of equivariant architectures respecting underlying physical symmetries. However, the effect of rotational equivariance in modeling fluids remains unclear. We build a multi-scale equivariant GNN to forecast fluid flow and study the effect of modeling invariant and non-invariant representations of the flow state. We evaluate the model performance of several equivariant and non-equivariant architectures on predicting the evolution of two fluid flows, flow around a cylinder and buoyancy-driven shear flow, to understand the effect of equivariance and invariance on data-driven modeling approaches. Our results show that modeling invariant quantities produces more accurate long-term predictions and that these invariant quantities may be learned from the velocity field using a data-driven encoder. △ Less

Submitted 3 May, 2023; originally announced July 2023.

arXiv:2307.03683 [pdf, other]

Differentiable Turbulence: Closure as a partial differential equation constrained optimization

Authors: Varun Shankar, Dibyajyoti Chakraborty, Venkatasubramanian Viswanathan, Romit Maulik

Abstract: Deep learning is increasingly becoming a promising pathway to improving the accuracy of sub-grid scale (SGS) turbulence closure models for large eddy simulations (LES). We leverage the concept of differentiable turbulence, whereby an end-to-end differentiable solver is used in combination with physics-inspired choices of deep learning architectures to learn highly effective and versatile SGS model… ▽ More Deep learning is increasingly becoming a promising pathway to improving the accuracy of sub-grid scale (SGS) turbulence closure models for large eddy simulations (LES). We leverage the concept of differentiable turbulence, whereby an end-to-end differentiable solver is used in combination with physics-inspired choices of deep learning architectures to learn highly effective and versatile SGS models for two-dimensional turbulent flow. We perform an in-depth analysis of the inductive biases in the chosen architectures, finding that the inclusion of small-scale non-local features is most critical to effective SGS modeling, while large-scale features can improve pointwise accuracy of the \textit{a-posteriori} solution field. The velocity gradient tensor on the LES grid can be mapped directly to the SGS stress via decomposition of the inputs and outputs into isotropic, deviatoric, and anti-symmetric components. We see that the model can generalize to a variety of flow configurations, including higher and lower Reynolds numbers and different forcing conditions. We show that the differentiable physics paradigm is more successful than offline, \textit{a-priori} learning, and that hybrid solver-in-the-loop approaches to deep learning offer an ideal balance between computational efficiency, accuracy, and generalization. Our experiments provide physics-based recommendations for deep-learning based SGS modeling for generalizable closure modeling of turbulence. △ Less

Submitted 27 March, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

arXiv:2307.00524 [pdf, other]

Large Language Models Enable Few-Shot Clustering

Authors: Vijay Viswanathan, Kiril Gashteovski, Carolin Lawrence, Tongshuang Wu, Graham Neubig

Abstract: Unlike traditional unsupervised clustering, semi-supervised clustering allows users to provide meaningful structure to the data, which helps the clustering algorithm to match the user's intent. Existing approaches to semi-supervised clustering require a significant amount of feedback from an expert to improve the clusters. In this paper, we ask whether a large language model can amplify an expert'… ▽ More Unlike traditional unsupervised clustering, semi-supervised clustering allows users to provide meaningful structure to the data, which helps the clustering algorithm to match the user's intent. Existing approaches to semi-supervised clustering require a significant amount of feedback from an expert to improve the clusters. In this paper, we ask whether a large language model can amplify an expert's guidance to enable query-efficient, few-shot semi-supervised text clustering. We show that LLMs are surprisingly effective at improving clustering. We explore three stages where LLMs can be incorporated into clustering: before clustering (improving input features), during clustering (by providing constraints to the clusterer), and after clustering (using LLMs post-correction). We find incorporating LLMs in the first two stages can routinely provide significant improvements in cluster quality, and that LLMs enable a user to make trade-offs between cost and accuracy to produce desired clusters. We release our code and LLM prompts for the public to use. △ Less

Submitted 2 July, 2023; originally announced July 2023.

arXiv:2306.15557 [pdf, ps, other]

Simple Steps to Success: Axiomatics of Distance-Based Algorithmic Recourse

Authors: Jenny Hamer, Jake Valladares, Vignesh Viswanathan, Yair Zick

Abstract: We propose a novel data-driven framework for algorithmic recourse that offers users interventions to change their predicted outcome. Existing approaches to compute recourse find a set of points that satisfy some desiderata -- e.g. an intervention in the underlying causal graph, or minimizing a cost function. Satisfying these criteria, however, requires extensive knowledge of the underlying model s… ▽ More We propose a novel data-driven framework for algorithmic recourse that offers users interventions to change their predicted outcome. Existing approaches to compute recourse find a set of points that satisfy some desiderata -- e.g. an intervention in the underlying causal graph, or minimizing a cost function. Satisfying these criteria, however, requires extensive knowledge of the underlying model structure, often an unrealistic amount of information in several domains. We propose a data-driven, computationally efficient approach to computing algorithmic recourse. We do so by suggesting directions in the data manifold that users can take to change their predicted outcome. We present Stepwise Explainable Paths (StEP), an axiomatically justified framework to compute direction-based algorithmic recourse. We offer a thorough empirical and theoretical investigation of StEP. StEP offers provable privacy and robustness guarantees, and outperforms the state-of-the-art on several established recourse desiderata. △ Less

Submitted 1 August, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

arXiv:2306.00028 [pdf, other]

doi 10.1021/jacs.4c03464

Twisto-electrochemical activity volcanoes in Trilayer Graphene

Authors: Mohammad Babar, Ziyan Zhu, Rachel Kurchin, Efthimios Kaxiras, Venkatasubramanian Viswanathan

Abstract: In this work, we develop a twist-dependent electrochemical activity map, combining a tight-binding electronic structure model with modified Marcus-Hush-Chidsey kinetics in trilayer graphene. We identify a counterintuitive rate enhancement region spanning the magic angle curve and incommensurate twists of the system geometry. We find a broad activity peak with a ruthenium hexamine redox couple in r… ▽ More In this work, we develop a twist-dependent electrochemical activity map, combining a tight-binding electronic structure model with modified Marcus-Hush-Chidsey kinetics in trilayer graphene. We identify a counterintuitive rate enhancement region spanning the magic angle curve and incommensurate twists of the system geometry. We find a broad activity peak with a ruthenium hexamine redox couple in regions corresponding to both magic angles and incommensurate angles, a result qualitatively distinct from the twisted bilayer case. Flat bands and incommensurability offer new avenues for reaction rate enhancements in electrochemical transformations. △ Less

Submitted 31 May, 2023; originally announced June 2023.

Comments: 6 pages, 4 figures, Supporting Information

Journal ref: J. Am. Chem. Soc. 2024, 146, 23, 16105-16111

arXiv:2305.16636 [pdf, other]

DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions

Authors: Vijay Viswanathan, Luyu Gao, Tongshuang Wu, Pengfei Liu, Graham Neubig

Abstract: Modern machine learning relies on datasets to develop and validate research ideas. Given the growth of publicly available data, finding the right dataset to use is increasingly difficult. Any research question imposes explicit and implicit constraints on how well a given dataset will enable researchers to answer this question, such as dataset size, modality, and domain. We operationalize the task… ▽ More Modern machine learning relies on datasets to develop and validate research ideas. Given the growth of publicly available data, finding the right dataset to use is increasingly difficult. Any research question imposes explicit and implicit constraints on how well a given dataset will enable researchers to answer this question, such as dataset size, modality, and domain. We operationalize the task of recommending datasets given a short natural language description of a research idea, to help people find relevant datasets for their needs. Dataset recommendation poses unique challenges as an information retrieval problem; datasets are hard to directly index for search and there are no corpora readily available for this task. To facilitate this task, we build the DataFinder Dataset which consists of a larger automatically-constructed training set (17.5K queries) and a smaller expert-annotated evaluation set (392 queries). Using this data, we compare various information retrieval algorithms on our test set and present a superior bi-encoder retriever for text-based dataset recommendation. This system, trained on the DataFinder Dataset, finds more relevant search results than existing third-party dataset search engines. To encourage progress on dataset recommendation, we release our dataset and models to the public. △ Less

Submitted 6 June, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

Comments: To appear at ACL 2023. Code published at https://github.com/viswavi/datafinder

arXiv:2305.12010 [pdf, other]

Chemellia: An Ecosystem for Atomistic Scientific Machine Learning

Authors: Anant Thazhemadam, Dhairya Gandhi, Venkatasubramanian Viswanathan, Rachel C. Kurchin

Abstract: Chemellia is an open-source framework for atomistic machine learning in the Julia programming language. The framework takes advantage of Julia's high speed as well as the ability to share and reuse code and interfaces through the paradigm of multiple dispatch. Chemellia is designed to make use of existing interfaces and avoid ``reinventing the wheel'' wherever possible. A key aspect of the Chemell… ▽ More Chemellia is an open-source framework for atomistic machine learning in the Julia programming language. The framework takes advantage of Julia's high speed as well as the ability to share and reuse code and interfaces through the paradigm of multiple dispatch. Chemellia is designed to make use of existing interfaces and avoid ``reinventing the wheel'' wherever possible. A key aspect of the Chemellia ecosystem is the ChemistryFeaturization interface for defining and encoding features -- it is designed to maximize interoperability between featurization schemes and elements thereof, to maintain provenance of encoded features, and to ensure easy decodability and reconfigurability to enable feature engineering experiments. This embodies the overall design principles of the Chemellia ecosystem: separation of concerns, interoperability, and transparency. We illustrate these principles by discussing the implementation of crystal graph convolutional neural networks for material property prediction. △ Less

Submitted 19 May, 2023; originally announced May 2023.

arXiv:2305.06925 [pdf, other]

Accurate Surface and Finite Temperature Bulk Properties of Lithium Metal at Large Scales using Machine Learning Interaction Potentials

Authors: Mgcini Keith Phuthi, Archie Mingze Yao, Simon Batzner, Albert Musaelian, Boris Kozinsky, Ekin Dogus Cubuk, Venkatasubramanian Viswanathan

Abstract: The properties of lithium metal are key parameters in the design of lithium ion and lithium metal batteries. They are difficult to probe experimentally due to the high reactivity and low melting point of lithium as well as the microscopic scales at which lithium exists in batteries where it is found to have enhanced strength, with implications for dendrite suppression strategies. Computationally,… ▽ More The properties of lithium metal are key parameters in the design of lithium ion and lithium metal batteries. They are difficult to probe experimentally due to the high reactivity and low melting point of lithium as well as the microscopic scales at which lithium exists in batteries where it is found to have enhanced strength, with implications for dendrite suppression strategies. Computationally, there is a lack of empirical potentials that are consistently quantitatively accurate across all properties and ab-initio calculations are too costly. In this work, we train Machine Learning Interaction Potentials (MLIPs) on Density Functional Theory (DFT) data to state-of-the-art accuracy in reproducing experimental and ab-initio results across a wide range of simulations at large length and time scales. We accurately predict thermodynamic properties, phonon spectra, temperature dependence of elastic constants and various surface properties inaccessible using DFT. We establish that there exists a Bell-Evans-Polanyi relation correlating the self-adsorption energy and the minimum surface diffusion barrier for high Miller index facets. △ Less

Submitted 22 May, 2023; v1 submitted 24 April, 2023; originally announced May 2023.

Comments: 9 pages, 4 figures, 3 pages of Supporting Information

arXiv:2304.14520 [pdf, other]

doi 10.1109/MED59994.2023.10185906

Multimodal Dataset from Harsh Sub-Terranean Environment with Aerosol Particles for Frontier Exploration

Authors: Alexander Kyuroson, Niklas Dahlquist, Nikolaos Stathoulopoulos, Vignesh Kottayam Viswanathan, Anton Koval, George Nikolakopoulos

Abstract: Algorithms for autonomous navigation in environments without Global Navigation Satellite System (GNSS) coverage mainly rely on onboard perception systems. These systems commonly incorporate sensors like cameras and Light Detection and Rangings (LiDARs), the performance of which may degrade in the presence of aerosol particles. Thus, there is a need of fusing acquired data from these sensors with d… ▽ More Algorithms for autonomous navigation in environments without Global Navigation Satellite System (GNSS) coverage mainly rely on onboard perception systems. These systems commonly incorporate sensors like cameras and Light Detection and Rangings (LiDARs), the performance of which may degrade in the presence of aerosol particles. Thus, there is a need of fusing acquired data from these sensors with data from Radio Detection and Rangings (RADARs) which can penetrate through such particles. Overall, this will improve the performance of localization and collision avoidance algorithms under such environmental conditions. This paper introduces a multimodal dataset from the harsh and unstructured underground environment with aerosol particles. A detailed description of the onboard sensors and the environment, where the dataset is collected are presented to enable full evaluation of acquired data. Furthermore, the dataset contains synchronized raw data measurements from all onboard sensors in Robot Operating System (ROS) format to facilitate the evaluation of navigation, and localization algorithms in such environments. In contrast to the existing datasets, the focus of this paper is not only to capture both temporal and spatial data diversities but also to present the impact of harsh conditions on captured data. Therefore, to validate the dataset, a preliminary comparison of odometry from onboard LiDARs is presented. △ Less

Submitted 21 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

Comments: Accepted in the 31st Mediterranean Conference on Control and Automation [MED2023]

arXiv:2303.09619 [pdf, other]

Vision Based Docking of Multiple Satellites with an Uncooperative Target

Authors: Fragiskos Fourlas, Vignesh Kottayam Viswanathan, Sumeet Satpute, George Nikolakopoulos

Abstract: With the ever growing number of space debris in orbit, the need to prevent further space population is becoming more and more apparent. Refueling, servicing, inspection and deorbiting of spacecraft are some example missions that require precise navigation and docking in space. Having multiple, collaborating robots handling these tasks can greatly increase the efficiency of the mission in terms of… ▽ More With the ever growing number of space debris in orbit, the need to prevent further space population is becoming more and more apparent. Refueling, servicing, inspection and deorbiting of spacecraft are some example missions that require precise navigation and docking in space. Having multiple, collaborating robots handling these tasks can greatly increase the efficiency of the mission in terms of time and cost. This article will introduce a modern and efficient control architecture for satellites on collaborative docking missions. The proposed architecture uses a centralized scheme that combines state-of-the-art, ad-hoc implementations of algorithms and techniques to maximize robustness and flexibility. It is based on a Model Predictive Controller (MPC) for which efficient cost function and constraint sets are designed to ensure a safe and accurate docking. A simulation environment is also presented to validate and test the proposed control scheme. △ Less

Submitted 16 March, 2023; originally announced March 2023.

Comments: ©2023 Fragiskos Fourlas, Vignesh Kottayam Viswanathan, Sumeet Satpute and George Nikolakopoulos. This work has been accepted to IFAC for publication under a Creative Commons Licence CC-BY-NC-ND

arXiv:2303.09529 [pdf, other]

Anomalous interfacial electron transfer kinetics in twisted trilayer graphene caused by layer-specific localization

Authors: Kaidi Zhang, Yun Yu, Stephen Carr, Mohammad Babar, Ziyan Zhu, Bryan Kim, Catherine Groschner, Nikta Khaloo, Takashi Taniguchi, Kenji Watanabe, Venkatasubramanian Viswanathan, D. Kwabena Bediako

Abstract: Interfacial electron-transfer (ET) reactions underpin the interconversion of electrical and chemical energy. Pioneering experiments showed that the ET rate depends on the Fermi Dirac distribution of the electronic density of states (DOS) of the electrode, formalized in the Marcus Hush Chidsey (MHC) model. Here, by controlling interlayer twists in well-defined trilayergraphene moires, we show that… ▽ More Interfacial electron-transfer (ET) reactions underpin the interconversion of electrical and chemical energy. Pioneering experiments showed that the ET rate depends on the Fermi Dirac distribution of the electronic density of states (DOS) of the electrode, formalized in the Marcus Hush Chidsey (MHC) model. Here, by controlling interlayer twists in well-defined trilayergraphene moires, we show that ET rates are strikingly dependent on electronic localization in each atomic layer, and not the overall DOS. The large degree of tunability inherent to moire electrodes leads to local ET kinetics that range over three orders of magnitude across different constructions of only three atomic layers, even exceeding rates at bulk metals. Our results demonstrate that beyond the ensemble DOS, electronic localization is critical in facilitating interfacial ET, with implications for understanding the origin of high interfacial reactivity typically exhibited by defects at electrode electrolyte interfaces. △ Less

Submitted 16 March, 2023; originally announced March 2023.

Comments: 18 pages, 5 figures

arXiv:2303.06212 [pdf, ps, other]

Weighted Notions of Fairness with Binary Supermodular Chores

Authors: Vignesh Viswanathan, Yair Zick

Abstract: We study the problem of allocating indivisible chores among agents with binary supermodular cost functions. In other words, each chore has a marginal cost of $0$ or $1$ and chores exhibit increasing marginal costs (or decreasing marginal utilities). In this note, we combine the techniques of Viswanathan and Zick (2022) and Barman et al. (2023) to present a general framework for fair allocation wit… ▽ More We study the problem of allocating indivisible chores among agents with binary supermodular cost functions. In other words, each chore has a marginal cost of $0$ or $1$ and chores exhibit increasing marginal costs (or decreasing marginal utilities). In this note, we combine the techniques of Viswanathan and Zick (2022) and Barman et al. (2023) to present a general framework for fair allocation with this class of valuation functions. Our framework allows us to generalize the results of Barman et al. (2023) and efficiently compute allocations which satisfy weighted notions of fairness like weighted leximin or min weighted $p$-mean malfare for any $p \ge 1$. △ Less

Submitted 10 March, 2023; originally announced March 2023.

arXiv:2302.06186 [pdf, other]

Multiscale Graph Neural Network Autoencoders for Interpretable Scientific Machine Learning

Authors: Shivam Barwey, Varun Shankar, Venkatasubramanian Viswanathan, Romit Maulik

Abstract: The goal of this work is to address two limitations in autoencoder-based models: latent space interpretability and compatibility with unstructured meshes. This is accomplished here with the development of a novel graph neural network (GNN) autoencoding architecture with demonstrations on complex fluid flow applications. To address the first goal of interpretability, the GNN autoencoder achieves re… ▽ More The goal of this work is to address two limitations in autoencoder-based models: latent space interpretability and compatibility with unstructured meshes. This is accomplished here with the development of a novel graph neural network (GNN) autoencoding architecture with demonstrations on complex fluid flow applications. To address the first goal of interpretability, the GNN autoencoder achieves reduction in the number nodes in the encoding stage through an adaptive graph reduction procedure. This reduction procedure essentially amounts to flowfield-conditioned node sampling and sensor identification, and produces interpretable latent graph representations tailored to the flowfield reconstruction task in the form of so-called masked fields. These masked fields allow the user to (a) visualize where in physical space a given latent graph is active, and (b) interpret the time-evolution of the latent graph connectivity in accordance with the time-evolution of unsteady flow features (e.g. recirculation zones, shear layers) in the domain. To address the goal of unstructured mesh compatibility, the autoencoding architecture utilizes a series of multi-scale message passing (MMP) layers, each of which models information exchange among node neighborhoods at various lengthscales. The MMP layer, which augments standard single-scale message passing with learnable coarsening operations, allows the decoder to more efficiently reconstruct the flowfield from the identified regions in the masked fields. Analysis of latent graphs produced by the autoencoder for various model settings are conducted using using unstructured snapshot data sourced from large-eddy simulations in a backward-facing step (BFS) flow configuration with an OpenFOAM-based flow solver at high Reynolds numbers. △ Less

Submitted 16 February, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

Comments: 30 pages, 17 figures. Correction: Fixed authorship

arXiv:2302.03087 [pdf, ps, other]

Dividing Good and Better Items Among Agents with Bivalued Submodular Valuations

Authors: Cyrus Cousins, Vignesh Viswanathan, Yair Zick

Abstract: We study the problem of fairly allocating a set of indivisible goods among agents with {\em bivalued submodular valuations} -- each good provides a marginal gain of either $a$ or $b$ ($a < b$) and goods have decreasing marginal gains. This is a natural generalization of two well-studied valuation classes -- bivalued additive valuations and binary submodular valuations. We present a simple sequenti… ▽ More We study the problem of fairly allocating a set of indivisible goods among agents with {\em bivalued submodular valuations} -- each good provides a marginal gain of either $a$ or $b$ ($a < b$) and goods have decreasing marginal gains. This is a natural generalization of two well-studied valuation classes -- bivalued additive valuations and binary submodular valuations. We present a simple sequential algorithmic framework, based on the recently introduced Yankee Swap mechanism, that can be adapted to compute a variety of solution concepts, including max Nash welfare (MNW), leximin and $p$-mean welfare maximizing allocations when $a$ divides $b$. This result is complemented by an existing result on the computational intractability of MNW and leximin allocations when $a$ does not divide $b$. We show that MNW and leximin allocations guarantee each agent at least $\frac25$ and $\frac{a}{b+2a}$ of their maximin share, respectively, when $a$ divides $b$. We also show that neither the leximin nor the MNW allocation is guaranteed to be envy free up to one good (EF1). This is surprising since for the simpler classes of bivalued additive valuations and binary submodular valuations, MNW allocations are known to be envy free up to any good (EFX). △ Less

Submitted 19 July, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

arXiv:2301.09892 [pdf, other]

Learning Effective Strategies for Moving Target Defense with Switching Costs

Authors: Vignesh Viswanathan, Megha Bose, Praveen Paruchuri

Abstract: Moving Target Defense (MTD) has emerged as a key technique in various security applications as it takes away the attacker's ability to perform reconnaissance for exploiting a system's vulnerabilities. However, most of the existing research in the field assumes unrealistic access to information about the attacker's motivations and/or actions when develo** MTD strategies. Many of the existing appr… ▽ More Moving Target Defense (MTD) has emerged as a key technique in various security applications as it takes away the attacker's ability to perform reconnaissance for exploiting a system's vulnerabilities. However, most of the existing research in the field assumes unrealistic access to information about the attacker's motivations and/or actions when develo** MTD strategies. Many of the existing approaches also assume complete knowledge regarding the vulnerabilities of a system and how each of these vulnerabilities can be exploited by an attacker. In this work, we aim to create algorithms that generate effective Moving Target Defense strategies that do not rely on prior knowledge about the attackers. Our work assumes that the only way the defender receives information about its own reward is via interaction with the attacker in a repeated game setting. Depending on the amount of information that can be obtained from the interactions, we devise two different algorithms using multi-armed bandit formulation to identify efficient strategies. We then evaluate our algorithms using data mined from the National Vulnerability Database to showcase that they match the performance of the state-of-the-art techniques, despite using a lot less amount of information. △ Less

Submitted 24 January, 2023; originally announced January 2023.

arXiv:2301.04035 [pdf, other]

doi 10.1016/j.icarus.2023.115426

Constraints on the lunar core viscosity from tidal deformation

Authors: Arthur Briaud, Agnès Fienga, Daniele Melini, Nicolas Rambaux, Anthony Mémin, Giorgio Spada, Christelle Saliby, Hauke Hussmann, Alexander Stark, Vishnu Viswanathan, Daniel Baguet

Abstract: We use the tidal deformations of the Moon induced by the Earth and the Sun as a tool for studying the inner structure of our satellite. Based on measurements of the degree-two tidal Love numbers k2 and h2 and dissipation coefficients from the GRAIL mission, Lunar Laser Ranging and Laser Altimetry on board of the LRO spacecraft, we perform Monte Carlo samplings for 120,000 possible combinations of… ▽ More We use the tidal deformations of the Moon induced by the Earth and the Sun as a tool for studying the inner structure of our satellite. Based on measurements of the degree-two tidal Love numbers k2 and h2 and dissipation coefficients from the GRAIL mission, Lunar Laser Ranging and Laser Altimetry on board of the LRO spacecraft, we perform Monte Carlo samplings for 120,000 possible combinations of thicknesses and viscosities for two classes of the lunar models. The first one includes a uniform core, a low viscosity zone (LVZ) at the core-mantle boundary, a mantle and a crust. The second one has an additional inner core. All models are consistent with the lunar total mass as well as its moment of inertia. By comparing predicted and observed parameters for the tidal deformations we find that the existence of an inner core cannot be ruled out. Furthermore, by deducing temperature profiles for the LVZ and an Earth-like mantle, we obtain stringent constraints on the radius (500 +- 1) km, viscosity,21 (4.5 +- 0.8) x 10^16 Pa.s and the density (3400 +- 10) kg/m^3 of the LVZ. We also infer the first estimation for the outer core viscosity, (2.07 +- 1.03) x 10^17 Pa.s, for two different possible structures: a Moon with a 70 km thick outer core and a large inner core (290 km radius with a density of 6000 kg/m3), and a Moon with a thicker outer core (169 km thick) but a denser and smaller inner core (219 km radius for 8000 kg/m^3). △ Less

Submitted 10 January, 2023; originally announced January 2023.

arXiv:2301.01323 [pdf, other]

Graphical House Allocation

Authors: Hadi Hosseini, Justin Payan, Rik Sengupta, Rohit Vaish, Vignesh Viswanathan

Abstract: The classical house allocation problem involves assigning $n$ houses (or items) to $n$ agents according to their preferences. A key criterion in such problems is satisfying some fairness constraints such as envy-freeness. We consider a generalization of this problem wherein the agents are placed along the vertices of a graph (corresponding to a social network), and each agent can only experience e… ▽ More The classical house allocation problem involves assigning $n$ houses (or items) to $n$ agents according to their preferences. A key criterion in such problems is satisfying some fairness constraints such as envy-freeness. We consider a generalization of this problem wherein the agents are placed along the vertices of a graph (corresponding to a social network), and each agent can only experience envy towards its neighbors. Our goal is to minimize the aggregate envy among the agents as a natural fairness objective, i.e., the sum of all pairwise envy values over all edges in a social graph. When agents have identical and evenly-spaced valuations, our problem reduces to the well-studied problem of linear arrangements. For identical valuations with possibly uneven spacing, we show a number of deep and surprising ways in which our setting is a departure from this classical problem. More broadly, we contribute several structural and computational results for various classes of graphs, including NP-hardness results for disjoint unions of paths, cycles, stars, or cliques, and fixed-parameter tractable (and, in some cases, polynomial-time) algorithms for paths, cycles, stars, cliques, and their disjoint unions. Additionally, a conceptual contribution of our work is the formulation of a structural property for disconnected graphs that we call separability which results in efficient parameterized algorithms for finding optimal allocations. △ Less

Submitted 18 September, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

arXiv:2212.06952 [pdf, other]

Nonequilibrium Electrochemical Phase Maps: Beyond Butler-Volmer Kinetics

Authors: Rachel C. Kurchin, Dhairya Gandhi, Venkatasubramanian Viswanathan

Abstract: Electrochemical kinetics at electrode-electrolyte interfaces are crucial to understand high-rate behavior of energy storage devices. Phase transformation of electrodes is typically treated under equilibrium thermodynamic conditions, while realistic operation is at finite rates. Analyzing phase transformations under nonequilibrium conditions requires integrating nonlinear electrochemical kinetic mo… ▽ More Electrochemical kinetics at electrode-electrolyte interfaces are crucial to understand high-rate behavior of energy storage devices. Phase transformation of electrodes is typically treated under equilibrium thermodynamic conditions, while realistic operation is at finite rates. Analyzing phase transformations under nonequilibrium conditions requires integrating nonlinear electrochemical kinetic models with thermodynamic models. This had only previously been demonstrated for Butler-Volmer kinetics, where it can be done analytically. In this work, we develop a kinetic modeling package in the Julia language capable of efficient numerical inversion of rate relationships for general kinetic models using automatic differentiation. We demonstrate building nonequilibrium phase maps, including for models such as Marcus-Hush-Chidsey that require computation of an integral, and also discuss the impact of a variety of assumptions and model parameters (such as temperature, reorganization energy, activity, and ideal solution interaction energy), particularly on high-rate phase behavior. Even for a fixed set of parameters, the magnitude of the critical current can vary by in excess of a factor of two amongst kinetic models. △ Less

Submitted 13 December, 2022; originally announced December 2022.

Comments: main text + supplementary info (15+6 pages, 3+2 figures), as submitted to ACS Energy Letters

arXiv:2211.10533 [pdf, other]

By how much can closed-loop frameworks accelerate computational materials discovery?

Authors: Lance Kavalsky, Vinay I. Hegde, Eric Muckley, Matthew S. Johnson, Bryce Meredig, Venkatasubramanian Viswanathan

Abstract: The implementation of automation and machine learning surrogatization within closed-loop computational workflows is an increasingly popular approach to accelerate materials discovery. However, the scale of the speedup associated with this paradigm shift from traditional manual approaches remains an open question. In this work, we rigorously quantify the acceleration from each of the components wit… ▽ More The implementation of automation and machine learning surrogatization within closed-loop computational workflows is an increasingly popular approach to accelerate materials discovery. However, the scale of the speedup associated with this paradigm shift from traditional manual approaches remains an open question. In this work, we rigorously quantify the acceleration from each of the components within a closed-loop framework for material hypothesis evaluation by identifying four distinct sources of speedup: (1) task automation, (2) calculation runtime improvements, (3) sequential learning-driven design space search, and (4) surrogatization of expensive simulations with machine learning models. This is done using a time-kee** ledger to record runs of automated software and corresponding manual computational experiments within the context of electrocatalysis. From a combination of the first three sources of acceleration, we estimate that overall hypothesis evaluation time can be reduced by over 90%, i.e., achieving a speedup of $\sim$$10\times$. Further, by introducing surrogatization into the loop, we estimate that the design time can be reduced by over 95%, i.e., achieving a speedup of $\sim$$15$-$20\times$. Our findings present a clear value proposition for utilizing closed-loop approaches for accelerating materials discovery. △ Less

Submitted 23 November, 2022; v1 submitted 18 November, 2022; originally announced November 2022.

Comments: added Supplementary Information

arXiv:2209.11614 [pdf, other]

Differentiable physics-enabled closure modeling for Burgers' turbulence

Authors: Varun Shankar, Vedant Puri, Ramesh Balakrishnan, Romit Maulik, Venkatasubramanian Viswanathan

Abstract: Data-driven turbulence modeling is experiencing a surge in interest following algorithmic and hardware developments in the data sciences. We discuss an approach using the differentiable physics paradigm that combines known physics with machine learning to develop closure models for Burgers' turbulence. We consider the 1D Burgers system as a prototypical test problem for modeling the unresolved ter… ▽ More Data-driven turbulence modeling is experiencing a surge in interest following algorithmic and hardware developments in the data sciences. We discuss an approach using the differentiable physics paradigm that combines known physics with machine learning to develop closure models for Burgers' turbulence. We consider the 1D Burgers system as a prototypical test problem for modeling the unresolved terms in advection-dominated turbulence problems. We train a series of models that incorporate varying degrees of physical assumptions on an a posteriori loss function to test the efficacy of models across a range of system parameters, including viscosity, time, and grid resolution. We find that constraining models with inductive biases in the form of partial differential equations that contain known physics or existing closure approaches produces highly data-efficient, accurate, and generalizable models, outperforming state-of-the-art baselines. Addition of structure in the form of physics information also brings a level of interpretability to the models, potentially offering a step** stone to the future of closure modeling. △ Less

Submitted 23 September, 2022; originally announced September 2022.

arXiv:2208.07311 [pdf, ps, other]

A General Framework for Fair Allocation under Matroid Rank Valuations

Authors: Vignesh Viswanathan, Yair Zick

Abstract: We study the problem of fairly allocating a set of indivisible goods among agents with matroid rank valuations -- every good provides a marginal value of $0$ or $1$ when added to a bundle and valuations are submodular. We generalize the Yankee Swap algorithm to create a simple framework, called General Yankee Swap, that can efficiently compute allocations that maximize any justice criterion (or fa… ▽ More We study the problem of fairly allocating a set of indivisible goods among agents with matroid rank valuations -- every good provides a marginal value of $0$ or $1$ when added to a bundle and valuations are submodular. We generalize the Yankee Swap algorithm to create a simple framework, called General Yankee Swap, that can efficiently compute allocations that maximize any justice criterion (or fairness objective) satisfying some mild assumptions. Along with maximizing a justice criterion, General Yankee Swap is guaranteed to maximize utilitarian social welfare, ensure strategyproofness and use at most a quadratic number of valuation queries. We show how General Yankee Swap can be used to compute allocations for five different well-studied justice criteria: (a) Prioritized Lorenz dominance, (b) Maximin fairness, (c) Weighted leximin, (d) Max weighted Nash welfare, and (e) Max weighted $p$-mean welfare. In particular, our framework provides the first polynomial time algorithms to compute weighted leximin, max weighted Nash welfare and max weighted $p$-mean welfare allocations for agents with matroid rank valuations. △ Less

Submitted 19 May, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

arXiv:2208.05152 [pdf, other]

TagRec++: Hierarchical Label Aware Attention Network for Question Categorization

Authors: Venktesh Viswanathan, Mukesh Mohania, Vikram Goyal

Abstract: Online learning systems have multiple data repositories in the form of transcripts, books and questions. To enable ease of access, such systems organize the content according to a well defined taxonomy of hierarchical nature (subject-chapter-topic). The task of categorizing inputs to the hierarchical labels is usually cast as a flat multi-class classification problem. Such approaches ignore the se… ▽ More Online learning systems have multiple data repositories in the form of transcripts, books and questions. To enable ease of access, such systems organize the content according to a well defined taxonomy of hierarchical nature (subject-chapter-topic). The task of categorizing inputs to the hierarchical labels is usually cast as a flat multi-class classification problem. Such approaches ignore the semantic relatedness between the terms in the input and the tokens in the hierarchical labels. Alternate approaches also suffer from class imbalance when they only consider leaf level nodes as labels. To tackle the issues, we formulate the task as a dense retrieval problem to retrieve the appropriate hierarchical labels for each content. In this paper, we deal with categorizing questions. We model the hierarchical labels as a composition of their tokens and use an efficient cross-attention mechanism to fuse the information with the term representations of the content. We also propose an adaptive in-batch hard negative sampling approach which samples better negatives as the training progresses. We demonstrate that the proposed approach \textit{TagRec++} outperforms existing state-of-the-art approaches on question datasets as measured by Recall@k. In addition, we demonstrate zero-shot capabilities of \textit{TagRec++} and ability to adapt to label changes. △ Less

Submitted 10 August, 2022; originally announced August 2022.

Comments: 12 pages, double column, Under review at IEEE Transactions on Knwoledge and Data Engineering. arXiv admin note: text overlap with arXiv:2107.10649

arXiv:2206.08495 [pdf, ps, other]

Yankee Swap: a Fast and Simple Fair Allocation Mechanism for Matroid Rank Valuations

Authors: Vignesh Viswanathan, Yair Zick

Abstract: We study fair allocation of indivisible goods when agents have matroid rank valuations. Our main contribution is a simple algorithm based on the colloquial Yankee Swap procedure that computes provably fair and efficient Lorenz dominating allocations. While there exist polynomial time algorithms to compute such allocations, our proposed method improves on them in two ways. (a) Our approach is easy… ▽ More We study fair allocation of indivisible goods when agents have matroid rank valuations. Our main contribution is a simple algorithm based on the colloquial Yankee Swap procedure that computes provably fair and efficient Lorenz dominating allocations. While there exist polynomial time algorithms to compute such allocations, our proposed method improves on them in two ways. (a) Our approach is easy to understand and does not use complex matroid optimization algorithms as subroutines. (b) Our approach is scalable; it is provably faster than all known algorithms to compute Lorenz dominating allocations. These two properties are key to the adoption of algorithms in any real fair allocation setting; our contribution brings us one step closer to this goal. △ Less

Submitted 3 April, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

arXiv:2205.03885 [pdf, other]

Effect of disorder and do** on electronic structure and diffusion properties of Li$_{3}$V$_{2}$O$_{5}$

Authors: Mohammad Babar, Hasnain Hafiz, Zeeshan Ahmad, Bernardo Barbiellini, Arun Bansil, Venkatasubramanian Viswanathan

Abstract: V$_{2}$O$_{5}$ in its $ω$ phase (Li$_{3}$V$_{2}$O$_{5}$) with excess lithium is a potential alternative to the graphite anode for lithium-ion batteries at low temperature and fast charging conditions due to its safer voltage (0.6 V vs Li$^{+}$/Li(s)) and high lithium transport rate. In-operando cationic disorder, as observed in most ordered materials, can produce significant changes in charge comp… ▽ More V$_{2}$O$_{5}$ in its $ω$ phase (Li$_{3}$V$_{2}$O$_{5}$) with excess lithium is a potential alternative to the graphite anode for lithium-ion batteries at low temperature and fast charging conditions due to its safer voltage (0.6 V vs Li$^{+}$/Li(s)) and high lithium transport rate. In-operando cationic disorder, as observed in most ordered materials, can produce significant changes in charge compensation mechanisms, anionic activity, lithium diffusion and operational voltages. In this work, we report the variation in structural distortion, electronic structure and migration barrier accompanied by disorder using first-principles calculations. Due to segregation of lithium atoms in the disordered state, we observe greater distortion, emergence of metallic behaviour and potential anionic activity from non-bonding oxygen states near the Fermi level. Redox capacity can be tuned by do** with 3d metals which can adjust the participating cationic states, and by fluorine substitution which can stabilize or suppress anionic states. Moreover, suppression of anionic activity is found to decrease structural distortion, crucial for mitigating voltage fade and hysteresis. Diffusion barrier calculations in the presence of disorder indicate the activation of the remaining 3D-paths for lithium hop** which are unavailable in the ordered configuration, explaining its fast-charging ability observed in experiments. △ Less

Submitted 8 May, 2022; originally announced May 2022.

Comments: 23 pages, 6 figures

arXiv:2205.02289 [pdf, other]

A Dataset for N-ary Relation Extraction of Drug Combinations

Authors: Aryeh Tiktinsky, Vijay Viswanathan, Danna Niezni, Dana Meron Azagury, Yosi Shamay, Hillel Taub-Tabib, Tom Hope, Yoav Goldberg

Abstract: Combination therapies have become the standard of care for diseases such as cancer, tuberculosis, malaria and HIV. However, the combinatorial set of available multi-drug treatments creates a challenge in identifying effective combination therapies available in a situation. To assist medical professionals in identifying beneficial drug-combinations, we construct an expert-annotated dataset for extr… ▽ More Combination therapies have become the standard of care for diseases such as cancer, tuberculosis, malaria and HIV. However, the combinatorial set of available multi-drug treatments creates a challenge in identifying effective combination therapies available in a situation. To assist medical professionals in identifying beneficial drug-combinations, we construct an expert-annotated dataset for extracting information about the efficacy of drug combinations from the scientific literature. Beyond its practical utility, the dataset also presents a unique NLP challenge, as the first relation extraction dataset consisting of variable-length relations. Furthermore, the relations in this dataset predominantly require language understanding beyond the sentence level, adding to the challenge of this task. We provide a promising baseline model and identify clear areas for further improvement. We release our dataset, code, and baseline models publicly to encourage the NLP community to participate in this task. △ Less

Submitted 4 May, 2022; originally announced May 2022.

Comments: To appear in NAACL 2022

arXiv:2204.03094 [pdf, other]

Super-linear Scaling Behavior for Electric Vehicle Chargers and Road Map to Addressing the Infrastructure Gap

Authors: Alexius Wadell, Matthew Guttenberg, Christopher P. Kempes, Venkatasubramanian Viswanathan

Abstract: Enabling widespread electric vehicle (EV) adoption requires substantial build-out of charging infrastructure in the coming decade. We formulate the charging infrastructure needs as a scaling analysis problem and use it to estimate the EV infrastructure needs of the US at a county-level resolution. Surprisingly, we find that the current EV infrastructure deployment scales super-linearly with popula… ▽ More Enabling widespread electric vehicle (EV) adoption requires substantial build-out of charging infrastructure in the coming decade. We formulate the charging infrastructure needs as a scaling analysis problem and use it to estimate the EV infrastructure needs of the US at a county-level resolution. Surprisingly, we find that the current EV infrastructure deployment scales super-linearly with population, deviating from the sub-linear scaling of gasoline stations and other infrastructure. We discuss how this demonstrates the infancy of EV station abundance compared to other mature transportation infrastructures. By considering the power delivery of existing gasoline stations, and appropriate EV efficiencies, we estimate the EV infrastructure gap at the county level, providing a road map for future EV infrastructure expansion. Our reliance on scaling analysis allows us to make a unique forecast in this domain. △ Less

Submitted 6 April, 2022; originally announced April 2022.

Comments: 3 pages, 3 figures, 1 table

arXiv:2203.04698 [pdf, other]

Score-Based Generative Models for Molecule Generation

Authors: Dwaraknath Gnaneshwar, Bharath Ramsundar, Dhairya Gandhi, Rachel Kurchin, Venkatasubramanian Viswanathan

Abstract: Recent advances in generative models have made exploring design spaces easier for de novo molecule generation. However, popular generative models like GANs and normalizing flows face challenges such as training instabilities due to adversarial training and architectural constraints, respectively. Score-based generative models sidestep these challenges by modelling the gradient of the log probabili… ▽ More Recent advances in generative models have made exploring design spaces easier for de novo molecule generation. However, popular generative models like GANs and normalizing flows face challenges such as training instabilities due to adversarial training and architectural constraints, respectively. Score-based generative models sidestep these challenges by modelling the gradient of the log probability density using a score function approximation, as opposed to modelling the density function directly, and sampling from it using annealed Langevin Dynamics. We believe that score-based generative models could open up new opportunities in molecule generation due to their architectural flexibility, such as replacing the score function with an SE(3) equivariant model. In this work, we lay the foundations by testing the efficacy of score-based models for molecule generation. We train a Transformer-based score function on Self-Referencing Embedded Strings (SELFIES) representations of 1.5 million samples from the ZINC dataset and use the Moses benchmarking framework to evaluate the generated samples on a suite of metrics. △ Less

Submitted 7 March, 2022; originally announced March 2022.

arXiv:2202.12875 [pdf, other]

DataLab: A Platform for Data Analysis and Intervention

Authors: Yang Xiao, **lan Fu, Weizhe Yuan, Vijay Viswanathan, Zhoumianze Liu, Yixin Liu, Graham Neubig, Pengfei Liu

Abstract: Despite data's crucial role in machine learning, most existing tools and research tend to focus on systems on top of existing data rather than how to interpret and manipulate data. In this paper, we propose DataLab, a unified data-oriented platform that not only allows users to interactively analyze the characteristics of data, but also provides a standardized interface for different data processi… ▽ More Despite data's crucial role in machine learning, most existing tools and research tend to focus on systems on top of existing data rather than how to interpret and manipulate data. In this paper, we propose DataLab, a unified data-oriented platform that not only allows users to interactively analyze the characteristics of data, but also provides a standardized interface for different data processing operations. Additionally, in view of the ongoing proliferation of datasets, \toolname has features for dataset recommendation and global vision analysis that help researchers form a better view of the data ecosystem. So far, DataLab covers 1,715 datasets and 3,583 of its transformed version (e.g., hyponyms replacement), where 728 datasets support various analyses (e.g., with respect to gender bias) with the help of 140M samples annotated by 318 feature functions. DataLab is under active development and will be supported going forward. We have released a web platform, web API, Python SDK, PyPI published package and online documentation, which hopefully, can meet the diverse needs of researchers. △ Less

Submitted 25 February, 2022; originally announced February 2022.

Comments: DataLab Web Platform: http://datalab.nlpedia.ai/

arXiv:2202.10946 [pdf, other]

Relaxations of Envy-Freeness Over Graphs

Authors: Justin Payan, Rik Sengupta, Vignesh Viswanathan

Abstract: When allocating a set of indivisible items among agents, the ideal condition of envy-freeness cannot always be achieved. Envy-freeness up to any good (EFX), and envy-freeness with $k$ hidden items (HEF-$k$) are two very compelling relaxations of envy-freeness, which remain elusive in many settings. We study a natural relaxation of these two fairness constraints, where we place the agents on the ve… ▽ More When allocating a set of indivisible items among agents, the ideal condition of envy-freeness cannot always be achieved. Envy-freeness up to any good (EFX), and envy-freeness with $k$ hidden items (HEF-$k$) are two very compelling relaxations of envy-freeness, which remain elusive in many settings. We study a natural relaxation of these two fairness constraints, where we place the agents on the vertices of an undirected graph, and only require that our allocations satisfy the EFX (resp. HEF) constraint on the edges of the graph. We refer to these allocations as graph-EFX (resp. graph-HEF) or simply $G$-EFX (resp. $G$-HEF) allocations. We show that for any graph $G$, there always exists a $G$-HEF-$k$ allocation of goods, where $k$ is the size of a minimum vertex cover of $G$, and that this is essentially tight. We show that $G$-EFX allocations of goods exist for three different classes of graphs -- two of them generalizing the star $K_{1, n-1}$ and the third generalizing the three-edge path $P_4$. Many of these results extend to allocations of chores as well. Overall, we show several natural settings in which the graph structure helps obtain strong fairness guarantees. Finally, we evaluate an algorithm using problem instances from Spliddit to show that $G$-EFX allocations appear to exist for paths $P_n$, pointing the way towards showing EFX for even broader families of graphs. △ Less

Submitted 3 January, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

arXiv:2111.14786 [pdf, other]

doi 10.1038/s41467-022-32938-1

Autonomous optimization of nonaqueous battery electrolytes via robotic experimentation and machine learning

Authors: Adarsh Dave, Jared Mitchell, Sven Burke, Hongyi Lin, Jay Whitacre, Venkatasubramanian Viswanathan

Abstract: In this work, we introduce a novel workflow that couples robotics to machine-learning for efficient optimization of a non-aqueous battery electrolyte. A custom-built automated experiment named "Clio" is coupled to Dragonfly - a Bayesian optimization-based experiment planner. Clio autonomously optimizes electrolyte conductivity over a single-salt, ternary solvent design space. Using this workflow,… ▽ More In this work, we introduce a novel workflow that couples robotics to machine-learning for efficient optimization of a non-aqueous battery electrolyte. A custom-built automated experiment named "Clio" is coupled to Dragonfly - a Bayesian optimization-based experiment planner. Clio autonomously optimizes electrolyte conductivity over a single-salt, ternary solvent design space. Using this workflow, we identify 6 fast-charging electrolytes in 2 work-days and 42 experiments (compared with 60 days using exhaustive search of the 1000 possible candidates, or 6 days assuming only 10% of candidates are evaluated). Our method finds the highest reported conductivity electrolyte in a design space heavily explored by previous literature, converging on a high-conductivity mixture that demonstrates subtle electrolyte chemical physics. △ Less

Submitted 22 November, 2021; originally announced November 2021.

Comments: 26 pages, 5 Figures, 7 Extended Data Figures

arXiv:2110.11528 [pdf, other]

doi 10.1063/5.0122115

Validation and parameterization of a novel physics-constrained neural dynamics model applied to turbulent fluid flow

Authors: Varun Shankar, Gavin D. Portwood, Arvind T. Mohan, Peetak P. Mitra, Dilip Krishnamurthy, Christopher Rackauckas, Lucas A. Wilson, David P. Schmidt, Venkatasubramanian Viswanathan

Abstract: In fluid physics, data-driven models to enhance or accelerate solution methods are becoming increasingly popular for many application domains, such as alternatives to turbulence closures, system surrogates, or for new physics discovery. In the context of reduced order models of high-dimensional time-dependent fluid systems, machine learning methods grant the benefit of automated learning from data… ▽ More In fluid physics, data-driven models to enhance or accelerate solution methods are becoming increasingly popular for many application domains, such as alternatives to turbulence closures, system surrogates, or for new physics discovery. In the context of reduced order models of high-dimensional time-dependent fluid systems, machine learning methods grant the benefit of automated learning from data, but the burden of a model lies on its reduced-order representation of both the fluid state and physical dynamics. In this work, we build a physics-constrained, data-driven reduced order model for the Navier-Stokes equations to approximate spatio-temporal turbulent fluid dynamics. The model design choices mimic numerical and physical constraints by, for example, implicitly enforcing the incompressibility constraint and utilizing continuous Neural Ordinary Differential Equations for tracking the evolution of the differential equation. We demonstrate this technique on three-dimensional, moderate Reynolds number turbulent fluid flow. In assessing the statistical quality and characteristics of the machine-learned model through rigorous diagnostic tests, we find that our model is capable of reconstructing the dynamics of the flow over large integral timescales, favoring accuracy at the larger length scales. More significantly, comprehensive diagnostics suggest that physically-interpretable model parameters, corresponding to the representations of the fluid state and dynamics, have attributable and quantifiable impact on the quality of the model predictions and computational complexity. △ Less

Submitted 21 October, 2021; originally announced October 2021.

Comments: Submitted to Physical Review Fluids

arXiv:2109.08332 [pdf, other]

Cell-Level State of Charge Estimation for Battery Packs Under Minimal Sensing

Authors: Dong Zhang, Luis D. Couto, Ross Drummond, Shashank Sripad, Venkatasubramanian Viswanathan

Abstract: This manuscript presents an algorithm for individual Lithium-ion (Li-ion) battery cell state of charge (SOC) estimation in a large-scale battery pack under minimal sensing, where only pack-level voltage and current are measured. For battery packs consisting of up to thousands of cells in electric vehicle or stationary energy storage applications, it is desirable to estimate individual cell SOCs wi… ▽ More This manuscript presents an algorithm for individual Lithium-ion (Li-ion) battery cell state of charge (SOC) estimation in a large-scale battery pack under minimal sensing, where only pack-level voltage and current are measured. For battery packs consisting of up to thousands of cells in electric vehicle or stationary energy storage applications, it is desirable to estimate individual cell SOCs without cell local measurements in order to reduce sensing costs. Mathematically, pure series connected cells yield dynamics given by ordinary differential equations under classical full voltage sensing. In contrast, parallel--series connected battery packs are evidently more challenging because the dynamics are governed by a nonlinear differential--algebraic equations (DAE) system. The majority of the conventional studies on SOC estimation for battery packs benefit from idealizing the pack as a lumped single cell which ultimately lose track of cell-level conditions and are blind to potential risks of cell-level over-charge and over-discharge. This work explicitly models a battery pack with high fidelity cell-by-cell resolution based on the interconnection of single cell models, and examines the observability of cell-level state with only pack-level measurements. A DAE-based state observer with linear output error injection is formulated, where the individual cell SOC and current can be reconstructed from minimal number of pack sensing. The mathematically guaranteed asymptotic convergence of differential and algebraic state estimates is established by considering local Lipschitz continuity property of system nonlinearities. Simulation results for Graphite/NMC cells illustrate convergence for cell SOCs, currents, and voltages. △ Less

Submitted 16 September, 2021; originally announced September 2021.

arXiv:2109.07573 [pdf, other]

Differentiable Physics: A Position Piece

Authors: Bharath Ramsundar, Dilip Krishnamurthy, Venkatasubramanian Viswanathan

Abstract: Differentiable physics provides a new approach for modeling and understanding the physical systems by pairing the new technology of differentiable programming with classical numerical methods for physical simulation. We survey the rapidly growing literature of differentiable physics techniques and highlight methods for parameter estimation, learning representations, solving differential equations,… ▽ More Differentiable physics provides a new approach for modeling and understanding the physical systems by pairing the new technology of differentiable programming with classical numerical methods for physical simulation. We survey the rapidly growing literature of differentiable physics techniques and highlight methods for parameter estimation, learning representations, solving differential equations, and develo** what we call scientific foundation models using data and inductive priors. We argue that differentiable physics offers a new paradigm for modeling physical phenomena by combining classical analytic solutions with numerical methodology using the bridge of differentiable programming. △ Less

Submitted 14 September, 2021; originally announced September 2021.

Comments: 12 pages, 1 figure

arXiv:2109.07278 [pdf]

Principles of the Battery Data Genome

Authors: Logan Ward, Susan Babinec, Eric J. Dufek, David A. Howey, Venkatasubramanian Viswanathan, Muratahan Aykol, David A. C. Beck, Ben Blaiszik, Bor-Rong Chen, George Crabtree, Valerio de Angelis, Philipp Dechent, Matthieu Dubarry, Erica E. Eggleton, Donal P. Finegan, Ian Foster, Chirranjeevi Gopal, Patrick Herring, Victor W. Hu, Noah H. Paulson, Yuliya Preger, Dirk Uwe Sauer, Kandler Smith, Seth Snyder, Shashank Sripad , et al. (2 additional authors not shown)

Abstract: Electrochemical energy storage is central to modern society -- from consumer electronics to electrified transportation and the power grid. It is no longer just a convenience but a critical enabler of the transition to a resilient, low-carbon economy. The large pluralistic battery research and development community serving these needs has evolved into diverse specialties spanning materials discover… ▽ More Electrochemical energy storage is central to modern society -- from consumer electronics to electrified transportation and the power grid. It is no longer just a convenience but a critical enabler of the transition to a resilient, low-carbon economy. The large pluralistic battery research and development community serving these needs has evolved into diverse specialties spanning materials discovery, battery chemistry, design innovation, scale-up, manufacturing and deployment. Despite the maturity and the impact of battery science and technology, the data and software practices among these disparate groups are far behind the state-of-the-art in other fields (e.g. drug discovery), which have enjoyed significant increases in the rate of innovation. Incremental performance gains and lost research productivity, which are the consequences, retard innovation and societal progress. Examples span every field of battery research , from the slow and iterative nature of materials discovery, to the repeated and time-consuming performance testing of cells and the mitigation of degradation and failures. The fundamental issue is that modern data science methods require large amounts of data and the battery community lacks the requisite scalable, standardized data hubs required for immediate use of these approaches. Lack of uniform data practices is a central barrier to the scale problem. In this perspective we identify the data- and software-sharing gaps and propose the unifying principles and tools needed to build a robust community of data hubs, which provide flexible sharing formats to address diverse needs. The Battery Data Genome is offered as a data-centric initiative that will enable the transformative acceleration of battery science and technology, and will ultimately serve as a catalyst to revolutionize our approach to innovation. △ Less

Submitted 3 December, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

Comments: corrected author list

Showing 1–50 of 115 results for author: Viswanathan, V