Skip to main content

Showing 1–50 of 69 results for author: Ashesh

.
  1. arXiv:2406.03689  [pdf, other

    cs.CL cs.AI

    Evaluating the World Model Implicit in a Generative Model

    Authors: Keyon Vafa, Justin Y. Chen, Jon Kleinberg, Sendhil Mullainathan, Ashesh Rambachan

    Abstract: Recent work suggests that large language models may implicitly learn world models. How should we assess this possibility? We formalize this question for the case where the underlying reality is governed by a deterministic finite automaton. This includes problems as diverse as simple logical reasoning, geographic navigation, game-playing, and chemistry. We propose new evaluation metrics for world m… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2406.01382  [pdf, other

    cs.CL cs.AI

    Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function

    Authors: Keyon Vafa, Ashesh Rambachan, Sendhil Mullainathan

    Abstract: What makes large language models (LLMs) impressive is also what makes them hard to evaluate: their diversity of uses. To evaluate these models, we must understand the purposes they will be used for. We consider a setting where these deployment decisions are made by people, and in particular, people's beliefs about where an LLM will perform well. We model such beliefs as the consequence of a human… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  3. arXiv:2405.16297  [pdf, other

    cs.LG physics.ao-ph physics.comp-ph

    LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensembles

    Authors: Haiwen Guan, Troy Arcomano, Ashesh Chattopadhyay, Romit Maulik

    Abstract: We present LUCIE, a $1000$- member ensemble data-driven atmospheric emulator that remains stable during autoregressive inference for thousands of years without a drifting climatology. LUCIE has been trained on $9.5$ years of coarse-resolution ERA5 data with $4$ prognostic variables on a single A100 GPU for $2.4$ h. Owing to the cheap computational cost of inference, $1000$ model ensembles are exec… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  4. arXiv:2404.19035  [pdf, other

    physics.flu-dyn

    Improved pressure-gradient sensor for the prediction of separation onset in RANS models

    Authors: Kevin Patrick Griffin, Ganesh Vijayakumar, Ashesh Sharma, Michael A. Sprague

    Abstract: We improve upon two key aspects of the Menter shear stress transport (SST) turbulence model: (1) We propose a more robust adverse pressure gradient sensor based on the strength of the pressure gradient in the direction of the local mean flow; (2) We propose two alternative eddy viscosity models to be used in the adverse pressure gradient regions identified by our sensor. Direct numerical simulatio… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2404.10111  [pdf, other

    econ.EM

    From Predictive Algorithms to Automatic Generation of Anomalies

    Authors: Sendhil Mullainathan, Ashesh Rambachan

    Abstract: Machine learning algorithms can find predictive signals that researchers fail to notice; yet they are notoriously hard-to-interpret. How can we extract theoretical insights from these black boxes? History provides a clue. Facing a similar problem -- how to extract theoretical insights from their intuitions -- researchers often turned to ``anomalies:'' constructed examples that highlight flaws in a… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  6. arXiv:2403.11854  [pdf, other

    eess.IV cs.CV

    denoiSplit: a method for joint image splitting and unsupervised denoising

    Authors: Ashesh Ashesh, Florian Jug

    Abstract: In this work we present denoiSplit, a method to tackle a new analysis task, i.e. the challenge of joint semantic image splitting and unsupervised denoising. This dual approach has important applications in fluorescence microscopy, where semantic image splitting has important applications but noise does generally hinder the downstream analysis of image content. Image splitting involves dissecting a… ▽ More

    Submitted 25 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  7. arXiv:2403.00271  [pdf

    physics.med-ph

    Assessing Bilateral Neurovascular Bundles Function with Pulsed Wave Doppler Ultrasound: Implications for Reducing Erectile Dysfunction Following Prostate Radiotherapy

    Authors: **g Wang, Xiaofeng Yang, Boran Zhou, James Sohn, Richard Qiu, Pretesh Patel, Ashesh B. Jani, Tian Liu

    Abstract: This study aims to evaluate the functional status of bilateral neurovascular bundles (NVBs) using pulsed wave Doppler ultrasound in patients undergoing prostate radiotherapy (RT). Sixty-two patients (mean age: 66.1 +/- 7.2 years) underwent transrectal ultrasound scan using a conventional ultrasound scanner, a 7.5 MHz bi-plane probe and a mechanical stepper. The ultrasound protocol comprised 3 step… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures

    MSC Class: 68U10

  8. arXiv:2401.17671  [pdf, other

    cs.CL cs.AI q-bio.NC

    Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

    Authors: Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani

    Abstract: Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs,… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 19 pages, 5 figures and 4 supplementary figures

  9. arXiv:2311.17078  [pdf, other

    physics.ao-ph

    Data Imbalance, Uncertainty Quantification, and Generalization via Transfer Learning in Data-driven Parameterizations: Lessons from the Emulation of Gravity Wave Momentum Transport in WACCM

    Authors: Y. Qiang Sun, Hamid A. Pahlavan, Ashesh Chattopadhyay, Pedram Hassanzadeh, Sandro W. Lubis, M. Joan Alexander, Edwin Gerber, Aditi Sheshadri, Yifei Guan

    Abstract: Neural networks (NNs) are increasingly used for data-driven subgrid-scale parameterization in weather and climate models. While NNs are powerful tools for learning complex nonlinear relationships from data, there are several challenges in using them for parameterizations. Three of these challenges are 1) data imbalance related to learning rare (often large-amplitude) samples; 2) uncertainty quanti… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  10. arXiv:2310.00813  [pdf, other

    cs.LG cs.AI nlin.CD physics.ao-ph

    OceanNet: A principled neural operator-based digital twin for regional oceans

    Authors: Ashesh Chattopadhyay, Michael Gray, Tianning Wu, Anna B. Lowe, Ruoying He

    Abstract: While data-driven approaches demonstrate great potential in atmospheric modeling and weather forecasting, ocean modeling poses distinct challenges due to complex bathymetry, land, vertical structure, and flow non-linearity. This study introduces OceanNet, a principled neural operator-based digital twin for ocean circulation. OceanNet uses a Fourier neural operator and predictor-evaluate-corrector… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: Supplementary information can be found in: https://drive.google.com/file/d/1NoxJLa967naJT787a5-IfZ7f_MmRuZMP/view?usp=sharing

  11. arXiv:2309.17434  [pdf, other

    cond-mat.soft cond-mat.stat-mech physics.bio-ph physics.chem-ph

    Local Changes in Protein Filament Properties Drive Large-Scale Membrane Transformations Involved in Endosome Tethering and Fusion

    Authors: Ashesh Ghosh, Andrew J. Spkaowitz

    Abstract: Large-scale cellular transformations are triggered by subtle physical and structural changes in individual biomacromolecular and membrane components. A prototypical example of such an event is the orchestrated fusion of membranes within an endosome that enables transport of cargo and processing of biochemical moieties. In this work, we demonstrate how protein filaments on the endosomal membrane su… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  12. arXiv:2309.13211  [pdf, other

    physics.comp-ph physics.ao-ph physics.data-an stat.CO

    Interpretable structural model error discovery from sparse assimilation increments using spectral bias-reduced neural networks: A quasi-geostrophic turbulence test case

    Authors: Rambod Mojgani, Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: Earth system models suffer from various structural and parametric errors in their representation of nonlinear, multi-scale processes, leading to uncertainties in their long-term projections. The effects of many of these errors (particularly those due to fast physics) can be quantified in short-term simulations, e.g., as differences between the predicted and observed states (analysis increments). W… ▽ More

    Submitted 15 February, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 26 pages, 5+1 figures

  13. arXiv:2306.05014  [pdf, other

    physics.flu-dyn cs.LG physics.ao-ph

    Learning Closed-form Equations for Subgrid-scale Closures from High-fidelity Data: Promises and Challenges

    Authors: Karan Jakhar, Yifei Guan, Rambod Mojgani, Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: There is growing interest in discovering interpretable, closed-form equations for subgrid-scale (SGS) closures/parameterizations of complex processes in Earth systems. Here, we apply a common equation-discovery technique with expansive libraries to learn closures from filtered direct numerical simulations of 2D turbulence and Rayleigh-Bénard convection (RBC). Across common filters (e.g., Gaussian,… ▽ More

    Submitted 7 July, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 40 pages, 4 figures. The code for 2D-FHIT solver "py2d" is available at https://github.com/envfluids/py2d. The code and data used for analysis in this work can be found at https://github.com/jakharkaran/EqsDiscovery_2D-FHIT_RBC and https://doi.org/10.5281/zenodo.7500647, respectively

    MSC Class: 76F65 (Primary) 86A08; 68T01; 76F05; 76F35 (Secondary) ACM Class: J.2; I.2.0; G.1.8

  14. arXiv:2305.00385  [pdf

    eess.IV cs.CV

    Cross-Shaped Windows Transformer with Self-supervised Pretraining for Clinically Significant Prostate Cancer Detection in Bi-parametric MRI

    Authors: Yuheng Li, Jacob Wynne, **g Wang, Richard L. J. Qiu, Justin Roper, Shaoyan Pan, Ashesh B. Jani, Tian Liu, Pretesh R. Patel, Hui Mao, Xiaofeng Yang

    Abstract: Biparametric magnetic resonance imaging (bpMRI) has demonstrated promising results in prostate cancer (PCa) detection using convolutional neural networks (CNNs). Recently, transformers have achieved competitive performance compared to CNNs in computer vision. Large scale transformers need abundant annotated data for training, which are difficult to obtain in medical imaging. Self-supervised learni… ▽ More

    Submitted 17 March, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

  15. arXiv:2304.07029  [pdf, other

    physics.flu-dyn cs.AI cs.LG math.NA physics.ao-ph

    Long-term instabilities of deep learning-based digital twins of the climate system: The cause and a solution

    Authors: Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: Long-term stability is a critical property for deep learning-based data-driven digital twins of the Earth system. Such data-driven digital twins enable sub-seasonal and seasonal predictions of extreme environmental events, probabilistic forecasts, that require a large number of ensemble members, and computationally tractable high-resolution Earth system models where expensive components of the mod… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Supplementary information is given at https://drive.google.com/file/d/1J0k20Qk___PbDQob0Z4vnSVWEpnDFlif/view?usp=share_link

  16. arXiv:2303.16969  [pdf, other

    cond-mat.soft cond-mat.stat-mech

    Importance of Many Particle Correlations to the Collective Debye-Waller Factor in a Single-Particle Activated Dynamic Theory of the Glass Transition

    Authors: Ashesh Ghosh

    Abstract: We theoretically study the importance of many body correlations on the collective Debye Waller (DW) factor in the context of the Nonlinear Langevin Equation (NLE) single particle activated dynamics theory of glass transition and its extension to include collective elasticity (ECNLE theory). This microscopic force-based approach envisions structural alpha relaxation as a coupled local-nonlocal proc… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  17. arXiv:2212.09844  [pdf, other

    econ.EM cs.CY cs.LG stat.ME

    Robust Design and Evaluation of Predictive Algorithms under Unobserved Confounding

    Authors: Ashesh Rambachan, Amanda Coston, Edward Kennedy

    Abstract: Predictive algorithms inform consequential decisions in settings where the outcome is selectively observed given choices made by human decision makers. We propose a unified framework for the robust design and evaluation of predictive algorithms in selectively observed data. We impose general assumptions on how much the outcome may vary on average between unselected and selected units conditional o… ▽ More

    Submitted 19 May, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

  18. arXiv:2211.12872  [pdf, other

    cs.CV cs.LG

    μSplit: efficient image decomposition for microscopy data

    Authors: Ashesh, Alexander Krull, Moises Di Sante, Francesco Silvio Pasqualini, Florian Jug

    Abstract: We present μSplit, a dedicated approach for trained image decomposition in the context of fluorescence microscopy images. We find that best results using regular deep architectures are achieved when large image patches are used during training, making memory consumption the limiting factor to further improving performance. We therefore introduce lateral contextualization (LC), a novel meta-archite… ▽ More

    Submitted 16 August, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Published at ICCV 2023. 10 pages, 7 figures, 9 pages supplement, 8 supplementary figures

  19. arXiv:2208.08419  [pdf, other

    cond-mat.soft cond-mat.stat-mech

    Microscopic Activated Dynamics Theory of the Shear Rheology and Stress Overshoot in Ultra-Dense Glass-Forming Fluids and Colloidal Suspensions

    Authors: Ashesh Ghosh, Kenneth S. Schweizer

    Abstract: We formulate a microscopic, force-level, activated dynamics-based statistical-mechanical theory for the continuous startup nonlinear shear-rheology of ultra-dense glass-forming hard-sphere fluids and colloidal suspensions in the context of the ECNLE approach. Activated structural relaxation is described as a coupled local-nonlocal event involving caging and longer-range collective elasticity which… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

  20. arXiv:2206.04811  [pdf, other

    cs.LG physics.comp-ph physics.data-an physics.flu-dyn physics.geo-ph

    Deep learning-enhanced ensemble-based data assimilation for high-dimensional nonlinear dynamical systems

    Authors: Ashesh Chattopadhyay, Ebrahim Nabizadeh, Eviatar Bach, Pedram Hassanzadeh

    Abstract: Data assimilation (DA) is a key component of many forecasting models in science and engineering. DA allows one to estimate better initial conditions using an imperfect dynamical model of the system and noisy/sparse observations available from the system. Ensemble Kalman filter (EnKF) is a DA algorithm that is widely used in applications involving high-dimensional nonlinear dynamical systems. Howev… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  21. arXiv:2206.03198  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph

    Explaining the physics of transfer learning a data-driven subgrid-scale closure to a different turbulent flow

    Authors: Adam Subel, Yifei Guan, Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: Transfer learning (TL) is becoming a powerful tool in scientific applications of neural networks (NNs), such as weather/climate prediction and turbulence modeling. TL enables out-of-distribution generalization (e.g., extrapolation in parameters) and effective blending of disparate training sets (e.g., simulations and observations). In TL, selected layers of a NN, already trained for a base system,… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: 21 pages, 6 figures

  22. arXiv:2205.04601  [pdf, other

    cs.LG nlin.CD physics.ao-ph physics.flu-dyn physics.geo-ph

    Long-term stability and generalization of observationally-constrained stochastic data-driven models for geophysical turbulence

    Authors: Ashesh Chattopadhyay, Jaideep Pathak, Ebrahim Nabizadeh, Wahid Bhimji, Pedram Hassanzadeh

    Abstract: Recent years have seen a surge in interest in building deep learning-based fully data-driven models for weather prediction. Such deep learning models if trained on observations can mitigate certain biases in current state-of-the-art weather models, some of which stem from inaccurate representation of subgrid-scale processes. However, these data-driven models, being over-parameterized, require a lo… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  23. arXiv:2202.11214  [pdf, other

    physics.ao-ph cs.LG

    FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators

    Authors: Jaideep Pathak, Shashank Subramanian, Peter Harrington, Sanjeev Raja, Ashesh Chattopadhyay, Morteza Mardani, Thorsten Kurth, David Hall, Zongyi Li, Kamyar Azizzadenesheli, Pedram Hassanzadeh, Karthik Kashinath, Animashree Anandkumar

    Abstract: FourCastNet, short for Fourier Forecasting Neural Network, is a global data-driven weather forecasting model that provides accurate short to medium-range global predictions at $0.25^{\circ}$ resolution. FourCastNet accurately forecasts high-resolution, fast-timescale variables such as the surface wind speed, precipitation, and atmospheric water vapor. It has important implications for planning win… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  24. arXiv:2201.07347  [pdf, ps, other

    physics.flu-dyn physics.comp-ph

    Learning physics-constrained subgrid-scale closures in the small-data regime for stable and accurate LES

    Authors: Yifei Guan, Adam Subel, Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: We demonstrate how incorporating physics constraints into convolutional neural networks (CNNs) enables learning subgrid-scale (SGS) closures for stable and accurate large-eddy simulations (LES) in the small-data regime (i.e., when the availability of high-quality training data is limited). Using several setups of forced 2D turbulence as the testbeds, we examine the {\it a priori} and {\it a poster… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: 23 pages, 9 figures

  25. arXiv:2201.02702  [pdf

    math.DS cs.LG math.OC stat.AP stat.ME

    An Improved Mathematical Model of Sepsis: Modeling, Bifurcation Analysis, and Optimal Control Study for Complex Nonlinear Infectious Disease System

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: Sepsis is a life-threatening medical emergency, which is a major cause of death worldwide and the second highest cause of mortality in the United States. Researching the optimal control treatment or intervention strategy on the comprehensive sepsis system is key in reducing mortality. For this purpose, first, this paper improves a complex nonlinear sepsis model proposed in our previous work. Then,… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: 25 pages, 7 figures, 1 table

  26. arXiv:2201.00147  [pdf

    cs.LG math.OC stat.AP stat.ME

    High-dimensional Bayesian Optimization Algorithm with Recurrent Neural Network for Disease Control Models in Time Series

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: Bayesian Optimization algorithm has become a promising approach for nonlinear global optimization problems and many machine learning applications. Over the past few years, improvements and enhancements have been brought forward and they have shown some promising results in solving the complex dynamic problems, systems of ordinary differential equations where the objective functions are computation… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: 16 pages, 9 figures, 2 tables

  27. arXiv:2110.00546  [pdf, other

    physics.comp-ph math.NA physics.flu-dyn stat.CO stat.ML

    Discovery of interpretable structural model errors by combining Bayesian sparse regression and data assimilation: A chaotic Kuramoto-Sivashinsky test case

    Authors: Rambod Mojgani, Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: Models of many engineering and natural systems are imperfect. The discrepancy between the mathematical representations of a true physical system and its imperfect model is called the model error. These model errors can lead to substantial differences between the numerical solutions of the model and the state of the system, particularly in those involving nonlinear, multi-scale phenomena. Thus, the… ▽ More

    Submitted 2 June, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: 9 pages, 2 figures

    Journal ref: Chaos 32, 061105 (2022)

  28. arXiv:2109.13602  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

    Authors: Matt Vitelli, Yan Chang, Yawei Ye, Maciej Wołczyk, Błażej Osiński, Moritz Niendorf, Hugo Grimmett, Qiangui Huang, Ashesh Jain, Peter Ondruska

    Abstract: In this paper we present the first safe system for full control of self-driving vehicles trained from human demonstrations and deployed in challenging, real-world, urban environments. Current industry-standard solutions use rule-based systems for planning. Although they perform reasonably well in common scenarios, the engineering complexity renders this approach incompatible with human-level perfo… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  29. arXiv:2108.02289  [pdf

    cs.LG math.OC stat.AP stat.ME

    High dimensional Bayesian Optimization Algorithm for Complex System in Time Series

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: At present, high-dimensional global optimization problems with time-series models have received much attention from engineering fields. Since it was proposed, Bayesian optimization has quickly become a popular and promising approach for solving global optimization problems. However, the standard Bayesian optimization algorithm is insufficient to solving the global optimal solution when the model i… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: 18 pages, 13 figures

  30. arXiv:2108.00062  [pdf

    stat.ME math.OC stat.AP

    A New Bayesian Optimization Algorithm for Complex High-Dimensional Disease Epidemic Systems

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: This paper presents an Improved Bayesian Optimization (IBO) algorithm to solve complex high-dimensional epidemic models' optimal control solution. Evaluating the total objective function value for disease control models with hundreds of thousands of control time periods is a high computational cost. In this paper, we improve the conventional Bayesian Optimization (BO) approach from two parts. The… ▽ More

    Submitted 30 July, 2021; originally announced August 2021.

    Comments: 17 pages, 14 figures

  31. arXiv:2107.08142  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Autonomy 2.0: Why is self-driving always 5 years away?

    Authors: Ashesh Jain, Luca Del Pero, Hugo Grimmett, Peter Ondruska

    Abstract: Despite the numerous successes of machine learning over the past decade (image recognition, decision-making, NLP, image synthesis), self-driving technology has not yet followed the same trend. In this paper, we study the history, composition, and development bottlenecks of the modern self-driving stack. We argue that the slow progress is caused by approaches that require too much hand-engineering,… ▽ More

    Submitted 9 August, 2021; v1 submitted 16 July, 2021; originally announced July 2021.

  32. arXiv:2103.09360  [pdf, other

    physics.ao-ph cs.AI cs.LG physics.comp-ph

    Towards physically consistent data-driven weather forecasting: Integrating data assimilation with equivariance-preserving deep spatial transformers

    Authors: Ashesh Chattopadhyay, Mustafa Mustafa, Pedram Hassanzadeh, Eviatar Bach, Karthik Kashinath

    Abstract: There is growing interest in data-driven weather prediction (DDWP), for example using convolutional neural networks such as U-NETs that are trained on data from models or reanalysis. Here, we propose 3 components to integrate with commonly used DDWP models in order to improve their physical consistency and forecast accuracy. These components are 1) a deep spatial transformer added to the latent sp… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

    Comments: Under review in Geoscientific Model Development

  33. arXiv:2103.02108  [pdf

    cond-mat.soft

    Linear and Nonlinear Viscoelasticity of Concentrated Thermoresponsive Microgel Suspensions

    Authors: Gaurav Chaudhary, Ashesh Ghosh, ** Gu Kang, Paul V. Braun, Randy H. Ewoldt, Kenneth S. Schweizer

    Abstract: This is an integrated experimental and theoretical study of the dynamics and rheology of self-crosslinked, slightly charged, temperature responsive soft Poly(N-isopropylacrylamide) (pNIPAM) microgels over a wide range of concentration and temperature spanning the sharp change in particle size and intermolecular interactions across the lower critical solution temperature (LCST). Dramatic, non-monot… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: additional supplementary information provided

  34. arXiv:2102.11400  [pdf, other

    physics.flu-dyn physics.ao-ph physics.comp-ph

    Stable a posteriori LES of 2D turbulence using convolutional neural networks: Backscattering analysis and generalization to higher Re via transfer learning

    Authors: Yifei Guan, Ashesh Chattopadhyay, Adam Subel, Pedram Hassanzadeh

    Abstract: There is a growing interest in develo** data-driven subgrid-scale (SGS) models for large-eddy simulation (LES) using machine learning (ML). In a priori (offline) tests, some recent studies have found ML-based data-driven SGS models that are trained on high-fidelity data (e.g., from direct numerical simulation, DNS) to outperform baseline physics-based models and accurately capture the inter-scal… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: 30 pages, 12 figures

  35. Accurate and Clear Precipitation Nowcasting with Consecutive Attention and Rain-map Discrimination

    Authors: Ashesh, Buo-Fu Chen, Treng-Shi Huang, Boyo Chen, Chia-Tung Chang, Hsuan-Tien Lin

    Abstract: Precipitation nowcasting is an important task for weather forecasting. Many recent works aim to predict the high rainfall events more accurately with the help of deep learning techniques, but such events are relatively rare. The rarity is often addressed by formulations that re-weight the rare events. Somehow such a formulation carries a side effect of making "blurry" predictions in low rainfall r… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  36. arXiv:2101.00352  [pdf, other

    cs.LG stat.ML

    Characterizing Fairness Over the Set of Good Models Under Selective Labels

    Authors: Amanda Coston, Ashesh Rambachan, Alexandra Chouldechova

    Abstract: Algorithmic risk assessments are used to inform decisions in a wide variety of high-stakes settings. Often multiple predictive models deliver similar overall performance but differ markedly in their predictions for individual cases, an empirical phenomenon known as the "Rashomon Effect." These models may have different properties over various groups, and therefore have different predictive fairnes… ▽ More

    Submitted 30 April, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: Added comparison methods to the empirical lending analysis

  37. arXiv:2012.06664  [pdf, other

    physics.flu-dyn physics.ao-ph

    Data-driven subgrid-scale modeling of forced Burgers turbulence using deep learning with generalization to higher Reynolds numbers via transfer learning

    Authors: Adam Subel, Ashesh Chattopadhyay, Yifei Guan, Pedram Hassanzadeh

    Abstract: Develo** data-driven subgrid-scale (SGS) models for large eddy simulations (LES) has received substantial attention recently. Despite some success, particularly in a priori (offline) tests, challenges have been identified that include numerical instabilities in a posteriori (online) tests and generalization (i.e., extrapolation) of trained data-driven SGS models, for example to higher Reynolds n… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Journal ref: Physics of Fluids, 2021

  38. arXiv:2010.05274  [pdf

    physics.data-an

    Full Automation for Rapid Modulator Characterization and Accurate Analysis Using SciPy

    Authors: T. L. Yap, A. Sasidhara, N. X. Ang, X. Guo, W. Wang, K. S. Ang, S. L. Tan

    Abstract: Modulator testing involved complex biasing conditions, hardware connections and data analysis. Also, any optical signal distortion due to the grating coupler effect could potentially induce additional difficulty in setting the correct bias condition for an accurate measurement of the modulator performance. In this paper, we proposed to use SciPy, an open-source scientific computing library, for au… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: 7 pages with 4 figures

  39. arXiv:2009.06924  [pdf, other

    cs.CV

    360-Degree Gaze Estimation in the Wild Using Multiple Zoom Scales

    Authors: Ashesh, Chu-Song Chen, Hsuan-Tien Lin

    Abstract: Gaze estimation involves predicting where the person is looking at within an image or video. Technically, the gaze information can be inferred from two different magnification levels: face orientation and eye orientation. The inference is not always feasible for gaze estimation in the wild, given the lack of clear eye patches in conditions like extreme left/right gazes or occlusions. In this work,… ▽ More

    Submitted 26 October, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: accepted at BMVC 2021

  40. arXiv:2008.00602  [pdf, other

    econ.EM stat.ME

    Design-Based Uncertainty for Quasi-Experiments

    Authors: Ashesh Rambachan, Jonathan Roth

    Abstract: This paper develops a finite-population, design-based theory of uncertainty for studying quasi-experimental settings in the social sciences. In our framework, treatment is determined by stochastic idiosyncratic factors, but individuals may differ in their probability of receiving treatment in ways unknown to the researcher, thus allowing for rich selection into treatment. We derive formulas for th… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 August, 2020; originally announced August 2020.

  41. arXiv:2006.14480  [pdf, other

    cs.CV cs.LG cs.RO

    One Thousand and One Hours: Self-driving Motion Prediction Dataset

    Authors: John Houston, Guido Zuidhof, Luca Bergamini, Yawei Ye, Long Chen, Ashesh Jain, Sammy Omari, Vladimir Iglovikov, Peter Ondruska

    Abstract: Motivated by the impact of large-scale datasets on ML systems we present the largest self-driving dataset for motion prediction to date, containing over 1,000 hours of data. This was collected by a fleet of 20 autonomous vehicles along a fixed route in Palo Alto, California, over a four-month period. It consists of 170,000 scenes, where each scene is 25 seconds long and captures the perception out… ▽ More

    Submitted 16 November, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: Presente at CoRL2020

  42. arXiv:2006.14005  [pdf, other

    cond-mat.soft

    The Role of Collective Elasticity on Activated Structural Relaxation, Yielding and Steady State Flow in Hard Sphere Fluids and Colloidal Suspensions Under Strong Deformation

    Authors: Ashesh Ghosh, Kenneth S. Schweizer

    Abstract: We theoretically study the effect of external deformation on activated structural relaxation and elementary aspects of the nonlinear mechanical response of glassy hard sphere fluids in the context of the nonequilibrium version of the Elastically Collective Nonlinear Langevin Equation (ECNLE) theory. ECNLE theory describes activated relaxation as a coupled local-nonlocal event involving local cagin… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 13 pages, 14 figures

  43. arXiv:2003.09915  [pdf, other

    stat.ME math.ST

    Panel Experiments and Dynamic Causal Effects: A Finite Population Perspective

    Authors: Iavor Bo**ov, Ashesh Rambachan, Neil Shephard

    Abstract: In panel experiments, we randomly assign units to different interventions, measuring their outcomes, and repeating the procedure in several periods. Using the potential outcomes framework, we define finite population dynamic causal effects that capture the relative effectiveness of alternative treatment paths. For a rich class of dynamic causal effects, we provide a nonparametric estimator that is… ▽ More

    Submitted 27 May, 2021; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: Forthcoming in Quantitative Economics

  44. arXiv:2003.06720  [pdf, other

    cond-mat.soft cond-mat.stat-mech

    Microscopic Theory of Onset of De-Caging and Bond Breaking Activated Dynamics in Ultra-Dense Fluids with Strong Short Range Attractions

    Authors: Ashesh Ghosh, Kenneth S. Schweizer

    Abstract: We theoretically study thermally activated elementary dynamical processes that precede full structural relaxation in ultra-dense particle liquids interacting via strong short range attractive forces. Our approach is based on a microscopic theory formulated at the particle trajectory level built on the dynamic free energy concept and an explicit treatment of how attractions control physical bonding… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: 6 pages, 4 figures

    Journal ref: Phys. Rev. E 101, 060601 (2020)

  45. arXiv:2002.11167  [pdf, other

    physics.ao-ph nlin.CD physics.comp-ph physics.flu-dyn physics.geo-ph stat.ML

    Data-driven super-parameterization using deep learning: Experimentation with multi-scale Lorenz 96 systems and transfer-learning

    Authors: Ashesh Chattopadhyay, Adam Subel, Pedram Hassanzadeh

    Abstract: To make weather/climate modeling computationally affordable, small-scale processes are usually represented in terms of the large-scale, explicitly-resolved processes using physics-based or semi-empirical parameterization schemes. Another approach, computationally more demanding but often more accurate, is super-parameterization (SP), which involves integrating the equations of small-scale processe… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Journal ref: Journal of Advances in Modeling Earth Systems 2020

  46. arXiv:1910.01284  [pdf

    cond-mat.soft cond-mat.stat-mech

    Microscopic Theory of the Influence of Strong Attractive Forces on the Activated Dynamics of Dense Glass and Gel Forming Fluids

    Authors: Ashesh Ghosh, Kenneth S. Schweizer

    Abstract: We theoretically study the non-monotonic (re-entrant) activated dynamics associated with a repulsive glass to fluid to attractive glass transition in high density particle suspensions interacting via strong short range attractive forces. The classic theoretical projection approximation that replaces all microscopic forces by a single effective force determined solely by equilibrium pair correlatio… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: 17 pages, 10 figures (additional Supplementary Material)

  47. Bias In, Bias Out? Evaluating the Folk Wisdom

    Authors: Ashesh Rambachan, Jonathan Roth

    Abstract: We evaluate the folk wisdom that algorithmic decision rules trained on data produced by biased human decision-makers necessarily reflect this bias. We consider a setting where training labels are only generated if a biased decision-maker takes a particular action, and so "biased" training data arise due to discriminatory selection into the training data. In our baseline model, the more biased the… ▽ More

    Submitted 19 December, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Journal ref: 1st Symposium on Foundations of Responsible Computing (FORC 2020)

  48. arXiv:1909.05943  [pdf

    physics.flu-dyn

    Spline-based Interface Modeling and Optimization (SIMO) for Surface Tension and Contact Angle Measurements

    Authors: Karan Jakhar, Ashesh Chattopadhyay, Atul Thakur, Rishi Raj

    Abstract: Surface tension and contact angle measurements are fundamental characterization techniques relevant to thermal and fluidic applications. Drop shape analysis techniques for the measurement of interfacial tension are powerful, versatile and flexible. Here we develop a Spline-based Interface Modeling and Optimization (SIMO) tool for estimating the surface tension and the contact angle from the profil… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: 28 pages, 8 Figures. Supporting Information can be accessed on: https://drive.google.com/open?id=13Tp0ILu0lOf0V7lxSX7ERjRMjFCDH3y4

  49. arXiv:1907.11617  [pdf, other

    physics.ao-ph cs.LG

    Analog forecasting of extreme-causing weather patterns using deep learning

    Authors: Ashesh Chattopadhyay, Ebrahim Nabizadeh, Pedram Hassanzadeh

    Abstract: Numerical weather prediction (NWP) models require ever-growing computing time/resources, but still, have difficulties with predicting weather extremes. Here we introduce a data-driven framework that is based on analog forecasting (prediction using past similar patterns) and employs a novel deep learning pattern-recognition technique (capsule neural networks, CapsNets) and impact-based auto-labelin… ▽ More

    Submitted 12 January, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: Accepted in Journal of Advances in Modeling Earth System

  50. arXiv:1906.08829  [pdf, other

    cs.LG math.DS nlin.CD stat.ML

    Data-driven prediction of a multi-scale Lorenz 96 chaotic system using deep learning methods: Reservoir computing, ANN, and RNN-LSTM

    Authors: Ashesh Chattopadhyay, Pedram Hassanzadeh, Devika Subramanian

    Abstract: In this paper, the performance of three deep learning methods for predicting short-term evolution and for reproducing the long-term statistics of a multi-scale spatio-temporal Lorenz 96 system is examined. The methods are: echo state network (a type of reservoir computing, RC-ESN), deep feed-forward artificial neural network (ANN), and recurrent neural network with long short-term memory (RNN-LSTM… ▽ More

    Submitted 5 December, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: Some changes, in Figures, addition of an appendix etc has been done

    Journal ref: Nonlin. Processes Geophys. 2020