Skip to main content

Showing 1–39 of 39 results for author: Ashesh

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03689  [pdf, other

    cs.CL cs.AI

    Evaluating the World Model Implicit in a Generative Model

    Authors: Keyon Vafa, Justin Y. Chen, Jon Kleinberg, Sendhil Mullainathan, Ashesh Rambachan

    Abstract: Recent work suggests that large language models may implicitly learn world models. How should we assess this possibility? We formalize this question for the case where the underlying reality is governed by a deterministic finite automaton. This includes problems as diverse as simple logical reasoning, geographic navigation, game-playing, and chemistry. We propose new evaluation metrics for world m… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2406.01382  [pdf, other

    cs.CL cs.AI

    Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function

    Authors: Keyon Vafa, Ashesh Rambachan, Sendhil Mullainathan

    Abstract: What makes large language models (LLMs) impressive is also what makes them hard to evaluate: their diversity of uses. To evaluate these models, we must understand the purposes they will be used for. We consider a setting where these deployment decisions are made by people, and in particular, people's beliefs about where an LLM will perform well. We model such beliefs as the consequence of a human… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  3. arXiv:2405.16297  [pdf, other

    cs.LG physics.ao-ph physics.comp-ph

    LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensembles

    Authors: Haiwen Guan, Troy Arcomano, Ashesh Chattopadhyay, Romit Maulik

    Abstract: We present LUCIE, a $1000$- member ensemble data-driven atmospheric emulator that remains stable during autoregressive inference for thousands of years without a drifting climatology. LUCIE has been trained on $9.5$ years of coarse-resolution ERA5 data with $4$ prognostic variables on a single A100 GPU for $2.4$ h. Owing to the cheap computational cost of inference, $1000$ model ensembles are exec… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  4. arXiv:2403.11854  [pdf, other

    eess.IV cs.CV

    denoiSplit: a method for joint image splitting and unsupervised denoising

    Authors: Ashesh Ashesh, Florian Jug

    Abstract: In this work we present denoiSplit, a method to tackle a new analysis task, i.e. the challenge of joint semantic image splitting and unsupervised denoising. This dual approach has important applications in fluorescence microscopy, where semantic image splitting has important applications but noise does generally hinder the downstream analysis of image content. Image splitting involves dissecting a… ▽ More

    Submitted 25 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  5. arXiv:2401.17671  [pdf, other

    cs.CL cs.AI q-bio.NC

    Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

    Authors: Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani

    Abstract: Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs,… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 19 pages, 5 figures and 4 supplementary figures

  6. arXiv:2310.00813  [pdf, other

    cs.LG cs.AI nlin.CD physics.ao-ph

    OceanNet: A principled neural operator-based digital twin for regional oceans

    Authors: Ashesh Chattopadhyay, Michael Gray, Tianning Wu, Anna B. Lowe, Ruoying He

    Abstract: While data-driven approaches demonstrate great potential in atmospheric modeling and weather forecasting, ocean modeling poses distinct challenges due to complex bathymetry, land, vertical structure, and flow non-linearity. This study introduces OceanNet, a principled neural operator-based digital twin for ocean circulation. OceanNet uses a Fourier neural operator and predictor-evaluate-corrector… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: Supplementary information can be found in: https://drive.google.com/file/d/1NoxJLa967naJT787a5-IfZ7f_MmRuZMP/view?usp=sharing

  7. arXiv:2306.05014  [pdf, other

    physics.flu-dyn cs.LG physics.ao-ph

    Learning Closed-form Equations for Subgrid-scale Closures from High-fidelity Data: Promises and Challenges

    Authors: Karan Jakhar, Yifei Guan, Rambod Mojgani, Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: There is growing interest in discovering interpretable, closed-form equations for subgrid-scale (SGS) closures/parameterizations of complex processes in Earth systems. Here, we apply a common equation-discovery technique with expansive libraries to learn closures from filtered direct numerical simulations of 2D turbulence and Rayleigh-Bénard convection (RBC). Across common filters (e.g., Gaussian,… ▽ More

    Submitted 12 March, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 40 pages, 4 figures. The codes and data used in this work can be found at https://github.com/jakharkaran/EqsDiscovery_2D-FHIT_RBC and https://doi.org/10.5281/zenodo.7500647, respectively

    MSC Class: 76F65 (Primary) 86A08; 68T01; 76F05; 76F35 (Secondary) ACM Class: J.2; I.2.0; G.1.8

  8. arXiv:2305.00385  [pdf

    eess.IV cs.CV

    Cross-Shaped Windows Transformer with Self-supervised Pretraining for Clinically Significant Prostate Cancer Detection in Bi-parametric MRI

    Authors: Yuheng Li, Jacob Wynne, **g Wang, Richard L. J. Qiu, Justin Roper, Shaoyan Pan, Ashesh B. Jani, Tian Liu, Pretesh R. Patel, Hui Mao, Xiaofeng Yang

    Abstract: Biparametric magnetic resonance imaging (bpMRI) has demonstrated promising results in prostate cancer (PCa) detection using convolutional neural networks (CNNs). Recently, transformers have achieved competitive performance compared to CNNs in computer vision. Large scale transformers need abundant annotated data for training, which are difficult to obtain in medical imaging. Self-supervised learni… ▽ More

    Submitted 17 March, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

  9. arXiv:2304.07029  [pdf, other

    physics.flu-dyn cs.AI cs.LG math.NA physics.ao-ph

    Long-term instabilities of deep learning-based digital twins of the climate system: The cause and a solution

    Authors: Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: Long-term stability is a critical property for deep learning-based data-driven digital twins of the Earth system. Such data-driven digital twins enable sub-seasonal and seasonal predictions of extreme environmental events, probabilistic forecasts, that require a large number of ensemble members, and computationally tractable high-resolution Earth system models where expensive components of the mod… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: Supplementary information is given at https://drive.google.com/file/d/1J0k20Qk___PbDQob0Z4vnSVWEpnDFlif/view?usp=share_link

  10. arXiv:2212.09844  [pdf, other

    econ.EM cs.CY cs.LG stat.ME

    Robust Design and Evaluation of Predictive Algorithms under Unobserved Confounding

    Authors: Ashesh Rambachan, Amanda Coston, Edward Kennedy

    Abstract: Predictive algorithms inform consequential decisions in settings where the outcome is selectively observed given choices made by human decision makers. We propose a unified framework for the robust design and evaluation of predictive algorithms in selectively observed data. We impose general assumptions on how much the outcome may vary on average between unselected and selected units conditional o… ▽ More

    Submitted 19 May, 2024; v1 submitted 19 December, 2022; originally announced December 2022.

  11. arXiv:2211.12872  [pdf, other

    cs.CV cs.LG

    μSplit: efficient image decomposition for microscopy data

    Authors: Ashesh, Alexander Krull, Moises Di Sante, Francesco Silvio Pasqualini, Florian Jug

    Abstract: We present μSplit, a dedicated approach for trained image decomposition in the context of fluorescence microscopy images. We find that best results using regular deep architectures are achieved when large image patches are used during training, making memory consumption the limiting factor to further improving performance. We therefore introduce lateral contextualization (LC), a novel meta-archite… ▽ More

    Submitted 16 August, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Published at ICCV 2023. 10 pages, 7 figures, 9 pages supplement, 8 supplementary figures

  12. arXiv:2206.04811  [pdf, other

    cs.LG physics.comp-ph physics.data-an physics.flu-dyn physics.geo-ph

    Deep learning-enhanced ensemble-based data assimilation for high-dimensional nonlinear dynamical systems

    Authors: Ashesh Chattopadhyay, Ebrahim Nabizadeh, Eviatar Bach, Pedram Hassanzadeh

    Abstract: Data assimilation (DA) is a key component of many forecasting models in science and engineering. DA allows one to estimate better initial conditions using an imperfect dynamical model of the system and noisy/sparse observations available from the system. Ensemble Kalman filter (EnKF) is a DA algorithm that is widely used in applications involving high-dimensional nonlinear dynamical systems. Howev… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  13. arXiv:2206.03198  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph

    Explaining the physics of transfer learning a data-driven subgrid-scale closure to a different turbulent flow

    Authors: Adam Subel, Yifei Guan, Ashesh Chattopadhyay, Pedram Hassanzadeh

    Abstract: Transfer learning (TL) is becoming a powerful tool in scientific applications of neural networks (NNs), such as weather/climate prediction and turbulence modeling. TL enables out-of-distribution generalization (e.g., extrapolation in parameters) and effective blending of disparate training sets (e.g., simulations and observations). In TL, selected layers of a NN, already trained for a base system,… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: 21 pages, 6 figures

  14. arXiv:2205.04601  [pdf, other

    cs.LG nlin.CD physics.ao-ph physics.flu-dyn physics.geo-ph

    Long-term stability and generalization of observationally-constrained stochastic data-driven models for geophysical turbulence

    Authors: Ashesh Chattopadhyay, Jaideep Pathak, Ebrahim Nabizadeh, Wahid Bhimji, Pedram Hassanzadeh

    Abstract: Recent years have seen a surge in interest in building deep learning-based fully data-driven models for weather prediction. Such deep learning models if trained on observations can mitigate certain biases in current state-of-the-art weather models, some of which stem from inaccurate representation of subgrid-scale processes. However, these data-driven models, being over-parameterized, require a lo… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  15. arXiv:2202.11214  [pdf, other

    physics.ao-ph cs.LG

    FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators

    Authors: Jaideep Pathak, Shashank Subramanian, Peter Harrington, Sanjeev Raja, Ashesh Chattopadhyay, Morteza Mardani, Thorsten Kurth, David Hall, Zongyi Li, Kamyar Azizzadenesheli, Pedram Hassanzadeh, Karthik Kashinath, Animashree Anandkumar

    Abstract: FourCastNet, short for Fourier Forecasting Neural Network, is a global data-driven weather forecasting model that provides accurate short to medium-range global predictions at $0.25^{\circ}$ resolution. FourCastNet accurately forecasts high-resolution, fast-timescale variables such as the surface wind speed, precipitation, and atmospheric water vapor. It has important implications for planning win… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  16. arXiv:2201.02702  [pdf

    math.DS cs.LG math.OC stat.AP stat.ME

    An Improved Mathematical Model of Sepsis: Modeling, Bifurcation Analysis, and Optimal Control Study for Complex Nonlinear Infectious Disease System

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: Sepsis is a life-threatening medical emergency, which is a major cause of death worldwide and the second highest cause of mortality in the United States. Researching the optimal control treatment or intervention strategy on the comprehensive sepsis system is key in reducing mortality. For this purpose, first, this paper improves a complex nonlinear sepsis model proposed in our previous work. Then,… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: 25 pages, 7 figures, 1 table

  17. arXiv:2201.00147  [pdf

    cs.LG math.OC stat.AP stat.ME

    High-dimensional Bayesian Optimization Algorithm with Recurrent Neural Network for Disease Control Models in Time Series

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: Bayesian Optimization algorithm has become a promising approach for nonlinear global optimization problems and many machine learning applications. Over the past few years, improvements and enhancements have been brought forward and they have shown some promising results in solving the complex dynamic problems, systems of ordinary differential equations where the objective functions are computation… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: 16 pages, 9 figures, 2 tables

  18. arXiv:2109.13602  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

    Authors: Matt Vitelli, Yan Chang, Yawei Ye, Maciej Wołczyk, Błażej Osiński, Moritz Niendorf, Hugo Grimmett, Qiangui Huang, Ashesh Jain, Peter Ondruska

    Abstract: In this paper we present the first safe system for full control of self-driving vehicles trained from human demonstrations and deployed in challenging, real-world, urban environments. Current industry-standard solutions use rule-based systems for planning. Although they perform reasonably well in common scenarios, the engineering complexity renders this approach incompatible with human-level perfo… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  19. arXiv:2108.02289  [pdf

    cs.LG math.OC stat.AP stat.ME

    High dimensional Bayesian Optimization Algorithm for Complex System in Time Series

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: At present, high-dimensional global optimization problems with time-series models have received much attention from engineering fields. Since it was proposed, Bayesian optimization has quickly become a popular and promising approach for solving global optimization problems. However, the standard Bayesian optimization algorithm is insufficient to solving the global optimal solution when the model i… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: 18 pages, 13 figures

  20. arXiv:2107.08142  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Autonomy 2.0: Why is self-driving always 5 years away?

    Authors: Ashesh Jain, Luca Del Pero, Hugo Grimmett, Peter Ondruska

    Abstract: Despite the numerous successes of machine learning over the past decade (image recognition, decision-making, NLP, image synthesis), self-driving technology has not yet followed the same trend. In this paper, we study the history, composition, and development bottlenecks of the modern self-driving stack. We argue that the slow progress is caused by approaches that require too much hand-engineering,… ▽ More

    Submitted 9 August, 2021; v1 submitted 16 July, 2021; originally announced July 2021.

  21. arXiv:2103.09360  [pdf, other

    physics.ao-ph cs.AI cs.LG physics.comp-ph

    Towards physically consistent data-driven weather forecasting: Integrating data assimilation with equivariance-preserving deep spatial transformers

    Authors: Ashesh Chattopadhyay, Mustafa Mustafa, Pedram Hassanzadeh, Eviatar Bach, Karthik Kashinath

    Abstract: There is growing interest in data-driven weather prediction (DDWP), for example using convolutional neural networks such as U-NETs that are trained on data from models or reanalysis. Here, we propose 3 components to integrate with commonly used DDWP models in order to improve their physical consistency and forecast accuracy. These components are 1) a deep spatial transformer added to the latent sp… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

    Comments: Under review in Geoscientific Model Development

  22. Accurate and Clear Precipitation Nowcasting with Consecutive Attention and Rain-map Discrimination

    Authors: Ashesh, Buo-Fu Chen, Treng-Shi Huang, Boyo Chen, Chia-Tung Chang, Hsuan-Tien Lin

    Abstract: Precipitation nowcasting is an important task for weather forecasting. Many recent works aim to predict the high rainfall events more accurately with the help of deep learning techniques, but such events are relatively rare. The rarity is often addressed by formulations that re-weight the rare events. Somehow such a formulation carries a side effect of making "blurry" predictions in low rainfall r… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  23. arXiv:2101.00352  [pdf, other

    cs.LG stat.ML

    Characterizing Fairness Over the Set of Good Models Under Selective Labels

    Authors: Amanda Coston, Ashesh Rambachan, Alexandra Chouldechova

    Abstract: Algorithmic risk assessments are used to inform decisions in a wide variety of high-stakes settings. Often multiple predictive models deliver similar overall performance but differ markedly in their predictions for individual cases, an empirical phenomenon known as the "Rashomon Effect." These models may have different properties over various groups, and therefore have different predictive fairnes… ▽ More

    Submitted 30 April, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: Added comparison methods to the empirical lending analysis

  24. arXiv:2009.06924  [pdf, other

    cs.CV

    360-Degree Gaze Estimation in the Wild Using Multiple Zoom Scales

    Authors: Ashesh, Chu-Song Chen, Hsuan-Tien Lin

    Abstract: Gaze estimation involves predicting where the person is looking at within an image or video. Technically, the gaze information can be inferred from two different magnification levels: face orientation and eye orientation. The inference is not always feasible for gaze estimation in the wild, given the lack of clear eye patches in conditions like extreme left/right gazes or occlusions. In this work,… ▽ More

    Submitted 26 October, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: accepted at BMVC 2021

  25. arXiv:2006.14480  [pdf, other

    cs.CV cs.LG cs.RO

    One Thousand and One Hours: Self-driving Motion Prediction Dataset

    Authors: John Houston, Guido Zuidhof, Luca Bergamini, Yawei Ye, Long Chen, Ashesh Jain, Sammy Omari, Vladimir Iglovikov, Peter Ondruska

    Abstract: Motivated by the impact of large-scale datasets on ML systems we present the largest self-driving dataset for motion prediction to date, containing over 1,000 hours of data. This was collected by a fleet of 20 autonomous vehicles along a fixed route in Palo Alto, California, over a four-month period. It consists of 170,000 scenes, where each scene is 25 seconds long and captures the perception out… ▽ More

    Submitted 16 November, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: Presente at CoRL2020

  26. Bias In, Bias Out? Evaluating the Folk Wisdom

    Authors: Ashesh Rambachan, Jonathan Roth

    Abstract: We evaluate the folk wisdom that algorithmic decision rules trained on data produced by biased human decision-makers necessarily reflect this bias. We consider a setting where training labels are only generated if a biased decision-maker takes a particular action, and so "biased" training data arise due to discriminatory selection into the training data. In our baseline model, the more biased the… ▽ More

    Submitted 19 December, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Journal ref: 1st Symposium on Foundations of Responsible Computing (FORC 2020)

  27. arXiv:1907.11617  [pdf, other

    physics.ao-ph cs.LG

    Analog forecasting of extreme-causing weather patterns using deep learning

    Authors: Ashesh Chattopadhyay, Ebrahim Nabizadeh, Pedram Hassanzadeh

    Abstract: Numerical weather prediction (NWP) models require ever-growing computing time/resources, but still, have difficulties with predicting weather extremes. Here we introduce a data-driven framework that is based on analog forecasting (prediction using past similar patterns) and employs a novel deep learning pattern-recognition technique (capsule neural networks, CapsNets) and impact-based auto-labelin… ▽ More

    Submitted 12 January, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: Accepted in Journal of Advances in Modeling Earth System

  28. arXiv:1906.08829  [pdf, other

    cs.LG math.DS nlin.CD stat.ML

    Data-driven prediction of a multi-scale Lorenz 96 chaotic system using deep learning methods: Reservoir computing, ANN, and RNN-LSTM

    Authors: Ashesh Chattopadhyay, Pedram Hassanzadeh, Devika Subramanian

    Abstract: In this paper, the performance of three deep learning methods for predicting short-term evolution and for reproducing the long-term statistics of a multi-scale spatio-temporal Lorenz 96 system is examined. The methods are: echo state network (a type of reservoir computing, RC-ESN), deep feed-forward artificial neural network (ANN), and recurrent neural network with long short-term memory (RNN-LSTM… ▽ More

    Submitted 5 December, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: Some changes, in Figures, addition of an appendix etc has been done

    Journal ref: Nonlin. Processes Geophys. 2020

  29. arXiv:1811.04817  [pdf, other

    physics.ao-ph cs.CV cs.LG

    A test case for application of convolutional neural networks to spatio-temporal climate data: Re-identifying clustered weather patterns

    Authors: Ashesh Chattopadhyay, Pedram Hassanzadeh, Saba Pasha

    Abstract: Convolutional neural networks (CNNs) can potentially provide powerful tools for classifying and identifying patterns in climate and environmental data. However, because of the inherent complexities of such data, which are often spatio-temporal, chaotic, and non-stationary, the CNN algorithms must be designed/evaluated for each specific dataset and application. Yet to start, CNN, a supervised techn… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Journal ref: Scientific Reports, 2020

  30. arXiv:1711.10871  [pdf, other

    cs.CV

    PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation

    Authors: Danfei Xu, Dragomir Anguelov, Ashesh Jain

    Abstract: We present PointFusion, a generic 3D object detection method that leverages both image and 3D point cloud information. Unlike existing methods that either use multi-stage pipelines or hold sensor and dataset-specific assumptions, PointFusion is conceptually simple and application-agnostic. The image data and the raw point cloud data are independently processed by a CNN and a PointNet architecture,… ▽ More

    Submitted 25 August, 2018; v1 submitted 29 November, 2017; originally announced November 2017.

    Comments: CVPR 2018

  31. arXiv:1601.00741  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Preferences for Manipulation Tasks from Online Coactive Feedback

    Authors: Ashesh Jain, Shikhar Sharma, Thorsten Joachims, Ashutosh Saxena

    Abstract: We consider the problem of learning preferences over trajectories for mobile manipulators such as personal robots and assembly line robots. The preferences we learn are more intricate than simple geometric constraints on trajectories; they are rather governed by the surrounding context of various objects and human interactions in the environment. We propose a coactive online learning framework for… ▽ More

    Submitted 5 January, 2016; originally announced January 2016.

    Comments: IJRR accepted (Learning preferences over trajectories from coactive feedback)

  32. arXiv:1601.00740  [pdf, other

    cs.RO cs.CV cs.LG

    Brain4Cars: Car That Knows Before You Do via Sensory-Fusion Deep Learning Architecture

    Authors: Ashesh Jain, Hema S Koppula, Shane Soh, Bharad Raghavan, Avi Singh, Ashutosh Saxena

    Abstract: Advanced Driver Assistance Systems (ADAS) have made driving safer over the last decade. They prepare vehicles for unsafe road conditions and alert drivers if they perform a dangerous maneuver. However, many accidents are unavoidable because by the time drivers are alerted, it is already too late. Anticipating maneuvers beforehand can alert drivers before they perform the maneuver and also give ADA… ▽ More

    Submitted 5 January, 2016; originally announced January 2016.

    Comments: Journal Version (ICCV and ICRA combination with more system details) http://brain4cars.com

  33. arXiv:1511.05298  [pdf, other

    cs.CV cs.LG cs.NE cs.RO

    Structural-RNN: Deep Learning on Spatio-Temporal Graphs

    Authors: Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena

    Abstract: Deep Recurrent Neural Network architectures, though remarkably capable at modeling sequences, lack an intuitive high-level spatio-temporal structure. That is while many problems in computer vision inherently have an underlying high-level structure and can benefit from it. Spatio-temporal graphs are a popular tool for imposing such high-level intuitions in the formulation of real world problems. In… ▽ More

    Submitted 11 April, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

    Comments: CVPR 2016 (Oral)

  34. arXiv:1509.05016  [pdf, other

    cs.CV cs.AI cs.RO

    Recurrent Neural Networks for Driver Activity Anticipation via Sensory-Fusion Architecture

    Authors: Ashesh Jain, Avi Singh, Hema S Koppula, Shane Soh, Ashutosh Saxena

    Abstract: Anticipating the future actions of a human is a widely studied problem in robotics that requires spatio-temporal reasoning. In this work we propose a deep learning approach for anticipation in sensory-rich robotics applications. We introduce a sensory-fusion architecture which jointly learns to anticipate and fuse information from multiple sensory streams. Our architecture consists of Recurrent Ne… ▽ More

    Submitted 16 September, 2015; originally announced September 2015.

    Comments: Follow-up of ICCV 2015 Brain4Cars http://www.brain4cars.com

  35. arXiv:1504.02789  [pdf, other

    cs.CV

    Car that Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models

    Authors: Ashesh Jain, Hema S. Koppula, Bharad Raghavan, Shane Soh, Ashutosh Saxena

    Abstract: Advanced Driver Assistance Systems (ADAS) have made driving safer over the last decade. They prepare vehicles for unsafe road conditions and alert drivers if they perform a dangerous maneuver. However, many accidents are unavoidable because by the time drivers are alerted, it is already too late. Anticipating maneuvers beforehand can alert drivers before they perform the maneuver and also give ADA… ▽ More

    Submitted 19 September, 2015; v1 submitted 10 April, 2015; originally announced April 2015.

    Comments: ICCV 2015, http://brain4cars.com

  36. arXiv:1412.0691  [pdf, other

    cs.AI cs.RO

    RoboBrain: Large-Scale Knowledge Engine for Robots

    Authors: Ashutosh Saxena, Ashesh Jain, Ozan Sener, Aditya Jami, Dipendra K. Misra, Hema S. Koppula

    Abstract: In this paper we introduce a knowledge engine, which learns and shares knowledge representations, for robots to carry out a variety of tasks. Building such an engine brings with it the challenge of dealing with multiple data modalities including symbols, natural language, haptic senses, robot trajectories, visual features and many others. The \textit{knowledge} stored in the engine comes from mult… ▽ More

    Submitted 12 April, 2015; v1 submitted 1 December, 2014; originally announced December 2014.

    Comments: 10 pages, 9 figures

  37. arXiv:1406.2616  [pdf, other

    cs.RO cs.AI cs.LG

    PlanIt: A Crowdsourcing Approach for Learning to Plan Paths from Large Scale Preference Feedback

    Authors: Ashesh Jain, Debarghya Das, Jayesh K Gupta, Ashutosh Saxena

    Abstract: We consider the problem of learning user preferences over robot trajectories for environments rich in objects and humans. This is challenging because the criterion defining a good trajectory varies with users, tasks and interactions in the environment. We represent trajectory preferences using a cost function that the robot learns and uses it to generate good trajectories in new environments. We d… ▽ More

    Submitted 5 January, 2016; v1 submitted 10 June, 2014; originally announced June 2014.

    Comments: PlanIt Camera Ready ICRA'15

  38. arXiv:1306.6294  [pdf, other

    cs.RO cs.AI cs.HC

    Learning Trajectory Preferences for Manipulators via Iterative Improvement

    Authors: Ashesh Jain, Brian Wojcik, Thorsten Joachims, Ashutosh Saxena

    Abstract: We consider the problem of learning good trajectories for manipulation tasks. This is challenging because the criterion defining a good trajectory varies with users, tasks and environments. In this paper, we propose a co-active online learning framework for teaching robots the preferences of its users for object manipulation tasks. The key novelty of our approach lies in the type of feedback expec… ▽ More

    Submitted 5 November, 2013; v1 submitted 26 June, 2013; originally announced June 2013.

    Comments: 9 pages. To appear in NIPS 2013

  39. arXiv:1204.1748  [pdf

    cs.NI

    Bluetooth Navigation System using Wi-Fi Access Points

    Authors: Rohit Agrawal, Ashesh Vasalya

    Abstract: There have been various navigation and tracking systems being developed with the help of technologies like GPS, GSM, Bluetooth, IR, Wi-Fi and Radar. Outdoor positioning systems have been deployed quite successfully using GPS but positioning systems for indoor environments still do not have widespread deployment due to various reasons. Most of these use only a single technology for positioning but… ▽ More

    Submitted 8 April, 2012; originally announced April 2012.

    Comments: 8 pages 2 figures and 1 table International Journal of Distributed and Parallel Systems