Skip to main content

Showing 1–9 of 9 results for author: Jonas, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.06271  [pdf, other

    cs.CV cs.AI cs.LG

    Rotation-Invariant Random Features Provide a Strong Baseline for Machine Learning on 3D Point Clouds

    Authors: Owen Melia, Eric Jonas, Rebecca Willett

    Abstract: Rotational invariance is a popular inductive bias used by many fields in machine learning, such as computer vision and machine learning for quantum chemistry. Rotation-invariant machine learning methods set the state of the art for many tasks, including molecular property prediction and 3D shape classification. These methods generally either rely on task-specific rotation-invariant features, or th… ▽ More

    Submitted 27 July, 2023; originally announced August 2023.

  2. arXiv:2306.07472  [pdf, other

    physics.chem-ph cs.LG stat.ML

    Von Mises Mixture Distributions for Molecular Conformation Generation

    Authors: Kirk Swanson, Jake Williams, Eric Jonas

    Abstract: Molecules are frequently represented as graphs, but the underlying 3D molecular geometry (the locations of the atoms) ultimately determines most molecular properties. However, most molecules are not static and at room temperature adopt a wide variety of geometries or $\textit{conformations}$. The resulting distribution on geometries $p(x)$ is known as the Boltzmann distribution, and many molecular… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  3. arXiv:1902.03383  [pdf, ps, other

    cs.OS

    Cloud Programming Simplified: A Berkeley View on Serverless Computing

    Authors: Eric Jonas, Johann Schleier-Smith, Vikram Sreekanti, Chia-Che Tsai, Anurag Khandelwal, Qifan Pu, Vaishaal Shankar, Joao Carreira, Karl Krauth, Neeraja Yadwadkar, Joseph E. Gonzalez, Raluca Ada Popa, Ion Stoica, David A. Patterson

    Abstract: Serverless cloud computing handles virtually all the system administration operations needed to make it easier for programmers to use the cloud. It provides an interface that greatly simplifies cloud programming, and represents an evolution that parallels the transition from assembly language to high-level programming languages. This paper gives a quick history of cloud computing, including an acc… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  4. arXiv:1901.08705  [pdf, other

    cs.DC

    Ambitious Data Science Can Be Painless

    Authors: Hatef Monajemi, Riccardo Murri, Eric Jonas, Percy Liang, Victoria Stodden, David L. Donoho

    Abstract: Modern data science research can involve massive computational experimentation; an ambitious PhD in computational fields may do experiments consuming several million CPU hours. Traditional computing practices, in which researchers use laptops or shared campus-resident resources, are inadequate for experiments at the massive scale and varied scope that we now see in data science. On the other hand,… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: Submitted to Harvard Data Science Review

  5. arXiv:1810.09679  [pdf, other

    cs.DC

    numpywren: serverless linear algebra

    Authors: Vaishaal Shankar, Karl Krauth, Qifan Pu, Eric Jonas, Shivaram Venkataraman, Ion Stoica, Benjamin Recht, Jonathan Ragan-Kelley

    Abstract: Linear algebra operations are widely used in scientific computing and machine learning applications. However, it is challenging for scientists and data analysts to run linear algebra at scales beyond a single machine. Traditional approaches either require access to supercomputing clusters, or impose configuration and cluster management challenges. In this paper we show how the disaggregation of st… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

  6. Flare Prediction Using Photospheric and Coronal Image Data

    Authors: Eric Jonas, Monica G. Bobra, Vaishaal Shankar, J. Todd Hoeksema, Benjamin Recht

    Abstract: The precise physical process that triggers solar flares is not currently understood. Here we attempt to capture the signature of this mechanism in solar image data of various wavelengths and use these signatures to predict flaring activity. We do this by develo** an algorithm that [1] automatically generates features in 5.5 TB of image data taken by the Solar Dynamics Observatory of the solar ph… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: submitted for publication in the Astrophysical Journal

  7. arXiv:1702.04024  [pdf, other

    cs.DC

    Occupy the Cloud: Distributed Computing for the 99%

    Authors: Eric Jonas, Qifan Pu, Shivaram Venkataraman, Ion Stoica, Benjamin Recht

    Abstract: Distributed computing remains inaccessible to a large number of users, in spite of many open source platforms and extensive commercial offerings. While distributed computation frameworks have moved beyond a simple map-reduce model, many users are still left to struggle with complex cluster management and configuration tools, even for running simple embarrassingly parallel jobs. We argue that state… ▽ More

    Submitted 7 June, 2017; v1 submitted 13 February, 2017; originally announced February 2017.

  8. arXiv:1512.01272  [pdf, other

    cs.AI stat.CO stat.ML

    CrossCat: A Fully Bayesian Nonparametric Method for Analyzing Heterogeneous, High Dimensional Data

    Authors: Vikash Mansinghka, Patrick Shafto, Eric Jonas, Cap Petschulat, Max Gasner, Joshua B. Tenenbaum

    Abstract: There is a widespread need for statistical methods that can analyze high-dimensional datasets with- out imposing restrictive or opaque modeling assumptions. This paper describes a domain-general data analysis method called CrossCat. CrossCat infers multiple non-overlap** views of the data, each consisting of a subset of the variables, and uses a separate nonparametric mixture to model each view.… ▽ More

    Submitted 3 December, 2015; originally announced December 2015.

  9. arXiv:1402.4914  [pdf, other

    cs.AI cs.AR stat.CO

    Building fast Bayesian computing machines out of intentionally stochastic, digital parts

    Authors: Vikash Mansinghka, Eric Jonas

    Abstract: The brain interprets ambiguous sensory information faster and more reliably than modern computers, using neurons that are slower and less reliable than logic gates. But Bayesian inference, which underpins many computational models of perception and cognition, appears computationally challenging even given modern transistor speeds and energy budgets. The computational principles and structures need… ▽ More

    Submitted 20 February, 2014; originally announced February 2014.

    Comments: 6 figures