Skip to main content

Showing 1–19 of 19 results for author: van Dijk, D

.
  1. arXiv:2310.01618  [pdf, other

    cs.LG math.NA

    Operator Learning Meets Numerical Analysis: Improving Neural Networks through Iterative Methods

    Authors: Emanuele Zappala, Daniel Levine, Sizhuang He, Syed Rizvi, Sacha Levy, David van Dijk

    Abstract: Deep neural networks, despite their success in numerous applications, often function without established theoretical foundations. In this paper, we bridge this gap by drawing parallels between deep learning and classical numerical analysis. By framing neural networks as operators with fixed points representing desired solutions, we develop a theoretical framework grounded in iterative methods for… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 27 pages (13+14). 8 Figures and 5 tables. Comments are welcome!

  2. arXiv:2301.13338  [pdf, other

    cs.LG cs.CV

    Continuous Spatiotemporal Transformers

    Authors: Antonio H. de O. Fonseca, Emanuele Zappala, Josue Ortega Caro, David van Dijk

    Abstract: Modeling spatiotemporal dynamical systems is a fundamental challenge in machine learning. Transformer models have been very successful in NLP and computer vision where they provide interpretable representations of data. However, a limitation of transformers in modeling continuous dynamical systems is that they are fundamentally discrete time and space models and thus have no guarantees regarding c… ▽ More

    Submitted 28 July, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Updated version, after reviews

  3. arXiv:2210.09475  [pdf, other

    cs.LG

    FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks

    Authors: Syed Asad Rizvi, Nazreen Pallikkavaliyaveetil, David Zhang, Zhuoyang Lyu, Nhi Nguyen, Haoran Lyu, Benjamin Christensen, Josue Ortega Caro, Antonio H. O. Fonseca, Emanuele Zappala, Maryam Bagherian, Christopher Averill, Chadi G. Abdallah, Amin Karbasi, Rex Ying, Maria Brbic, Rahul Madhav Dhodapkar, David van Dijk

    Abstract: Foundation models have achieved remarkable success across many domains, relying on pretraining over vast amounts of data. Graph-structured data often lacks the same scale as unstructured data, making the development of graph foundation models challenging. In this work, we propose Foundation-Informed Message Passing (FIMP), a Graph Neural Network (GNN) message-passing framework that leverages pretr… ▽ More

    Submitted 1 July, 2024; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 16 pages (12 + 4 pages appendix). 5 figures and 4 tables

  4. arXiv:2209.15190  [pdf, other

    cs.LG math.DS math.NA physics.comp-ph

    Neural Integral Equations

    Authors: Emanuele Zappala, Antonio Henrique de Oliveira Fonseca, Josue Ortega Caro, David van Dijk

    Abstract: Integral equations (IEs) are equations that model spatiotemporal systems with non-local interactions. They have found important applications throughout theoretical and applied sciences, including in physics, chemistry, biology, and engineering. While efficient algorithms exist for solving given IEs, no method exists that can learn an IE and its associated dynamics from data alone. In this paper, w… ▽ More

    Submitted 18 May, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: 19 + 20 pages, 18 figures and 11 tables. Comments are welcome! v4: Article expanded to include theoretical guarantees for convergence of the integral equations solver and further experiments on brain data and interpretability of dynamics

  5. arXiv:2206.14282  [pdf, other

    cs.LG

    Neural Integro-Differential Equations

    Authors: Emanuele Zappala, Antonio Henrique de Oliveira Fonseca, Andrew Henry Moberly, Michael James Higley, Chadi Abdallah, Jessica Cardin, David van Dijk

    Abstract: Modeling continuous dynamical systems from discretely sampled observations is a fundamental problem in data science. Often, such dynamics are the result of non-local processes that present an integral over time. As such, these systems are modeled with Integro-Differential Equations (IDEs); generalizations of differential equations that comprise both an integral and a differential component. For ex… ▽ More

    Submitted 29 November, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: 18 pages (including 8 pages Appendix), 8 figures and 6 tables. v4: Final version with reviewers' comments included, to appear in AAAI-23 (up to formatting differences)

  6. arXiv:2010.05820  [pdf, other

    cs.LG math.PR stat.ML

    Permutation invariant networks to learn Wasserstein metrics

    Authors: Arijit Sehanobish, Neal Ravindra, David van Dijk

    Abstract: Understanding the space of probability measures on a metric space equipped with a Wasserstein distance is one of the fundamental questions in mathematical analysis. The Wasserstein metric has received a lot of attention in the machine learning community especially for its principled way of comparing distributions. In this work, we use a permutation invariant network to map samples from probability… ▽ More

    Submitted 26 February, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: Fix typos, Accepted as a spotlight at Topological Data Analysis and Beyond Workshop at Neurips 2020. Added more experiments and results. Comments welcome

  7. arXiv:2007.04777  [pdf, other

    eess.IV cs.LG q-bio.GN stat.ML

    Self-supervised edge features for improved Graph Neural Network training

    Authors: Arijit Sehanobish, Neal G. Ravindra, David van Dijk

    Abstract: Graph Neural Networks (GNN) have been extensively used to extract meaningful representations from graph structured data and to perform predictive tasks such as node classification and link prediction. In recent years, there has been a lot of work incorporating edge features along with node features for prediction tasks. One of the main difficulties in using edge features is that they are often han… ▽ More

    Submitted 23 June, 2020; originally announced July 2020.

    Comments: Comments welcome. arXiv admin note: substantial text overlap with arXiv:2006.12971

    ACM Class: I.2.4; J.3

  8. arXiv:2006.13297  [pdf, other

    cs.LG physics.comp-ph quant-ph stat.ML

    Learning Potentials of Quantum Systems using Deep Neural Networks

    Authors: Arijit Sehanobish, Hector H. Corzo, Onur Kara, David van Dijk

    Abstract: Attempts to apply Neural Networks (NN) to a wide range of research problems have been ubiquitous and plentiful in recent literature. Particularly, the use of deep NNs for understanding complex physical and chemical phenomena has opened a new niche of science where the analysis tools from Machine Learning (ML) are combined with the computational concepts of the natural sciences. Reports from this u… ▽ More

    Submitted 14 January, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: New density to potential experiments, substantial rearrangement of the paper, Under Review, comments welcome

    Report number: Vol-2964 (0074-2964-3) ACM Class: I.2.6; J.2

    Journal ref: CEUR Workshop Proceedings 2021

  9. arXiv:2006.12971  [pdf, other

    cs.LG q-bio.GN stat.ML

    Gaining Insight into SARS-CoV-2 Infection and COVID-19 Severity Using Self-supervised Edge Features and Graph Neural Networks

    Authors: Arijit Sehanobish, Neal G. Ravindra, David van Dijk

    Abstract: A molecular and cellular understanding of how SARS-CoV-2 variably infects and causes severe COVID-19 remains a bottleneck in develo** interventions to end the pandemic. We sought to use deep learning to study the biology of SARS-CoV-2 infection and COVID-19 severity by identifying transcriptomic patterns and cell types associated with SARS-CoV-2 infection and COVID-19 severity. To do this, we de… ▽ More

    Submitted 15 December, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: To appear at AAAI'21. Previous version (v2) accepted as a spotlight talk at ICML 2020 Workshop on Graph Representation Learning and Beyond (GRL+) and recipient of best paper award for Covid-19 applications. Significant improvements over v2

  10. arXiv:2006.11578  [pdf, other

    cs.CL cs.LG

    Learning aligned embeddings for semi-supervised word translation using Maximum Mean Discrepancy

    Authors: Antonio H. O. Fonseca, David van Dijk

    Abstract: Word translation is an integral part of language translation. In machine translation, each language is considered a domain with its own word embedding. The alignment between word embeddings allows linking semantically equivalent words in multilingual contexts. Moreover, it offers a way to infer cross-lingual meaning for words without a direct translation. Current methods for word embedding alignme… ▽ More

    Submitted 20 June, 2020; originally announced June 2020.

  11. arXiv:2002.07128  [pdf, other

    q-bio.GN cs.LG stat.ML

    Disease State Prediction From Single-Cell Data Using Graph Attention Networks

    Authors: Neal G. Ravindra, Arijit Sehanobish, Jenna L. Pappalardo, David A. Hafler, David van Dijk

    Abstract: Single-cell RNA sequencing (scRNA-seq) has revolutionized biological discovery, providing an unbiased picture of cellular heterogeneity in tissues. While scRNA-seq has been used extensively to provide insight into both healthy systems and diseases, it has not been used for disease prediction or diagnostics. Graph Attention Networks (GAT) have proven to be versatile for a wide range of tasks by lea… ▽ More

    Submitted 12 March, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: Incorporated suggestions from anonymous reviewers, Accepted at ACM CHIL 2020, comments welcome

    ACM Class: J.3; I.2.6

  12. arXiv:2002.04461  [pdf, other

    stat.ML cs.CV cs.LG q-bio.QM

    TrajectoryNet: A Dynamic Optimal Transport Network for Modeling Cellular Dynamics

    Authors: Alexander Tong, Jessie Huang, Guy Wolf, David van Dijk, Smita Krishnaswamy

    Abstract: It is increasingly common to encounter data from dynamic processes captured by static cross-sectional measurements over time, particularly in biomedical settings. Recent attempts to model individual trajectories from this data use optimal transport to create pairwise matchings between time points. However, these methods cannot model continuous dynamics and non-linear paths that entities can take i… ▽ More

    Submitted 26 July, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: Presented at ICML 2020

  13. arXiv:1907.04463  [pdf, other

    cs.HC cs.CV cs.LG q-bio.QM

    Coarse Graining of Data via Inhomogeneous Diffusion Condensation

    Authors: Nathan Brugnone, Alex Gonopolskiy, Mark W. Moyle, Manik Kuchroo, David van Dijk, Kevin R. Moon, Daniel Colon-Ramos, Guy Wolf, Matthew J. Hirn, Smita Krishnaswamy

    Abstract: Big data often has emergent structure that exists at multiple levels of abstraction, which are useful for characterizing complex interactions and dynamics of the observations. Here, we consider multiple levels of abstraction via a multiresolution geometry of data points at different granularities. To construct this geometry we define a time-inhomogeneous diffusion process that effectively condense… ▽ More

    Submitted 9 March, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: 14 pages, 7 figures

    ACM Class: I.5.3

    Journal ref: Proceedings of the 2019 IEEE International Conference on Big Data, pages 2624-2633, 2019

  14. arXiv:1902.00033  [pdf, other

    cs.LG stat.ML

    Compressed Diffusion

    Authors: Scott Gigante, Jay S. Stanley III, Ngan Vu, David van Dijk, Kevin Moon, Guy Wolf, Smita Krishnaswamy

    Abstract: Diffusion maps are a commonly used kernel-based method for manifold learning, which can reveal intrinsic structures in data and embed them in low dimensions. However, as with most kernel methods, its implementation requires a heavy computational load, reaching up to cubic complexity in the number of data points. This limits its usability in modern data analysis. Here, we present a new approach to… ▽ More

    Submitted 10 June, 2019; v1 submitted 31 January, 2019; originally announced February 2019.

    Comments: 4 pages double column, published in SampTA 2019

    Journal ref: Sampling Theory & Applications (2019)

  15. arXiv:1901.09078  [pdf, other

    cs.LG stat.ML

    Finding Archetypal Spaces Using Neural Networks

    Authors: David van Dijk, Daniel Burkhardt, Matthew Amodio, Alex Tong, Guy Wolf, Smita Krishnaswamy

    Abstract: Archetypal analysis is a data decomposition method that describes each observation in a dataset as a convex combination of "pure types" or archetypes. These archetypes represent extrema of a data space in which there is a trade-off between features, such as in biology where different combinations of traits provide optimal fitness for different environments. Existing methods for archetypal analysis… ▽ More

    Submitted 13 November, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: 9 pages, 10 figures, to be presented at IEEE Big Data 2019

  16. arXiv:1810.00424  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Interpretable Neuron Structuring with Graph Spectral Regularization

    Authors: Alexander Tong, David van Dijk, Jay S. Stanley III, Matthew Amodio, Kristina Yim, Rebecca Muhle, James Noonan, Guy Wolf, Smita Krishnaswamy

    Abstract: While neural networks are powerful approximators used to classify or embed data into lower dimensional spaces, they are often regarded as black boxes with uninterpretable features. Here we propose Graph Spectral Regularization for making hidden layers more interpretable without significantly impacting performance on the primary task. Taking inspiration from spatial organization and localization of… ▽ More

    Submitted 14 February, 2020; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: 12 pages, 6 figures, presented at IDA 2020

  17. arXiv:1805.12198  [pdf, other

    q-bio.QM

    Out-of-Sample Extrapolation with Neuron Editing

    Authors: Matthew Amodio, David van Dijk, Ruth Montgomery, Guy Wolf, Smita Krishnaswamy

    Abstract: While neural networks can be trained to map from one specific dataset to another, they usually do not learn a generalized transformation that can extrapolate accurately outside the space of training. For instance, a generative adversarial network (GAN) exclusively trained to transform images of black-haired men to blond-haired men might not have the same effect on images of black-haired women. Thi… ▽ More

    Submitted 23 January, 2019; v1 submitted 30 May, 2018; originally announced May 2018.

  18. arXiv:1802.03497  [pdf, other

    cs.LG stat.ML

    Modeling Global Dynamics from Local Snapshots with Deep Generative Neural Networks

    Authors: Scott Gigante, David van Dijk, Kevin Moon, Alexander Strzalkowski, Guy Wolf, Smita Krishnaswamy

    Abstract: Complex high dimensional stochastic dynamic systems arise in many applications in the natural sciences and especially biology. However, while these systems are difficult to describe analytically, "snapshot" measurements that sample the output of the system are often available. In order to model the dynamics of such systems given snapshot data, or local transitions, we present a deep neural network… ▽ More

    Submitted 10 June, 2019; v1 submitted 9 February, 2018; originally announced February 2018.

    Comments: Published in SampTA 2019

    Journal ref: Sampling Theory & Applications (2019)

  19. An expanded evaluation of protein function prediction methods shows an improvement in accuracy

    Authors: Yuxiang Jiang, Tal Ronnen Oron, Wyatt T Clark, Asma R Bankapur, Daniel D'Andrea, Rosalba Lepore, Christopher S Funk, Indika Kahanda, Karin M Verspoor, Asa Ben-Hur, Emily Koo, Duncan Penfold-Brown, Dennis Shasha, Noah Youngs, Richard Bonneau, Alexandra Lin, Sayed ME Sahraeian, Pier Luigi Martelli, Giuseppe Profiti, Rita Casadio, Renzhi Cao, Zhaolong Zhong, Jianlin Cheng, Adrian Altenhoff, Nives Skunca , et al. (122 additional authors not shown)

    Abstract: Background: The increasing volume and variety of genotypic and phenotypic data is a major defining characteristic of modern biomedical sciences. At the same time, the limitations in technology for generating data and the inherently stochastic nature of biomolecular events have led to the discrepancy between the volume of data and the amount of knowledge gleaned from it. A major bottleneck in our a… ▽ More

    Submitted 2 January, 2016; originally announced January 2016.

    Comments: Submitted to Genome Biology