Skip to main content

Showing 1–21 of 21 results for author: Bose, A J

.
  1. arXiv:2405.20313  [pdf, other

    cs.LG q-bio.BM

    Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation

    Authors: Guillaume Huguet, James Vuckovic, Kilian Fatras, Eric Thibodeau-Laufer, Pablo Lemos, Riashat Islam, Cheng-Hao Liu, Jarrid Rector-Brooks, Tara Akhound-Sadegh, Michael Bronstein, Alexander Tong, Avishek Joey Bose

    Abstract: Proteins are essential for almost all biological processes and derive their diverse functions from complex 3D structures, which are in turn determined by their amino acid sequences. In this paper, we exploit the rich biological inductive bias of amino acid sequences and introduce FoldFlow-2, a novel sequence-conditioned SE(3)-equivariant flow matching model for protein structure generation. FoldFl… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: preprint

  2. arXiv:2405.14780  [pdf, other

    cs.LG stat.ML

    Metric Flow Matching for Smooth Interpolations on the Data Manifold

    Authors: Kacper Kapusniak, Peter Potaptchik, Teodora Reu, Leo Zhang, Alexander Tong, Michael Bronstein, Avishek Joey Bose, Francesco Di Giovanni

    Abstract: Matching objectives underpin the success of modern generative models and rely on constructing conditional paths that transform a source distribution into a target distribution. Despite being a fundamental building block, conditional paths have been designed principally under the assumption of Euclidean geometry, resulting in straight interpolations. However, this can be particularly restrictive fo… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.14664  [pdf, other

    cs.LG cs.AI

    Fisher Flow Matching for Generative Modeling over Discrete Data

    Authors: Oscar Davis, Samuel Kessler, Mircea Petrache, İsmail İlkan Ceylan, Michael Bronstein, Avishek Joey Bose

    Abstract: Generative modeling over discrete data has recently seen numerous success stories, with applications spanning language modeling, biological sequence design, and graph-structured molecular data. The predominant generative modeling paradigm for discrete data is still autoregressive, with more recent alternatives based on diffusion or flow-matching falling short of their impressive performance in con… ▽ More

    Submitted 28 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Preprint, Under Review

  4. arXiv:2402.06121  [pdf, other

    cs.LG stat.ML

    Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

    Authors: Tara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong

    Abstract: Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient -- and… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024. Code for iDEM is available at https://github.com/jarridrb/dem

  5. arXiv:2310.02391  [pdf, other

    cs.LG cs.AI

    SE(3)-Stochastic Flow Matching for Protein Backbone Generation

    Authors: Avishek Joey Bose, Tara Akhound-Sadegh, Guillaume Huguet, Kilian Fatras, Jarrid Rector-Brooks, Cheng-Hao Liu, Andrei Cristian Nica, Maksym Korablyov, Michael Bronstein, Alexander Tong

    Abstract: The computational design of novel protein structures has the potential to impact numerous scientific disciplines greatly. Toward this goal, we introduce FoldFlow, a series of novel generative models of increasing modeling power based on the flow-matching paradigm over $3\mathrm{D}$ rigid motions -- i.e. the group $\text{SE}(3)$ -- enabling accurate modeling of protein backbones. We first introduce… ▽ More

    Submitted 11 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Spotlight

  6. arXiv:2310.00429  [pdf, other

    cs.LG stat.ML

    On the Stability of Iterative Retraining of Generative Models on their own Data

    Authors: Quentin Bertrand, Avishek Joey Bose, Alexandre Duplessis, Marco Jiralerspong, Gauthier Gidel

    Abstract: Deep generative models have made tremendous progress in modeling complex data, often exhibiting generation quality that surpasses a typical human's ability to discern the authenticity of samples. Undeniably, a key driver of this success is enabled by the massive amounts of web-scale data consumed by these models. Due to these models' striking performance and ease of availability, the web will inev… ▽ More

    Submitted 2 April, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  7. arXiv:2302.04440  [pdf, other

    cs.LG cs.CV

    Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples

    Authors: Marco Jiralerspong, Avishek Joey Bose, Ian Gemp, Chongli Qin, Yoram Bachrach, Gauthier Gidel

    Abstract: The past few years have seen impressive progress in the development of deep generative models capable of producing high-dimensional, complex, and photo-realistic data. However, current methods for evaluating such models remain incomplete: standard likelihood-based metrics do not always apply and rarely correlate with perceptual fidelity, while sample-based metrics, such as FID, are insensitive to… ▽ More

    Submitted 12 March, 2024; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: FLD code: https://github.com/marcojira/fld

  8. arXiv:2208.07949  [pdf, other

    cs.LG

    Riemannian Diffusion Models

    Authors: Chin-Wei Huang, Milad Aghajohari, Avishek Joey Bose, Prakash Panangaden, Aaron Courville

    Abstract: Diffusion models are recent state-of-the-art methods for image generation and likelihood estimation. In this work, we generalize continuous-time diffusion models to arbitrary Riemannian manifolds and derive a variational framework for likelihood estimation. Computationally, we propose new methods for computing the Riemannian divergence which is needed in the likelihood estimation. Moreover, in gen… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  9. arXiv:2110.08649  [pdf, other

    cs.LG cs.AI

    Equivariant Finite Normalizing Flows

    Authors: Avishek Joey Bose, Marcus Brubaker, Ivan Kobyzev

    Abstract: Generative modeling seeks to uncover the underlying factors that give rise to observed data that can often be modeled as the natural symmetries that manifest themselves through invariances and equivariances to certain transformation laws. However, current approaches to representing these symmetries are couched in the formalism of continuous normalizing flows that require the construction of equiva… ▽ More

    Submitted 12 August, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: Preprint

  10. arXiv:2104.08455  [pdf, other

    cs.CL

    Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding

    Authors: Nouha Dziri, Andrea Madotto, Osmar Zaiane, Avishek Joey Bose

    Abstract: Dialogue systems powered by large pre-trained language models (LM) exhibit an innate ability to deliver fluent and natural-looking responses. Despite their impressive generation performance, these models can often generate factually incorrect statements impeding their widespread adoption. In this paper, we focus on the task of improving the faithfulness -- and thus reduce hallucination -- of Neura… ▽ More

    Submitted 14 September, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 18 pages

  11. arXiv:2103.02014  [pdf, other

    cs.LG cs.CR cs.DS

    Online Adversarial Attacks

    Authors: Andjela Mladenovic, Avishek Joey Bose, Hugo Berard, William L. Hamilton, Simon Lacoste-Julien, Pascal Vincent, Gauthier Gidel

    Abstract: Adversarial attacks expose important vulnerabilities of deep learning models, yet little attention has been paid to settings where data arrives as a stream. In this paper, we formalize the online adversarial attack problem, emphasizing two key elements found in real-world use-cases: attackers must operate under partial knowledge of the target model, and the decisions made by the attacker are irrev… ▽ More

    Submitted 22 March, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: ICLR 2022

  12. arXiv:2009.11355  [pdf, other

    cs.LG cs.CL stat.ML

    Structure Aware Negative Sampling in Knowledge Graphs

    Authors: Kian Ahrabian, Aarash Feizi, Yasmin Salehi, William L. Hamilton, Avishek Joey Bose

    Abstract: Learning low-dimensional representations for entities and relations in knowledge graphs using contrastive estimation represents a scalable and effective method for inferring connectivity patterns. A crucial aspect of contrastive learning approaches is the choice of corruption distribution that generates hard negative samples, which force the embedding model to learn discriminative representations… ▽ More

    Submitted 6 October, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020. Camera-ready submission

  13. arXiv:2007.00720  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Adversarial Example Games

    Authors: Avishek Joey Bose, Gauthier Gidel, Hugo Berard, Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, William L. Hamilton

    Abstract: The existence of adversarial examples capable of fooling trained neural network classifiers calls for a much better understanding of possible attacks to guide the development of safeguards against them. This includes attack methods in the challenging non-interactive blackbox setting, where adversarial attacks are generated without any access, including queries, to the target model. Prior attacks i… ▽ More

    Submitted 8 January, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Appears in: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  14. arXiv:2002.06336  [pdf, other

    cs.LG stat.ML

    Latent Variable Modelling with Hyperbolic Normalizing Flows

    Authors: Avishek Joey Bose, Ariella Smofsky, Renjie Liao, Prakash Panangaden, William L. Hamilton

    Abstract: The choice of approximate posterior distributions plays a central role in stochastic variational inference (SVI). One effective solution is the use of normalizing flows \cut{defined on Euclidean spaces} to construct flexible posterior distributions. However, one key limitation of existing normalizing flows is that they are restricted to the Euclidean space and are ill-equipped to model data with a… ▽ More

    Submitted 13 August, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

    Comments: Preprint, work under review

  15. arXiv:1912.09867  [pdf, other

    cs.LG cs.SI stat.ML

    Meta-Graph: Few Shot Link Prediction via Meta Learning

    Authors: Avishek Joey Bose, Ankit Jain, Piero Molino, William L. Hamilton

    Abstract: We consider the task of few shot link prediction on graphs. The goal is to learn from a distribution over graphs so that a model is able to quickly infer missing edges in a new graph after a small amount of training. We show that current link prediction methods are generally ill-equipped to handle this task. They cannot effectively transfer learned knowledge from one graph to another and are unabl… ▽ More

    Submitted 1 March, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

  16. arXiv:1906.02771  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies

    Authors: Patrick Nadeem Ward, Ariella Smofsky, Avishek Joey Bose

    Abstract: Deep Reinforcement Learning (DRL) algorithms for continuous action spaces are known to be brittle toward hyperparameters as well as \cut{being}sample inefficient. Soft Actor Critic (SAC) proposes an off-policy deep actor critic algorithm within the maximum entropy RL framework which offers greater stability and empirical gains. The choice of policy distribution, a factored Gaussian, is motivated b… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: INNF workshop, International Conference on Machine Learning 2019, Long Beach CA, USA

  17. arXiv:1905.11912  [pdf, other

    cs.CL

    A Cross-Domain Transferable Neural Coherence Model

    Authors: Peng Xu, Hamidreza Saghir, ** Sung Kang, Teng Long, Avishek Joey Bose, Yanshuai Cao, Jackie Chi Kit Cheung

    Abstract: Coherence is an important aspect of text quality and is crucial for ensuring its readability. One important limitation of existing coherence models is that training on one domain does not easily generalize to unseen categories of text. Previous work advocates for generative models for cross-domain generalization, because for discriminative models, the space of incoherent sentence orderings to disc… ▽ More

    Submitted 9 July, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted at ACL 2019

  18. arXiv:1905.10864  [pdf, other

    cs.LG cs.CR stat.ML

    Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling

    Authors: Avishek Joey Bose, Andre Cianflone, William L. Hamilton

    Abstract: Adversarial attacks on deep neural networks traditionally rely on a constrained optimization paradigm, where an optimization procedure is used to obtain a single adversarial perturbation for a given input example. In this work we frame the problem as learning a distribution of adversarial perturbations, enabling us to generate diverse adversarial distributions given an unperturbed input. We show t… ▽ More

    Submitted 20 January, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  19. arXiv:1905.10674  [pdf, other

    cs.LG cs.AI stat.ML

    Compositional Fairness Constraints for Graph Embeddings

    Authors: Avishek Joey Bose, William L. Hamilton

    Abstract: Learning high-quality node embeddings is a key building block for machine learning models that operate on graph data, such as social networks and recommender systems. However, existing graph embedding techniques are unable to cope with fairness constraints, e.g., ensuring that the learned representations do not correlate with certain attributes, such as age or gender. Here, we introduce an adversa… ▽ More

    Submitted 16 July, 2019; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019

  20. arXiv:1805.12302  [pdf, other

    cs.CV cs.LG

    Adversarial Attacks on Face Detectors using Neural Net based Constrained Optimization

    Authors: Avishek Joey Bose, Parham Aarabi

    Abstract: Adversarial attacks involve adding, small, often imperceptible, perturbations to inputs with the goal of getting a machine learning model to misclassifying them. While many different adversarial attack strategies have been proposed on image classification models, object detection pipelines have been much harder to break. In this paper, we propose a novel strategy to craft adversarial examples by s… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted to IEEE MMSP

  21. arXiv:1805.03642  [pdf, other

    cs.CL cs.AI cs.LG

    Adversarial Contrastive Estimation

    Authors: Avishek Joey Bose, Huan Ling, Yanshuai Cao

    Abstract: Learning by contrasting positive and negative samples is a general strategy adopted by many methods. Noise contrastive estimation (NCE) for word embeddings and translating embeddings for knowledge graphs are examples in NLP employing this approach. In this work, we view contrastive learning as an abstraction of all such methods and augment the negative sampler into a mixture distribution containin… ▽ More

    Submitted 2 August, 2018; v1 submitted 9 May, 2018; originally announced May 2018.

    Comments: Association for Computational Linguistics, 2018