Skip to main content

Showing 1–50 of 103 results for author: Jain, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/GFNOrg/diffusion-finetuning

  2. arXiv:2405.18540  [pdf, other

    cs.CL cs.CR cs.LG

    Learning diverse attacks on large language models for robust red-teaming and safety tuning

    Authors: Seanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh Jain

    Abstract: Red-teaming, or identifying prompts that elicit harmful responses, is a critical step in ensuring the safe and responsible deployment of large language models (LLMs). Develo** effective protection against many modes of attack prompts requires discovering diverse attacks. Automated red-teaming typically uses reinforcement learning to fine-tune an attacker language model to generate prompts that e… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.13546  [pdf, other

    cs.CL cs.IR

    Knowledge-Driven Cross-Document Relation Extraction

    Authors: Monika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru

    Abstract: Relation extraction (RE) is a well-known NLP application often treated as a sentence- or document-level task. However, a handful of recent efforts explore it across documents or in the cross-document setting (CrossDocRE). This is distinct from the single document case because different documents often focus on disparate themes, while text within a document tends to have a single goal. Linking find… ▽ More

    Submitted 18 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted in ACL 2024 Findings

  4. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  5. arXiv:2404.10094  [pdf, other

    cs.LG q-bio.QM

    Towards DNA-Encoded Library Generation with GFlowNets

    Authors: Michał Koziarski, Mohammed Abukalam, Vedant Shah, Louis Vaillancourt, Doris Alexandra Schuetz, Moksh Jain, Almer van der Sloot, Mathieu Bourgey, Anne Marinier, Yoshua Bengio

    Abstract: DNA-encoded libraries (DELs) are a powerful approach for rapidly screening large numbers of diverse compounds. One of the key challenges in using DELs is library design, which involves choosing the building blocks that will be combinatorially combined to produce the final library. In this paper we consider the task of protein-protein interaction (PPI) biased DEL design. To this end, we evaluate se… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  6. arXiv:2404.08423  [pdf, other

    cs.LG physics.soc-ph q-bio.PE

    SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Develo** Economies

    Authors: Maeghal Jain, Ziya Uddin, Wubshet Ibrahim

    Abstract: The outbreak of COVID-19 has highlighted the intricate interplay between public health and economic stability on a global scale. This study proposes a novel reinforcement learning framework designed to optimize health and economic outcomes during pandemics. The framework leverages the SIR model, integrating both lockdown measures (via a stringency index) and vaccination strategies to simulate dise… ▽ More

    Submitted 30 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 27 pages, 12 figures

  7. arXiv:2402.04620  [pdf, other

    cs.HC cs.LG

    CataractBot: An LLM-Powered Expert-in-the-Loop Chatbot for Cataract Patients

    Authors: Pragnya Ramjee, Bhuvan Sachdeva, Satvik Golechha, Shreyas Kulkarni, Geeta Fulari, Kaushik Murali, Mohit Jain

    Abstract: The healthcare landscape is evolving, with patients seeking more reliable information about their health conditions, treatment options, and potential risks. Despite the abundance of information sources, the digital age overwhelms individuals with excess, often inaccurate information. Patients primarily trust doctors and hospital staff, highlighting the need for expert-endorsed health information.… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  8. arXiv:2401.11800  [pdf, other

    cs.IR

    Revisiting Document-Level Relation Extraction with Context-Guided Link Prediction

    Authors: Monika Jain, Raghava Mutharaju, Ramakanth Kavuluru, Kuldeep Singh

    Abstract: Document-level relation extraction (DocRE) poses the challenge of identifying relationships between entities within a document as opposed to the traditional RE setting where a single sentence is input. Existing approaches rely on logical reasoning or contextual cues from entities. This paper reframes document-level RE as link prediction over a knowledge graph with distinct benefits: 1) Our approac… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted in AAAI 2024

  9. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  10. arXiv:2310.08774  [pdf, other

    q-bio.PE cs.LG stat.ML

    PhyloGFN: Phylogenetic inference with generative flow networks

    Authors: Mingyang Zhou, Zichao Yan, Elliot Layne, Nikolay Malkin, Dinghuai Zhang, Moksh Jain, Mathieu Blanchette, Yoshua Bengio

    Abstract: Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt… ▽ More

    Submitted 24 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  11. arXiv:2310.04363  [pdf, other

    cs.LG cs.CL

    Amortizing intractable inference in large language models

    Authors: Edward J. Hu, Moksh Jain, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin

    Abstract: Autoregressive large language models (LLMs) compress knowledge from their training data through next-token conditional distributions. This limits tractable querying of this knowledge to start-to-end autoregressive sampling. However, many tasks of interest -- including sequence continuation, infilling, and other forms of constrained generation -- involve sampling from intractable posterior distribu… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; 23 pages; code: https://github.com/GFNOrg/gfn-lm-tuning

  12. arXiv:2310.03419  [pdf, other

    cs.LG cs.AI

    Pre-Training and Fine-Tuning Generative Flow Networks

    Authors: Ling Pan, Moksh Jain, Kanika Madan, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets) are amortized samplers that learn stochastic policies to sequentially generate compositional objects from a given unnormalized reward distribution. They can generate diverse sets of high-reward objects, which is an important consideration in scientific discovery tasks. However, as they are typically trained from a given extrinsic reward function, it remains an… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  13. arXiv:2309.09495  [pdf, other

    cs.HC cs.SE

    PwR: Exploring the Role of Representations in Conversational Programming

    Authors: Pradyumna YM, Vinod Ganesan, Dinesh Kumar Arumugam, Meghna Gupta, Nischith Shadagopan, Tanay Dixit, Sameer Segal, Pratyush Kumar, Mohit Jain, Sriram Rajamani

    Abstract: Large Language Models (LLMs) have revolutionized programming and software engineering. AI programming assistants such as GitHub Copilot X enable conversational programming, narrowing the gap between human intent and code generation. However, prior literature has identified a key challenge--there is a gap between user's mental model of the system's understanding after a sequence of natural language… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 23 pages, 3 figures, 2 tables, under submission for ACM CHI 2024

    ACM Class: H.5.2

  14. arXiv:2309.06386  [pdf, other

    eess.IV cs.CV

    Lung Diseases Image Segmentation using Faster R-CNNs

    Authors: Mihir Jain

    Abstract: Lung diseases are a leading cause of child mortality in the develo** world, with India accounting for approximately half of global pneumonia deaths (370,000) in 2016. Timely diagnosis is crucial for reducing mortality rates. This paper introduces a low-density neural network structure to mitigate topological challenges in deep networks. The network incorporates parameters into a feature pyramid,… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  15. arXiv:2309.06082  [pdf, other

    cs.LG cs.AI cs.CY

    A Machine Learning Framework to Deconstruct the Primary Drivers for Electricity Market Price Events

    Authors: Milan Jain, Xueqing Sun, Sohom Datta, Abhishek Somani

    Abstract: Power grids are moving towards 100% renewable energy source bulk power grids, and the overall dynamics of power system operations and electricity markets are changing. The electricity markets are not only dispatching resources economically but also taking into account various controllable actions like renewable curtailment, transmission congestion mitigation, and energy storage optimization to ens… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Published in IEEE PES GM 2023

  16. arXiv:2309.01370  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    ReOnto: A Neuro-Symbolic Approach for Biomedical Relation Extraction

    Authors: Monika Jain, Kuldeep Singh, Raghava Mutharaju

    Abstract: Relation Extraction (RE) is the task of extracting semantic relationships between entities in a sentence and aligning them to relations defined in a vocabulary, which is generally in the form of a Knowledge Graph (KG) or an ontology. Various approaches have been proposed so far to address this task. However, applying these techniques to biomedical text often yields unsatisfactory results because i… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted in ECML 2023

  17. arXiv:2308.09726  [pdf, other

    cs.LG cs.AI cs.CY cs.MA

    Equitable Restless Multi-Armed Bandits: A General Framework Inspired By Digital Health

    Authors: Jackson A. Killian, Manish Jain, Yugang Jia, Jonathan Amar, Erich Huang, Milind Tambe

    Abstract: Restless multi-armed bandits (RMABs) are a popular framework for algorithmic decision making in sequential settings with limited resources. RMABs are increasingly being used for sensitive decisions such as in public health, treatment scheduling, anti-poaching, and -- the motivation for this work -- digital health. For such high stakes settings, decisions must both improve outcomes and prevent disp… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 16 pages, 8 figures, 2 tables

  18. arXiv:2306.17693  [pdf, other

    cs.LG

    Thompson sampling for improved exploration in GFlowNets

    Authors: Jarrid Rector-Brooks, Kanika Madan, Moksh Jain, Maksym Korablyov, Cheng-Hao Liu, Sarath Chandar, Nikolay Malkin, Yoshua Bengio

    Abstract: Generative flow networks (GFlowNets) are amortized variational inference algorithms that treat sampling from a distribution over compositional objects as a sequential decision-making problem with a learnable action policy. Unlike other algorithms for hierarchical sampling that optimize a variational bound, GFlowNet algorithms can stably run off-policy, which can be advantageous for discovering mod… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Structured Probabilistic Inference and Generative Modeling (SPIGM) workshop @ ICML 2023

  19. arXiv:2306.15058  [pdf, other

    cs.LG stat.ML

    BatchGFN: Generative Flow Networks for Batch Active Learning

    Authors: Shreshth A. Malik, Salem Lahlou, Andrew Jesson, Moksh Jain, Nikolay Malkin, Tristan Deleu, Yoshua Bengio, Yarin Gal

    Abstract: We introduce BatchGFN -- a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward. With an appropriate reward function to quantify the utility of acquiring a batch, such as the joint mutual information between the batch and the model parameters, BatchGFN is able to construct highly informative batches for active… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at the Structured Probabilistic Inference & Generative Modeling workshop, ICML 2023

  20. arXiv:2306.14939  [pdf, other

    cs.CL cs.LG

    The Art of Embedding Fusion: Optimizing Hate Speech Detection

    Authors: Mohammad Aflah Khan, Neemesh Yadav, Mohit Jain, Sanyam Goyal

    Abstract: Hate speech detection is a challenging natural language processing task that requires capturing linguistic and contextual nuances. Pre-trained language models (PLMs) offer rich semantic representations of text that can improve this task. However there is still limited knowledge about ways to effectively combine representations across PLMs and leverage their complementary strengths. In this work, w… ▽ More

    Submitted 8 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Published as a Tiny Paper at ICLR 2023, 12 Pages

  21. arXiv:2306.11715  [pdf, other

    cs.LG q-bio.BM

    Multi-Fidelity Active Learning with GFlowNets

    Authors: Alex Hernandez-Garcia, Nikita Saxena, Moksh Jain, Cheng-Hao Liu, Yoshua Bengio

    Abstract: In the last decades, the capacity to generate large amounts of data in science and engineering applications has been growing steadily. Meanwhile, the progress in machine learning has turned it into a suitable tool to process and utilise the available data. Nonetheless, many relevant scientific and engineering problems present challenges where current machine learning methods cannot yet efficiently… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Code: https://github.com/nikita-0209/mf-al-gfn

  22. arXiv:2305.09600  [pdf, other

    cs.AI cs.LG

    Deep Reinforcement Learning to Maximize Arterial Usage during Extreme Congestion

    Authors: Ashutosh Dutta, Milan Jain, Arif Khan, Arun Sathanur

    Abstract: Collisions, crashes, and other incidents on road networks, if left unmitigated, can potentially cause cascading failures that can affect large parts of the system. Timely handling such extreme congestion scenarios is imperative to reduce emissions, enhance productivity, and improve the quality of urban living. In this work, we propose a Deep Reinforcement Learning (DRL) approach to reduce traffic… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  23. arXiv:2305.07552  [pdf, other

    cs.CV cs.AI cs.CY

    Dish detection in food platters: A framework for automated diet logging and nutrition management

    Authors: Mansi Goel, Shashank Dargar, Shounak Ghatak, Nidhi Verma, Pratik Chauhan, Anushka Gupta, Nikhila Vishnumolakala, Hareesh Amuru, Ekta Gambhir, Ronak Chhajed, Meenal Jain, Astha Jain, Samiksha Garg, Nitesh Narwade, Nikhilesh Verhwani, Abhuday Tiwari, Kirti Vashishtha, Ganesh Bagler

    Abstract: Diet is central to the epidemic of lifestyle disorders. Accurate and effortless diet logging is one of the significant bottlenecks for effective diet management and calorie restriction. Dish detection from food platters is a challenging problem due to a visually complex food layout. We present an end-to-end computational framework for diet management, from data compilation, annotation, and state-o… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 11 pages, 5 figures, 5 tables. Submitted to the 8th International Conference on Computer Vision & Image Processing (CVIP-2023)

    ACM Class: I.4.9; I.5.4; J.3

  24. arXiv:2304.14916  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    "Can't Take the Pressure?": Examining the Challenges of Blood Pressure Estimation via Pulse Wave Analysis

    Authors: Suril Mehta, Nipun Kwatra, Mohit Jain, Daniel McDuff

    Abstract: The use of observed wearable sensor data (e.g., photoplethysmograms [PPG]) to infer health measures (e.g., glucose level or blood pressure) is a very active area of research. Such technology can have a significant impact on health screening, chronic disease management and remote monitoring. A common approach is to collect sensor data and corresponding labels from a clinical grade device (e.g., blo… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  25. Understanding Journalists' Workflows in News Curation

    Authors: Shubham Atreja, Shruthi Srinath, Mohit Jain, Joyojeet Pal

    Abstract: With the increasing dominance of the internet as a source of news consumption, there has been a rise in the production and popularity of email newsletters compiled by individual journalists. However, there is little research on the processes of aggregation, and how these differ between expert journalists and trained machines. In this paper, we interviewed journalists who curate newsletters from ar… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: accepted at CHI'23

  26. arXiv:2303.03677  [pdf, other

    cs.CY cs.AI cs.LG

    Training Machine Learning Models to Characterize Temporal Evolution of Disadvantaged Communities

    Authors: Milan Jain, Narmadha Meenu Mohankumar, Heng Wan, Sumitrra Ganguly, Kyle D Wilson, David M Anderson

    Abstract: Disadvantaged communities (DAC), as defined by the Justice40 initiative of the Department of Energy (DOE), USA, identifies census tracts across the USA to determine where benefits of climate and energy investments are or are not currently accruing. The DAC status not only helps in determining the eligibility for future Justice40-related investments but is also critical for exploring ways to achiev… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  27. MFBE: Leveraging Multi-Field Information of FAQs for Efficient Dense Retrieval

    Authors: Debopriyo Banerjee, Mausam Jain, Ashish Kulkarni

    Abstract: In the domain of question-answering in NLP, the retrieval of Frequently Asked Questions (FAQ) is an important sub-area which is well researched and has been worked upon for many languages. Here, in response to a user query, a retrieval system typically returns the relevant FAQs from a knowledge-base. The efficacy of such a system depends on its ability to establish semantic match between the query… ▽ More

    Submitted 21 March, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: The first two authors contributed equally to this work. 12 pages, 3 figures, 5 tables. Accepted at the 2023 Pacific-Asia Conference On Knowledge Discovery And Data Mining (PAKDD)

  28. arXiv:2302.09465  [pdf, other

    cs.LG

    Stochastic Generative Flow Networks

    Authors: Ling Pan, Dinghuai Zhang, Moksh Jain, Longbo Huang, Yoshua Bengio

    Abstract: Generative Flow Networks (or GFlowNets for short) are a family of probabilistic agents that learn to sample complex combinatorial structures through the lens of "inference as control". They have shown great potential in generating high-quality and diverse candidates from a given energy landscape. However, existing GFlowNets can be applied only to deterministic environments, and fail in more genera… ▽ More

    Submitted 24 June, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: UAI 2023

  29. arXiv:2302.06576  [pdf, other

    cs.LG stat.ML

    GFlowNet-EM for learning compositional latent variable models

    Authors: Edward J. Hu, Nikolay Malkin, Moksh Jain, Katie Everett, Alexandros Graikos, Yoshua Bengio

    Abstract: Latent variable models (LVMs) with discrete compositional latents are an important but challenging setting due to a combinatorially large number of possible configurations of the latents. A key tradeoff in modeling the posteriors over latents is between expressivity and tractable optimization. For algorithms based on expectation-maximization (EM), the E-step is often intractable without restrictiv… ▽ More

    Submitted 3 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023; code: https://github.com/GFNOrg/GFlowNet-EM

  30. GFlowNets for AI-Driven Scientific Discovery

    Authors: Moksh Jain, Tristan Deleu, Jason Hartford, Cheng-Hao Liu, Alex Hernandez-Garcia, Yoshua Bengio

    Abstract: Tackling the most pressing problems for humanity, such as the climate crisis and the threat of global pandemics, requires accelerating the pace of scientific discovery. While science has traditionally relied on trial and error and even serendipity to a large extent, the last few decades have seen a surge of data-driven scientific discoveries. However, in order to truly leverage large-scale data se… ▽ More

    Submitted 27 June, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 31 pages, 5 figures. Updated with camera-ready changes

  31. arXiv:2301.02872  [pdf, other

    cs.LG

    Machine Learning to Estimate Gross Loss of Jewelry for Wax Patterns

    Authors: Mihir Jain, Kashish Jain, Sandip Mane

    Abstract: In mass manufacturing of jewellery, the gross loss is estimated before manufacturing to calculate the wax weight of the pattern that would be investment casted to make multiple identical pieces of jewellery. Machine learning is a technology that is a part of AI which helps create a model with decision-making capabilities based on a large set of user-defined data. In this paper, the authors found a… ▽ More

    Submitted 7 January, 2023; originally announced January 2023.

  32. arXiv:2211.12374  [pdf, other

    cs.CL cs.LG

    An Emotion-Aware Multi-Task Approach to Fake News and Rumour Detection using Transfer Learning

    Authors: Arjun Choudhry, Inder Khatri, Minni Jain, Dinesh Kumar Vishwakarma

    Abstract: Social networking sites, blogs, and online articles are instant sources of news for internet users globally. However, in the absence of strict regulations mandating the genuineness of every text on social media, it is probable that some of these texts are fake news or rumours. Their deceptive nature and ability to propagate instantly can have an adverse effect on society. This necessitates the nee… ▽ More

    Submitted 7 December, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted in IEEE Transaction on Computational Social Systems 18 pages 5 figures

  33. arXiv:2211.01885  [pdf, other

    eess.IV cs.CV q-bio.QM

    Using U-Net Network for Efficient Brain Tumor Segmentation in MRI Images

    Authors: Jason Walsh, Alice Othmani, Mayank Jain, Soumyabrata Dev

    Abstract: Magnetic Resonance Imaging (MRI) is the most commonly used non-intrusive technique for medical image acquisition. Brain tumor segmentation is the process of algorithmically identifying tumors in brain MRI scans. While many approaches have been proposed in the literature for brain tumor segmentation, this paper proposes a lightweight implementation of U-Net. Apart from providing real-time segmentat… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: Published in Healthcare Analytics, 2022

  34. arXiv:2211.00568  [pdf, other

    cs.LG cs.AI

    Consistent Training via Energy-Based GFlowNets for Modeling Discrete Joint Distributions

    Authors: Chanakya Ekbote, Moksh Jain, Payel Das, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets) have demonstrated significant performance improvements for generating diverse discrete objects $x$ given a reward function $R(x)$, indicating the utility of the object and trained independently from the GFlowNet by supervised learning to predict a desirable property $y$ given $x$. We hypothesize that this can lead to incompatibility between the inductive optimi… ▽ More

    Submitted 2 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: 9 Pages, 10 Figures

  35. arXiv:2211.00112  [pdf, other

    cs.MA cs.AI cs.LG math.OC

    Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits

    Authors: Abheek Ghosh, Dheeraj Nagaraj, Manish Jain, Milind Tambe

    Abstract: We study the problem of planning restless multi-armed bandits (RMABs) with multiple actions. This is a popular model for multi-agent systems with applications like multi-channel communication, monitoring and machine maintenance tasks, and healthcare. Whittle index policies, which are based on Lagrangian relaxations, are widely used in these settings due to their simplicity and near-optimality unde… ▽ More

    Submitted 28 February, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: 21 pages; AAMAS'23 version with appendix

  36. arXiv:2210.12928  [pdf, other

    cs.LG cs.AI

    GFlowOut: Dropout with Generative Flow Networks

    Authors: Dianbo Liu, Moksh Jain, Bonaventure Dossou, Qianli Shen, Salem Lahlou, Anirudh Goyal, Nikolay Malkin, Chris Emezue, Dinghuai Zhang, Nadhir Hassen, Xu Ji, Kenji Kawaguchi, Yoshua Bengio

    Abstract: Bayesian Inference offers principled tools to tackle many critical problems with modern neural networks such as poor calibration and generalization, and data inefficiency. However, scaling Bayesian inference to large architectures is challenging and requires restrictive approximations. Monte Carlo Dropout has been widely used as a relatively cheap way for approximate Inference and to estimate unce… ▽ More

    Submitted 23 June, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

  37. arXiv:2210.12765  [pdf, other

    cs.LG stat.ML

    Multi-Objective GFlowNets

    Authors: Moksh Jain, Sharath Chandra Raparthy, Alex Hernandez-Garcia, Jarrid Rector-Brooks, Yoshua Bengio, Santiago Miret, Emmanuel Bengio

    Abstract: We study the problem of generating diverse candidates in the context of Multi-Objective Optimization. In many applications of machine learning such as drug discovery and material design, the goal is to generate candidates which simultaneously optimize a set of potentially conflicting objectives. Moreover, these objectives are often imperfect evaluations of some underlying property of interest, mak… ▽ More

    Submitted 17 July, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 23 pages, 8 figures. ICML 2023. Code at: https://github.com/GFNOrg/multi-objective-gfn

  38. arXiv:2210.07241  [pdf, other

    cs.LG cs.RO

    Visual Reinforcement Learning with Self-Supervised 3D Representations

    Authors: Yanjie Ze, Nicklas Hansen, Yinbo Chen, Mohit Jain, Xiaolong Wang

    Abstract: A prominent approach to visual Reinforcement Learning (RL) is to learn an internal state representation using self-supervised methods, which has the potential benefit of improved sample-efficiency and generalization through additional learning signal and inductive biases. However, while the real world is inherently 3D, prior efforts have largely been focused on leveraging 2D computer vision techni… ▽ More

    Submitted 15 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted in RA-L 2023 and IROS 2023. Project page: https://yanjieze.com/3d4rl/

  39. arXiv:2209.13518  [pdf

    q-bio.BM cs.AI cs.LG

    Graph-Based Active Machine Learning Method for Diverse and Novel Antimicrobial Peptides Generation and Selection

    Authors: Bonaventure F. P. Dossou, Dianbo Liu, Xu Ji, Moksh Jain, Almer M. van der Sloot, Roger Palou, Michael Tyers, Yoshua Bengio

    Abstract: As antibiotic-resistant bacterial strains are rapidly spreading worldwide, infections caused by these strains are emerging as a global crisis causing the death of millions of people every year. Antimicrobial Peptides (AMPs) are one of the candidates to tackle this problem because of their potential diversity, and ability to favorably modulate the host immune response. However, large-scale screenin… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: Under Review at Sciences Advances

  40. arXiv:2209.12782  [pdf, other

    cs.LG stat.ML

    Learning GFlowNets from partial episodes for improved convergence and stability

    Authors: Kanika Madan, Jarrid Rector-Brooks, Maksym Korablyov, Emmanuel Bengio, Moksh Jain, Andrei Nica, Tom Bosc, Yoshua Bengio, Nikolay Malkin

    Abstract: Generative flow networks (GFlowNets) are a family of algorithms for training a sequential sampler of discrete objects under an unnormalized target density and have been successfully used for various probabilistic modeling tasks. Existing training objectives for GFlowNets are either local to states or transitions, or propagate a reward signal over an entire sampling trajectory. We argue that these… ▽ More

    Submitted 3 June, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: ICML 2023

  41. arXiv:2208.05552  [pdf, other

    cs.HC cs.CV

    Towards Automating Retinoscopy for Refractive Error Diagnosis

    Authors: Aditya Aggarwal, Siddhartha Gairola, Uddeshya Upadhyay, Akshay P Vasishta, Diwakar Rao, Aditya Goyal, Kaushik Murali, Nipun Kwatra, Mohit Jain

    Abstract: Refractive error is the most common eye disorder and is the key cause behind correctable visual impairment, responsible for nearly 80% of the visual impairment in the US. Refractive error can be diagnosed using multiple methods, including subjective refraction, retinoscopy, and autorefractors. Although subjective refraction is the gold standard, it requires cooperation from the patient and hence i… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: This paper is accepted for publication in IMWUT 2022

  42. arXiv:2206.13395  [pdf, other

    cs.CV cs.AI

    Gait Cycle Reconstruction and Human Identification from Occluded Sequences

    Authors: Abhishek Paul, Manav Mukesh Jain, **esh Jain, Pratik Chattopadhyay

    Abstract: Gait-based person identification from videos captured at surveillance sites using Computer Vision-based techniques is quite challenging since these walking sequences are usually corrupted with occlusion, and a complete cycle of gait is not always available. In this work, we propose an effective neural network-based model to reconstruct the occluded frames in an input sequence before carrying out g… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  43. arXiv:2205.03702  [pdf, other

    eess.IV cs.CV cs.CY

    Keratoconus Classifier for Smartphone-based Corneal Topographer

    Authors: Siddhartha Gairola, Pallavi Joshi, Anand Balasubramaniam, Kaushik Murali, Nipun Kwatra, Mohit Jain

    Abstract: Keratoconus is a severe eye disease that leads to deformation of the cornea. It impacts people aged 10-25 years and is the leading cause of blindness in that demography. Corneal topography is the gold standard for keratoconus diagnosis. It is a non-invasive process performed using expensive and bulky medical devices called corneal topographers. This makes it inaccessible to large populations, espe… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

    Comments: 4 pages

  44. arXiv:2204.11115  [pdf

    cs.LG

    Time Series Forecasting (TSF) Using Various Deep Learning Models

    Authors: Jimeng Shi, Mahek Jain, Giri Narasimhan

    Abstract: Time Series Forecasting (TSF) is used to predict the target variables at a future time point based on the learning from previous time points. To keep the problem tractable, learning methods use data from a fixed length window in the past as an explicit input. In this paper, we study how the performance of predictive models change as a function of different look-back window sizes and different amou… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: 9 pages

  45. arXiv:2204.02790  [pdf, other

    cs.CY cs.CL

    Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?

    Authors: Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O Neil, Kalika Bali, Monojit Choudhury

    Abstract: The COVID-19 pandemic has brought out both the best and worst of language technology (LT). On one hand, conversational agents for information dissemination and basic diagnosis have seen widespread use, and arguably, had an important role in combating the pandemic. On the other hand, it has also become clear that such technologies are readily available for a handful of languages, and the vast major… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Under Revision

  46. arXiv:2203.04115  [pdf, other

    q-bio.BM cs.LG

    Biological Sequence Design with GFlowNets

    Authors: Moksh Jain, Emmanuel Bengio, Alex-Hernandez Garcia, Jarrid Rector-Brooks, Bonaventure F. P. Dossou, Chanakya Ekbote, Jie Fu, Tianyu Zhang, Micheal Kilgour, Dinghuai Zhang, Lena Simine, Payel Das, Yoshua Bengio

    Abstract: Design of de novo biological sequences with desired properties, like protein and DNA sequences, often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations. These experiments can consist of multiple stages, with increasing levels of precision and cost of evaluation, where candidates are filtered. This makes the diversity of proposed candidates a key con… ▽ More

    Submitted 24 May, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: ICML 2022. 15 pages, 3 figures. Code available at: https://github.com/MJ10/BioSeq-GFN-AL. Updated GFP results

  47. arXiv:2201.13259  [pdf, other

    cs.LG stat.ML

    Trajectory balance: Improved credit assignment in GFlowNets

    Authors: Nikolay Malkin, Moksh Jain, Emmanuel Bengio, Chen Sun, Yoshua Bengio

    Abstract: Generative flow networks (GFlowNets) are a method for learning a stochastic policy for generating compositional objects, such as graphs or strings, from a given unnormalized density by sequences of actions, where many possible action sequences may lead to the same object. We find previously proposed learning objectives for GFlowNets, flow matching and detailed balance, which are analogous to tempo… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: NeurIPS 2022; see footnotes for code; v3 fixes minor errata

  48. arXiv:2201.07779  [pdf, other

    cs.RO cs.CV cs.LG

    Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

    Authors: Rishabh Jangir, Nicklas Hansen, Sambaran Ghosal, Mohit Jain, Xiaolong Wang

    Abstract: Learning to solve precision-based manipulation tasks from visual feedback using Reinforcement Learning (RL) could drastically reduce the engineering efforts required by traditional robot systems. However, performing fine-grained motor control from visual inputs alone is challenging, especially with a static third-person camera as often used in previous work. We propose a setting for robotic manipu… ▽ More

    Submitted 20 January, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Accepted in Robotics and Automation Letters Journal (RA-L 2022). Website at https://jangirrishabh.github.io/lookcloser .8 Pages

  49. arXiv:2112.08611  [pdf

    cs.SI

    Clickbait in YouTube Prevention, Detection and Analysis of the Bait using Ensemble Learning

    Authors: Peya Mowar, Mini Jain, Ruchika Goel, Dinesh Kumar Vishwakarma

    Abstract: Unscrupulous content creators on YouTube employ deceptive techniques such as spam and clickbait to reach a broad audience and trick users into clicking on their videos to increase their advertisement revenue. Clickbait detection on YouTube requires an in depth examination and analysis of the intricate relationship between the video content and video descriptors title and thumbnail. However, the cu… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 26 pages, 16 figures

  50. arXiv:2112.03849  [pdf, ps, other

    cs.CL cs.AI

    Natural Answer Generation: From Factoid Answer to Full-length Answer using Grammar Correction

    Authors: Manas Jain, Sriparna Saha, Pushpak Bhattacharyya, Gladvin Chinnadurai, Manish Kumar Vatsa

    Abstract: Question Answering systems these days typically use template-based language generation. Though adequate for a domain-specific task, these systems are too restrictive and predefined for domain-independent systems. This paper proposes a system that outputs a full-length answer given a question and the extracted factoid answer (short spans such as named entities) as the input. Our system uses constit… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.