Skip to main content

Showing 1–9 of 9 results for author: Yala, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.14391  [pdf, other

    cs.CV

    Rethinking Patch Dependence for Masked Autoencoders

    Authors: Letian Fu, Long Lian, Renhao Wang, Baifeng Shi, Xudong Wang, Adam Yala, Trevor Darrell, Alexei A. Efros, Ken Goldberg

    Abstract: In this work, we re-examine inter-patch dependencies in the decoding mechanism of masked autoencoders (MAE). We decompose this decoding mechanism for masked patch reconstruction in MAE into self-attention and cross-attention. Our investigations suggest that self-attention between mask patches is not essential for learning good representations. To this end, we propose a novel pretraining framework:… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  2. arXiv:2309.17444  [pdf, other

    cs.CV cs.AI cs.CL

    LLM-grounded Video Diffusion Models

    Authors: Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li

    Abstract: Text-conditioned diffusion models have emerged as a promising tool for neural video generation. However, current models still struggle with intricate spatiotemporal prompts and often generate restricted or incorrect motion. To address these limitations, we introduce LLM-grounded Video Diffusion (LVD). Instead of directly generating videos from the text inputs, LVD first leverages a large language… ▽ More

    Submitted 4 May, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: ICLR 2024. Project Page: https://llm-grounded-video-diffusion.github.io/

  3. arXiv:2306.10193  [pdf, other

    cs.CL cs.LG

    Conformal Language Modeling

    Authors: Victor Quach, Adam Fisch, Tal Schuster, Adam Yala, Jae Ho Sohn, Tommi S. Jaakkola, Regina Barzilay

    Abstract: We propose a novel approach to conformal prediction for generative language models (LMs). Standard conformal prediction produces prediction sets -- in place of single predictions -- that have rigorous, statistical performance guarantees. LM responses are typically sampled from the model's predicted distribution over the large, combinatorial output space of natural language. Translating this proces… ▽ More

    Submitted 1 June, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: ICLR 2024

  4. arXiv:2305.13655  [pdf, other

    cs.CV

    LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

    Authors: Long Lian, Boyi Li, Adam Yala, Trevor Darrell

    Abstract: Recent advancements in text-to-image diffusion models have yielded impressive results in generating realistic and diverse images. However, these models still struggle with complex prompts, such as those that involve numeracy and spatial reasoning. This work proposes to enhance prompt understanding capabilities in diffusion models. Our method leverages a pretrained large language model (LLM) for gr… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Transactions on Machine Learning Research (TMLR) 2024, with Featured Certification

  5. arXiv:2304.00047  [pdf, other

    cs.LG cs.CR cs.IT

    PEOPL: Characterizing Privately Encoded Open Datasets with Public Labels

    Authors: Homa Esfahanizadeh, Adam Yala, Rafael G. L. D'Oliveira, Andrea J. D. Jaba, Victor Quach, Ken R. Duffy, Tommi S. Jaakkola, Vinod Vaikuntanathan, Manya Ghobadi, Regina Barzilay, Muriel Médard

    Abstract: Allowing organizations to share their data for training of machine learning (ML) models without unintended information leakage is an open problem in practice. A promising technique for this still-open problem is to train models on the encoded data. Our approach, called Privately Encoded Open Datasets with Public Labels (PEOPL), uses a certain class of randomly constructed transforms to encode sens… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: Submitted to IEEE Transactions on Information Forensics and Security

  6. arXiv:2203.16452  [pdf, other

    cs.LG stat.ML

    AI Gone Astray: Technical Supplement

    Authors: Janice Yang, Ludvig Karstens, Casey Ross, Adam Yala

    Abstract: This study is a technical supplement to "AI gone astray: How subtle shifts in patient data send popular algorithms reeling, undermining patient safety." from STAT News, which investigates the effect of time drift on clinically deployed machine learning models. We use MIMIC-IV, a publicly available dataset, to train models that replicate commercial approaches by Dascena and Epic to predict the onse… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

  7. arXiv:2201.12406  [pdf, other

    cs.LG cs.CR cs.CV

    Syfer: Neural Obfuscation for Private Data Release

    Authors: Adam Yala, Victor Quach, Homa Esfahanizadeh, Rafael G. L. D'Oliveira, Ken R. Duffy, Muriel Médard, Tommi S. Jaakkola, Regina Barzilay

    Abstract: Balancing privacy and predictive utility remains a central challenge for machine learning in healthcare. In this paper, we develop Syfer, a neural obfuscation method to protect against re-identification attacks. Syfer composes trained layers with random neural networks to encode the original data (e.g. X-rays) while maintaining the ability to predict diagnoses from the encoded data. The randomness… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  8. arXiv:2106.02484  [pdf, other

    cs.CR cs.AI

    NeuraCrypt: Hiding Private Health Data via Random Neural Networks for Public Training

    Authors: Adam Yala, Homa Esfahanizadeh, Rafael G. L. D' Oliveira, Ken R. Duffy, Manya Ghobadi, Tommi S. Jaakkola, Vinod Vaikuntanathan, Regina Barzilay, Muriel Medard

    Abstract: Balancing the needs of data privacy and predictive utility is a central challenge for machine learning in healthcare. In particular, privacy concerns have led to a dearth of public datasets, complicated the construction of multi-hospital cohorts and limited the utilization of external machine learning resources. To remedy this, new methods are required to enable data owners, such as hospitals, to… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  9. arXiv:1603.07954  [pdf, other

    cs.CL

    Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning

    Authors: Karthik Narasimhan, Adam Yala, Regina Barzilay

    Abstract: Most successful information extraction systems operate with access to a large collection of documents. In this work, we explore the task of acquiring and incorporating external evidence to improve extraction accuracy in domains where the amount of training data is scarce. This process entails issuing search queries, extraction from new sources and reconciliation of extracted values, which are repe… ▽ More

    Submitted 27 September, 2016; v1 submitted 25 March, 2016; originally announced March 2016.

    Comments: Appearing in EMNLP 2016 (12 pages incl. supplementary material)