Skip to main content

Showing 1–50 of 87 results for author: Weinberger, K Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16034  [pdf, other

    cs.CV

    DiffuBox: Refining 3D Object Detection with Point Diffusion

    Authors: Xiangyu Chen, Zhenzhen Liu, Katie Z Luo, Siddhartha Datta, Adhitya Polavaram, Yan Wang, Yurong You, Boyi Li, Marco Pavone, Wei-Lun Chao, Mark Campbell, Bharath Hariharan, Kilian Q. Weinberger

    Abstract: Ensuring robust 3D object detection and localization is crucial for many applications in robotics and autonomous driving. Recent models, however, face difficulties in maintaining high performance when applied to domains with differing sensor setups or geographic locations, often resulting in poor localization accuracy due to domain shift. To overcome this challenge, we introduce a novel diffusion-… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2404.05139  [pdf, other

    cs.CV cs.RO

    Better Monocular 3D Detectors with LiDAR from the Past

    Authors: Yurong You, Cheng Perng Phoo, Carlos Andres Diaz-Ruiz, Katie Z Luo, Wei-Lun Chao, Mark Campbell, Bharath Hariharan, Kilian Q Weinberger

    Abstract: Accurate 3D object detection is crucial to autonomous driving. Though LiDAR-based detectors have achieved impressive performance, the high cost of LiDAR sensors precludes their widespread adoption in affordable vehicles. Camera-based detectors are cheaper alternatives but often suffer inferior performance compared to their LiDAR-based counterparts due to inherent depth ambiguities in images. In th… ▽ More

    Submitted 9 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by ICRA 2024. The code can be found at https://github.com/YurongYou/AsyncDepth

  3. arXiv:2403.18120  [pdf, other

    cs.AI cs.CL cs.LG

    Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization

    Authors: ** Peng Zhou, Charles Staats, Wenda Li, Christian Szegedy, Kilian Q. Weinberger, Yuhuai Wu

    Abstract: Large language models (LLM), such as Google's Minerva and OpenAI's GPT families, are becoming increasingly capable of solving mathematical quantitative reasoning problems. However, they still make unjustified logical and computational errors in their reasoning steps and answers. In this paper, we leverage the fact that if the training corpus of LLMs contained sufficiently many examples of formal m… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  4. arXiv:2402.03545  [pdf, other

    cs.LG

    Online Feature Updates Improve Online (Generalized) Label Shift Adaptation

    Authors: Ruihan Wu, Siddhartha Datta, Yi Su, Dheeraj Baby, Yu-Xiang Wang, Kilian Q. Weinberger

    Abstract: This paper addresses the prevalent issue of label shift in an online setting with missing labels, where data distributions change over time and obtaining timely labels is challenging. While existing methods primarily focus on adjusting or updating the final layer of a pre-trained classifier, we explore the untapped potential of enhancing feature representations using unlabeled data at test-time. O… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  5. arXiv:2402.03292  [pdf, other

    cs.LG cs.CV

    Zero-shot Object-Level OOD Detection with Context-Aware Inpainting

    Authors: Quang-Huy Nguyen, ** Peng Zhou, Zhenzhen Liu, Khanh-Huyen Bui, Kilian Q. Weinberger, Dung D. Le

    Abstract: Machine learning algorithms are increasingly provided as black-box cloud services or pre-trained models, without access to their training data. This motivates the problem of zero-shot out-of-distribution (OOD) detection. Concretely, we aim to detect OOD objects that do not belong to the classifier's label set but are erroneously classified as in-distribution (ID) objects. Our approach, RONIN, uses… ▽ More

    Submitted 6 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  6. arXiv:2401.02957  [pdf, other

    cs.CV

    Denoising Vision Transformers

    Authors: Jiawei Yang, Katie Z Luo, Jiefeng Li, Kilian Q Weinberger, Yonglong Tian, Yue Wang

    Abstract: We delve into a nuanced but significant challenge inherent to Vision Transformers (ViTs): feature maps of these models exhibit grid-like artifacts, which detrimentally hurt the performance of ViTs in downstream tasks. Our investigations trace this fundamental issue down to the positional embeddings at the input stage. To address this, we propose a novel noise model, which is universally applicable… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Project website: https://jiawei-yang.github.io/DenoisingViT/

  7. arXiv:2311.04079  [pdf, other

    cs.CV

    Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

    Authors: Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

    Abstract: Autonomous driving has traditionally relied heavily on costly and labor-intensive High Definition (HD) maps, hindering scalability. In contrast, Standard Definition (SD) maps are more affordable and have worldwide coverage, offering a scalable alternative. In this work, we systematically explore the effect of SD maps for real-time lane-topology understanding. We propose a novel framework to integr… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  8. arXiv:2310.19080  [pdf, other

    cs.CV

    Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery

    Authors: Katie Z Luo, Zhenzhen Liu, Xiangyu Chen, Yurong You, Sagie Benaim, Cheng Perng Phoo, Mark Campbell, Wen Sun, Bharath Hariharan, Kilian Q. Weinberger

    Abstract: Recent advances in machine learning have shown that Reinforcement Learning from Human Feedback (RLHF) can improve machine learning models and align them with human preferences. Although very successful for Large Language Models (LLMs), these advancements have not had a comparable impact in research for autonomous vehicles -- where alignment with human expectations can be imperative. In this paper,… ▽ More

    Submitted 5 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

  9. arXiv:2310.16176  [pdf, other

    cs.CL cs.AI

    Correction with Backtracking Reduces Hallucination in Summarization

    Authors: Zhenzhen Liu, Chao Wan, Varsha Kishore, ** Peng Zhou, Minmin Chen, Kilian Q. Weinberger

    Abstract: Abstractive summarization aims at generating natural language summaries of a source document that are succinct while preserving the important elements. Despite recent advances, neural text summarization models are known to be susceptible to hallucinating (or more correctly confabulating), that is to produce summaries with details that are not grounded in the source document. In this paper, we intr… ▽ More

    Submitted 31 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  10. arXiv:2310.14592  [pdf, other

    cs.CV cs.LG

    Pre-Training LiDAR-Based 3D Object Detectors Through Colorization

    Authors: Tai-Yu Pan, Chenyang Ma, Tianle Chen, Cheng Perng Phoo, Katie Z Luo, Yurong You, Mark Campbell, Kilian Q. Weinberger, Bharath Hariharan, Wei-Lun Chao

    Abstract: Accurate 3D object detection and understanding for self-driving cars heavily relies on LiDAR point clouds, necessitating large amounts of labeled data to train. In this work, we introduce an innovative pre-training approach, Grounded Point Colorization (GPC), to bridge the gap between data and labels by teaching the model to colorize LiDAR point clouds, equip** it with valuable semantic cues. To… ▽ More

    Submitted 25 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  11. arXiv:2309.12140  [pdf, other

    cs.CV cs.AI cs.LG

    Unsupervised Domain Adaptation for Self-Driving from Past Traversal Features

    Authors: Travis Zhang, Katie Luo, Cheng Perng Phoo, Yurong You, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: The rapid development of 3D object detection systems for self-driving cars has significantly improved accuracy. However, these systems struggle to generalize across diverse driving environments, which can lead to safety-critical failures in detecting traffic participants. To address this, we propose a method that utilizes unlabeled repeated traversals of multiple locations to adapt object detector… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  12. arXiv:2307.12425  [pdf, other

    cs.CL

    On the Effectiveness of Offline RL for Dialogue Response Generation

    Authors: Paloma Sodhi, Felix Wu, Ethan R. Elenberg, Kilian Q. Weinberger, Ryan McDonald

    Abstract: A common training technique for language models is teacher forcing (TF). TF attempts to match human language exactly, even though identical meanings can be expressed in different ways. This motivates use of sequence-level objectives for dialogue response generation. In this paper, we study the efficacy of various offline reinforcement learning (RL) methods to maximize such objectives. We present a… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: Accepted at ICML 2023. 18 pages, 12 figures. Code available at https://github.com/asappresearch/dialogue-offline-rl

  13. arXiv:2307.10323  [pdf, other

    cs.IR cs.CL cs.LG

    IncDSI: Incrementally Updatable Document Retrieval

    Authors: Varsha Kishore, Chao Wan, Justin Lovelace, Yoav Artzi, Kilian Q. Weinberger

    Abstract: Differentiable Search Index is a recently proposed paradigm for document retrieval, that encodes information about a corpus of documents within the parameters of a neural network and directly maps queries to corresponding documents. These models have achieved state-of-the-art performances for document retrieval across many benchmarks. These kinds of models have a significant limitation: it is not… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  14. arXiv:2303.16206  [pdf, other

    eess.IV cs.CV cs.MM

    Learning Iterative Neural Optimizers for Image Steganography

    Authors: Xiangyu Chen, Varsha Kishore, Kilian Q Weinberger

    Abstract: Image steganography is the process of concealing secret information in images through imperceptible changes. Recent work has formulated this task as a classic constrained optimization problem. In this paper, we argue that image steganography is inherently performed on the (elusive) manifold of natural images, and propose an iterative neural network trained to perform the optimization steps. In con… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: International Conference on Learning Representations (ICLR) 2023

  15. arXiv:2303.15286  [pdf, other

    cs.CV cs.LG

    Unsupervised Adaptation from Repeated Traversals for Autonomous Driving

    Authors: Yurong You, Cheng Perng Phoo, Katie Z Luo, Travis Zhang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: For a self-driving car to operate reliably, its perceptual system must generalize to the end-user's environment -- ideally without additional annotation efforts. One potential solution is to leverage unlabeled data (e.g., unlabeled LiDAR point clouds) collected from the end-users' environments (i.e. target domain) to adapt the system to the difference between training and testing environments. Whi… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by NeurIPS 2022. Code is available at https://github.com/YurongYou/Rote-DA

  16. arXiv:2302.10326  [pdf, other

    cs.CV cs.LG

    Unsupervised Out-of-Distribution Detection with Diffusion Inpainting

    Authors: Zhenzhen Liu, ** Peng Zhou, Yufan Wang, Kilian Q. Weinberger

    Abstract: Unsupervised out-of-distribution detection (OOD) seeks to identify out-of-domain data by learning only from unlabeled in-domain data. We present a novel approach for this task - Lift, Map, Detect (LMD) - that leverages recent advancement in diffusion models. Diffusion models are one type of generative models. At their core, they learn an iterative denoising process that gradually maps a noisy imag… ▽ More

    Submitted 16 August, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  17. arXiv:2212.10564  [pdf, other

    cs.CL cs.AI cs.LG

    Re-evaluating the Need for Multimodal Signals in Unsupervised Grammar Induction

    Authors: Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein

    Abstract: Are multimodal inputs necessary for grammar induction? Recent work has shown that multimodal training inputs can improve grammar induction. However, these improvements are based on comparisons to weak text-only baselines that were trained on relatively little textual data. To determine whether multimodal inputs are needed in regimes with large amounts of textual training data, we design a stronger… ▽ More

    Submitted 12 April, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: NAACL Findings 2024

  18. arXiv:2212.09462  [pdf, other

    cs.CL cs.LG

    Latent Diffusion for Language Generation

    Authors: Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman, Kilian Q. Weinberger

    Abstract: Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have presented diffusion as an alternative to existing pretrained language models. We view diffusion and existing language models as complementary. We demonstrate that enc… ▽ More

    Submitted 7 November, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023

  19. arXiv:2210.10880  [pdf, other

    cs.LG cs.CR

    Learning to Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning

    Authors: Ruihan Wu, Xiangyu Chen, Chuan Guo, Kilian Q. Weinberger

    Abstract: Gradient inversion attack enables recovery of training samples from model gradients in federated learning (FL), and constitutes a serious threat to data privacy. To mitigate this vulnerability, prior work proposed both principled defenses based on differential privacy, as well as heuristic defenses based on gradient compression as countermeasures. These defenses have so far been very effective, in… ▽ More

    Submitted 9 June, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  20. arXiv:2209.11673  [pdf, other

    cs.CV cs.RO

    Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs

    Authors: Youya Xia, Josephine Monica, Wei-Lun Chao, Bharath Hariharan, Kilian Q Weinberger, Mark Campbell

    Abstract: A self-driving car must be able to reliably handle adverse weather conditions (e.g., snowy) to operate safely. In this paper, we investigate the idea of turning sensor inputs (i.e., images) captured in an adverse condition into a benign one (i.e., sunny), upon which the downstream tasks (e.g., semantic segmentation) can attain high accuracy. Prior work primarily formulates this as an unpaired imag… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: Submitted to the International Conference on Robotics and Automation (ICRA) 2023

  21. arXiv:2208.01166  [pdf, other

    cs.CV

    Ithaca365: Dataset and Driving Perception under Repeated and Challenging Weather Conditions

    Authors: Carlos A. Diaz-Ruiz, Youya Xia, Yurong You, Jose Nino, Junan Chen, Josephine Monica, Xiangyu Chen, Katie Luo, Yan Wang, Marc Emond, Wei-Lun Chao, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

    Abstract: Advances in perception for self-driving cars have accelerated in recent years due to the availability of large-scale datasets, typically collected at specific locations and under nice weather conditions. Yet, to achieve the high safety requirement, these perceptual systems must operate robustly under a wide variety of weather conditions including snow and rain. In this paper, we present a new data… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted by CVPR 2022

  22. arXiv:2206.07998  [pdf, other

    cs.CR cs.LG

    Differentially Private Multi-Party Data Release for Linear Regression

    Authors: Ruihan Wu, Xin Yang, Yuanshun Yao, Jiankai Sun, Tianyi Liu, Kilian Q. Weinberger, Chong Wang

    Abstract: Differentially Private (DP) data release is a promising technique to disseminate data without compromising the privacy of data subjects. However the majority of prior work has focused on scenarios where a single party owns all the data. In this paper we focus on the multi-party setting, where different stakeholders own disjoint sets of attributes belonging to the same group of data subjects. Withi… ▽ More

    Submitted 18 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: UAI 2022

  23. arXiv:2205.07352  [pdf, other

    cs.CL cs.AI

    Long-term Control for Dialogue Generation: Methods and Evaluation

    Authors: Ramya Ramakrishnan, Hashan Buddhika Narangodage, Mauro Schilman, Kilian Q. Weinberger, Ryan McDonald

    Abstract: Current approaches for controlling dialogue response generation are primarily focused on high-level attributes like style, sentiment, or topic. In this work, we focus on constrained long-term dialogue generation, which involves more fine-grained control and requires a given set of control words to appear in generated responses. This setting requires a model to not only consider the generation of t… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  24. arXiv:2205.01086  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages

    Authors: Felix Wu, Kwangyoun Kim, Shinji Watanabe, Kyu Han, Ryan McDonald, Kilian Q. Weinberger, Yoav Artzi

    Abstract: We introduce Wav2Seq, the first self-supervised approach to pre-train both parts of encoder-decoder models for speech data. We induce a pseudo language as a compact discrete representation, and formulate a self-supervised pseudo speech recognition task -- transcribing audio inputs into pseudo subword sequences. This process stands on its own, or can be applied as low-cost second-stage pre-training… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: Code available at https://github.com/asappresearch/wav2seq

  25. arXiv:2203.15882  [pdf, other

    cs.CV

    Learning to Detect Mobile Objects from LiDAR Scans Without Labels

    Authors: Yurong You, Katie Z Luo, Cheng Perng Phoo, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: Current 3D object detectors for autonomous driving are almost entirely trained on human-annotated data. Although of high quality, the generation of such data is laborious and costly, restricting them to a few specific locations and object types. This paper proposes an alternative approach entirely based on unlabeled data, which can be collected cheaply and in abundance almost everywhere on earth.… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022. Code is available at https://github.com/YurongYou/MODEST

  26. arXiv:2203.11405  [pdf, other

    cs.CV

    Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception

    Authors: Yurong You, Katie Z Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: Self-driving cars must detect vehicles, pedestrians, and other traffic participants accurately to operate safely. Small, far-away, or highly occluded objects are particularly challenging because there is limited information in the LiDAR point clouds for detecting them. To address this challenge, we leverage valuable information from the past: in particular, data collected in past traversals of the… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by ICLR 2022. Code is available at https://github.com/YurongYou/Hindsight

  27. arXiv:2202.12968  [pdf, other

    cs.LG

    Does Label Differential Privacy Prevent Label Inference Attacks?

    Authors: Ruihan Wu, ** Peng Zhou, Kilian Q. Weinberger, Chuan Guo

    Abstract: Label differential privacy (label-DP) is a popular framework for training private ML models on datasets with public features and sensitive private labels. Despite its rigorous privacy guarantee, it has been observed that in practice label-DP does not preclude label inference attacks (LIAs): Models trained with label-DP can be evaluated on the public training features to recover, with high accuracy… ▽ More

    Submitted 3 June, 2023; v1 submitted 25 February, 2022; originally announced February 2022.

  28. arXiv:2201.03546  [pdf, other

    cs.CV cs.CL cs.LG

    Language-driven Semantic Segmentation

    Authors: Boyi Li, Kilian Q. Weinberger, Serge Belongie, Vladlen Koltun, René Ranftl

    Abstract: We present LSeg, a novel model for language-driven semantic image segmentation. LSeg uses a text encoder to compute embeddings of descriptive input labels (e.g., "grass" or "building") together with a transformer-based image encoder that computes dense per-pixel embeddings of the input image. The image encoder is trained with a contrastive objective to align pixel embeddings to the text embedding… ▽ More

    Submitted 2 April, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: ICLR 2022

  29. arXiv:2112.10789  [pdf, other

    quant-ph cond-mat.quant-gas cond-mat.str-el cs.LG

    Machine learning discovery of new phases in programmable quantum simulator snapshots

    Authors: Cole Miles, Rhine Samajdar, Sepehr Ebadi, Tout T. Wang, Hannes Pichler, Subir Sachdev, Mikhail D. Lukin, Markus Greiner, Kilian Q. Weinberger, Eun-Ah Kim

    Abstract: Machine learning has recently emerged as a promising approach for studying complex phenomena characterized by rich datasets. In particular, data-centric approaches lend to the possibility of automatically discovering structures in experimental datasets that manual inspection may miss. Here, we introduce an interpretable unsupervised-supervised hybrid machine learning approach, the hybrid-correlati… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: 9 pages, 5 figures + 12 pages, 10 figures appendix

    Journal ref: Physics Review Research 5, 013026 (2023)

  30. arXiv:2110.11222  [pdf, other

    cs.LG cs.AI

    Is High Variance Unavoidable in RL? A Case Study in Continuous Control

    Authors: Johan Bjorck, Carla P. Gomes, Kilian Q. Weinberger

    Abstract: Reinforcement learning (RL) experiments have notoriously high variance, and minor details can have disproportionately large effects on measured outcomes. This is problematic for creating reproducible research and also serves as an obstacle for real-world applications, where safety and predictability are paramount. In this paper, we investigate causes for this perceived instability. To allow for an… ▽ More

    Submitted 5 February, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: Accepted to ICLR2022

  31. arXiv:2109.06870  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

    Authors: Felix Wu, Kwangyoun Kim, **g Pan, Kyu Han, Kilian Q. Weinberger, Yoav Artzi

    Abstract: This paper is a study of performance-efficiency trade-offs in pre-trained models for automatic speech recognition (ASR). We focus on wav2vec 2.0, and formalize several architecture designs that influence both the model performance and its efficiency. Putting together all our observations, we introduce SEW (Squeezed and Efficient Wav2vec), a pre-trained model architecture with significant improveme… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Code available at https://github.com/asappresearch/sew

  32. arXiv:2107.04520  [pdf, other

    cs.LG stat.ML

    Online Adaptation to Label Distribution Shift

    Authors: Ruihan Wu, Chuan Guo, Yi Su, Kilian Q. Weinberger

    Abstract: Machine learning models often encounter distribution shifts when deployed in the real world. In this paper, we focus on adaptation to label distribution shift in the online setting, where the test-time label distribution is continually changing and the model must dynamically adapt to it without observing the true label. Leveraging a novel analysis, we show that the lack of true label does not hind… ▽ More

    Submitted 5 January, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

  33. arXiv:2106.01151  [pdf, other

    cs.LG

    Towards Deeper Deep Reinforcement Learning with Spectral Normalization

    Authors: Johan Bjorck, Carla P. Gomes, Kilian Q. Weinberger

    Abstract: In computer vision and natural language processing, innovations in model architecture that increase model capacity have reliably translated into gains in performance. In stark contrast with this trend, state-of-the-art reinforcement learning (RL) algorithms often use small MLPs, and gains in performance typically originate from algorithmic innovations. It is natural to hypothesize that small datas… ▽ More

    Submitted 3 January, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: accepted NeurIPS 2021

  34. arXiv:2103.14198  [pdf, other

    cs.CV

    Exploiting Playbacks in Unsupervised Domain Adaptation for 3D Object Detection

    Authors: Yurong You, Carlos Andres Diaz-Ruiz, Yan Wang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q Weinberger

    Abstract: Self-driving cars must detect other vehicles and pedestrians in 3D to plan safe routes and avoid collisions. State-of-the-art 3D object detectors, based on deep learning, have shown promising accuracy but are prone to over-fit to domain idiosyncrasies, making them fail in new environments -- a serious problem if autonomous vehicles are meant to operate freely. In this paper, we propose a novel lea… ▽ More

    Submitted 10 July, 2022; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: Accepted by ICRA 2022

  35. arXiv:2102.13565  [pdf, other

    cs.LG

    Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision

    Authors: Johan Bjorck, Xiangyu Chen, Christopher De Sa, Carla P. Gomes, Kilian Q. Weinberger

    Abstract: Low-precision training has become a popular approach to reduce compute requirements, memory footprint, and energy consumption in supervised learning. In contrast, this promising approach has not yet enjoyed similarly widespread adoption within the reinforcement learning (RL) community, partly because RL agents can be notoriously hard to train even in full precision. In this paper we consider conti… ▽ More

    Submitted 3 June, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

  36. arXiv:2102.06020  [pdf, other

    cs.CR cs.GT cs.LG

    Making Paper Reviewing Robust to Bid Manipulation Attacks

    Authors: Ruihan Wu, Chuan Guo, Felix Wu, Rahul Kidambi, Laurens van der Maaten, Kilian Q. Weinberger

    Abstract: Most computer science conferences rely on paper bidding to assign reviewers to papers. Although paper bidding enables high-quality assignments in days of unprecedented submission numbers, it also opens the door for dishonest reviewers to adversarially influence paper reviewing assignments. Anecdotal evidence suggests that some reviewers bid on papers by "friends" or colluding authors, even though… ▽ More

    Submitted 22 February, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

  37. arXiv:2011.03474  [pdf, other

    cond-mat.str-el cond-mat.dis-nn cond-mat.quant-gas cs.LG physics.comp-ph

    Correlator Convolutional Neural Networks: An Interpretable Architecture for Image-like Quantum Matter Data

    Authors: Cole Miles, Annabelle Bohrdt, Ruihan Wu, Christie Chiu, Muqing Xu, Geoffrey Ji, Markus Greiner, Kilian Q. Weinberger, Eugene Demler, Eun-Ah Kim

    Abstract: Machine learning models are a powerful theoretical tool for analyzing data from quantum simulators, in which results of experiments are sets of snapshots of many-body states. Recently, they have been successfully applied to distinguish between snapshots that can not be identified using traditional one and two point correlation functions. Thus far, the complexity of these models has inhibited new p… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: 7 pages, 4 figures + 13 pages of supplemental material

  38. arXiv:2007.12684  [pdf, other

    cs.CV cs.LG

    Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation

    Authors: Luyu Yang, Yan Wang, Mingfei Gao, Abhinav Shrivastava, Kilian Q. Weinberger, Wei-Lun Chao, Ser-Nam Lim

    Abstract: Semi-supervised domain adaptation (SSDA) aims to adapt models trained from a labeled source domain to a different but related target domain, from which unlabeled data and a small set of labeled data are provided. Current methods that treat source and target supervision without distinction overlook their inherent discrepancy, resulting in a source-dominated model that has not effectively used the t… ▽ More

    Submitted 22 September, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: accepted to ICCV 2021

  39. arXiv:2007.03085  [pdf, other

    cs.CV cs.LG

    Wasserstein Distances for Stereo Disparity Estimation

    Authors: Divyansh Garg, Yan Wang, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

    Abstract: Existing approaches to depth or disparity estimation output a distribution over a set of pre-defined discrete values. This leads to inaccurate results when the true depth or disparity does not match any of these values. The fact that this distribution is usually learned indirectly through a regression loss causes further problems in ambiguous regions around object boundaries. We address these issu… ▽ More

    Submitted 29 March, 2021; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: Accepted to NeurIPS 2020 (spotlight)

  40. arXiv:2006.05987  [pdf, other

    cs.CL cs.LG

    Revisiting Few-sample BERT Fine-tuning

    Authors: Tianyi Zhang, Felix Wu, Arzoo Katiyar, Kilian Q. Weinberger, Yoav Artzi

    Abstract: This paper is a study of fine-tuning of BERT contextual representations, with focus on commonly observed instabilities in few-sample scenarios. We identify several factors that cause this instability: the common use of a non-standard optimization method with biased gradient estimation; the limited applicability of significant parts of the BERT network for down-stream tasks; and the prevalent pract… ▽ More

    Submitted 11 March, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Code available at https://github.com/asappresearch/revisit-bert-finetuning

  41. arXiv:2005.08139  [pdf, other

    cs.CV

    Train in Germany, Test in The USA: Making 3D Object Detectors Generalize

    Authors: Yan Wang, Xiangyu Chen, Yurong You, Li Erran, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

    Abstract: In the domain of autonomous driving, deep learning has substantially improved the 3D object detection accuracy for LiDAR and stereo camera data alike. While deep networks are great at generalization, they are also notorious to over-fit to all kinds of spurious artifacts, such as brightness, car sizes and models, that may appear consistently throughout the data. In fact, most datasets for autonomou… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: Accepted to 2020 Conference on Computer Vision and Pattern Recognition (CVPR 2020)

  42. arXiv:2004.03080  [pdf, other

    cs.CV eess.IV

    End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection

    Authors: Rui Qian, Divyansh Garg, Yan Wang, Yurong You, Serge Belongie, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

    Abstract: Reliable and accurate 3D object detection is a necessity for safe autonomous driving. Although LiDAR sensors can provide accurate 3D point cloud estimates of the environment, they are also prohibitively expensive for many settings. Recently, the introduction of pseudo-LiDAR (PL) has led to a drastic reduction in the accuracy gap between methods based on LiDAR sensors and those based on cheap stere… ▽ More

    Submitted 14 May, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted to 2020 Conference on Computer Vision and Pattern Recognition (CVPR 2020)

  43. arXiv:2002.11102  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    On Feature Normalization and Data Augmentation

    Authors: Boyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, Kilian Q. Weinberger

    Abstract: The moments (a.k.a., mean and standard deviation) of latent features are often removed as noise when training image recognition models, to increase stability and reduce training time. However, in the field of image generation, the moments play a much more central role. Studies have shown that the moments extracted from instance normalization and positional normalization can roughly capture style a… ▽ More

    Submitted 30 March, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: CVPR 2021. Code is available at https://github.com/Boyiliee/MoEx

  44. arXiv:2002.10078  [pdf, other

    cs.LG stat.ML

    On Hiding Neural Networks Inside Neural Networks

    Authors: Chuan Guo, Ruihan Wu, Kilian Q. Weinberger

    Abstract: Modern neural networks often contain significantly more parameters than the size of their training data. We show that this excess capacity provides an opportunity for embedding secret machine learning models within a trained neural network. Our novel framework hides the existence of a secret neural network with arbitrary desired functionality within a carrier network. We prove theoretically that t… ▽ More

    Submitted 21 May, 2021; v1 submitted 24 February, 2020; originally announced February 2020.

  45. arXiv:2002.00573  [pdf, other

    cs.LG cs.CV stat.ML

    Revisiting Meta-Learning as Supervised Learning

    Authors: Wei-Lun Chao, Han-Jia Ye, De-Chuan Zhan, Mark Campbell, Kilian Q. Weinberger

    Abstract: Recent years have witnessed an abundance of new publications and approaches on meta-learning. This community-wide enthusiasm has sparked great insights but has also created a plethora of seemingly different frameworks, which can be hard to compare and evaluate. In this paper, we aim to provide a principled, unifying framework by revisiting and strengthening the connection between meta-learning and… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: An extended version of the paper titled "A Meta Understanding of Meta-Learning" presented in ICML 2019 Workshop on Adaptive and Multitask Learning: Algorithms & Systems

  46. arXiv:2001.10528  [pdf, other

    cs.LG cs.CV stat.ML

    Identifying Mislabeled Data using the Area Under the Margin Ranking

    Authors: Geoff Pleiss, Tianyi Zhang, Ethan R. Elenberg, Kilian Q. Weinberger

    Abstract: Not all data in a typical training set help with generalization; some samples can be overly ambiguous or outrightly mislabeled. This paper introduces a new method to identify such samples and mitigate their impact when training neural networks. At the heart of our algorithm is the Area Under the Margin (AUM) statistic, which exploits differences in the training dynamics of clean and mislabeled sam… ▽ More

    Submitted 23 December, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: NeurIPS 2020

  47. arXiv:2001.02394  [pdf, other

    cs.LG cs.CV stat.ML

    Convolutional Networks with Dense Connectivity

    Authors: Gao Huang, Zhuang Liu, Geoff Pleiss, Laurens van der Maaten, Kilian Q. Weinberger

    Abstract: Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we embrace this observation and introduce the Dense Convolutional Network (DenseNet), which connects each layer to every other layer in a feed-forward fashion.Whereas… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

    Comments: Journal(PAMI) version of DenseNet(CVPR'17)

  48. arXiv:1911.04623  [pdf, other

    cs.CV

    SimpleShot: Revisiting Nearest-Neighbor Classification for Few-Shot Learning

    Authors: Yan Wang, Wei-Lun Chao, Kilian Q. Weinberger, Laurens van der Maaten

    Abstract: Few-shot learners aim to recognize new object classes based on a small number of labeled training examples. To prevent overfitting, state-of-the-art few-shot learners use meta-learning on convolutional-network features and perform classification using a nearest-neighbor classifier. This paper studies the accuracy of nearest-neighbor baselines without meta-learning. Surprisingly, we find simple fea… ▽ More

    Submitted 15 November, 2019; v1 submitted 11 November, 2019; originally announced November 2019.

  49. arXiv:1910.13955  [pdf, other

    eess.IV cs.CV cs.RO

    LDLS: 3-D Object Segmentation Through Label Diffusion From 2-D Images

    Authors: Brian H. Wang, Wei-Lun Chao, Yan Wang, Bharath Hariharan, Kilian Q. Weinberger, Mark Campbell

    Abstract: Object segmentation in three-dimensional (3-D) point clouds is a critical task for robots capable of 3-D perception. Despite the impressive performance of deep learning-based approaches on object segmentation in 2-D images, deep learning has not been applied nearly as successfully for 3-D point cloud segmentation. Deep networks generally require large amounts of labeled training data, which are re… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters with presentation at IROS 2019

  50. arXiv:1910.07629  [pdf, other

    cs.LG cs.CR stat.ML

    A New Defense Against Adversarial Images: Turning a Weakness into a Strength

    Authors: Tao Yu, Shengyuan Hu, Chuan Guo, Wei-Lun Chao, Kilian Q. Weinberger

    Abstract: Natural images are virtually surrounded by low-density misclassified regions that can be efficiently discovered by gradient-guided search --- enabling the generation of adversarial images. While many techniques for detecting these attacks have been proposed, they are easily bypassed when the adversary has full knowledge of the detection mechanism and adapts the attack strategy accordingly. In this… ▽ More

    Submitted 3 December, 2019; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019, 14 pages