Skip to main content

Showing 1–13 of 13 results for author: Mu, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.19531  [pdf, other

    cs.CV

    MoST: Multi-modality Scene Tokenization for Motion Prediction

    Authors: Norman Mu, **gwei Ji, Zhenpei Yang, Nate Harada, Haotian Tang, Kan Chen, Charles R. Qi, Runzhou Ge, Kratarth Goel, Zoey Yang, Scott Ettinger, Rami Al-Rfou, Dragomir Anguelov, Yin Zhou

    Abstract: Many existing motion prediction approaches rely on symbolic perception outputs to generate agent trajectories, such as bounding boxes, road graph information and traffic lights. This symbolic representation is a high-level abstraction of the real world, which may render the motion prediction model vulnerable to perception errors (e.g., failures in detecting open-vocabulary obstacles) while missing… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  2. arXiv:2402.12617  [pdf, other

    cs.CR cs.AI cs.CL cs.CY cs.LG

    Generative AI Security: Challenges and Countermeasures

    Authors: Banghua Zhu, Norman Mu, Jiantao Jiao, David Wagner

    Abstract: Generative AI's expanding footprint across numerous industries has led to both excitement and increased scrutiny. This paper delves into the unique security challenges posed by Generative AI, and outlines potential research directions for managing these risks.

    Submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2402.09674  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    PAL: Proxy-Guided Black-Box Attack on Large Language Models

    Authors: Chawin Sitawarin, Norman Mu, David Wagner, Alexandre Araujo

    Abstract: Large Language Models (LLMs) have surged in popularity in recent months, but they have demonstrated concerning capabilities to generate harmful content when manipulated. While techniques like safety fine-tuning aim to minimize harmful use, recent works have shown that LLMs remain vulnerable to attacks that elicit toxic responses. In this work, we introduce the Proxy-Guided Attack on LLMs (PAL), th… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2402.04249  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

    Authors: Mantas Mazeika, Long Phan, Xuwang Yin, Andy Zou, Zifan Wang, Norman Mu, Elham Sakhaee, Nathaniel Li, Steven Basart, Bo Li, David Forsyth, Dan Hendrycks

    Abstract: Automated red teaming holds substantial promise for uncovering and mitigating the risks associated with the malicious use of large language models (LLMs), yet the field lacks a standardized evaluation framework to rigorously assess new methods. To address this issue, we introduce HarmBench, a standardized evaluation framework for automated red teaming. We identify several desirable properties prev… ▽ More

    Submitted 26 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Website: https://www.harmbench.org

  5. arXiv:2312.00273  [pdf, other

    cs.CR cs.AI cs.CL

    Mark My Words: Analyzing and Evaluating Language Model Watermarks

    Authors: Julien Piet, Chawin Sitawarin, Vivian Fang, Norman Mu, David Wagner

    Abstract: The capabilities of large language models have grown significantly in recent years and so too have concerns about their misuse. In this context, the ability to distinguish machine-generated text from human-authored content becomes important. Prior works have proposed numerous schemes to watermark text, which would benefit from a systematic evaluation framework. This work focuses on text watermarki… ▽ More

    Submitted 6 December, 2023; v1 submitted 30 November, 2023; originally announced December 2023.

    Comments: 18 pages, 11 figures

  6. arXiv:2311.04235  [pdf, other

    cs.AI cs.CL cs.LG

    Can LLMs Follow Simple Rules?

    Authors: Norman Mu, Sarah Chen, Zifan Wang, Sizhe Chen, David Karamardian, Lulwa Aljeraisy, Basel Alomair, Dan Hendrycks, David Wagner

    Abstract: As Large Language Models (LLMs) are deployed with increasing real-world responsibilities, it is important to be able to specify and constrain the behavior of these systems in a reliable manner. Model developers may wish to set explicit rules for the model, such as "do not generate abusive content", but these may be circumvented by jailbreaking techniques. Existing evaluations of adversarial attack… ▽ More

    Submitted 8 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Project website: https://eecs.berkeley.edu/~normanmu/llm_rules; revised content

  7. arXiv:2212.02064  [pdf, other

    cs.AI

    E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance

    Authors: Can Chang, Ni Mu, Jiajun Wu, Ling Pan, Huazhe Xu

    Abstract: A critical challenge in multi-agent reinforcement learning(MARL) is for multiple agents to efficiently accomplish complex, long-horizon tasks. The agents often have difficulties in cooperating on common goals, dividing complex tasks, and planning through several stages to make progress. We propose to address these challenges by guiding agents with programs designed for parallelization, since progr… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  8. arXiv:2211.09533  [pdf, other

    eess.IV cs.CV

    Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation

    Authors: Yiyue Hu, Lei Zhang, Nan Mu, Lei Liu

    Abstract: Transformers have achieved remarkable success in medical image analysis owing to their powerful capability to use flexible self-attention mechanism. However, due to lacking intrinsic inductive bias in modeling visual structural information, they generally require a large-scale pre-training schedule, limiting the clinical applications over expensive small-scale medical data. To this end, we propose… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  9. arXiv:2112.12750  [pdf, other

    cs.CV

    SLIP: Self-supervision meets Language-Image Pre-training

    Authors: Norman Mu, Alexander Kirillov, David Wagner, Saining Xie

    Abstract: Recent work has shown that self-supervised pre-training leads to improvements over supervised learning on challenging visual recognition tasks. CLIP, an exciting new approach to learning with language supervision, demonstrates promising performance on a wide variety of benchmarks. In this work, we explore whether self-supervised learning can aid in the use of language supervision for visual repres… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: Code: https://github.com/facebookresearch/SLIP

  10. arXiv:2006.16241  [pdf, other

    cs.CV cs.LG stat.ML

    The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization

    Authors: Dan Hendrycks, Steven Basart, Norman Mu, Saurav Kadavath, Frank Wang, Evan Dorundo, Rahul Desai, Tyler Zhu, Samyak Parajuli, Mike Guo, Dawn Song, Jacob Steinhardt, Justin Gilmer

    Abstract: We introduce four new real-world distribution shift datasets consisting of changes in image style, image blurriness, geographic location, camera operation, and more. With our new datasets, we take stock of previously proposed methods for improving out-of-distribution robustness and put them to the test. We find that using larger models and artificial data augmentations can improve robustness on re… ▽ More

    Submitted 24 July, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: ICCV 2021; Datasets, code, and models available at https://github.com/hendrycks/imagenet-r

  11. arXiv:1912.02781  [pdf, other

    stat.ML cs.CV cs.LG

    AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

    Authors: Dan Hendrycks, Norman Mu, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan

    Abstract: Modern deep neural networks can achieve high accuracy when the training distribution and test distribution are identically distributed, but this assumption is frequently violated in practice. When the train and test distributions are mismatched, accuracy can plummet. Currently there are few techniques that improve robustness to unforeseen data shifts encountered during deployment. In this work, we… ▽ More

    Submitted 17 February, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Code available at https://github.com/google-research/augmix

  12. arXiv:1906.02337  [pdf, other

    cs.CV cs.LG

    MNIST-C: A Robustness Benchmark for Computer Vision

    Authors: Norman Mu, Justin Gilmer

    Abstract: We introduce the MNIST-C dataset, a comprehensive suite of 15 corruptions applied to the MNIST test set, for benchmarking out-of-distribution robustness in computer vision. Through several experiments and visualizations we demonstrate that our corruptions significantly degrade performance of state-of-the-art computer vision models while preserving the semantic content of the test images. In contra… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

  13. arXiv:1812.01216  [pdf, other

    cs.LG

    Parameter Re-Initialization through Cyclical Batch Size Schedules

    Authors: Norman Mu, Zhewei Yao, Amir Gholami, Kurt Keutzer, Michael Mahoney

    Abstract: Optimal parameter initialization remains a crucial problem for neural network training. A poor weight initialization may take longer to train and/or converge to sub-optimal solutions. Here, we propose a method of weight re-initialization by repeated annealing and injection of noise in the training process. We implement this through a cyclical batch size schedule motivated by a Bayesian perspective… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: Presented in Systems for Machine Learning Workshop at NeurIPS'18 conference

    Journal ref: NeurIPS 2018 Workshop