Skip to main content

Showing 1–23 of 23 results for author: Wei, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20053  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation

    Authors: Danny Halawi, Alexander Wei, Eric Wallace, Tony T. Wang, Nika Haghtalab, Jacob Steinhardt

    Abstract: Black-box finetuning is an emerging interface for adapting state-of-the-art language models to user needs. However, such access may also let malicious actors undermine model safety. To demonstrate the challenge of defending finetuning interfaces, we introduce covert malicious finetuning, a method to compromise model safety via finetuning while evading detection. Our method constructs a malicious d… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 22 pages

  2. arXiv:2403.13213  [pdf, other

    cs.LG cs.CL cs.CY

    From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards

    Authors: Khaoula Chehbouni, Megha Roshan, Emmanuel Ma, Futian Andrew Wei, Afaf Taik, Jackie CK Cheung, Golnoosh Farnadi

    Abstract: Recent progress in large language models (LLMs) has led to their widespread adoption in various domains. However, these advancements have also introduced additional safety risks and raised concerns regarding their detrimental impact on already marginalized populations. Despite growing mitigation efforts to develop safety safeguards, such as supervised safety-oriented fine-tuning and leveraging saf… ▽ More

    Submitted 7 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures. Accepted to Findings of the Association for Computational Linguistics: ACL 2024

  3. arXiv:2403.09675  [pdf, other

    cs.CV cs.GR

    Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases

    Authors: Rio Aguina-Kang, Maxim Gumin, Do Heon Han, Stewart Morris, Seung Jean Yoo, Aditya Ganeshan, R. Kenny Jones, Qiuhong Anna Wei, Kailiang Fu, Daniel Ritchie

    Abstract: We present a system for generating indoor scenes in response to text prompts. The prompts are not limited to a fixed vocabulary of scene descriptions, and the objects in generated scenes are not restricted to a fixed set of object categories -- we call this setting indoor scene generation. Unlike most prior work on indoor scene generation, our system does not require a large training dataset of ex… ▽ More

    Submitted 4 February, 2024; originally announced March 2024.

    Comments: See ancillary files for link to supplemental material

  4. arXiv:2307.02483  [pdf, other

    cs.LG cs.CR

    Jailbroken: How Does LLM Safety Training Fail?

    Authors: Alexander Wei, Nika Haghtalab, Jacob Steinhardt

    Abstract: Large language models trained for safety and harmlessness remain susceptible to adversarial misuse, as evidenced by the prevalence of "jailbreak" attacks on early releases of ChatGPT that elicit undesired behavior. Going beyond recognition of the issue, we investigate why such attacks succeed and how they can be created. We hypothesize two failure modes of safety training: competing objectives and… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  5. arXiv:2304.11259  [pdf, other

    cs.RO

    Consensus Complementarity Control for Multi-Contact MPC

    Authors: Alp Aydinoglu, Adam Wei, Wei-Cheng Huang, Michael Posa

    Abstract: We propose a hybrid model predictive control algorithm, consensus complementarity control (C3), for systems that make and break contact with their environment. Many state-of-the-art controllers for tasks which require initiating contact with the environment, such as locomotion and manipulation, require a priori mode schedules or are too computationally complex to run at real-time rates. We present… ▽ More

    Submitted 7 March, 2024; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: T-RO submission. Continuation of the work: arXiv:2109.07076v2

  6. arXiv:2301.09629  [pdf, other

    cs.CV

    LEGO-Net: Learning Regular Rearrangements of Objects in Rooms

    Authors: Qiuhong Anna Wei, Sijie Ding, Jeong Joon Park, Rahul Sajnani, Adrien Poulenard, Srinath Sridhar, Leonidas Guibas

    Abstract: Humans universally dislike the task of cleaning up a messy room. If machines were to help us with this task, they must understand human criteria for regular arrangements, such as several types of symmetry, co-linearity or co-circularity, spacing uniformity in linear or circular patterns, and further inter-object relationships that relate to style and functionality. Previous approaches for this tas… ▽ More

    Submitted 24 March, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Project page: https://ivl.cs.brown.edu/projects/lego-net

  7. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, **gang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, **woo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

  8. arXiv:2208.09407  [pdf, other

    cs.GT cs.DS cs.LG

    Learning in Stackelberg Games with Non-myopic Agents

    Authors: Nika Haghtalab, Thodoris Lykouris, Sloan Nietert, Alex Wei

    Abstract: We study Stackelberg games where a principal repeatedly interacts with a long-lived, non-myopic agent, without knowing the agent's payoff function. Although learning in Stackelberg games is well-understood when the agent is myopic, non-myopic agents pose additional complications. In particular, non-myopic agents may strategically select actions that are inferior in the present to mislead the princ… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: An extended abstract of this work appeared at the ACM Conference on Economics and Computation (EC) 2022

  9. arXiv:2207.06343  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

    Authors: Yaodong Yu, Alexander Wei, Sai Praneeth Karimireddy, Yi Ma, Michael I. Jordan

    Abstract: State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions. For neural networks, even when centralized SGD easily finds a solution that is simultaneously performant for all clients, current federated optimization methods fail to converge to a comparable solution. We show that this performance disparity can l… ▽ More

    Submitted 5 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS) 2022. V2 releases code

    MSC Class: 68W40; 68W15; 90C25; 90C06 ACM Class: G.1.6; F.2.1; E.4

  10. arXiv:2207.05531  [pdf, other

    cs.SE

    Fuzzing Deep-Learning Libraries via Automated Relational API Inference

    Authors: Yinlin Deng, Chenyuan Yang, Anjiang Wei, Lingming Zhang

    Abstract: A growing body of research has been dedicated to DL model testing. However, there is still limited work on testing DL libraries, which serve as the foundations for building, training, and running DL models. Prior work on fuzzing DL libraries can only generate tests for APIs which have been invoked by documentation examples, developer tests, or DL models, leaving a large number of APIs untested. In… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted at ESEC/FSE 2022

  11. arXiv:2203.06176  [pdf, other

    cs.LG stat.ML

    More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize

    Authors: Alexander Wei, Wei Hu, Jacob Steinhardt

    Abstract: Of theories for why large-scale machine learning models generalize despite being vastly overparameterized, which of their assumptions are needed to capture the qualitative phenomena of generalization in the real world? On one hand, we find that most theoretical analyses fall short of capturing these qualitative phenomena even for kernel regression, when applied to kernels derived from large-scale… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

  12. arXiv:2202.05834  [pdf, other

    cs.LG stat.ML

    Predicting Out-of-Distribution Error with the Projection Norm

    Authors: Yaodong Yu, Zitong Yang, Alexander Wei, Yi Ma, Jacob Steinhardt

    Abstract: We propose a metric -- Projection Norm -- to predict a model's performance on out-of-distribution (OOD) data without access to ground truth labels. Projection Norm first uses model predictions to pseudo-label test samples and then trains a new model on the pseudo-labels. The more the new model's parameters differ from an in-distribution model, the greater the predicted OOD error. Empirically, our… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  13. arXiv:2201.06589  [pdf, other

    cs.SE

    Free Lunch for Testing: Fuzzing Deep-Learning Libraries from Open Source

    Authors: Anjiang Wei, Yinlin Deng, Chenyuan Yang, Lingming Zhang

    Abstract: Deep learning (DL) systems can make our life much easier, and thus are gaining more and more attention from both academia and industry. Meanwhile, bugs in DL systems can be disastrous, and can even threaten human lives in safety-critical applications. To date, a huge body of research efforts have been dedicated to testing DL models. However, interestingly, there is still limited work for testing t… ▽ More

    Submitted 25 February, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

  14. arXiv:2108.09922  [pdf

    cs.SD cs.LG eess.AS

    Subject Envelope based Multitype Reconstruction Algorithm of Speech Samples of Parkinson's Disease

    Authors: Yongming Li, Chengyu Liu, Pin Wang, Hehua Zhang, Anhai Wei

    Abstract: The risk of Parkinson's disease (PD) is extremely serious, and PD speech recognition is an effective method of diagnosis nowadays. However, due to the influence of the disease stage, corpus, and other factors on data collection, the ability of every samples within one subject to reflect the status of PD vary. No samples are useless totally, and not samples are 100% perfect. This characteristic mea… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 11 pages, 6 tables

  15. arXiv:2108.08843  [pdf, other

    cs.LG cs.GT stat.ML

    Learning Equilibria in Matching Markets from Bandit Feedback

    Authors: Meena Jagadeesan, Alexander Wei, Yixin Wang, Michael I. Jordan, Jacob Steinhardt

    Abstract: Large-scale, two-sided matching platforms must find market outcomes that align with user preferences while simultaneously learning these preferences from data. Classical notions of stability (Gale and Shapley, 1962; Shapley and Shubik, 1971) are unfortunately of limited value in the learning setting, given that preferences are inherently uncertain and destabilizing while they are being learned. To… ▽ More

    Submitted 31 January, 2023; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted to the Journal of the ACM; conference version appeared at NeurIPS 2021

  16. arXiv:2102.09017  [pdf, other

    cs.GT

    Designing Approximately Optimal Search on Matching Platforms

    Authors: Nicole Immorlica, Brendan Lucier, Vahideh Manshadi, Alexander Wei

    Abstract: We study the design of a decentralized two-sided matching market in which agents' search is guided by the platform. There are finitely many agent types, each with (potentially random) preferences drawn from known type-specific distributions. Equipped with knowledge of these distributions, the platform guides the search process by determining the meeting rate between each pair of types from the two… ▽ More

    Submitted 18 August, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

  17. arXiv:2010.11443  [pdf, other

    cs.LG cs.DS

    Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms

    Authors: Alexander Wei, Fred Zhang

    Abstract: We study the problem of improving the performance of online algorithms by incorporating machine-learned predictions. The goal is to design algorithms that are both consistent and robust, meaning that the algorithm performs well when predictions are accurate and maintains worst-case guarantees. Such algorithms have been studied in a recent line of works due to Lykouris and Vassilvitskii (ICML '18)… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: To appear at NeurIPS 2020

  18. arXiv:2007.09610  [pdf, other

    eess.IV cs.CV cs.LG

    Self-similarity Student for Partial Label Histopathology Image Segmentation

    Authors: Hsien-Tzu Cheng, Chun-Fu Yeh, Po-Chen Kuo, Andy Wei, Keng-Chi Liu, Mong-Chi Ko, Kuan-Hua Chao, Yu-Ching Peng, Tyng-Luh Liu

    Abstract: Delineation of cancerous regions in gigapixel whole slide images (WSIs) is a crucial diagnostic procedure in digital pathology. This process is time-consuming because of the large search space in the gigapixel WSIs, causing chances of omission and misinterpretation at indistinct tumor lesions. To tackle this, the development of an automated cancerous region segmentation method is imperative. We fr… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  19. arXiv:2005.13716  [pdf, ps, other

    cs.DS

    Better and Simpler Learning-Augmented Online Caching

    Authors: Alexander Wei

    Abstract: Lykouris and Vassilvitskii (ICML 2018) introduce a model of online caching with machine-learned advice, where each page request additionally comes with a prediction of when that page will next be requested. In this model, a natural goal is to design algorithms that (1) perform well when the advice is accurate and (2) remain robust in the worst case a la traditional competitive analysis. Lykouris a… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

  20. arXiv:2004.12786  [pdf, other

    eess.IV cs.CV cs.LG

    A Cascaded Learning Strategy for Robust COVID-19 Pneumonia Chest X-Ray Screening

    Authors: Chun-Fu Yeh, Hsien-Tzu Cheng, Andy Wei, Hsin-Ming Chen, Po-Chen Kuo, Keng-Chi Liu, Mong-Chi Ko, Ray-Jade Chen, Po-Chang Lee, Jen-Hsiang Chuang, Chi-Mai Chen, Yi-Chang Chen, Wen-Jeng Lee, Ning Chien, Jo-Yu Chen, Yu-Sen Huang, Yu-Chien Chang, Yu-Cheng Huang, Nai-Kuan Chou, Kuan-Hua Chao, Yi-Chin Tu, Yeun-Chung Chang, Tyng-Luh Liu

    Abstract: We introduce a comprehensive screening platform for the COVID-19 (a.k.a., SARS-CoV-2) pneumonia. The proposed AI-based system works on chest x-ray (CXR) images to predict whether a patient is infected with the COVID-19 disease. Although the recent international joint effort on making the availability of all sorts of open data, the public collection of CXR images is still relatively small for relia… ▽ More

    Submitted 30 April, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: 14 pages, 6 figures

  21. arXiv:1807.07527  [pdf, ps, other

    cs.DS

    Optimal Las Vegas Approximate Near Neighbors in $\ell_p$

    Authors: Alexander Wei

    Abstract: We show that approximate near neighbor search in high dimensions can be solved in a Las Vegas fashion (i.e., without false negatives) for $\ell_p$ ($1\le p\le 2$) while matching the performance of optimal locality-sensitive hashing. Specifically, we construct a data-independent Las Vegas data structure with query time $O(dn^ρ)$ and space usage $O(dn^{1+ρ})$ for $(r, c r)$-approximate near neighbor… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

  22. Bayesian and hybrid Cramer-Rao bounds for QAM dynamical phase estimation

    Authors: Jianxiao Yang, Benoit Geller, A Wei

    Abstract: -In this paper, we study Bayesian and hybrid Cramer-Rao bounds for the dynamical phase estimation of QAM modulated signals. We present the analytical expressions for the various CRBs. This avoids the calculation of any matrix inversion and thus greatly reduces the computation complexity. Through simulations, we also illustrate the behaviors of the BCRB and of the HCRB with the signal-to-noise rati… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

    Journal ref: Acoustics, Speech and Signal Processing, Apr 2009, Taipei, Taiwan. 2009

  23. arXiv:1206.1419  [pdf

    cs.NI

    Analysis study of time synchronization protocols in wireless sensor networks

    Authors: Salim el Khediri, Nejah Nasri, Mounir Samet, Anne Wei, Abdennaceur Kachouri

    Abstract: One of the main pervasive problems Wireless Sensor Networks (WSN) encounter is to maintain flawless communication sharing and cooperative processing between sensors via radio links to ensure a reliable treatment of information. Many applications based on these WSNs consider local clocks at each sensor node that need to be synchronized to a common notion of time. In this context, the majority of pr… ▽ More

    Submitted 7 June, 2012; originally announced June 2012.

    Comments: 15 pages, 1 figures, 2 tables