Skip to main content

Showing 1–50 of 120 results for author: Wong, E

.
  1. arXiv:2407.00075  [pdf, other

    cs.AI cs.CL cs.CR cs.LG

    Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

    Authors: Anton Xue, Avishree Khare, Rajeev Alur, Surbhi Goel, Eric Wong

    Abstract: We study how to subvert language models from following the rules. We model rule-following as inference in propositional Horn logic, a mathematical system in which rules have the form "if $P$ and $Q$, then $R$" for some propositions $P$, $Q$, and $R$. We prove that although transformers can faithfully abide by such rules, maliciously crafted prompts can nevertheless mislead even theoretically const… ▽ More

    Submitted 21 June, 2024; originally announced July 2024.

  2. arXiv:2406.18534  [pdf, other

    cs.CL cs.LG

    Towards Compositionality in Concept Learning

    Authors: Adam Stein, Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong

    Abstract: Concept-based interpretability methods offer a lens into the internals of foundation models by decomposing their embeddings into high-level concepts. These concept representations are most useful when they are compositional, meaning that the individual concepts compose to explain the full sample. We show that existing unsupervised concept extraction methods find concepts which are not compositiona… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 26 pages, 10 figures

  3. arXiv:2406.10952  [pdf, other

    cs.CL

    Avoiding Copyright Infringement via Machine Unlearning

    Authors: Guangyao Dou, Zheyuan Liu, Qing Lyu, Kaize Ding, Eric Wong

    Abstract: Pre-trained Large Language Models (LLMs) have demonstrated remarkable capabilities but also pose risks by learning and generating copyrighted material, leading to significant legal and ethical concerns. To address these issues, it is critical for model owners to be able to unlearn copyrighted content at various time steps. We explore the setting of sequential unlearning, where copyrighted content… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  4. arXiv:2406.06246  [pdf, other

    cs.LG

    Data-Efficient Learning with Neural Programs

    Authors: Alaia Solko-Breslin, Seewon Choi, Ziyang Li, Neelay Velingker, Rajeev Alur, Mayur Naik, Eric Wong

    Abstract: Many computational tasks can be naturally expressed as a composition of a DNN followed by a program written in a traditional programming language or an API call to an LLM. We call such composites "neural programs" and focus on the problem of learning the DNN parameters when the training data consist of end-to-end input-output labels for the composite. When the program is written in a differentiabl… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  5. Rethinking Programming Paradigms in the QC-HPC Context

    Authors: Silvina Caino-Lores, Daniel Claudino, Eugene Dumitrescu, Travis S. Humble, Sonia Lopez Alarcon, Elaine Wong

    Abstract: Programming for today's quantum computers is making significant strides toward modern workflows compatible with high performance computing (HPC), but fundamental challenges still remain in the integration of these vastly different technologies. Quantum computing (QC) programming languages share some common ground, as well as their emerging runtimes and algorithmic modalities. In this short paper,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Journal ref: WAMTA 2024: Proceedings of the Workshop on Asynchronous Many-Task Systems and Applications. Lecture Notes in Computer Science, Vol 14626, Pages 84-91, Springer, Cham

  6. arXiv:2406.00611  [pdf, other

    cs.LG stat.ME

    DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation

    Authors: Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong

    Abstract: Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. 22 pages, 5 figures

  7. arXiv:2405.06692  [pdf, ps, other

    cs.CL

    Analyzing Language Bias Between French and English in Conventional Multilingual Sentiment Analysis Models

    Authors: Ethan Parker Wong, Faten M'hiri

    Abstract: Inspired by the 'Bias Considerations in Bilingual Natural Language Processing' report by Statistics Canada, this study delves into potential biases in multilingual sentiment analysis between English and French. Given a 50-50 dataset of French and English, we aim to determine if there exists a language bias and explore how the incorporation of more diverse datasets in the future might affect the eq… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Undergraduate Research Project

  8. Practice-informed Patterns for Organising Large Groups in Distributed Mixed Reality Collaboration

    Authors: Emily Wong, Juan Sánchez Esquivel, Jens Emil Grønbæk, Germán Leiva, Eduardo Velloso

    Abstract: Collaborating across dissimilar, distributed spaces presents numerous challenges for computer-aided spatial communication. Mixed reality (MR) can blend selected surfaces, allowing collaborators to work in blended f-formations (facing formations), even when their workstations are physically misaligned. Since collaboration often involves more than just participant pairs, this research examines how w… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Journal ref: CHI '24, Proceedings of the CHI Conference on Human Factors in Computing Systems, May 11-16 2024, Honolulu, HI, USA

  9. arXiv:2404.19727  [pdf, other

    quant-ph

    Expressiveness of Commutative Quantum Circuits: A Probabilistic Approach

    Authors: Jorge M. Ramirez, Elaine Wong, Caio Alves, Sarah Chehade, Ryan Bennink

    Abstract: This study investigates the frame potential and expressiveness of commutative quantum circuits. Based on the Fourier series representation of these circuits, we express quantum expectation and pairwise fidelity as characteristic functions of random variables, and expressiveness as the recurrence probability of a random walk on a lattice. A central outcome of our work includes formulas to approxima… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  10. arXiv:2404.14299  [pdf, other

    quant-ph cs.SE

    A Cross-Platform Execution Engine for the Quantum Intermediate Representation

    Authors: Elaine Wong, Vicente Leyton Ortega, Daniel Claudino, Seth Johnson, Sharmin Afrose, Meenambika Gowrishankar, Anthony M. Cabrera, Travis S. Humble

    Abstract: Hybrid languages like the Quantum Intermediate Representation (QIR) are essential for programming systems that mix quantum and conventional computing models, while execution of these programs is often deferred to a system-specific implementation. Here, we describe and demonstrate the QIR Execution Engine (QIR-EE) for parsing, interpreting, and executing QIR across multiple hardware platforms. QIR-… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  11. arXiv:2404.01318  [pdf, other

    cs.CR cs.LG

    JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

    Authors: Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramer, Hamed Hassani, Eric Wong

    Abstract: Jailbreak attacks cause large language models (LLMs) to generate harmful, unethical, or otherwise objectionable content. Evaluating these attacks presents a number of challenges, which the current collection of benchmarks and evaluation techniques do not adequately address. First, there is no clear standard of practice regarding jailbreaking evaluation. Second, existing works compute costs and suc… ▽ More

    Submitted 16 June, 2024; v1 submitted 27 March, 2024; originally announced April 2024.

    Comments: JailbreakBench v1.0: more attack artifacts, more test-time defenses, a more accurate jailbreak judge (Llama-3-70B with a custom prompt), a larger dataset of human preferences for selecting a jailbreak judge (300 examples), an over-refusal evaluation dataset (100 benign/borderline behaviors), a semantic refusal judge based on Llama-3-8B

  12. arXiv:2402.16192  [pdf, other

    cs.CL

    Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

    Authors: Jiabao Ji, Bairu Hou, Alexander Robey, George J. Pappas, Hamed Hassani, Yang Zhang, Eric Wong, Shiyu Chang

    Abstract: Aligned large language models (LLMs) are vulnerable to jailbreaking attacks, which bypass the safeguards of targeted LLMs and fool them into generating objectionable content. While initial defenses show promise against token-based threat models, there do not exist defenses that provide robustness against semantic attacks and avoid unfavorable trade-offs between robustness and nominal performance.… ▽ More

    Submitted 28 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: 37 pages

  13. arXiv:2401.13650  [pdf, other

    eess.IV cs.CV

    Tyche: Stochastic In-Context Learning for Medical Image Segmentation

    Authors: Marianne Rakic, Hallee E. Wong, Jose Javier Gonzalez Ortiz, Beth Cimini, John Guttag, Adrian V. Dalca

    Abstract: Existing learning-based solutions to medical image segmentation have two important shortcomings. First, for most new segmentation task, a new model has to be trained or fine-tuned. This requires extensive resources and machine learning expertise, and is therefore often infeasible for medical researchers and clinicians. Second, most existing segmentation methods produce a single deterministic segme… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  14. arXiv:2312.07381  [pdf, other

    cs.CV eess.IV

    ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image

    Authors: Hallee E. Wong, Marianne Rakic, John Guttag, Adrian V. Dalca

    Abstract: Biomedical image segmentation is a crucial part of both scientific research and clinical care. With enough labelled data, deep learning models can be trained to accurately automate specific biomedical image segmentation tasks. However, manually segmenting images to create training data is highly labor intensive and requires domain expertise. We present ScribblePrompt, a flexible neural network bas… ▽ More

    Submitted 12 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Project Website: https://scribbleprompt.csail.mit.edu Keywords: Interactive Segmentation, Medical Imaging, Segment Anything Model, SAM, Scribble Annotations, Prompt

  15. arXiv:2312.05716  [pdf, other

    cs.CV

    Initialization Matters for Adversarial Transfer Learning

    Authors: Andong Hua, **dong Gu, Zhiyu Xue, Nicholas Carlini, Eric Wong, Yao Qin

    Abstract: With the prevalence of the Pretraining-Finetuning paradigm in transfer learning, the robustness of downstream tasks has become a critical concern. In this work, we delve into adversarial robustness in transfer learning and reveal the critical role of initialization, including both the pretrained model and the linear head. First, we discover the necessity of an adversarially robust pretrained model… ▽ More

    Submitted 30 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: CVPR 2024

  16. arXiv:2312.03231  [pdf, other

    cs.LG cs.AI cs.CV cs.HC eess.AS

    Deep Multimodal Fusion for Surgical Feedback Classification

    Authors: Rafal Kocielnik, Elyssa Y. Wong, Timothy N. Chu, Lydia Lin, De-An Huang, Jiayun Wang, Anima Anandkumar, Andrew J. Hung

    Abstract: Quantification of real-time informal feedback delivered by an experienced surgeon to a trainee during surgery is important for skill improvements in surgical training. Such feedback in the live operating room is inherently multimodal, consisting of verbal conversations (e.g., questions and answers) as well as non-verbal elements (e.g., through visual cues like pointing to anatomic elements). In th… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Journal ref: Published in Proceedings of Machine Learning for Health 2024

  17. arXiv:2310.16316  [pdf, other

    cs.LG cs.AI

    Sum-of-Parts Models: Faithful Attributions for Groups of Features

    Authors: Weiqiu You, Helen Qu, Marco Gatti, Bhuvnesh Jain, Eric Wong

    Abstract: An explanation of a machine learning model is considered "faithful" if it accurately reflects the model's decision-making process. However, explanations such as feature attributions for deep learning are not guaranteed to be faithful, and can produce potentially misleading interpretations. In this work, we develop Sum-of-Parts (SOP), a class of models whose predictions come with grouped feature at… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  18. arXiv:2310.12508  [pdf, other

    cs.LG cs.AI

    SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

    Authors: Chongyu Fan, Jiancheng Liu, Yihua Zhang, Eric Wong, Dennis Wei, Sijia Liu

    Abstract: With evolving data regulations, machine unlearning (MU) has become an important tool for fostering trust and safety in today's AI models. However, existing MU methods focusing on data and/or weight perspectives often suffer limitations in unlearning accuracy, stability, and cross-domain applicability. To address these challenges, we introduce the concept of 'weight saliency' for MU, drawing parall… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024 as a Spotlight paper

  19. arXiv:2310.08419  [pdf, other

    cs.LG cs.AI

    Jailbreaking Black Box Large Language Models in Twenty Queries

    Authors: Patrick Chao, Alexander Robey, Edgar Dobriban, Hamed Hassani, George J. Pappas, Eric Wong

    Abstract: There is growing interest in ensuring that large language models (LLMs) align with human values. However, the alignment of such models is vulnerable to adversarial jailbreaks, which coax LLMs into overriding their safety guardrails. The identification of these vulnerabilities is therefore instrumental in understanding inherent weaknesses and preventing future misuse. To this end, we propose Prompt… ▽ More

    Submitted 13 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 21 pages, 10 figures

  20. arXiv:2310.07135  [pdf, other

    cs.CL

    Comparing Styles across Languages

    Authors: Shreya Havaldar, Matthew Pressimone, Eric Wong, Lyle Ungar

    Abstract: Understanding how styles differ across languages is advantageous for training both humans and computers to generate culturally appropriate text. We introduce an explanation framework to extract stylistic differences from multilingual LMs and compare styles across languages. Our framework (1) generates comprehensive style lexica in any language and (2) consolidates feature importances from LMs into… ▽ More

    Submitted 4 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  21. arXiv:2310.03684  [pdf, other

    cs.LG cs.AI stat.ML

    SmoothLLM: Defending Large Language Models Against Jailbreaking Attacks

    Authors: Alexander Robey, Eric Wong, Hamed Hassani, George J. Pappas

    Abstract: Despite efforts to align large language models (LLMs) with human intentions, widely-used LLMs such as GPT, Llama, and Claude are susceptible to jailbreaking attacks, wherein an adversary fools a targeted LLM into generating objectionable content. To address this vulnerability, we propose SmoothLLM, the first algorithm designed to mitigate jailbreaking attacks. Based on our finding that adversarial… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  22. arXiv:2309.03861  [pdf, other

    astro-ph.EP

    Saturn's ancient regular satellites

    Authors: Emily. W. Wong, Ramon Brasser, Stephanie. C. Werner, Michelle. R. Kirchoff

    Abstract: Are Saturn's regular satellites young or old? And how old are Enceladus' cratered plains? To answer these questions we computed model surface ages of the most heavily cratered terrains on Saturn's regular icy satellites using new high-resolution outer Solar System evolution simulations, and coupled with improved estimates of the trans-Neptunian objects populations. The output of the simulations al… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted for publication in Icarus. 30 pages, 11 figures; including Appendix and Appendix Figures

  23. Fe substitution in URu$_2$Si$_2$: singlet magnetism in an extended Doniach phase diagram

    Authors: Andrea Marino, Denise S. Christovam, Chun-Fu Chang, Johannes Falke, Chang-Yang Kuo, Chi-Nan Wu, Martin Sundermann, Andrea Amorese, Hlynur Gretarsson, Eric Lee Wong, Camilla M. Moir, Yuang Deng, M. Brian Maple, Peter Thalmeier, Liu Hao Tjeng, Andrea Severing

    Abstract: The application of pressure as well as the successive substitution of Ru with Fe in the hidden order (HO) compound URu$_2$Si$_2$ leads to the formation of the large moment antiferromagnetic phase (LMAFM). Here we have investigated the substitution series URu$_{2-x}$Fe$_x$Si$_2$ from $x$\,=\,0.0 to 2.0 by U\,4$f$ core-level photoelectron spectroscopy and have observed non-monotonic changes in the s… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 10 pages, 7 figures

    Journal ref: Phys. Rev. B 108, 085128 (2023)

  24. arXiv:2308.06686  [pdf, other

    cs.DB cs.LG cs.SE

    TorchQL: A Programming Framework for Integrity Constraints in Machine Learning

    Authors: Aaditya Naik, Adam Stein, Yinjun Wu, Mayur Naik, Eric Wong

    Abstract: Finding errors in machine learning applications requires a thorough exploration of their behavior over data. Existing approaches used by practitioners are often ad-hoc and lack the abstractions needed to scale this process. We present TorchQL, a programming framework to evaluate and improve the correctness of machine learning applications. TorchQL allows users to write queries to specify and check… ▽ More

    Submitted 14 February, 2024; v1 submitted 13 August, 2023; originally announced August 2023.

  25. arXiv:2308.02712  [pdf, other

    astro-ph.IM astro-ph.EP

    A Search for Technosignatures Around 11,680 Stars with the Green Bank Telescope at 1.15-1.73 GHz

    Authors: Jean-Luc Margot, Megan G. Li, Pavlo Pinchuk, Nathan Myhrvold, Larry Lesyna, Lea E. Alcantara, Megan T. Andrakin, Jeth Arunseangroj, Damien S. Baclet, Madison H. Belk, Zerxes R. Bhadha, Nicholas W. Brandis, Robert E. Carey, Harrison P. Cassar, Sai S. Chava, Calvin Chen, James Chen, Kellen T. Cheng, Alessia Cimbri, Benjamin Cloutier, Jordan A. Combitsis, Kelly L. Couvrette, Brandon P. Coy, Kyle W. Davis, Antoine F. Delcayre , et al. (56 additional authors not shown)

    Abstract: We conducted a search for narrowband radio signals over four observing sessions in 2020-2023 with the L-band receiver (1.15-1.73 GHz) of the 100 m diameter Green Bank Telescope. We pointed the telescope in the directions of 62 TESS Objects of Interest, capturing radio emissions from a total of ~11,680 stars and planetary systems in the ~9 arcminute beam of the telescope. All detections were either… ▽ More

    Submitted 15 October, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 21 pages, 9 figures, in press at AJ

    Journal ref: AJ 166 206 (2023)

  26. arXiv:2307.05902  [pdf, other

    cs.LG cs.AI

    Stability Guarantees for Feature Attributions with Multiplicative Smoothing

    Authors: Anton Xue, Rajeev Alur, Eric Wong

    Abstract: Explanation methods for machine learning models tend not to provide any formal guarantees and may not reflect the underlying decision-making process. In this work, we analyze stability as a property for reliable feature attribution methods. We prove that relaxed variants of stability are guaranteed if the model is sufficiently Lipschitz with respect to the masking of features. We develop a smoothi… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  27. arXiv:2306.14414  [pdf, ps, other

    math.NT cs.CR cs.IT math.CO

    Rationality of Four-Valued Families of Weil Sums of Binomials

    Authors: Daniel J. Katz, Allison E. Wong

    Abstract: We investigate the rationality of Weil sums of binomials of the form $W^{K,s}_u=\sum_{x \in K} ψ(x^s - u x)$, where $K$ is a finite field whose canonical additive character is $ψ$, and where $u$ is an element of $K^{\times}$ and $s$ is a positive integer relatively prime to $|K^\times|$, so that $x \mapsto x^s$ is a permutation of $K$. The Weil spectrum for $K$ and $s$, which is the family of valu… ▽ More

    Submitted 6 April, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 33 pages

    MSC Class: 11T24; 11L05; 11L40; 11T22; 11G25; 11T71; 94A55; 94A60; 94B15

  28. arXiv:2306.03304  [pdf, other

    physics.plasm-ph

    Plasma flows during the ablation stage of an over-massed pulsed-power-driven exploding planar wire array

    Authors: R. Datta, J. Angel, J. B. Greenly, S. N. Bland, J. P. Chittenden, E. S. Lavine, W. M. Potter, D. Robinson, T. W. O. Varnish, E. Wong, D. A. Hammer, B. R. Kusse, J. D. Hare

    Abstract: We characterize the plasma flows generated during the ablation stage of an over-massed exploding planar wire array, fielded on the COBRA pulsed-power facility (1 MA peak current, 250 ns rise time). The planar wire array is designed to provide a driving magnetic field (80-100 T) and current per wire distribution (about 60 kA), similar to that in a 10 MA cylindrical exploding wire array fielded on t… ▽ More

    Submitted 12 July, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 14 pages; 14 figures

    Journal ref: Phys. Plasmas 30, 092104 (2023)

  29. arXiv:2306.00976  [pdf, other

    cs.CL

    TopEx: Topic-based Explanations for Model Comparison

    Authors: Shreya Havaldar, Adam Stein, Eric Wong, Lyle Ungar

    Abstract: Meaningfully comparing language models is challenging with current explanation methods. Current explanations are overwhelming for humans due to large vocabularies or incomparable across models. We present TopEx, an explanation method that enables a level playing field for comparing language models via model-agnostic topics. We demonstrate how TopEx can identify similarities and differences between… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2023, Tiny Papers Track

  30. arXiv:2305.16308  [pdf, other

    cs.LG

    Rectifying Group Irregularities in Explanations for Distribution Shift

    Authors: Adam Stein, Yinjun Wu, Eric Wong, Mayur Naik

    Abstract: It is well-known that real-world changes constituting distribution shift adversely affect model performance. How to characterize those changes in an interpretable manner is poorly understood. Existing techniques to address this problem take the form of shift explanations that elucidate how to map samples from the original distribution toward the shifted one by reducing the disparity between these… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 19 pages, 5 figures

  31. arXiv:2305.01209  [pdf, other

    econ.GN

    Cooperation and Cognition in Social Networks

    Authors: Edoardo Gallo, Joseph Lee, Yohanes Eko Riyanto, Erwin Wong

    Abstract: Social networks can sustain cooperation by amplifying the consequences of a single defection through a cascade of relationship losses. Building on Jackson et al. (2012), we introduce a novel robustness notion to characterize low cognitive complexity (LCC) networks - a subset of equilibrium networks that imposes a minimal cognitive burden to calculate and comprehend the consequences of defection. W… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  32. arXiv:2303.09603  [pdf, ps, other

    math.CO cs.SC

    Rigorous Analytic Combinatorics in Several Variables in SageMath

    Authors: Benjamin Hackl, Andrew Luo, Stephen Melczer, Jesse Selover, Elaine Wong

    Abstract: We introduce the new sage_acsv package for the SageMath computer algebra system, allowing users to rigorously compute asymptotics for a large variety of multivariate sequences with rational generating functions. Using Sage's support for exact computations over the algebraic number field, this package provides the first rigorous implementation of algorithms from the theory of analytic combinatorics… ▽ More

    Submitted 31 August, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: 8 pages; Package: https://pypi.org/project/sage-acsv/

    Journal ref: Séminaire Lotharingiende Combinatoire 89B (2023): Proceedings of the 35th FPSAC Conference, Article #90,12pp

  33. arXiv:2303.01433  [pdf, other

    cs.LG cs.AI

    Do Machine Learning Models Learn Statistical Rules Inferred from Data?

    Authors: Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong

    Abstract: Machine learning models can make critical errors that are easily hidden within vast amounts of data. Such errors often run counter to rules based on human intuition. However, rules based on human knowledge are challenging to scale or to even formalize. We thereby seek to infer statistical rules from the data and quantify the extent to which a model has learned them. We propose a framework SQRL tha… ▽ More

    Submitted 6 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  34. arXiv:2302.11042  [pdf, other

    cs.CL cs.LG

    In-context Example Selection with Influences

    Authors: Tai Nguyen, Eric Wong

    Abstract: In-context learning (ICL) is a powerful paradigm emerged from large language models (LLMs). Despite its promises, ICL performance is known to be highly sensitive to input examples. In this work, we use $\textit{in-context influences}$ to analyze few-shot ICL performance directly from the in-context examples. Our proposed influence-based example selection method can identify both positive and negat… ▽ More

    Submitted 5 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  35. arXiv:2302.04237  [pdf, other

    cs.LG

    Black Box Adversarial Prompting for Foundation Models

    Authors: Natalie Maus, Patrick Chao, Eric Wong, Jacob Gardner

    Abstract: Prompting interfaces allow users to quickly adjust the output of generative models in both vision and language. However, small changes and design choices in the prompt can lead to significant differences in the output. In this work, we develop a black-box framework for generating adversarial prompts for unstructured image and text generation. These prompts, which can be standalone or prepended to… ▽ More

    Submitted 29 May, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  36. A Unified Approach to Unimodality of Gaussian Polynomials

    Authors: Christoph Koutschan, Ali K. Uncu, Elaine Wong

    Abstract: In 2013, Pak and Panova proved the strict unimodality property of $q$-binomial coefficients $\binom{\ell+m}{m}_q$ (as polynomials in $q$) based on the combinatorics of Young tableaux and the semigroup property of Kronecker coefficients. They showed it to be true for all $\ell,m\geq 8$ and a few other cases. We propose a different approach to this problem based on computer algebra, where we establi… ▽ More

    Submitted 31 August, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: Supplementary material at https://wongey.github.io/unimodality

    Journal ref: ISSAC 2023: Proceedings of the 2023 International Symposium on Symbolic and Algebraic Computation, July 2023, Pages 434-442

  37. arXiv:2302.00464  [pdf

    stat.AP physics.ed-ph

    Improving Models for Student Retention and Graduation using Markov Chains

    Authors: Mason N Tedeschi, Tiana M Hose, Emily K Mehlman, Scott Franklin, Tony E Wong

    Abstract: Graduation rates are a key measure of the long-term efficacy of academic interventions. However, challenges to using traditional estimates of graduation rates for underrepresented students include inherently small sample sizes and high data requirements. Here, we show that a Markov model increases confidence and reduces biases in estimated graduation rates for underrepresented minority and first-g… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  38. arXiv:2301.13379  [pdf, other

    cs.CL

    Faithful Chain-of-Thought Reasoning

    Authors: Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-Burch

    Abstract: While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query $\rightarrow$ symbolic reasoning chain) and Problem Solving… ▽ More

    Submitted 20 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: IJCNLP-AACL 2023 camera-ready version

  39. arXiv:2301.02721  [pdf, other

    cond-mat.mtrl-sci

    WS$_2$ Band Gap Renormalization Induced by Tomonaga Luttinger Liquid Formation in Mirror Twin Boundaries

    Authors: Antonio Rossi, John C. Thomas, Johannes T. Küchle, Elyse Barré, Zhuohang Yu, Da Zhou, Shalini Kumari, Hsin-Zon Tsai, Ed Wong, Chris Jozwiak, Aaron Bostwick, Joshua A. Robinson, Mauricio Terrones, Archana Raja, Adam Schwartzberg, D. Frank Ogletree, Jeffrey B. Neaton, Michael F. Crommie, Francesco Allegretti, Willi Auwärter, Eli Rotenberg, Alexander Weber-Bargioni

    Abstract: Tomonaga-Luttinger liquid (TLL) behavior in one-dimensional systems has been predicted and shown to occur at semiconductor-to-metal transitions within two-dimensional materials. Reports of mirror twin boundaries (MTBs) hosting a Fermi liquid or a TLL have suggested a dependence on the underlying substrate, however, unveiling the physical details of electronic contributions from the substrate requi… ▽ More

    Submitted 18 January, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: Main text is 13 pages, 4 figures; Supplementary text is 14 pages, 11 figures

  40. arXiv:2211.16460  [pdf

    physics.ao-ph stat.AP

    Sea Level and Socioeconomic Uncertainty Drives High-End Coastal Adaptation Costs

    Authors: Tony E. Wong, Catherine Ledna, Lisa Rennels, Hannah Sheets, Frank C. Errickson, Delavane Diaz, David Anthoff

    Abstract: Sea-level rise and associated flood hazards pose severe risks to the millions of people globally living in coastal zones. Models representing coastal adaptation and impacts are important tools to inform the design of strategies to manage these risks. Representing the often deep uncertainties influencing these risks poses nontrivial challenges. A common uncertainty characterization approach is to u… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  41. arXiv:2211.08624  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Map** for Single-channel Speech Enhancement

    Authors: Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu

    Abstract: Most speech enhancement (SE) models learn a point estimate and do not make use of uncertainty estimation in the learning process. In this paper, we show that modeling heteroscedastic uncertainty by minimizing a multivariate Gaussian negative log-likelihood (NLL) improves SE performance at no extra cost. During training, our approach augments a model learning complex spectral map** with a tempora… ▽ More

    Submitted 8 March, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 5 pages. Accepted at ICASSP 2023

  42. arXiv:2211.07734  [pdf, other

    physics.app-ph

    Superconducting Niobium Tip Electron Beam Source

    Authors: Cameron W. Johnson, Andreas K. Schmid, Marian Mankos, Robin Röpke, Nicole Kerker, Ing-Shouh Hwang, Ed K. Wong, D. Frank Ogletree, Andrew M. Minor, Alexander Stibor

    Abstract: Modern electron microscopy and spectroscopy is a key technology for studying the structure and composition of quantum and biological materials in fundamental and applied sciences. High-resolution spectroscopic techniques and aberration-corrected microscopes are often limited by the relatively large energy distribution of currently available beam sources. This can be improved by a monochromator, wi… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  43. arXiv:2209.08422  [pdf

    cs.LG stat.ML

    Computed Decision Weights and a New Learning Algorithm for Neural Classifiers

    Authors: Eugene Wong

    Abstract: In this paper we consider the possibility of computing rather than training the decision layer weights of a neural classifier. Such a possibility arises in two way, from making an appropriate choice of loss function and by solving a problem of constrained optimization. The latter formulation leads to a promising new learning process for pre-decision weights with both simplicity and efficacy.

    Submitted 17 September, 2022; originally announced September 2022.

  44. arXiv:2209.02446  [pdf, other

    cs.CY cs.MM

    Web3 Challenges and Opportunities for the Market

    Authors: Dan Sheridan, James Harris, Frank Wear, Jerry Cowell Jr, Easton Wong, Abbas Yazdinejad

    Abstract: The inability of a computer to think has been a limiter in its usefulness and a point of reassurance for humanity since the first computers were created. The semantic web is the first step toward removing that barrier, enabling computers to operate based on conceptual understanding, and AI and ML are the second. Both semantic knowledge and the ability to learn are fundamental to web3, as are block… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  45. arXiv:2207.07912  [pdf

    q-bio.NC

    A Reservoir Model of Explicit Human Intelligence

    Authors: Eric C. Wong

    Abstract: A fundamental feature of human intelligence is that we accumulate and transfer knowledge as a society and across generations. We describe here a network architecture for the human brain that may support this feature and suggest that two key innovations were the ability to consider an offline model of the world, and the use of language to record and communicate knowledge within this model. We propo… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 8 pages

  46. arXiv:2207.05739  [pdf, other

    cs.LG

    A Data-Based Perspective on Transfer Learning

    Authors: Saachi Jain, Hadi Salman, Alaa Khaddaj, Eric Wong, Sung Min Park, Aleksander Madry

    Abstract: It is commonly believed that in transfer learning including more pre-training data translates into better performance. However, recent evidence suggests that removing data from the source dataset can actually help too. In this work, we take a closer look at the role of the source dataset's composition in transfer learning and present a framework for probing its impact on downstream performance. Ou… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  47. arXiv:2207.02842  [pdf, other

    cs.LG

    When does Bias Transfer in Transfer Learning?

    Authors: Hadi Salman, Saachi Jain, Andrew Ilyas, Logan Engstrom, Eric Wong, Aleksander Madry

    Abstract: Using transfer learning to adapt a pre-trained "source model" to a downstream "target task" can dramatically increase performance with seemingly no downside. In this work, we demonstrate that there can exist a downside after all: bias transfer, or the tendency for biases of the source model to persist even after adapting the model to the target class. Through a combination of synthetic and natural… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  48. arXiv:2206.07843  [pdf, other

    math.OC

    Recent Developments in Security-Constrained AC Optimal Power Flow: Overview of Challenge 1 in the ARPA-E Grid Optimization Competition

    Authors: Ignacio Aravena, Daniel K. Molzahn, Shixuan Zhang, Cosmin G. Petra, Frank E. Curtis, Shenyinying Tu, Andreas Wächter, Ermin Wei, Elizabeth Wong, Amin Gholami, Kaizhao Sun, Xu Andy Sun, Stephen T. Elbert, Jesse T. Holzer, Arun Veeramany

    Abstract: The optimal power flow problem is central to many tasks in the design and operation of electric power grids. This problem seeks the minimum cost operating point for an electric power grid while satisfying both engineering requirements and physical laws describing how power flows through the electric network. By additionally considering the possibility of component failures and using an accurate AC… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  49. arXiv:2205.14059  [pdf, ps, other

    math.CO math.NT

    Convexity, Squeezing, and the Elekes-Szabó Theorem

    Authors: Oliver Roche-Newton, Elaine Wong

    Abstract: This paper explores the relationship between convexity and sum sets. In particular, we show that elementary number theoretical methods, principally the application of a squeezing principle, can be augmented with the Elekes-Szabó Theorem in order to give new information. Namely, if we let $A \subset \mathbb R$, we prove that there exist $a,a' \in A$ such that \[\left | \frac{(aA+1)^{(2)}(a'A+1)^{(2… ▽ More

    Submitted 14 January, 2024; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: 20 pages, 2 figures

    Journal ref: The Electronic Journal of Combinatorics, Volume 31(1), P1.3 (2024)

  50. arXiv:2205.05767  [pdf, other

    physics.app-ph quant-ph

    Near-monochromatic tuneable cryogenic niobium electron field emitter

    Authors: Cameron W. Johnson, Andreas K. Schmid, Marian Mankos, Robin Röpke, Nicole Kerker, Ed K. Wong, D. Frank Ogletree, Andrew M. Minor, Alexander Stibor

    Abstract: Creating, manipulating, and detecting coherent electrons is at the heart of future quantum microscopy and spectroscopy technologies. Leveraging and specifically altering the quantum features of an electron beam source at low temperatures can enhance its emission properties. Here, we describe electron field emission from a monocrystalline, superconducting niobium nanotip at a temperature of 5.9 K.… ▽ More

    Submitted 6 October, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

    Comments: to be published in Phys. Rev. Lett. (2022)