Skip to main content

Showing 1–50 of 171 results for author: Goel, S

.
  1. arXiv:2407.00075  [pdf, other

    cs.AI cs.CL cs.CR cs.LG

    Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

    Authors: Anton Xue, Avishree Khare, Rajeev Alur, Surbhi Goel, Eric Wong

    Abstract: We study how to subvert language models from following the rules. We model rule-following as inference in propositional Horn logic, a mathematical system in which rules have the form "if $P$ and $Q$, then $R$" for some propositions $P$, $Q$, and $R$. We prove that although transformers can faithfully abide by such rules, maliciously crafted prompts can nevertheless mislead even theoretically const… ▽ More

    Submitted 21 June, 2024; originally announced July 2024.

  2. arXiv:2406.09000  [pdf, other

    cs.CR

    A Passwordless MFA Utlizing Biometrics, Proximity and Contactless Communication

    Authors: Sneha Shukla, Gaurav Varshney, Shreya Singh, Swati Goel

    Abstract: Despite being more secure and strongly promoted, two-factor (2FA) or multi-factor (MFA) schemes either fail to protect against recent phishing threats such as real-time MITM, controls/relay MITM, malicious browser extension-based phishing attacks, and/or need the users to purchase and carry other hardware for additional account protection. Leveraging the unprecedented popularity of NFC and BLE-ena… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.02742  [pdf, ps, other

    cs.DS cs.LG

    Tolerant Algorithms for Learning with Arbitrary Covariate Shift

    Authors: Surbhi Goel, Abhishek Shetty, Konstantinos Stavropoulos, Arsen Vasilyan

    Abstract: We study the problem of learning under arbitrary distribution shift, where the learner is trained on a labeled set from one distribution but evaluated on a different, potentially adversarially generated test distribution. We focus on two frameworks: PQ learning [Goldwasser, A. Kalai, Y. Kalai, Montasser NeurIPS 2020], allowing abstention on adversarially generated parts of the test distribution, a… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2406.01895  [pdf, other

    cs.LG cs.CL stat.ML

    Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks

    Authors: Mahdi Sabbaghi, George Pappas, Hamed Hassani, Surbhi Goel

    Abstract: Despite the success of Transformers on language understanding, code generation, and logical reasoning, they still fail to generalize over length on basic arithmetic tasks such as addition and multiplication. A major reason behind this failure is the vast difference in structure between numbers and text; For example, the numbers are typically parsed from right to left, and there is a correspondence… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 32 pages, 16 figures

  5. arXiv:2405.07331  [pdf, other

    cs.LG cs.DS stat.ML

    Stochastic Bandits with ReLU Neural Networks

    Authors: Kan Xu, Hamsa Bastani, Surbhi Goel, Osbert Bastani

    Abstract: We study the stochastic bandit problem with ReLU neural network structure. We show that a $\tilde{O}(\sqrt{T})$ regret guarantee is achievable by considering bandits with one-layer ReLU neural networks; to the best of our knowledge, our work is the first to achieve such a guarantee. In this specific setting, we propose an OFU-ReLU algorithm that can achieve this upper bound. The algorithm first ex… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  6. arXiv:2404.05072  [pdf, other

    cs.CV

    Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind

    Authors: Chiara Plizzari, Shubham Goel, Toby Perrett, Jacob Chalk, Angjoo Kanazawa, Dima Damen

    Abstract: As humans move around, performing their daily tasks, they are able to recall where they have positioned objects in their environment, even if these objects are currently out of sight. In this paper, we aim to mimic this spatial cognition ability. We thus formulate the task of Out of Sight, Not Out of Mind - 3D tracking active objects using observations captured through an egocentric camera. We int… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 21 pages including references and appendix. Project Webpage: http://dimadamen.github.io/OSNOM/

  7. arXiv:2404.04603  [pdf, ps, other

    cs.HC cs.CY

    Analyzing LLM Usage in an Advanced Computing Class in India

    Authors: Chaitanya Arora, Utkarsh Venaik, Pavit Singh, Sahil Goyal, Jatin Tyagi, Shyama Goel, Ujjwal Singhal, Dhruv Kumar

    Abstract: This paper investigates the usage patterns of undergraduate and graduate students when engaging with large language models (LLMs) to tackle programming assignments in the context of advanced computing courses. Existing work predominantly focuses on the influence of LLMs in introductory programming contexts. Additionally, there is a scarcity of studies analyzing actual conversations between student… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Under review: 12 pages

  8. arXiv:2404.03652  [pdf, other

    cs.CV

    The More You See in 2D, the More You Perceive in 3D

    Authors: Xinyang Han, Zelin Gao, Angjoo Kanazawa, Shubham Goel, Yossi Gandelsman

    Abstract: Humans can infer 3D structure from 2D images of an object based on past experience and improve their 3D understanding as they see more images. Inspired by this behavior, we introduce SAP3D, a system for 3D reconstruction and novel view synthesis from an arbitrary number of unposed images. Given a few unposed images of an object, we adapt a pre-trained view-conditioned diffusion model together with… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Project page: https://sap3d.github.io/

  9. arXiv:2404.03086  [pdf, other

    stat.AP cs.CL

    Auditing the Use of Language Models to Guide Hiring Decisions

    Authors: Johann D. Gaebler, Sharad Goel, Aziz Huq, Prasanna Tambe

    Abstract: Regulatory efforts to protect against algorithmic bias have taken on increased urgency with rapid advances in large language models (LLMs), which are machine learning models that can achieve performance rivaling human experts on a wide array of tasks. A key theme of these initiatives is algorithmic "auditing," but current regulations -- as well as the scientific literature -- provide little guidan… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  10. arXiv:2403.03375  [pdf, other

    cs.LG

    Complexity Matters: Dynamics of Feature Learning in the Presence of Spurious Correlations

    Authors: GuanWen Qiu, Da Kuang, Surbhi Goel

    Abstract: Existing research often posits spurious features as easier to learn than core features in neural network optimization, but the impact of their relative simplicity remains under-explored. Moreover, studies mainly focus on end performance rather than the learning dynamics of feature learning. In this paper, we propose a theoretical framework and an associated synthetic dataset grounded in boolean fu… ▽ More

    Submitted 16 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted to ICML 2024 with the title:"Complexity Matters: Feature Learning in the Presence of Spurious Correlations"

  11. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in develo** biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are develo** evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  12. arXiv:2402.15890  [pdf, ps, other

    econ.TH cs.GT

    Optimality of weighted contracts for multi-agent contract design with a budget

    Authors: Sumit Goel, Wade Hann-Caruthers

    Abstract: We study a contract design problem between a principal and multiple agents. Each agent participates in an independent task with binary outcomes (success or failure), in which it may exert costly effort towards improving its probability of success, and the principal has a fixed budget which it can use to provide outcome-dependent rewards to the agents. Crucially, we assume the principal cares only… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 26 pages

  13. arXiv:2402.14015  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Corrective Machine Unlearning

    Authors: Shashwat Goel, Ameya Prabhu, Philip Torr, Ponnurangam Kumaraguru, Amartya Sanyal

    Abstract: Machine Learning models increasingly face data integrity challenges due to the use of large-scale training datasets drawn from the internet. We study what model developers can do if they detect that some data was manipulated or incorrect. Such manipulated data can cause adverse effects like vulnerability to backdoored samples, systematic biases, and in general, reduced accuracy on certain input do… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 17 pages, 7 figures

  14. arXiv:2402.11004  [pdf, other

    cs.LG

    The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains

    Authors: Benjamin L. Edelman, Ezra Edelman, Surbhi Goel, Eran Malach, Nikolaos Tsilivis

    Abstract: Large language models have the ability to generate text that mimics patterns in their inputs. We introduce a simple Markov Chain sequence modeling task in order to study how this in-context learning (ICL) capability emerges. In our setting, each example is sampled from a Markov chain drawn from a prior distribution over Markov chains. Transformers trained on this task form \emph{statistical induct… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  15. arXiv:2402.10494  [pdf, other

    cs.LO math.LO

    Mechanised uniform interpolation for modal logics K, GL, and iSL

    Authors: Hugo Férée, Iris van der Giessen, Sam van Gool, Ian Shillito

    Abstract: The uniform interpolation property in a given logic can be understood as the definability of propositional quantifiers. We mechanise the computation of these quantifiers and prove correctness in the Coq proof assistant for three modal logics, namely: (1) the modal logic K, for which a pen-and-paper proof exists; (2) Gödel-Löb logic GL, for which our formalisation clarifies an important point in an… ▽ More

    Submitted 29 April, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 18 pages, to appear in IJCAR 2024

    MSC Class: 03B45; 03B20; 03F45; 68V20 ACM Class: F.4.1; I.2.3; I.2.4

  16. arXiv:2401.07321  [pdf, ps, other

    math.NT

    On the moments of averages of Ramanujan sums

    Authors: Shivani Goel, M. Ram Murty

    Abstract: Chan and Kumchev studied averages of the first and second moments of Ramanujan sums. In this article, we extend this investigation by estimating the higher moments of averages of Ramanujan sums using the Brèteche Tauberian theorem. We also give a result for the moments of averages of Cohen-Ramanujan sums.

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 11 Pages

    MSC Class: 11A05; 11L03; 11N37

  17. arXiv:2401.05052  [pdf, ps, other

    math.NT

    Moments of Averages of Ramanujan Sums over Number Fields

    Authors: Sneha Chaubey, Shivani Goel

    Abstract: Assuming the generalized Lindelöf hypothesis, we provide asymptotic formulas for the mean values of the first and second moments of Ramanujan sums over any number field. Additionally, unconditionally, we estimate the second moment of Ramanujan sums over cyclotomic number fields.

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 15 pages. arXiv admin note: text overlap with arXiv:2109.09398

    MSC Class: 11M06; 11R18; 11N37

  18. arXiv:2401.03546  [pdf, other

    cs.AI

    NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds

    Authors: Shivam Goel, Yichen Wei, Panagiotis Lymperopoulos, Klara Chura, Matthias Scheutz, Jivko Sinapov

    Abstract: As AI agents leave the lab and venture into the real world as autonomous vehicles, delivery robots, and cooking robots, it is increasingly necessary to design and comprehensively evaluate algorithms that tackle the ``open-world''. To this end, we introduce NovelGym, a flexible and adaptable ecosystem designed to simulate gridworld environments, serving as a robust platform for benchmarking reinfor… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted at AAMAS-2024

  19. arXiv:2312.05745  [pdf, other

    cs.CV cs.AI

    Open World Object Detection in the Era of Foundation Models

    Authors: Orr Zohar, Alejandro Lozano, Shelly Goel, Serena Yeung, Kuan-Chieh Wang

    Abstract: Object detection is integral to a bevy of real-world applications, from robotics to medical image analysis. To be used reliably in such applications, models must be capable of handling unexpected - or novel - objects. The open world object detection (OWD) paradigm addresses this challenge by enabling models to detect unknown objects and learn discovered ones incrementally. However, OWD method deve… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  20. arXiv:2310.08836  [pdf, other

    cs.RO cs.AI cs.LG

    A Framework for Few-Shot Policy Transfer through Observation Map** and Behavior Cloning

    Authors: Yash Shukla, Bharat Kesari, Shivam Goel, Robert Wright, Jivko Sinapov

    Abstract: Despite recent progress in Reinforcement Learning for robotics applications, many tasks remain prohibitively difficult to solve because of the expensive interaction cost. Transfer learning helps reduce the training time in the target domain by transferring knowledge learned in a source domain. Sim2Real transfer helps transfer knowledge from a simulated robotic domain to a physical target domain. K… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Paper accepted to the IROS 2023 Conference

  21. arXiv:2310.01405  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.CY

    Representation Engineering: A Top-Down Approach to AI Transparency

    Authors: Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, Dan Hendrycks

    Abstract: In this paper, we identify and characterize the emerging area of representation engineering (RepE), an approach to enhancing the transparency of AI systems that draws on insights from cognitive neuroscience. RepE places population-level representations, rather than neurons or circuits, at the center of analysis, equip** us with novel methods for monitoring and manipulating high-level cognitive p… ▽ More

    Submitted 10 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Code is available at https://github.com/andyzoujm/representation-engineering

  22. arXiv:2309.03800  [pdf, other

    cs.LG cs.AI stat.ML

    Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

    Authors: Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

    Abstract: In modern deep learning, algorithmic choices (such as width, depth, and learning rate) are known to modulate nuanced resource tradeoffs. This work investigates how these complexities necessarily arise for feature learning in the presence of computational-statistical gaps. We begin by considering offline sparse parity learning, a supervised classification problem which admits a statistical query lo… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: v2: NeurIPS 2023 camera-ready updates

  23. arXiv:2308.11462  [pdf, other

    cs.CL cs.AI cs.CY

    LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

    Authors: Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia , et al. (15 additional authors not shown)

    Abstract: The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisc… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 143 pages, 79 tables, 4 figures

  24. arXiv:2308.04449  [pdf, other

    cs.CY

    The Disparate Impacts of College Admissions Policies on Asian American Applicants

    Authors: Joshua Grossman, Sabina Tomkins, Lindsay Page, Sharad Goel

    Abstract: There is debate over whether Asian American students are admitted to selective colleges and universities at lower rates than white students with similar academic qualifications. However, there have been few empirical investigations of this issue, in large part due to a dearth of data. Here we present the results from analyzing 685,709 applications from Asian American and white students to a subset… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 14 pages, 4 figures, 1 table + appendix: 42 pages, 5 figures, 21 tables

  25. arXiv:2306.14858  [pdf, other

    cs.GT cs.LG

    Proportional Aggregation of Preferences for Sequential Decision Making

    Authors: Nikhil Chandak, Shashwat Goel, Dominik Peters

    Abstract: We study the problem of fair sequential decision making given voter preferences. In each round, a decision rule must choose a decision from a set of alternatives where each voter reports which of these alternatives they approve. Instead of going with the most popular choice in each round, we aim for proportional representation. We formalize this aim using axioms based on Proportional Justified Rep… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 35 pages

  26. arXiv:2306.13119  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Adversarial Resilience in Sequential Prediction via Abstention

    Authors: Surbhi Goel, Steve Hanneke, Shay Moran, Abhishek Shetty

    Abstract: We study the problem of sequential prediction in the stochastic setting with an adversary that is allowed to inject clean-label adversarial (or out-of-distribution) examples. Algorithms designed to handle purely stochastic data tend to fail in the presence of such adversarial examples, often leading to erroneous predictions. This is undesirable in many high-stakes applications such as medical reco… ▽ More

    Submitted 24 January, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

  27. arXiv:2306.12389  [pdf, other

    stat.AP

    Automated Reminders Reduce Incarceration for Missed Court Dates: Evidence from a Text Message Experiment

    Authors: Alex Chohlas-Wood, Madison Coots, Joe Nudell, Julian Nyarko, Emma Brunskill, Todd Rogers, Sharad Goel

    Abstract: Millions of Americans must attend mandatory court dates every year. To boost appearance rates, jurisdictions nationwide are increasingly turning to automated reminders, but previous research offers mixed evidence on their effectiveness. In partnership with the Santa Clara County Public Defender Office, we randomly assigned 5,709 public defender clients to either receive automated text message remi… ▽ More

    Submitted 22 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

  28. arXiv:2306.10220  [pdf, other

    stat.AP

    Reevaluating the Role of Race and Ethnicity in Diabetes Screening

    Authors: Madison Coots, Soroush Saghafian, David Kent, Sharad Goel

    Abstract: There is active debate over whether to consider patient race and ethnicity when estimating disease risk. By accounting for race and ethnicity, it is possible to improve the accuracy of risk predictions, but there is concern that their use may encourage a racialized view of medicine. In diabetes risk models, despite substantial gains in statistical accuracy from using race and ethnicity, the gains… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 11 pages, 4 figures

  29. arXiv:2306.00946  [pdf, other

    cs.LG cs.CL

    Exposing Attention Glitches with Flip-Flop Language Modeling

    Authors: Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang

    Abstract: Why do large language models sometimes output factual inaccuracies and exhibit erroneous reasoning? The brittleness of these models, particularly when executing long chains of reasoning, currently seems to be an inevitable price to pay for their advanced capabilities of coherently synthesizing knowledge, pragmatics, and abstract thought. Towards making sense of this fundamentally unsolved problem,… ▽ More

    Submitted 30 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: v2: NeurIPS 2023 camera-ready + data release

  30. arXiv:2305.20091  [pdf, other

    cs.CV

    Humans in 4D: Reconstructing and Tracking Humans with Transformers

    Authors: Shubham Goel, Georgios Pavlakos, Jathushan Rajasegaran, Angjoo Kanazawa, Jitendra Malik

    Abstract: We present an approach to reconstruct humans and track them over time. At the core of our approach, we propose a fully "transformerized" version of a network for human mesh recovery. This network, HMR 2.0, advances the state of the art and shows the capability to analyze unusual poses that have in the past been difficult to reconstruct from single images. To analyze video, we use 3D reconstruction… ▽ More

    Submitted 31 August, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: In ICCV 2023. Project Webpage: https://shubham-goel.github.io/4dhumans/

  31. arXiv:2305.12638  [pdf, other

    cs.CY stat.AP

    Risk Scores, Label Bias, and Everything but the Kitchen Sink

    Authors: Michael Zanger-Tishler, Julian Nyarko, Sharad Goel

    Abstract: In designing risk assessment algorithms, many scholars promote a "kitchen sink" approach, reasoning that more information yields more accurate predictions. We show, however, that this rationale often fails when algorithms are trained to predict a proxy of the true outcome, as is typically the case. With such "label bias", one should exclude a feature if its correlation with the proxy and its corre… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 19 pages, 4 figures

  32. arXiv:2305.10067  [pdf, ps, other

    math.NT

    Pair correlation of real-valued vector sequences

    Authors: Sneha Chaubey, Shivani Goel

    Abstract: In this article, we investigate the fine-scale statistics of real-valued arithmetic sequences. In particular, we focus on real-valued vector sequences and show the Poissonian behavior of the pair correlation function for certain classes of such sequences, thereby extending previous works of Boca et al. and the first author on local statistics of integer-valued and rational-valued vector sequences.

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 18 pages

    MSC Class: 11K06; 11J83; 11K99

  33. Optimal tie-breaking rules

    Authors: Sumit Goel, Amit Goyal

    Abstract: We consider two-player contests with the possibility of ties and study the effect of different tie-breaking rules on effort. For ratio-form and difference-form contests that admit pure-strategy Nash equilibrium, we find that the effort of both players is monotone decreasing in the probability that ties are broken in favor of the stronger player. Thus, the effort-maximizing tie-breaking rule commit… ▽ More

    Submitted 30 August, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 25 pages

    Journal ref: Journal of Mathematical Economics (2023)

  34. arXiv:2304.10524  [pdf, other

    cs.LG cs.DS stat.ML

    Learning Narrow One-Hidden-Layer ReLU Networks

    Authors: Sitan Chen, Zehao Dou, Surbhi Goel, Adam R Klivans, Raghu Meka

    Abstract: We consider the well-studied problem of learning a linear combination of $k$ ReLU activations with respect to a Gaussian distribution on inputs in $d$ dimensions. We give the first polynomial-time algorithm that succeeds whenever $k$ is a constant. All prior polynomial-time learners require additional assumptions on the network, such as positive combining coefficients or the matrix of hidden weigh… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 33 pages, comments welcome

  35. arXiv:2304.08530  [pdf, other

    cs.CY

    Popular Support for Balancing Equity and Efficiency in Resource Allocation: A Case Study in Online Advertising to Increase Welfare Program Awareness

    Authors: Allison Koenecke, Eric Giannella, Robb Willer, Sharad Goel

    Abstract: Algorithmically optimizing the provision of limited resources is commonplace across domains from healthcare to lending. Optimization can lead to efficient resource allocation, but, if deployed without additional scrutiny, can also exacerbate inequality. Little is known about popular preferences regarding acceptable efficiency-equity trade-offs, making it difficult to design algorithms that are res… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: This paper will be presented at the 2023 International Conference on Web and Social Media (ICWSM'23)

  36. arXiv:2304.05286  [pdf, other

    quant-ph cond-mat.other cond-mat.str-el physics.optics

    Unveiling the non-Abelian statistics of $D(S_3)$ anyons via photonic simulation

    Authors: Suraj Goel, Matthew Reynolds, Matthew Girling, Will McCutcheon, Saroch Leedumrongwatthanakun, Vatshal Srivastav, David Jennings, Mehul Malik, Jiannis K. Pachos

    Abstract: Simulators can realise novel phenomena by separating them from the complexities of a full physical implementation. Here we put forward a scheme that can simulate the exotic statistics of $D(S_3)$ non-Abelian anyons with minimal resources. The qudit lattice representation of this planar code supports local encoding of $D(S_3)$ anyons. As a proof-of-principle demonstration we employ a photonic simul… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  37. arXiv:2304.04288  [pdf, ps, other

    math.CO

    Distance matrix of enhanced power graphs of finite groups

    Authors: Anita Arora, Hiranya Kishore Dey, Shivani Goel

    Abstract: The enhanced power graph of a group $G$ is the graph $\mathcal{G}_E(G)$ with vertex set $G$ and edge set $ \{(u,v): u, v \in \langle w \rangle,~\mbox{for some}~ w \in G\}$. In this paper, we compute the spectrum of the distance matrix of the enhanced power graph of non-abelian groups of order $pq$, dihedral groups, dicyclic groups, elementary abelian groups $\mathrm{El}(p^n)$ and the non-cyclic ab… ▽ More

    Submitted 21 June, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    MSC Class: 05C50; 05C25; 20K01

  38. arXiv:2303.16041  [pdf, other

    physics.optics physics.comp-ph

    Referenceless characterisation of complex media using physics-informed neural networks

    Authors: Suraj Goel, Claudio Conti, Saroch Leedumrongwatthanakun, Mehul Malik

    Abstract: In this work, we present a method to characterise the transmission matrices of complex scattering media using a physics-informed, multi-plane neural network (MPNN) without the requirement of a known optical reference field. We use this method to accurately measure the transmission matrix of a commercial multi-mode fiber without the problems of output-phase ambiguity and dark spots, leading to upto… ▽ More

    Submitted 26 September, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Journal ref: Optics Express 31, 32824-32839 (2023)

  39. arXiv:2303.03139  [pdf, other

    cs.AI cs.CY

    Low impact agency: review and discussion

    Authors: Danilo Naiff, Shashwat Goel

    Abstract: Powerful artificial intelligence poses an existential threat if the AI decides to drastically change the world in pursuit of its goals. The hope of low-impact artificial intelligence is to incentivize AI to not do that just because this causes a large impact in the world. In this work, we first review the concept of low-impact agency and previous proposals to approach the problem, and then propose… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Work done as part of the SERIMATS 3.0 training program

  40. arXiv:2302.09157  [pdf, other

    cs.LG cs.CY

    Designing Equitable Algorithms

    Authors: Alex Chohlas-Wood, Madison Coots, Sharad Goel, Julian Nyarko

    Abstract: Predictive algorithms are now used to help distribute a large share of our society's resources and sanctions, such as healthcare, loans, criminal detentions, and tax audits. Under the right circumstances, these algorithms can improve the efficiency and equity of decision-making. At the same time, there is a danger that the algorithms themselves could entrench and exacerbate disparities, particular… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  41. Deciding Equations in the Time Warp Algebra

    Authors: Sam van Gool, Adrien Guatto, George Metcalfe, Simon Santschi

    Abstract: Join-preserving maps on the discrete time scale $ω^+$, referred to as time warps, have been proposed as graded modalities that can be used to quantify the growth of information in the course of program execution. The set of time warps forms a simple distributive involutive residuated lattice -- called the time warp algebra -- that is equipped with residual operations relevant to potential applicat… ▽ More

    Submitted 25 January, 2024; v1 submitted 15 January, 2023; originally announced February 2023.

    Journal ref: Logical Methods in Computer Science (January 26, 2024) lmcs:10937

  42. Profinite lambda-terms and parametricity

    Authors: Sam van Gool, Paul-André Melliès, Vincent Moreau

    Abstract: Combining ideas coming from Stone duality and Reynolds parametricity, we formulate in a clean and principled way a notion of profinite lambda-term which, we show, generalizes at every type the traditional notion of profinite word coming from automata theory. We start by defining the Stone space of profinite lambda-terms as a projective limit of finite sets of usual lambda-terms, considered modulo… ▽ More

    Submitted 18 November, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: For the proceedings of MFPS2023

    Journal ref: Electronic Notes in Theoretical Informatics and Computer Science, Volume 3 - Proceedings of MFPS XXXIX (November 23, 2023) entics:12280

  43. arXiv:2301.04205  [pdf, other

    cs.LO

    A Performance Verification Methodology for Resource Allocation Heuristics

    Authors: Saksham Goel, Benjamin Mikek, Jehad Aly, Venkat Arun, Ahmed Saeed, Aditya Akella

    Abstract: Performance verification is a nascent but promising tool for understanding the performance and limitations of heuristics under realistic assumptions. Bespoke performance verification tools have already demonstrated their value in settings like congestion control and packet scheduling. In this paper, we aim to emphasize the broad applicability and utility of performance verification. To that end, w… ▽ More

    Submitted 28 February, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: 12 pages, 11 figures

  44. arXiv:2211.16309  [pdf, other

    cs.RO cs.LG stat.AP

    A Contextual Bandit Approach for Learning to Plan in Environments with Probabilistic Goal Configurations

    Authors: Sohan Rudra, Saksham Goel, Anirban Santara, Claudio Gentile, Laurent Perron, Fei Xia, Vikas Sindhwani, Carolina Parada, Gaurav Aggarwal

    Abstract: Object-goal navigation (Object-nav) entails searching, recognizing and navigating to a target object. Object-nav has been extensively studied by the Embodied-AI community, but most solutions are often restricted to considering static objects (e.g., television, fridge, etc.). We propose a modular framework for object-nav that is able to efficiently search indoor environments for not just static obj… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Shorter version accepted at NeurIPS 2022 Workshop on Robot Learning: Trustworthy Robotics

  45. arXiv:2211.05369  [pdf, other

    cs.CY

    Decomposing the Fundamentals of Creepy Stories

    Authors: Sakshi Goel, Haripriya Dharmala, Yuchen Zhang, Keith Burghardt

    Abstract: Fear is a universal concept; people crave it in urban legends, scary movies, and modern stories. Open questions remain, however, about why these stories are scary and more generally what scares people. In this study, we explore these questions by analyzing tens of thousands of scary stories on forums (known as subreddits) in a social media website, Reddit. We first explore how writing styles have… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 9 pages, 4 figures

  46. arXiv:2210.10749  [pdf, other

    cs.LG cs.FL stat.ML

    Transformers Learn Shortcuts to Automata

    Authors: Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang

    Abstract: Algorithmic reasoning requires capabilities which are most naturally understood through recurrent models of computation, like the Turing machine. However, Transformer models, while lacking recurrence, are able to perform such reasoning using far fewer layers than the number of reasoning steps. This raises the question: what solutions are learned by these shallow and non-recurrent models? We find t… ▽ More

    Submitted 2 May, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  47. On duality and model theory for polyadic spaces

    Authors: Sam van Gool, Jérémie Marquès

    Abstract: This paper is a study of first-order coherent logic from the point of view of duality and categorical logic. We prove a duality theorem between coherent hyperdoctrines and open polyadic Priestley spaces, which we subsequently apply to prove completeness, omitting types, and Craig interpolation theorems for coherent or intuitionistic logic. Our approach emphasizes the role of interpolation and open… ▽ More

    Submitted 31 October, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 45 pages

    MSC Class: 03C07 (Primary); 06D50; 18C10; 18A15 (Secondary)

  48. arXiv:2209.00735  [pdf, other

    cs.LG stat.ML

    Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms

    Authors: Surbhi Goel, Sham Kakade, Adam Tauman Kalai, Cyril Zhang

    Abstract: Neural networks (NNs) struggle to efficiently solve certain problems, such as learning parities, even when there are simple learning algorithms for those problems. Can NNs discover learning algorithms on their own? We exhibit a NN architecture that, in polynomial time, learns as well as any efficient learning algorithm describable by a constant-sized program. For example, on parity problems, the N… ▽ More

    Submitted 15 January, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: v2: final camera-ready revisions for NeurIPS 2022

  49. arXiv:2207.08799  [pdf, other

    cs.LG cs.NE math.OC stat.ML

    Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit

    Authors: Boaz Barak, Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

    Abstract: There is mounting evidence of emergent phenomena in the capabilities of deep learning methods as we scale up datasets, model sizes, and training times. While there are some accounts of how these resources modulate statistical capacity, far less is known about their effect on the computational problem of model training. This work conducts such an exploration through the lens of learning a $k$-spars… ▽ More

    Submitted 15 January, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: v3: final camera-ready revisions for NeurIPS 2022

  50. arXiv:2207.05302  [pdf, other

    cs.LG cs.AI cs.CY

    Causal Conceptions of Fairness and their Consequences

    Authors: Hamed Nilforoshan, Johann Gaebler, Ravi Shroff, Sharad Goel

    Abstract: Recent work highlights the role of causality in designing equitable decision-making algorithms. It is not immediately clear, however, how existing causal conceptions of fairness relate to one another, or what the consequences are of using these definitions as design principles. Here, we first assemble and categorize popular causal definitions of algorithmic fairness into two broad families: (1) th… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: ICML 2022. arXiv admin note: substantial text overlap with arXiv:1808.00023

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022