Skip to main content

Showing 1–50 of 202 results for author: Cohen, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20838  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    einspace: Searching for Neural Architectures from Fundamental Operations

    Authors: Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

    Abstract: Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the search spaces in NAS often aren't diverse enough to include such transformations a priori. Instead, for NAS to provide greater potential for fundamental design shift… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Project page at https://linusericsson.github.io/einspace/

  2. arXiv:2405.09719  [pdf, other

    cs.CL cs.AI cs.LG

    Spectral Editing of Activations for Large Language Model Alignment

    Authors: Yifu Qiu, Zheng Zhao, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

    Abstract: Large language models (LLMs) often exhibit undesirable behaviours, such as generating untruthful or biased content. Editing their internal representations has been shown to be effective in mitigating such behaviours on top of the existing alignment methods. We propose a novel inference-time editing method, namely spectral editing of activations (SEA), to project the input representations into dire… ▽ More

    Submitted 25 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  3. arXiv:2404.16123  [pdf, other

    cs.CV cs.AI cs.CL

    FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication

    Authors: Eric Slyman, Stefan Lee, Scott Cohen, Kushal Kafle

    Abstract: Recent dataset deduplication techniques have demonstrated that content-aware dataset pruning can dramatically reduce the cost of training Vision-Language Pretrained (VLP) models without significant performance losses compared to training on the original dataset. These results have been based on pruning commonly used image-caption datasets collected from the web -- datasets that are known to harbor… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Conference paper at CVPR 2024. 6 pages, 8 figures. Project Page: https://ericslyman.com/fairdedup/

    ACM Class: I.4.10; I.2.7; E.0

  4. arXiv:2404.14715  [pdf, other

    cs.CV cs.CL

    FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

    Authors: Hang Hua, **g Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo Luo

    Abstract: Recent progress in large-scale pre-training has led to the development of advanced vision-language models (VLMs) with remarkable proficiency in comprehending and generating multimodal content. Despite the impressive ability to perform complex reasoning for VLMs, current models often struggle to effectively and precisely capture the compositional information on both the image and text sides. To add… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  5. arXiv:2403.13312  [pdf, other

    cs.CL

    LeanReasoner: Boosting Complex Logical Reasoning with Lean

    Authors: Dongwei Jiang, Marcio Fonseca, Shay B. Cohen

    Abstract: Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning. We use Lean, a theorem proving framework, to address these challenges. By formalizing logical reasoning problems into theorems within Lean, we can solve them by proving or disproving the corresponding theorems. This method reduces the risk of logi… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024 main conference

  6. arXiv:2403.10701  [pdf, other

    cs.CV

    IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

    Authors: Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, He Zhang, Wei Xiong, Daniel Aliaga

    Abstract: Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In response, this paper introduces IMPRINT, a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  7. arXiv:2403.08828  [pdf, other

    cs.HC cs.AI cs.RO

    People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior

    Authors: Balint Gyevnar, Stephanie Droop, Tadeg Quillien, Shay B. Cohen, Neil R. Bramley, Christopher G. Lucas, Stefano V. Albrecht

    Abstract: Cognitive science can help us understand which explanations people might expect, and in which format they frame these explanations, whether causal, counterfactual, or teleological (i.e., purpose-oriented). Understanding the relevance of these concepts is crucial for building good explainable AI (XAI) which offers recourse and actionability. Focusing on autonomous driving, a complex decision-making… ▽ More

    Submitted 30 April, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  8. arXiv:2403.02817  [pdf, other

    cs.CR

    Here Comes The AI Worm: Unleashing Zero-click Worms that Target GenAI-Powered Applications

    Authors: Stav Cohen, Ron Bitton, Ben Nassi

    Abstract: In the past year, numerous companies have incorporated Generative AI (GenAI) capabilities into new and existing applications, forming interconnected Generative AI (GenAI) ecosystems consisting of semi/fully autonomous agents powered by GenAI services. While ongoing research highlighted risks associated with the GenAI layer of agents (e.g., dialog poisoning, membership inference, prompt leaking, ja… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Website: https://sites.google.com/view/compromptmized

  9. arXiv:2402.17783  [pdf, other

    eess.SP cs.AI cs.LG

    BagStacking: An Integrated Ensemble Learning Approach for Freezing of Gait Detection in Parkinson's Disease

    Authors: Seffi Cohen, Lior Rokach

    Abstract: This paper introduces BagStacking, a novel ensemble learning method designed to enhance the detection of Freezing of Gait (FOG) in Parkinson's Disease (PD) by using a lower-back sensor to track acceleration. Building on the principles of bagging and stacking, BagStacking aims to achieve the variance reduction benefit of bagging's bootstrap sampling while also learning sophisticated blending throug… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  10. arXiv:2402.15055  [pdf, other

    cs.CL cs.AI cs.LG

    Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions

    Authors: Clement Neo, Shay B. Cohen, Fazl Barez

    Abstract: In this paper, we investigate the interplay between attention heads and specialized "next-token" neurons in the Multilayer Perceptron that predict specific tokens. By prompting an LLM like GPT-4 to explain these model internals, we can elucidate attention mechanisms that activate certain next-token neurons. Our analysis identifies attention heads that recognize contexts relevant to predicting a pa… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 15 pages, 11 figures

  11. arXiv:2402.10643  [pdf, other

    cs.CL cs.AI

    `Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory

    Authors: Ronald Cardenas, Matthias Galle, Shay B. Cohen

    Abstract: Extractive summaries are usually presented as lists of sentences with no expected cohesion between them. In this paper, we aim to enforce cohesion whilst controlling for informativeness and redundancy in summaries, in cases where the input exhibits high redundancy. The pipeline controls for redundancy in long inputs as it is consumed, and balances informativeness and cohesion during sentence selec… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  12. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  13. arXiv:2402.05534  [pdf, other

    cs.SI cs.DS

    Robust Parameter Fitting to Realistic Network Models via Iterative Stochastic Approximation

    Authors: Thomas Bläsius, Sarel Cohen, Philipp Fischbeck, Tobias Friedrich, Martin S. Krejca

    Abstract: Random graph models are widely used to understand network properties and graph algorithms. Key to such analyses are the different parameters of each model, which affect various network features, such as its size, clustering, or degree distribution. The exact effect of the parameters on these features is not well understood, mainly because we lack tools to thoroughly investigate this relation. More… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  14. arXiv:2401.10415  [pdf, other

    cs.CL cs.AI

    Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?

    Authors: Marcio Fonseca, Shay B. Cohen

    Abstract: In this work, we investigate the controllability of large language models (LLMs) on scientific summarization tasks. We identify key stylistic and content coverage factors that characterize different types of summaries such as paper reviews, abstracts, and lay summaries. By controlling stylistic features, we find that non-fine-tuned LLMs outperform humans in the MuP review generation task, both in… ▽ More

    Submitted 27 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: ACL 2024 camera ready

  15. arXiv:2401.01814  [pdf, other

    cs.AI

    Large Language Models Relearn Removed Concepts

    Authors: Michelle Lo, Shay B. Cohen, Fazl Barez

    Abstract: Advances in model editing through neuron pruning hold promise for removing undesirable concepts from large language models. However, it remains unclear whether models have the capacity to reacquire pruned concepts after editing. To investigate this, we evaluate concept relearning in models by tracking concept saliency and similarity in pruned neurons during retraining. Our findings reveal that mod… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  16. arXiv:2312.11476  [pdf

    physics.geo-ph cs.LG

    The geometry of flow: Advancing predictions of river geometry with multi-model machine learning

    Authors: Shuyu Y Chang, Zahra Ghahremani, Laura Manuel, Mohammad Erfani, Chaopeng Shen, Sagy Cohen, Kimberly Van Meter, Jennifer L Pierce, Ehab A Meselhe, Erfan Goharian

    Abstract: Hydraulic geometry parameters describing river hydrogeomorphic is important for flood forecasting. Although well-established, power-law hydraulic geometry curves have been widely used to understand riverine systems and map** flooding inundation worldwide for the past 70 years, we have become increasingly aware of the limitations of these approaches. In the present study, we have moved beyond the… ▽ More

    Submitted 27 November, 2023; originally announced December 2023.

    Comments: 30 pages, 10 figures

  17. arXiv:2312.03480  [pdf, other

    cs.CL

    AMR Parsing is Far from Solved: GrAPES, the Granular AMR Parsing Evaluation Suite

    Authors: Jonas Groschwitz, Shay B. Cohen, Lucia Donatelli, Meaghan Fowlie

    Abstract: We present the Granular AMR Parsing Evaluation Suite (GrAPES), a challenge set for Abstract Meaning Representation (AMR) parsing with accompanying evaluation metrics. AMR parsers now obtain high scores on the standard AMR evaluation metric Smatch, close to or even above reported inter-annotator agreement. But that does not mean that AMR parsing is solved; in fact, human evaluation in previous work… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted at EMNLP 2023. For the associated GitHub repository, see https://github.com/jgroschwitz/GrAPES

    ACM Class: J.5

  18. arXiv:2312.02156  [pdf, other

    cs.CV cs.AI

    Latent Feature-Guided Diffusion Models for Shadow Removal

    Authors: Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel

    Abstract: Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images. In this paper, we propose the use of diffusion models as they offer a promising approach to gradually refine the details of shadow regions during the diffusion process. Our method improves this process by conditioning on a learned latent feature space that… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: project page see https://kfmei.page/shadow-diffusion/index.html

  19. arXiv:2311.12901  [pdf, other

    q-bio.QM cs.LG

    From Microbes to Methane: AI-Based Predictive Modeling of Feed Additive Efficacy in Dairy Cows

    Authors: Yaniv Altshuler, Tzruya Calvao Chebach, Shalom Cohen

    Abstract: In an era of increasing pressure to achieve sustainable agriculture, the optimization of livestock feed for enhancing yield and minimizing environmental impact is a paramount objective. This study presents a pioneering approach towards this goal, using rumen microbiome data to predict the efficacy of feed additives in dairy cattle. We collected an extensive dataset that includes methane emission… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 51 pages, 24 figures, 11 tables, 93 references

  20. arXiv:2311.09467  [pdf, other

    cs.CL cs.AI

    Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation

    Authors: Yifu Qiu, Varun Embar, Shay B. Cohen, Benjamin Han

    Abstract: Knowledge-to-text generators often struggle to faithfully generate descriptions for the input facts: they may produce hallucinations that contradict the input, or describe facts not present in the input. To reduce hallucinations, we propose a decoding-only method, TWEAK (Think While Effectively Articulating Knowledge), which can be integrated with any generator without retraining. TWEAK treats the… ▽ More

    Submitted 3 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 (Findings)

  21. arXiv:2311.08704  [pdf, other

    cs.CL cs.AI

    Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains

    Authors: Marcio Fonseca, Shay B. Cohen

    Abstract: Although large language models (LLMs) exhibit remarkable capacity to leverage in-context demonstrations, it is still unclear to what extent they can learn new concepts or facts from ground-truth labels. To address this question, we examine the capacity of instruction-tuned LLMs to follow in-context concept guidelines for sentence labeling tasks. We design guidelines that present different types of… ▽ More

    Submitted 26 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: ACL 2024 camera ready

  22. arXiv:2311.08398  [pdf, other

    cs.CL cs.AI

    Are Large Language Models Temporally Grounded?

    Authors: Yifu Qiu, Zheng Zhao, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

    Abstract: Are Large language models (LLMs) temporally grounded? Since LLMs cannot perceive and interact with the environment, it is impossible to answer this question directly. Instead, we provide LLMs with textual narratives and probe them with respect to their common-sense knowledge of the structure and duration of events, their ability to order events along a timeline, and self-consistency within their t… ▽ More

    Submitted 16 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

  23. arXiv:2310.15513  [pdf, other

    cs.CL

    A Joint Matrix Factorization Analysis of Multilingual Representations

    Authors: Zheng Zhao, Yftah Ziser, Bonnie Webber, Shay B. Cohen

    Abstract: We present an analysis tool based on joint matrix factorization for comparing latent representations of multilingual and monolingual models. An alternative to probing, this tool allows us to analyze multiple sets of representations in a joint manner. Using this tool, we study to what extent and how morphosyntactic features are reflected in the representations learned by multilingual pre-trained mo… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of EMNLP 2023

  24. arXiv:2308.12910  [pdf, other

    cs.CV

    SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data

    Authors: Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding, Vicente Ordonez

    Abstract: We propose Subject-Conditional Relation Detection SCoRD, where conditioned on an input subject, the goal is to predict all its relations to other objects in a scene along with their locations. Based on the Open Images dataset, we propose a challenging OIv6-SCoRD benchmark such that the training and testing splits have a distribution shift in terms of the occurrence statistics of $\langle$subject,… ▽ More

    Submitted 4 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: WACV 2024

  25. arXiv:2308.02927  [pdf, other

    cs.DC

    Subquadratic Multivalued Asynchronous Byzantine Agreement WHP

    Authors: Shir Cohen, Idit Keidar

    Abstract: There have been several reductions from multivalued consensus to binary consensus over the past 20 years. To the best of our knowledge, none of them solved it for Byzantine asynchronous settings. In this paper, we close this gap. Moreover, we do so in subquadratic communication, using newly developed subquadratic binary Byzantine Agreement techniques.

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2002.06545

  26. arXiv:2308.01999  [pdf, other

    quant-ph cs.PF cs.SE

    cuQuantum SDK: A High-Performance Library for Accelerating Quantum Science

    Authors: Harun Bayraktar, Ali Charara, David Clark, Saul Cohen, Timothy Costa, Yao-Lung L. Fang, Yang Gao, Jack Guan, John Gunnels, Azzam Haidar, Andreas Hehn, Markus Hohnerbach, Matthew Jones, Tom Lubowe, Dmitry Lyakh, Shinya Morino, Paul Springer, Sam Stanwyck, Igor Terentyev, Satya Varadhan, Jonathan Wong, Takuma Yamaguchi

    Abstract: We present the NVIDIA cuQuantum SDK, a state-of-the-art library of composable primitives for GPU-accelerated quantum circuit simulations. As the size of quantum devices continues to increase, making their classical simulation progressively more difficult, the availability of fast and scalable quantum circuit simulators becomes vital for quantum algorithm developers, as well as quantum hardware eng… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: paper accepted at QCE 2023, journal reference will be updated whenever available

    MSC Class: 68Q12; 68Q09; 81P68;

  27. arXiv:2307.11677  [pdf, ps, other

    cs.DS

    Improved Approximate Distance Oracles: Bypassing the Thorup-Zwick Bound in Dense Graphs

    Authors: Davide Bilò, Shiri Chechik, Keerti Choudhary, Sarel Cohen, Tobias Friedrich, Martin Schirneck

    Abstract: Despite extensive research on distance oracles, there are still large gaps between the best constructions for spanners and distance oracles. Notably, there exist sparse spanners with a multiplicative stretch of $1+\varepsilon$ plus some additive stretch. A fundamental open problem is whether such a bound is achievable for distance oracles as well. Specifically, can we construct a distance oracle w… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  28. arXiv:2307.10810  [pdf, other

    cs.LG cs.AI

    On Combining Expert Demonstrations in Imitation Learning via Optimal Transport

    Authors: Ilana Sebag, Samuel Cohen, Marc Peter Deisenroth

    Abstract: Imitation learning (IL) seeks to teach agents specific tasks through expert demonstrations. One of the key approaches to IL is to define a distance between agent and expert and to find an agent policy that minimizes that distance. Optimal transport methods have been widely used in imitation learning as they provide ways to measure meaningful distances between agent and expert trajectories. However… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Journal ref: NeurIPS Workshop on Optimal Transport and Machine Learning, 2021

  29. arXiv:2306.15041  [pdf

    q-bio.QM cs.DB

    A Comparison of Neuroelectrophysiology Databases

    Authors: Priyanka Subash, Alex Gray, Misque Boswell, Samantha L. Cohen, Rachael Garner, Sana Salehi, Calvary Fisher, Samuel Hobel, Satrajit Ghosh, Yaroslav Halchenko, Benjamin Dichter, Russell A. Poldrack, Chris Markiewicz, Dora Hermes, Arnaud Delorme, Scott Makeig, Brendan Behan, Alana Sparks, Stephen R Arnott, Zhengjia Wang, John Magnotti, Michael S. Beauchamp, Nader Pouratian, Arthur W. Toga, Dominique Duncan

    Abstract: As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DAN… ▽ More

    Submitted 30 August, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 22 pages, 6 figures, 5 tables

  30. arXiv:2306.11623  [pdf, ps, other

    stat.ML cs.LG math.ST

    Mean-field Analysis of Generalization Errors

    Authors: Gholamali Aminian, Samuel N. Cohen, Łukasz Szpruch

    Abstract: We propose a novel framework for exploring weak and $L_2$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$, is… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 49 pages

    MSC Class: 62B10; 60F99; 49N80; 46N30

  31. arXiv:2305.19911  [pdf, other

    cs.LG cs.CL

    Neuron to Graph: Interpreting Language Model Neurons at Scale

    Authors: Alex Foote, Neel Nanda, Esben Kran, Ioannis Konstas, Shay Cohen, Fazl Barez

    Abstract: Advances in Large Language Models (LLMs) have led to remarkable capabilities, yet their inner mechanisms remain largely unknown. To understand these models, we need to unravel the functions of individual neurons and their contribution to the network. This paper introduces a novel automated approach designed to scale interpretability techniques across a vast array of neurons within LLMs, to make th… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  32. arXiv:2305.19734  [pdf, other

    cs.AI cs.CL cs.DB

    Knowledge Base Question Answering for Space Debris Queries

    Authors: Paul Darm, Antonio Valerio Miceli-Barone, Shay B. Cohen, Annalisa Riccardi

    Abstract: Space agencies execute complex satellite operations that need to be supported by the technical knowledge contained in their extensive information systems. Knowledge bases (KB) are an effective way of storing and accessing such information at scale. In this work we present a system, developed for the European Space Agency (ESA), that can answer complex natural language queries, to support engineers… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 7 pages, ACL 2023 industry track

    ACM Class: I.2.7

  33. arXiv:2305.16947  [pdf, other

    cs.CL

    Sentence-Incremental Neural Coreference Resolution

    Authors: Matt Grenander, Shay B. Cohen, Mark Steedman

    Abstract: We propose a sentence-incremental neural coreference resolution system which incrementally builds clusters after marking mention boundaries in a shift-reduce method. The system is aimed at bridging two recent approaches at coreference resolution: (1) state-of-the-art non-incremental models that incur quadratic complexity in document length with high computational cost, and (2) memory network-based… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2022

  34. arXiv:2305.15507  [pdf, other

    cs.CL cs.AI

    The Larger They Are, the Harder They Fail: Language Models do not Recognize Identifier Swaps in Python

    Authors: Antonio Valerio Miceli-Barone, Fazl Barez, Ioannis Konstas, Shay B. Cohen

    Abstract: Large Language Models (LLMs) have successfully been applied to code generation tasks, raising the question of how well these models understand programming. Typical programming languages have invariances and equivariances in their semantics that human programmers intuitively understand and exploit, such as the (near) invariance to the renaming of identifiers. We show that LLMs not only fail to prop… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 17 pages, 5 figure, ACL 2023

  35. arXiv:2305.13632  [pdf, other

    cs.CL cs.AI cs.LG

    Detecting and Mitigating Hallucinations in Multilingual Summarisation

    Authors: Yifu Qiu, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

    Abstract: Hallucinations pose a significant challenge to the reliability of neural models for abstractive summarisation. While automatically generated summaries may be fluent, they often lack faithfulness to the original document. This issue becomes even more pronounced in low-resource settings, such as cross-lingual transfer. With the existing faithful metrics focusing on English, even measuring the extent… ▽ More

    Submitted 26 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  36. Approximate Distance Sensitivity Oracles in Subquadratic Space

    Authors: Davide Bilò, Shiri Chechik, Keerti Choudhary, Sarel Cohen, Tobias Friedrich, Simon Krogmann, Martin Schirneck

    Abstract: An $f$-edge fault-tolerant distance sensitive oracle ($f$-DSO) with stretch $σ\ge 1$ is a data structure that preprocesses a given undirected, unweighted graph $G$ with $n$ vertices and $m$ edges, and a positive integer $f$. When queried with a pair of vertices $s, t$ and a set $F$ of at most $f$ edges, it returns a $σ$-approximation of the $s$-$t$-distance in $G-F$. We study $f$-DSOs that take… ▽ More

    Submitted 4 June, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: The is the arXiv version of the eponymous paper that appeared first at STOC 2023 and then was extended to a journal version, published in TheoretiCS

    Journal ref: TheoretiCS, Volume 3 (2024), Article 15, 1-47

  37. arXiv:2305.08828  [pdf, other

    cs.CL

    PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India

    Authors: Ashok Urlana, Pinzhen Chen, Zheng Zhao, Shay B. Cohen, Manish Shrivastava, Barry Haddow

    Abstract: This paper introduces PMIndiaSum, a multilingual and massively parallel summarization corpus focused on languages in India. Our corpus provides a training and testing ground for four language families, 14 languages, and the largest to date with 196 language pairs. We detail our construction workflow including data acquisition, processing, and quality assurance. Furthermore, we publish benchmarks f… ▽ More

    Submitted 19 October, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

    ACM Class: I.2.7

  38. arXiv:2305.07494  [pdf, other

    cs.GT cs.AI cs.DS

    Temporal Network Creation Games

    Authors: Davide Bilò, Sarel Cohen, Tobias Friedrich, Hans Gawendowicz, Nicolas Klodt, Pascal Lenzner, George Skretas

    Abstract: Most networks are not static objects, but instead they change over time. This observation has sparked rigorous research on temporal graphs within the last years. In temporal graphs, we have a fixed set of nodes and the connections between them are only available at certain time steps. This gives rise to a plethora of algorithmic problems on such graphs, most prominently the problem of finding temp… ▽ More

    Submitted 21 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: To appear at the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), full version

  39. arXiv:2305.06000  [pdf, ps, other

    math.NA cs.LG

    Global Convergence of Deep Galerkin and PINNs Methods for Solving Partial Differential Equations

    Authors: Deqing Jiang, Justin Sirignano, Samuel N. Cohen

    Abstract: Numerically solving high-dimensional partial differential equations (PDEs) is a major challenge. Conventional methods, such as finite difference methods, are unable to solve high-dimensional PDEs due to the curse-of-dimensionality. A variety of deep learning methods have been recently developed to try and solve high-dimensional PDEs by approximating the solution using a neural network. In this pap… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  40. arXiv:2305.03697  [pdf, other

    cs.DS

    Fault-Tolerant ST-Diameter Oracles

    Authors: Davide Bilò, Keerti Choudhary, Sarel Cohen, Tobias Friedrich, Simon Krogmann, Martin Schirneck

    Abstract: We study the problem of estimating the $ST$-diameter of a graph that is subject to a bounded number of edge failures. An $f$-edge fault-tolerant $ST$-diameter oracle ($f$-FDO-$ST$) is a data structure that preprocesses a given graph $G$, two sets of vertices $S,T$, and positive integer $f$. When queried with a set $F$ of at most $f$ edges, the oracle returns an estimate $\widehat{D}$ of the $ST$-d… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: accepted at ICALP 2023

  41. arXiv:2304.14184  [pdf, other

    cs.DS

    Compact Distance Oracles with Large Sensitivity and Low Stretch

    Authors: Davide Bilò, Keerti Choudhary, Sarel Cohen, Tobias Friedrich, Simon Krogmann, Martin Schirneck

    Abstract: An $f$-edge fault-tolerant distance sensitive oracle ($f$-DSO) with stretch $σ\geq 1$ is a data structure that preprocesses an input graph $G$. When queried with the triple $(s,t,F)$, where $s, t \in V$ and $F \subseteq E$ contains at most $f$ edges of $G$, the oracle returns an estimate $\widehat{d}_{G-F}(s,t)$ of the distance $d_{G-F}(s,t)$ between $s$ and $t$ in the graph $G-F$ such that… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: accepted at WADS 2023

  42. arXiv:2304.11743  [pdf, other

    cs.CV

    GamutMLP: A Lightweight MLP for Color Loss Recovery

    Authors: Hoang M. Le, Brian Price, Scott Cohen, Michael S. Brown

    Abstract: Cameras and image-editing software often process images in the wide-gamut ProPhoto color space, encompassing 90% of all visible colors. However, when images are encoded for sharing, this color-rich representation is transformed and clipped to fit within the small-gamut standard RGB (sRGB) color space, representing only 30% of visible colors. Recovering the lost color information is challenging due… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  43. arXiv:2304.03372  [pdf, other

    cs.CV

    TopNet: Transformer-based Object Placement Network for Image Compositing

    Authors: Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen

    Abstract: We investigate the problem of automatically placing an object into a background image for image compositing. Given a background image and a segmented object, the goal is to train a model to predict plausible placements (location and scale) of the object for compositing. The quality of the composite image highly depends on the predicted location/scale. Existing works either generate candidate bound… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: CVPR

  44. arXiv:2303.13971  [pdf, other

    cs.LG

    Optimal Transport for Offline Imitation Learning

    Authors: Yicheng Luo, Zhengyao Jiang, Samuel Cohen, Edward Grefenstette, Marc Peter Deisenroth

    Abstract: With the advent of large datasets, offline reinforcement learning (RL) is a promising framework for learning good decision-making policies without the need to interact with the real environment. However, offline RL requires the dataset to be reward-annotated, which presents practical challenges when reward engineering is difficult or when obtaining reward annotations is labor-intensive. In this pa… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: Published in ICLR 2023

  45. arXiv:2302.10809  [pdf, other

    cs.AI cs.RO

    Causal Explanations for Sequential Decision-Making in Multi-Agent Systems

    Authors: Balint Gyevnar, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht

    Abstract: We present CEMA: Causal Explanations in Multi-Agent systems; a framework for creating causal natural language explanations of an agent's decisions in dynamic sequential multi-agent systems to build more trustworthy autonomous agents. Unlike prior work that assumes a fixed causal structure, CEMA only requires a probabilistic model for forward-simulating the state of the system. Using such a model,… ▽ More

    Submitted 14 February, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted in 23rd International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2024

    ACM Class: I.2.9

  46. arXiv:2302.09350  [pdf, other

    cs.CL

    BERT is not The Count: Learning to Match Mathematical Statements with Proofs

    Authors: Weixian Waylon Li, Yftah Ziser, Maximin Coavoux, Shay B. Cohen

    Abstract: We introduce a task consisting in matching a proof to a given mathematical statement. The task fits well within current research on Mathematical Information Retrieval and, more generally, mathematical article analysis (Mathematical Sciences, 2014). We present a dataset for the task (the MATcH dataset) consisting of over 180k statement-proof pairs extracted from modern mathematical research article… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: Accepted to the Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023; 14 pages. arXiv admin note: substantial text overlap with arXiv:2102.02110

  47. arXiv:2302.02997  [pdf, other

    cs.CL

    Erasure of Unaligned Attributes from Neural Representations

    Authors: Shun Shao, Yftah Ziser, Shay Cohen

    Abstract: We present the Assignment-Maximization Spectral Attribute removaL (AMSAL) algorithm, which erases information from neural representations when the information to be erased is implicit rather than directly being aligned to each input example. Our algorithm works by alternating between two steps. In one, it finds an assignment of the input representations to the information to be erased, and in the… ▽ More

    Submitted 18 April, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted to Transactions of the Association for Computational Linguistics, 22 pages (pre-MIT Press publication version)

  48. arXiv:2212.06310  [pdf, other

    cs.CV cs.GR

    Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators

    Authors: Haitian Zheng, Zhe Lin, **gwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Qing Liu, Yuqian Zhou, Sohrab Amirghodsi, Jiebo Luo

    Abstract: Structure-guided image completion aims to inpaint a local region of an image according to an input guidance map from users. While such a task enables many practical applications for interactive editing, existing methods often struggle to hallucinate realistic object instances in complex natural scenes. Such a limitation is partially due to the lack of semantic-level constraints inside the hole reg… ▽ More

    Submitted 23 April, 2024; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 18 pages, 16 figures

  49. arXiv:2212.00932  [pdf, other

    cs.CV

    ObjectStitch: Generative Object Compositing

    Authors: Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, Daniel Aliaga

    Abstract: Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore, annotating training data pairs for compositing requires substantial manual effort from professionals, and is hardly scalable. Thus, with the recent advances in generat… ▽ More

    Submitted 5 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

  50. arXiv:2211.11540  [pdf, other

    cs.CR

    A Framework for Auditable Synthetic Data Generation

    Authors: Florimond Houssiau, Samuel N. Cohen, Lukasz Szpruch, Owen Daniel, Michaela G. Lawrence, Robin Mitra, Henry Wilde, Callum Mole

    Abstract: Synthetic data has gained significant momentum thanks to sophisticated machine learning tools that enable the synthesis of high-dimensional datasets. However, many generation techniques do not give the data controller control over what statistical patterns are captured, leading to concerns over privacy protection. While synthetic records are not linked to a particular real-world individual, they c… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.