Skip to main content

Showing 1–28 of 28 results for author: Davidson, T

.
  1. arXiv:2407.06946  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Recognition in Language Models

    Authors: Tim R. Davidson, Viacheslav Surkov, Veniamin Veselovsky, Giuseppe Russo, Robert West, Caglar Gulcehre

    Abstract: A rapidly growing number of applications rely on a small set of closed-source language models (LMs). This dependency might introduce novel security risks if LMs develop self-recognition capabilities. Inspired by human identity verification methods, we propose a novel approach for assessing self-recognition in LMs using model-generated "security questions". Our test can be externally administered t… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Code to reproduce experiments and replicate findings is made available at https://github.com/trdavidson/self-recognition

  2. arXiv:2405.02150  [pdf, other

    cs.CY

    The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates

    Authors: Giuseppe Russo Latona, Manoel Horta Ribeiro, Tim R. Davidson, Veniamin Veselovsky, Robert West

    Abstract: Journals and conferences worry that peer reviews assisted by artificial intelligence (AI), in particular, large language models (LLMs), may negatively influence the validity and fairness of the peer-review system, a cornerstone of modern science. In this work, we address this concern with a quasi-experimental study of the prevalence and impact of AI-assisted peer reviews in the context of the 2024… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Manoel Horta Ribeiro, Tim R. Davidson, and Veniamin Veselovsky contributed equally to this work

  3. arXiv:2401.04536  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Language Model Agency through Negotiations

    Authors: Tim R. Davidson, Veniamin Veselovsky, Martin Josifoski, Maxime Peyrard, Antoine Bosselut, Michal Kosinski, Robert West

    Abstract: We introduce an approach to evaluate language model (LM) agency using negotiation games. This approach better reflects real-world use cases and addresses some of the shortcomings of alternative LM benchmarks. Negotiation games enable us to study multi-turn, and cross-model interactions, modulate complexity, and side-step accidental evaluation data leakage. We use our approach to test six widely us… ▽ More

    Submitted 16 March, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR 2024, code and link to project data are made available at https://github.com/epfl-dlab/LAMEN

  4. arXiv:2312.07413  [pdf, other

    cs.AI cs.LG

    AI capabilities can be significantly improved without expensive retraining

    Authors: Tom Davidson, Jean-Stanislas Denain, Pablo Villalobos, Guillem Bas

    Abstract: State-of-the-art AI systems can be significantly improved without expensive retraining via "post-training enhancements"-techniques applied after initial training like fine-tuning the system to use a web browser. We review recent post-training enhancements, categorizing them into five types: tool-use, prompting methods, scaffolding, solution selection, and data generation. Different enhancements im… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 30 pages, 24 figures

  5. arXiv:2306.01985  [pdf, other

    cs.CL

    COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements

    Authors: Xuhui Zhou, Hao Zhu, Akhila Yerukola, Thomas Davidson, Jena D. Hwang, Swabha Swayamdipta, Maarten Sap

    Abstract: Warning: This paper contains content that may be offensive or upsetting. Understanding the harms and offensiveness of statements requires reasoning about the social and situational context in which statements are made. For example, the utterance "your English is very good" may implicitly signal an insult when uttered by a white man to a non-white colleague, but uttered by an ESL teacher to their s… ▽ More

    Submitted 8 June, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted to Findings of ACL 2023

  6. arXiv:2206.09024  [pdf

    cs.SI

    Partisan US News Media Representations of Syrian Refugees

    Authors: Keyu Chen, Marzieh Babaeianjelodar, Yiwen Shi, Kamila Janmohamed, Rupak Sarkar, Ingmar Weber, Thomas Davidson, Munmun De Choudhury, Jonathan Huang, Shweta Yadav, Ashique Khudabukhsh, Preslav Ivanov Nakov, Chris Bauch, Orestis Papakyriakopoulos, Kaveh Khoshnood, Navin Kumar

    Abstract: We investigate how representations of Syrian refugees (2011-2021) differ across US partisan news outlets. We analyze 47,388 articles from the online US media about Syrian refugees to detail differences in reporting between left- and right-leaning media. We use various NLP techniques to understand these differences. Our polarization and question answering results indicated that left-leaning media t… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  7. arXiv:2102.13011  [pdf, other

    cs.CV

    Learning for Unconstrained Space-Time Video Super-Resolution

    Authors: Zhihao Shi, Xiaohong Liu, Chengqi Li, Linhui Dai, Jun Chen, Timothy N. Davidson, Jiying Zhao

    Abstract: Recent years have seen considerable research activities devoted to video enhancement that simultaneously increases temporal frame rate and spatial resolution. However, the existing methods either fail to explore the intrinsic relationship between temporal and spatial information or lack flexibility in the choice of final temporal/spatial resolution. In this work, we propose an unconstrained space-… ▽ More

    Submitted 31 August, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

  8. arXiv:2010.13681  [pdf, other

    cs.DC cs.HC

    Aggregate-Driven Trace Visualizations for Performance Debugging

    Authors: Vaastav Anand, Matheus Stolet, Thomas Davidson, Ivan Beschastnikh, Tamara Munzner, Jonathan Mace

    Abstract: Performance issues in cloud systems are hard to debug. Distributed tracing is a widely adopted approach that gives engineers visibility into cloud systems. Existing trace analysis approaches focus on debugging single request correctness issues but not debugging single request performance issues. Diagnosing a performance issue in a given request requires comparing the performance of the offending r… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  9. arXiv:2005.13041  [pdf, other

    cs.CL cs.SI

    Examining Racial Bias in an Online Abuse Corpus with Structural Topic Modeling

    Authors: Thomas Davidson, Debasmita Bhattacharya

    Abstract: We use structural topic modeling to examine racial bias in data collected to train models to detect hate speech and abusive language in social media posts. We augment the abusive language dataset by adding an additional feature indicating the predicted probability of the tweet being written in African-American English. We then use structural topic modeling to examine the content of the tweets and… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: Please cite the published version, see proceedings of ICWSM 2020

  10. arXiv:1910.02912  [pdf, other

    stat.ML cs.LG

    Increasing Expressivity of a Hyperspherical VAE

    Authors: Tim R. Davidson, Jakub M. Tomczak, Efstratios Gavves

    Abstract: Learning suitable latent representations for observed, high-dimensional data is an important research topic underlying many recent advances in machine learning. While traditionally the Gaussian normal distribution has been the go-to latent parameterization, recently a variety of works have successfully proposed the use of manifold-valued latents. In one such work (Davidson et al., 2018), the autho… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019, in Workshop on Bayesian Deep Learning

  11. arXiv:1905.12516  [pdf, ps, other

    cs.CL cs.LG

    Racial Bias in Hate Speech and Abusive Language Detection Datasets

    Authors: Thomas Davidson, Debasmita Bhattacharya, Ingmar Weber

    Abstract: Technologies for abusive language detection are being developed and applied with little consideration of their potential biases. We examine racial bias in five different sets of Twitter data annotated for hate speech and abusive language. We train classifiers on these datasets and compare the predictions of these classifiers on tweets written in African-American English with those written in Stand… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: To appear in the proceedings of the Third Abusive Language Workshop (https://sites.google.com/view/alw3/) at the Annual Meeting for the Association for Computational Linguistics 2019. Please cite the published version

  12. arXiv:1903.02958  [pdf, other

    stat.ML cs.CG cs.LG math.PR math.RT

    Reparameterizing Distributions on Lie Groups

    Authors: Luca Falorsi, Pim de Haan, Tim R. Davidson, Patrick Forré

    Abstract: Reparameterizable densities are an important way to learn probability distributions in a deep learning setting. For many distributions it is possible to create low-variance gradient estimators by utilizing a `reparameterization trick'. Due to the absence of a general reparameterization trick, much research has recently been devoted to extend the number of reparameterizable distributional families.… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

    Comments: AISTATS (2019), code available at https://github.com/pimdh/relie

  13. arXiv:1809.07453  [pdf, ps, other

    cs.IT

    Uplink Resource Allocation for Multiple Access Computational Offloading (Extended Version)

    Authors: Mahsa Salmani, Timothy N. Davidson

    Abstract: The mobile edge computing framework offers the opportunity to reduce the energy that devices must expend to complete computational tasks. The extent of that energy reduction depends on the nature of the tasks, and on the choice of the multiple access scheme. In this paper, we first address the uplink communication resource allocation for offloading systems that exploit the full capabilities of the… ▽ More

    Submitted 29 April, 2019; v1 submitted 19 September, 2018; originally announced September 2018.

  14. arXiv:1807.04689  [pdf, other

    stat.ML cs.LG

    Explorations in Homeomorphic Variational Auto-Encoding

    Authors: Luca Falorsi, Pim de Haan, Tim R. Davidson, Nicola De Cao, Maurice Weiler, Patrick Forré, Taco S. Cohen

    Abstract: The manifold hypothesis states that many kinds of high-dimensional data are concentrated near a low-dimensional manifold. If the topology of this data manifold is non-trivial, a continuous encoder network cannot embed it in a one-to-one manner without creating holes of low density in the latent space. This is at odds with the Gaussian prior assumption typically made in Variational Auto-Encoders (V… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: 16 pages, 8 figures, ICML workshop on Theoretical Foundations and Applications of Deep Generative Models

  15. arXiv:1805.04981  [pdf, other

    cs.IT

    Multiple Access Computational Offloading: Communication Resource Allocation in the Two-User Case (Extended Version)

    Authors: Mahsa Salmani, Timothy N. Davidson

    Abstract: By offering shared computational facilities to which mobile devices can offload their computational tasks, the mobile edge computing framework is expanding the scope of applications that can be provided on resource-constrained devices. When multiple devices seek to use such a facility simultaneously, both the available computational resources and the available communication resources need to be ap… ▽ More

    Submitted 14 October, 2018; v1 submitted 13 May, 2018; originally announced May 2018.

    Comments: 50 pages (single-column), 12 figures, A condensed version of this manuscript is submitted to TSP

  16. arXiv:1804.00891  [pdf, other

    stat.ML cs.LG

    Hyperspherical Variational Auto-Encoders

    Authors: Tim R. Davidson, Luca Falorsi, Nicola De Cao, Thomas Kipf, Jakub M. Tomczak

    Abstract: The Variational Auto-Encoder (VAE) is one of the most used unsupervised machine learning models. But although the default choice of a Gaussian distribution for both the prior and posterior represents a mathematically convenient distribution often leading to competitive results, we show that this parameterization fails to model data with a latent hyperspherical structure. To address this issue we p… ▽ More

    Submitted 27 September, 2022; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: Code at http://github.com/nicola-decao/s-vae-tf and https://github.com/nicola-decao/s-vae-pytorch, Blogpost: https://nicola-decao.github.io/s-vae

    Journal ref: Uncertainty in Artificial Intelligence (UAI). Proceedings of the Thirty-Fourth Conference (2018) 856- 865

  17. arXiv:1710.09786  [pdf, ps, other

    eess.SP cs.IT

    Offset-Based Beamforming: A New Approach to Robust Downlink Transmission

    Authors: Mostafa Medra, Yongwei Huang, Timothy N. Davidson

    Abstract: The design of a set of beamformers for the multiuser multiple-input single-output (MISO) downlink that provides the receivers with prespecified levels of quality-of-service (QoS) can be quite challenging when the channel state information is not perfectly known at the base station. The constraint of having the SINR meet or exceed a given threshold with high probability is intractable in general, w… ▽ More

    Submitted 26 October, 2017; originally announced October 2017.

  18. arXiv:1705.09899  [pdf, ps, other

    cs.CL

    Understanding Abuse: A Typology of Abusive Language Detection Subtasks

    Authors: Zeerak Waseem, Thomas Davidson, Dana Warmsley, Ingmar Weber

    Abstract: As the body of research on abusive language detection and analysis grows, there is a need for critical consideration of the relationships between different subtasks that have been grouped under this label. Based on work on hate speech, cyberbullying, and online abuse we propose a typology that captures central similarities and differences between subtasks and we discuss its implications for data a… ▽ More

    Submitted 30 May, 2017; v1 submitted 28 May, 2017; originally announced May 2017.

    Comments: To appear in the proceedings of the 1st Workshop on Abusive Language Online. Please cite that version

  19. Low-Complexity Robust MISO Downlink Precoder Design With Per-Antenna Power Constraints

    Authors: Mostafa Medra, Timothy N. Davidson

    Abstract: This paper considers the design of the beamformers for a multiple-input single-output (MISO) downlink system that seeks to mitigate the impact of the imperfections in the channel state information (CSI) that is available at the base station (BS). The goal of the design is to minimize the outage probability of specified signal-to-interference-and-noise ratio (SINR) targets, while satisfying per-ant… ▽ More

    Submitted 25 April, 2017; originally announced April 2017.

  20. arXiv:1703.04009  [pdf, other

    cs.CL

    Automated Hate Speech Detection and the Problem of Offensive Language

    Authors: Thomas Davidson, Dana Warmsley, Michael Macy, Ingmar Weber

    Abstract: A key challenge for automatic hate-speech detection on social media is the separation of hate speech from other instances of offensive language. Lexical detection methods tend to have low precision because they classify all messages containing particular terms as hate speech and previous work using supervised learning has failed to distinguish between the two categories. We used a crowd-sourced ha… ▽ More

    Submitted 11 March, 2017; originally announced March 2017.

    Comments: To appear in the Proceedings of ICWSM 2017. Please cite that version

  21. Coordinate Update Algorithms for Robust Power Loading for the MU-MISO Downlink with Outage Constraints

    Authors: Foad Sohrabi, Timothy N. Davidson

    Abstract: We consider the problem of power allocation for the single-cell multi-user (MU) multiple-input single-output (MISO) downlink with quality-of-service (QoS) constraints. The base station acquires an estimate of the channels and, for a given beamforming structure, designs the power allocation so as to minimize the total transmission power required to ensure that target signal-to-interference-and-nois… ▽ More

    Submitted 26 February, 2016; originally announced February 2016.

    Comments: 14 pages, 6 figures, to appear in IEEE Transactions on Signal Processing, 2016

  22. arXiv:1210.0614  [pdf, ps, other

    cs.LO cs.PL quant-ph

    Analysis of a Quantum Error Correcting Code using Quantum Process Calculus

    Authors: Timothy A. S. Davidson, Simon J. Gay, Rajagopal Nagarajan, Ittoop Vergheese Puthoor

    Abstract: We describe the use of quantum process calculus to describe and analyze quantum communication protocols, following the successful field of formal methods from classical computer science. The key idea is to define two systems, one modelling a protocol and one expressing a specification, and prove that they are behaviourally equivalent. We summarize the necessary theory in the process calculus CQP,… ▽ More

    Submitted 1 October, 2012; originally announced October 2012.

    Comments: In Proceedings QPL 2011, arXiv:1210.0298

    ACM Class: D.3.1; F.3.1

    Journal ref: EPTCS 95, 2012, pp. 67-80

  23. arXiv:1108.0469  [pdf, ps, other

    cs.LO quant-ph

    Formal Analysis of Quantum Systems using Process Calculus

    Authors: Timothy A. S. Davidson, Simon J. Gay, Rajagopal Nagarajan

    Abstract: Quantum communication and cryptographic protocols are well on the way to becoming an important practical technology. Although a large amount of successful research has been done on proving their correctness, most of this work does not make use of familiar techniques from formal methods, such as formal logics for specification, formal modelling languages, separation of levels of abstraction, and co… ▽ More

    Submitted 1 August, 2011; originally announced August 2011.

    Comments: In Proceedings ICE 2011, arXiv:1108.0144

    Journal ref: EPTCS 59, 2011, pp. 104-110

  24. arXiv:0911.0660  [pdf, ps, other

    cs.IT

    The capacity region of a product of two unmatched Gaussian broadcast channels with three particular messages and a common message

    Authors: Ramy H. Gohary, Timothy N. Davidson

    Abstract: This paper considers a Gaussian broadcast channel with two unmatched degraded components, three particular messages, and a common message that is intended for all three receivers. It is shown that for this channel superposition coding and Gaussian signalling is sufficient to achieve every point in the capacity region.

    Submitted 3 November, 2009; originally announced November 2009.

  25. arXiv:0804.2473  [pdf, ps, other

    cs.IT

    A Design Framework for Limited Feedback MIMO Systems with Zero-Forcing DFE

    Authors: Michael Botros Shenouda, Timothy Davidson

    Abstract: We consider the design of multiple-input multiple-output communication systems with a linear precoder at the transmitter, zero-forcing decision feedback equalization (ZF-DFE) at the receiver, and a low-rate feedback channel that enables communication from the receiver to the transmitter. The channel state information (CSI) available at the receiver is assumed to be perfect, and based on this inf… ▽ More

    Submitted 15 April, 2008; v1 submitted 15 April, 2008; originally announced April 2008.

    Comments: Submitted to JSAC: Manuscript submitted 4 November 2007; revised 15 April 2008

  26. arXiv:0712.1659  [pdf, ps, other

    cs.IT

    Non-linear and Linear Broadcasting with QoS Requirements: Tractable Approaches for Bounded Channel Uncertainties

    Authors: Michael Botros Shenouda, Timothy N. Davidson

    Abstract: We consider the downlink of a cellular system in which the base station employs multiple transmit antennas, each receiver has a single antenna, and the users specify. We consider communication schemes in which the users have certain Quality of Service (QoS) requirements. We study the design of robust broadcasting schemes that minimize the transmission power necessary to guarantee that the QoS re… ▽ More

    Submitted 11 December, 2007; originally announced December 2007.

    Comments: Submitted to IEEE Transaction of Signal Processing

  27. arXiv:cs/0701169  [pdf, ps, other

    cs.IT

    A Framework for Designing MIMO systems with Decision Feedback Equalization or Tomlinson-Harashima Precoding

    Authors: Michael Botros Shenouda, T. N. Davidson

    Abstract: We consider joint transceiver design for general Multiple-Input Multiple-Output communication systems that implement interference (pre-)subtraction, such as those based on Decision Feedback Equalization (DFE) or Tomlinson-Harashima precoding (THP). We develop a unified framework for joint transceiver design by considering design criteria that are expressed as functions of the Mean Square Error (… ▽ More

    Submitted 27 January, 2007; v1 submitted 25 January, 2007; originally announced January 2007.

    Comments: To appear in ICASSP 2007

  28. Design of Block Transceivers with Decision Feedback Detection

    Authors: Fang Xu, Tim Davidson, Jian-Kang Zhang, K. Max Wong

    Abstract: This paper presents a method for jointly designing the transmitter-receiver pair in a block-by-block communication system that employs (intra-block) decision feedback detection. We provide closed-form expressions for transmitter-receiver pairs that simultaneously minimize the arithmetic mean squared error (MSE) at the decision point (assuming perfect feedback), the geometric MSE, and the bit err… ▽ More

    Submitted 5 April, 2005; originally announced April 2005.

    Comments: 14 pages, 8 figures, to appear in the IEEE Transactions on Signal Processing