Skip to main content

Showing 1–15 of 15 results for author: Smith, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.18761  [pdf, other

    cs.CL

    Can training neural language models on a curriculum with developmentally plausible data improve alignment with human reading behavior?

    Authors: Aryaman Chobey, Oliver Smith, Anzi Wang, Grusha Prasad

    Abstract: The use of neural language models to model human behavior has met with mixed success. While some work has found that the surprisal estimates from these models can be used to predict a wide range of human neural and behavioral responses, other work studying more complex syntactic phenomena has found that these surprisal estimates generate incorrect behavioral predictions. This paper explores the ex… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: To appear in the proceedings of BabyLM shared task CoNLL 2023

  2. arXiv:2305.14223  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Co-Learning Empirical Games and World Models

    Authors: Max Olan Smith, Michael P. Wellman

    Abstract: Game-based decision-making involves reasoning over both world dynamics and strategic interactions among the agents. Typically, empirical models capturing these respective aspects are learned and used separately. We investigate the potential gain from co-learning these elements: a world model for dynamics and an empirical game for strategic interactions. Empirical games drive world models toward a… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  3. arXiv:2303.03196  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

    Authors: Marc Lanctot, John Schultz, Neil Burch, Max Olan Smith, Daniel Hennes, Thomas Anthony, Julien Perolat

    Abstract: Progress in fields of machine learning and adversarial planning has benefited significantly from benchmark domains, from checkers and the classic UCI data sets to Go and Diplomacy. In sequential decision-making, agent evaluation has largely been restricted to few interactions against experts, with the aim to reach some desired level of performance (e.g. beating a human professional player). We pro… ▽ More

    Submitted 31 October, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 25 pages, 8 figures, Accepted at TMLR October 2023

  4. arXiv:2106.01901  [pdf, other

    cs.MA

    Iterative Empirical Game Solving via Single Policy Best Response

    Authors: Max Olan Smith, Thomas Anthony, Michael P. Wellman

    Abstract: Policy-Space Response Oracles (PSRO) is a general algorithmic framework for learning policies in multiagent systems by interleaving empirical game analysis with deep reinforcement learning (Deep RL). At each iteration, Deep RL is invoked to train a best response to a mixture of opponent policies. The repeated application of Deep RL poses an expensive computational burden as we look to apply this a… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Journal ref: ICLR 2021

  5. arXiv:2103.01904  [pdf, other

    cs.LG stat.ML

    A Spectral Enabled GAN for Time Series Data Generation

    Authors: Kaleb E. Smith, Anthony O. Smith

    Abstract: Time dependent data is a main source of information in today's data driven world. Generating this type of data though has shown its challenges and made it an interesting research area in the field of generative machine learning. One such approach was that by Smith et al. who developed Time Series Generative Adversarial Network (TSGAN) which showed promising performance in generating time dependent… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  6. arXiv:2009.14180  [pdf, other

    cs.MA

    Learning to Play against Any Mixture of Opponents

    Authors: Max Olan Smith, Thomas Anthony, Yongzhao Wang, Michael P. Wellman

    Abstract: Intuitively, experience playing against one mixture of opponents in a given domain should be relevant for a different mixture in the same domain. We propose a transfer learning method, Q-Mixing, that starts by learning Q-values against each pure-strategy opponent. Then a Q-value for any distribution of opponent strategies is approximated by appropriately averaging the separately learned Q-values.… ▽ More

    Submitted 3 June, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

  7. arXiv:2006.16477  [pdf, other

    cs.LG stat.ML

    Conditional GAN for timeseries generation

    Authors: Kaleb E Smith, Anthony O Smith

    Abstract: It is abundantly clear that time dependent data is a vital source of information in the world. The challenge has been for applications in machine learning to gain access to a considerable amount of quality data needed for algorithm development and analysis. Modeling synthetic data using a Generative Adversarial Network (GAN) has been at the heart of providing a viable solution. Our work focuses on… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

  8. arXiv:1911.07575  [pdf, ps, other

    cs.SD eess.AS

    A Spatial Sampling Approach to Wave Field Synthesis: PBAP and Huygens Arrays

    Authors: Julius O. Smith III

    Abstract: A simple approach to microphone- and speaker-arrays is described in which the microphone array is regarded as a sampling grid for the acoustic field, and the corresponding speaker-array is treated as a "spatial digital to analog converter" that reconstructs the acoustic field from its spatial samples. Advantages of this approach include ease of understanding and teaching, ease of deployment, effec… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: 42 pages

  9. arXiv:1909.02128  [pdf, other

    cs.AI cs.LG cs.MA

    No Press Diplomacy: Modeling Multi-Agent Gameplay

    Authors: Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau, Aaron Courville

    Abstract: Diplomacy is a seven-player non-stochastic, non-cooperative game, where agents acquire resources through a mix of teamwork and betrayal. Reliance on trust and coordination makes Diplomacy the first non-cooperative multi-agent benchmark for complex sequential social dilemmas in a rich environment. In this work, we focus on training an agent that learns to play the No Press version of Diplomacy wher… ▽ More

    Submitted 19 November, 2019; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted at NeurIPS 2019

  10. arXiv:1904.06132  [pdf

    cs.HC

    Looking At Situationally-Induced Impairments And Disabilities (SIIDs) With People With Cognitive Brain Injury

    Authors: Osian Smith, Stephen Lindsay

    Abstract: In this document, we discuss our work into a speaker recognition to support people with prosopagnosia and the limitations of alerting the user of whom they are in discussion with. We will discuss how current research into Situationally Induced Impairments Disabilities (SIIDs) can assist people with disabilities and vice versa and how our work can support people who may find themselves in a situati… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

    Comments: Presented at the CHI'19 Workshop: Addressing the Challenges of Situationally-Induced Impairments and Disabilities in Mobile Interaction, 2019 (arXiv:1904.05382)

    Report number: SIID/2019/no10

  11. arXiv:1811.11866  [pdf, other

    cs.IR cs.SI

    A Review on Recommendation Systems: Context-aware to Social-based

    Authors: S. M. Mahdi Seyednezhad, Kailey Nobuko Cozart, John Anthony Bowllan, Anthony O. Smith

    Abstract: The number of Internet users had grown rapidly enticing companies and cooperations to make full use of recommendation infrastructures. Consequently, online advertisement companies emerged to aid us in the presence of numerous items and users. Even as a user, you may find yourself drowned in a set of items that you think you might need, but you are not sure if you should try them. Those items could… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: 44 pages without bibliography, 4 chapters, Slide presentation: https://www.slideshare.net/MahdiSeyednejad/recommender-systems-97094937

    MSC Class: 68T35 ACM Class: I.2.1

  12. arXiv:1801.01589  [pdf, other

    cs.SD cs.MM eess.AS

    Neural Style Transfer for Audio Spectograms

    Authors: Prateek Verma, Julius O. Smith

    Abstract: There has been fascinating work on creating artistic transformations of images by Gatys. This was revolutionary in how we can in some sense alter the 'style' of an image while generally preserving its 'content'. In our work, we present a method for creating new sounds using a similar approach, treating it as a style-transfer problem, starting from a random-noise input signal and iteratively using… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    Comments: Appeared in 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA at the workshop for Machine Learning for Creativity and Design

  13. arXiv:1610.08838  [pdf, other

    stat.ML cs.LG

    A Category Space Approach to Supervised Dimensionality Reduction

    Authors: Anthony O. Smith, Anand Rangarajan

    Abstract: Supervised dimensionality reduction has emerged as an important theme in the last decade. Despite the plethora of models and formulations, there is a lack of a simple model which aims to project the set of patterns into a space defined by the classes (or categories). To this end, we set up a model in which each class is represented as a 1D subspace of the vector space formed by the features. Assum… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

  14. arXiv:1606.06154  [pdf, other

    cs.CE cs.SD eess.SY

    Closed Form Fractional Integration and Differentiation via Real Exponentially Spaced Pole-Zero Pairs

    Authors: Julius Orion Smith, Harrison Freeman Smith

    Abstract: We derive closed-form expressions for the poles and zeros of approximate fractional integrator/differentiator filters, which correspond to spectral roll-off filters having any desired log-log slope to a controllable degree of accuracy over any bandwidth. The filters can be described as a uniform exponential distribution of poles along the negative-real axis of the s plane, with zeros interleaving… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

    Comments: 10 pages, 8 figures

  15. Efficient Synthesis of Room Acoustics via Scattering Delay Networks

    Authors: Enzo De Sena, Huseyin Hacihabiboglu, Zoran Cvetkovic, Julius O. Smith III

    Abstract: An acoustic reverberator consisting of a network of delay lines connected via scattering junctions is proposed. All parameters of the reverberator are derived from physical properties of the enclosure it simulates. It allows for simulation of unequal and frequency-dependent wall absorption, as well as directional sources and microphones. The reverberator renders the first-order reflections exactly… ▽ More

    Submitted 9 July, 2015; v1 submitted 19 February, 2015; originally announced February 2015.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 23, No. 9, September 2015