Skip to main content

Showing 1–37 of 37 results for author: Gordon, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14105  [pdf, other

    cs.DC cs.AI cs.CL cs.LG

    Distributed Speculative Inference of Large Language Models

    Authors: Nadav Timor, Jonathan Mamou, Daniel Korat, Moshe Berchansky, Oren Pereg, Moshe Wasserblat, Tomer Galanti, Michal Gordon, David Harel

    Abstract: Accelerating the inference of large language models (LLMs) is an important challenge in artificial intelligence. This paper introduces distributed speculative inference (DSI), a novel distributed inference algorithm that is provably faster than speculative inference (SI) [leviathan2023fast, chen2023accelerating, miao2023specinfer] and traditional autoregressive inference (non-SI). Like other SI al… ▽ More

    Submitted 28 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2403.19851  [pdf, other

    cs.CL cs.CR cs.LG stat.ML

    Localizing Paragraph Memorization in Language Models

    Authors: Niklas Stoehr, Mitchell Gordon, Chiyuan Zhang, Owen Lewis

    Abstract: Can we localize the weights and mechanisms used by a language model to memorize and recite entire paragraphs of its training data? In this paper, we show that while memorization is spread across multiple layers and model components, gradients of memorized paragraphs have a distinguishable spatial pattern, being larger in lower model layers than gradients of non-memorized examples. Moreover, the me… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  3. arXiv:2402.05070  [pdf, other

    cs.AI cs.CL cs.IR

    A Roadmap to Pluralistic Alignment

    Authors: Taylor Sorensen, Jared Moore, Jillian Fisher, Mitchell Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye, Liwei Jiang, Ximing Lu, Nouha Dziri, Tim Althoff, Ye** Choi

    Abstract: With increased power and prevalence of AI systems, it is ever more critical that AI systems are designed to serve all, i.e., people with diverse values and perspectives. However, aligning models to serve pluralistic human values remains an open research question. In this piece, we propose a roadmap to pluralistic alignment, specifically using language models as a test bed. We identify and formaliz… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2312.04836  [pdf, other

    cs.ET cond-mat.stat-mech cs.AI

    Thermodynamic Computing System for AI Applications

    Authors: Denis Melanson, Mohammad Abu Khater, Maxwell Aifer, Kaelan Donatella, Max Hunter Gordon, Thomas Ahle, Gavin Crooks, Antonio J. Martinez, Faris Sbahi, Patrick J. Coles

    Abstract: Recent breakthroughs in artificial intelligence (AI) algorithms have highlighted the need for novel computing hardware in order to truly unlock the potential for AI. Physics-based hardware, such as thermodynamic computing, has the potential to provide a fast, low-power means to accelerate AI primitives, especially generative AI and probabilistic AI. In this work, we present the first continuous-va… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 26 pages, 22 figures

  5. Cura: Curation at Social Media Scale

    Authors: Wanrong He, Mitchell L. Gordon, Lindsay Popowski, Michael S. Bernstein

    Abstract: How can online communities execute a focused vision for their space? Curation offers one approach, where community leaders manually select content to share with the community. Curation enables leaders to shape a space that matches their taste, norms, and values, but the practice is often intractable at social media scale: curators cannot realistically sift through hundreds or thousands of submissi… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: CSCW 2023

  6. arXiv:2308.05660  [pdf, other

    cond-mat.stat-mech cs.ET quant-ph

    Thermodynamic Linear Algebra

    Authors: Maxwell Aifer, Kaelan Donatella, Max Hunter Gordon, Samuel Duffield, Thomas Ahle, Daniel Simpson, Gavin E. Crooks, Patrick J. Coles

    Abstract: Linear algebraic primitives are at the core of many modern algorithms in engineering, science, and machine learning. Hence, accelerating these primitives with novel computing hardware would have tremendous economic impact. Quantum computing has been proposed for this purpose, although the resource requirements are far beyond current technological capabilities, so this approach remains long-term in… ▽ More

    Submitted 10 June, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 15+22 pages, 6 figures

  7. arXiv:2304.09050  [pdf

    q-bio.NC cs.LG stat.ML

    Decoding Neural Activity to Assess Individual Latent State in Ecologically Valid Contexts

    Authors: Stephen M. Gordon, Jonathan R. McDaniel, Kevin W. King, Vernon J. Lawhern, Jonathan Touryan

    Abstract: There exist very few ways to isolate cognitive processes, historically defined via highly controlled laboratory studies, in more ecologically valid contexts. Specifically, it remains unclear as to what extent patterns of neural activity observed under such constraints actually manifest outside the laboratory in a manner that can be used to make an accurate inference about the latent state, associa… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: Journal of Neural Engineering, vol. 20(4), 2023

  8. arXiv:2302.07416  [pdf, other

    cs.LG

    Deep Convolutional Neural Network for Plume Rise Measurements in Industrial Environments

    Authors: Mohammad Koushafar, Gunho Sohn, Mark Gordon

    Abstract: Estimating Plume Cloud (PC) height is essential for various applications, such as global climate models. Smokestack Plume Rise (PR) is the constant height at which the PC is carried downwind as its momentum dissipates and the PC and the ambient temperatures equalize. Although different parameterizations are used in most air-quality models to predict PR, they have yet to be verified thoroughly. Thi… ▽ More

    Submitted 12 March, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  9. arXiv:2211.04965  [pdf, other

    quant-ph cs.LG stat.ML

    Resource frugal optimizer for quantum machine learning

    Authors: Charles Moussa, Max Hunter Gordon, Michal Baczyk, M. Cerezo, Lukasz Cincio, Patrick J. Coles

    Abstract: Quantum-enhanced data science, also known as quantum machine learning (QML), is of growing interest as an application of near-term quantum computers. Variational QML algorithms have the potential to solve practical problems on real hardware, particularly when involving quantum data. However, training these algorithms can be challenging and calls for tailored optimization procedures. Specifically,… ▽ More

    Submitted 28 July, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 22 pages, 6 figures - extra quantum autoencoder results added - extra affiliation

    Report number: LA-UR-22-31774

    Journal ref: Quantum Sci. Technol. 8 045019 (2023)

  10. arXiv:2211.03555  [pdf, ps, other

    cs.HC

    Can laypeople predict the replicability of social science studies without expert intervention: an exploratory study

    Authors: Juntao Wang, Jonathan Lei, Anna Dreber, Michael Gordon, Magnus Johannesson, Thomas Pfeiffer, Yiling Chen

    Abstract: The low replication rate of published studies has long concerned the social science community, making understanding the replicability a critical problem. Several studies have shown that relevant research communities can make predictions about the replicability of individual studies with above-chance accuracy. Follow-up work further indicates that laypeople can also achieve above-chance accuracy in… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  11. arXiv:2206.09919  [pdf, other

    quant-ph cs.LG stat.ML

    Inference-Based Quantum Sensing

    Authors: C. Huerta Alderete, Max Hunter Gordon, Frederic Sauvage, Akira Sone, Andrew T. Sornborger, Patrick J. Coles, M. Cerezo

    Abstract: In a standard Quantum Sensing (QS) task one aims at estimating an unknown parameter $θ$, encoded into an $n$-qubit probe state, via measurements of the system. The success of this task hinges on the ability to correlate changes in the parameter to changes in the system response $\mathcal{R}(θ)$ (i.e., changes in the measurement outcomes). For simple cases the form of $\mathcal{R}(θ)$ is known, but… ▽ More

    Submitted 4 August, 2023; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: 7+13 pages, 3+7 figures

    Report number: LA-UR-22-25337

    Journal ref: Phys. Rev. Lett. 129, 190501 (2022)

  12. arXiv:2204.03495  [pdf, other

    quant-ph cs.LG stat.ML

    Covariance matrix preparation for quantum principal component analysis

    Authors: Max Hunter Gordon, M. Cerezo, Lukasz Cincio, Patrick J. Coles

    Abstract: Principal component analysis (PCA) is a dimensionality reduction method in data analysis that involves diagonalizing the covariance matrix of the dataset. Recently, quantum algorithms have been formulated for PCA based on diagonalizing a density matrix. These algorithms assume that the covariance matrix can be encoded in a density matrix, but a concrete protocol for this encoding has been lacking.… ▽ More

    Submitted 24 October, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: 13 + 3 pages, 8 figures

    Report number: LA-UR-22-23149

    Journal ref: PRX Quantum 3, 030334 (2022)

  13. arXiv:2202.12950  [pdf, other

    eess.SP cs.AI cs.LG

    2021 BEETL Competition: Advancing Transfer Learning for Subject Independence & Heterogenous EEG Data Sets

    Authors: Xiaoxi Wei, A. Aldo Faisal, Moritz Grosse-Wentrup, Alexandre Gramfort, Sylvain Chevallier, Vinay Jayaram, Camille Jeunet, Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou, William C. Duong, Stephen M. Gordon, Vernon J. Lawhern, Maciej Śliwowski, Vincent Rouanne, Piotr Tempczyk

    Abstract: Transfer learning and meta-learning offer some of the most promising avenues to unlock the scalability of healthcare and consumer technologies driven by biosignal data. This is because current methods cannot generalise well across human subjects' data and handle learning from different heterogeneously collected data sets, thus limiting the scale of training data. On the other side, developments in… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: PrePrint of the NeurIPS2021 BEETL Competition Submitted to Proceedings of Machine Learning Research (PMLR)

  14. arXiv:2202.02950  [pdf, other

    cs.HC cs.AI cs.LG

    Jury Learning: Integrating Dissenting Voices into Machine Learning Models

    Authors: Mitchell L. Gordon, Michelle S. Lam, Joon Sung Park, Kayur Patel, Jeffrey T. Hancock, Tatsunori Hashimoto, Michael S. Bernstein

    Abstract: Whose labels should a machine learning (ML) algorithm learn to emulate? For ML tasks ranging from online comment toxicity to misinformation detection to medical diagnosis, different groups in society may have irreconcilable disagreements about ground truth labels. Supervised ML today resolves these label disagreements implicitly using majority vote, which overrides minority groups' labels. We intr… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: To appear at CHI 2022

  15. Visual Intelligence through Human Interaction

    Authors: Ranjay Krishna, Mitchell Gordon, Li Fei-Fei, Michael Bernstein

    Abstract: Over the last decade, Computer Vision, the branch of Artificial Intelligence aimed at understanding the visual world, has evolved from simply recognizing objects in images to describing pictures, answering questions about images, aiding robots maneuver around physical spaces and even generating novel visual content. As these tasks and applications have modernized, so too has the reliance on more d… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: This is a preprint of the following chapter: Ranjay Krishna, Mitchell Gordon, Li Fei-Fei, Michael Bernstein, Visual Intelligence through Human Interaction, published in Artificial Intelligence for Human Computer Interaction: A Modern Approach, edited by Yang Li and Otmar Hilliges, 2021, Springer reproduced with permission of Springer Nature. arXiv admin note: substantial text overlap with arXiv:1602.04506, arXiv:1904.01121

  16. arXiv:2012.11634  [pdf, other

    cs.AI

    Exploring and Analyzing Machine Commonsense Benchmarks

    Authors: Henrique Santos, Minor Gordon, Zhicheng Liang, Gretchen Forbush, Deborah L. McGuinness

    Abstract: Commonsense question-answering (QA) tasks, in the form of benchmarks, are constantly being introduced for challenging and comparing commonsense QA systems. The benchmarks provide question sets that systems' developers can use to train and test new models before submitting their implementations to official leaderboards. Although these tasks are created to evaluate systems in identified dimensions (… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: Commonsense Knowledge Graphs Workshop 2021 (CSKGs) @AAAI-21

  17. arXiv:2011.14722  [pdf, other

    cs.DS nlin.AO q-bio.QM

    Distributed Algorithms from Arboreal Ants for the Shortest Path Problem

    Authors: Shivam Garg, Kirankumar Shiragur, Deborah M. Gordon, Moses Charikar

    Abstract: Colonies of the arboreal turtle ant create networks of trails that link nests and food sources on the graph formed by branches and vines in the canopy of the tropical forest. Ants put down a volatile pheromone on edges as they traverse them. At each vertex, the next edge to traverse is chosen using a decision rule based on the current pheromone level. There is a bidirectional flow of ants around t… ▽ More

    Submitted 30 June, 2023; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: Changes in presentation. Published in PNAS 2023

  18. A Semantic Framework for Enabling Radio Spectrum Policy Management and Evaluation

    Authors: H. Santos, A. Mulvehill, J. S. Erickson, J. P. McCusker, M. Gordon, O. Xie, S. Stouffer, G. Capraro, A. Pidwerbetsky, J. Burgess, A. Berlinsky, K. Turck, J. Ashdown, D. L. McGuinness

    Abstract: Because radio spectrum is a finite resource, its usage and sharing is regulated by government agencies. These agencies define policies to manage spectrum allocation and assignment across multiple organizations, systems, and devices. With more portions of the radio spectrum being licensed for commercial use, the importance of providing an increased level of automation when evaluating such policies… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

    Journal ref: The Semantic Web - ISWC 2020. ISWC 2020. Lecture Notes in Computer Science, vol 12507

  19. Mitiq: A software package for error mitigation on noisy quantum computers

    Authors: Ryan LaRose, Andrea Mari, Sarah Kaiser, Peter J. Karalekas, Andre A. Alves, Piotr Czarnik, Mohamed El Mandouh, Max H. Gordon, Yousef Hindy, Aaron Robertson, Purva Thakre, Misty Wahl, Danny Samuel, Rahul Mistri, Maxime Tremblay, Nick Gardner, Nathaniel T. Stemen, Nathan Shammah, William J. Zeng

    Abstract: We introduce Mitiq, a Python package for error mitigation on noisy quantum computers. Error mitigation techniques can reduce the impact of noise on near-term quantum computers with minimal overhead in quantum resources by relying on a mixture of quantum sampling and classical post-processing techniques. Mitiq is an extensible toolkit of different error mitigation methods, including zero-noise extr… ▽ More

    Submitted 1 August, 2022; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: 33 pages, 8 figures. The Mitiq GitHub is https://github.com/unitaryfund/mitiq and the Mitiq documentation is https://mitiq.readthedocs.io/en/stable/

    Journal ref: Quantum 6, 774 (2022)

  20. arXiv:2008.12371  [pdf, ps, other

    eess.IV cs.CV

    Improving the Segmentation of Scanning Probe Microscope Images using Convolutional Neural Networks

    Authors: Steff Farley, Jo E. A. Hodgkinson, Oliver M. Gordon, Joanna Turner, Andrea Soltoggio, Philip J. Moriarty, Eugenie Hunsicker

    Abstract: A wide range of techniques can be considered for segmentation of images of nanostructured surfaces. Manually segmenting these images is time-consuming and results in a user-dependent segmentation bias, while there is currently no consensus on the best automated segmentation methods for particular techniques, image classes, and samples. Any image segmentation approach must minimise the noise in the… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: 21 pages, 10 figures

  21. arXiv:2006.06316  [pdf, other

    cs.CV

    RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams

    Authors: Vasiliki Kougia, John Pavlopoulos, Panagiotis Papapetrou, Max Gordon

    Abstract: This paper introduces RTEx, a novel methodology for a) ranking radiography exams based on their probability to contain an abnormality, b) generating abnormality tags for abnormal exams, and c) providing a diagnostic explanation in natural language for each abnormal exam. The task of ranking radiography exams is an important first step for practitioners who want to identify and prioritize those rad… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  22. arXiv:2005.04543  [pdf, other

    cs.CY cs.AI cs.SI

    Replication Markets: Results, Lessons, Challenges and Opportunities in AI Replication

    Authors: Yang Liu, Michael Gordon, Juntao Wang, Michael Bishop, Yiling Chen, Thomas Pfeiffer, Charles Twardy, Domenico Viganola

    Abstract: The last decade saw the emergence of systematic large-scale replication projects in the social and behavioral sciences, (Camerer et al., 2016, 2018; Ebersole et al., 2016; Klein et al., 2014, 2018; Collaboration, 2015). These projects were driven by theoretical and conceptual concerns about a high fraction of "false positives" in the scientific publications (Ioannidis, 2005) (and a high prevalence… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: Appeared at AAAI workshop on Reproducible AI (RAI), 2020

  23. arXiv:2003.02877  [pdf, other

    cs.CL

    Distill, Adapt, Distill: Training Small, In-Domain Models for Neural Machine Translation

    Authors: Mitchell A. Gordon, Kevin Duh

    Abstract: We explore best practices for training small, memory efficient machine translation models with sequence-level knowledge distillation in the domain adaptation setting. While both domain adaptation and knowledge distillation are widely-used, their interaction remains little understood. Our large-scale empirical results in machine translation (on three language pairs with three domains each) suggest… ▽ More

    Submitted 23 June, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted to WNGT 2020 Workshop at ACL 2020 Conference. Code is at http://github.com/mitchellgordon95/kd-aug

  24. arXiv:2002.08307  [pdf, other

    cs.CL

    Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning

    Authors: Mitchell A. Gordon, Kevin Duh, Nicholas Andrews

    Abstract: Pre-trained universal feature extractors, such as BERT for natural language processing and VGG for computer vision, have become effective methods for improving deep learning models without requiring more labeled data. While effective, feature extractors like BERT may be prohibitively large for some deployment scenarios. We explore weight pruning for BERT and ask: how does compression during pre-tr… ▽ More

    Submitted 14 May, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted to Rep4NLP 2020 Workshop at ACL 2020 Conference

  25. arXiv:1912.12121  [pdf, other

    cs.CV cs.LG stat.ML

    Approximating Human Judgment of Generated Image Quality

    Authors: Y. Alex Kolchinski, Sharon Zhou, Shengjia Zhao, Mitchell Gordon, Stefano Ermon

    Abstract: Generative models have made immense progress in recent years, particularly in their ability to generate high quality images. However, that quality has been difficult to evaluate rigorously, with evaluation dominated by heuristic approaches that do not correlate well with human judgment, such as the Inception Score and Fréchet Inception Distance. Real human labels have also been used in evaluation,… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

    Comments: To appear in the Shared Visual Representations in Human and Machine Intelligence workshop at NeurIPS 2019. The first two authors contributed equally to the manuscript

  26. arXiv:1912.03334  [pdf, other

    cs.CL

    Explaining Sequence-Level Knowledge Distillation as Data-Augmentation for Neural Machine Translation

    Authors: Mitchell A. Gordon, Kevin Duh

    Abstract: Sequence-level knowledge distillation (SLKD) is a model compression technique that leverages large, accurate teacher models to train smaller, under-parameterized student models. Why does pre-processing MT data with SLKD help us train smaller models? We test the common hypothesis that SLKD addresses a capacity deficiency in students by "simplifying" noisy data points and find it unlikely in our cas… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

  27. Goal-setting And Achievement In Activity Tracking Apps: A Case Study Of MyFitnessPal

    Authors: Mitchell L. Gordon, Tim Althoff, Jure Leskovec

    Abstract: Activity tracking apps often make use of goals as one of their core motivational tools. There are two critical components to this tool: setting a goal, and subsequently achieving that goal. Despite its crucial role in how a number of prominent self-tracking apps function, there has been relatively little investigation of the goal-setting and achievement aspects of self-tracking apps. Here we exp… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Journal ref: WWW 2019: The Web Conference 2019

  28. arXiv:1904.01121  [pdf, other

    cs.CV cs.HC cs.LG

    HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models

    Authors: Sharon Zhou, Mitchell L. Gordon, Ranjay Krishna, Austin Narcomey, Li Fei-Fei, Michael S. Bernstein

    Abstract: Generative models often use human evaluations to measure the perceived quality of their outputs. Automated metrics are noisy indirect proxies, because they rely on heuristics or pretrained embeddings. However, up until now, direct human evaluation strategies have been ad-hoc, neither standardized nor validated. Our work establishes a gold standard human benchmark for generative realism. We constru… ▽ More

    Submitted 31 October, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: https://hype.stanford.edu

  29. An efficient MPI/OpenMP parallelization of the Hartree-Fock method for the second generation of Intel Xeon Phi processor

    Authors: Vladimir Mironov, Yuri Alexeev, Kristopher Keipert, Michael D'mello, Alexander Moskovsky, Mark S. Gordon

    Abstract: Modern OpenMP threading techniques are used to convert the MPI-only Hartree-Fock code in the GAMESS program to a hybrid MPI/OpenMP algorithm. Two separate implementations that differ by the sharing or replication of key data structures among threads are considered, density and Fock matrices. All implementations are benchmarked on a super-computer of 3,000 Intel Xeon Phi processors. With 64 cores p… ▽ More

    Submitted 14 August, 2017; v1 submitted 31 July, 2017; originally announced August 2017.

    Comments: SC17 conference paper, 12 pages, 7 figures

    ACM Class: D.1.3; J.2

  30. arXiv:1701.07479  [pdf, other

    physics.soc-ph cs.SI

    Epidemiological modeling of the 2005 French riots: a spreading wave and the role of contagion

    Authors: Laurent Bonnasse-Gahot, Henri Berestycki, Marie-Aude Depuiset, Mirta B. Gordon, Sebastian Roché, Nancy Rodriguez, Jean-Pierre Nadal

    Abstract: As a large-scale instance of dramatic collective behaviour, the 2005 French riots started in a poor suburb of Paris, then spread in all of France, lasting about three weeks. Remarkably, although there were no displacements of rioters, the riot activity did travel. Access to daily national police data has allowed us to explore the dynamics of riot propagation. Here we show that an epidemic-like mod… ▽ More

    Submitted 8 January, 2018; v1 submitted 25 January, 2017; originally announced January 2017.

    Comments: Authors version of the manuscript accepted for publication (2 Dec. 2017). 21 pages, 8 figures, 10 SI pages, 7 SI figures, 2 SI Tables, 4 SI videos - the SI videos are included in the source package, and are also available as online SI videos with the published paper

    Journal ref: Scientific Reports, Nature Publishing Group, 2018, 8 (1), #107

  31. arXiv:1611.08024  [pdf, other

    cs.LG q-bio.NC stat.ML

    EEGNet: A Compact Convolutional Network for EEG-based Brain-Computer Interfaces

    Authors: Vernon J. Lawhern, Amelia J. Solon, Nicholas R. Waytowich, Stephen M. Gordon, Chou P. Hung, Brent J. Lance

    Abstract: Brain computer interfaces (BCI) enable direct communication with a computer, using neural activity as the control signal. This neural signal is generally chosen from a variety of well-studied electroencephalogram (EEG) signals. For a given BCI paradigm, feature extractors and classifiers are tailored to the distinct characteristics of its expected EEG control signal, limiting its application to th… ▽ More

    Submitted 15 May, 2018; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: 30 pages, 10 figures. Added additional feature relevance analyses. Minor change to EEGNet architecture. Source code can be found at https://github.com/vlawhern/arl-eegmodels

  32. arXiv:1506.07118  [pdf, ps, other

    cs.DC

    Idle Ants Have a Role

    Authors: Yehuda Afek, Deborah M. Gordon, Moshe Sulamy

    Abstract: Using elementary distributed computing techniques we suggest an explanation for two unexplained phenomena in regards to ant colonies, (a) a substantial amount of ants in an ant colony are idle, and (b) the observed low survivability of new ant colonies in nature. Ant colonies employ task allocation, in which ants progress from one task to the other, to meet changing demands introduced by the envir… ▽ More

    Submitted 20 May, 2016; v1 submitted 23 June, 2015; originally announced June 2015.

  33. arXiv:0906.0470  [pdf, ps, other

    cs.LG

    An optimal linear separator for the Sonar Signals Classification task

    Authors: Juan-Manuel Torres-Moreno, Mirta B. Gordon

    Abstract: The problem of classifying sonar signals from rocks and mines first studied by Gorman and Sejnowski has become a benchmark against which many learning algorithms have been tested. We show that both the training set and the test set of this benchmark are linearly separable, although with different hyperplanes. Moreover, the complete set of learning and test patterns together, is also linearly sep… ▽ More

    Submitted 2 June, 2009; originally announced June 2009.

    Comments: 8 pages, 6 tables

  34. arXiv:0904.4587  [pdf, ps, other

    cs.AI cs.NE

    Adaptive Learning with Binary Neurons

    Authors: Juan-Manuel Torres-Moreno, Mirta B. Gordon

    Abstract: A efficient incremental learning algorithm for classification tasks, called NetLines, well adapted for both binary and real-valued input patterns is presented. It generates small compact feedforward neural networks with one hidden layer of binary units and binary output units. A convergence theorem ensures that solutions with a finite number of hidden units exist for both binary and real-valued… ▽ More

    Submitted 29 April, 2009; originally announced April 2009.

    Comments: 29 pages, 7 figures

  35. arXiv:0904.0776  [pdf

    stat.ML cs.LG

    Induction of High-level Behaviors from Problem-solving Traces using Machine Learning Tools

    Authors: Vivien Robinet, Gilles Bisson, Mirta B. Gordon, Benoît Lemaire

    Abstract: This paper applies machine learning techniques to student modeling. It presents a method for discovering high-level student behaviors from a very large set of low-level traces corresponding to problem-solving actions in a learning environment. Basic actions are encoded into sets of domain-dependent attribute-value patterns called cases. Then a domain-independent hierarchical clustering identifie… ▽ More

    Submitted 5 April, 2009; originally announced April 2009.

    Journal ref: IEEE Intelligent Systems 22, 4 (2007) 22

  36. arXiv:0806.3284  [pdf, ps, other

    cs.IT

    Optimal hash functions for approximate closest pairs on the n-cube

    Authors: Daniel M. Gordon, Victor Miller, Peter Ostapenko

    Abstract: One way to find closest pairs in large datasets is to use hash functions. In recent years locality-sensitive hash functions for various metrics have been given: projecting an n-cube onto k bits is simple hash function that performs well. In this paper we investigate alternatives to projection. For various parameters hash functions given by complete decoding algorithms for codes work better, and… ▽ More

    Submitted 15 October, 2009; v1 submitted 20 June, 2008; originally announced June 2008.

    Comments: IEEE Transactions on Information Theory, to appear

  37. arXiv:cond-mat/9703183  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn cs.AI cs.LG

    Finite size scaling of the bayesian perceptron

    Authors: A. Buhot, J. -M. Torres Moreno, M. B. Gordon

    Abstract: We study numerically the properties of the bayesian perceptron through a gradient descent on the optimal cost function. The theoretical distribution of stabilities is deduced. It predicts that the optimal generalizer lies close to the boundary of the space of (error-free) solutions. The numerical simulations are in good agreement with the theoretical distribution. The extrapolation of the genera… ▽ More

    Submitted 20 March, 1997; originally announced March 1997.

    Comments: RevTeX, 7 pages, 7 figures, submitted to Phys. Rev. E