Skip to main content

Showing 1–24 of 24 results for author: Roman, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00805  [pdf, other

    cs.AI

    Towards shutdownable agents via stochastic choice

    Authors: Elliott Thornley, Alexander Roman, Christos Ziakas, Leyton Ho, Louis Thomson

    Abstract: Some worry that advanced artificial agents may resist being shut down. The Incomplete Preferences Proposal (IPP) is an idea for ensuring that doesn't happen. A key part of the IPP is using a novel 'Discounted REward for Same-Length Trajectories (DREST)' reward function to train agents to (1) pursue goals effectively conditional on each trajectory-length (be 'USEFUL'), and (2) choose stochastically… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2404.16710  [pdf, other

    cs.CL cs.AI cs.LG

    LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

    Authors: Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, Basil Hosmer, Bram Wasti, Liangzhen Lai, Anas Mahmoud, Bilge Acun, Saurabh Agarwal, Ahmed Roman, Ahmed A Aly, Beidi Chen, Carole-Jean Wu

    Abstract: We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for earlier layers and higher dropout rates for later layers, and an early exit loss where all transformer layers share the same exit. Second, during inference, we show that this training recipe increases the accuracy of early exi… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Code open sourcing is in progress

  3. arXiv:2402.02633  [pdf, other

    cs.CL cs.LG

    Predicting Machine Translation Performance on Low-Resource Languages: The Role of Domain Similarity

    Authors: Eric Khiu, Hasti Toossi, David Anugraha, **yu Liu, Jiaxu Li, Juan Armando Parra Flores, Leandro Acros Roman, A. Seza Doğruöz, En-Shiun Annie Lee

    Abstract: Fine-tuning and testing a multilingual large language model is expensive and challenging for low-resource languages (LRLs). While previous studies have predicted the performance of natural language processing (NLP) tasks using machine learning methods, they primarily focus on high-resource languages, overlooking LRLs and shifts across domains. Focusing on LRLs, we investigate three factors: the si… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures, accepted to EACL 2024, findings

  4. arXiv:2401.17129  [pdf, other

    cs.SD cs.AI eess.AS

    Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes

    Authors: Adrian S. Roman, Baladithya Balamurugan, Rithik Pothuganti

    Abstract: This technical report details our work towards building an enhanced audio-visual sound event localization and detection (SELD) network. We build on top of the audio-only SELDnet23 model and adapt it to be audio-visual by merging both audio and video information prior to the gated recurrent unit (GRU) of the audio-only network. Our model leverages YOLO and DETIC object detectors. We also build a fr… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  5. arXiv:2401.12238  [pdf, other

    eess.AS cs.LG cs.SD

    Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms

    Authors: Iran R. Roman, Christopher Ick, Sivan Ding, Adrian S. Roman, Brian McFee, Juan P. Bello

    Abstract: Sound event localization and detection (SELD) is an important task in machine listening. Major advancements rely on simulated data with sound events in specific rooms and strong spatio-temporal labels. SELD data is simulated by convolving spatialy-localized room impulse responses (RIRs) with sound waveforms to place sound events in a soundscape. However, RIRs require manual collection in specific… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures, 1 table, to be presented at ICASSP 2024 in Seoul, South Korea

  6. arXiv:2401.08717  [pdf, other

    cs.SD eess.AS

    Robust DOA estimation using deep acoustic imaging

    Authors: Adrian S. Roman, Iran R. Roman, Juan P. Bello

    Abstract: Direction of arrival estimation (DoAE) aims at tracking a sound in azimuth and elevation. Recent advancements include data-driven models with inputs derived from ambisonics intensity vectors or correlations between channels in a microphone array. A spherical intensity map (SIM), or acoustic image, is an alternative input representation that remains underexplored. SIMs benefit from high-resolution… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  7. arXiv:2312.06153  [pdf, other

    cs.LG cs.AI cs.HC

    Open Datasheets: Machine-readable Documentation for Open Datasets and Responsible AI Assessments

    Authors: Anthony Cintron Roman, Jennifer Wortman Vaughan, Valerie See, Steph Ballard, Jehu Torres, Caleb Robinson, Juan M. Lavista Ferres

    Abstract: This paper introduces a no-code, machine-readable documentation framework for open datasets, with a focus on responsible AI (RAI) considerations. The framework aims to improve comprehensibility, and usability of open datasets, facilitating easier discovery and use, better understanding of content and context, and evaluation of dataset quality and accuracy. The proposed framework is designed to str… ▽ More

    Submitted 27 March, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  8. arXiv:2309.07860  [pdf, other

    hep-ph cs.LG hep-th math-ph math.GR

    Identifying the Group-Theoretic Structure of Machine-Learned Symmetries

    Authors: Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Alexander Roman, Eyup B. Unlu, Sarunas Verner

    Abstract: Deep learning was recently successfully used in deriving symmetry transformations that preserve important physics quantities. Being completely agnostic, these techniques postpone the identification of the discovered symmetries to a later stage. In this letter we propose methods for examining and identifying the group-theoretic structure of such machine-learned symmetries. We design loss functions… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 10 pages, 8 figures, 2 tables

  9. arXiv:2309.00329  [pdf, other

    cs.SD cs.LG cs.SE eess.AS

    Mi-Go: Test Framework which uses YouTube as Data Source for Evaluating Speech Recognition Models like OpenAI's Whisper

    Authors: Tomasz Wojnar, Jaroslaw Hryszko, Adam Roman

    Abstract: This article introduces Mi-Go, a novel testing framework aimed at evaluating the performance and adaptability of general-purpose speech recognition machine learning models across diverse real-world scenarios. The framework leverages YouTube as a rich and continuously updated data source, accounting for multiple languages, accents, dialects, speaking styles, and audio quality levels. To demonstrate… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 25 pages, 9 tables, 3 figures

  10. arXiv:2307.04891  [pdf, other

    hep-th cs.LG hep-ph math-ph math.GR

    Accelerated Discovery of Machine-Learned Symmetries: Deriving the Exceptional Lie Groups G2, F4 and E6

    Authors: Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Alexander Roman, Eyup B. Unlu, Sarunas Verner

    Abstract: Recent work has applied supervised deep learning to derive continuous symmetry transformations that preserve the data labels and to obtain the corresponding algebras of symmetry generators. This letter introduces two improved algorithms that significantly speed up the discovery of these symmetry transformations. The new methods are demonstrated by deriving the complete set of generators for the un… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 11 pages, 7 figures

  11. arXiv:2306.06191  [pdf, other

    cs.LG cs.IR

    Open Data on GitHub: Unlocking the Potential of AI

    Authors: Anthony Cintron Roman, Kevin Xu, Arfon Smith, Jehu Torres Vega, Caleb Robinson, Juan M Lavista Ferres

    Abstract: GitHub is the world's largest platform for collaborative software development, with over 100 million users. GitHub is also used extensively for open data collaboration, hosting more than 800 million open data files, totaling 142 terabytes of data. This study highlights the potential of open data on GitHub and demonstrates how it can accelerate AI research. We analyze the existing landscape of open… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: In submission to NeurIPS 2023 Track Datasets and Benchmarks

  12. arXiv:2302.05383  [pdf, other

    hep-ph cs.LG math-ph math.GR

    Discovering Sparse Representations of Lie Groups with Machine Learning

    Authors: Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Alexander Roman, Eyup B. Unlu, Sarunas Verner

    Abstract: Recent work has used deep learning to derive symmetry transformations, which preserve conserved quantities, and to obtain the corresponding algebras of generators. In this letter, we extend this technique to derive sparse representations of arbitrary Lie algebras. We show that our method reproduces the canonical (sparse) representations of the generators of the Lorentz group, as well as the… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 14 pages, 6 figures

  13. arXiv:2302.00806  [pdf, other

    cs.LG hep-ph math.GR stat.ML

    Oracle-Preserving Latent Flows

    Authors: Alexander Roman, Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Eyup B. Unlu

    Abstract: We develop a deep learning methodology for the simultaneous discovery of multiple nontrivial continuous symmetries across an entire labelled dataset. The symmetry transformations and the corresponding generators are modeled with fully connected neural networks trained with a specially constructed loss function ensuring the desired symmetry properties. The two new elements in this work are the use… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: 9 pages, 8 figures

  14. arXiv:2301.05638  [pdf, other

    hep-ph cs.LG physics.data-an

    Deep Learning Symmetries and Their Lie Groups, Algebras, and Subalgebras from First Principles

    Authors: Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Alexander Roman, Eyup Unlu, Sarunas Verner

    Abstract: We design a deep-learning algorithm for the discovery and identification of the continuous group of symmetries present in a labeled dataset. We use fully connected neural networks to model the symmetry transformations and the corresponding generators. We construct loss functions that ensure that the applied transformations are symmetries and that the corresponding set of generators forms a closed… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 21 pages, 26 figures with 47 panels

  15. arXiv:2207.00962  [pdf, other

    physics.data-an cs.IT math.ST

    Low probability states, data statistics, and entropy estimation

    Authors: Damián G. Hernández, Ahmed Roman, Ilya Nemenman

    Abstract: A fundamental problem in analysis of complex systems is getting a reliable estimate of entropy of their probability distributions over the state space. This is difficult because unsampled states can contribute substantially to the entropy, while they do not contribute to the Maximum Likelihood estimator of entropy, which replaces probabilities by the observed frequencies. Bayesian estimators overc… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

  16. arXiv:2201.02696  [pdf, other

    astro-ph.EP cs.LG physics.data-an

    Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet Transmission Spectra

    Authors: Konstantin T. Matchev, Katia Matcheva, Alexander Roman

    Abstract: Transit spectroscopy is a powerful tool to decode the chemical composition of the atmospheres of extrasolar planets. In this paper we focus on unsupervised techniques for analyzing spectral data from transiting exoplanets. We demonstrate methods for i) cleaning and validating the data, ii) initial exploratory data analysis based on summary statistics (estimates of location and variability), iii) e… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: 10 pages, 11 figures, submitted to MNRAS

  17. arXiv:2112.11600  [pdf, other

    astro-ph.EP cs.LG cs.SC physics.data-an

    Analytical Modelling of Exoplanet Transit Specroscopy with Dimensional Analysis and Symbolic Regression

    Authors: Konstantin T. Matchev, Katia Matcheva, Alexander Roman

    Abstract: The physical characteristics and atmospheric chemical composition of newly discovered exoplanets are often inferred from their transit spectra which are obtained from complex numerical models of radiative transfer. Alternatively, simple analytical expressions provide insightful physical intuition into the relevant atmospheric processes. The deep learning revolution has opened the door for deriving… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: Submitted to AAS Journals, 24 pages, 7 figures

  18. arXiv:2106.05215  [pdf, other

    cs.CV

    A machine learning pipeline for aiding school identification from child trafficking images

    Authors: Sumit Mukherjee, Tina Sederholm, Anthony C. Roman, Ria Sankar, Sherrie Caltagirone, Juan Lavista Ferres

    Abstract: Child trafficking in a serious problem around the world. Every year there are more than 4 million victims of child trafficking around the world, many of them for the purposes of child sexual exploitation. In collaboration with UK Police and a non-profit focused on child abuse prevention, Global Emancipation Network, we developed a proof-of-concept machine learning pipeline to aid the identificatio… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  19. Machine Learning and Glioblastoma: Treatment Response Monitoring Biomarkers in 2021

    Authors: Thomas Booth, Bernice Akpinar, Andrei Roman, Haris Shuaib, Aysha Luis, Alysha Chelliah, Ayisha Al Busaidi, Ayesha Mirchandani, Burcu Alparslan, Nina Mansoor, Keyoumars Ashkan, Sebastien Ourselin, Marc Modat

    Abstract: The aim of the systematic review was to assess recently published studies on diagnostic test accuracy of glioblastoma treatment response monitoring biomarkers in adults, developed through machine learning (ML). Articles were searched for using MEDLINE, EMBASE, and the Cochrane Register. Included study participants were adult patients with high grade glioma who had undergone standard treatment (max… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Journal ref: Kia S.M. et al. (eds) Machine Learning in Clinical Neuroimaging and Radiogenomics in Neuro-oncology. MLCN 2020, RNO-AI 2020. Lecture Notes in Computer Science, vol 12449

  20. arXiv:2008.04137  [pdf, other

    cs.LG stat.ML

    SplitNN-driven Vertical Partitioning

    Authors: Iker Ceballos, Vivek Sharma, Eduardo Mugica, Abhishek Singh, Alberto Roman, Praneeth Vepakomma, Ramesh Raskar

    Abstract: In this work, we introduce SplitNN-driven Vertical Partitioning, a configuration of a distributed deep learning method called SplitNN to facilitate learning from vertically distributed features. SplitNN does not share raw data or model details with collaborating institutions. The proposed configuration allows training among institutions holding diverse sources of data without the need of complex e… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: First version, please provide feedback

  21. arXiv:1910.12086  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    A holistic approach to polyphonic music transcription with neural networks

    Authors: Miguel A. Román, Antonio Pertusa, Jorge Calvo-Zaragoza

    Abstract: We present a framework based on neural networks to extract music scores directly from polyphonic audio in an end-to-end fashion. Most previous Automatic Music Transcription (AMT) methods seek a piano-roll representation of the pitches, that can be further transformed into a score by incorporating tempo estimation, beat tracking, key estimation or rhythm quantization. Unlike these methods, our appr… ▽ More

    Submitted 26 October, 2019; originally announced October 2019.

    Comments: Source code available at https://github.com/mangelroman/audio2score

  22. arXiv:1412.0799  [pdf, ps, other

    cs.FL

    Complexity of Road Coloring with Prescribed Reset Words

    Authors: Vojtěch Vorel, Adam Roman

    Abstract: By the Road Coloring Theorem (Trahtman, 2008), the edges of any aperiodic directed multigraph with a constant out-degree can be colored such that the resulting automaton admits a reset word. There may also be a need for a particular reset word to be admitted. For certain words it is NP-complete to decide whether there is a suitable coloring of a given multigraph. We present a classification of all… ▽ More

    Submitted 2 December, 2014; originally announced December 2014.

    Comments: To be presented at LATA 2015

  23. arXiv:1403.4749  [pdf, ps, other

    cs.FL

    Parameterized Complexity of Synchronization and Road Coloring

    Authors: Vojtěch Vorel, Adam Roman

    Abstract: First, we close the multivariate analysis of a canonical problem concerning short reset words (SYN), as it was started by Fernau et al. (2013). Namely, we prove that the problem, parameterized by the number of states, does not admit a polynomial kernel unless the polynomial hierarchy collapses. Second, we consider a related canonical problem concerning synchronizing road colorings (SRCP). Here we… ▽ More

    Submitted 23 June, 2014; v1 submitted 19 March, 2014; originally announced March 2014.

    ACM Class: F.1.1; F.2.2

  24. arXiv:1201.0418  [pdf, other

    math.ST cs.IT math.PR

    A New Family of Bounded Divergence Measures and Application to Signal Detection

    Authors: Shivakumar Jolad, Ahmed Roman, Mahesh C. Shastry, Mihir Gadgil, Ayanendranath Basu

    Abstract: We introduce a new one-parameter family of divergence measures, called bounded Bhattacharyya distance (BBD) measures, for quantifying the dissimilarity between probability distributions. These measures are bounded, symmetric and positive semi-definite and do not require absolute continuity. In the asymptotic limit, BBD measure approaches the squared Hellinger distance. A generalized BBD measure fo… ▽ More

    Submitted 9 April, 2016; v1 submitted 1 January, 2012; originally announced January 2012.

    Comments: 12 pages, 4 figures

    MSC Class: 94A17; 94A12; 94B70; 97K50 ACM Class: G.3; H.1.1

    Journal ref: Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2016), pages 72-83, ISBN: 978-989-758-173-1