Skip to main content

Showing 1–21 of 21 results for author: O'Connor, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.09107  [pdf

    cs.DL

    A Comprehensive Approach to Ensuring Quality in Spreadsheet-Based Metadata

    Authors: Martin J. O'Connor, Marcos Martínez-Romero, Mete Ugur Akdogan, Josef Hardi, Mark A. Musen

    Abstract: While scientists increasingly recognize the importance of metadata in describing their data, spreadsheets remain the preferred tool for supplying this information despite their limitations in ensuring compliance and quality. Various tools have been developed to address these limitations, but they suffer from their own shortcomings, such as steep learning curves and limited customization. In this p… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  2. Extending GCC-PHAT using Shift Equivariant Neural Networks

    Authors: Axel Berg, Mark O'Connor, Kalle Åström, Magnus Oskarsson

    Abstract: Speaker localization using microphone arrays depends on accurate time delay estimation techniques. For decades, methods based on the generalized cross correlation with phase transform (GCC-PHAT) have been widely adopted for this purpose. Recently, the GCC-PHAT has also been used to provide input features to neural networks in order to remove the effects of noise and reverberation, but at the cost… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Proceedings of INTERSPEECH

    Journal ref: Proc. Interspeech 2022, 1791-1795

  3. arXiv:2208.02836  [pdf

    cs.DL

    Modeling community standards for metadata as templates makes data FAIR

    Authors: Mark A. Musen, Martin J. O'Connor, Erik Schultes, Marcos Martinez-Romero, Josef Hardi, John Graybeal

    Abstract: It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be "rich" and to adhere to "domain-relevant" community standards. Scientific communities should be able to defin… ▽ More

    Submitted 14 October, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: 20 pages, 1 table, 5 figures

  4. arXiv:2204.03957  [pdf, other

    cs.CV

    Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition

    Authors: Axel Berg, Magnus Oskarsson, Mark O'Connor

    Abstract: While the Transformer architecture has become ubiquitous in the machine learning field, its adaptation to 3D shape recognition is non-trivial. Due to its quadratic computational complexity, the self-attention operator quickly becomes inefficient as the set of input points grows larger. Furthermore, we find that the attention mechanism struggles to find useful connections between individual points… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted to the 26th International Conference on Pattern Recognition

  5. arXiv:2104.00769  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Keyword Transformer: A Self-Attention Model for Keyword Spotting

    Authors: Axel Berg, Mark O'Connor, Miguel Tairum Cruz

    Abstract: The Transformer architecture has been successful across many domains, including natural language processing, computer vision and speech recognition. In keyword spotting, self-attention has primarily been used on top of convolutional or recurrent encoders. We investigate a range of ways to adapt the Transformer architecture to keyword spotting and introduce the Keyword Transformer (KWT), a fully se… ▽ More

    Submitted 15 June, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: Proceedings of INTERSPEECH

    Journal ref: Proc. Interspeech 2021, 4249-4253

  6. arXiv:2012.14294  [pdf, other

    cs.CY cs.DC cs.NI

    I-Health: Leveraging Edge Computing and Blockchain for Epidemic Management

    Authors: Alaa Awad Abdellatif, Lutfi Samara, Amr Mohamed, Aiman Erbad, Carla Fabiana Chiasserini, Mohsen Guizani, Mark Dennis O'Connor, James Laughton

    Abstract: Epidemic situations typically demand intensive data collection and management from different locations/entities within a strict time constraint. Such demand can be fulfilled by leveraging the intensive and easy deployment of the Internet of Things (IoT) devices. The management and containment of such situations also rely on cross-organizational and national collaboration. Thus, this paper proposes… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: A version of this paper has been submitted in IEEE Internet of Things Journal. arXiv admin note: text overlap with arXiv:2006.10843

  7. Deep Ordinal Regression with Label Diversity

    Authors: Axel Berg, Magnus Oskarsson, Mark O'Connor

    Abstract: Regression via classification (RvC) is a common method used for regression problems in deep learning, where the target variable belongs to a set of continuous values. By discretizing the target into a set of non-overlap** classes, it has been shown that training a classifier can improve neural network accuracy compared to using a standard regression approach. However, it is not clear how the set… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: Accepted to ICPR2020

    Journal ref: 2020 25th International Conference on Pattern Recognition (ICPR), 2021, pp. 2740-2747

  8. arXiv:2005.03807  [pdf, other

    cs.LG stat.ML

    Variance Constrained Autoencoding

    Authors: D. T. Braithwaite, M. O'Connor, W. B. Kleijn

    Abstract: Recent state-of-the-art autoencoder based generative models have an encoder-decoder structure and learn a latent representation with a pre-defined distribution that can be sampled from. Implementing the encoder networks of these models in a stochastic manner provides a natural and common approach to avoid overfitting and enforce a smooth decoder function. However, we show that for stochastic encod… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: 20 pages

  9. The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific Experiments

    Authors: Rafael S. Gonçalves, Martin J. O'Connor, Marcos Martínez-Romero, Attila L. Egyedi, Debra Willrett, John Graybeal, Mark A. Musen

    Abstract: The Center for Expanded Data Annotation and Retrieval (CEDAR) aims to revolutionize the way that metadata describing scientific experiments are authored. The software we have developed--the CEDAR Workbench--is a suite of Web-based tools and REST APIs that allows users to construct metadata templates, to fill in templates to generate high-quality metadata, and to share and manage these resources. T… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

  10. DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis

    Authors: Sangkug Lym, Donghyuk Lee, Mike O'Connor, Niladrish Chatterjee, Mattan Erez

    Abstract: Training convolutional neural networks (CNNs) requires intense compute throughput and high memory bandwidth. Especially, convolution layers account for the majority of the execution time of CNN training, and GPUs are commonly used to accelerate these layer workloads. GPU design optimization for efficient CNN training acceleration requires the accurate modeling of how their performance improves whe… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  11. Using association rule mining and ontologies to generate metadata recommendations from multiple biomedical databases

    Authors: Marcos Martínez-Romero, Martin J. O'Connor, Attila L. Egyedi, Debra Willrett, Josef Hardi, John Graybeal, Mark A. Musen

    Abstract: Metadata-the machine-readable descriptions of the data-are increasingly seen as crucial for describing the vast array of biomedical datasets that are currently being deposited in public repositories. While most public repositories have firm requirements that metadata must accompany submitted datasets, the quality of those metadata is generally very poor. A key problem is that the typical metadata… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

  12. arXiv:1903.02596  [pdf, other

    cs.AR

    Buddy Compression: Enabling Larger Memory for Deep Learning and HPC Workloads on GPUs

    Authors: Esha Choukse, Michael Sullivan, Mike O'Connor, Mattan Erez, Jeff Pool, David Nellans, Steve Keckler

    Abstract: GPUs offer orders-of-magnitude higher memory bandwidth than traditional CPU-only systems. However, GPU device memory tends to be relatively small and the memory capacity can not be increased by the user. This paper describes Buddy Compression, a scheme to increase both the effective GPU memory capacity and bandwidth while avoiding the downsides of conventional memory-expanding strategies. Buddy Co… ▽ More

    Submitted 15 April, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

  13. arXiv:1902.01827  [pdf

    physics.chem-ph cs.HC physics.bio-ph physics.ed-ph

    Interactive molecular dynamics in virtual reality from quantum chemistry to drug binding: An open-source multi-person framework

    Authors: Michael O'Connor, Simon J. Bennie, Helen M. Deeks, Alexander Jamieson-Binnie, Alex J. Jones, Robin J. Shannon, Rebecca Walters, Thomas J. Mitchell, Adrian J. Mulholland, David R. Glowacki

    Abstract: As molecular scientists have made progress in their ability to engineer nano-scale molecular structure, we are facing new challenges in our ability to engineer molecular dynamics (MD) and flexibility. Dynamics at the molecular scale differs from the familiar mechanics of everyday objects, because it involves a complicated, highly correlated, and three-dimensional many-body dynamical choreography w… ▽ More

    Submitted 1 May, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

  14. arXiv:1807.05102  [pdf, other

    cs.AR

    What Your DRAM Power Models Are Not Telling You: Lessons from a Detailed Experimental Study

    Authors: Saugata Ghose, Abdullah Giray Yağlıkçı, Raghav Gupta, Donghyuk Lee, Kais Kudrolli, William X. Liu, Hasan Hassan, Kevin K. Chang, Niladrish Chatterjee, Aditya Agrawal, Mike O'Connor, Onur Mutlu

    Abstract: Main memory (DRAM) consumes as much as half of the total system power in a computer today, resulting in a growing need to develop new DRAM architectures and systems that consume less power. Researchers have long relied on DRAM power models that are based off of standardized current measurements provided by vendors, called IDD values. Unfortunately, we find that these models are highly inaccurate,… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: presented at SIGMETRICS 2018

  15. arXiv:1807.02391  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Extracting Actionable Knowledge from Domestic Violence Discourses on Social Media

    Authors: Sudha Subramani, Manjula O'Connor

    Abstract: Domestic Violence (DV) is considered as big social issue and there exists a strong relationship between DV and health impacts of the public. Existing research studies have focused on social media to track and analyse real world events like emerging trends, natural disasters, user sentiment analysis, political opinions, and health care. However there is less attention given on social welfare issues… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

  16. arXiv:1805.03175  [pdf, other

    cs.AR

    Voltron: Understanding and Exploiting the Voltage-Latency-Reliability Trade-Offs in Modern DRAM Chips to Improve Energy Efficiency

    Authors: Kevin K. Chang, Abdullah Giray Yaglıkçı, Saugata Ghose, Aditya Agrawal, Niladrish Chatterjee, Abhijith Kashyap, Donghyuk Lee, Mike O'Connor, Hasan Hassan, Onur Mutlu

    Abstract: This paper summarizes our work on experimental characterization and analysis of reduced-voltage operation in modern DRAM chips, which was published in SIGMETRICS 2017, and examines the work's significance and future potential. We take a comprehensive approach to understanding and exploiting the latency and reliability characteristics of modern DRAM when the DRAM supply voltage is lowered below t… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  17. arXiv:1708.01286  [pdf

    cs.DB

    Metadata in the BioSample Online Repository are Impaired by Numerous Anomalies

    Authors: Rafael S. Gonçalves, Martin J. O'Connor, Marcos Martínez-Romero, John Graybeal, Mark A. Musen

    Abstract: The metadata about scientific experiments are crucial for finding, reproducing, and reusing the data that the metadata describe. We present a study of the quality of the metadata stored in BioSample--a repository of metadata about samples used in biomedical experiments managed by the U.S. National Center for Biomedical Technology Information (NCBI). We tested whether 6.6 million BioSample metadata… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

  18. Understanding Reduced-Voltage Operation in Modern DRAM Chips: Characterization, Analysis, and Mechanisms

    Authors: Kevin K. Chang, Abdullah Giray Yağlıkçı, Saugata Ghose, Aditya Agrawal, Niladrish Chatterjee, Abhijith Kashyap, Donghyuk Lee, Mike O'Connor, Hasan Hassan, Onur Mutlu

    Abstract: The energy consumption of DRAM is a critical concern in modern computing systems. Improvements in manufacturing process technology have allowed DRAM vendors to lower the DRAM supply voltage conservatively, which reduces some of the DRAM energy consumption. We would like to reduce the DRAM supply voltage more aggressively, to further reduce energy. Aggressive supply voltage reduction requires a tho… ▽ More

    Submitted 29 May, 2017; originally announced May 2017.

    Comments: 25 pages, 25 figures, 7 tables, Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS)

  19. arXiv:1705.01626  [pdf, other

    cs.LG cs.AR

    Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

    Authors: Minsoo Rhu, Mike O'Connor, Niladrish Chatterjee, Jeff Pool, Stephen W. Keckler

    Abstract: Popular deep learning frameworks require users to fine-tune their memory usage so that the training data of a deep neural network (DNN) fits within the GPU physical memory. Prior work tries to address this restriction by virtualizing the memory usage of DNNs, enabling both CPU and GPU memory to be utilized for memory allocations. Despite its merits, virtualizing memory can incur significant perfor… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

  20. NCBO Ontology Recommender 2.0: An Enhanced Approach for Biomedical Ontology Recommendation

    Authors: Marcos Martinez-Romero, Clement Jonquet, Martin J. O'Connor, John Graybeal, Alejandro Pazos, Mark A. Musen

    Abstract: Biomedical researchers use ontologies to annotate their data with ontology terms, enabling better data integration and interoperability. However, the number, variety and complexity of current biomedical ontologies make it cumbersome for researchers to determine which ones to reuse for their specific needs. To overcome this problem, in 2010 the National Center for Biomedical Ontology (NCBO) release… ▽ More

    Submitted 25 May, 2017; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: 29 pages, 8 figures, 11 tables

    ACM Class: I.2.4

    Journal ref: Journal of Biomedical Semantics 8 (2017) 1-22

  21. arXiv:1602.00722  [pdf, other

    cs.AR

    Enabling Efficient Dynamic Resizing of Large DRAM Caches via A Hardware Consistent Hashing Mechanism

    Authors: Kevin K. Chang, Gabriel H. Loh, Mithuna Thottethodi, Yasuko Eckert, Mike O'Connor, Srilatha Manne, Lisa Hsu, Lavanya Subramanian, Onur Mutlu

    Abstract: Die-stacked DRAM has been proposed for use as a large, high-bandwidth, last-level cache with hundreds or thousands of megabytes of capacity. Not all workloads (or phases) can productively utilize this much cache space, however. Unfortunately, the unused (or under-used) cache continues to consume power due to leakage in the peripheral circuitry and periodic DRAM refresh. Dynamically adjusting the a… ▽ More

    Submitted 1 February, 2016; originally announced February 2016.

    Comments: 13 pages