Skip to main content

Showing 1–16 of 16 results for author: Morrison, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03569  [pdf, other

    math.NA cs.LG

    GFN: A graph feedforward network for resolution-invariant reduced operator learning in multifidelity applications

    Authors: Oisín M. Morrison, Federico Pichi, Jan S. Hesthaven

    Abstract: This work presents a novel resolution-invariant model order reduction strategy for multifidelity applications. We base our architecture on a novel neural network layer developed in this work, the graph feedforward network, which extends the concept of feedforward networks to graph-structured data by creating a direct link between the weights of a neural network and the nodes of a mesh, enhancing t… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2402.17735  [pdf, other

    eess.AS cs.SD

    High-Fidelity Neural Phonetic Posteriorgrams

    Authors: Cameron Churchwell, Max Morrison, Bryan Pardo

    Abstract: A phonetic posteriorgram (PPG) is a time-varying categorical distribution over acoustic units of speech (e.g., phonemes). PPGs are a popular representation in speech generation due to their ability to disentangle pronunciation features from speaker identity, allowing accurate reconstruction of pronunciation (e.g., voice conversion) and coarse-grained pronunciation editing (e.g., foreign accent con… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP 2024 Workshop on Explainable Machine Learning for Speech and Audio

  3. arXiv:2402.11151  [pdf

    cs.SE

    A Landscape Study of Open Source and Proprietary Tools for Software Bill of Materials (SBOM)

    Authors: Mehdi Mirakhorli, Derek Garcia, Schuyler Dillon, Kevin Laporte, Matthew Morrison, Henry Lu, Viktoria Koscinski, Christopher Enoch

    Abstract: Modern software applications heavily rely on diverse third-party components, libraries, and frameworks sourced from various vendors and open source repositories, presenting a complex challenge for securing the software supply chain. To address this complexity, the adoption of a Software Bill of Materials (SBOM) has emerged as a promising solution, offering a centralized repository that inventories… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  4. arXiv:2401.11042  [pdf

    cs.HC

    Does Using ChatGPT Result in Human Cognitive Augmentation?

    Authors: Ron Fulbright, Miranda Morrison

    Abstract: Human cognitive performance is enhanced by the use of tools. For example, a human can produce a much greater, and more accurate, volume of mathematical calculation in a unit of time using a calculator or a spreadsheet application on a computer. Such tools have taken over the burden of lower level cognitive grunt work but the human still serves the role of the expert performing higher level thinkin… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 12 pages, 5 figures

  5. arXiv:2310.08464  [pdf, other

    eess.AS cs.SD

    Crowdsourced and Automatic Speech Prominence Estimation

    Authors: Max Morrison, Pranav Pawar, Nathan Pruyne, Jennifer Cole, Bryan Pardo

    Abstract: The prominence of a spoken word is the degree to which an average native listener perceives the word as salient or emphasized relative to its context. Speech prominence estimation is the process of assigning a numeric value to the prominence of each word in an utterance. These prominence labels are useful for linguistic analysis, as well as training automated systems to perform emphasis-controlled… ▽ More

    Submitted 22 December, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at ICASSP 2024

  6. arXiv:2301.12258  [pdf, other

    eess.AS cs.SD

    Cross-domain Neural Pitch and Periodicity Estimation

    Authors: Max Morrison, Caedon Hsieh, Nathan Pruyne, Bryan Pardo

    Abstract: Pitch is a foundational aspect of our perception of audio signals. Pitch contours are commonly used to analyze speech and music signals and as input features for many audio tasks, including music transcription, singing voice synthesis, and prosody editing. In this paper, we describe a set of techniques for improving the accuracy of widely-used neural pitch and periodicity estimators to achieve sta… ▽ More

    Submitted 9 June, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

  7. arXiv:2208.12387  [pdf, other

    cs.SD cs.LG eess.AS

    Music Separation Enhancement with Generative Modeling

    Authors: Noah Schaffer, Boaz Cogan, Ethan Manilow, Max Morrison, Prem Seetharaman, Bryan Pardo

    Abstract: Despite phenomenal progress in recent years, state-of-the-art music separation systems produce source estimates with significant perceptual shortcomings, such as adding extraneous noise or removing harmonics. We propose a post-processing model (the Make it Sound Good (MSG) post-processor) to enhance the output of music source separation systems. We apply our post-processing model to state-of-the-a… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: Accepted to ISMIR 2022

  8. arXiv:2203.04451  [pdf, other

    cs.SI physics.soc-ph

    Transitions between peace and systemic war as bifurcations in a signed network dynamical system

    Authors: Megan Morrison, J. Nathan Kutz, Michael Gabbay

    Abstract: We investigate structural features and processes associated with the onset of systemic conflict using an approach which integrates complex systems theory with network modeling and analysis. We present a signed network model of cooperation and conflict dynamics in the context of international relations between states. The model evolves ties between nodes under the influence of a structural balance… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    MSC Class: 91D30; 37G99; 37N99; 91C20; 34H20 ACM Class: J.4

  9. arXiv:2203.04444  [pdf, other

    cs.HC cs.LG

    Reproducible Subjective Evaluation

    Authors: Max Morrison, Brian Tang, Gefei Tan, Bryan Pardo

    Abstract: Human perceptual studies are the gold standard for the evaluation of many research tasks in machine learning, linguistics, and psychology. However, these studies require significant time and cost to perform. As a result, many researchers use objective measures that can correlate poorly with human evaluation. When subjective evaluations are performed, they are often not reported with sufficient det… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Submitted to ICLR 2022 Workshop on Setting up ML Evaluation Standards to Accelerate Progress

  10. arXiv:2110.10139  [pdf, other

    eess.AS cs.SD

    Chunked Autoregressive GAN for Conditional Waveform Synthesis

    Authors: Max Morrison, Rithesh Kumar, Kundan Kumar, Prem Seetharaman, Aaron Courville, Yoshua Bengio

    Abstract: Conditional waveform synthesis models learn a distribution of audio waveforms given conditioning such as text, mel-spectrograms, or MIDI. These systems employ deep generative models that model the waveform via either sequential (autoregressive) or parallel (non-autoregressive) sampling. Generative adversarial networks (GANs) have become a common choice for non-autoregressive waveform synthesis. Ho… ▽ More

    Submitted 3 March, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICLR 2022

  11. arXiv:2110.02360  [pdf, other

    eess.AS cs.SD

    Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet

    Authors: Max Morrison, Zeyu **, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo

    Abstract: Modifying the pitch and timing of an audio signal are fundamental audio editing operations with applications in speech manipulation, audio-visual synchronization, and singing voice editing and synthesis. Thus far, methods for pitch-shifting and time-stretching that use digital signal processing (DSP) have been favored over deep learning approaches due to their speed and relatively higher quality.… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: Submitted to ICASSP 2022

  12. arXiv:2102.08328  [pdf, other

    eess.AS cs.LG cs.SD

    Context-Aware Prosody Correction for Text-Based Speech Editing

    Authors: Max Morrison, Lucas Rencker, Zeyu **, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo

    Abstract: Text-based speech editors expedite the process of editing speech recordings by permitting editing via intuitive cut, copy, and paste operations on a speech transcript. A major drawback of current systems, however, is that edited recordings often sound unnatural because of prosody mismatches around edited regions. In our work, we propose a new context-aware method for more natural sounding text-bas… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: To appear in proceedings of ICASSP 2021

  13. arXiv:2010.03660  [pdf, other

    cs.DC

    Fast Stencil-Code Computation on a Wafer-Scale Processor

    Authors: Kamil Rocki, Dirk Van Essendelft, Ilya Sharapov, Robert Schreiber, Michael Morrison, Vladimir Kibardin, Andrey Portnoy, Jean Francois Dietiker, Madhava Syamlal, Michael James

    Abstract: The performance of CPU-based and GPU-based systems is often low for PDE codes, where large, sparse, and often structured systems of linear equations must be solved. Iterative solvers are limited by data movement, both between caches and memory and between nodes. Here we describe the solution of such systems of equations on the Cerebras Systems CS-1, a wafer-scale processor that has the memory band… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: SC 20: The International Conference for High Performance Computing, Networking, Storage, and Analysis, to appear

  14. arXiv:2008.03388  [pdf, other

    eess.AS cs.LG cs.SD

    Controllable Neural Prosody Synthesis

    Authors: Max Morrison, Zeyu **, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore

    Abstract: Speech synthesis has recently seen significant improvements in fidelity, driven by the advent of neural vocoders and neural prosody generators. However, these systems lack intuitive user controls over prosody, making them unable to rectify prosody errors (e.g., misplaced emphases and contextually inappropriate emotions) or generate prosodies with diverse speaker excitement levels and emotions. We… ▽ More

    Submitted 11 August, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: To appear in proceedings of INTERSPEECH 2020

  15. arXiv:1912.07772  [pdf, other

    cs.SI physics.soc-ph

    Community detectability and structural balance dynamics in signed networks

    Authors: Megan Morrison, Michael Gabbay

    Abstract: We investigate signed networks with community structure with respect to their spectrum and their evolution under a dynamical model of structural balance, a prominent theory of signed social networks. The spectrum of the adjacency matrix generated by a stochastic block model with two equal size communities shows detectability transitions in which the community structure becomes manifest when its si… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

    MSC Class: 91C20; 15B52; 91D30

    Journal ref: Phys. Rev. E 102, 012304 (2020)

  16. arXiv:1911.02073  [pdf, other

    cs.SD cs.LG eess.AS

    OtoMechanic: Auditory Automobile Diagnostics via Query-by-Example

    Authors: Max Morrison, Bryan Pardo

    Abstract: Early detection and repair of failing components in automobiles reduces the risk of vehicle failure in life-threatening situations. Many automobile components in need of repair produce characteristic sounds. For example, loose drive belts emit a high-pitched squeaking sound, and bad starter motors have a characteristic whirring or clicking noise. Often drivers can tell that the sound of their car… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: Submitted to Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE2019)