Skip to main content

Showing 1–10 of 10 results for author: Staib, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2109.08564  [pdf, other

    cs.CL cs.IR cs.LG

    Slot Filling for Biomedical Information Extraction

    Authors: Yannis Papanikolaou, Marlene Staib, Justin Grace, Francine Bennett

    Abstract: Information Extraction (IE) from text refers to the task of extracting structured knowledge from unstructured text. The task typically consists of a series of sub-tasks such as Named Entity Recognition and Relation Extraction. Sourcing entity and relation type specific training data is a major bottleneck in domains with limited resources such as biomedicine. In this work we present a slot filling… ▽ More

    Submitted 11 April, 2022; v1 submitted 17 September, 2021; originally announced September 2021.

  2. arXiv:2106.08352  [pdf, other

    eess.AS cs.LG cs.SD

    Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis

    Authors: Devang S Ram Mohan, Vivian Hu, Tian Huey Teh, Alexandra Torresquintero, Christopher G. R. Wallis, Marlene Staib, Lorenzo Foglianti, Jiameng Gao, Simon King

    Abstract: Text does not fully specify the spoken form, so text-to-speech models must be able to learn from speech data that vary in ways not explained by the corresponding text. One way to reduce the amount of unexplained variation in training data is to provide acoustic information as an additional learning signal. When generating speech, modifying this acoustic information enables multiple distinct rendit… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: To be published in Interspeech 2021. 5 pages, 4 figures

  3. Phonological Features for 0-shot Multilingual Speech Synthesis

    Authors: Marlene Staib, Tian Huey Teh, Alexandra Torresquintero, Devang S Ram Mohan, Lorenzo Foglianti, Raphael Lenain, Jiameng Gao

    Abstract: Code-switching---the intra-utterance use of multiple languages---is prevalent across the world. Within text-to-speech (TTS), multilingual models have been found to enable code-switching. By modifying the linguistic input to sequence-to-sequence TTS, we show that code-switching is possible for languages unseen during training, even within monolingual models. We use a small set of phonological featu… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: 5 pages, to be presented at INTERSPEECH 2020

  4. arXiv:2008.03096  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning

    Authors: Devang S Ram Mohan, Raphael Lenain, Lorenzo Foglianti, Tian Huey Teh, Marlene Staib, Alexandra Torresquintero, Jiameng Gao

    Abstract: Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation. Interleaving the action of reading a character with that of synthesising audio reduces this latency. However, the order of this sequence of interleaved actions v… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: To be published in Interspeech 2020. 5 pages, 4 figures

  5. arXiv:1905.10943  [pdf, other

    cs.LG stat.ML

    Distributionally Robust Optimization and Generalization in Kernel Methods

    Authors: Matthew Staib, Stefanie Jegelka

    Abstract: Distributionally robust optimization (DRO) has attracted attention in machine learning due to its connections to regularization, generalization, and robustness. Existing work has considered uncertainty sets based on phi-divergences and Wasserstein distances, each of which have drawbacks. In this paper, we study DRO with uncertainty sets measured via maximum mean discrepancy (MMD). We show that MMD… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

  6. arXiv:1901.09149  [pdf, other

    cs.LG math.OC stat.ML

    Esca** Saddle Points with Adaptive Gradient Methods

    Authors: Matthew Staib, Sashank J. Reddi, Satyen Kale, Sanjiv Kumar, Suvrit Sra

    Abstract: Adaptive methods such as Adam and RMSProp are widely used in deep learning but are not well understood. In this paper, we seek a crisp, clean and precise characterization of their behavior in nonconvex settings. To this end, we first provide a novel view of adaptive methods as preconditioned SGD, where the preconditioner is estimated in an online manner. By studying the preconditioner on its own,… ▽ More

    Submitted 3 February, 2020; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Update Theorem 4.1 and proof to use martingale concentration bounds, i.e. matrix Freedman

  7. arXiv:1901.00032  [pdf, other

    cond-mat.mtrl-sci cs.AI stat.ML

    Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks

    Authors: Edward Kim, Zach Jensen, Alexander van Grootel, Kevin Huang, Matthew Staib, Sheshera Mysore, Haw-Shiuan Chang, Emma Strubell, Andrew McCallum, Stefanie Jegelka, Elsa Olivetti

    Abstract: Leveraging new data sources is a key step in accelerating the pace of materials design and discovery. To complement the strides in synthesis planning driven by historical, experimental, and computed data, we present an automated method for connecting scientific literature to synthesis insights. Starting from natural language text, we apply word embeddings from language models, which are fed into a… ▽ More

    Submitted 17 February, 2019; v1 submitted 31 December, 2018; originally announced January 2019.

    Comments: Added new funding support to the acknowledgments section in this version

  8. arXiv:1802.05249  [pdf, other

    cs.LG math.OC stat.ML

    Distributionally Robust Submodular Maximization

    Authors: Matthew Staib, Bryan Wilder, Stefanie Jegelka

    Abstract: Submodular functions have applications throughout machine learning, but in many settings, we do not have direct access to the underlying function $f$. We focus on stochastic functions that are given as an expectation of functions over a distribution $P$. In practice, we often have only a limited set of samples $f_i$ from $P$. The standard approach indirectly optimizes $f$ by maximizing the sum of… ▽ More

    Submitted 5 June, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

  9. arXiv:1705.07443  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Parallel Streaming Wasserstein Barycenters

    Authors: Matthew Staib, Sebastian Claici, Justin Solomon, Stefanie Jegelka

    Abstract: Efficiently aggregating data from different sources is a challenging problem, particularly when samples from each source are distributed differently. These differences can be inherent to the inference task or present for other reasons: sensors in a sensor network may be placed far apart, affecting their individual measurements. Conversely, it is computationally advantageous to split Bayesian infer… ▽ More

    Submitted 13 November, 2017; v1 submitted 21 May, 2017; originally announced May 2017.

    Comments: NIPS 2017

  10. arXiv:1702.08791  [pdf, other

    cs.LG cs.AI cs.DS cs.SI math.OC

    Robust Budget Allocation via Continuous Submodular Functions

    Authors: Matthew Staib, Stefanie Jegelka

    Abstract: The optimal allocation of resources for maximizing influence, spread of information or coverage, has gained attention in the past years, in particular in machine learning and data mining. But in applications, the parameters of the problem are rarely known exactly, and using wrong parameters can lead to undesirable outcomes. We hence revisit a continuous version of the Budget Allocation or Bipartit… ▽ More

    Submitted 13 June, 2017; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: ICML 2017