Skip to main content

Showing 1–11 of 11 results for author: Mao, H H

.
  1. arXiv:2210.04243  [pdf, other

    cs.LG

    Fine-Tuning Pre-trained Transformers into Decaying Fast Weights

    Authors: Huanru Henry Mao

    Abstract: Autoregressive Transformers are strong language models but incur O(T) complexity during per-token generation due to the self-attention mechanism. Recent work proposes kernel-based methods to approximate causal self-attention by replacing it with recurrent formulations with various update rules and feature maps to achieve O(1) time and memory complexity. We explore these approaches and find that th… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  2. Sampling Through the Lens of Sequential Decision Making

    Authors: Jason Xiaotian Dou, Alvin Qingkai Pan, Runxue Bao, Haiyi Harry Mao, Lei Luo, Zhi-Hong Mao

    Abstract: Sampling is ubiquitous in machine learning methodologies. Due to the growth of large datasets and model complexity, we want to learn and adapt the sampling process while training a representation. Towards achieving this grand goal, a variety of sampling techniques have been proposed. However, most of them either use a fixed sampling scheme or adjust the sampling scheme based on simple heuristics.… ▽ More

    Submitted 13 December, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  3. arXiv:2007.00800  [pdf, other

    cs.LG stat.ML

    A Survey on Self-supervised Pre-training for Sequential Transfer Learning in Neural Networks

    Authors: Huanru Henry Mao

    Abstract: Deep neural networks are typically trained under a supervised learning framework where a model learns a single task using labeled data. Instead of relying solely on labeled data, practitioners can harness unlabeled or related data to improve model performance, which is often more accessible and ubiquitous. Self-supervised pre-training for transfer learning is becoming an increasingly popular techn… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  4. arXiv:2005.08072  [pdf, other

    eess.AS cs.LG cs.SD

    Speech Recognition and Multi-Speaker Diarization of Long Conversations

    Authors: Huanru Henry Mao, Shuyang Li, Julian McAuley, Garrison Cottrell

    Abstract: Speech recognition (ASR) and speaker diarization (SD) models have traditionally been trained separately to produce rich conversation transcripts with speaker labels. Recent advances have shown that joint ASR and SD models can learn to leverage audio-lexical inter-dependencies to improve word diarization performance. We introduce a new benchmark of hour-long podcasts collected from the weekly This… ▽ More

    Submitted 4 November, 2020; v1 submitted 16 May, 2020; originally announced May 2020.

  5. arXiv:2003.04887  [pdf, other

    cs.LG cs.CL stat.ML

    ReZero is All You Need: Fast Convergence at Large Depth

    Authors: Thomas Bachlechner, Bodhisattwa Prasad Majumder, Huanru Henry Mao, Garrison W. Cottrell, Julian McAuley

    Abstract: Deep networks often suffer from vanishing or exploding gradients due to inefficient signal propagation, leading to long training times or convergence difficulties. Various architecture designs, sophisticated residual-style networks, and initialization schemes have been shown to improve deep signal propagation. Recently, Pennington et al. used free probability theory to show that dynamical isometry… ▽ More

    Submitted 24 June, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

  6. arXiv:2002.03340  [pdf, ps, other

    physics.plasm-ph

    Growth, saturation and collapse of laser-driven plasma density gratings

    Authors: H. H. Ma, S. M. Weng, P. Li, X. F. Li, Y. X. Wang, S. H. Yew, M. Chen, P. McKenna, Z. M. Sheng

    Abstract: The plasma density grating induced by intersecting intense laser pulses can be utilized as an optical compressors, polarizers, waveplates and photonic crystals for the manipulation of ultra-high-power laser pulses. However, the formation and evolution of the plasma density grating are still not fully understood as linear models are adopted to describe them usually. In this paper, two nonlinear the… ▽ More

    Submitted 28 July, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: 15pages, 5figures

    Journal ref: Phys. Plasmas 27, 073105 (2020)

  7. arXiv:1908.09451  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Improving Neural Story Generation by Targeted Common Sense Grounding

    Authors: Huanru Henry Mao, Bodhisattwa Prasad Majumder, Julian McAuley, Garrison W. Cottrell

    Abstract: Stories generated with neural language models have shown promise in grammatical and stylistic consistency. However, the generated stories are still lacking in common sense reasoning, e.g., they often contain sentences deprived of world knowledge. We propose a simple multi-task learning scheme to achieve quantitatively better common sense reasoning in language models by leveraging auxiliary trainin… ▽ More

    Submitted 27 February, 2020; v1 submitted 25 August, 2019; originally announced August 2019.

  8. arXiv:1907.04868  [pdf, other

    cs.SD cs.LG cs.MM eess.AS stat.ML

    LakhNES: Improving multi-instrumental music generation with cross-domain pre-training

    Authors: Chris Donahue, Huanru Henry Mao, Yiting Ethan Li, Garrison W. Cottrell, Julian McAuley

    Abstract: We are interested in the task of generating multi-instrumental music scores. The Transformer architecture has recently shown great promise for the task of piano score generation; here we adapt it to the multi-instrumental setting. Transformers are complex, high-dimensional language models which are capable of capturing long-term structure in sequence data, but require large amounts of data to fit.… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: Published as a conference paper at ISMIR 2019

  9. arXiv:1806.04278  [pdf, other

    cs.SD cs.LG cs.NE eess.AS

    The NES Music Database: A multi-instrumental dataset with expressive performance attributes

    Authors: Chris Donahue, Huanru Henry Mao, Julian McAuley

    Abstract: Existing research on music generation focuses on composition, but often ignores the expressive performance characteristics required for plausible renditions of resultant pieces. In this paper, we introduce the Nintendo Entertainment System Music Database (NES-MDB), a large corpus allowing for separate examination of the tasks of composition and performance. NES-MDB contains thousands of multi-inst… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: Published as a conference paper at ISMIR 2018

  10. DeepJ: Style-Specific Music Generation

    Authors: Huanru Henry Mao, Taylor Shin, Garrison W. Cottrell

    Abstract: Recent advances in deep neural networks have enabled algorithms to compose music that is comparable to music composed by humans. However, few algorithms allow the user to generate music with tunable parameters. The ability to tune properties of generated music will yield more practical benefits for aiding artists, filmmakers, and composers in their creative tasks. In this paper, we introduce DeepJ… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

  11. arXiv:1603.08471  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Liquid-Gated High Mobility and Quantum Oscillation of the Two-Dimensional Electron Gas at an Oxide Interface

    Authors: Shengwei Zeng, Weiming Lü, Zhen Huang, Zhiqi Liu, Kun Han, Kalon Gopinadhan, Changjian Li, Rui Guo, Wenxiong Zhou, Haijiao Harsan Ma, Linke Jian, T Venkatesan, Ariando

    Abstract: Electric field effect in electronic double layer transistor (EDLT) configuration with ionic liquids as the dielectric materials is a powerful means of exploring various properties in different materials. Here we demonstrate the modulation of electrical transport properties and extremely high mobility of two-dimensional electron gas at LaAlO$_3$/SrTiO$_3$ (LAO/STO) interface through ionic liquid-as… ▽ More

    Submitted 28 March, 2016; originally announced March 2016.

    Comments: 22 pages, 4 figures, ACS Nano, March 09, 2016