Skip to main content

Showing 1–24 of 24 results for author: Huang, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.15145  [pdf, ps, other

    cs.IT eess.SP

    Robust Resource Allocation for STAR-RIS Assisted SWIPT Systems

    Authors: Guangyu Zhu, Xidong Mu, Li Guo, Ao Huang, Shibiao Xu

    Abstract: A simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted simultaneous wireless information and power transfer (SWIPT) system is proposed. More particularly, an STAR-RIS is deployed to assist in the information/power transfer from a multi-antenna access point (AP) to multiple single-antenna information users (IUs) and energy users (EUs), where two practica… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  2. arXiv:2403.15130  [pdf, ps, other

    cs.IT eess.SP

    Coexisting Passive RIS and Active Relay Assisted NOMA Systems

    Authors: Ao Huang, Li Guo, Xidong Mu, Chao Dong, Yuanwei Liu

    Abstract: A novel coexisting passive reconfigurable intelligent surface (RIS) and active decode-and-forward (DF) relay assisted non-orthogonal multiple access (NOMA) transmission framework is proposed. In particular, two communication protocols are conceived, namely Hybrid NOMA (H-NOMA) and Full NOMA (F-NOMA). Based on the proposed two protocols, both the sum rate maximization and max-min rate fairness prob… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  3. arXiv:2403.15120  [pdf, ps, other

    cs.IT eess.SP

    STAR-RIS Assisted Downlink Active and Uplink Backscatter Communications with NOMA

    Authors: Ao Huang, Xidong Mu, Li Guo

    Abstract: A simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted downlink (DL) active and uplink (UL) backscatter communication (BackCom) framework is proposed. More particularly, a full-duplex (FD) base station (BS) communicates with the DL users via the STAR-RIS's transmission link, while exciting and receiving the information from the UL BackCom devices with t… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  4. arXiv:2403.01956  [pdf, ps, other

    cs.IT eess.SP

    Hybrid Active-Passive RIS Transmitter Enabled Energy-Efficient Multi-User Communications

    Authors: Ao Huang, Xidong Mu, Li Guo, Guangyu Zhu

    Abstract: A novel hybrid active-passive reconfigurable intelligent surface (RIS) transmitter enabled downlink multi-user communication system is investigated. Specifically, RISs are exploited to serve as transmitter antennas, where each element can flexibly switch between active and passive modes to deliver information to multiple users. The system energy efficiency (EE) maximization problem is formulated b… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  5. arXiv:2303.07821  [pdf, ps, other

    cs.IT eess.SP

    Self-attention for Enhanced OAMP Detection in MIMO Systems

    Authors: Alexander Fuchs, Christian Knoll, Nima N. Moghadam, Alexey Pak **liang Huang, Erik Leitinger, Franz Pernkopf

    Abstract: Multiple-Input Multiple-Output (MIMO) systems are essential for wireless communications. Sinceclassical algorithms for symbol detection in MIMO setups require large computational resourcesor provide poor results, data-driven algorithms are becoming more popular. Most of the proposedalgorithms, however, introduce approximations leading to degraded performance for realistic MIMOsystems. In this pape… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 8 pages, 2 figures, ICASSP 2023

    ACM Class: I.2.1; H.1.1

  6. arXiv:2303.07406  [pdf

    cs.AR cs.CR eess.IV physics.app-ph

    Infra-Red, In-Situ (IRIS) Inspection of Silicon

    Authors: Andrew 'bunnie' Huang

    Abstract: This paper introduces the Infra-Red, In Situ (IRIS) inspection method, which uses short-wave IR (SWIR) light to non-destructively "see through" the backside of chips and image them with lightly modified conventional digital CMOS cameras. With a ~1050 nm light source, IRIS is capable of constraining macro- and meso-scale features of a chip. This hardens existing micro-scale self-test verification t… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: 8 pages, 19 figures

    ACM Class: B.m

  7. arXiv:2205.03997  [pdf, other

    cs.AR cs.LG eess.IV

    A Real Time Super Resolution Accelerator with Tilted Layer Fusion

    Authors: An-Jung Huang, Kai-Chieh Hsu, Tian-Sheuan Chang

    Abstract: Deep learning based superresolution achieves high-quality results, but its heavy computational workload, large buffer, and high external memory bandwidth inhibit its usage in mobile devices. To solve the above issues, this paper proposes a real-time hardware accelerator with the tilted layer fusion method that reduces the external DRAM bandwidth by 92\% and just needs 102KB on-chip memory. The des… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: 5 pages, 6 figures, published in ISCAS 2022

  8. arXiv:2203.15140  [pdf, other

    cs.SD eess.AS

    Improving Source Separation by Explicitly Modeling Dependencies Between Sources

    Authors: Ethan Manilow, Curtis Hawthorne, Cheng-Zhi Anna Huang, Bryan Pardo, Jesse Engel

    Abstract: We propose a new method for training a supervised source separation system that aims to learn the interdependent relationships between all combinations of sources in a mixture. Rather than independently estimating each source from a mix, we reframe the source separation problem as an Orderless Neural Autoregressive Density Estimator (NADE), and estimate each source from both the mix and a random s… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: To appear at ICASSP 2022

  9. arXiv:2112.09312  [pdf, other

    cs.SD cs.LG eess.AS

    MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling

    Authors: Yusong Wu, Ethan Manilow, Yi Deng, Rigel Swavely, Kyle Kastner, Tim Cooijmans, Aaron Courville, Cheng-Zhi Anna Huang, Jesse Engel

    Abstract: Musical expression requires control of both what notes are played, and how they are performed. Conventional audio synthesizers provide detailed expressive controls, but at the cost of realism. Black-box neural audio synthesis and concatenative samplers can produce realistic audio, but have few mechanisms for control. In this work, we introduce MIDI-DDSP a hierarchical model of musical instruments… ▽ More

    Submitted 17 March, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted by International Conference on Learning Representations (ICLR) 2022

  10. arXiv:2111.14951  [pdf, other

    cs.HC cs.LG cs.SD eess.AS

    Expressive Communication: A Common Framework for Evaluating Developments in Generative Models and Steering Interfaces

    Authors: Ryan Louie, Jesse Engel, Anna Huang

    Abstract: There is an increasing interest from ML and HCI communities in empowering creators with better generative models and more intuitive interfaces with which to control them. In music, ML researchers have focused on training models capable of generating pieces with increasing long-range structure and musical coherence, while HCI researchers have separately focused on designing steering interfaces that… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 15 pages, 6 figures, submitted to ACM Intelligent User Interfaces 2022 Conference

  11. arXiv:2106.08846  [pdf, other

    cs.LG cs.AI cs.CL eess.SY

    Algorithm to Compilation Co-design: An Integrated View of Neural Network Sparsity

    Authors: Fu-Ming Guo, Austin Huang

    Abstract: Reducing computation cost, inference latency, and memory footprint of neural networks are frequently cited as research motivations for pruning and sparsity. However, operationalizing those benefits and understanding the end-to-end effect of algorithm design and regularization on the runtime execution is not often examined in depth. Here we apply structured and unstructured pruning to attention w… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

  12. arXiv:2010.09776  [pdf, other

    cs.MA cs.AI cs.GT cs.LG eess.SY

    SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

    Authors: Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat , et al. (12 additional authors not shown)

    Abstract: Multi-agent interaction is a fundamental aspect of autonomous driving in the real world. Despite more than a decade of research and development, the problem of how to competently interact with diverse road users in diverse scenarios remains largely unsolved. Learning methods have much to offer towards solving this problem. But they require a realistic multi-agent simulator that generates diverse a… ▽ More

    Submitted 31 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 20 pages, 11 figures. Paper accepted to CoRL 2020

  13. arXiv:2010.05388  [pdf, other

    cs.SD cs.HC cs.LG eess.AS

    AI Song Contest: Human-AI Co-Creation in Songwriting

    Authors: Cheng-Zhi Anna Huang, Hendrik Vincent Koops, Ed Newton-Rex, Monica Dinculescu, Carrie J. Cai

    Abstract: Machine learning is challenging the way we make music. Although research in deep generative models has dramatically improved the capability and fluency of music models, recent work has shown that it can be challenging for humans to partner with this new class of algorithms. In this paper, we present findings on what 13 musician/developer teams, a total of 61 users, needed when co-creating a song w… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: 6 pages + 3 pages of references

    ACM Class: J.5; I.2

    Journal ref: ISMIR 2020

  14. arXiv:2007.05500  [pdf, other

    cs.CV cs.LG eess.IV

    Scientific Discovery by Generating Counterfactuals using Image Translation

    Authors: Arunachalam Narayanaswamy, Subhashini Venugopalan, Dale R. Webster, Lily Peng, Greg Corrado, Paisan Ruamviboonsuk, Pinal Bavishi, Rory Sayres, Abigail Huang, Siva Balasubramanian, Michael Brenner, Philip Nelson, Avinash V. Varadarajan

    Abstract: Model explanation techniques play a critical role in understanding the source of a model's performance and making its decisions transparent. Here we investigate if explanation techniques can also be used as a mechanism for scientific discovery. We make three contributions: first, we propose a framework to convert predictions from explanation techniques to a mechanism of discovery. Second, we show… ▽ More

    Submitted 19 July, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: Accepted at MICCAI 2020. This version combines camera-ready and supplement

    Journal ref: MICCAI 2020

  15. Large-Signal Stability Criteria in DC Power Grids with Distributed-Controlled Converters and Constant Power Loads

    Authors: Fangyuan Chang, Xiaofan Cui, Mengqi Wang, Wencong Su, Alex Q. Huang

    Abstract: The increasing adoption of power electronic devices may lead to large disturbance and destabilization of future power systems. However, stability criteria are still an unsolved puzzle, since traditional small-signal stability analysis is not applicable to power electronics-enabled power systems when a large disturbance occurs, such as a fault, a pulse power load, or load switching. To address this… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

  16. arXiv:2002.02451  [pdf, other

    eess.SP cs.NI

    Federated Orchestration for Network Slicing of Bandwidth and Computational Resource

    Authors: Yingyu Li, Anqi Huang, Yong Xiao, Xiaohu Ge, Sumei Sun, Han-Chieh Chao

    Abstract: Network slicing has been considered as one of the key enablers for 5G to support diversified IoT services and application scenarios. This paper studies the distributed network slicing for a massive scale IoT network supported by 5G with fog computing. Multiple services with various requirements need to be supported by both spectrum resource offered by 5G network and computational resourc of the fo… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2002.01101

  17. arXiv:1908.08669  [pdf

    eess.SY

    A Novel Synchronous Reference Frame Frequency-Locked Loop

    Authors: Xiangjun Quan, Qinran Hu, Alex Q. Huang, Xiaobo Dou, Zaijun Wu

    Abstract: This letter proposes a new design of frequency-locked loop (FLL) which is based on synchronous (dq) reference frame instead of stationary (α\b{eta}) reference frame. First, a synchronous reference frame FLL (briefly called SRF-FLL0) equivalent to the conventional FLL is proposed. Then the SRF-FLL0 is improved by utilizing the phase error to acquire a better performance. The small-signal modeling a… ▽ More

    Submitted 26 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

    Comments: 4 pages, 6 figures

  18. arXiv:1907.06637  [pdf, other

    cs.SD cs.HC cs.LG eess.AS stat.ML

    The Bach Doodle: Approachable music composition with machine learning at scale

    Authors: Cheng-Zhi Anna Huang, Curtis Hawthorne, Adam Roberts, Monica Dinculescu, James Wexler, Leon Hong, Jacob Howcroft

    Abstract: To make music composition more approachable, we designed the first AI-powered Google Doodle, the Bach Doodle, where users can create their own melody and have it harmonized by a machine learning model Coconet (Huang et al., 2017) in the style of Bach. For users to input melodies, we designed a simplified sheet-music based interface. To support an interactive experience at scale, we re-implemented… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

    Comments: Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2019

  19. arXiv:1905.08632  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Human Vocal Sentiment Analysis

    Authors: Andrew Huang, Puwei Bao

    Abstract: In this paper, we use several techniques with conventional vocal feature extraction (MFCC, STFT), along with deep-learning approaches such as CNN, and also context-level analysis, by providing the textual data, and combining different approaches for improved emotion-level classification. We explore models that have not been tested to gauge the difference in performance and accuracy. We apply hyper… ▽ More

    Submitted 19 May, 2019; originally announced May 2019.

    Comments: NYU Shanghai CSCS 2019

  20. arXiv:1903.07227  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Counterpoint by Convolution

    Authors: Cheng-Zhi Anna Huang, Tim Cooijmans, Adam Roberts, Aaron Courville, Douglas Eck

    Abstract: Machine learning models of music typically break up the task of composition into a chronological process, composing a piece of music in a single pass from beginning to end. On the contrary, human composers write music in a nonlinear fashion, scribbling motifs here and there, often revisiting choices previously made. In order to better approximate this process, we train a convolutional neural netwo… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

    Comments: Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017

    ACM Class: H.5.5; I.2

  21. arXiv:1811.09914  [pdf, other

    eess.SY cs.AI cs.MA cs.RO

    RADMPC: A Fast Decentralized Approach for Chance-Constrained Multi-Vehicle Path-Planning

    Authors: Aaron Huang, Benjamin J. Ayton, Brian C. Williams

    Abstract: Robust multi-vehicle path-planning is important for ensuring the safety of multi-vehicle systems in applications like transportation, search and rescue, and robotic exploration. Chance-constrained methods like Iterative Risk Allocation (IRA)\cite{IRA} have been developed for situations where environmental disturbances are unbounded. However, chance-constrained methods for the multi-vehicle case ge… ▽ More

    Submitted 24 November, 2018; originally announced November 2018.

  22. arXiv:1810.12247  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset

    Authors: Curtis Hawthorne, Andriy Stasyuk, Adam Roberts, Ian Simon, Cheng-Zhi Anna Huang, Sander Dieleman, Erich Elsen, Jesse Engel, Douglas Eck

    Abstract: Generating musical audio directly with neural networks is notoriously difficult because it requires coherently modeling structure at many different timescales. Fortunately, most music is also highly structured and can be represented as discrete note events played on musical instruments. Herein, we show that by using notes as an intermediate representation, we can train a suite of models capable of… ▽ More

    Submitted 17 January, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Examples available at https://goo.gl/magenta/maestro-examples

  23. arXiv:1809.04281  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Music Transformer

    Authors: Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Ian Simon, Curtis Hawthorne, Andrew M. Dai, Matthew D. Hoffman, Monica Dinculescu, Douglas Eck

    Abstract: Music relies heavily on repetition to build structure and meaning. Self-reference occurs on multiple timescales, from motifs to phrases to reusing of entire sections of music, such as in pieces with ABA structure. The Transformer (Vaswani et al., 2017), a sequence model based on self-attention, has achieved compelling results in many generation tasks that require maintaining long-range coherence.… ▽ More

    Submitted 12 December, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: Improved skewing section and accompanying figures. Previous titles are "An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation" and "Music Transformer"

  24. arXiv:1410.2792  [pdf, other

    cs.RO eess.SY

    Convex Model Predictive Control for Vehicular Systems

    Authors: Tiffany A. Huang, Matanya B. Horowitz, Joel W. Burdick

    Abstract: In this work, we present a method to perform Model Predictive Control (MPC) over systems whose state is an element of $SO(n)$ for $n=2,3$. This is done without charts or any local linearization, and instead is performed by operating over the orbitope of rotation matrices. This results in a novel MPC scheme without the drawbacks associated with conventional linearization techniques. Instead, second… ▽ More

    Submitted 10 October, 2014; originally announced October 2014.