Skip to main content

Showing 1–50 of 54 results for author: Ruan, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13281  [pdf, other

    cs.CV

    ECAFormer: Low-light Image Enhancement using Cross Attention

    Authors: Yudi Ruan, Hao Ma, Weikai Li, Xiao Wang

    Abstract: Low-light image enhancement (LLIE) is vital for autonomous driving. Despite the importance, existing LLIE methods often prioritize robustness in overall brightness adjustment, which can come at the expense of detail preservation. To overcome this limitation,we propose the Hierarchical Mutual Enhancement via Cross-Attention transformer (ECAFormer), a novel network that utilizes Dual Multi-head Self… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.13161  [pdf, other

    cs.AI cs.CL cs.LG cs.PL

    APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts

    Authors: Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si

    Abstract: Large Language Models (LLMs) have become increasingly capable of handling diverse tasks with the aid of well-crafted prompts and integration of external tools, but as task complexity rises, the workflow involving LLMs can be complicated and thus challenging to implement and maintain. To address this challenge, we propose APPL, A Prompt Programming Language that acts as a bridge between computer pr… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.02240  [pdf, other

    cs.NI

    Quantum Computing in Wireless Communications and Networking: A Tutorial-cum-Survey

    Authors: Wei Zhao, Tangjie Weng, Yue Ruan, Zhi Liu, Xuangou Wu, Xiao Zheng, Nei Kato

    Abstract: Owing to its outstanding parallel computing capabilities, quantum computing (QC) has been a subject of continuous attention. With the gradual maturation of QC platforms, it has increasingly played a significant role in various fields such as transportation, pharmaceuticals, and industrial manufacturing,achieving unprecedented milestones. In modern society, wireless communication stands as an indis… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2405.10938  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Observational Scaling Laws and the Predictability of Language Model Performance

    Authors: Yangjun Ruan, Chris J. Maddison, Tatsunori Hashimoto

    Abstract: Understanding how language model performance varies with scale is critical to benchmark and algorithm development. Scaling laws are one approach to building this understanding, but the requirement of training models across many different scales has limited their use. We propose an alternative, observational approach that bypasses model training and instead builds scaling laws from ~80 publically a… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  5. arXiv:2404.13841  [pdf, other

    cs.LG cs.AI

    Fair Concurrent Training of Multiple Models in Federated Learning

    Authors: Marie Siew, Haoran Zhang, Jong-Ik Park, Yuezhou Liu, Yichen Ruan, Lili Su, Stratis Ioannidis, Edmund Yeh, Carlee Joe-Wong

    Abstract: Federated learning (FL) enables collaborative learning across multiple clients. In most FL work, all clients train a single learning task. However, the recent proliferation of FL applications may increasingly require multiple FL tasks to be trained simultaneously, sharing clients' computing and communication resources, which we call Multiple-Model Federated Learning (MMFL). Current MMFL algorithms… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  6. arXiv:2403.05268  [pdf, ps, other

    cs.CL cs.LG

    Deep Prompt Multi-task Network for Abuse Language Detection

    Authors: Jian Zhu, Yu** Ruan, **gfei Chang, Wenhui Sun, Hui Wan, Jian Long, Cheng Luo

    Abstract: The detection of abusive language remains a long-standing challenge with the extensive use of social networks. The detection task of abusive language suffers from limited accuracy. We argue that the existing detection methods utilize the fine-tuning technique of the pre-trained language models (PLMs) to handle downstream tasks. Hence, these methods fail to stimulate the general knowledge of the PL… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted by the International Conference on Pattern Recognition (ICPR) 2024

  7. arXiv:2403.04246  [pdf, other

    stat.ML cs.AI cs.LG

    Efficient CNN-LSTM based Parameter Estimation of Levy Driven Stochastic Differential Equations

    Authors: Shuaiyu Li, Yang Ruan, Changzhou Long, Yuzhong Cheng

    Abstract: This study addresses the challenges in parameter estimation of stochastic differential equations driven by non-Gaussian noises, which are critical in understanding dynamic phenomena such as price fluctuations and the spread of infectious diseases. Previous research highlighted the potential of LSTM networks in estimating parameters of alpha stable Levy driven SDEs but faced limitations including h… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 2023 International Conference on Machine Learning and Applications (ICMLA)

  8. arXiv:2312.10386  [pdf, other

    cs.LG

    RedCore: Relative Advantage Aware Cross-modal Representation Learning for Missing Modalities with Imbalanced Missing Rates

    Authors: Jun Sun, Xinxin Zhang, Shoukang Han, Yu-** Ruan, Taihao Li

    Abstract: Multimodal learning is susceptible to modality missing, which poses a major obstacle for its practical applications and, thus, invigorates increasing research interest. In this paper, we investigate two challenging problems: 1) when modality missing exists in the training data, how to exploit the incomplete samples while guaranteeing that they are properly supervised? 2) when the missing rates of… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  9. arXiv:2312.07048  [pdf, other

    cs.CV

    Edge Wasserstein Distance Loss for Oriented Object Detection

    Authors: Yuke Zhu, Yumeng Ruan, Zihua Xiong, Sheng Guo

    Abstract: Regression loss design is an essential topic for oriented object detection. Due to the periodicity of the angle and the ambiguity of width and height definition, traditional L1-distance loss and its variants have been suffered from the metric discontinuity and the square-like problem. As a solution, the distribution based methods show significant advantages by representing oriented boxes as distri… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  10. arXiv:2311.18358  [pdf, other

    cs.CV

    TIDE: Test Time Few Shot Object Detection

    Authors: Weikai Li, Hongfeng Wei, Yanlai Wu, Jie Yang, Yudi Ruan, Yuan Li, Ying Tang

    Abstract: Few-shot object detection (FSOD) aims to extract semantic knowledge from limited object instances of novel categories within a target domain. Recent advances in FSOD focus on fine-tuning the base model based on a few objects via meta-learning or data augmentation. Despite their success, the majority of them are grounded with parametric readjustment to generalize on novel objects, which face consid… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  11. arXiv:2310.13901  [pdf, other

    cs.LG eess.SY

    Towards Hyperparameter-Agnostic DNN Training via Dynamical System Insights

    Authors: Carmel Fiscko, Aayushya Agarwal, Yihan Ruan, Soummya Kar, Larry Pileggi, Bruno Sinopoli

    Abstract: We present a stochastic first-order optimization method specialized for deep neural networks (DNNs), ECCO-DNN. This method models the optimization variable trajectory as a dynamical system and develops a discretization algorithm that adaptively selects step sizes based on the trajectory's shape. This provides two key insights: designing the dynamical system for fast continuous-time convergence and… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 25 pages, 11 figures

  12. On Synthetic Data for Back Translation

    Authors: Jiahao Xu, Yubin Ruan, Wei Bi, Guo** Huang, Shuming Shi, Lihui Chen, Lemao Liu

    Abstract: Back translation (BT) is one of the most significant technologies in NMT research fields. Existing attempts on BT share a common characteristic: they employ either beam search or random sampling to generate synthetic data with a backward model but seldom work studies the role of synthetic data in the performance of BT. This motivates us to ask a fundamental question: {\em what kind of synthetic da… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Journal ref: In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 419--430, Seattle, United States. Association for Computational Linguistics

  13. arXiv:2310.05694  [pdf, other

    cs.CL

    A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics

    Authors: Kai He, Rui Mao, Qika Lin, Yucheng Ruan, Xiang Lan, Mengling Feng, Erik Cambria

    Abstract: The utilization of large language models (LLMs) in the Healthcare domain has generated both excitement and concern due to their ability to effectively respond to freetext queries with certain professional knowledge. This survey outlines the capabilities of the currently developed LLMs for Healthcare and explicates their development process, with the aim of providing an overview of the development… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  14. arXiv:2309.15817  [pdf, other

    cs.AI cs.CL cs.LG

    Identifying the Risks of LM Agents with an LM-Emulated Sandbox

    Authors: Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto

    Abstract: Recent advances in Language Model (LM) agents and tool use, exemplified by applications like ChatGPT Plugins, enable a rich set of capabilities but also amplify potential risks - such as leaking private data or causing financial losses. Identifying these risks is labor-intensive, necessitating implementing the tools, setting up the environment for each test scenario manually, and finding risky cas… ▽ More

    Submitted 17 May, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  15. arXiv:2308.09346  [pdf, other

    cs.CV

    Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching

    Authors: Jiazheng Xing, Mengmeng Wang, Yudi Ruan, Bofan Chen, Yaowei Guo, Boyu Mu, Guang Dai, **gdong Wang, Yong Liu

    Abstract: Class prototype construction and matching are core aspects of few-shot action recognition. Previous methods mainly focus on designing spatiotemporal relation modeling modules or complex temporal alignment algorithms. Despite the promising results, they ignored the value of class prototype construction and matching, leading to unsatisfactory performance in recognizing similar categories in every ta… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  16. arXiv:2305.11304  [pdf, other

    cs.LG

    pTSE: A Multi-model Ensemble Method for Probabilistic Time Series Forecasting

    Authors: Yunyi Zhou, Zhixuan Chu, Yijia Ruan, Ge **, Yuchen Huang, Sheng Li

    Abstract: Various probabilistic time series forecasting models have sprung up and shown remarkably good performance. However, the choice of model highly relies on the characteristics of the input time series and the fixed distribution that the model is based on. Due to the fact that the probability distributions cannot be averaged over different models straightforwardly, the current time series model ensemb… ▽ More

    Submitted 30 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: The 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

  17. PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba

    Authors: Jianying Wang, Tongliang Li, Haoze Song, Xinjun Yang, Wenchao Zhou, Feifei Li, Baoyue Yan, Qianqian Wu, Yukun Liang, Chengjun Ying, Yujie Wang, Baokai Chen, Chang Cai, Yubin Ruan, Xiaoyi Weng, Shibin Chen, Liang Yin, Chengzhong Yang, Xin Cai, Hongyan Xing, Nanlong Yu, Xiaofei Chen, Dapeng Huang, Jianling Sun

    Abstract: Cloud-native databases have become the de-facto choice for mission-critical applications on the cloud due to the need for high availability, resource elasticity, and cost efficiency. Meanwhile, driven by the increasing connectivity between data generation and analysis, users prefer a single database to efficiently process both OLTP and OLAP workloads, which enhances data freshness and reduces the… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 14 pages, 16 figures, to be published in ACM SIGMOD 2023

  18. arXiv:2305.05835  [pdf, other

    eess.IV cs.CV cs.LG

    Reference-based OCT Angiogram Super-resolution with Learnable Texture Generation

    Authors: Yuyan Ruan, Dawei Yang, Ziqi Tang, An Ran Ran, Carol Y. Cheung, Hao Chen

    Abstract: Optical coherence tomography angiography (OCTA) is a new imaging modality to visualize retinal microvasculature and has been readily adopted in clinics. High-resolution OCT angiograms are important to qualitatively and quantitatively identify potential biomarkers for different retinal diseases accurately. However, one significant problem of OCTA is the inevitable decrease in resolution when increa… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages, 11 figures

    MSC Class: 68T07 ACM Class: I.2; I.4

  19. arXiv:2303.17408  [pdf, other

    cs.CL

    P-Transformer: A Prompt-based Multimodal Transformer Architecture For Medical Tabular Data

    Authors: Yucheng Ruan, Xiang Lan, Daniel J. Tan, Hairil Rizal Abdullah, Mengling Feng

    Abstract: Medical tabular data, abundant in Electronic Health Records (EHRs), is a valuable resource for diverse medical tasks such as risk prediction. While deep learning approaches, particularly transformer-based models, have shown remarkable performance in tabular data prediction, there are still problems remained for existing work to be effectively adapted into medical domain, such as under-utilization… ▽ More

    Submitted 9 January, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

  20. arXiv:2302.03916  [pdf, other

    cs.LG

    QS-ADN: Quasi-Supervised Artifact Disentanglement Network for Low-Dose CT Image Denoising by Local Similarity Among Unpaired Data

    Authors: Yuhui Ruan, Qiao Yuan, Chuang Niu, Chen Li, Yudong Yao, Ge Wang, Yueyang Teng

    Abstract: Deep learning has been successfully applied to low-dose CT (LDCT) image denoising for reducing potential radiation risk. However, the widely reported supervised LDCT denoising networks require a training set of paired images, which is expensive to obtain and cannot be perfectly simulated. Unsupervised learning utilizes unpaired data and is highly desirable for LDCT denoising. As an example, an art… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  21. arXiv:2211.09981  [pdf, other

    cs.LG cs.AI stat.ML

    Weighted Ensemble Self-Supervised Learning

    Authors: Yangjun Ruan, Saurabh Singh, Warren Morningstar, Alexander A. Alemi, Sergey Ioffe, Ian Fischer, Joshua V. Dillon

    Abstract: Ensembling has proven to be a powerful technique for boosting model performance, uncertainty estimation, and robustness in supervised learning. Advances in self-supervised learning (SSL) enable leveraging large unlabeled corpora for state-of-the-art few-shot and supervised learning performance. In this paper, we explore how ensemble methods can improve recent SSL techniques by develo** a framewo… ▽ More

    Submitted 9 April, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by ICLR 2023

  22. arXiv:2211.08682  [pdf, other

    cs.CL

    Parameter-Efficient Tuning on Layer Normalization for Pre-trained Language Models

    Authors: Wang Qi, Yu-** Ruan, Yuan Zuo, Taihao Li

    Abstract: Conventional fine-tuning encounters increasing difficulties given the size of current Pre-trained Language Models, which makes parameter-efficient tuning become the focal point of frontier research. Previous methods in this field add tunable adapters into MHA or/and FFN of Transformer blocks to enable PLMs achieve transferability. However, as an important part of Transformer architecture, the powe… ▽ More

    Submitted 9 December, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  23. arXiv:2210.06361  [pdf, other

    cs.CV

    MFFN: Multi-view Feature Fusion Network for Camouflaged Object Detection

    Authors: Dehua Zheng, Xiaochen Zheng, Laurence T. Yang, Yuan Gao, Chenlu Zhu, Yiheng Ruan

    Abstract: Recent research about camouflaged object detection (COD) aims to segment highly concealed objects hidden in complex surroundings. The tiny, fuzzy camouflaged objects result in visually indistinguishable properties. However, current single-view COD detectors are sensitive to background distractors. Therefore, blurred boundaries and variable shapes of the camouflaged objects are challenging to be fu… ▽ More

    Submitted 19 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: In Proceedings of the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

  24. arXiv:2206.08497  [pdf, other

    cs.GR cs.CV

    Unsupervised Kinematic Motion Detection for Part-segmented 3D Shape Collections

    Authors: Xianghao Xu, Yifan Ruan, Srinath Sridhar, Daniel Ritchie

    Abstract: 3D models of manufactured objects are important for populating virtual worlds and for synthetic data generation for vision and robotics. To be most useful, such objects should be articulated: their parts should move when interacted with. While articulated object datasets exist, creating them is labor-intensive. Learning-based prediction of part motions can help, but all existing methods require an… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: SIGGRAPH 2022

  25. arXiv:2204.07763  [pdf, other

    cs.SD cs.LG eess.AS

    UFRC: A Unified Framework for Reliable COVID-19 Detection on Crowdsourced Cough Audio

    Authors: Jiangeng Chang, Yucheng Ruan, Cui Shaoze, John Soong Tshon Yit, Mengling Feng

    Abstract: We suggested a unified system with core components of data augmentation, ImageNet-pretrained ResNet-50, cost-sensitive loss, deep ensemble learning, and uncertainty estimation to quickly and consistently detect COVID-19 using acoustic evidence. To increase the model's capacity to identify a minority class, data augmentation and cost-sensitive loss are incorporated (infected samples). In the COVID-… ▽ More

    Submitted 30 June, 2022; v1 submitted 16 April, 2022; originally announced April 2022.

  26. arXiv:2202.08396  [pdf, other

    cs.LG cs.AI cs.LO

    Augment with Care: Contrastive Learning for Combinatorial Problems

    Authors: Haonan Duan, Pashootan Vaezipoor, Max B. Paulus, Yangjun Ruan, Chris J. Maddison

    Abstract: Supervised learning can improve the design of state-of-the-art solvers for combinatorial problems, but labelling large numbers of combinatorial instances is often impractical due to exponential worst-case complexity. Inspired by the recent success of contrastive pre-training for images, we conduct a scientific study of the effect of augmentation design on contrastive pre-training for the Boolean s… ▽ More

    Submitted 20 June, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  27. arXiv:2201.07366  [pdf, other

    cs.CV

    TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval

    Authors: Yue Ruan, Han-Hung Lee, Yiming Zhang, Ke Zhang, Angel X. Chang

    Abstract: Text-to-shape retrieval is an increasingly relevant problem with the growth of 3D shape data. Recent work on contrastive losses for learning joint embeddings over multimodal data has been successful at tasks such as retrieval and classification. Thus far, work on joint representation learning for 3D shapes and text has focused on improving embeddings through modeling of complex attention between r… ▽ More

    Submitted 27 December, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted by WACV 2024

  28. arXiv:2201.00057  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Optimal Representations for Covariate Shift

    Authors: Yangjun Ruan, Yann Dubois, Chris J. Maddison

    Abstract: Machine learning systems often experience a distribution shift between training and testing. In this paper, we introduce a simple variational objective whose optima are exactly the set of all representations on which risk minimizers are guaranteed to be robust to any distribution shift that preserves the Bayes predictor, e.g., covariate shifts. Our objective has two components. First, a representa… ▽ More

    Submitted 14 March, 2022; v1 submitted 31 December, 2021; originally announced January 2022.

    Comments: Accepted at ICLR 2022

  29. arXiv:2112.06053  [pdf, other

    cs.LG

    FedSoft: Soft Clustered Federated Learning with Proximal Local Updating

    Authors: Yichen Ruan, Carlee Joe-Wong

    Abstract: Traditionally, clustered federated learning groups clients with the same data distribution into a cluster, so that every client is uniquely associated with one data distribution and helps train a model for this distribution. We relax this hard association assumption to soft clustered federated learning, which allows every local dataset to follow a mixture of multiple source distributions. We propo… ▽ More

    Submitted 22 March, 2022; v1 submitted 11 December, 2021; originally announced December 2021.

  30. arXiv:2105.14879  [pdf, other

    cs.CL

    SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning

    Authors: Boyuan Zheng, Xiaoyu Yang, Yu-** Ruan, Zhenhua Ling, Quan Liu, Si Wei, Xiaodan Zhu

    Abstract: This paper introduces the SemEval-2021 shared task 4: Reading Comprehension of Abstract Meaning (ReCAM). This shared task is designed to help evaluate the ability of machines in representing and understanding abstract concepts. Given a passage and the corresponding question, a participating system is expected to choose the correct answer from five candidates of abstract concepts in a cloze-style m… ▽ More

    Submitted 1 June, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

  31. Emotion-Regularized Conditional Variational Autoencoder for Emotional Response Generation

    Authors: Yu-** Ruan, Zhen-Hua Ling

    Abstract: This paper presents an emotion-regularized conditional variational autoencoder (Emo-CVAE) model for generating emotional conversation responses. In conventional CVAE-based emotional response generation, emotion labels are simply used as additional conditions in prior, posterior and decoder networks. Considering that emotion styles are naturally entangled with semantic contents in the language spac… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted by IEEE Transactions on Affective Computing

  32. arXiv:2102.11086  [pdf, other

    cs.LG cs.AI cs.IT stat.CO

    Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

    Authors: Yangjun Ruan, Karen Ullrich, Daniel Severo, James Townsend, Ashish Khisti, Arnaud Doucet, Alireza Makhzani, Chris J. Maddison

    Abstract: Latent variable models have been successfully applied in lossless compression with the bits-back coding algorithm. However, bits-back suffers from an increase in the bitrate equal to the KL divergence between the approximate posterior and the true posterior. In this paper, we show how to remove this gap asymptotically by deriving bits-back coding algorithms from tighter variational bounds. The key… ▽ More

    Submitted 14 June, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

  33. arXiv:2007.01980  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design

    Authors: Yufei Ruan, Jiaqi Yang, Yuan Zhou

    Abstract: Motivated by practical needs such as large-scale learning, we study the impact of adaptivity constraints to linear contextual bandits, a central problem in online active learning. We consider two popular limited adaptivity models in literature: batch learning and rare policy switches. We show that, when the context vectors are adversarially chosen in $d$-dimensional linear contextual bandits, the… ▽ More

    Submitted 23 April, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

  34. arXiv:2006.06954  [pdf, other

    cs.LG stat.ML

    Towards Flexible Device Participation in Federated Learning

    Authors: Yichen Ruan, Xiaoxi Zhang, Shu-Che Liang, Carlee Joe-Wong

    Abstract: Traditional federated learning algorithms impose strict requirements on the participation rates of devices, which limit the potential reach of federated learning. This paper extends the current learning paradigm to include devices that may become inactive, compute incomplete updates, and depart or arrive in the middle of training. We derive analytical results to illustrate how allowing more flexib… ▽ More

    Submitted 25 February, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

  35. arXiv:2004.08488  [pdf, other

    cs.DC

    Network-Aware Optimization of Distributed Learning for Fog Computing

    Authors: Su Wang, Yichen Ruan, Yuwei Tu, Satyavrat Wagle, Christopher G. Brinton, Carlee Joe-Wong

    Abstract: Fog computing promises to enable machine learning tasks to scale to large amounts of data by distributing processing across connected devices. Two key challenges to achieving this goal are heterogeneity in devices compute resources and topology constraints on which devices can communicate with each other. We address these challenges by develo** the first network-aware distributed learning optimi… ▽ More

    Submitted 21 April, 2021; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in IEEE/ACM Transactions on Networking (16 pages)

  36. arXiv:2004.01940  [pdf, other

    cs.CL

    Pre-Trained and Attention-Based Neural Networks for Building Noetic Task-Oriented Dialogue Systems

    Authors: Jia-Chen Gu, Tianda Li, Quan Liu, Xiaodan Zhu, Zhen-Hua Ling, Yu-** Ruan

    Abstract: The NOESIS II challenge, as the Track 2 of the 8th Dialogue System Technology Challenges (DSTC 8), is the extension of DSTC 7. This track incorporates new elements that are vital for the creation of a deployed task-oriented dialogue system. This paper describes our systems that are evaluated on all subtasks under this challenge. We study the problem of employing pre-trained attention-based network… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: Accepted by AAAI 2020, Workshop on DSTC8

  37. arXiv:2002.00943  [pdf, other

    quant-ph cs.CC physics.app-ph

    Quantum approximate algorithm for NP optimization problems with constraints

    Authors: Yue Ruan, Samuel Marsh, Xilin Xue, Xi Li, Zhihao Liu, **gbo Wang

    Abstract: The Quantum Approximate Optimization Algorithm (QAOA) is an algorithmic framework for finding approximate solutions to combinatorial optimization problems, derived from an approximation to the Quantum Adiabatic Algorithm (QAA). In solving combinatorial optimization problems with constraints in the context of QAOA or QAA, one needs to find a way to encode problem constraints into the scheme. In thi… ▽ More

    Submitted 31 January, 2020; originally announced February 2020.

    Comments: 27 pages, 10 figures(including 27 subfigurs) submitted to Quantum Information Processing

  38. arXiv:2002.00181  [pdf, other

    cs.CL

    Fine-Tuning BERT for Schema-Guided Zero-Shot Dialogue State Tracking

    Authors: Yu-** Ruan, Zhen-Hua Ling, Jia-Chen Gu, Quan Liu

    Abstract: We present our work on Track 4 in the Dialogue System Technology Challenges 8 (DSTC8). The DSTC8-Track 4 aims to perform dialogue state tracking (DST) under the zero-shot settings, in which the model needs to generalize on unseen service APIs given a schema definition of these target APIs. Serving as the core for many virtual assistants such as Siri, Alexa, and Google Assistant, the DST keeps trac… ▽ More

    Submitted 1 February, 2020; originally announced February 2020.

    Comments: Present on the DSTC8 Workshop @ AAAI-2020

  39. arXiv:1911.04862  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    An End-to-end Approach for Lexical Stress Detection based on Transformer

    Authors: Yong Ruan, Xiangdong Wang, Hong Liu, Zhigang Ou, Yun Gao, Jianfeng Cheng, Yueliang Qian

    Abstract: The dominant automatic lexical stress detection method is to split the utterance into syllable segments using phoneme sequence and their time-aligned boundaries. Then we extract features from syllable to use classification method to classify the lexical stress. However, we can't get very accurate time boundaries of each phoneme and we have to design some features in the syllable segments to classi… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Submission to ICASSP 2020

  40. arXiv:1910.09464  [pdf, other

    cs.LG stat.ML

    Learning to Learn by Zeroth-Order Oracle

    Authors: Yangjun Ruan, Yuanhao Xiong, Sashank Reddi, Sanjiv Kumar, Cho-Jui Hsieh

    Abstract: In the learning to learn (L2L) framework, we cast the design of optimization algorithms as a machine learning problem and use deep neural networks to learn the update rules. In this paper, we extend the L2L framework to zeroth-order (ZO) optimization setting, where no explicit gradient information is available. Our learned optimizer, modeled as recurrent neural network (RNN), first approximates gr… ▽ More

    Submitted 7 February, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Published as a conference paper at ICLR 2020

  41. arXiv:1909.00948  [pdf, other

    cs.CV

    HarDNet: A Low Memory Traffic Network

    Authors: ** Chao, Chao-Yang Kao, Yu-Shan Ruan, Chien-Hsiang Huang, Youn-Long Lin

    Abstract: State-of-the-art neural network architectures such as ResNet, MobileNet, and DenseNet have achieved outstanding accuracy over low MACs and small model size counterparts. However, these metrics might not be accurate for predicting the inference time. We suggest that memory traffic for accessing intermediate feature maps can be a factor dominating the inference latency, especially in such tasks as r… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: ICCV 2019

  42. arXiv:1905.09263  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    FastSpeech: Fast, Robust and Controllable Text to Speech

    Authors: Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu

    Abstract: Neural network based end-to-end text to speech (TTS) has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron 2) usually first generate mel-spectrogram from text, and then synthesize speech from the mel-spectrogram using vocoder such as WaveNet. Compared with traditional concatenative and statistical parametric approaches, neural network based end-to-end mode… ▽ More

    Submitted 20 November, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: Accepted by NeurIPS2019

  43. arXiv:1904.10610  [pdf, other

    cs.CL cs.AI

    Condition-Transforming Variational AutoEncoder for Conversation Response Generation

    Authors: Yu-** Ruan, Zhen-Hua Ling, Quan Liu, Zhigang Chen, Nitin Indurkhya

    Abstract: This paper proposes a new model, called condition-transforming variational autoencoder (CTVAE), to improve the performance of conversation response generation using conditional variational autoencoders (CVAEs). In conventional CVAEs , the prior distribution of latent variable z follows a multivariate Gaussian distribution with mean and variance modulated by the input conditions. Previous work foun… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

    Comments: ICASSP 2019, oral

  44. arXiv:1904.09705  [pdf, other

    cs.CL cs.AI

    Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge

    Authors: Yu-** Ruan, Xiaodan Zhu, Zhen-Hua Ling, Zhan Shi, Quan Liu, Si Wei

    Abstract: Winograd Schema Challenge (WSC) was proposed as an AI-hard problem in testing computers' intelligence on common sense representation and reasoning. This paper presents the new state-of-theart on WSC, achieving an accuracy of 71.1%. We demonstrate that the leading performance benefits from jointly modelling sentence structures, utilizing knowledge learned from cutting-edge pretraining models, and p… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

    Comments: 7 pages

  45. arXiv:1901.09444  [pdf, other

    cs.CL

    Promoting Diversity for End-to-End Conversation Response Generation

    Authors: Yu-** Ruan, Zhen-Hua Ling, Quan Liu, Jia-Chen Gu, Xiaodan Zhu

    Abstract: We present our work on Track 2 in the Dialog System Technology Challenges 7 (DSTC7). The DSTC7-Track 2 aims to evaluate the response generation of fully data-driven conversation models in knowledge-grounded settings, which provides the contextual-relevant factual texts. The Sequenceto-Sequence models have been widely used for end-to-end generative conversation modelling and achieved impressive res… ▽ More

    Submitted 30 January, 2019; v1 submitted 27 January, 2019; originally announced January 2019.

    Comments: To be present on AAAI19 Workshop---Dialog System Technology Challenges 7 (DSTC7)

  46. arXiv:1812.00686  [pdf, other

    cs.CL

    Building Sequential Inference Models for End-to-End Response Selection

    Authors: Jia-Chen Gu, Zhen-Hua Ling, Yu-** Ruan, Quan Liu

    Abstract: This paper presents an end-to-end response selection model for Track 1 of the 7th Dialogue System Technology Challenges (DSTC7). This task focuses on selecting the correct next utterance from a set of candidates given a partial conversation. We propose an end-to-end neural network based on enhanced sequential inference model (ESIM) for this task. Our proposed model differs from the original ESIM m… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: Accepted by AAAI 2019, Workshop on DSTC7

  47. arXiv:1810.06118  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.data-an stat.ML

    Learning to fail: Predicting fracture evolution in brittle material models using recurrent graph convolutional neural networks

    Authors: Max Schwarzer, Bryce Rogan, Yadong Ruan, Zhengming Song, Diana Y. Lee, Allon G. Percus, Viet T. Chau, Bryan A. Moore, Esteban Rougier, Hari S. Viswanathan, Gowri Srinivasan

    Abstract: We propose a machine learning approach to address a key challenge in materials science: predicting how fractures propagate in brittle materials under stress, and how these materials ultimately fail. Our methods use deep learning and train on simulation data from high-fidelity models, emulating the results of these models while avoiding the overwhelming computational demands associated with running… ▽ More

    Submitted 15 March, 2019; v1 submitted 14 October, 2018; originally announced October 2018.

    Report number: LA-UR-18-29693

    Journal ref: Computational Materials Science 162, 322-332 (2019)

  48. A Sequential Neural Encoder with Latent Structured Description for Modeling Sentences

    Authors: Yu-** Ruan, Qian Chen, Zhen-Hua Ling

    Abstract: In this paper, we propose a sequential neural encoder with latent structured description (SNELSD) for modeling sentences. This model introduces latent chunk-level representations into conventional sequential neural encoders, i.e., recurrent neural networks (RNNs) with long short-term memory (LSTM) units, to consider the compositionality of languages in semantic modeling. An SNELSD model has a hier… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

    Comments: Accepted by IEEE Transactions on Audio, Speech, and Language Processing

  49. arXiv:1608.02021  [pdf, ps, other

    cs.SI cs.IR

    An Integrated Recommender Algorithm for Rating Prediction

    Authors: Yefeng Ruan, Tzu-Chun Lin

    Abstract: Recommender system is currently widely used in many e-commerce systems, such as Amazon, eBay, and so on. It aims to help users to find items which they may be interested in. In literature, neighborhood-based collaborative filtering and matrix factorization are two common methods used in recommender systems. In this paper, we combine these two methods with personalized weights on them. Rather than… ▽ More

    Submitted 5 August, 2016; originally announced August 2016.

  50. arXiv:1301.2279  [pdf

    cs.AI

    A Bayesian Approach to Tackling Hard Computational Problems

    Authors: Eric J. Horvitz, Yongshao Ruan, Carla P. Gomes, Henry Kautz, Bart Selman, David Maxwell Chickering

    Abstract: We are develo** a general framework for using learned Bayesian models for decision-theoretic control of search and reasoningalgorithms. We illustrate the approach on the specific task of controlling both general and domain-specific solvers on a hard class of structured constraint satisfaction problems. A successful strategyfor reducing the high (and even infinite) variance in running time typi… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-235-244