Skip to main content

Showing 1–47 of 47 results for author: Ying, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19885  [pdf, other

    cs.LG cs.RO

    Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning

    Authors: Hengkai Tan, Songming Liu, Kai Ma, Chengyang Ying, Xingxing Zhang, Hang Su, Jun Zhu

    Abstract: Transformer has shown promise in reinforcement learning to model time-varying features for obtaining generalized low-level robot policies on diverse robotics datasets in embodied learning. However, it still suffers from the issues of low data efficiency and high inference latency. In this paper, we propose to investigate the task from a new perspective of the frequency domain. We first observe tha… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.19600  [pdf, ps, other

    cs.LG cs.AI

    Do spectral cues matter in contrast-based graph self-supervised learning?

    Authors: Xiangru Jian, Xinjian Zhao, Wei Pang, Chaolong Ying, Yimu Wang, Yaoyao Xu, Tianshu Yu

    Abstract: The recent surge in contrast-based graph self-supervised learning has prominently featured an intensified exploration of spectral cues. However, an intriguing paradox emerges, as methods grounded in seemingly conflicting assumptions or heuristic approaches regarding the spectral domain demonstrate notable enhancements in learning performance. This paradox prompts a critical inquiry into the genuin… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.14073  [pdf, other

    cs.LG

    PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning

    Authors: Chengyang Ying, Zhongkai Hao, Xinning Zhou, Xuezhou Xu, Hang Su, Xingxing Zhang, Jun Zhu

    Abstract: Designing generalizable agents capable of adapting to diverse embodiments has achieved significant attention in Reinforcement Learning (RL), which is critical for deploying RL agents in various real-world applications. Previous Cross-Embodiment RL approaches have focused on transferring knowledge across embodiments within specific tasks. These methods often result in knowledge tightly coupled with… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  5. arXiv:2404.12186  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving UCB Decision Process Verification via zk-SNARKs

    Authors: Xikun Jiang, He Lyu, Chenhao Ying, Yibin Xu, Boris Düdder, Yuan Luo

    Abstract: With the increasingly widespread application of machine learning, how to strike a balance between protecting the privacy of data and algorithm parameters and ensuring the verifiability of machine learning has always been a challenge. This study explores the intersection of reinforcement learning and data privacy, specifically addressing the Multi-Armed Bandit (MAB) problem with the Upper Confidenc… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  6. arXiv:2404.07794  [pdf, other

    cs.CV

    DGMamba: Domain Generalization via Generalized State Space Model

    Authors: Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan

    Abstract: Domain generalization~(DG) aims at solving distribution shift problems in various scenes. Existing approaches are based on Convolution Neural Networks (CNNs) or Vision Transformers (ViTs), which suffer from limited receptive fields or quadratic complexities issues. Mamba, as an emerging state space model (SSM), possesses superior linear complexity and global receptive fields. Despite this, it can… ▽ More

    Submitted 9 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  7. arXiv:2403.10652  [pdf, other

    cs.LG q-fin.RM

    Improving Fairness in Credit Lending Models using Subgroup Threshold Optimization

    Authors: Cecilia Ying, Stephen Thomas

    Abstract: In an effort to improve the accuracy of credit lending decisions, many financial intuitions are now using predictions from machine learning models. While such predictions enjoy many advantages, recent research has shown that the predictions have the potential to be biased and unfair towards certain subgroups of the population. To combat this, several techniques have been introduced to help remove… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Neural Information Processing Systems (NeurIPS) Workshop in Strategic ML

  8. arXiv:2403.03542  [pdf, other

    cs.LG math.NA

    DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training

    Authors: Zhongkai Hao, Chang Su, Songming Liu, Julius Berner, Chengyang Ying, Hang Su, Anima Anandkumar, Jian Song, Jun Zhu

    Abstract: Pre-training has been investigated to improve the efficiency and performance of training neural operators in data-scarce settings. However, it is largely in its infancy due to the inherent complexity and diversity, such as long trajectories, multiple scales and varying dimensions of partial differential equations (PDEs) data. In this paper, we present a new auto-regressive denoising pre-training s… ▽ More

    Submitted 6 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  9. arXiv:2402.16402  [pdf, other

    cs.LG cs.AI

    Graph Learning with Distributional Edge Layouts

    Authors: Xinjian Zhao, Chaolong Ying, Tianshu Yu

    Abstract: Graph Neural Networks (GNNs) learn from graph-structured data by passing local messages between neighboring nodes along edges on certain topological layouts. Typically, these topological layouts in modern GNNs are deterministically computed (e.g., attention-based GNNs) or locally sampled (e.g., GraphSage) under heuristic assumptions. In this paper, we for the first time pose that these layouts can… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 20 pages, 10 figures

  10. arXiv:2402.16346  [pdf, other

    cs.LG math.AT

    Boosting Graph Pooling with Persistent Homology

    Authors: Chaolong Ying, Xinjian Zhao, Tianshu Yu

    Abstract: Recently, there has been an emerging trend to integrate persistent homology (PH) into graph neural networks (GNNs) to enrich expressive power. However, naively plugging PH features into GNN layers always results in marginal improvement with low interpretability. In this paper, we investigate a novel mechanism for injecting global topological invariance into pooling layers using PH, motivated by th… ▽ More

    Submitted 1 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  11. arXiv:2309.16483  [pdf, other

    cs.CV

    Rethinking Domain Generalization: Discriminability and Generalizability

    Authors: Shaocong Long, Qianyu Zhou, Chenhao Ying, Lizhuang Ma, Yuan Luo

    Abstract: Domain generalization (DG) endeavors to develop robust models that possess strong generalizability while preserving excellent discriminability. Nonetheless, pivotal DG techniques tend to improve the feature generalizability by learning domain-invariant representations, inadvertently overlooking the feature discriminability. On the one hand, the simultaneous attainment of generalizability and discr… ▽ More

    Submitted 30 November, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

  12. arXiv:2309.16460  [pdf, other

    cs.CV

    Diverse Target and Contribution Scheduling for Domain Generalization

    Authors: Shaocong Long, Qianyu Zhou, Chenhao Ying, Lizhuang Ma, Yuan Luo

    Abstract: Generalization under the distribution shift has been a great challenge in computer vision. The prevailing practice of directly employing the one-hot labels as the training targets in domain generalization~(DG) can lead to gradient conflicts, making it insufficient for capturing the intrinsic class characteristics and hard to increase the intra-class variation. Besides, existing methods in DG mostl… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  13. arXiv:2309.09985  [pdf, other

    physics.comp-ph cs.AI

    Molecular Conformation Generation via Shifting Scores

    Authors: Zihan Zhou, Ruiying Liu, Chaolong Ying, Ruimao Zhang, Tianshu Yu

    Abstract: Molecular conformation generation, a critical aspect of computational chemistry, involves producing the three-dimensional conformer geometry for a given molecule. Generating molecular conformation via diffusion requires learning to reverse a noising process. Diffusion on inter-atomic distances instead of conformation preserves SE(3)-equivalence and shows superior performance compared to alternativ… ▽ More

    Submitted 7 October, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 18 pages, 7 figures

  14. arXiv:2305.18694  [pdf, other

    cs.LG cs.AI

    NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data

    Authors: Songming Liu, Zhongkai Hao, Chengyang Ying, Hang Su, Ze Cheng, Jun Zhu

    Abstract: The neural operator has emerged as a powerful tool in learning map**s between function spaces in PDEs. However, when faced with real-world physical data, which are often highly non-uniformly distributed, it is challenging to use mesh-based techniques such as the FFT. To address this, we introduce the Non-Uniform Neural Operator (NUNO), a comprehensive framework designed for efficient operator le… ▽ More

    Submitted 31 May, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  15. arXiv:2305.16793  [pdf, other

    cs.GT cs.CR

    Incentive Mechanism for Uncertain Tasks under Differential Privacy

    Authors: Xikun Jiang, Chenhao Ying, Lei Li, Boris Düdder, Haiqin Wu, Haiming **, Yuan Luo

    Abstract: Mobile crowd sensing (MCS) has emerged as an increasingly popular sensing paradigm due to its cost-effectiveness. This approach relies on platforms to outsource tasks to participating workers when prompted by task publishers. Although incentive mechanisms have been devised to foster widespread participation in MCS, most of them focus only on static tasks (i.e., tasks for which the timing and type… ▽ More

    Submitted 6 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  16. PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba

    Authors: Jianying Wang, Tongliang Li, Haoze Song, Xinjun Yang, Wenchao Zhou, Feifei Li, Baoyue Yan, Qianqian Wu, Yukun Liang, Chengjun Ying, Yujie Wang, Baokai Chen, Chang Cai, Yubin Ruan, Xiaoyi Weng, Shibin Chen, Liang Yin, Chengzhong Yang, Xin Cai, Hongyan Xing, Nanlong Yu, Xiaofei Chen, Dapeng Huang, Jianling Sun

    Abstract: Cloud-native databases have become the de-facto choice for mission-critical applications on the cloud due to the need for high availability, resource elasticity, and cost efficiency. Meanwhile, driven by the increasing connectivity between data generation and analysis, users prefer a single database to efficiently process both OLTP and OLAP workloads, which enhances data freshness and reduces the… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 14 pages, 16 figures, to be published in ACM SIGMOD 2023

  17. arXiv:2303.05092  [pdf, other

    cs.LG

    Task Aware Dreamer for Task Generalization in Reinforcement Learning

    Authors: Chengyang Ying, Zhongkai Hao, Xinning Zhou, Hang Su, Songming Liu, Dong Yan, Jun Zhu

    Abstract: A long-standing goal of reinforcement learning is to acquire agents that can learn on training tasks and generalize well on unseen tasks that may share a similar dynamic but with different reward functions. The ability to generalize across tasks is important as it determines an agent's adaptability to real-world scenarios where reward mechanisms might vary. In this work, we first show that trainin… ▽ More

    Submitted 2 February, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

  18. arXiv:2302.14376  [pdf, other

    cs.LG math.NA physics.comp-ph

    GNOT: A General Neural Operator Transformer for Operator Learning

    Authors: Zhongkai Hao, Zhengyi Wang, Hang Su, Chengyang Ying, Yinpeng Dong, Songming Liu, Ze Cheng, Jian Song, Jun Zhu

    Abstract: Learning partial differential equations' (PDEs) solution operators is an essential problem in machine learning. However, there are several challenges for learning operators in practical applications like the irregular mesh, multiple input functions, and complexity of the PDEs' solution. To address these challenges, we propose a general neural operator transformer (GNOT), a scalable and effective t… ▽ More

    Submitted 14 June, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

  19. arXiv:2211.08064  [pdf, other

    cs.LG cs.AI cs.CV math.NA

    Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications

    Authors: Zhongkai Hao, Songming Liu, Yichi Zhang, Chengyang Ying, Yao Feng, Hang Su, Jun Zhu

    Abstract: Recent advances of data-driven machine learning have revolutionized fields like computer vision, reinforcement learning, and many scientific and engineering domains. In many real-world and scientific problems, systems that generate data are governed by physical laws. Recent work shows that it provides potential benefits for machine learning models by incorporating the physical prior and collected… ▽ More

    Submitted 6 March, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  20. arXiv:2210.03837  [pdf, other

    eess.IV cs.CV

    Self-Supervised Deep Equilibrium Models for Inverse Problems with Theoretical Guarantees

    Authors: Weijie Gan, Chunwei Ying, Parna Eshraghi, Tongyao Wang, Cihat Eldeniz, Yuyang Hu, Jiaming Liu, Yasheng Chen, Hongyu An, Ulugbek S. Kamilov

    Abstract: Deep equilibrium models (DEQ) have emerged as a powerful alternative to deep unfolding (DU) for image reconstruction. DEQ models-implicit neural networks with effectively infinite number of layers-were shown to achieve state-of-the-art image reconstruction without the memory complexity associated with DU. While the performance of DEQ has been widely investigated, the existing work has primarily fo… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  21. arXiv:2210.03526  [pdf, other

    cs.LG

    A Unified Hard-Constraint Framework for Solving Geometrically Complex PDEs

    Authors: Songming Liu, Zhongkai Hao, Chengyang Ying, Hang Su, Jun Zhu, Ze Cheng

    Abstract: We present a unified hard-constraint framework for solving geometrically complex PDEs with neural networks, where the most commonly used Dirichlet, Neumann, and Robin boundary conditions (BCs) are considered. Specifically, we first introduce the "extra fields" from the mixed finite element method to reformulate the PDEs so as to equivalently transform the three types of BCs into linear equations.… ▽ More

    Submitted 4 June, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

    Journal ref: Advances in Neural Information Processing Systems 35 (2022): 20287-20299

  22. arXiv:2209.14548  [pdf, other

    cs.LG

    Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling

    Authors: Huayu Chen, Cheng Lu, Chengyang Ying, Hang Su, Jun Zhu

    Abstract: In offline reinforcement learning, weighted regression is a common method to ensure the learned policy stays close to the behavior policy and to prevent selecting out-of-sample actions. In this work, we show that due to the limited distributional expressivity of policy models, previous methods might still select unseen actions during training, which deviates from their initial motivation. To addre… ▽ More

    Submitted 27 February, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  23. arXiv:2209.07075  [pdf, other

    cs.LG

    Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients

    Authors: Zhongkai Hao, Chengyang Ying, Hang Su, Jun Zhu, Jian Song, Ze Cheng

    Abstract: Deep learning based approaches like Physics-informed neural networks (PINNs) and DeepONets have shown promise on solving PDE constrained optimization (PDECO) problems. However, existing methods are insufficient to handle those PDE constraints that have a complicated or nonlinear dependency on optimization targets. In this paper, we present a novel bi-level optimization framework to resolve the cha… ▽ More

    Submitted 11 April, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

  24. arXiv:2209.07074  [pdf, other

    cs.LG

    On the Reuse Bias in Off-Policy Reinforcement Learning

    Authors: Chengyang Ying, Zhongkai Hao, Xinning Zhou, Hang Su, Dong Yan, Jun Zhu

    Abstract: Importance sampling (IS) is a popular technique in off-policy evaluation, which re-weights the return of trajectories in the replay buffer to boost sample efficiency. However, training with IS can be unstable and previous attempts to address this issue mainly focus on analyzing the variance of IS. In this paper, we reveal that the instability is also related to a new notion of Reuse Bias of IS --… ▽ More

    Submitted 21 May, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Journal ref: IJCAI 2023

  25. Consistent Attack: Universal Adversarial Perturbation on Embodied Vision Navigation

    Authors: Chengyang Ying, You Qiaoben, Xinning Zhou, Hang Su, Wenbo Ding, Jianyong Ai

    Abstract: Embodied agents in vision navigation coupled with deep neural networks have attracted increasing attention. However, deep neural networks have been shown vulnerable to malicious adversarial noises, which may potentially cause catastrophic failures in Embodied Vision Navigation. Among different adversarial noises, universal adversarial perturbations (UAP), i.e., a constant image-agnostic perturbati… ▽ More

    Submitted 25 March, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

    Journal ref: Pattern Recognition Letters (PRL), 2023

  26. arXiv:2206.04436  [pdf, other

    cs.LG cs.AI

    Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk

    Authors: Chengyang Ying, Xinning Zhou, Hang Su, Dong Yan, Ning Chen, Jun Zhu

    Abstract: Though deep reinforcement learning (DRL) has obtained substantial success, it may encounter catastrophic failures due to the intrinsic uncertainty of both transition and observation. Most of the existing methods for safe reinforcement learning can only handle transition disturbance or observation disturbance since these two kinds of disturbance affect different parts of the agent; besides, the pop… ▽ More

    Submitted 17 September, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Journal ref: IJCAI 2022

  27. arXiv:2206.04310  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized Smoothing

    Authors: Zhongkai Hao, Chengyang Ying, Yinpeng Dong, Hang Su, Jun Zhu, Jian Song

    Abstract: Certified defenses such as randomized smoothing have shown promise towards building reliable machine learning systems against $\ell_p$-norm bounded attacks. However, existing methods are insufficient or unable to provably defend against semantic transformations, especially those without closed-form expressions (such as defocus blur and pixelate), which are more common in practice and often unrestr… ▽ More

    Submitted 29 June, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

  28. arXiv:2112.12961   

    stat.ML cs.LG

    Optimal Model Averaging of Support Vector Machines in Diverging Model Spaces

    Authors: Chaoxia Yuan, Chao Ying, Zhou Yu, Fang Fang

    Abstract: Support vector machine (SVM) is a powerful classification method that has achieved great success in many fields. Since its performance can be seriously impaired by redundant covariates, model selection techniques are widely used for SVM with high dimensional covariates. As an alternative to model selection, significant progress has been made in the area of model averaging in the past decades. Yet… ▽ More

    Submitted 22 July, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

    Comments: need to be improved further

  29. arXiv:2106.15860  [pdf, other

    cs.LG

    Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning

    Authors: You Qiaoben, Chengyang Ying, Xinning Zhou, Hang Su, Jun Zhu, Bo Zhang

    Abstract: Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease a victim's cumulative expected reward by manipulating the victim's observations. Despite the efficiency of previous optimization-based methods for generating adversarial noise in supervised learning, such methods might not be able to achieve the lowest cumulative reward since they do not explore the environm… ▽ More

    Submitted 27 February, 2023; v1 submitted 30 June, 2021; originally announced June 2021.

  30. arXiv:2106.08279  [pdf, ps, other

    cs.LG

    First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track

    Authors: Chengxuan Ying, Mingqi Yang, Shuxin Zheng, Guolin Ke, Shengjie Luo, Tianle Cai, Chenglin Wu, Yuxin Wang, Yanming Shen, Di He

    Abstract: In this technical report, we present our solution of KDD Cup 2021 OGB Large-Scale Challenge - PCQM4M-LSC Track. We adopt Graphormer and ExpC as our basic models. We train each model by 8-fold cross-validation, and additionally train two Graphormer models on the union of training and validation sets with different random seeds. For final submission, we use a naive ensemble for these 18 models by ta… ▽ More

    Submitted 20 June, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  31. arXiv:2106.05234  [pdf, other

    cs.LG cs.AI

    Do Transformers Really Perform Bad for Graph Representation?

    Authors: Chengxuan Ying, Tianle Cai, Shengjie Luo, Shuxin Zheng, Guolin Ke, Di He, Yanming Shen, Tie-Yan Liu

    Abstract: The Transformer architecture has become a dominant choice in many domains, such as natural language processing and computer vision. Yet, it has not achieved competitive performance on popular leaderboards of graph-level prediction compared to mainstream GNN variants. Therefore, it remains a mystery how Transformers could perform well for graph representation learning. In this paper, we solve this… ▽ More

    Submitted 23 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Journal ref: NeurIPS 2021

  32. arXiv:2105.11271  [pdf, ps, other

    cs.IT

    Constructions of Binary Optimal Locally Repairable Codes via Intersection Subspaces

    Authors: Wenqin Zhang, Deng Tang, Chenhao Ying, Yuan Luo

    Abstract: Locally repairable codes (LRCs), which can recover any symbol of a codeword by reading only a small number of other symbols, have been widely used in real-world distributed storage systems, such as Microsoft Azure Storage and Ceph Storage Cluster. Since binary linear LRCs can significantly reduce coding and decoding complexity, constructions of binary LRCs are of particular interest. The aim of th… ▽ More

    Submitted 15 October, 2023; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in the SCIENCE CHINA Information Sciences

  33. arXiv:2102.12702  [pdf, other

    cs.CL cs.AI

    LazyFormer: Self Attention with Lazy Update

    Authors: Chengxuan Ying, Guolin Ke, Di He, Tie-Yan Liu

    Abstract: Improving the efficiency of Transformer-based language pre-training is an important task in NLP, especially for the self-attention module, which is computationally expensive. In this paper, we propose a simple but effective solution, called \emph{LazyFormer}, which computes the self-attention distribution infrequently. LazyFormer composes of multiple lazy blocks, each of which contains multiple Tr… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  34. arXiv:2011.00476  [pdf, other

    cs.CL

    Transformer-based Multi-Aspect Modeling for Multi-Aspect Multi-Sentiment Analysis

    Authors: Zhen Wu, Chengcan Ying, Xinyu Dai, Shujian Huang, Jiajun Chen

    Abstract: Aspect-based sentiment analysis (ABSA) aims at analyzing the sentiment of a given aspect in a sentence. Recently, neural network-based methods have achieved promising results in existing ABSA datasets. However, these datasets tend to degenerate to sentence-level sentiment analysis because most sentences contain only one aspect or multiple aspects with the same sentiment polarity. To facilitate the… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: Accepted by NLPCC 2020

  35. arXiv:2011.00474  [pdf, other

    cs.CL

    Opinion Transmission Network for Jointly Improving Aspect-oriented Opinion Words Extraction and Sentiment Classification

    Authors: Chengcan Ying, Zhen Wu, Xinyu Dai, Shujian Huang, Jiajun Chen

    Abstract: Aspect-level sentiment classification (ALSC) and aspect oriented opinion words extraction (AOWE) are two highly relevant aspect-based sentiment analysis (ABSA) subtasks. They respectively aim to detect the sentiment polarity and extract the corresponding opinion words toward a given aspect in a sentence. Previous works separate them and focus on one of them by training neural models on small-scale… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: Accepted by NLPCC 2020

  36. arXiv:2010.04640  [pdf, other

    cs.CL

    Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction

    Authors: Zhen Wu, Chengcan Ying, Fei Zhao, Zhifang Fan, Xinyu Dai, Rui Xia

    Abstract: Aspect-oriented Fine-grained Opinion Extraction (AFOE) aims at extracting aspect terms and opinion terms from review in the form of opinion pairs or additionally extracting sentiment polarity of aspect term to form opinion triplet. Because of containing several opinion factors, the complete AFOE task is usually divided into multiple subtasks and achieved in the pipeline. However, pipeline approach… ▽ More

    Submitted 3 November, 2020; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: Accepted by Findings of EMNLP

  37. arXiv:2005.04073  [pdf, other

    cs.LG q-bio.GN stat.ML

    Multi-Instance Multi-Label Learning for Gene Mutation Prediction in Hepatocellular Carcinoma

    Authors: Kaixin Xu, Ziyuan Zhao, Jiapan Gu, Zeng Zeng, Chan Wan Ying, Lim Kheng Choon, Thng Choon Hua, Pierce KH Chow

    Abstract: Gene mutation prediction in hepatocellular carcinoma (HCC) is of great diagnostic and prognostic value for personalized treatments and precision medicine. In this paper, we tackle this problem with multi-instance multi-label learning to address the difficulties on label correlations, label representations, etc. Furthermore, an effective oversampling strategy is applied for data imbalance. Experime… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Accepted version to be published in the 42nd IEEE Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2020, Montreal, Canada

    Journal ref: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  38. arXiv:2005.04069  [pdf, other

    q-bio.QM cs.CV eess.IV q-bio.GN

    Multi-Phase Cross-modal Learning for Noninvasive Gene Mutation Prediction in Hepatocellular Carcinoma

    Authors: Jiapan Gu, Ziyuan Zhao, Zeng Zeng, Yuzhe Wang, Zhengyiren Qiu, Bharadwaj Veeravalli, Brian Kim Poh Goh, Glenn Kunnath Bonney, Krishnakumar Madhavan, Chan Wan Ying, Lim Kheng Choon, Thng Choon Hua, Pierce KH Chow

    Abstract: Hepatocellular carcinoma (HCC) is the most common type of primary liver cancer and the fourth most common cause of cancer-related death worldwide. Understanding the underlying gene mutations in HCC provides great prognostic value for treatment planning and targeted therapy. Radiogenomics has revealed an association between non-invasive imaging features and molecular genomics. However, imaging feat… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Accepted version to be published in the 42nd IEEE Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2020, Montreal, Canada

    Journal ref: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  39. arXiv:1911.12732  [pdf, other

    stat.ML cs.LG math.ST

    Distributed estimation of principal support vector machines for sufficient dimension reduction

    Authors: Jun **, Chao Ying, Zhou Yu

    Abstract: The principal support vector machines method (Li et al., 2011) is a powerful tool for sufficient dimension reduction that replaces original predictors with their low-dimensional linear combinations without loss of information. However, the computational burden of the principal support vector machines method constrains its use for massive data. To address this issue, we in this paper propose two di… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

  40. arXiv:1906.02901  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Decompose-and-Integrate Learning for Multi-class Segmentation in Medical Images

    Authors: Yizhe Zhang, Michael T. C. Ying, Danny Z. Chen

    Abstract: Segmentation maps of medical images annotated by medical experts contain rich spatial information. In this paper, we propose to decompose annotation maps to learn disentangled and richer feature transforms for segmentation problems in medical images. Our new scheme consists of two main stages: decompose and integrate. Decompose: by annotation map decomposition, the original segmentation problem is… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: To appear in MICCAI 2019

  41. arXiv:1902.09635  [pdf, other

    cs.LG stat.ML

    NAS-Bench-101: Towards Reproducible Neural Architecture Search

    Authors: Chris Ying, Aaron Klein, Esteban Real, Eric Christiansen, Kevin Murphy, Frank Hutter

    Abstract: Recent advances in neural architecture search (NAS) demand tremendous computational resources, which makes it difficult to reproduce experiments and imposes a barrier-to-entry to researchers without access to large-scale computation. We aim to ameliorate these problems by introducing NAS-Bench-101, the first public architecture dataset for NAS research. To build NAS-Bench-101, we carefully constru… ▽ More

    Submitted 14 May, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: Published in the Proceedings of the 36th International Conference on Machine Learning

  42. arXiv:1902.06192  [pdf, ps, other

    cs.DM math.CO

    Enumerating Unique Computational Graphs via an Iterative Graph Invariant

    Authors: Chris Ying

    Abstract: In this report, we describe a novel graph invariant for computational graphs (colored directed acylic graphs) and how we used it to generate all distinct computational graphs up to isomorphism for small graphs. The algorithm iteratively applies isomorphism-invariant operations, which take into account the graph structure and coloring, and outputs a fixed-length hash that is identical for all isomo… ▽ More

    Submitted 16 February, 2019; originally announced February 2019.

  43. arXiv:1901.08256  [pdf, other

    cs.LG stat.ML

    Large-Batch Training for LSTM and Beyond

    Authors: Yang You, Jonathan Hseu, Chris Ying, James Demmel, Kurt Keutzer, Cho-Jui Hsieh

    Abstract: Large-batch training approaches have enabled researchers to utilize large-scale distributed processing and greatly accelerate deep-neural net (DNN) training. For example, by scaling the batch size from 256 to 32K, researchers have been able to reduce the training time of ResNet50 on ImageNet from 29 hours to 2.2 minutes (Ying et al., 2018). In this paper, we propose a new approach called linear-ep… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: Preprint. Work in progress. We may update this draft recently

  44. arXiv:1811.06992  [pdf, other

    cs.LG cs.DC stat.ML

    Image Classification at Supercomputer Scale

    Authors: Chris Ying, Sameer Kumar, Dehao Chen, Tao Wang, Youlong Cheng

    Abstract: Deep learning is extremely computationally intensive, and hardware vendors have responded by building faster accelerators in large clusters. Training deep learning models at petaFLOPS scale requires overcoming both algorithmic and systems software challenges. In this paper, we discuss three systems-related optimizations: (1) distributed batch normalization to control per-replica batch sizes, (2) i… ▽ More

    Submitted 1 December, 2018; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: Presented as part of Systems for ML Workshop @ NIPS 2018

  45. arXiv:1806.00593  [pdf, other

    cs.CV

    BoxNet: Deep Learning Based Biomedical Image Segmentation Using Boxes Only Annotation

    Authors: Lin Yang, Yizhe Zhang, Zhuo Zhao, Hao Zheng, Peixian Liang, Michael T. C. Ying, Anil T. Ahuja, Danny Z. Chen

    Abstract: In recent years, deep learning (DL) methods have become powerful tools for biomedical image segmentation. However, high annotation efforts and costs are commonly needed to acquire sufficient biomedical training data for DL models. To alleviate the burden of manual annotation, in this paper, we propose a new weakly supervised DL approach for biomedical image segmentation using boxes only annotation… ▽ More

    Submitted 2 June, 2018; originally announced June 2018.

  46. arXiv:1801.00508  [pdf, other

    cs.CV

    Depth-Adaptive Computational Policies for Efficient Visual Tracking

    Authors: Chris Ying, Katerina Fragkiadaki

    Abstract: Current convolutional neural networks algorithms for video object tracking spend the same amount of computation for each object and video frame. However, it is harder to track an object in some frames than others, due to the varying amount of clutter, scene complexity, amount of motion, and object's distinctiveness against its background. We propose a depth-adaptive convolutional Siamese network t… ▽ More

    Submitted 1 January, 2018; originally announced January 2018.

    Comments: presented at EMMCVPR 2017 in Venice, Italy

  47. arXiv:1711.00489  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    Don't Decay the Learning Rate, Increase the Batch Size

    Authors: Samuel L. Smith, Pieter-Jan Kindermans, Chris Ying, Quoc V. Le

    Abstract: It is common practice to decay the learning rate. Here we show one can usually obtain the same learning curve on both training and test sets by instead increasing the batch size during training. This procedure is successful for stochastic gradient descent (SGD), SGD with momentum, Nesterov momentum, and Adam. It reaches equivalent test accuracies after the same number of training epochs, but with… ▽ More

    Submitted 23 February, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: 11 pages, 8 figures. Published as a conference paper at ICLR 2018