Skip to main content

Showing 1–50 of 68 results for author: Gong, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19226  [pdf, other

    cs.CL cs.HC

    Simulating Classroom Education with LLM-Empowered Agents

    Authors: Zheyuan Zhang, Daniel Zhang-Li, Jifan Yu, Linlu Gong, **chang Zhou, Zhiyuan Liu, Lei Hou, Juanzi Li

    Abstract: Large language models (LLMs) have been employed in various intelligent educational tasks to assist teaching. While preliminary explorations have focused on independent LLM-empowered agents for specific educational tasks, the potential for LLMs within a multi-agent collaborative framework to simulate a classroom with real user participation remains unexplored. In this work, we propose SimClass, a m… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2404.15638  [pdf, other

    cs.CV cs.AI

    PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing

    Authors: Yutong Chen, Zhang Wen, Chao Wang, Lei Gong, Zhongchao Yi

    Abstract: Hazy images degrade visual quality, and dehazing is a crucial prerequisite for subsequent processing tasks. Most current dehazing methods rely on neural networks and face challenges such as high computational parameter pressure and weak generalization capabilities. This paper introduces PriorNet--a novel, lightweight, and highly applicable dehazing network designed to significantly improve the cla… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 8 pages, 4 figures

  3. FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation

    Authors: Hanfang Lyu, Yuanchen Bai, Xin Liang, Ujaan Das, Chuhan Shi, Leiliang Gong, Yingchi Li, Mingfei Sun, Ming Ge, Xiaojuan Ma

    Abstract: Preference-based learning aims to align robot task objectives with human values. One of the most common methods to infer human preferences is by pairwise comparisons of robot task trajectories. Traditional comparison-based preference labeling systems seldom support labelers to digest and identify critical differences between complex trajectories recorded in videos. Our formative study (N = 12) sug… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted to ACM Conference on Intelligent User Interfaces (IUI) 2024, March 18-21, 2024, Greenville, SC, USA

  4. arXiv:2403.04814  [pdf, other

    cs.CL cs.AI cs.LG cs.SE

    Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks

    Authors: Linyuan Gong, Sida Wang, Mostafa Elhoushi, Alvin Cheung

    Abstract: We introduce Syntax-Aware Fill-In-the-Middle (SAFIM), a new benchmark for evaluating Large Language Models (LLMs) on the code Fill-in-the-Middle (FIM) task. This benchmark focuses on syntax-aware completions of program structures such as code blocks and conditional expressions, and includes 17,720 examples from multiple programming languages, sourced from recent code submissions after April 2022 t… ▽ More

    Submitted 22 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 22 pages; ICML 2024 Oral: https://icml.cc/virtual/2024/oral/35482

  5. arXiv:2403.02827  [pdf, other

    cs.CV

    Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation

    Authors: Weijie Li, Litong Gong, Yiran Zhu, Fanda Fan, Biao Wang, Tiezheng Ge, Bo Zheng

    Abstract: Image-to-video (I2V) generation tasks always suffer from kee** high fidelity in the open domains. Traditional image animation techniques primarily focus on specific domains such as faces or human poses, making them difficult to generalize to open domains. Several recent I2V frameworks based on diffusion models can generate dynamic content for open domain images but fail to maintain fidelity. We… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  6. arXiv:2403.01800  [pdf, other

    cs.CV

    AtomoVideo: High Fidelity Image-to-Video Generation

    Authors: Litong Gong, Yiran Zhu, Weijie Li, Xiaoyang Kang, Biao Wang, Tiezheng Ge, Bo Zheng

    Abstract: Recently, video generation has achieved significant rapid development based on superior text-to-image generation techniques. In this work, we propose a high fidelity framework for image-to-video generation, named AtomoVideo. Based on multi-granularity image injection, we achieve higher fidelity of the generated video to the given image. In addition, thanks to high quality datasets and training str… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Technical report. Page: https://atomo-video.github.io/

  7. arXiv:2402.02012  [pdf, other

    cs.CV

    Precise Knowledge Transfer via Flow Matching

    Authors: Shitong Shao, Zhiqiang Shen, Linrui Gong, Huanran Chen, Xu Dai

    Abstract: In this paper, we propose a novel knowledge transfer framework that introduces continuous normalizing flows for progressive knowledge transformation and leverages multi-step sampling strategies to achieve precision knowledge transfer. We name this framework Knowledge Transfer with Flow Matching (FM-KT), which can be integrated with a metric-based distillation method with any form (\textit{e.g.} va… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2401.11824  [pdf, other

    cs.CV

    Rethinking Centered Kernel Alignment in Knowledge Distillation

    Authors: Zikai Zhou, Yunhang Shen, Shitong Shao, Linrui Gong, Shaohui Lin

    Abstract: Knowledge distillation has emerged as a highly effective method for bridging the representation discrepancy between large-scale models and lightweight models. Prevalent approaches involve leveraging appropriate metrics to minimize the divergence or distance between the knowledge extracted from the teacher model and the knowledge learned by the student model. Centered Kernel Alignment (CKA) is wide… ▽ More

    Submitted 30 April, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  9. arXiv:2401.03003  [pdf, other

    cs.SE cs.CL cs.LG

    AST-T5: Structure-Aware Pretraining for Code Generation and Understanding

    Authors: Linyuan Gong, Mostafa Elhoushi, Alvin Cheung

    Abstract: Large language models (LLMs) have made significant advancements in code-related tasks, yet many LLMs treat code as simple sequences, neglecting its structured nature. We introduce AST-T5, a novel pretraining paradigm that leverages the Abstract Syntax Tree (AST) for enhanced code generation, transpilation, and understanding. Using dynamic programming, our AST-Aware Segmentation retains code struct… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 15 pages; ICML 2024: https://icml.cc/virtual/2024/poster/33601

  10. arXiv:2312.14896  [pdf, other

    cs.NE eess.SY math.DS q-bio.NC

    Strong anti-Hebbian plasticity alters the convexity of network attractor landscapes

    Authors: Lulu Gong, Xudong Chen, ShiNung Ching

    Abstract: In this paper, we study recurrent neural networks in the presence of pairwise learning rules. We are specifically interested in how the attractor landscapes of such networks become altered as a function of the strength and nature (Hebbian vs. anti-Hebbian) of learning, which may have a bearing on the ability of such rules to mediate large-scale optimization problems. Through formal analysis, we sh… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures

  11. arXiv:2311.08066  [pdf, other

    cs.SE

    How to get better embeddings with code pre-trained models? An empirical study

    Authors: Yu Zhao, Lina Gong, Haoxiang Zhang, Yaoshen Yu, Zhiqiu Huang

    Abstract: Pre-trained language models have demonstrated powerful capabilities in the field of natural language processing (NLP). Recently, code pre-trained model (PTM), which draw from the experiences of the NLP field, have also achieved state-of-the-art results in many software engineering (SE) downstream tasks. These code PTMs take into account the differences between programming languages and natural lan… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  12. arXiv:2311.03508  [pdf, other

    q-bio.NC cs.AI

    Astrocytes as a mechanism for meta-plasticity and contextually-guided network function

    Authors: Lulu Gong, Fabio Pasqualetti, Thomas Papouin, ShiNung Ching

    Abstract: Astrocytes are a ubiquitous and enigmatic type of non-neuronal cell and are found in the brain of all vertebrates. While traditionally viewed as being supportive of neurons, it is increasingly recognized that astrocytes may play a more direct and active role in brain function and neural computation. On account of their sensitivity to a host of physiological covariates and ability to modulate neuro… ▽ More

    Submitted 10 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 42 pages, 14 figures

  13. arXiv:2310.15985  [pdf, other

    cs.CV

    Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning

    Authors: Xin Xing, Zhexiao Xiong, Abby Stylianou, Srikumar Sastry, Liyu Gong, Nathan Jacobs

    Abstract: This paper presents a novel approach to Single-Positive Multi-label Learning. In general multi-label learning, a model learns to predict multiple labels or categories for a single input image. This is in contrast with standard multi-class image classification, where the task is predicting a single label from many possible labels for an image. Single-Positive Multi-label Learning (SPML) specificall… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  14. arXiv:2310.05136  [pdf, other

    cs.AI cs.CV

    InstructDET: Diversifying Referring Object Detection with Generalized Instructions

    Authors: Ronghao Dang, Jiangyan Feng, Haodong Zhang, Chongjian Ge, Lin Song, Lijun Gong, Chengju Liu, Qijun Chen, Feng Zhu, Rui Zhao, Yibing Song

    Abstract: We propose InstructDET, a data-centric method for referring object detection (ROD) that localizes target objects based on user instructions. While deriving from referring expressions (REC), the instructions we leverage are greatly diversified to encompass common user intentions related to object detection. For one image, we produce tremendous instructions that refer to every single object and diff… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: 29 pages (include Appendix) Published in ICLR

  15. arXiv:2310.03379  [pdf, other

    cs.RO

    Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint Safeguards

    Authors: Zhaorun Chen, Zhuokai Zhao, Tairan He, Binhao Chen, Xuhao Zhao, Liang Gong, Chengliang Liu

    Abstract: Ensuring safety in Reinforcement Learning (RL), typically framed as a Constrained Markov Decision Process (CMDP), is crucial for real-world exploration applications. Current approaches in handling CMDP struggle to balance optimality and feasibility, as direct optimization methods cannot ensure state-wise in-training safety, and projection-based methods correct actions inefficiently through lengthy… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: 8 pages, 6 figures, 3 tables

  16. arXiv:2309.02119  [pdf, other

    cs.CV

    Hierarchical Masked 3D Diffusion Model for Video Outpainting

    Authors: Fanda Fan, Chaoxu Guo, Litong Gong, Biao Wang, Tiezheng Ge, Yuning Jiang, Chunjie Luo, Jianfeng Zhan

    Abstract: Video outpainting aims to adequately complete missing areas at the edges of video frames. Compared to image outpainting, it presents an additional challenge as the model should maintain the temporal consistency of the filled area. In this paper, we introduce a masked 3D diffusion model for video outpainting. We use the technique of mask modeling to train the 3D diffusion model. This allows us to u… ▽ More

    Submitted 19 January, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted to ACM MM 2023

  17. arXiv:2308.12481  [pdf

    eess.SP cs.LG

    Fall Detection using Knowledge Distillation Based Long short-term memory for Offline Embedded and Low Power Devices

    Authors: Hannah Zhou, Allison Chen, Celine Buer, Emily Chen, Kayleen Tang, Lauryn Gong, Zhiqi Liu, Jianbin Tang

    Abstract: This paper presents a cost-effective, low-power approach to unintentional fall detection using knowledge distillation-based LSTM (Long Short-Term Memory) models to significantly improve accuracy. With a primary focus on analyzing time-series data collected from various sensors, the solution offers real-time detection capabilities, ensuring prompt and reliable identification of falls. The authors i… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 4 pages

  18. arXiv:2307.15339  [pdf, other

    cs.IT cs.CV cs.LG

    The Radon Signed Cumulative Distribution Transform and its applications in classification of Signed Images

    Authors: Le Gong, Shiying Li, Naqib Sad Pathan, Mohammad Shifat-E-Rabbi, Gustavo K. Rohde, Abu Hasnat Mohammad Rubaiyat, Sumati Thareja

    Abstract: Here we describe a new image representation technique based on the mathematics of transport and optimal transport. The method relies on the combination of the well-known Radon transform for images and a recent signal representation method called the Signed Cumulative Distribution Transform. The newly proposed method generalizes previous transport-related image representation methods to arbitrary f… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    MSC Class: 65R10; 68U10; 68T45

  19. arXiv:2306.09296  [pdf, other

    cs.CL

    KoLA: Carefully Benchmarking World Knowledge of Large Language Models

    Authors: Jifan Yu, Xiaozhi Wang, Shangqing Tu, Shulin Cao, Daniel Zhang-Li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi , et al. (10 additional authors not shown)

    Abstract: The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations. Rather than merely exploring the breadth of LLM abilities, we believe meticulous and thoughtful designs are essential to thorough, unbiased, and applicable evaluations. Given the importance of world knowledge to LLMs, we construct a Knowledge-oriented LLM Assessment benchmark (KoLA), in which we… ▽ More

    Submitted 30 June, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted by ICLR 2024

  20. Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers

    Authors: Linyuan Gong, Chenyan Xiong, Xiaodong Liu, Payal Bajaj, Yiqing Xie, Alvin Cheung, Jianfeng Gao, Xia Song

    Abstract: This paper explores the effectiveness of model-generated signals in improving zero-shot generalization of text-to-text Transformers such as T5. We study various designs to pretrain T5 using an auxiliary model to construct more challenging token replacements for the main model to denoise. Key aspects under study include the decoding target, the location of the RTD head, and the masking pattern. Bas… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at ACL 2023. 9 pages

  21. arXiv:2303.17766  [pdf, other

    cs.CV

    Joint Depth Estimation and Mixture of Rain Removal From a Single Image

    Authors: Yongzhen Wang, Xuefeng Yan, Yanbiao Niu, Lina Gong, Yanwen Guo, Mingqiang Wei

    Abstract: Rainy weather significantly deteriorates the visibility of scene objects, particularly when images are captured through outdoor camera lenses or windshields. Through careful observation of numerous rainy photos, we have found that the images are generally affected by various rainwater artifacts such as raindrops, rain streaks, and rainy haze, which impact the image quality from both near and far d… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 11 pages, 7 figures, 5 tables

  22. arXiv:2303.03593  [pdf, other

    cs.CL cs.LG

    ADELT: Transpilation Between Deep Learning Frameworks

    Authors: Linyuan Gong, Jiayi Wang, Alvin Cheung

    Abstract: We propose the Adversarial DEep Learning Transpiler (ADELT), a novel approach to source-to-source transpilation between deep learning frameworks. ADELT uniquely decouples code skeleton transpilation and API keyword map**. For code skeleton transpilation, it uses few-shot prompting on large language models (LLMs), while for API keyword map**, it uses contextual embeddings from a code-specific B… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 19 pages, to be published in the main track of IJCAI 2024

  23. arXiv:2301.12738  [pdf, other

    cs.RO

    FLYOVER: A Model-Driven Method to Generate Diverse Highway Interchanges for Autonomous Vehicle Testing

    Authors: Yuan Zhou, Gengjie Lin, Yun Tang, Kairui Yang, Wei **g, ** Zhang, Junbo Chen, Liang Gong, Yang Liu

    Abstract: It has become a consensus that autonomous vehicles (AVs) will first be widely deployed on highways. However, the complexity of highway interchanges becomes the bottleneck for deploying AVs. An AV should be sufficiently tested under different highway interchanges, which is still challenging due to the lack of available datasets containing diverse highway interchanges. In this paper, we propose a mo… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted by ICRA 2023

  24. arXiv:2301.11119  [pdf, other

    cs.CR

    Robust multi-party semi-quantum private comparison protocols with decoherence-free states against collective noises

    Authors: Lihua Gong, Zhenyong Chen, Liguo Qin, Jiehui Huang

    Abstract: Based on decoherence-free states, two multi-party semi-quantum private comparison protocols are proposed to counteract collective noises. One could resist the collective-dephasing noise well, whereas the other could resist the collective-rotation noise. Multiple classical participants could compare their secret information by performing the proposed protocols once. It is manifested that the propos… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  25. arXiv:2212.05422  [pdf, other

    cs.CV

    Teaching What You Should Teach: A Data-Based Distillation Method

    Authors: Shitong Shao, Huanran Chen, Zhen Huang, Linrui Gong, Shuai Wang, Xinxiao Wu

    Abstract: In real teaching scenarios, an excellent teacher always teaches what he (or she) is good at but the student is not. This gives the student the best assistance in making up for his (or her) weaknesses and becoming a good one overall. Enlightened by this, we introduce the "Teaching what you Should Teach" strategy into a knowledge distillation framework, and propose a data-based distillation method n… ▽ More

    Submitted 20 May, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: 13 pages, 4 figures Accepted by IJCAI2023

  26. arXiv:2211.09518  [pdf, other

    cs.CV

    ImLiDAR: Cross-Sensor Dynamic Message Propagation Network for 3D Object Detection

    Authors: Yiyang Shen, Rongwei Yu, Peng Wu, Haoran Xie, Lina Gong, **g Qin, Mingqiang Wei

    Abstract: LiDAR and camera, as two different sensors, supply geometric (point clouds) and semantic (RGB images) information of 3D scenes. However, it is still challenging for existing methods to fuse data from the two cross sensors, making them complementary for quality 3D object detection (3OD). We propose ImLiDAR, a new 3OD paradigm to narrow the cross-sensor discrepancies by progressively fusing the mult… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 12 pages

  27. arXiv:2211.01664  [pdf, other

    cs.CV

    PointSee: Image Enhances Point Cloud

    Authors: Lipeng Gu, Xuefeng Yan, Peng Cui, Lina Gong, Haoran Xie, Fu Lee Wang, ** Qin, Mingqiang Wei

    Abstract: There is a trend to fuse multi-modal information for 3D object detection (3OD). However, the challenging problems of low lightweightness, poor flexibility of plug-and-play, and inaccurate alignment of features are still not well-solved, when designing multi-modal fusion newtorks. We propose PointSee, a lightweight, flexible and effective multi-modal fusion solution to facilitate various 3OD networ… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  28. arXiv:2210.06003  [pdf, other

    cs.RO

    A Complementary Framework for Human-Robot Collaboration with a Mixed AR-Haptic Interface

    Authors: Xiangjie Yan, Yongpeng Jiang, Chen Chen, Leiliang Gong, Ming Ge, Tao Zhang, Xiang Li

    Abstract: There is invariably a trade-off between safety and efficiency for collaborative robots (cobots) in human-robot collaborations. Robots that interact minimally with humans can work with high speed and accuracy but cannot adapt to new tasks or respond to unforeseen changes, whereas robots that work closely with humans can but only by becoming passive to humans, meaning that their main tasks suspended… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  29. arXiv:2209.08721  [pdf, other

    cs.CL cs.AI cs.LG

    Joint Language Semantic and Structure Embedding for Knowledge Graph Completion

    Authors: Jianhao Shen, Chenguang Wang, Linyuan Gong, Dawn Song

    Abstract: The task of completing knowledge triplets has broad downstream applications. Both structural and semantic information plays an important role in knowledge graph completion. Unlike previous approaches that rely on either the structures or semantics of the knowledge graphs, we propose to jointly embed the semantics in the natural language description of the knowledge triplets with their structure in… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: COLING 2022

  30. arXiv:2209.01373  [pdf, other

    cs.CV

    TogetherNet: Bridging Image Restoration and Object Detection Together via Dynamic Enhancement Learning

    Authors: Yongzhen Wang, Xuefeng Yan, Kaiwen Zhang, Lina Gong, Haoran Xie, Fu Lee Wang, Mingqiang Wei

    Abstract: Adverse weather conditions such as haze, rain, and snow often impair the quality of captured images, causing detection networks trained on normal images to generalize poorly in these scenarios. In this paper, we raise an intriguing question - if the combination of image restoration and object detection, can boost the performance of cutting-edge detectors in adverse weather conditions. To answer it… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 12 pages, 9 figures

  31. arXiv:2209.00977  [pdf, other

    cs.CV

    Contrastive Semantic-Guided Image Smoothing Network

    Authors: Jie Wang, Yongzhen Wang, Yidan Feng, Lina Gong, Xuefeng Yan, Haoran Xie, Fu Lee Wang, Mingqiang Wei

    Abstract: Image smoothing is a fundamental low-level vision task that aims to preserve salient structures of an image while removing insignificant details. Deep learning has been explored in image smoothing to deal with the complex entanglement of semantic structures and trivial details. However, current methods neglect two important facts in smoothing: 1) naive pixel-level regression supervised by the limi… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  32. arXiv:2208.02712  [pdf, other

    cs.CV

    UTOPIC: Uncertainty-aware Overlap Prediction Network for Partial Point Cloud Registration

    Authors: Zhilei Chen, Honghua Chen, Lina Gong, Xuefeng Yan, Jun Wang, Yanwen Guo, **g Qin, Mingqiang Wei

    Abstract: High-confidence overlap prediction and accurate correspondences are critical for cutting-edge models to align paired point clouds in a partial-to-partial manner. However, there inherently exists uncertainty between the overlap** and non-overlap** regions, which has always been neglected and significantly affects the registration performance. Beyond the current wisdom, we propose a novel uncert… ▽ More

    Submitted 1 September, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: Accept to Pacific Graphics 2022

  33. Revisiting the Impact of Dependency Network Metrics on Software Defect Prediction

    Authors: Lina Gong, Gopi Krishnan Rajbahadur, Ahmed E. Hassan, Shujuan Jiang

    Abstract: Software dependency network metrics extracted from the dependency graph of the software modules by the application of Social Network Analysis (SNA metrics) have been shown to improve the performance of the Software Defect prediction (SDP) models. However, the relative effectiveness of these SNA metrics over code metrics in improving the performance of the SDP models has been widely debated with no… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

  34. arXiv:2109.13005  [pdf

    cs.LG cs.HC cs.RO eess.SY

    Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration

    Authors: Zhaorun Chen, Binhao Chen, Shenghan Xie, Liang Gong, Chengliang Liu, Zhengfeng Zhang, Jun** Zhang

    Abstract: In complex environments with high dimension, training a reinforcement learning (RL) model from scratch often suffers from lengthy and tedious collection of agent-environment interactions. Instead, leveraging expert demonstration to guide RL agent can boost sample efficiency and improve final convergence. In order to better integrate expert prior with on-policy RL models, we propose a generic frame… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: This paper is accepted at The 3rd International Symposium on Robotics & Intelligent Manufacturing Technology (ISRIMT 2021) (https://conferences.ieee.org/conferences_events/conferences/conferencedetails/53730) and nominated as the "best paper"

  35. arXiv:2109.08642  [pdf, other

    cs.RO cs.AI

    POAR: Efficient Policy Optimization via Online Abstract State Representation Learning

    Authors: Zhaorun Chen, Siqi Fan, Yuan Tan, Liang Gong, Binhao Chen, Te Sun, David Filliat, Natalia Díaz-Rodríguez, Chengliang Liu

    Abstract: While the rapid progress of deep learning fuels end-to-end reinforcement learning (RL), direct application, especially in high-dimensional space like robotic scenarios still suffers from low sample efficiency. Therefore State Representation Learning (SRL) is proposed to specifically learn to encode task-relevant features from complex sensory data into low-dimensional states. However, the pervasive… ▽ More

    Submitted 9 December, 2023; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: 19 pages

  36. arXiv:2103.05864  [pdf, ps, other

    cs.DB

    MP-RW-LSH: An Efficient Multi-Probe LSH Solution to ANNS in $L_1$ Distance

    Authors: Huayi Wang, **gfan Meng, Long Gong, Jun Xu, Mitsunori Ogihara

    Abstract: Approximate Nearest Neighbor Search (ANNS) is a fundamental algorithmic problem, with numerous applications in many areas of computer science. Locality-sensitive hashing (LSH) is one of the most popular solution approaches for ANNS. A common shortcoming of many LSH schemes is that since they probe only a single bucket in a hash table, they need to use a large number of hash tables to achieve a hig… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  37. arXiv:2103.05232  [pdf, other

    cs.CV cs.CR cs.LG eess.IV

    Stabilized Medical Image Attacks

    Authors: Gege Qi, Lijun Gong, Yibing Song, Kai Ma, Yefeng Zheng

    Abstract: Convolutional Neural Networks (CNNs) have advanced existing medical systems for automatic disease diagnosis. However, a threat to these systems arises that adversarial attacks make CNNs vulnerable. Inaccurate diagnosis results make a negative influence on human healthcare. There is a need to investigate potential adversarial attacks to robustify deep medical diagnosis systems. On the other side, t… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: ICLR 2021 (Spotlight)

  38. arXiv:2102.11495  [pdf, other

    cs.LG

    Anytime Sampling for Autoregressive Models via Ordered Autoencoding

    Authors: Yilun Xu, Yang Song, Sahaj Garg, Linyuan Gong, Rui Shu, Aditya Grover, Stefano Ermon

    Abstract: Autoregressive models are widely used for tasks such as image and audio generation. The sampling process of these models, however, does not allow interruptions and cannot adapt to real-time computational resources. This challenge impedes the deployment of powerful autoregressive models, which involve a slow sampling process that is sequential in nature and typically scales linearly with respect to… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Accepted by ICLR 2021

  39. arXiv:2010.08620  [pdf, other

    cs.NI

    Sliding-Window QPS (SW-QPS): A Perfect Parallel Iterative Switching Algorithm for Input-Queued Switches

    Authors: **gfan Meng, Long Gong, Jun, Xu

    Abstract: In this work, we first propose a parallel batch switching algorithm called Small-Batch Queue-Proportional Sampling (SB-QPS). Compared to other batch switching algorithms, SB-QPS significantly reduces the batch size without sacrificing the throughput performance and hence has much lower delay when traffic load is light to moderate. It also achieves the lowest possible time complexity of $O(1)$ per… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 8 pages, 5 figures, to be published in ACM Performance Evaluation Review (PER)

  40. arXiv:2007.14569  [pdf, ps, other

    cs.DB cs.DS

    Space- and Computationally-Efficient Set Reconciliation via Parity Bitmap Sketch (PBS)

    Authors: Long Gong, Ziheng Liu, Liang Liu, Jun Xu, Mitsunori Ogihara, Tong Yang

    Abstract: Set reconciliation is a fundamental algorithmic problem that arises in many networking, system, and database applications. In this problem, two large sets A and B of objects (bitcoins, files, records, etc.) are stored respectively at two different network-connected hosts, which we name Alice and Bob respectively. Alice and Bob communicate with each other to learn $AΔB$, the difference between A an… ▽ More

    Submitted 15 August, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

  41. arXiv:2007.09979  [pdf, other

    cs.CV

    Distractor-Aware Neuron Intrinsic Learning for Generic 2D Medical Image Classifications

    Authors: Lijun Gong, Kai Ma, Yefeng Zheng

    Abstract: Medical image analysis benefits Computer Aided Diagnosis (CADx). A fundamental analyzing approach is the classification of medical images, which serves for skin lesion diagnosis, diabetic retinopathy grading, and cancer classification on histological images. When learning these discriminative classifiers, we observe that the convolutional neural networks (CNNs) are vulnerable to distractor interfe… ▽ More

    Submitted 21 July, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: MICCAI2020

  42. arXiv:2007.09968  [pdf, other

    cs.CV

    GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy

    Authors: Shaoteng Liu, Lijun Gong, Kai Ma, Yefeng Zheng

    Abstract: The automatic grading of diabetic retinopathy (DR) facilitates medical diagnosis for both patients and physicians. Existing researches formulate DR grading as an image classification problem. As the stages/categories of DR correlate with each other, the relationship between different classes cannot be explicitly described via a one-hot label because it is empirically estimated by different physici… ▽ More

    Submitted 21 July, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: MICCAI2020

  43. arXiv:2006.05744  [pdf, other

    cs.CL cs.LG

    MC-BERT: Efficient Language Pre-Training via a Meta Controller

    Authors: Zhenhui Xu, Linyuan Gong, Guolin Ke, Di He, Shuxin Zheng, Liwei Wang, Jiang Bian, Tie-Yan Liu

    Abstract: Pre-trained contextual representations (e.g., BERT) have become the foundation to achieve state-of-the-art results on many NLP tasks. However, large-scale pre-training is computationally expensive. ELECTRA, an early attempt to accelerate pre-training, trains a discriminative model that predicts whether each input token was replaced by a generator. Our studies reveal that ELECTRA's success is mainl… ▽ More

    Submitted 16 June, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

  44. arXiv:2004.05523  [pdf, other

    cs.CR cs.CV eess.IV

    DeepEDN: A Deep Learning-based Image Encryption and Decryption Network for Internet of Medical Things

    Authors: Yi Ding, Guozheng Wu, Dajiang Chen, Ning Zhang, Linpeng Gong, Mingsheng Cao, Zhiguang Qin

    Abstract: Internet of Medical Things (IoMT) can connect many medical imaging equipments to the medical information network to facilitate the process of diagnosing and treating for doctors. As medical image contains sensitive information, it is of importance yet very challenging to safeguard the privacy or security of the patient. In this work, a deep learning based encryption and decryption network (DeepEDN… ▽ More

    Submitted 5 May, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

    Journal ref: IEEE Internet of Things Journal,2020

  45. arXiv:2003.09405  [pdf, other

    cs.CV

    Explainable Object-induced Action Decision for Autonomous Vehicles

    Authors: Yiran Xu, Xiaoyin Yang, Lihang Gong, Hsuan-Chu Lin, Tz-Ying Wu, Yunsheng Li, Nuno Vasconcelos

    Abstract: A new paradigm is proposed for autonomous driving. The new paradigm lies between the end-to-end and pipelined approaches, and is inspired by how humans solve the problem. While it relies on scene understanding, the latter only considers objects that could originate hazard. These are denoted as action-inducing, since changes in their state should trigger vehicle actions. They also define a set of e… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

  46. arXiv:2002.12089  [pdf, other

    cs.RO

    Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control

    Authors: Ke Lin, Liang Gong, Xudong Li, Te Sun, Binhao Chen, Chengliang Liu, Zhengfeng Zhang, Jian Pu, Jun** Zhang

    Abstract: Although deep reinforcement learning (DRL) algorithms have made important achievements in many control tasks, they still suffer from the problems of sample inefficiency and unstable training process, which are usually caused by sparse rewards. Recently, some reinforcement learning from demonstration (RLfD) methods have shown to be promising in overcoming these problems. However, they usually requi… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  47. JNET: Learning User Representations via Joint Network Embedding and Topic Embedding

    Authors: Lin Gong, Lu Lin, Weihao Song, Hongning Wang

    Abstract: User representation learning is vital to capture diverse user preferences, while it is also challenging as user intents are latent and scattered among complex and different modalities of user-generated data, thus, not directly measurable. Inspired by the concept of user schema in social psychology, we take a new perspective to perform user representation learning by constructing a shared latent sp… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

  48. arXiv:1911.06191  [pdf, other

    cs.CL cs.LG stat.ML

    Microsoft Research Asia's Systems for WMT19

    Authors: Yingce Xia, Xu Tan, Fei Tian, Fei Gao, Weicong Chen, Yang Fan, Linyuan Gong, Yichong Leng, Renqian Luo, Yiren Wang, Lijun Wu, **hua Zhu, Tao Qin, Tie-Yan Liu

    Abstract: We Microsoft Research Asia made submissions to 11 language directions in the WMT19 news translation tasks. We won the first place for 8 of the 11 directions and the second place for the other three. Our basic systems are built on Transformer, back translation and knowledge distillation. We integrate several of our rececent techniques to enhance the baseline systems: multi-agent dual learning (MADL… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Accepted to "Fourth Conference on Machine Translation (WMT19)"

  49. arXiv:1907.03958  [pdf, other

    cs.CV

    Attentive CT Lesion Detection Using Deep Pyramid Inference with Multi-Scale Booster

    Authors: Qingbin Shao, Lijun Gong, Kai Ma, Hualuo Liu, Yefeng Zheng

    Abstract: Accurate lesion detection in computer tomography (CT) slices benefits pathologic organ analysis in the medical diagnosis process. More recently, it has been tackled as an object detection problem using the Convolutional Neural Networks (CNNs). Despite the achievements from off-the-shelf CNN models, the current detection accuracy is limited by the inability of CNNs on lesions at vastly different sc… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

  50. arXiv:1905.05392  [pdf, ps, other

    cs.PF

    QPS-r: A Cost-Effective Crossbar Scheduling Algorithm and Its Stability and Delay Analysis

    Authors: Long Gong, Jun Xu, Liang Liu, Siva Theja Maguluri

    Abstract: In an input-queued switch, a crossbar schedule, or a matching between the input ports and the output ports needs to be computed in each switching cycle, or time slot. Designing switching algorithms with very low computational complexity, that lead to high throughput and small delay is a challenging problem. There appears to be a fundamental tradeoff between the computational complexity of the swit… ▽ More

    Submitted 10 September, 2020; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: 10 pages