Skip to main content

Showing 1–50 of 100 results for author: Fu, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14088  [pdf, other

    cs.DC cs.AI cs.CL cs.LG

    ReaLHF: Optimized RLHF Training for Large Language Models through Parameter Reallocation

    Authors: Zhiyu Mei, Wei Fu, Kaiwei Li, Guangju Wang, Huanchen Zhang, Yi Wu

    Abstract: Reinforcement Learning from Human Feedback (RLHF) stands as a pivotal technique in empowering large language model (LLM) applications. Since RLHF involves diverse computational workloads and intricate dependencies among multiple LLMs, directly adopting parallelization techniques from supervised training can result in sub-optimal performance. To overcome this limitation, we propose a novel approach… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 13 pages (15 pages with references), 13 figures

  2. arXiv:2406.05707  [pdf, other

    cs.CL cs.AI

    QGEval: A Benchmark for Question Generation Evaluation

    Authors: Wei** Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu

    Abstract: Automatically generated questions often suffer from problems such as unclear expression or factual inaccuracies, requiring a reliable and comprehensive evaluation of their quality. Human evaluation is frequently used in the field of question generation (QG) and is one of the most accurate evaluation methods. It also serves as the standard for automatic metrics. However, there is a lack of unified… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2406.03065  [pdf, other

    cs.LG cs.CV

    Decision Boundary-aware Knowledge Consolidation Generates Better Instance-Incremental Learner

    Authors: Qiang Nie, Weifu Fu, Yuhuan Lin, Jialin Li, Yifeng Zhou, Yong Liu, Lei Zhu, Chengjie Wang

    Abstract: Instance-incremental learning (IIL) focuses on learning continually with data of the same classes. Compared to class-incremental learning (CIL), the IIL is seldom explored because IIL suffers less from catastrophic forgetting (CF). However, besides retaining knowledge, in real-world deployment scenarios where the class space is always predefined, continual and cost-effective model promotion with t… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 14 pages

  4. arXiv:2404.10719  [pdf, other

    cs.CL

    Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

    Authors: Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is currently the most widely used method to align large language models (LLMs) with human preferences. Existing RLHF methods can be roughly categorized as either reward-based or reward-free. Novel applications such as ChatGPT and Claude leverage reward-based methods that first learn a reward model and apply actor-critic algorithms, such as Proximal… ▽ More

    Submitted 21 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 16 pages, 2 figures, 14 tables

  5. arXiv:2403.17980  [pdf, other

    cs.CR cs.LG

    EG-ConMix: An Intrusion Detection Method based on Graph Contrastive Learning

    Authors: Li** Wu, Shanshan Lei, Feilong Liao, Yuanjun Zheng, Yuxin Liu, Wentao Fu, Hao Song, Jiajun Zhou

    Abstract: As the number of IoT devices increases, security concerns become more prominent. The impact of threats can be minimized by deploying Network Intrusion Detection System (NIDS) by monitoring network traffic, detecting and discovering intrusions, and issuing security alerts promptly. Most intrusion detection research in recent years has been directed towards the pair of traffic itself without conside… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  6. arXiv:2403.05500  [pdf, other

    cs.RO

    Using Fiber Optic Bundles to Miniaturize Vision-Based Tactile Sensors

    Authors: Julia Di, Zdravko Dugonjic, Will Fu, Tingfan Wu, Romeo Mercado, Kevin Sawyer, Victoria Rose Most, Gregg Kammerer, Stefanie Speidel, Richard E. Fan, Geoffrey Sonn, Mark R. Cutkosky, Mike Lambeta, Roberto Calandra

    Abstract: Vision-based tactile sensors have recently become popular due to their combination of low cost, very high spatial resolution, and ease of integration using widely available miniature cameras. The associated field of view and focal length, however, are difficult to package in a human-sized finger. In this paper we employ optical fiber bundles to achieve a form factor that, at 15 mm diameter, is sma… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: We open source the design of DIGIT Pinki at https://github.com/facebookresearch/digit-design

  7. arXiv:2403.04303  [pdf, other

    cs.CV

    LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking

    Authors: Jialin Li, Qiang Nie, Weifu Fu, Yuhuan Lin, Guangpin Tao, Yong Liu, Chengjie Wang

    Abstract: Deep learning models, particularly those based on transformers, often employ numerous stacked structures, which possess identical architectures and perform similar functions. While effective, this stacking paradigm leads to a substantial increase in the number of parameters, posing challenges for practical applications. In today's landscape of increasingly large models, stacking depth can even rea… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures, 11 tables, CVPR2024 accepted

  8. arXiv:2403.01652  [pdf, other

    cs.NI

    Towards Memory-Efficient Traffic Policing in Time-Sensitive Networking

    Authors: Xuyan Jiang, Xiangrui Yang, Tongqing Zhou, Wenwen Fu, Wei Quan, Yihao Jiao, Yinhan Sun, Zhigang Sun

    Abstract: Time-Sensitive Networking (TSN) is an emerging real-time Ethernet technology that provides deterministic communication for time-critical traffic. At its core, TSN relies on Time-Aware Shaper (TAS) for pre-allocating frames in specific time intervals and Per-Stream Filtering and Policing (PSFP) for mitigating the fatal disturbance of unavoidable frame drift. However, as first identified in this wor… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  9. arXiv:2402.11954  [pdf, other

    cs.SD cs.MM eess.AS

    Multimodal Emotion Recognition from Raw Audio with Sinc-convolution

    Authors: Xiaohui Zhang, Wenjie Fu, Mangui Liang

    Abstract: Speech Emotion Recognition (SER) is still a complex task for computers with average recall rates usually about 70% on the most realistic datasets. Most SER systems use hand-crafted features extracted from audio signal such as energy, zero crossing rate, spectral information, prosodic, mel frequency cepstral coefficient (MFCC), and so on. More recently, using raw waveform for training neural networ… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  10. arXiv:2402.11931  [pdf, other

    cs.SD eess.AS q-bio.NC

    Soft-Weighted CrossEntropy Loss for Continous Alzheimer's Disease Detection

    Authors: Xiaohui Zhang, Wenjie Fu, Mangui Liang

    Abstract: Alzheimer's disease is a common cognitive disorder in the elderly. Early and accurate diagnosis of Alzheimer's disease (AD) has a major impact on the progress of research on dementia. At present, researchers have used machine learning methods to detect Alzheimer's disease from the speech of participants. However, the recognition accuracy of current methods is unsatisfactory, and most of them focus… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  11. arXiv:2402.02146  [pdf, other

    cs.AI cs.LG cs.NI eess.SP

    Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning

    Authors: Weiqi Fu, Lianming Xu, Xin Wu, Li Wang, Aiguo Fei

    Abstract: In achieving effective emergency response, the timely acquisition of environmental information, seamless command data transmission, and prompt decision-making are crucial. This necessitates the establishment of a resilient emergency communication dedicated network, capable of providing communication and sensing services even in the absence of basic infrastructure. In this paper, we propose an Emer… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  12. arXiv:2402.01728  [pdf, other

    cs.CL cs.AI cs.AR

    Hardware Phi-1.5B: A Large Language Model Encodes Hardware Domain Specific Knowledge

    Authors: Weimin Fu, Shijie Li, Yifang Zhao, Haocheng Ma, Raj Dutta, Xuan Zhang, Kaichen Yang, Yier **, Xiaolong Guo

    Abstract: In the rapidly evolving semiconductor industry, where research, design, verification, and manufacturing are intricately linked, the potential of Large Language Models to revolutionize hardware design and security verification is immense. The primary challenge, however, lies in the complexity of hardware specific issues that are not adequately addressed by the natural language or software code know… ▽ More

    Submitted 27 January, 2024; originally announced February 2024.

    Comments: 6 pages, 6 figures

    Journal ref: 29th IEEE/ACM Asia and South Pacific Design Automation Conference (ASP-DAC); 2024 January; Incheon Songdo Convensia, South Korea

  13. arXiv:2401.18019  [pdf, other

    cs.DB

    Joining Entities Across Relation and Graph with a Unified Model

    Authors: Wenzhi Fu

    Abstract: This paper introduces RG (Relational Genetic) model, a revised relational model to represent graph-structured data in RDBMS while preserving its topology, for efficiently and effectively extracting data in different formats from disparate sources. Along with: (a) SQL$_δ$, an SQL dialect augmented with graph pattern queries and tuple-vertex joins, such that one can extract graph properties via grap… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 24 pages, 16 figures, 5 tables

    ACM Class: H.2

  14. LLM4SecHW: Leveraging Domain Specific Large Language Model for Hardware Debugging

    Authors: Weimin Fu, Kaichen Yang, Raj Gautam Dutta, Xiaolong Guo, Gang Qu

    Abstract: This paper presents LLM4SecHW, a novel framework for hardware debugging that leverages domain specific Large Language Model (LLM). Despite the success of LLMs in automating various software development tasks, their application in the hardware security domain has been limited due to the constraints of commercial LLMs and the scarcity of domain specific data. To address these challenges, we propose… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 6 pages. 1 figure

    Journal ref: 2023 Asian Hardware Oriented Security and Trust Symposium (AsianHOST), Tian**, China, 2023, pp. 1-6

  15. arXiv:2401.03804  [pdf, other

    cs.CL cs.AI

    TeleChat Technical Report

    Authors: Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang , et al. (11 additional authors not shown)

    Abstract: In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, i… ▽ More

    Submitted 1 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: 28 pages, 2 figures

    ACM Class: I.2.7

  16. arXiv:2312.06580  [pdf, ps, other

    cs.AR

    VGF: Value-Guided Fuzzing -- Fuzzing Hardware as Hardware

    Authors: Ruochen Dai, Michael Lee, Patrick Hoey, Weimin Fu, Tuba Yavuz, Xiaolong Guo, Shuo Wang, Dean Sullivan, Orlando Arias

    Abstract: As the complexity of logic designs increase, new avenues for testing digital hardware becomes necessary. Fuzz Testing (fuzzing) has recently received attention as a potential candidate for input vector generation on hardware designs. Using this technique, a fuzzer is used to generate an input to a logic design. Using a simulation engine, the logic design is given the generated stimulus and some me… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 20 pages, 7 figures, 7 tables

  17. Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks

    Authors: Ling Luo, **zhong Ning, Yingwen Zhao, Zhijun Wang, Zeyuan Ding, Peng Chen, Weiru Fu, Qinyu Han, Guangtao Xu, Yunzhi Qiu, Dinghao Pan, Jiru Li, Hao Li, Wenduo Feng, Senbo Tu, Yuqi Liu, Zhihao Yang, Jian Wang, Yuanyuan Sun, Hongfei Lin

    Abstract: Objective: Most existing fine-tuned biomedical large language models (LLMs) focus on enhancing performance in monolingual biomedical question answering and conversation tasks. To investigate the effectiveness of the fine-tuned LLMs on diverse biomedical NLP tasks in different languages, We present Taiyi, a bilingual fine-tuned LLM for diverse biomedical tasks. Materials and Methods: We first curat… ▽ More

    Submitted 19 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Journal of the American Medical Informatics Association, 2024, ocae037

  18. arXiv:2311.06062  [pdf, other

    cs.CL cs.CR cs.LG

    Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration

    Authors: Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang

    Abstract: Membership Inference Attacks (MIA) aim to infer whether a target data record has been utilized for model training or not. Prior attempts have quantified the privacy risks of language models (LMs) via MIAs, but there is still no consensus on whether existing MIA algorithms can cause remarkable privacy leakage on practical Large Language Models (LLMs). Existing MIAs designed for LMs can be classifie… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: Repo: https://github.com/wjfu99/MIA-LLMs

  19. arXiv:2311.06049  [pdf, other

    cs.SI cs.CY

    Privacy-Preserving Individual-Level COVID-19 Infection Prediction via Federated Graph Learning

    Authors: Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang

    Abstract: Accurately predicting individual-level infection state is of great value since its essential role in reducing the damage of the epidemic. However, there exists an inescapable risk of privacy leakage in the fine-grained user mobility trajectories required by individual-level infection prediction. In this paper, we focus on develo** a framework of privacy-preserving individual-level infection pred… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: accepted by TOIS

  20. arXiv:2311.05818  [pdf, other

    cs.RO

    Learning Agile Bipedal Motions on a Quadrupedal Robot

    Authors: Yunfei Li, **han Li, Wei Fu, Yi Wu

    Abstract: Can a quadrupedal robot perform bipedal motions like humans? Although develo** human-like behaviors is more often studied on costly bipedal robot platforms, we present a solution over a lightweight quadrupedal robot that unlocks the agility of the quadruped in an upright standing pose and is capable of a variety of human-like motions. Our framework is with a hierarchical structure. At the low le… ▽ More

    Submitted 3 March, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Camera ready for ICRA 2024

  21. arXiv:2310.14509  [pdf, other

    cs.LG cs.AI

    Iteratively Learn Diverse Strategies with State Distance Information

    Authors: Wei Fu, Weihua Du, **gwei Li, Sunli Chen, **gzhao Zhang, Yi Wu

    Abstract: In complex reinforcement learning (RL) problems, policies with similar rewards may have substantially different behaviors. It remains a fundamental challenge to optimize rewards while also discovering as many diverse strategies as possible, which can be crucial in many practical applications. Our study examines two design choices for tackling this challenge, i.e., diversity measure and computation… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  22. arXiv:2309.16306  [pdf, other

    cs.CV

    Can the Query-based Object Detector Be Designed with Fewer Stages?

    Authors: Jialin Li, Weifu Fu, Yuhuan Lin, Qiang Nie, Yong Liu

    Abstract: Query-based object detectors have made significant advancements since the publication of DETR. However, most existing methods still rely on multi-stage encoders and decoders, or a combination of both. Despite achieving high accuracy, the multi-stage paradigm (typically consisting of 6 stages) suffers from issues such as heavy computational burden, prompting us to reconsider its necessity. In this… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  23. arXiv:2308.15855  [pdf, other

    cs.CV

    IIDM: Inter and Intra-domain Mixing for Semi-supervised Domain Adaptation in Semantic Segmentation

    Authors: Weifu Fu, Qiang Nie, Jialin Li, Yuhuan Lin, Kai Wu, Jian Li, Yabiao Wang, Yong Liu, Chengjie Wang

    Abstract: Despite recent advances in semantic segmentation, an inevitable challenge is the performance degradation caused by the domain shift in real applications. Current dominant approach to solve this problem is unsupervised domain adaptation (UDA). However, the absence of labeled target data in UDA is overly restrictive and limits performance. To overcome this limitation, a more practical scenario calle… ▽ More

    Submitted 11 April, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 7 pages, 4 figures

  24. arXiv:2308.12143  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    A Probabilistic Fluctuation based Membership Inference Attack for Diffusion Models

    Authors: Wenjie Fu, Huandong Wang, Chen Gao, Guanghua Liu, Yong Li, Tao Jiang

    Abstract: Membership Inference Attack (MIA) identifies whether a record exists in a machine learning model's training set by querying the model. MIAs on the classic classification models have been well-studied, and recent works have started to explore how to transplant MIA onto generative models. Our investigation indicates that existing MIAs designed for generative models mainly depend on the overfitting i… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: Repo: https://github.com/wjfu99/MIA-Gen

  25. arXiv:2307.09288  [pdf, other

    cs.CL cs.AI

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    Authors: Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini , et al. (43 additional authors not shown)

    Abstract: In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be… ▽ More

    Submitted 19 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  26. arXiv:2306.16688  [pdf, other

    cs.DC cs.AI cs.LG

    SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

    Authors: Zhiyu Mei, Wei Fu, Jiaxuan Gao, Guangju Wang, Huanchen Zhang, Yi Wu

    Abstract: The ever-growing complexity of reinforcement learning (RL) tasks demands a distributed system to efficiently generate and process a massive amount of data. However, existing open-source libraries suffer from various limitations, which impede their practical use in challenging scenarios where large-scale training is necessary. In this paper, we present a novel abstraction on the dataflows of RL tra… ▽ More

    Submitted 21 June, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: Published at ICLR 2024. 10 pages (24 pages with references and appendix), 7 figures

  27. arXiv:2305.14516  [pdf, other

    cs.LG cs.DC

    Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces

    Authors: Srinivas Sridharan, Taekyung Heo, Louis Feng, Zhaodong Wang, Matt Bergeron, Wenyin Fu, Shengbao Zheng, Brian Coutinho, Saeed Rashidi, Changhai Man, Tushar Krishna

    Abstract: Benchmarking and co-design are essential for driving optimizations and innovation around ML models, ML software, and next-generation hardware. Full workload benchmarks, e.g. MLPerf, play an essential role in enabling fair comparison across different software and hardware stacks especially once systems are fully designed and deployed. However, the pace of AI innovation demands a more agile methodol… ▽ More

    Submitted 26 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  28. arXiv:2303.09035  [pdf, other

    cs.CV

    Extracting the Brain-like Representation by an Improved Self-Organizing Map for Image Classification

    Authors: Jiahong Zhang, Lihong Cao, Moning Zhang, Wenlong Fu

    Abstract: Backpropagation-based supervised learning has achieved great success in computer vision tasks. However, its biological plausibility is always controversial. Recently, the bio-inspired Hebbian learning rule (HLR) has received extensive attention. Self-Organizing Map (SOM) uses the competitive HLR to establish connections between neurons, obtaining visual features in an unsupervised way. Although th… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: This paper has been accepted by ICASSP-2023

  29. arXiv:2303.09005  [pdf, other

    cs.CV cs.LG

    Conditional Synthetic Food Image Generation

    Authors: Wen** Fu, Yue Han, Jiangpeng He, Sriram Baireddy, Mridul Gupta, Fengqing Zhu

    Abstract: Generative Adversarial Networks (GAN) have been widely investigated for image synthesis based on their powerful representation learning ability. In this work, we explore the StyleGAN and its application of synthetic food image generation. Despite the impressive performance of GAN for natural image generation, food images suffer from high intra-class diversity and inter-class similarity, resulting… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  30. arXiv:2301.04122  [pdf, other

    cs.DC cs.AI

    Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks

    Authors: Mingyu Liang, Wenyin Fu, Louis Feng, Zhongyi Lin, Pavani Panakanti, Shengbao Zheng, Srinivas Sridharan, Christina Delimitrou

    Abstract: Building large AI fleets to support the rapidly growing DL workloads is an active research topic for modern cloud providers. Generating accurate benchmarks plays an essential role in designing the fast-paced software and hardware solutions in this space. Two fundamental challenges to make this scalable are (i) workload representativeness and (ii) the ability to quickly incorporate changes to the f… ▽ More

    Submitted 11 April, 2023; v1 submitted 16 December, 2022; originally announced January 2023.

    Comments: Accepted to ISCA 2023

  31. arXiv:2207.00493  [pdf, other

    q-fin.ST cs.LG q-fin.CP

    Simulating financial time series using attention

    Authors: Weilong Fu, Ali Hirsa, Jörg Osterrieder

    Abstract: Financial time series simulation is a central topic since it extends the limited real data for training and evaluation of trading strategies. It is also challenging because of the complex statistical properties of the real financial data. We introduce two generative adversarial networks (GANs), which utilize the convolutional networks with attention and the transformers, for financial time series… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  32. arXiv:2206.07505  [pdf, other

    cs.AI

    Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning

    Authors: Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu

    Abstract: Many advances in cooperative multi-agent reinforcement learning (MARL) are based on two common design principles: value decomposition and parameter sharing. A typical MARL algorithm of this fashion decomposes a centralized Q-function into local Q-networks with parameters shared across agents. Such an algorithmic paradigm enables centralized training and decentralized execution (CTDE) and leads to… ▽ More

    Submitted 7 August, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: 16 pages, published as a conference paper in ICML 2022

  33. arXiv:2206.06936  [pdf, ps, other

    cs.IT eess.SP

    Worst-case Design for RIS-aided Over-the-air Computation with Imperfect CSI

    Authors: Wenhui Zhang, **dan Xu, Wei Xu, Xiaohu You, Weijie Fu

    Abstract: Over-the-air computation (AirComp) enables fast wireless data aggregation at the receiver through concurrent transmission by sensors in the application of Internet-of-Things (IoT). To further improve the performance of AirComp under unfavorable propagation channel conditions, we consider the problem of computation distortion minimization in a reconfigurable intelligent surface (RIS)-aided AirComp… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  34. arXiv:2205.11407  [pdf

    cond-mat.mtrl-sci cs.LG

    Deep-learning-based prediction of nanoparticle phase transitions during in situ transmission electron microscopy

    Authors: Wenkai Fu, Steven R. Spurgeon, Chongmin Wang, Yuyan Shao, Wei Wang, Amra Peles

    Abstract: We develop the machine learning capability to predict a time sequence of in-situ transmission electron microscopy (TEM) video frames based on the combined long-short-term-memory (LSTM) algorithm and the features de-entanglement method. We train deep learning models to predict a sequence of future video frames based on the input of a sequence of previous frames. This unique capability provides insi… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 16 pages, 13 figures

  35. arXiv:2205.03850  [pdf, other

    cs.CR cs.LG eess.SY

    SeqNet: An Efficient Neural Network for Automatic Malware Detection

    Authors: Jiawei Xu, Wenxuan Fu, Haoyu Bu, Zhi Wang, Lingyun Ying

    Abstract: Malware continues to evolve rapidly, and more than 450,000 new samples are captured every day, which makes manual malware analysis impractical. However, existing deep learning detection models need manual feature engineering or require high computational overhead for long training processes, which might be laborious to select feature space and difficult to retrain for mitigating model aging. There… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

  36. arXiv:2204.02246  [pdf, other

    cs.LG cs.AI

    Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization

    Authors: Zihan Zhou, Wei Fu, Bingliang Zhang, Yi Wu

    Abstract: We present Reward-Switching Policy Optimization (RSPO), a paradigm to discover diverse strategies in complex RL environments by iteratively finding novel policies that are both locally optimal and sufficiently different from existing ones. To encourage the learning policy to consistently converge towards a previously undiscovered local optimum, RSPO switches between extrinsic and intrinsic rewards… ▽ More

    Submitted 3 May, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: 30 pages, 15 figures, published as a conference paper at ICLR 2022

  37. arXiv:2203.07975  [pdf, other

    cs.LG cond-mat.dis-nn cs.AI math.AG math.CT math.DG

    Categorical Representation Learning and RG flow operators for algorithmic classifiers

    Authors: Artan Sheshmani, Yizhuang You, Wenbo Fu, Ahmadreza Azizi

    Abstract: Following the earlier formalism of the categorical representation learning (arXiv:2103.14770) by the first two authors, we discuss the construction of the "RG-flow based categorifier". Borrowing ideas from theory of renormalization group flows (RG) in quantum field theory, holographic duality, and hyperbolic geometry, and mixing them with neural ODE's, we construct a new algorithmic natural langua… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: 31 pages, comments are very welcome

    MSC Class: 03B70; 03-04; 03D10; 11Y16

    Journal ref: Machine Learning: Science and Technology, 2023

  38. arXiv:2203.01934  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Quality or Quantity: Toward a Unified Approach for Multi-organ Segmentation in Body CT

    Authors: Fakrul Islam Tushar, Husam Nujaim, Wanyi Fu, Ehsan Abadi, Maciej A. Mazurowski, Ehsan Samei, William P. Segars, Joseph Y. Lo

    Abstract: Organ segmentation of medical images is a key step in virtual imaging trials. However, organ segmentation datasets are limited in terms of quality (because labels cover only a few organs) and quantity (since case numbers are limited). In this study, we explored the tradeoffs between quality and quantity. Our goal is to create a unified approach for multi-organ segmentation of body CT, which will f… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 6 pages, 3 figures, 2 tables, Accepted and Presented at SPIE Medical Imaging 2022

  39. arXiv:2202.11124  [pdf, other

    cs.CV

    Learning with Free Object Segments for Long-Tailed Instance Segmentation

    Authors: Cheng Zhang, Tai-Yu Pan, Tianle Chen, Jike Zhong, Wen** Fu, Wei-Lun Chao

    Abstract: One fundamental challenge in building an instance segmentation model for a large number of classes in complex scenes is the lack of training examples, especially for rare objects. In this paper, we explore the possibility to increase the training examples without laborious data collection and annotation. We find that an abundance of instance segments can potentially be obtained freely from object-… ▽ More

    Submitted 4 October, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted to ECCV 2022

  40. arXiv:2111.02668  [pdf, other

    cs.CV

    LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation

    Authors: WeiFu Fu, CongChong Nie, Ting Sun, Jun Liu, TianLiang Zhang, Yong Liu

    Abstract: This report introduces the technical details of the team FuXi-Fresher for LVIS Challenge 2021. Our method focuses on the problem in following two aspects: the long-tail distribution and the segmentation quality of mask and boundary. Based on the advanced HTC instance segmentation algorithm, we connect transformer backbone(Swin-L) through composite connections inspired by CBNetv2 to enhance the bas… ▽ More

    Submitted 4 November, 2021; v1 submitted 4 November, 2021; originally announced November 2021.

  41. An Improved Positioning Accuracy Method of a Robot Based on Particle Filter

    Authors: Rashid Ali, Dil Nawaz Hakro, Yong** He, Wenpeng Fu, Zhiqiang Cao

    Abstract: This paper aims to improve the performance and positioning accuracy of a robot by using the particle filter method. The laser range information is a wireless navigation system mainly used to measure, position, and control autonomous robots. Its localization is more flexible to control than wired guidance systems. However, the navigation through the laser range finder occurs with a large positionin… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 12 pages, 6 figures, conference

  42. arXiv:2107.11099  [pdf, other

    quant-ph cs.CV cs.LG

    RGB Image Classification with Quantum Convolutional Ansaetze

    Authors: Yu **g, Xiaogang Li, Yang Yang, Chonghang Wu, Wenbing Fu, Wei Hu, Yuanyuan Li, Hua Xu

    Abstract: With the rapid growth of qubit numbers and coherence times in quantum hardware technology, implementing shallow neural networks on the so-called Noisy Intermediate-Scale Quantum (NISQ) devices has attracted a lot of interest. Many quantum (convolutional) circuit ansaetze are proposed for grayscale images classification tasks with promising empirical results. However, when applying these ansaetze o… ▽ More

    Submitted 22 February, 2022; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: https://link.springer.com/article/10.1007/s11128-022-03442-8

    Journal ref: Quantum Inf Process 21, 101 (2022)

  43. arXiv:2107.07113  [pdf, other

    cs.CL cs.AI

    Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example Mining

    Authors: Guowei Xu, Wenbiao Ding, Wei** Fu, Zhongqin Wu, Zitao Liu

    Abstract: Many real-world applications involve the use of Optical Character Recognition (OCR) engines to transform handwritten images into transcripts on which downstream Natural Language Processing (NLP) models are applied. In this process, OCR engines may introduce errors and inputs to downstream NLP models become noisy. Despite that pre-trained models achieve state-of-the-art performance in many NLP benc… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: ECML-PKDD'21: The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021

  44. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  45. arXiv:2106.07894  [pdf, other

    cs.AR cs.DC cs.LG

    S2Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks

    Authors: Jianlei Yang, Wenzhi Fu, Xingzhou Cheng, Xucheng Ye, Pengcheng Dai, Weisheng Zhao

    Abstract: Convolutional neural networks (CNNs) have achieved great success in performing cognitive tasks. However, execution of CNNs requires a large amount of computing resources and generates heavy memory traffic, which imposes a severe challenge on computing system design. Through optimizing parallel executions and data reuse in convolution, systolic architecture demonstrates great advantages in accelera… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: 13 pages, 17 figures

    Journal ref: IEEE Transactions on Computers, 2021

  46. arXiv:2106.03648  [pdf, other

    cs.RO

    Cost-effective Map** of Mobile Robot Based on the Fusion of UWB and Short-range 2D LiDAR

    Authors: Ran Liu, Yong** He, Chau Yuen, Billy Pik Lik Lau, Rashid Ali, Wenpeng Fu, Zhiqiang Cao

    Abstract: Environment map** is an essential prerequisite for mobile robots to perform different tasks such as navigation and mission planning. With the availability of low-cost 2D LiDARs, there are increasing applications of such 2D LiDARs in industrial environments. However, environment map** in an unknown and feature-less environment with such low-cost 2D LiDARs remains a challenge. The challenge main… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted by IEEE/ASME TRANSACTIONS ON MECHATRONICS

  47. arXiv:2103.11526  [pdf, other

    cs.LG

    ExAD: An Ensemble Approach for Explanation-based Adversarial Detection

    Authors: Raj Vardhan, Ninghao Liu, Phakpoom Chinprutthiwong, Weijie Fu, Zhenyu Hu, Xia Ben Hu, Guofei Gu

    Abstract: Recent research has shown Deep Neural Networks (DNNs) to be vulnerable to adversarial examples that induce desired misclassifications in the models. Such risks impede the application of machine learning in security-sensitive domains. Several defense methods have been proposed against adversarial attacks to detect adversarial examples at test time or to make machine learning models more robust. How… ▽ More

    Submitted 21 March, 2021; originally announced March 2021.

    Comments: 15 pages, 10 figures

  48. arXiv:2012.00058  [pdf

    cs.LG cs.DB

    PMLB v1.0: An open source dataset collection for benchmarking machine learning methods

    Authors: Joseph D. Romano, Trang T. Le, William La Cava, John T. Gregg, Daniel J. Goldberg, Natasha L. Ray, Praneel Chakraborty, Daniel Himmelstein, Weixuan Fu, Jason H. Moore

    Abstract: Motivation: Novel machine learning and statistical modeling studies rely on standardized comparisons to existing methods using well-studied benchmark datasets. Few tools exist that provide rapid access to many of these datasets through a standardized, user-friendly interface that integrates well with popular data science workflows. Results: This release of PMLB provides the largest collection of… ▽ More

    Submitted 6 April, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: 4 pages, 1 figure. *: These authors contributed equally

    ACM Class: H.2.8

  49. arXiv:2008.08730  [pdf

    physics.med-ph cs.CV eess.IV

    iPhantom: a framework for automated creation of individualized computational phantoms and its application to CT organ dosimetry

    Authors: Wanyi Fu, Shobhit Sharma, Ehsan Abadi, Alexandros-Stavros Iliopoulos, Qi Wang, Joseph Y. Lo, Xiaobai Sun, William P. Segars, Ehsan Samei

    Abstract: Objective: This study aims to develop and validate a novel framework, iPhantom, for automated creation of patient-specific phantoms or digital-twins (DT) using patient medical images. The framework is applied to assess radiation dose to radiosensitive organs in CT imaging of individual patients. Method: From patient CT images, iPhantom segments selected anchor organs (e.g. liver, bones, pancreas)… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

    Comments: Main text: 11 pages, 8 figures; Supplement material: 7 pages, 5 figures, 7 tables

  50. arXiv:2008.03911  [pdf, other

    cs.LG stat.ML

    A Survey on Large-scale Machine Learning

    Authors: Meng Wang, Weijie Fu, Xiangnan He, Shijie Hao, Xindong Wu

    Abstract: Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However, most sophisticated machine learning approaches suffer from huge time costs when operating on large-scale data. This issue calls for the need of {Large-scale Mach… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.