Skip to main content

Showing 1–26 of 26 results for author: Mei, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14088  [pdf, other

    cs.DC cs.AI cs.CL cs.LG

    ReaLHF: Optimized RLHF Training for Large Language Models through Parameter Reallocation

    Authors: Zhiyu Mei, Wei Fu, Kaiwei Li, Guangju Wang, Huanchen Zhang, Yi Wu

    Abstract: Reinforcement Learning from Human Feedback (RLHF) stands as a pivotal technique in empowering large language model (LLM) applications. Since RLHF involves diverse computational workloads and intricate dependencies among multiple LLMs, directly adopting parallelization techniques from supervised training can result in sub-optimal performance. To overcome this limitation, we propose a novel approach… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 13 pages (15 pages with references), 13 figures

  2. arXiv:2404.10719  [pdf, other

    cs.CL

    Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

    Authors: Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is currently the most widely used method to align large language models (LLMs) with human preferences. Existing RLHF methods can be roughly categorized as either reward-based or reward-free. Novel applications such as ChatGPT and Claude leverage reward-based methods that first learn a reward model and apply actor-critic algorithms, such as Proximal… ▽ More

    Submitted 21 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 16 pages, 2 figures, 14 tables

  3. arXiv:2403.08185  [pdf, other

    cs.RO eess.SY

    Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception

    Authors: Anushri Dixit, Zhiting Mei, Meghan Booker, Mariko Storey-Matsutani, Allen Z. Ren, Anirudha Majumdar

    Abstract: Rapid advances in perception have enabled large pre-trained models to be used out of the box for processing high-dimensional, noisy, and partial observations of the world into rich geometric representations (e.g., occupancy predictions). However, safe integration of these models onto robots remains challenging due to a lack of reliable performance in unfamiliar environments. In this work, we prese… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Videos and code can be found at https://perceive-with-confidence.github.io

  4. arXiv:2402.12957  [pdf, other

    cs.DC

    Energy-Efficient Wireless Federated Learning via Doubly Adaptive Quantization

    Authors: Xuefeng Han, Wen Chen, Jun Li, Ming Ding, Qingqing Wu, Kang Wei, Xiumei Deng, Zhen Mei

    Abstract: Federated learning (FL) has been recognized as a viable distributed learning paradigm for training a machine learning model across distributed clients without uploading raw data. However, FL in wireless networks still faces two major challenges, i.e., large communication overhead and high energy consumption, which are exacerbated by client heterogeneity in dataset sizes and wireless channels. Whil… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  5. arXiv:2312.00774  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Context Retrieval via Normalized Contextual Latent Interaction for Conversational Agent

    Authors: Junfeng Liu, Zhuocheng Mei, Kewen Peng, Ranga Raju Vatsavai

    Abstract: Conversational agents leveraging AI, particularly deep learning, are emerging in both academic research and real-world applications. However, these applications still face challenges, including disrespecting knowledge and facts, not personalizing to user preferences, and enormous demand for computational resources during training and inference. Recent research efforts have been focused on addressi… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 2023 IEEE International Conference on Data Mining Workshops (ICDMW)

  6. arXiv:2310.09002  [pdf, other

    cs.LG

    Federated Meta-Learning for Few-Shot Fault Diagnosis with Representation Encoding

    Authors: Jixuan Cui, Jun Li, Zhen Mei, Kang Wei, Sha Wei, Ming Ding, Wen Chen, Song Guo

    Abstract: Deep learning-based fault diagnosis (FD) approaches require a large amount of training data, which are difficult to obtain since they are located across different entities. Federated learning (FL) enables multiple clients to collaboratively train a shared model with data privacy guaranteed. However, the domain discrepancy and data scarcity problems among clients deteriorate the performance of the… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  7. arXiv:2310.01966  [pdf

    cs.IT

    Throughput Maximization for Instantly Decodable Network Coded NOMA in Broadcast Communication Systems

    Authors: Zhonghui Mei

    Abstract: Non-orthogonal multiple access (NOMA) is a promising transmission scheme employed at the physical layer to improve the spectral efficiency. In this paper, we develop a novel cross-layer approach by employing NOMA at the physical layer and instantly decodable network coding (IDNC) at the network layer in downlink cellular networks. Following this approach, two IDNC packets are selected for each tra… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  8. arXiv:2308.03521  [pdf, other

    cs.LG cs.AI cs.DC

    Analysis and Optimization of Wireless Federated Learning with Data Heterogeneity

    Authors: Xuefeng Han, Jun Li, Wen Chen, Zhen Mei, Kang Wei, Ming Ding, H. Vincent Poor

    Abstract: With the rapid proliferation of smart mobile devices, federated learning (FL) has been widely considered for application in wireless networks for distributed model training. However, data heterogeneity, e.g., non-independently identically distributions and different sizes of training data among clients, poses major challenges to wireless FL. Limited communication resources complicate the implement… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  9. arXiv:2306.16688  [pdf, other

    cs.DC cs.AI cs.LG

    SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

    Authors: Zhiyu Mei, Wei Fu, Jiaxuan Gao, Guangju Wang, Huanchen Zhang, Yi Wu

    Abstract: The ever-growing complexity of reinforcement learning (RL) tasks demands a distributed system to efficiently generate and process a massive amount of data. However, existing open-source libraries suffer from various limitations, which impede their practical use in challenging scenarios where large-scale training is necessary. In this paper, we present a novel abstraction on the dataflows of RL tra… ▽ More

    Submitted 21 June, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: Published at ICLR 2024. 10 pages (24 pages with references and appendix), 7 figures

  10. arXiv:2211.03093  [pdf, other

    cs.RO

    SRIBO: An Efficient and Resilient Single-Range and Inertia Based Odometry for Flying Robots

    Authors: Wei Dong, Zheyuan Mei, Yuanjiong Ying, Sijia Chen, Yichen ie, Xiangyang Zhu

    Abstract: Positioning with one inertial measurement unit and one ranging sensor is commonly thought to be feasible only when trajectories are in certain patterns ensuring observability. For this reason, to pursue observable patterns, it is required either exciting the trajectory or searching key nodes in a long interval, which is commonly highly nonlinear and may also lack resilience. Therefore, such a posi… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  11. arXiv:2209.12139  [pdf, other

    cs.CV

    Lightweight Image Codec via Multi-Grid Multi-Block-Size Vector Quantization (MGBVQ)

    Authors: Yifan Wang, Zhanxuan Mei, Ioannis Katsavounidis, C. -C. Jay Kuo

    Abstract: A multi-grid multi-block-size vector quantization (MGBVQ) method is proposed for image coding in this work. The fundamental idea of image coding is to remove correlations among pixels before quantization and entropy coding, e.g., the discrete cosine transform (DCT) and intra predictions, adopted by modern image coding standards. We present a new method to remove pixel correlations. First, by decom… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: GIC-python-v2

  12. arXiv:2208.05271  [pdf, other

    cs.CV cs.AI

    Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

    Authors: Peng Ye, Baopu Li, Tao Chen, Jiayuan Fan, Zhen Mei, Chen Lin, Chongyan Zuo, Qinghua Chi, Wanli Ouyan

    Abstract: Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run in real-time for this problem. Towards this goal, we jointly search the depth, channel, dilation rate and feature spatial resolution, which results in a search space consisting of about 2.78… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

  13. arXiv:2202.00129  [pdf, other

    cs.RO cs.AI cs.IT cs.LG math.OC

    Fundamental Limits for Sensor-Based Robot Control

    Authors: Anirudha Majumdar, Zhiting Mei, Vincent Pacelli

    Abstract: Our goal is to develop theory and algorithms for establishing fundamental limits on performance imposed by a robot's sensors for a given task. In order to achieve this, we define a quantity that captures the amount of task-relevant information provided by a sensor. Using a novel version of the generalized Fano inequality from information theory, we demonstrate that this quantity provides an upper… ▽ More

    Submitted 11 July, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Extended version of paper presented at the 2022 Robotics: Science and Systems (RSS) conference

  14. arXiv:2109.08927  [pdf, other

    cs.CL cs.AI

    Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic

    Authors: Zijun Wu, Zi Xuan Zhang, Atharva Naik, Zhijian Mei, Mauajama Firdaus, Lili Mou

    Abstract: Natural language inference (NLI) aims to determine the logical relationship between two sentences, such as Entailment, Contradiction, and Neutral. In recent years, deep learning models have become a prevailing approach to NLI, but they lack interpretability and explainability. In this work, we address the explainability of NLI by weakly supervised logical reasoning, and propose an Explainable Phra… ▽ More

    Submitted 22 February, 2023; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: Accepted by ICLR 2023

  15. arXiv:2105.03649  [pdf, other

    cs.NE cs.DC cs.ET

    In-Hardware Learning of Multilayer Spiking Neural Networks on a Neuromorphic Processor

    Authors: Amar Shrestha, Haowen Fang, Daniel Patrick Rider, Zaidao Mei, Qinru Qiu

    Abstract: Although widely used in machine learning, backpropagation cannot directly be applied to SNN training and is not feasible on a neuromorphic processor that emulates biological neuron and synapses. This work presents a spike-based backpropagation algorithm with biological plausible local update rules and adapts it to fit the constraint in a neuromorphic hardware. The algorithm is implemented on Intel… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: 6 pages, 5 figures, accepted for Design Automation Conference (DAC) 2021

  16. arXiv:2104.10712  [pdf, other

    cs.AR cs.NE

    Neuromorphic Algorithm-hardware Codesign for Temporal Pattern Learning

    Authors: Haowen Fang, Brady Taylor, Ziru Li, Zaidao Mei, Hai Li, Qinru Qiu

    Abstract: Neuromorphic computing and spiking neural networks (SNN) mimic the behavior of biological systems and have drawn interest for their potential to perform cognitive tasks with high energy efficiency. However, some factors such as temporal dynamics and spike timings prove critical for information processing but are often ignored by existing works, limiting the performance and applications of neuromor… ▽ More

    Submitted 6 May, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  17. arXiv:2102.00502  [pdf, other

    cs.MM eess.IV

    A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

    Authors: Yifan Wang, Zhanxuan Mei, Chia-Yang Tsai, Ioannis Katsavounidis, C. -C. Jay Kuo

    Abstract: The design of the optimal inverse discrete cosine transform (IDCT) to compensate the quantization error is proposed for effective lossy image compression in this work. The forward and inverse DCTs are designed in pair in current image/video coding standards without taking the quantization effect into account. Yet, the distribution of quantized DCT coefficients deviate from that of original DCT coe… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: conference

  18. arXiv:2004.05340  [pdf, ps, other

    cs.IT cs.LG

    DNN-aided Read-voltage Threshold Optimization for MLC Flash Memory with Finite Block Length

    Authors: Cheng Wang, Kang Wei, Lingjun Kong, Long Shi, Zhen Mei, Jun Li, Kui Cai

    Abstract: The error correcting performance of multi-level-cell (MLC) NAND flash memory is closely related to the block length of error correcting codes (ECCs) and log-likelihood-ratios (LLRs) of the read-voltage thresholds. Driven by this issue, this paper optimizes the read-voltage thresholds for MLC flash memory to improve the decoding performance of ECCs with finite block length. First, through the analy… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

  19. arXiv:1907.03938  [pdf, ps, other

    cs.IT

    Deep Learning-Aided Dynamic Read Thresholds Design For Multi-Level-Cell Flash Memories

    Authors: Zhen Mei, Kui Cai, Xuan He

    Abstract: The practical NAND flash memory suffers from various non-stationary noises that are difficult to be predicted. Furthermore, the data retention noise induced channel offset is unknown during the readback process. This severely affects the data recovery from the memory cell. In this paper, we first propose a novel recurrent neural network (RNN)-based detector to effectively detect the data symbols s… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

  20. arXiv:1907.02944  [pdf

    cs.IT

    Proceedings of the 11th Asia-Europe Workshop on Concepts in Information Theory

    Authors: A. J. Han Vinck, Kees A. Schouhamer Immink, Tadashi Wadayama, Van Khu Vu, Akiko Manada, Kui Cai, Shunsuke Horii, Yoshiki Abe, Mitsugu Iwamoto, Kazuo Ohta, Xingwei Zhong, Zhen Mei, Renfei Bu, J. H. Weber, Vitaly Skachek, Hiroyoshi Morita, N. Hovhannisyan, Hiroshi Kamabe, Shan Lu, Hirosuke Yamamoto, Kengo Hasimoto, O. Ytrehus, Shigeaki Kuzuoaka, Mikihiko Nishiara, Han Mao Kiah , et al. (2 additional authors not shown)

    Abstract: This year, 2019 we celebrate 30 years of our friendship between Asian and European scientists at the AEW11 in Rotterdam, the Netherlands. Many of the 1989 participants are also present at the 2019 event. This year we have many participants from different parts of Asia and Europe. It shows the importance of this event. It is a good tradition to pay a tribute to a special lecturer in our community.… ▽ More

    Submitted 26 June, 2019; originally announced July 2019.

  21. arXiv:1904.13245  [pdf, ps, other

    cs.IT

    Design of Protograph Codes for Additive White Symmetric Alpha-Stable Noise Channels

    Authors: Xingwei Zhong, Kui Cai, **** Chen, Zhen Mei

    Abstract: The protograph low-density parity-check (LDPC) codes possess many attractive properties, such as the low encoding/decoding complexity and better error floor performance, and hence have been successfully applied to different types of communication and data storage channels. In this paper,we design protograph LDPC codes for communication systems corrupted by the impulsive noise, which are modeled as… ▽ More

    Submitted 30 April, 2019; originally announced April 2019.

  22. arXiv:1904.06666  [pdf, other

    cs.IT

    Mutual Information-Maximizing Quantized Belief Propagation Decoding of Regular LDPC Codes

    Authors: Xuan He, Kui Cai, Zhen Mei, Peng Kang, Xiaohu Tang

    Abstract: In this paper, we propose a class of finite alphabet iterative decoder (FAID), called mutual information-maximizing quantized belief propagation (MIM-QBP) decoder, for decoding regular low-density parity-check (LDPC) codes. Our decoder follows the reconstruction-calculation-quantization (RCQ) decoding architecture that is widely used in FAIDs. We present the first complete and systematic design fr… ▽ More

    Submitted 16 December, 2022; v1 submitted 14 April, 2019; originally announced April 2019.

  23. arXiv:1902.06289  [pdf, ps, other

    cs.IT cs.LG

    Neural Network-Based Dynamic Threshold Detection for Non-Volatile Memories

    Authors: Zhen Mei, Kui Cai, Xingwei Zhong

    Abstract: The memory physics induced unknown offset of the channel is a critical and difficult issue to be tackled for many non-volatile memories (NVMs). In this paper, we first propose novel neural network (NN) detectors by using the multilayer perceptron (MLP) network and the recurrent neural network (RNN), which can effectively tackle the unknown offset of the channel. However, compared with the conventi… ▽ More

    Submitted 17 February, 2019; originally announced February 2019.

    Comments: A six-page version of this paper has been accepted by ICC 2019

  24. arXiv:1901.01659  [pdf, other

    cs.IT

    Dynamic Programming for Sequential Deterministic Quantization of Discrete Memoryless Channels

    Authors: Xuan He, Kui Cai, Wentu Song, Zhen Mei

    Abstract: In this paper, under a general cost function $C$, we present a dynamic programming (DP) method to obtain an optimal sequential deterministic quantizer (SDQ) for $q$-ary input discrete memoryless channel (DMC). The DP method has complexity $O(q (N-M)^2 M)$, where $N$ and $M$ are the alphabet sizes of the DMC output and quantizer output, respectively. Then, starting from the quadrangle inequality, t… ▽ More

    Submitted 23 February, 2021; v1 submitted 6 January, 2019; originally announced January 2019.

    Comments: 14 pages, 3 figures, accepted by TCOM

  25. Information Theoretic Bounds Based Channel Quantization Design for Emerging Memories

    Authors: Zhen Mei, Kui Cai, Long Shi

    Abstract: Channel output quantization plays a vital role in high-speed emerging memories such as the spin-torque transfer magnetic random access memory (STT-MRAM), where high-precision analog-to-digital converters (ADCs) are not applicable. In this paper, we investigate the design of the 1-bit quantizer which is highly suitable for practical applications. We first propose a quantized channel model for STT-M… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Comments: This paper is accepted by ITW 2018

  26. arXiv:1712.00983  [pdf, ps, other

    cs.IT

    Design of Polar Codes with Single and Multi-Carrier Modulation on Impulsive Noise Channels using Density Evolution

    Authors: Zhen Mei, Bin Dai, Martin Johnston, Rolando Carrasco

    Abstract: In this paper, density evolution-based construction methods to design good polar codes on impulsive noise channels for single-carrier and multi-carrier systems are proposed and evaluated. For a single-carrier system, the tight bound of the block error probability (BLEP) is derived by applying density evolution and the performance of the proposed construction methods are compared. For the multi-car… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

    Comments: 5 pages, 3 figures