Skip to main content

Showing 1–11 of 11 results for author: Du, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.10576  [pdf, other

    cs.LG cs.CL stat.ML

    Optimization-based Structural Pruning for Large Language Models without Back-Propagation

    Authors: Yuan Gao, Zu**g Liu, Weizhong Zhang, Bo Du, Gui-Song Xia

    Abstract: Compared to the moderate size of neural network models, structural weight pruning on the Large-Language Models (LLMs) imposes a novel challenge on the efficiency of the pruning algorithms, due to the heavy computation/memory demands of the LLMs. Recent efficient LLM pruning methods typically operate at the post-training phase without the expensive weight finetuning, however, their pruning criteria… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 17 pages

  2. arXiv:2404.01273  [pdf, other

    cs.LG cs.CL stat.ME

    TWIN-GPT: Digital Twins for Clinical Trials via Large Language Model

    Authors: Yue Wang, Tianfan Fu, Yinlong Xu, Zihan Ma, Hongxia Xu, Yingzhou Lu, Bang Du, Honghao Gao, Jian Wu

    Abstract: Clinical trials are indispensable for medical research and the development of new treatments. However, clinical trials often involve thousands of participants and can span several years to complete, with a high probability of failure during the process. Recently, there has been a burgeoning interest in virtual clinical trials, which simulate real-world scenarios and hold the potential to significa… ▽ More

    Submitted 28 June, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  3. arXiv:2110.02588  [pdf, ps, other

    stat.ME

    Hypothesis Testing of One-Sample Mean Vector in Distributed Frameworks

    Authors: Bin Du, Junlong Zhao

    Abstract: Distributed frameworks are widely used to handle massive data, where sample size $n$ is very large, and data are often stored in $k$ different machines. For a random vector $X\in \mathbb{R}^p$ with expectation $μ$, testing the mean vector $H_0: μ=μ_0$ vs $H_1: μ\ne μ_0$ for a given vector $μ_0$ is a basic problem in statistics. The centralized test statistics require heavy communication costs, whi… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

  4. arXiv:2103.00719  [pdf, ps, other

    cs.LG cs.AI stat.ML

    LocalDrop: A Hybrid Regularization for Deep Neural Networks

    Authors: Ziqing Lu, Chang Xu, Bo Du, Takashi Ishida, Lefei Zhang, Masashi Sugiyama

    Abstract: In neural networks, develo** regularization algorithms to settle overfitting is one of the major study areas. We propose a new approach for the regularization of neural networks by the local Rademacher complexity called LocalDrop. A new regularization function for both fully-connected networks (FCNs) and convolutional neural networks (CNNs), including drop rates and weight matrices, has been dev… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  5. arXiv:2011.05885  [pdf, other

    cs.LG stat.ML

    Leveraged Matrix Completion with Noise

    Authors: Xinjian Huang, Weiwei Liu, Bo Du, Dacheng Tao

    Abstract: Completing low-rank matrices from subsampled measurements has received much attention in the past decade. Existing works indicate that $\mathcal{O}(nr\log^2(n))$ datums are required to theoretically secure the completion of an $n \times n$ noisy matrix of rank $r$ with high probability, under some quite restrictive assumptions: (1) the underlying matrix must be incoherent; (2) observations follow… ▽ More

    Submitted 14 August, 2023; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: This manuscript has been accepted for publication as a regular paper in the IEEE Transactions on Cybernetics

  6. arXiv:1909.02902  [pdf, other

    cs.LG cs.CV stat.ML

    Dynamic Spatial-Temporal Representation Learning for Traffic Flow Prediction

    Authors: Lingbo Liu, Jiajie Zhen, Guanbin Li, Geng Zhan, Zhaocheng He, Bowen Du, Liang Lin

    Abstract: As a crucial component in intelligent transportation systems, traffic flow prediction has recently attracted widespread research interest in the field of artificial intelligence (AI) with the increasing availability of massive traffic mobility data. Its key challenge lies in how to integrate diverse factors (such as temporal rules and spatial dependencies) to infer the evolution trend of traffic f… ▽ More

    Submitted 12 June, 2020; v1 submitted 1 September, 2019; originally announced September 2019.

    Comments: Accepted by IEEE Transactions on Intelligent Transportation Systems. arXiv admin note: text overlap with arXiv:1809.00101

  7. arXiv:1908.09002  [pdf, other

    cs.CV cs.LG cs.NI stat.ML

    Autonomous Learning for Face Recognition in the Wild via Ambient Wireless Cues

    Authors: Chris Xiaoxuan Lu, Xuan Kan, Bowen Du, Changhao Chen, Hongkai Wen, Andrew Markham, Niki Trigoni, John Stankovic

    Abstract: Facial recognition is a key enabling component for emerging Internet of Things (IoT) services such as smart homes or responsive offices. Through the use of deep neural networks, facial recognition has achieved excellent performance. However, this is only possibly when trained with hundreds of images of each user in different viewing and lighting conditions. Clearly, this level of effort in enrolme… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: 11 pages, accepted in the Web Conference (WWW'2019)

  8. arXiv:1905.06133  [pdf, other

    eess.IV cs.LG stat.ML

    Multi-scale Dynamic Graph Convolutional Network for Hyperspectral Image Classification

    Authors: Sheng Wan, Chen Gong, ** Zhong, Bo Du, Lefei Zhang, Jian Yang

    Abstract: Convolutional Neural Network (CNN) has demonstrated impressive ability to represent hyperspectral images and to achieve promising results in hyperspectral image classification. However, traditional CNN models can only operate convolution on regular square image regions with fixed size and weights, so they cannot universally adapt to the distinct local regions with various object distributions and… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

  9. arXiv:1904.06685  [pdf, other

    cs.LG stat.ML

    Exploring Representativeness and Informativeness for Active Learning

    Authors: Bo Du, Zengmao Wang, Lefei Zhang, Liangpei Zhang, Wei Liu, Jialie Shen, Dacheng Tao

    Abstract: How can we find a general way to choose the most suitable samples for training a classifier? Even with very limited prior information? Active learning, which can be regarded as an iterative optimization procedure, plays a key role to construct a refined training set to improve the classification performance in a variety of applications, such as text analysis, image recognition, social network mode… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

  10. arXiv:1809.00101  [pdf, other

    cs.LG cs.CV stat.ML

    Attentive Crowd Flow Machines

    Authors: Lingbo Liu, Ruimao Zhang, Jiefeng Peng, Guanbin Li, Bowen Du, Liang Lin

    Abstract: Traffic flow prediction is crucial for urban traffic management and public safety. Its key challenges lie in how to adaptively integrate the various factors that affect the flow changes. In this paper, we propose a unified neural network module to address this problem, called Attentive Crowd Flow Machine~(ACFM), which is able to infer the evolution of the crowd flow by learning dynamic representat… ▽ More

    Submitted 31 August, 2018; originally announced September 2018.

    Comments: ACM MM, full paper

  11. arXiv:1808.06206  [pdf, other

    cs.LG cs.AI stat.ML

    TLR: Transfer Latent Representation for Unsupervised Domain Adaptation

    Authors: Pan Xiao, Bo Du, Jia Wu, Lefei Zhang, Ruimin Hu, Xuelong Li

    Abstract: Domain adaptation refers to the process of learning prediction models in a target domain by making use of data from a source domain. Many classic methods solve the domain adaptation problem by establishing a common latent space, which may cause the loss of many important properties across both domains. In this manuscript, we develop a novel method, transfer latent representation (TLR), to learn a… ▽ More

    Submitted 19 August, 2018; originally announced August 2018.