Skip to main content

Showing 1–15 of 15 results for author: Diao, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11832  [pdf, other

    cs.CV cs.MM

    Unveiling Encoder-Free Vision-Language Models

    Authors: Haiwen Diao, Yufeng Cui, Xiaotong Li, Yueze Wang, Huchuan Lu, Xinlong Wang

    Abstract: Existing vision-language models (VLMs) mostly rely on vision encoders to extract visual features followed by large language models (LLMs) for visual-language tasks. However, the vision encoders set a strong inductive bias in abstracting visual representation, e.g., resolution, aspect ratio, and semantic priors, which could impede the flexibility and efficiency of the VLMs. Training pure VLMs that… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 16 pages, 7 figures

  2. arXiv:2404.18114  [pdf, other

    cs.CV cs.MM

    Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching

    Authors: Haiwen Diao, Ying Zhang, Shang Gao, Xiang Ruan, Huchuan Lu

    Abstract: Image-text matching remains a challenging task due to heterogeneous semantic diversity across modalities and insufficient distance separability within triplets. Different from previous approaches focusing on enhancing multi-modal representations or exploiting cross-modal correspondence for more accurate retrieval, in this paper we aim to leverage the knowledge transfer between peer branches in a b… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 12 pages, 9 figures, Accepted by TIP2024

  3. arXiv:2403.17651  [pdf, other

    cs.CV

    Exploring Dynamic Transformer for Efficient Object Tracking

    Authors: Jiawen Zhu, Xin Chen, Haiwen Diao, Shuai Li, Jun-Yan He, Chenyang Li, Bin Luo, Dong Wang, Huchuan Lu

    Abstract: The speed-precision trade-off is a critical problem for visual object tracking which usually requires low latency and deployment on constrained resources. Existing solutions for efficient tracking mainly focus on adopting light-weight backbones or modules, which nevertheless come at the cost of a sacrifice in precision. In this paper, inspired by dynamic network routing, we propose DyTrack, a dyna… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  4. arXiv:2402.18167  [pdf, other

    cs.LG

    Decentralised Traffic Incident Detection via Network Lasso

    Authors: Qiyuan Zhu, A. K. Qin, Prabath Abeysekara, Hussein Dia, Hanna Grzybowska

    Abstract: Traffic incident detection plays a key role in intelligent transportation systems, which has gained great attention in transport engineering. In the past, traditional machine learning (ML) based detection methods achieved good performance under a centralised computing paradigm, where all data are transmitted to a central server for building ML models therein. Nowadays, deep neural networks based f… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  5. arXiv:2308.14316  [pdf, other

    cs.CV cs.MM

    UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory

    Authors: Haiwen Diao, Bo Wan, Ying Zhang, Xu Jia, Huchuan Lu, Long Chen

    Abstract: Parameter-efficient transfer learning (PETL), i.e., fine-tuning a small portion of parameters, is an effective strategy for adapting pre-trained models to downstream domains. To further reduce the memory demand, recent PETL works focus on the more valuable memory-efficient characteristic. In this paper, we argue that the scalability, adaptability, and generalizability of state-of-the-art methods a… ▽ More

    Submitted 11 March, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 15 pages, 11 figures, Accepted by CVPR2024

  6. arXiv:2307.03920  [pdf

    cs.NE cs.LG

    Training Physics-Informed Neural Networks via Multi-Task Optimization for Traffic Density Prediction

    Authors: Bo Wang, A. K. Qin, Sajjad Shafiei, Hussein Dia, Adriana-Simona Mihaita, Hanna Grzybowska

    Abstract: Physics-informed neural networks (PINNs) are a newly emerging research frontier in machine learning, which incorporate certain physical laws that govern a given data set, e.g., those described by partial differential equations (PDEs), into the training of the neural network (NN) based on such a data set. In PINNs, the NN acts as the solution approximator for the PDE while the PDE acts as the prior… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: accepted by the 2023 IEEE International Joint Conference on Neural Networks (IJCNN 2023)

  7. Plug-and-Play Regulators for Image-Text Matching

    Authors: Haiwen Diao, Ying Zhang, Wei Liu, Xiang Ruan, Huchuan Lu

    Abstract: Exploiting fine-grained correspondence and visual-semantic alignments has shown great potential in image-text matching. Generally, recent approaches first employ a cross-modal attention unit to capture latent region-word interactions, and then integrate all the alignments to obtain the final similarity. However, most of them adopt one-time forward association or aggregation strategies with complex… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 13 pages, 9 figures, Accepted by TIP2023

  8. arXiv:2212.10200  [pdf, other

    cs.CV

    Redistribution of Weights and Activations for AdderNet Quantization

    Authors: Ying Nie, Kai Han, Haikang Diao, Chuanjian Liu, Enhua Wu, Yunhe Wang

    Abstract: Adder Neural Network (AdderNet) provides a new way for develo** energy-efficient neural networks by replacing the expensive multiplications in convolution with cheaper additions (i.e.l1-norm). To achieve higher hardware efficiency, it is necessary to further study the low-bit quantization of AdderNet. Due to the limitation that the commutative law in multiplication does not hold in l1-norm, the… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  9. arXiv:2207.03088  [pdf, other

    cs.LG cs.AI

    Attention Round for Post-Training Quantization

    Authors: Huabin Diao, Gongyan Li, Shaoyun Xu, Yuexing Hao

    Abstract: At present, the quantification methods of neural network models are mainly divided into post-training quantization (PTQ) and quantization aware training (QAT). Post-training quantization only need a small part of the data to complete the quantification process, but the performance of its quantitative model is not as good as the quantization aware training. This paper presents a novel quantificatio… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: 18 pages, 5 figures, 5 tables

    MSC Class: 68T07; 68T45 ACM Class: I.4.2

  10. arXiv:2106.01589  [pdf

    cs.CY

    The Emotion coding and Propagation based on improved Genetic algorithm

    Authors: Hongyuan Diao, Fuzhong Nian, Xuelong Yu, Xirui Liu, Xinhao Liu

    Abstract: Computational communication research on information has been prevalent in recent years, as people are progressively inquisitive in social behavior and public opinion. Nevertheless, it is of great significance to analyze the direction of predominant sentiment from the sentiment communication perspective. In this paper, the information emotion propagation model is established by introducing revamp g… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

  11. arXiv:2101.01368  [pdf, other

    cs.CV cs.MM

    Similarity Reasoning and Filtration for Image-Text Matching

    Authors: Haiwen Diao, Ying Zhang, Lin Ma, Huchuan Lu

    Abstract: Image-text matching plays a critical role in bridging the vision and language, and great progress has been made by exploiting the global alignment between image and sentence, or local alignments between regions and words. However, how to make the most of these alignments to infer more accurate matching scores is still underexplored. In this paper, we propose a novel Similarity Graph Reasoning and… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 14 pages, 8 figures, Accepted by AAAI2021

  12. arXiv:2010.00174  [pdf

    cs.SI

    Information Propagation Model in Hybrid Networks

    Authors: Fuzhong Nian, Hongyuan Diao

    Abstract: It is in practice impossible to describe the topology of a real network or its message propagation process using a single dynamic model. To address this issue, we constructed a new hybrid network model based on scale-free (SF), small-world (SW) features that functions as closely as possible to a real network. And the hybrid propagation model is constructed with susceptible-infected-susceptible (SI… ▽ More

    Submitted 29 May, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

  13. arXiv:1909.13384  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Optimal Sketching for Kronecker Product Regression and Low Rank Approximation

    Authors: Huaian Diao, Rajesh Jayaram, Zhao Song, Wen Sun, David P. Woodruff

    Abstract: We study the Kronecker product regression problem, in which the design matrix is a Kronecker product of two or more matrices. Given $A_i \in \mathbb{R}^{n_i \times d_i}$ for $i=1,2,\dots,q$ where $n_i \gg d_i$ for each $i$, and $b \in \mathbb{R}^{n_1 n_2 \cdots n_q}$, let $\mathcal{A} = A_1 \otimes A_2 \otimes \cdots \otimes A_q$. Then for $p \in [1,2]$, the goal is to find… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

    Comments: A preliminary version of this paper appeared in NeurIPS 2019

  14. arXiv:1909.12441  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Total Least Squares Regression in Input Sparsity Time

    Authors: Huaian Diao, Zhao Song, David P. Woodruff, Xin Yang

    Abstract: In the total least squares problem, one is given an $m \times n$ matrix $A$, and an $m \times d$ matrix $B$, and one seeks to "correct" both $A$ and $B$, obtaining matrices $\hat{A}$ and $\hat{B}$, so that there exists an $X$ satisfying the equation $\hat{A}X = \hat{B}$. Typically the problem is overconstrained, meaning that $m \gg \max(n,d)$. The cost of the solution $\hat{A}, \hat{B}$ is given b… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

  15. arXiv:1712.09473  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Sketching for Kronecker Product Regression and P-splines

    Authors: Huaian Diao, Zhao Song, Wen Sun, David P. Woodruff

    Abstract: TensorSketch is an oblivious linear sketch introduced in Pagh'13 and later used in Pham, Pagh'13 in the context of SVMs for polynomial kernels. It was shown in Avron, Nguyen, Woodruff'14 that TensorSketch provides a subspace embedding, and therefore can be used for canonical correlation analysis, low rank approximation, and principal component regression for the polynomial kernel. We take TensorSk… ▽ More

    Submitted 26 December, 2017; originally announced December 2017.

    Comments: AISTATS 2018