Skip to main content

Showing 1–50 of 204 results for author: Qiu, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00529  [pdf, other

    cs.LG cs.SD eess.AS math.ST stat.ML

    Detecting and Identifying Selection Structure in Sequential Data

    Authors: Yujia Zheng, Zeyu Tang, Yiwen Qiu, Bernhard Schölkopf, Kun Zhang

    Abstract: We argue that the selective inclusion of data points based on latent objectives is common in practical situations, such as music sequences. Since this selection process often distorts statistical analysis, previous work primarily views it as a bias to be corrected and proposes various methods to mitigate its effect. However, while controlling this bias is crucial, selection also offers an opportun… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: ICML 2024

  2. arXiv:2406.16087  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy

    Authors: Chen Wang, Kaiyi Ji, Junyi Geng, Zhongqiang Ren, Taimeng Fu, Fan Yang, Yifan Guo, Haonan He, Xiangyu Chen, Zitong Zhan, Qiwei Du, Shaoshu Su, Bowen Li, Yuheng Qiu, Yi Du, Qihang Li, Yifan Yang, Xiao Lin, Zhipeng Zhao

    Abstract: Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeS… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, **gyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: under peer review

  4. arXiv:2406.09693  [pdf, other

    cs.CV eess.IV

    Compressed Video Quality Enhancement with Temporal Group Alignment and Fusion

    Authors: Qiang Zhu, Yajun Qiu, Yu Liu, Shuyuan Zhu, Bing Zeng

    Abstract: In this paper, we propose a temporal group alignment and fusion network to enhance the quality of compressed videos by using the long-short term correlations between frames. The proposed model consists of the intra-group feature alignment (IntraGFA) module, the inter-group feature fusion (InterGFF) module, and the feature enhancement (FE) module. We form the group of pictures (GoP) by selecting fr… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.09423  [pdf, other

    cs.DC cs.GR

    MSz: An Efficient Parallel Algorithm for Correcting Morse-Smale Segmentations in Error-Bounded Lossy Compressors

    Authors: Yuxiao Li, Xin Liang, Bei Wang, Yongfeng Qiu, Lin Yan, Hanqi Guo

    Abstract: This research explores a novel paradigm for preserving topological segmentations in existing error-bounded lossy compressors. Today's lossy compressors rarely consider preserving topologies such as Morse-Smale complexes, and the discrepancies in topology between original and decompressed datasets could potentially result in erroneous interpretations or even incorrect scientific conclusions. In thi… ▽ More

    Submitted 5 April, 2024; originally announced June 2024.

  6. arXiv:2406.03793  [pdf, other

    cs.LG cs.CV

    Low-Rank Similarity Mining for Multimodal Dataset Distillation

    Authors: Yue Xu, Zhilin Lin, Yusong Qiu, Cewu Lu, Yong-Lu Li

    Abstract: Though dataset distillation has witnessed rapid development in recent years, the distillation of multimodal data, e.g., image-text pairs, poses unique and under-explored challenges. Unlike unimodal data, image-text contrastive learning (ITC) data lack inherent categorization and should instead place greater emphasis on modality correspondence. In this work, we propose Low-Rank Similarity Mining (L… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024

  7. arXiv:2405.18085  [pdf, other

    cs.SI cs.MA eess.SY

    Network Diffusion -- Framework to Simulate Spreading Processes in Complex Networks

    Authors: Michał Czuba, Mateusz Nurek, Damian Serwata, Yu-Xuan Qiu, Mingshan Jia, Katarzyna Musial, Radosław Michalski, Piotr Bródka

    Abstract: With the advancement of computational network science, its research scope has significantly expanded beyond static graphs to encompass more complex structures. The introduction of streaming, temporal, multilayer, and hypernetwork approaches has brought new possibilities and imposed additional requirements. For instance, by utilising these advancements, one can model structures such as social netwo… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: To be published in: Big Data Mining and Analytics (https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8254253)

  8. arXiv:2405.17478  [pdf, other

    cs.LG stat.ML

    ROSE: Register Assisted General Time Series Forecasting with Decomposed Frequency Learning

    Authors: Yihang Wang, Yuying Qiu, Peng Chen, Kai Zhao, Yang Shu, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo

    Abstract: With the increasing collection of time series data from various domains, there arises a strong demand for general time series forecasting models pre-trained on a large number of time-series datasets to support a variety of downstream prediction tasks. Enabling general time series forecasting faces two challenges: how to obtain unified representations from multi-domian time series data, and how to… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  9. arXiv:2405.17398  [pdf, other

    cs.CV cs.AI

    Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability

    Authors: Shenyuan Gao, Jiazhi Yang, Li Chen, Kashyap Chitta, Yihang Qiu, Andreas Geiger, Jun Zhang, Hongyang Li

    Abstract: World models can foresee the outcomes of different actions, which is of paramount importance for autonomous driving. Nevertheless, existing driving world models still have limitations in generalization to unseen environments, prediction fidelity of critical details, and action controllability for flexible application. In this paper, we present Vista, a generalizable driving world model with high f… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Code and model: https://github.com/OpenDriveLab/Vista, video demos: https://vista-demo.github.io

  10. arXiv:2405.12484  [pdf, other

    cs.GR

    Meta-Homogenization for Knitwear Simulation

    Authors: Chun Yuan, Kui Wu, Haoyang Shi, Lei Lan, Yuxing Qiu, Cem Yuksel, Huamin Wang, Chenfanfu Jiang, Yin Yang

    Abstract: This paper presents meta-homogenization, a spatially varying homogenization scheme for knitwear simulation. We are motivated by the observation that macro-scale fabric dynamics is strongly correlated with its underlying knitting patterns. Therefore, homogenization towards a single material is less effective when the knitting is complex and non-repetitive. Our method tackles this challenge by homog… ▽ More

    Submitted 23 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  11. arXiv:2405.12356  [pdf, other

    physics.bio-ph cs.LG physics.chem-ph physics.data-an

    Coarse-graining conformational dynamics with multi-dimensional generalized Langevin equation: how, when, and why

    Authors: Pinchen Xie, Yunrui Qiu, Weinan E

    Abstract: A data-driven ab initio generalized Langevin equation (AIGLE) approach is developed to learn and simulate high-dimensional, heterogeneous, coarse-grained conformational dynamics. Constrained by the fluctuation-dissipation theorem, the approach can build coarse-grained models in dynamical consistency with all-atom molecular dynamics. We also propose practical criteria for AIGLE to enforce long-term… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  12. arXiv:2405.09719  [pdf, other

    cs.CL cs.AI cs.LG

    Spectral Editing of Activations for Large Language Model Alignment

    Authors: Yifu Qiu, Zheng Zhao, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

    Abstract: Large language models (LLMs) often exhibit undesirable behaviours, such as generating untruthful or biased content. Editing their internal representations has been shown to be effective in mitigating such behaviours on top of the existing alignment methods. We propose a novel inference-time editing method, namely spectral editing of activations (SEA), to project the input representations into dire… ▽ More

    Submitted 25 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  13. arXiv:2405.09125  [pdf, other

    cs.CV cs.AI

    HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition

    Authors: Honghui Chen, Yuhang Qiu, Jiabao Wang, **** Chen, Nam Ling

    Abstract: Internal Language Model (LM)-based methods use permutation language modeling (PLM) to solve the error correction caused by conditional independence in external LM-based methods. However, random permutations of human interference cause fit oscillations in the model training, and Iterative Refinement (IR) operation to improve multimodal information decoupling also introduces additional overhead. To… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 12 pages, 10 figures

    MSC Class: 68T01 ACM Class: I.2.10

  14. arXiv:2405.06214  [pdf, other

    cs.CV

    Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering

    Authors: Xiaohan Zhang, Yukui Qiu, Zhenyu Sun, Qi Liu

    Abstract: Recent progress in large-scale scene rendering has yielded Neural Radiance Fields (NeRF)-based models with an impressive ability to synthesize scenes across small objects and indoor scenes. Nevertheless, extending this idea to large-scale aerial rendering poses two critical problems. Firstly, a single NeRF cannot render the entire scene with high-precision for complex large-scale aerial datasets s… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  15. Optimizing E-commerce Search: Toward a Generalizable and Rank-Consistent Pre-Ranking Model

    Authors: Enqiang Xu, Yiming Qiu, Junyang Bai, ** Zhang, Dadong Miao, Songlin Wang, Guoyu Tang, Lin Liu, Mingming Li

    Abstract: In large e-commerce platforms, search systems are typically composed of a series of modules, including recall, pre-ranking, and ranking phases. The pre-ranking phase, serving as a lightweight module, is crucial for filtering out the bulk of products in advance for the downstream ranking module. Industrial efforts on optimizing the pre-ranking model have predominantly focused on enhancing ranking c… ▽ More

    Submitted 13 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    ACM Class: H.3.3

  16. arXiv:2405.02942  [pdf, other

    physics.optics cs.CV cs.RO eess.IV

    Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens

    Authors: Shaohua Gao, Qi Jiang, Yiqi Liao, Yi Qiu, Wanglei Ying, Kailun Yang, Kaiwei Wang, Benhao Zhang, Jian Bai

    Abstract: We propose a high-performance glass-plastic hybrid minimalist aspheric panoramic annular lens (ASPAL) to solve several major limitations of the traditional panoramic annular lens (PAL), such as large size, high weight, and complex system. The field of view (FoV) of the ASPAL is 360°x(35°~110°) and the imaging quality is close to the diffraction limit. This large FoV ASPAL is composed of only 4 len… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Accepted to Optics & Laser Technology

  17. arXiv:2405.01312  [pdf, other

    cs.DB cs.CR

    Privacy-Enhanced Database Synthesis for Benchmark Publishing

    Authors: Yongrui Zhong, Yunqing Ge, Jianbin Qin, Shuyuan Zheng, Bo Tang, Yu-Xuan Qiu, Rui Mao, Ye Yuan, Makoto Onizuka, Chuan Xiao

    Abstract: Benchmarking is crucial for evaluating a DBMS, yet existing benchmarks often fail to reflect the varied nature of user workloads. As a result, there is increasing momentum toward creating databases that incorporate real-world user data to more accurately mirror business environments. However, privacy concerns deter users from directly sharing their data, underscoring the importance of creating syn… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  18. arXiv:2404.17644  [pdf, other

    stat.ML cs.AI cs.LG

    A Conditional Independence Test in the Presence of Discretization

    Authors: Boyang Sun, Yu Yao, Huangyuan Hao, Yumou Qiu, Kun Zhang

    Abstract: Testing conditional independence has many applications, such as in Bayesian network learning and causal discovery. Different test methods have been proposed. However, existing methods generally can not work when only discretized observations are available. Specifically, consider $X_1$, $\tilde{X}_2$ and $X_3$ are observed variables, where $\tilde{X}_2$ is a discretization of latent variables… ▽ More

    Submitted 3 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  19. arXiv:2404.15615  [pdf, other

    cs.HC cs.LG

    MDDD: Manifold-based Domain Adaptation with Dynamic Distribution for Non-Deep Transfer Learning in Cross-subject and Cross-session EEG-based Emotion Recognition

    Authors: Ting Luo, **g Zhang, Yingwei Qiu, Li Zhang, Yaohua Hu, Zhuliang Yu, Zhen Liang

    Abstract: Emotion decoding using Electroencephalography (EEG)-based affective brain-computer interfaces represents a significant area within the field of affective computing. In the present study, we propose a novel non-deep transfer learning method, termed as Manifold-based Domain adaptation with Dynamic Distribution (MDDD). The proposed MDDD includes four main modules: manifold feature transformation, dyn… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  20. arXiv:2404.08237  [pdf, other

    cs.CV cs.AI

    IFViT: Interpretable Fixed-Length Representation for Fingerprint Matching via Vision Transformer

    Authors: Yuhang Qiu, Honghui Chen, Xingbo Dong, Zheng Lin, Iman Yi Liao, Massimo Tistarelli, Zhe **

    Abstract: Determining dense feature points on fingerprints used in constructing deep fixed-length representations for accurate matching, particularly at the pixel level, is of significant interest. To explore the interpretability of fingerprint matching, we propose a multi-stage interpretable fingerprint matching network, namely Interpretable Fixed-length Representation for Fingerprint Matching via Vision T… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: ready to submit to IEEE Transactions on Information Forensics and Security (TIFS)

  21. arXiv:2404.05680  [pdf, other

    cs.CV

    SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation

    Authors: Heyuan Li, Ce Chen, Tianhao Shi, Yuda Qiu, Sizhe An, Guanying Chen, Xiaoguang Han

    Abstract: While recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. Although PanoHead proves the possibilities of using a large-scale dataset with images of both frontal and back views for full-head synthesis, it often caus… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: project page: https://lhyfst.github.io/spherehead

  22. arXiv:2404.01780  [pdf, other

    astro-ph.IM astro-ph.GA cs.CV

    CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

    Authors: Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian , et al. (7 additional authors not shown)

    Abstract: Strong gravitational lensing is a powerful tool for investigating dark matter and dark energy properties. With the advent of large-scale sky surveys, we can discover strong lensing systems on an unprecedented scale, which requires efficient tools to extract them from billions of astronomical objects. The existing mainstream lens-finding tools are based on machine learning algorithms and applied to… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: The paper is accepted by the AJ. The complete code could be downloaded with DOI of: 10.12149/101393. Comments are welcome

  23. arXiv:2403.10780  [pdf, other

    cs.CV cs.AI

    Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation

    Authors: Mariia Khan, Yue Qiu, Yuren Cong, Jumana Abu-Khalaf, David Suter, Bodo Rosenhahn

    Abstract: Multi-class multi-instance segmentation is the task of identifying masks for multiple object classes and multiple instances of the same class within an image. The foundational Segment Anything Model (SAM) is designed for promptable multi-class multi-instance segmentation but tends to output part or sub-part masks in the "everything" mode for various real-world applications. Whole object segmentati… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  24. arXiv:2403.09630  [pdf, other

    cs.CV

    Generalized Predictive Model for Autonomous Driving

    Authors: Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, ** Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li

    Abstract: In this paper, we introduce the first large-scale video prediction model in the autonomous driving discipline. To eliminate the restriction of high-cost data collection and empower the generalization ability of our model, we acquire massive data from the web and pair it with diverse and high-quality text descriptions. The resultant dataset accumulates over 2000 hours of driving videos, spanning ar… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  25. arXiv:2403.05146  [pdf, other

    cs.CV

    Motion-Guided Dual-Camera Tracker for Low-Cost Skill Evaluation of Gastric Endoscopy

    Authors: Yuelin Zhang, Wanquan Yan, Kim Yan, Chun ** Lam, Yufu Qiu, Pengyu Zheng, Raymond Shing-Yan Tang, Shing Shin Cheng

    Abstract: Gastric simulators with objective educational feedback have been proven useful for endoscopy training. Existing electronic simulators with feedback are however not commonly adopted due to their high cost. In this work, a motion-guided dual-camera tracker is proposed to provide reliable endoscope tip position feedback at a low cost inside a mechanical simulator for endoscopy skill evaluation, tackl… ▽ More

    Submitted 20 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  26. arXiv:2403.04763  [pdf, other

    cs.LG

    BloomGML: Graph Machine Learning through the Lens of Bilevel Optimization

    Authors: Amber Yijia Zheng, Tong He, Yixuan Qiu, Minjie Wang, David Wipf

    Abstract: Bilevel optimization refers to scenarios whereby the optimal solution of a lower-level energy function serves as input features to an upper-level objective of interest. These optimal features typically depend on tunable parameters of the lower-level energy in such a way that the entire bilevel pipeline can be trained end-to-end. Although not generally presented as such, this paper demonstrates how… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Publication at AISTATS 2024

  27. arXiv:2403.02176  [pdf, other

    cs.CL

    EEE-QA: Exploring Effective and Efficient Question-Answer Representations

    Authors: Zhanghao Hu, Yijun Yang, Junjie Xu, Yifu Qiu, Pinzhen Chen

    Abstract: Current approaches to question answering rely on pre-trained language models (PLMs) like RoBERTa. This work challenges the existing question-answer encoding convention and explores finer representations. We begin with testing various pooling methods compared to using the begin-of-sentence token as a question representation for better quality. Next, we explore opportunities to simultaneously embed… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  28. arXiv:2403.01692  [pdf, other

    astro-ph.IM astro-ph.GA cs.CV eess.IV

    PI-AstroDeconv: A Physics-Informed Unsupervised Learning Method for Astronomical Image Deconvolution

    Authors: Shulei Ni, Yisheng Qiu, Yunchun Chen, Zihao Song, Hao Chen, Xuejian Jiang, Huaxi Chen

    Abstract: In the imaging process of an astronomical telescope, the deconvolution of its beam or Point Spread Function (PSF) is a crucial task. However, deconvolution presents a classical and challenging inverse computation problem. In scenarios where the beam or PSF is complex or inaccurately measured, such as in interferometric arrays and certain radio telescopes, the resultant blurry images are often chal… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  29. arXiv:2402.19242  [pdf, other

    cs.LG cs.CE math.NA

    Derivative-enhanced Deep Operator Network

    Authors: Yuan Qiu, Nolan Bridges, Peng Chen

    Abstract: Deep operator networks (DeepONets), a class of neural operators that learn map**s between function spaces, have recently been developed as surrogate models for parametric partial differential equations (PDEs). In this work we propose a derivative-enhanced deep operator network (DE-DeepONet), which leverages the derivative information to enhance the prediction accuracy, and provide a more accurat… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  30. arXiv:2402.19072  [pdf, other

    cs.LG cs.AI

    TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables

    Authors: Yuxuan Wang, Haixu Wu, Jiaxiang Dong, Yong Liu, Yunzhong Qiu, Haoran Zhang, Jianmin Wang, Mingsheng Long

    Abstract: Recent studies have demonstrated remarkable performance in time series forecasting. However, due to the partially-observed nature of real-world applications, solely focusing on the target of interest, so-called endogenous variables, is usually insufficient to guarantee accurate forecasting. Notably, a system is often recorded into multiple variables, where the exogenous series can provide valuable… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  31. arXiv:2402.14494  [pdf, other

    cs.CL

    Noise-BERT: A Unified Perturbation-Robust Framework with Noise Alignment Pre-training for Noisy Slot Filling Task

    Authors: **xu Zhao, Guanting Dong, Yueyan Qiu, Tingfeng Hui, Xiaoshuai Song, Daichi Guo, Weiran Xu

    Abstract: In a realistic dialogue system, the input information from users is often subject to various types of input perturbations, which affects the slot-filling task. Although rule-based data augmentation methods have achieved satisfactory results, they fail to exhibit the desired generalization when faced with unknown noise disturbances. In this study, we address the challenges posed by input perturbati… ▽ More

    Submitted 6 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by ICASSP 2024

  32. arXiv:2402.10876  [pdf, other

    cs.DC

    Accelerating Sparse DNNs Based on Tiled GEMM

    Authors: Cong Guo, Fengchen Xue, **gwen Leng, Yuxian Qiu, Yue Guan, Weihao Cui, Quan Chen, Minyi Guo

    Abstract: Network pruning can reduce the computation cost of deep neural network (DNN) models. However, sparse models often produce randomly-distributed weights to maintain accuracy, leading to irregular computations. Consequently, unstructured sparse models cannot achieve meaningful speedup on commodity hardware built for dense matrix computations. Accelerators are usually modified or designed with structu… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE Transactions on Computers. arXiv admin note: substantial text overlap with arXiv:2008.13006

  33. Can Channels be Fully Inferred Between Two Antenna Panels?

    Authors: Y. Qiu, D. W, Y. Zeng

    Abstract: This letter considers a two-panel massive multiple-input multiple-output (MIMO) communication system, where the base station (BS) is equipped with two antenna panels that may use different frequency bands for communication. By exploiting the geometric relationships between antenna panels, efficient channel inference methods across antenna panels are proposed to reduce the overhead of real-time cha… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 5 pages, 6figures, IEEE Communications Letters

  34. arXiv:2402.04013  [pdf, other

    cs.CV

    Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and Defenses

    Authors: Hao Fang, Yixiang Qiu, Hongyao Yu, Wenbo Yu, Jiawei Kong, Baoli Chong, Bin Chen, Xuan Wang, Shu-Tao Xia

    Abstract: Model Inversion (MI) attacks aim to disclose private information about the training data by abusing access to the pre-trained models. These attacks enable adversaries to reconstruct high-fidelity data that closely aligns with the private training data, which has raised significant privacy concerns. Despite the rapid advances in the field, we lack a comprehensive overview of existing MI attacks and… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  35. arXiv:2402.02475  [pdf, other

    cs.LG

    TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling

    Authors: Jiaxiang Dong, Haixu Wu, Yuxuan Wang, Yunzhong Qiu, Li Zhang, Jianmin Wang, Mingsheng Long

    Abstract: Time series pre-training has recently garnered wide attention for its potential to reduce labeling expenses and benefit various downstream tasks. Prior methods are mainly based on pre-training techniques well-acknowledged in vision or language, such as masked modeling and contrastive learning. However, randomly masking time series or calculating series-wise similarity will distort or neglect inher… ▽ More

    Submitted 7 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  36. arXiv:2402.00825  [pdf, other

    math.NA cs.LG

    Resolution invariant deep operator network for PDEs with complex geometries

    Authors: Jianguo Huang, Yue Qiu

    Abstract: Neural operators (NO) are discretization invariant deep learning methods with functional output and can approximate any continuous operator. NO have demonstrated the superiority of solving partial differential equations (PDEs) over other deep learning methods. However, the spatial domain of its input function needs to be identical to its output, which limits its applicability. For instance, the wi… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  37. arXiv:2401.17509  [pdf, other

    cs.CV

    Anything in Any Scene: Photorealistic Video Object Insertion

    Authors: Chen Bai, Zeman Shao, Guoxiang Zhang, Di Liang, Jie Yang, Zhuorui Zhang, Yujian Guo, Chengzhang Zhong, Yiqiao Qiu, Zhendong Wang, Yichen Guan, Xiaoyin Zheng, Tao Wang, Cheng Lu

    Abstract: Realistic video simulation has shown significant potential across diverse applications, from virtual reality to film production. This is particularly true for scenarios where capturing videos in real-world settings is either impractical or expensive. Existing approaches in video simulation often fail to accurately model the lighting environment, represent the object geometry, or achieve high level… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  38. arXiv:2401.16947  [pdf, other

    cs.CR

    WGAN-AFL: Seed Generation Augmented Fuzzer with Wasserstein-GAN

    Authors: Liqun Yang, Chunan Li, Yongxin Qiu, Chaoren Wei, Jian Yang, Hongcheng Guo, **xin Ma, Zhoujun Li

    Abstract: The importance of addressing security vulnerabilities is indisputable, with software becoming crucial in sectors such as national defense and finance. Consequently, The security issues caused by software vulnerabilities cannot be ignored. Fuzz testing is an automated software testing technology that can detect vulnerabilities in the software. However, most previous fuzzers encounter challenges tha… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  39. arXiv:2401.16858  [pdf, other

    cs.IT

    Low-Rate, Low-Distortion Compression with Wasserstein Distortion

    Authors: Yang Qiu, Aaron B. Wagner

    Abstract: Wasserstein distortion is a one-parameter family of distortion measures that was recently proposed to unify fidelity and realism constraints. After establishing continuity results for Wasserstein in the extreme cases of pure fidelity and pure realism, we prove the first coding theorems for compression under Wasserstein distortion focusing on the regime in which both the rate and the distortion are… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.03629

    MSC Class: 94A34 ACM Class: E.4

  40. arXiv:2401.11825  [pdf, other

    math.NA cs.LG

    Sparse discovery of differential equations based on multi-fidelity Gaussian process

    Authors: Yuhuang Meng, Yue Qiu

    Abstract: Sparse identification of differential equations aims to compute the analytic expressions from the observed data explicitly. However, there exist two primary challenges. Firstly, it exhibits sensitivity to the noise in the observed data, particularly for the derivatives computations. Secondly, existing literature predominantly concentrates on single-fidelity (SF) data, which imposes limitations on… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  41. arXiv:2401.05424  [pdf, other

    cs.CY cs.IR cs.LG stat.AP

    A Toolbox for Modelling Engagement with Educational Videos

    Authors: Yuxiang Qiu, Karim Djemili, Denis Elezi, Aaneel Shalman, María Pérez-Ortiz, Emine Yilmaz, John Shawe-Taylor, Sahan Bulathwela

    Abstract: With the advancement and utility of Artificial Intelligence (AI), personalising education to a global population could be a cornerstone of new educational systems in the future. This work presents the PEEKC dataset and the TrueLearn Python library, which contains a dataset and a series of online learner state models that are essential to facilitate research on learner engagement modelling.TrueLear… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: In Proceedings of AAAI Conference on Artificial Intelligence 2024. arXiv admin note: text overlap with arXiv:2309.11527

    ACM Class: H.3.3; J.1; I.2.0

  42. arXiv:2312.17648  [pdf, other

    cs.CV cs.AI

    Bridging Modality Gap for Visual Grounding with Effecitve Cross-modal Distillation

    Authors: Jiaxi Wang, Wenhui Hu, Xueyang Liu, Beihu Wu, Yuting Qiu, YingYing Cai

    Abstract: Visual grounding aims to align visual information of specific regions of images with corresponding natural language expressions. Current visual grounding methods leverage pre-trained visual and language backbones separately to obtain visual features and linguistic features. Although these two types of features are then fused via delicately designed networks, the heterogeneity of the features makes… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  43. arXiv:2312.17052  [pdf, other

    cs.CV

    Multi-Attention Fusion Drowsy Driving Detection Model

    Authors: Shulei QU, Zhenguo Gao, Xiaoxiao Wu, Yuanyuan Qiu

    Abstract: Drowsy driving represents a major contributor to traffic accidents, and the implementation of driver drowsy driving detection systems has been proven to significantly reduce the occurrence of such accidents. Despite the development of numerous drowsy driving detection algorithms, many of them impose specific prerequisites such as the availability of complete facial images, optimal lighting conditi… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 8 pages, 6 figures

  44. arXiv:2312.11545  [pdf, other

    cs.MA cs.AI cs.LG

    Robust Communicative Multi-Agent Reinforcement Learning with Active Defense

    Authors: Lebin Yu, Yunbo Qiu, Quanming Yao, Yuan Shen, Xudong Zhang, Jian Wang

    Abstract: Communication in multi-agent reinforcement learning (MARL) has been proven to effectively promote cooperation among agents recently. Since communication in real-world scenarios is vulnerable to noises and adversarial attacks, it is crucial to develop robust communicative MARL technique. However, existing research in this domain has predominantly focused on passive defense strategies, where agents… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  45. arXiv:2312.11146  [pdf, other

    cs.CV

    OsmLocator: locating overlap** scatter marks with a non-training generative perspective

    Authors: Yuming Qiu, Aleksandra Pizurica, Qi Ming, Nicolas Nadisic

    Abstract: Automated mark localization in scatter images, greatly helpful for discovering knowledge and understanding enormous document images and reasoning in visual question answering AI systems, is a highly challenging problem because of the ubiquity of overlap** marks. Locating overlap** marks faces many difficulties such as no texture, less contextual information, hallow shape and tiny size. Here, w… ▽ More

    Submitted 22 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 22pages

  46. arXiv:2312.02790  [pdf, other

    cs.SI

    A Low-cost, High-impact Node Injection Approach for Attacking Social Network Alignment

    Authors: Shuyu Jiang, Yunxiang Qiu, Xian Mo, Rui Tang, Wei Wang

    Abstract: Social network alignment (SNA) holds significant importance for various downstream applications, prompting numerous professionals to develop and share SNA tools. Unfortunately, these tools can be exploited by malicious actors to integrate sensitive user information, posing cybersecurity risks. While many researchers have explored attacking SNA (ASNA) through a network modification attack way, prac… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  47. arXiv:2312.00401  [pdf, other

    cs.CV cs.MM

    VIoTGPT: Learning to Schedule Vision Tools towards Intelligent Video Internet of Things

    Authors: Yaoyao Zhong, Mengshi Qi, Rui Wang, Yuhan Qiu, Yang Zhang, Huadong Ma

    Abstract: Video Internet of Things (VIoT) has shown full potential in collecting an unprecedented volume of video data. Learning to schedule perceiving models and analyzing the collected videos intelligently will be potential sparks for VIoT. In this paper, to address the challenges posed by the fine-grained and interrelated vision tool usage of VIoT, we build VIoTGPT, the framework based on LLMs to correct… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  48. arXiv:2311.12198  [pdf, other

    cs.GR cs.AI cs.CV cs.LG

    PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

    Authors: Tianyi Xie, Zeshun Zong, Yuxing Qiu, Xuan Li, Yutao Feng, Yin Yang, Chenfanfu Jiang

    Abstract: We introduce PhysGaussian, a new method that seamlessly integrates physically grounded Newtonian dynamics within 3D Gaussians to achieve high-quality novel motion synthesis. Employing a custom Material Point Method (MPM), our approach enriches 3D Gaussian kernels with physically meaningful kinematic deformation and mechanical stress attributes, all evolved in line with continuum mechanics principl… ▽ More

    Submitted 15 April, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR 2024

  49. Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks

    Authors: Ling Luo, **zhong Ning, Yingwen Zhao, Zhijun Wang, Zeyuan Ding, Peng Chen, Weiru Fu, Qinyu Han, Guangtao Xu, Yunzhi Qiu, Dinghao Pan, Jiru Li, Hao Li, Wenduo Feng, Senbo Tu, Yuqi Liu, Zhihao Yang, Jian Wang, Yuanyuan Sun, Hongfei Lin

    Abstract: Objective: Most existing fine-tuned biomedical large language models (LLMs) focus on enhancing performance in monolingual biomedical question answering and conversation tasks. To investigate the effectiveness of the fine-tuned LLMs on diverse biomedical NLP tasks in different languages, We present Taiyi, a bilingual fine-tuned LLM for diverse biomedical tasks. Materials and Methods: We first curat… ▽ More

    Submitted 19 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Journal of the American Medical Informatics Association, 2024, ocae037

  50. arXiv:2311.11591  [pdf, other

    cs.AI cs.HC

    DesignGPT: Multi-Agent Collaboration in Design

    Authors: Shiying Ding, Xinyi Chen, Yan Fang, Wenrui Liu, Yiwu Qiu, Chunlei Chai

    Abstract: Generative AI faces many challenges when entering the product design workflow, such as interface usability and interaction patterns. Therefore, based on design thinking and design process, we developed the DesignGPT multi-agent collaboration framework, which uses artificial intelligence agents to simulate the roles of different positions in the design company and allows human designers to collabor… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.