Skip to main content

Showing 1–50 of 122 results for author: Tian, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13170  [pdf, other

    cs.AI cs.CL

    Amphista: Accelerate LLM Inference with Bi-directional Multiple Drafting Heads in a Non-autoregressive Style

    Authors: Ze** Li, Xinlong Yang, Ziheng Gao, Ji Liu, Zhuang Liu, Dong Li, **zhang Peng, Lu Tian, Emad Barsoum

    Abstract: Large Language Models (LLMs) inherently use autoregressive decoding, which lacks parallelism in inference and results in significantly slow inference speeds, especially when hardware parallel accelerators and memory bandwidth are not fully utilized. In this work, we propose Amphista, a speculative decoding algorithm that adheres to a non-autoregressive decoding paradigm. Owing to the increased par… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.07177  [pdf, other

    cs.LG

    TernaryLLM: Ternarized Large Language Model

    Authors: Tianqi Chen, Zhe Li, Weixiang Xu, Zeyu Zhu, Dong Li, Lu Tian, Emad Barsoum, Peisong Wang, Jian Cheng

    Abstract: Large language models (LLMs) have achieved remarkable performance on Natural Language Processing (NLP) tasks, but they are hindered by high computational costs and memory requirements. Ternarization, an extreme form of quantization, offers a solution by reducing memory usage and enabling energy-efficient floating-point additions. However, applying ternarization to LLMs faces challenges stemming fr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2406.06025  [pdf, other

    cs.SE cs.CL cs.LG

    RepoQA: Evaluating Long Context Code Understanding

    Authors: Jiawei Liu, Jia Le Tian, Vijay Daita, Yuxiang Wei, Yifeng Ding, Yuhan Katherine Wang, Jun Yang, Lingming Zhang

    Abstract: Recent advances have been improving the context windows of Large Language Models (LLMs). To quantify the real long-context capabilities of LLMs, evaluators such as the popular Needle in a Haystack have been developed to test LLMs over a large chunk of raw texts. While effective, current evaluations overlook the insight of how LLMs work with long-context code, i.e., repositories. To this end, we in… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2405.16738  [pdf, other

    cs.CV

    CARL: A Framework for Equivariant Image Registration

    Authors: Hastings Greer, Lin Tian, Francois-Xavier Vialard, Roland Kwitt, Raul San Jose Estepar, Marc Niethammer

    Abstract: Image registration estimates spatial correspondences between a pair of images. These estimates are typically obtained via numerical optimization or regression by a deep network. A desirable property of such estimators is that a correspondence estimate (e.g., the true oracle correspondence) for an image pair is maintained under deformations of the input images. Formally, the estimator should be equ… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  5. arXiv:2405.11850  [pdf, other

    cs.CV

    Rethinking Overlooked Aspects in Vision-Language Models

    Authors: Yuan Liu, Le Tian, Xiao Zhou, Jie Zhou

    Abstract: Recent advancements in large vision-language models (LVLMs), such as GPT4-V and LLaVA, have been substantial. LLaVA's modular architecture, in particular, offers a blend of simplicity and efficiency. Recent works mainly focus on introducing more pre-training and instruction tuning data to improve model's performance. This paper delves into the often-neglected aspects of data efficiency during pre-… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2405.10621  [pdf, other

    cs.LG cs.AI

    Historically Relevant Event Structuring for Temporal Knowledge Graph Reasoning

    Authors: **chuan Zhang, Bei Hui, Chong Mu, Ming Sun, Ling Tian

    Abstract: Temporal Knowledge Graph (TKG) reasoning focuses on predicting events through historical information within snapshots distributed on a timeline. Existing studies mainly concentrate on two perspectives of leveraging the history of TKGs, including capturing evolution of each recent snapshot or correlations among global historical facts. Despite the achieved significant accomplishments, these models… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  7. arXiv:2404.17270  [pdf, other

    cs.IT eess.SP

    Empirical Studies of Propagation Characteristics and Modeling Based on XL-MIMO Channel Measurement: From Far-Field to Near-Field

    Authors: Haiyang Miao, Jianhua Zhang, Pan Tang, Lei Tian, Weirang Zuo, Qi Wei, Guangyi Liu

    Abstract: In the sixth-generation (6G), the extremely large-scale multiple-input-multiple-output (XL-MIMO) is considered a promising enabling technology. With the further expansion of array element number and frequency bands, near-field effects will be more likely to occur in 6G communication systems. The near-field radio communications (NFRC) will become crucial in 6G communication systems. It is known tha… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  8. arXiv:2404.12675  [pdf, other

    cs.CR

    ESPM-D: Efficient Sparse Polynomial Multiplication for Dilithium on ARM Cortex-M4 and Apple M2

    Authors: Jieyu Zheng, Hong Zhang, Le Tian, Zhuo Zhang, Hanyu Wei, Zhiwei Chu, Yafang Yang, Yunlei Zhao

    Abstract: Dilithium is a lattice-based digital signature scheme standardized by the NIST post-quantum cryptography (PQC) project. In this study, we focus on develo** efficient sparse polynomial multiplication implementations of Dilithium for ARM Cortex-M4 and Apple M2, which are both based on the ARM architecture. The ARM Cortex-M4 is commonly utilized in resource-constrained devices such as sensors. Conv… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 19 pages, 1 figure

  9. arXiv:2404.11108  [pdf, other

    cs.CV

    LADDER: An Efficient Framework for Video Frame Interpolation

    Authors: Tong Shen, Dong Li, Ziheng Gao, Lu Tian, Emad Barsoum

    Abstract: Video Frame Interpolation (VFI) is a crucial technique in various applications such as slow-motion generation, frame rate conversion, video frame restoration etc. This paper introduces an efficient video frame interpolation framework that aims to strike a favorable balance between efficiency and quality. Our framework follows a general paradigm consisting of a flow estimator and a refinement modul… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  10. arXiv:2404.11100  [pdf, other

    cs.CV cs.LG

    Synthesizing Realistic Data for Table Recognition

    Authors: Qiyu Hou, Jun Wang, Meixuan Qiao, Lujun Tian

    Abstract: To overcome the limitations and challenges of current automatic table data annotation methods and random table data synthesis approaches, we propose a novel method for synthesizing annotation data specifically designed for table recognition. This method utilizes the structure and content of existing complex tables, facilitating the efficient creation of tables that closely replicate the authentic… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: ICDAR 2024

  11. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  12. arXiv:2404.07821  [pdf, other

    cs.CV

    Sparse Laneformer

    Authors: Ji Liu, Zifeng Zhang, Mingjie Lu, Hongyang Wei, Dong Li, Yile Xie, **zhang Peng, Lu Tian, Ashish Sirasao, Emad Barsoum

    Abstract: Lane detection is a fundamental task in autonomous driving, and has achieved great progress as deep learning emerges. Previous anchor-based methods often design dense anchors, which highly depend on the training dataset and remain fixed during inference. We analyze that dense anchors are not necessary for lane detection, and propose a transformer-based lane detection framework based on a sparse an… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  13. arXiv:2403.09475  [pdf, other

    cs.CR

    Covert Communication for Untrusted UAV-Assisted Wireless Systems

    Authors: Chan Gao, Linying Tian, Dong Zheng

    Abstract: Wireless systems are of paramount importance for providing ubiquitous data transmission for smart cities. However, due to the broadcasting and openness of wireless channels, such systems face potential security challenges. UAV-assisted covert communication is a supporting technology for improving covert performances and has become a hot issue in the research of wireless communication security. Thi… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  14. arXiv:2403.05780  [pdf, other

    cs.CV

    uniGradICON: A Foundation Model for Medical Image Registration

    Authors: Lin Tian, Hastings Greer, Roland Kwitt, Francois-Xavier Vialard, Raul San Jose Estepar, Sylvain Bouix, Richard Rushmore, Marc Niethammer

    Abstract: Conventional medical image registration approaches directly optimize over the parameters of a transformation model. These approaches have been highly successful and are used generically for registrations of different anatomical regions. Recent deep registration networks are incredibly fast and accurate but are only trained for specific tasks. Hence, they are no longer generic registration approach… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  15. Improving Visual Perception of a Social Robot for Controlled and In-the-wild Human-robot Interaction

    Authors: Wangjie Zhong, Leimin Tian, Duy Tho Le, Hamid Rezatofighi

    Abstract: Social robots often rely on visual perception to understand their users and the environment. Recent advancements in data-driven approaches for computer vision have demonstrated great potentials for applying deep-learning models to enhance a social robot's visual perception. However, the high computational demands of deep-learning methods, as opposed to the more resource-efficient shallow-learning… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: accepted to HRI 2024 (LBR track)

  16. arXiv:2402.17485  [pdf, other

    cs.CV

    EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

    Authors: Linrui Tian, Qi Wang, Bang Zhang, Liefeng Bo

    Abstract: In this work, we tackle the challenge of enhancing the realism and expressiveness in talking head video generation by focusing on the dynamic and nuanced relationship between audio cues and facial movements. We identify the limitations of traditional techniques that often fail to capture the full spectrum of human expressions and the uniqueness of individual facial styles. To address these issues,… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  17. arXiv:2402.08155  [pdf, other

    cs.CL cs.AI

    CMA-R:Causal Mediation Analysis for Explaining Rumour Detection

    Authors: Lin Tian, Xiuzhen Zhang, Jey Han Lau

    Abstract: We apply causal mediation analysis to explain the decision-making process of neural models for rumour detection on Twitter. Interventions at the input and network level reveal the causal impacts of tweets and words in the model output. We find that our approach CMA-R -- Causal Mediation Analysis for Rumour detection -- identifies salient tweets that explain model predictions and show strong agreem… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 9 pages, 7 figures, Accepted by EACL 2024 Findings

  18. arXiv:2401.09084  [pdf, other

    cs.CV

    UniVG: Towards UNIfied-modal Video Generation

    Authors: Ludan Ruan, Lei Tian, Chuanwei Huang, Xu Zhang, Xinyan Xiao

    Abstract: Diffusion based video generation has received extensive attention and achieved considerable success within both the academic and industrial communities. However, current efforts are mainly concentrated on single-objective or single-task video generation, such as generation driven by text, by image, or by a combination of text and image. This cannot fully meet the needs of real-world application sc… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  19. Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered Chatbot

    Authors: Sachin Pathiyan Cherumanal, Lin Tian, Futoon M. Abushaqra, Angel Felipe Magnossao de Paula, Kaixin Ji, Danula Hettiachchi, Johanne R. Trippas, Halil Ali, Falk Scholer, Damiano Spina

    Abstract: Creating and deploying customized applications is crucial for operational success and enriching user experiences in the rapidly evolving modern business world. A prominent facet of modern user experiences is the integration of chatbots or voice assistants. The rapid evolution of Large Language Models (LLMs) has provided a powerful tool to build conversational applications. We present Walert, a cus… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: Accepted at 2024 ACM SIGIR CHIIR

  20. arXiv:2401.07061  [pdf, other

    cs.CV

    Dual-View Data Hallucination with Semantic Relation Guidance for Few-Shot Image Recognition

    Authors: Hefeng Wu, Guangzhi Ye, Ziyang Zhou, Ling Tian, Qing Wang, Liang Lin

    Abstract: Learning to recognize novel concepts from just a few image samples is very challenging as the learned model is easily overfitted on the few data and results in poor generalizability. One promising but underexplored solution is to compensate the novel classes by generating plausible samples. However, most existing works of this line exploit visual information only, rendering the generated data easy… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 13 pages

  21. arXiv:2401.06426  [pdf, other

    cs.CV cs.AI

    UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer

    Authors: Ji Liu, Dehua Tang, Yuanxian Huang, Li Zhang, Xiaocheng Zeng, Dong Li, Mingjie Lu, **zhang Peng, Yu Wang, Fan Jiang, Lu Tian, Ashish Sirasao

    Abstract: Traditional channel-wise pruning methods by reducing network channels struggle to effectively prune efficient CNN models with depth-wise convolutional layers and certain efficient modules, such as popular inverted residual blocks. Prior depth pruning methods by reducing network depths are not suitable for pruning some efficient models due to the existence of some normalization layers. Moreover, fi… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  22. arXiv:2401.05870  [pdf, other

    cs.CV cs.AI

    HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models

    Authors: Hanzhang Wang, Haoran Wang, **ze Yang, Zhongrui Yu, Zeke Xie, Lei Tian, Xinyan Xiao, Junjun Jiang, Xianming Liu, Mingming Sun

    Abstract: The goal of Arbitrary Style Transfer (AST) is injecting the artistic features of a style reference into a given image/video. Existing methods usually focus on pursuing the balance between style and content, whereas ignoring the significant demand for flexible and customized stylization results and thereby limiting their practical application. To address this critical issue, a novel AST approach na… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  23. arXiv:2401.00683  [pdf, ps, other

    cs.IT

    Asymptotically Optimal Sequence Sets With Low/Zero Ambiguity Zone Properties

    Authors: Liying Tian, Xiaoshi Song, Zilong Liu, Yubo Li

    Abstract: Sequences with low/zero ambiguity zone (LAZ/ZAZ) properties are useful for modern wireless communication and radar systems operating in mobile environments. This paper first presents a new family of ZAZ sequence sets by generalizing an earlier construction of zero correlation zone (ZCZ) sequences arising from perfect nonlinear functions. We then introduce a second family of ZAZ sequence sets with… ▽ More

    Submitted 1 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  24. Learning Multi-graph Structure for Temporal Knowledge Graph Reasoning

    Authors: **chuan Zhang, Bei Hui, Chong Mu, Ling Tian

    Abstract: Temporal Knowledge Graph (TKG) reasoning that forecasts future events based on historical snapshots distributed over timestamps is denoted as extrapolation and has gained significant attention. Owing to its extreme versatility and variation in spatial and temporal correlations, TKG reasoning presents a challenging task, demanding efficient capture of concurrent structures and evolutional interacti… ▽ More

    Submitted 26 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  25. arXiv:2311.14986  [pdf, other

    cs.CV

    SAME++: A Self-supervised Anatomical eMbeddings Enhanced medical image registration framework using stable sampling and regularized transformation

    Authors: Lin Tian, Zi Li, Fengze Liu, Xiaoyu Bai, Jia Ge, Le Lu, Marc Niethammer, Xianghua Ye, Ke Yan, Daikai **

    Abstract: Image registration is a fundamental medical image analysis task. Ideally, registration should focus on aligning semantically corresponding voxels, i.e., the same anatomical locations. However, existing methods often optimize similarity measures computed directly on intensities or on hand-crafted features, which lack anatomical semantic information. These similarity measures may lead to sub-optimal… ▽ More

    Submitted 25 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

  26. arXiv:2311.14762  [pdf, other

    cs.CV cs.AI

    The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024

    Authors: Benjamin Kiefer, Lojze Žust, Matej Kristan, Janez Perš, Matija Teršek, Arnold Wiliem, Martin Messmer, Cheng-Yen Yang, Hsiang-Wei Huang, Zhongyu Jiang, Heng-Cheng Kuo, Jie Mei, Jenq-Neng Hwang, Daniel Stadler, Lars Sommer, Kaer Huang, Aiguo Zheng, Weitu Chong, Kanokphan Lertniphonphan, Jun Xie, Feng Chen, Jian Li, Zhepeng Wang, Luca Zedda, Andrea Loddo , et al. (24 additional authors not shown)

    Abstract: The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 addresses maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicles (USV). Three challenges categories are considered: (i) UAV-based Maritime Object Tracking with Re-identification, (ii) USV-based Maritime Obstacle Segmentation and Detection, (iii) USV-based Maritime Boat Tracking. The USV-based Maritime Obst… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Part of 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 IEEE Xplore submission as part of WACV 2024

  27. arXiv:2311.04441  [pdf, other

    cs.LG cs.AI cs.SI

    MixTEA: Semi-supervised Entity Alignment with Mixture Teaching

    Authors: Feng Xie, Xin Song, Xiang Zeng, Xuechen Zhao, Lei Tian, Bin Zhou, Yusong Tan

    Abstract: Semi-supervised entity alignment (EA) is a practical and challenging task because of the lack of adequate labeled map**s as training data. Most works address this problem by generating pseudo map**s for unlabeled entities. However, they either suffer from the erroneous (noisy) pseudo map**s or largely ignore the uncertainty of pseudo map**s. In this paper, we propose a novel semi-supervise… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Findings of EMNLP 2023; 11 pages, 4 figures; code see https://github.com/Xiefeng69/MixTEA

  28. arXiv:2311.01033  [pdf, other

    cs.LG cs.AI cs.SI

    Non-Autoregressive Diffusion-based Temporal Point Processes for Continuous-Time Long-Term Event Prediction

    Authors: Wang-Tao Zhou, Zhao Kang, Ling Tian

    Abstract: Continuous-time long-term event prediction plays an important role in many application scenarios. Most existing works rely on autoregressive frameworks to predict event sequences, which suffer from error accumulation, thus compromising prediction quality. Inspired by the success of denoising diffusion probabilistic models, we propose a diffusion-based non-autoregressive temporal point process mode… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  29. arXiv:2311.00567  [pdf

    eess.IV cs.CV cs.LG physics.med-ph q-bio.QM

    A Robust Deep Learning Method with Uncertainty Estimation for the Pathological Classification of Renal Cell Carcinoma based on CT Images

    Authors: Ni Yao, Hang Hu, Kaicong Chen, Chen Zhao, Yuan Guo, Boya Li, Jiaofen Nan, Yanting Li, Chuang Han, Fubao Zhu, Weihua Zhou, Li Tian

    Abstract: Objectives To develop and validate a deep learning-based diagnostic model incorporating uncertainty estimation so as to facilitate radiologists in the preoperative differentiation of the pathological subtypes of renal cell carcinoma (RCC) based on CT images. Methods Data from 668 consecutive patients, pathologically proven RCC, were retrospectively collected from Center 1. By using five-fold cross… ▽ More

    Submitted 12 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 16 pages, 6 figures

  30. arXiv:2310.14228  [pdf, other

    cs.CV

    Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection

    Authors: Ruiying Lu, YuJie Wu, Long Tian, Dongsheng Wang, Bo Chen, Xiyang Liu, Ruimin Hu

    Abstract: Unsupervised image Anomaly Detection (UAD) aims to learn robust and discriminative representations of normal samples. While separate solutions per class endow expensive computation and limited generalizability, this paper focuses on building a unified framework for multiple classes. Under such a challenging setting, popular reconstruction-based networks with continuous latent representation assump… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  31. arXiv:2309.09301  [pdf, other

    cs.CV

    RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

    Authors: Lijun Li, Linrui Tian, Xindi Zhang, Qi Wang, Bang Zhang, Mengyuan Liu, Chen Chen

    Abstract: The current interacting hand (IH) datasets are relatively simplistic in terms of background and texture, with hand joints being annotated by a machine annotator, which may result in inaccuracies, and the diversity of pose distribution is limited. However, the variability of background, pose distribution, and texture can greatly influence the generalization ability. Therefore, we present a large-sc… ▽ More

    Submitted 27 September, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023

  32. arXiv:2309.07322  [pdf, other

    cs.CV

    $\texttt{NePhi}$: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration

    Authors: Lin Tian, Hastings Greer, Raúl San José Estépar, Soumyadip Sengupta, Marc Niethammer

    Abstract: This work proposes NePhi, a generalizable neural deformation model which results in approximately diffeomorphic transformations. In contrast to the predominant voxel-based transformation fields used in learning-based registration approaches, NePhi represents deformations functionally, leading to great flexibility within the design space of memory consumption during training and inference, inferenc… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

  33. arXiv:2308.04748  [pdf, other

    cs.SE cs.LG

    Fuzz4All: Universal Fuzzing with Large Language Models

    Authors: Chunqiu Steven Xia, Matteo Paltenghi, Jia Le Tian, Michael Pradel, Lingming Zhang

    Abstract: Fuzzing has achieved tremendous success in discovering bugs and vulnerabilities in various software systems. Systems under test (SUTs) that take in programming or formal language as inputs, e.g., compilers, runtime engines, constraint solvers, and software libraries with accessible APIs, are especially important as they are fundamental building blocks of software development. However, existing fuz… ▽ More

    Submitted 15 January, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted at ICSE 2024

  34. arXiv:2308.03047  [pdf, other

    cs.CV

    Prototypes-oriented Transductive Few-shot Learning with Conditional Transport

    Authors: Long Tian, **gyi Feng, Wenchao Chen, Xiaoqiang Chai, Liming Wang, Xiyang Liu, Bo Chen

    Abstract: Transductive Few-Shot Learning (TFSL) has recently attracted increasing attention since it typically outperforms its inductive peer by leveraging statistics of query samples. However, previous TFSL methods usually encode uniform prior that all the classes within query samples are equally likely, which is biased in imbalanced TFSL and causes severe performance degradation. Given this pivotal issu… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  35. arXiv:2307.09727  [pdf, other

    cs.CV

    SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid

    Authors: Zi Li, Lin Tian, Tony C. W. Mok, Xiaoyu Bai, Puyang Wang, Jia Ge, **gren Zhou, Le Lu, Xianghua Ye, Ke Yan, Dakai **

    Abstract: Estimating displacement vector field via a cost volume computed in the feature space has shown great success in image registration, but it suffers excessive computation burdens. Moreover, existing feature descriptors only extract local features incapable of representing the global semantic information, which is especially important for solving large transformations. To address the discussed issues… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  36. arXiv:2307.06207  [pdf, other

    eess.IV cs.LG physics.comp-ph physics.optics

    Local Conditional Neural Fields for Versatile and Generalizable Large-Scale Reconstructions in Computational Imaging

    Authors: Hao Wang, Jiabei Zhu, Yunzhe Li, QianWan Yang, Lei Tian

    Abstract: Deep learning has transformed computational imaging, but traditional pixel-based representations limit their ability to capture continuous, multiscale details of objects. Here we introduce a novel Local Conditional Neural Fields (LCNF) framework, leveraging a continuous implicit neural representation to address this limitation. LCNF enables flexible object representation and facilitates the recons… ▽ More

    Submitted 22 July, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  37. arXiv:2307.01557  [pdf, other

    cs.CV cs.AI

    Separated RoadTopoFormer

    Authors: Mingjie Lu, Yuanxian Huang, Ji Liu, **zhang Peng, Lu Tian, Ashish Sirasao

    Abstract: Understanding driving scenarios is crucial to realizing autonomous driving. Previous works such as map learning and BEV lane detection neglect the connection relationship between lane instances, and traffic elements detection tasks usually neglect the relationship with lane lines. To address these issues, the task is presented which includes 4 sub-tasks, the detection of traffic elements, the dete… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  38. arXiv:2307.00266  [pdf, other

    cs.CL cs.AI

    Hierarchical Pretraining for Biomedical Term Embeddings

    Authors: Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu Tian

    Abstract: Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  39. arXiv:2306.14072  [pdf, other

    cs.LG cs.AI cs.SI

    Intensity-free Convolutional Temporal Point Process: Incorporating Local and Global Event Contexts

    Authors: Wang-Tao Zhou, Zhao Kang, Ling Tian, Yi Su

    Abstract: Event prediction in the continuous-time domain is a crucial but rather difficult task. Temporal point process (TPP) learning models have shown great advantages in this area. Existing models mainly focus on encoding global contexts of events using techniques like recurrent neural networks (RNNs) or self-attention mechanisms. However, local event contexts also play an important role in the occurrenc… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted to Information Sciences

  40. arXiv:2306.04424  [pdf, other

    cs.CL

    Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity

    Authors: Nannan Huang, Lin Tian, Haytham Fayek, Xiuzhen Zhang

    Abstract: Opinion summarisation is a task that aims to condense the information presented in the source documents while retaining the core message and opinions. A summary that only represents the majority opinions will leave the minority opinions unrepresented in the summary. In this paper, we use the stance towards a certain target as an opinion. We study bias in opinion summarisation from the perspective… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages, 3 figures, accepted at WASSA, ACL 2023

  41. arXiv:2305.09176  [pdf, other

    cs.GR

    Stochastic Porous Microstructures

    Authors: Zhongren Wang, Lihao Tian, Xiaokang Liu, Andrei Sharf, Lin Lu

    Abstract: Stochastic porous structures are ubiquitous in natural phenomena and have gained considerable traction across diverse domains owing to their exceptional physical properties. The recent surge in interest in microstructures can be attributed to their impressive attributes, such as a high strength-to-weight ratio, isotropic elasticity, and bio-inspired design principles. Notwithstanding, extant stoch… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 27 pages, 16 figures

  42. arXiv:2305.00087  [pdf, other

    cs.CV

    Inverse Consistency by Construction for Multistep Deep Registration

    Authors: Hastings Greer, Lin Tian, Francois-Xavier Vialard, Roland Kwitt, Sylvain Bouix, Raul San Jose Estepar, Richard Rushmore, Marc Niethammer

    Abstract: Inverse consistency is a desirable property for image registration. We propose a simple technique to make a neural registration network inverse consistent by construction, as a consequence of its structure, as long as it parameterizes its output transform by a Lie group. We extend this technique to multi-step neural registration by composing many such networks in a way that preserves inverse consi… ▽ More

    Submitted 9 October, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

  43. arXiv:2304.10712  [pdf, other

    cs.CV cs.AI

    Adversarial Infrared Blocks: A Multi-view Black-box Attack to Thermal Infrared Detectors in Physical World

    Authors: Chengyin Hu, Weiwen Shi, Tingsong Jiang, Wen Yao, Ling Tian, Xiaoqian Chen

    Abstract: Infrared imaging systems have a vast array of potential applications in pedestrian detection and autonomous driving, and their safety performance is of great concern. However, few studies have explored the safety of infrared imaging systems in real-world settings. Previous research has used physical perturbations such as small bulbs and thermal "QR codes" to attack infrared imaging detectors, but… ▽ More

    Submitted 28 July, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  44. arXiv:2304.09421  [pdf, other

    cs.CL cs.CV cs.LG cs.SI

    TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection

    Authors: Quanjiang Guo, Zhao Kang, Ling Tian, Zhouguo Chen

    Abstract: Fake news detection aims to detect fake news widely spreading on social media platforms, which can negatively influence the public and the government. Many approaches have been developed to exploit relevant information from news images, text, or videos. However, these methods may suffer from the following limitations: (1) ignore the inherent emotional information of the news, which could be benefi… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Appear on IJCNN 2023

  45. MetaTroll: Few-shot Detection of State-Sponsored Trolls with Transformer Adapters

    Authors: Lin Tian, Xiuzhen Zhang, Jey Han Lau

    Abstract: State-sponsored trolls are the main actors of influence campaigns on social media and automatic troll detection is important to combat misinformation at scale. Existing troll detection models are developed based on training data for known campaigns (e.g.\ the influence campaign by Russia's Internet Research Agency on the 2016 US Election), and they fall short when dealing with {\em novel} campaign… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 11 pages, 2 figures, Accepted by the Web Conference 2023 (WWW 2023)

  46. arXiv:2303.06879  [pdf, other

    cs.LG cs.CV

    Spacecraft Anomaly Detection with Attention Temporal Convolution Network

    Authors: Liang Liu, Ling Tian, Zhao Kang, Tianqi Wan

    Abstract: Spacecraft faces various situations when carrying out exploration missions in complex space, thus monitoring the anomaly status of spacecraft is crucial to the development of \textcolor{blue}{the} aerospace industry. The time series telemetry data generated by on-orbit spacecraft \textcolor{blue}{contains} important information about the status of spacecraft. However, traditional domain knowledge-… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  47. arXiv:2303.03912  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    Document-level Relation Extraction with Cross-sentence Reasoning Graph

    Authors: Hongfei Liu, Zhao Kang, Lizong Zhang, Ling Tian, Fujun Hua

    Abstract: Relation extraction (RE) has recently moved from the sentence-level to document-level, which requires aggregating document information and using entities and mentions for reasoning. Existing works put entity nodes and mention nodes with similar representations in a document-level graph, whose complex edges may incur redundant information. Furthermore, existing studies only focus on entity-level re… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: This paper is accepted by PAKDD 2023

  48. arXiv:2303.00146  [pdf, other

    cs.HC cs.RO cs.SD eess.AS

    I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue

    Authors: Yuanchao Li, Koji Inoue, Leimin Tian, Changzeng Fu, Carlos Ishi, Hiroshi Ishiguro, Tatsuya Kawahara, Catherine Lai

    Abstract: Current Spoken Dialogue Systems (SDSs) often serve as passive listeners that respond only after receiving user speech. To achieve human-like dialogue, we propose a novel future prediction architecture that allows an SDS to anticipate future affective reactions based on its current behaviors before the user speaks. In this work, we investigate two scenarios: speech and laughter. In speech, we propo… ▽ More

    Submitted 17 March, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

    Comments: Accepted to CHI2023 Late-Breaking Work

  49. arXiv:2302.12261  [pdf, ps, other

    math.OC cs.LG

    Testing Stationarity Concepts for ReLU Networks: Hardness, Regularity, and Robust Algorithms

    Authors: Lai Tian, Anthony Man-Cho So

    Abstract: We study the computational problem of the stationarity test for the empirical loss of neural networks with ReLU activation functions. Our contributions are: Hardness: We show that checking a certain first-order approximate stationarity concept for a piecewise linear function is co-NP-hard. This implies that testing a certain stationarity concept for a modern nonsmooth neural network is in genera… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: 42 pages

  50. arXiv:2302.08710  [pdf, other

    cs.LG

    Cross-Domain Label Propagation for Domain Adaptation with Discriminative Graph Self-Learning

    Authors: Lei Tian, Yongqiang Tang, Liangchen Hu, Wensheng Zhang

    Abstract: Domain adaptation manages to transfer the knowledge of well-labeled source data to unlabeled target data. Many recent efforts focus on improving the prediction accuracy of target pseudo-labels to reduce conditional distribution shift. In this paper, we propose a novel domain adaptation method, which infers target pseudo-labels through cross-domain label propagation, such that the underlying manifo… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.