Skip to main content

Showing 51–100 of 702 results for author: Tan, Z

.
  1. arXiv:2404.13985  [pdf, other

    cs.CL

    Information Re-Organization Improves Reasoning in Large Language Models

    Authors: Xiaoxia Cheng, Zeqi Tan, Wei Xue, Weiming Lu

    Abstract: Improving the reasoning capabilities of large language models (LLMs) has attracted considerable interest. Recent approaches primarily focus on improving the reasoning process to yield a more precise final answer. However, in scenarios involving contextually aware reasoning, these methods neglect the importance of first identifying logical relationships from the context before proceeding with the r… ▽ More

    Submitted 24 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 15 pages, 4 figures

  2. arXiv:2404.13788  [pdf, other

    cs.CV cs.AI

    AnyPattern: Towards In-context Image Copy Detection

    Authors: Wenhao Wang, Yifan Sun, Zhentao Tan, Yi Yang

    Abstract: This paper explores in-context learning for image copy detection (ICD), i.e., prompting an ICD model to identify replicated images with new tampering patterns without the need for additional training. The prompts (or the contexts) are from a small set of image-replica pairs that reflect the new patterns and are used at inference time. Such in-context ICD has good realistic value, because it requir… ▽ More

    Submitted 28 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: The project is publicly available at https://anypattern.github.io. arXiv admin note: text overlap with arXiv:2403.06098

  3. arXiv:2404.13446  [pdf, other

    cs.DS

    New Structures and Algorithms for Length-Constrained Expander Decompositions

    Authors: Bernhard Haeupler, D Ellis Hershkowitz, Zihan Tan

    Abstract: Expander decompositions form the basis of one of the most flexible paradigms for close-to-linear-time graph algorithms. Length-constrained expander decompositions generalize this paradigm to better work for problems with lengths, distances and costs. Roughly, an $(h,s)$-length $φ$-expander decomposition is a small collection of length increases to a graph so that nodes within distance $h$ can rout… ▽ More

    Submitted 15 May, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: Added funding info

  4. arXiv:2404.07850  [pdf, other

    cs.CV cs.AI

    MindBridge: A Cross-Subject Brain Decoding Framework

    Authors: Shizun Wang, Songhua Liu, Zhenxiong Tan, Xinchao Wang

    Abstract: Brain decoding, a pivotal field in neuroscience, aims to reconstruct stimuli from acquired brain signals, primarily utilizing functional magnetic resonance imaging (fMRI). Currently, brain decoding is confined to a per-subject-per-model paradigm, limiting its applicability to the same individual for whom the decoding model is trained. This constraint stems from three key challenges: 1) the inheren… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 highlight. Code is available at https://github.com/littlepure2333/MindBridge

  5. arXiv:2404.05052  [pdf, other

    cs.CV

    Facial Affective Behavior Analysis with Instruction Tuning

    Authors: Yifan Li, Anh Dao, Wentao Bao, Zhen Tan, Tianlong Chen, Huan Liu, Yu Kong

    Abstract: Facial affective behavior analysis (FABA) is crucial for understanding human mental states from images. However, traditional approaches primarily deploy models to discriminate among discrete emotion categories, and lack the fine granularity and reasoning capability for complex facial behaviors. The advent of Multi-modal Large Language Models (MLLMs) has been proven successful in general visual und… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: V1.0

  6. arXiv:2404.00513  [pdf, other

    cs.CV

    Transformer based Pluralistic Image Completion with Reduced Information Loss

    Authors: Qiankun Liu, Yuqi Jiang, Zhentao Tan, Dongdong Chen, Ying Fu, Qi Chu, Gang Hua, Nenghai Yu

    Abstract: Transformer based methods have achieved great success in image inpainting recently. However, we find that these solutions regard each pixel as a token, thus suffering from an information loss issue from two aspects: 1) They downsample the input image into much lower resolutions for efficiency consideration. 2) They quantize $256^3$ RGB values to a small number (such as 512) of quantized color valu… ▽ More

    Submitted 14 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted by TPAMI (2024). arXiv admin note: text overlap with arXiv:2205.05076

  7. arXiv:2404.00091  [pdf, other

    quant-ph

    Non-Abelian braiding of Fibonacci anyons with a superconducting processor

    Authors: Shibo Xu, Zheng-Zhi Sun, Ke Wang, Hekang Li, Zitian Zhu, Hang Dong, **feng Deng, Xu Zhang, Jiachen Chen, Yaozu Wu, Chuanyu Zhang, Feitong **, Xuhao Zhu, Yu Gao, Aosai Zhang, Ning Wang, Yiren Zou, Ziqi Tan, Fanhao Shen, Jiarun Zhong, Zehang Bao, Weikang Li, Wenjie Jiang, Li-Wei Yu, Zixuan Song , et al. (7 additional authors not shown)

    Abstract: Non-Abelian topological orders offer an intriguing path towards fault-tolerant quantum computation, where information can be encoded and manipulated in a topologically protected manner immune to arbitrary local noises and perturbations. However, realizing non-Abelian topologically ordered states is notoriously challenging in both condensed matter and programmable quantum systems, and it was not un… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  8. arXiv:2403.19695  [pdf, ps, other

    math.AG

    Classification of quasi-affine Generalized Dynkin Diagrams with Rank $> 5$

    Authors: Zhengtang Tan, Shouchuan Zhang

    Abstract: All quasi-affine connected Generalized Dynkin Diagrams with rank $> 5$ are found. All quasi-affine Nichols (Lie braided) algebras with rank $> 5$ are also found.

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 77 pages. arXiv admin note: substantial text overlap with arXiv:2202.09940

    MSC Class: 16W30; 16G10

  9. arXiv:2403.19177  [pdf, other

    cs.CV cs.AI

    Rethinking Information Loss in Medical Image Segmentation with Various-sized Targets

    Authors: Tianyi Liu, Zhaorui Tan, Kaizhu Huang, Haochuan Jiang

    Abstract: Medical image segmentation presents the challenge of segmenting various-size targets, demanding the model to effectively capture both local and global information. Despite recent efforts using CNNs and ViTs to predict annotations of different scales, these approaches often struggle to effectively balance the detection of targets across varying sizes. Simply utilizing local information from CNNs an… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  10. arXiv:2403.18560  [pdf, other

    eess.AS cs.LG cs.SD

    Noise-Robust Keyword Spotting through Self-supervised Pretraining

    Authors: Jacob Mørk, Holger Severin Bovbjerg, Gergely Kiss, Zheng-Hua Tan

    Abstract: Voice assistants are now widely available, and to activate them a keyword spotting (KWS) algorithm is used. Modern KWS systems are mainly trained using supervised learning methods and require a large amount of labelled data to achieve a good performance. Leveraging unlabelled data through self-supervised learning (SSL) has been shown to increase the accuracy in clean conditions. This paper explore… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    MSC Class: 68T10 ACM Class: I.2.6

  11. arXiv:2403.17701   

    eess.IV cs.CV cs.LG

    Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation

    Authors: Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu, Kaihong Wu

    Abstract: Image segmentation holds a vital position in the realms of diagnosis and treatment within the medical domain. Traditional convolutional neural networks (CNNs) and Transformer models have made significant advancements in this realm, but they still encounter challenges because of limited receptive field or high computing complexity. Recently, State Space Models (SSMs), particularly Mamba and its var… ▽ More

    Submitted 3 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Experimental method encountered errors, undergoing experiment again

  12. arXiv:2403.16935  [pdf, other

    quant-ph

    Measuring Spectral Form Factor in Many-Body Chaotic and Localized Phases of Quantum Processors

    Authors: Hang Dong, Pengfei Zhang, Ceren B. Dag, Yu Gao, Ning Wang, **feng Deng, Xu Zhang, Jiachen Chen, Shibo Xu, Ke Wang, Yaozu Wu, Chuanyu Zhang, Feitong **, Xuhao Zhu, Aosai Zhang, Yiren Zou, Ziqi Tan, Zhengyi Cui, Zitian Zhu, Fanhao Shen, Tingting Li, Jiarun Zhong, Zehang Bao, Hekang Li, Zhen Wang , et al. (6 additional authors not shown)

    Abstract: The spectral form factor (SFF) captures universal spectral fluctuations as signatures of quantum chaos, and has been instrumental in advancing multiple frontiers of physics including the studies of black holes and quantum many-body systems. However, the measurement of SFF in many-body systems is challenging due to the difficulty in resolving level spacings that become exponentially small with incr… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures

  13. arXiv:2403.15611  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Electrically Switchable Circular Photogalvanic Effect in Methylammonium Lead Iodide Microcrystals

    Authors: Yuqing Zhu, Ziyi Song, Rodrigo Becerra Silva, Bob Minyu Wang, Henry Clark Travaglini, Andrew C Grieder, Yuan **, Liang Z. Tan, Dong Yu

    Abstract: We investigate the circular photogalvanic effect (CPGE) in single-crystalline methylammonium lead iodide microcrystals under a static electric field. The external electric field can enhance the magnitude of the helicity dependent photocurrent (HDPC) by two orders of magnitude and flip its sign, which we attribute to magnetic shift currents induced by the Rashba-Edelstein effect. This HDPC induced… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures in main text. 20 pages, 14 figures in supplementary material

  14. arXiv:2403.10754  [pdf, other

    astro-ph.GA astro-ph.CO

    CSST large-scale structure analysis pipeline: I. constructing reference mock galaxy redshift surveys

    Authors: Yizhou Gu, Xiaohu Yang, Jiaxin Han, Yirong Wang, Qingyang Li, Zhenlin Tan, Wenkang Jiang, Yaru Wang, Jiaqi Wang, Antonios Katsianis, Xiaoju Xu, Haojie Xu, Wensheng Hong, Houjun Mo, Run Wen, Xianzhong Zheng, Feng Shi, Pengjie Zhang, Zhongxu Zhai, Chengze Liu, Wenting Wang, Ying Zu, Hong Guo, Youcai Zhang, Yi Lu , et al. (7 additional authors not shown)

    Abstract: In this paper, we set out to construct a set of reference mock galaxy redshift surveys (MGRSs) for the future Chinese Space-station Survey Telescope (CSST) observation, where subsequent survey selection effects can be added and evaluated. This set of MGRSs is generated using the dark matter subhalos extracted from a high-resolution Jiutian $N$-body simulation of the standard $Λ$CDM cosmogony with… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 13 pages, 9 figures, accepted for publication in MNRAS

  15. arXiv:2403.10744  [pdf, ps, other

    cs.AI

    Game and Reference: Policy Combination Synthesis for Epidemic Prevention and Control

    Authors: Zhiyi Tan, Bingkun Bao

    Abstract: In recent years, epidemic policy-making models are increasingly being used to provide reference for governors on prevention and control policies against catastrophic epidemics such as SARS, H1N1 and COVID-19. Existing studies are currently constrained by two issues: First, previous methods develop policies based on effect evaluation, since few of factors in real-world decision-making can be modele… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 16 pages, single line, 7 figures, written with Springer conference template

  16. How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses

    Authors: Peter Leer, Jesper Jensen, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw

    Abstract: Advanced auditory models are useful in designing signal-processing algorithms for hearing-loss compensation or speech enhancement. Such auditory models provide rich and detailed descriptions of the auditory pathway, and might allow for individualization of signal-processing strategies, based on physiological measurements. However, these auditory models are often computationally demanding, requirin… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing. This version is the authors' version and may vary from the final publication in details

  17. arXiv:2403.10420  [pdf, other

    eess.AS

    Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks

    Authors: Peter Leer, Jesper Jensen, Laurel Carney, Zheng-Hua Tan, Jan Østergaard, Lars Bramsløw

    Abstract: This article investigates the use of deep neural networks (DNNs) for hearing-loss compensation. Hearing loss is a prevalent issue affecting millions of people worldwide, and conventional hearing aids have limitations in providing satisfactory compensation. DNNs have shown remarkable performance in various auditory tasks, including speech recognition, speaker identification, and music classificatio… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  18. Thought Graph: Generating Thought Process for Biological Reasoning

    Authors: Chi-Yang Hsu, Kyle Cox, Jiawei Xu, Zhen Tan, Tianhua Zhai, Mengzhou Hu, Dexter Pratt, Tianlong Chen, Ziniu Hu, Ying Ding

    Abstract: We present the Thought Graph as a novel framework to support complex reasoning and use gene set analysis as an example to uncover semantic relationships between biological processes. Our framework stands out for its ability to provide a deeper understanding of gene sets, significantly surpassing GSEA by 40.28% and LLM baselines by 5.38% based on cosine similarity to human annotations. Our analysis… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 4 pages. Accepted by Web Conf 2024

  19. arXiv:2403.06374  [pdf

    physics.optics

    Intrinsic polarization conversion and avoided-mode crossing in X-cut lithium niobate microrings

    Authors: Zelin Tan, Jianfa Zhang, Zhihong Zhu, Wei Chen, Zhengzheng Shao, Ken Liu, Shiqiao Qin

    Abstract: Compared with well-developed free space polarization converters, polarization conversion between TE and TM modes in waveguide is generally considered to be caused by shape birefringence, like curvature, morphology of waveguide cross section and scattering. Here, we reveal a hidden polarization conversion mechanism in X-cut lithium niobate microrings, that is the conversion can be implemented by bi… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  20. arXiv:2403.05636  [pdf, other

    cs.AI cs.CL

    Tuning-Free Accountable Intervention for LLM Deployment -- A Metacognitive Approach

    Authors: Zhen Tan, Jie Peng, Tianlong Chen, Huan Liu

    Abstract: Large Language Models (LLMs) have catalyzed transformative advances across a spectrum of natural language processing tasks through few-shot or zero-shot prompting, bypassing the need for parameter tuning. While convenient, this modus operandi aggravates ``hallucination'' concerns, particularly given the enigmatic ``black-box'' nature behind their gigantic model sizes. Such concerns are exacerbated… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  21. arXiv:2403.05239  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation

    Authors: Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li, Cheng Zhang, Yang Song

    Abstract: Vanilla text-to-image diffusion models struggle with generating accurate human images, commonly resulting in imperfect anatomies such as unnatural postures or disproportionate limbs.Existing methods address this issue mostly by fine-tuning the model with extra images or adding additional controls -- human-centric priors such as pose or depth maps -- during the image generation phase. This paper ex… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  22. arXiv:2403.04009  [pdf, other

    cs.SI cs.CL cs.CY physics.soc-ph

    Media Bias Matters: Understanding the Impact of Politically Biased News on Vaccine Attitudes in Social Media

    Authors: Bohan Jiang, Lu Cheng, Zhen Tan, Ruocheng Guo, Huan Liu

    Abstract: News media has been utilized as a political tool to stray from facts, presenting biased claims without evidence. Amid the COVID-19 pandemic, politically biased news (PBN) has significantly undermined public trust in vaccines, despite strong medical evidence supporting their efficacy. In this paper, we analyze: (i) how inherent vaccine stances subtly influence individuals' selection of news sources… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 9 pages, 6 figures, 3 tables

  23. arXiv:2403.03675  [pdf, other

    cs.IT eess.SP

    ZF Beamforming Tensor Compression for Massive MIMO Fronthaul

    Authors: Libin Zheng, Zihao Wang, Minru Bai, Zhenjie Tan

    Abstract: In the rapidly evolving landscape of 5G and beyond 5G (B5G) mobile cellular communications, efficient data compression and reconstruction strategies become paramount, especially in massive multiple-input multiple-output (MIMO) systems. A critical challenge in these systems is the capacity-limited fronthaul, particularly in the context of the Ethernet-based common public radio interface (eCPRI) con… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  24. arXiv:2403.02148  [pdf, other

    cs.CV

    MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection

    Authors: Tianxiang Chen, Zi Ye, Zhentao Tan, Tao Gong, Yue Wu, Qi Chu, Bin Liu, Nenghai Yu, Jie** Ye

    Abstract: Recently, infrared small target detection (ISTD) has made significant progress, thanks to the development of basic models. Specifically, the models combining CNNs with transformers can successfully extract both local and global features. However, the disadvantage of the transformer is also inherited, i.e., the quadratic computational complexity to sequence length. Inspired by the recent basic mode… ▽ More

    Submitted 24 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: The first Mamba-based model for infrared small target detection

  25. arXiv:2403.01071  [pdf, other

    cs.LG cs.AI

    GraphRCG: Self-conditioned Graph Generation via Bootstrapped Representations

    Authors: Song Wang, Zhen Tan, Xinyu Zhao, Tianlong Chen, Huan Liu, Jundong Li

    Abstract: Graph generation generally aims to create new graphs that closely align with a specific graph distribution. Existing works often implicitly capture this distribution through the optimization of generators, potentially overlooking the intricacies of the distribution itself. Furthermore, these approaches generally neglect the insights offered by the learned distribution for graph generation. In cont… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  26. arXiv:2403.00081  [pdf, other

    cs.CY

    The Constitutions of Web3

    Authors: Joshua Z. Tan, Max Langenkamp, Anna Weichselbraun, Ann Brody, Lucia Korpas

    Abstract: The governance of online communities has been a critical issue since the first USENET groups, and a number of serious constitutions -- declarations of goals, values, and rights -- have emerged since the mid-1990s. More recently, decentralized autonomous organizations (DAOs) have begun to publish their own constitutions, manifestos, and other governance documents. There are two unique aspects to th… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  27. arXiv:2402.18853  [pdf, other

    cs.LG cs.AI cs.CV

    Rethinking Multi-domain Generalization with A General Learning Objective

    Authors: Zhaorui Tan, Xi Yang, Kaizhu Huang

    Abstract: Multi-domain generalization (mDG) is universally aimed to minimize the discrepancy between training and testing distributions to enhance marginal-to-label distribution map**. However, existing mDG literature lacks a general learning objective paradigm and often imposes constraints on static target marginal distributions. In this paper, we propose to leverage a $Y$-map** to relax the constraint… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR24

  28. arXiv:2402.17574  [pdf, other

    cs.AI cs.CL

    Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

    Authors: Wenqi Zhang, Ke Tang, Hai Wu, Mengna Wang, Yongliang Shen, Guiyang Hou, Zeqi Tan, Peng Li, Yueting Zhuang, Weiming Lu

    Abstract: Large Language Models (LLMs) exhibit robust problem-solving capabilities for diverse tasks. However, most LLM-based agents are designed as specific task solvers with sophisticated prompt engineering, rather than agents capable of learning and evolving through interactions. These task solvers necessitate manually crafted prompts to inform task rules and regulate LLM behaviors, inherently incapacita… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL-2024 Main, camera-ready version

  29. arXiv:2402.16638  [pdf

    q-bio.QM

    The structure is the message: preserving experimental context through tensor decomposition

    Authors: Zhixin Cyrillus Tan, Aaron S. Meyer

    Abstract: Recent biological studies have been revolutionized in scale and granularity by multiplex and high-throughput assays. Profiling cell responses across several experimental parameters, such as perturbations, time, and genetic contexts, leads to richer and more generalizable findings. However, these multidimensional datasets necessitate a reevaluation of the conventional methods for their representati… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  30. arXiv:2402.14859  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative

    Authors: Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Yu Kong, Tianlong Chen, Huan Liu

    Abstract: Due to their unprecedented ability to process and respond to various types of data, Multimodal Large Language Models (MLLMs) are constantly defining the new boundary of Artificial General Intelligence (AGI). As these advanced generative models increasingly form collaborative networks for complex tasks, the integrity and security of these systems are crucial. Our paper, ``The Wolf Within'', explore… ▽ More

    Submitted 2 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to workshop on ReGenAI@CVPR 2024

  31. arXiv:2402.14230  [pdf, other

    cs.IR cs.AI

    MerRec: A Large-scale Multipurpose Mercari Dataset for Consumer-to-Consumer Recommendation Systems

    Authors: Lichi Li, Zainul Abi Din, Zhen Tan, Sam London, Tianlong Chen, Ajay Daptardar

    Abstract: In the evolving e-commerce field, recommendation systems crucially shape user experience and engagement. The rise of Consumer-to-Consumer (C2C) recommendation systems, noted for their flexibility and ease of access for customer vendors, marks a significant trend. However, the academic focus remains largely on Business-to-Consumer (B2C) models, leaving a gap filled by the limited C2C recommendation… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  32. arXiv:2402.13446  [pdf, other

    cs.CL

    Large Language Models for Data Annotation: A Survey

    Authors: Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu

    Abstract: Data annotation generally refers to the labeling or generating of raw data with relevant information, which could be used for improving the efficacy of machine learning models. The process, however, is labor-intensive and costly. The emergence of advanced Large Language Models (LLMs), exemplified by GPT-4, presents an unprecedented opportunity to automate the complicated process of data annotation… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  33. arXiv:2402.12753  [pdf

    physics.optics

    Giant enhancement of higher-order harmonics of an optical-tweezer phonon laser

    Authors: Guangzong Xiao, Tengfang Kuang, Yutong He, Xinlin Chen, Wei Xiong, Xiang Han, Zhongqi Tan, Hui Luo, Hui **g

    Abstract: Phonon lasers, as mechanical analogues of optical lasers, are unique tools for not only fundamental studies of phononics but also diverse applications such as acoustic imaging and force sensing. Very recently, by levitating a micro-size sphere in an optical tweezer, higher-order mechanical harmonics were observed in the phonon-lasing regime, as the first step towards nonlinear levitated optomechan… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 15 pages, 4 figures

  34. arXiv:2402.12128  [pdf, other

    cs.CV

    3D Vascular Segmentation Supervised by 2D Annotation of Maximum Intensity Projection

    Authors: Zhanqiang Guo, Zimeng Tan, Jianjiang Feng, Jie Zhou

    Abstract: Vascular structure segmentation plays a crucial role in medical analysis and clinical applications. The practical adoption of fully supervised segmentation models is impeded by the intricacy and time-consuming nature of annotating vessels in the 3D space. This has spurred the exploration of weakly-supervised approaches that reduce reliance on expensive segmentation annotations. Despite this, exist… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  35. arXiv:2402.11453  [pdf, other

    cs.CL

    MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

    Authors: Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun

    Abstract: Scientific data visualization plays a crucial role in research by enabling the direct display of complex information and assisting researchers in identifying implicit patterns. Despite its importance, the use of Large Language Models (LLMs) for scientific data visualization remains rather unexplored. In this study, we introduce MatPlotAgent, an efficient model-agnostic LLM agent framework designed… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: Work in Progress

  36. arXiv:2402.10551  [pdf, other

    cs.LG q-bio.QM

    Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information

    Authors: Aishwarya Jayagopal, Hansheng Xue, Ziyang He, Robert J. Walsh, Krishna Kumar Hariprasannan, David Shao Peng Tan, Tuan Zea Tan, Jason J. Pitt, Anand D. Jeyasekharan, Vaibhav Rajan

    Abstract: Cancer remains a global challenge due to its growing clinical and economic burden. Its uniquely personal manifestation, which makes treatment difficult, has fuelled the quest for personalized treatment strategies. Thus, genomic profiling is increasingly becoming part of clinical diagnostic panels. Effective use of such panels requires accurate drug response prediction (DRP) models, which are chall… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  37. arXiv:2402.10426  [pdf, other

    cs.CL

    DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

    Authors: Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo

    Abstract: Large language models are limited by challenges in factuality and hallucinations to be directly employed off-the-shelf for judging the veracity of news articles, where factual accuracy is paramount. In this work, we propose DELL that identifies three key stages in misinformation detection where LLMs could be incorporated as part of the pipeline: 1) LLMs could \emph{generate news reactions} to repr… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  38. arXiv:2402.10058  [pdf, other

    cs.CL

    Towards Safer Large Language Models through Machine Unlearning

    Authors: Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, Meng Jiang

    Abstract: The rapid advancement of Large Language Models (LLMs) has demonstrated their vast potential across various domains, attributed to their extensive pretraining knowledge and exceptional generalizability. However, LLMs often encounter challenges in generating harmful content when faced with problematic prompts. To address this problem, existing work attempted to implement a gradient ascent based appr… ▽ More

    Submitted 5 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 Findings

  39. arXiv:2402.07386  [pdf, other

    cs.CL

    Chain-of-Layer: Iteratively Prompting Large Language Models for Taxonomy Induction from Limited Examples

    Authors: Qingkai Zeng, Yuyang Bai, Zhaoxuan Tan, Shangbin Feng, Zhenwen Liang, Zhihan Zhang, Meng Jiang

    Abstract: Automatic taxonomy induction is crucial for web search, recommendation systems, and question answering. Manual curation of taxonomies is expensive in terms of human effort, making automatic taxonomy construction highly desirable. In this work, we introduce Chain-of-Layer which is an in-context learning framework designed to induct taxonomies from a given set of entities. Chain-of-Layer breaks down… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  40. arXiv:2402.05366  [pdf, other

    physics.med-ph

    Model-Based Reconstruction for Joint Estimation of $T_{1}$, $R_{2}^{*}$ and $B_{0}$ Field Maps Using Single-Shot Inversion-Recovery Multi-Echo Radial FLASH

    Authors: Xiaoqing Wang, Nick Scholand, Zhengguo Tan, Daniel Mackner, Vitali Telezki, Moritz Blumenthal, Philip Schaten, Martin Uecker

    Abstract: Purpose: To develop a model-based nonlinear reconstruction for simultaneous water-specific $T_{1}$, $R_{2}^{*}$, $B_{0}$ field and/or fat fraction (FF) map** using single-shot inversion-recovery (IR) multi-echo radial FLASH. Methods: The proposed model-based reconstruction jointly estimates water-specific $T_{1}$, $R_{2}^{*}$, $B_{0}$ field and/or FF maps, as well as a set of coil sensitivitie… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Part of this work has been presented at the ISMRM Workshop on Data Sampling and Image Reconstruction, Sedona, 2023. Submitted to Magnetic Resonance in Medicine

  41. arXiv:2402.05136  [pdf, other

    cs.CL

    LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

    Authors: Tao Yuan, Xuefei Ning, Dong Zhou, Zhijie Yang, Shiyao Li, Minghui Zhuang, Zheyue Tan, Zhuyu Yao, Dahua Lin, Boxun Li, Guohao Dai, Shengen Yan, Yu Wang

    Abstract: State-of-the-art large language models (LLMs) are now claiming remarkable supported context lengths of 256k or even more. In contrast, the average context lengths of mainstream benchmarks are insufficient (5k-21k), and they suffer from potential knowledge leakage and inaccurate metrics, resulting in biased evaluation. This paper introduces LV-Eval, a challenging long-context benchmark with five le… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  42. arXiv:2402.04401  [pdf, other

    cs.CL

    Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning

    Authors: Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang

    Abstract: Personalization in large language models (LLMs) is increasingly important, aiming to align LLM's interactions, content, and recommendations with individual user preferences. Recent advances in LLM personalization have spotlighted effective prompt design, by enriching user queries with non-parametric knowledge through behavior history retrieval and textual profiles. However, these approaches were l… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  43. arXiv:2402.03471  [pdf, other

    cs.LG cs.AI cs.CL cs.IT

    The Information of Large Language Model Geometry

    Authors: Zhiquan Tan, Chenghai Li, Weiran Huang

    Abstract: This paper investigates the information encoded in the embeddings of large language models (LLMs). We conduct simulations to analyze the representation entropy and discover a power law relationship with model sizes. Building upon this observation, we propose a theory based on (conditional) entropy to elucidate the scaling law phenomenon. Furthermore, we delve into the auto-regressive structure of… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  44. arXiv:2402.02327  [pdf, other

    cs.CV cs.SD eess.AS

    Bootstrap** Audio-Visual Segmentation by Strengthening Audio Cues

    Authors: Tianxiang Chen, Zhentao Tan, Tao Gong, Qi Chu, Yue Wu, Bin Liu, Le Lu, Jie** Ye, Nenghai Yu

    Abstract: How to effectively interact audio with vision has garnered considerable interest within the multi-modality research field. Recently, a novel audio-visual segmentation (AVS) task has been proposed, aiming to segment the sounding objects in video frames under the guidance of audio cues. However, most existing AVS methods are hindered by a modality imbalance where the visual features tend to dominate… ▽ More

    Submitted 6 February, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  45. arXiv:2402.02046  [pdf, other

    cs.CV

    TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target Detection

    Authors: Tianxiang Chen, Zhentao Tan, Qi Chu, Yue Wu, Bin Liu, Nenghai Yu

    Abstract: Infrared small target detection (ISTD) is critical to national security and has been extensively applied in military areas. ISTD aims to segment small target pixels from background. Most ISTD networks focus on designing feature extraction blocks or feature fusion modules, but rarely describe the ISTD process from the feature map evolution perspective. In the ISTD process, the network attention gra… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  46. arXiv:2402.01729  [pdf, other

    cs.CL cs.AI

    Contextualization Distillation from Large Language Model for Knowledge Graph Completion

    Authors: Dawei Li, Zhen Tan, Tianlong Chen, Huan Liu

    Abstract: While textual information significantly enhances the performance of pre-trained language models (PLMs) in knowledge graph completion (KGC), the static and noisy nature of existing corpora collected from Wikipedia articles or synsets definitions often limits the potential of PLM-based KGC models. To surmount these challenges, we introduce the Contextualization Distillation strategy, a versatile plu… ▽ More

    Submitted 24 February, 2024; v1 submitted 28 January, 2024; originally announced February 2024.

    Comments: Accepted by EACL 2024 findings v3: add missing citations

  47. arXiv:2402.00986  [pdf, other

    cs.PL

    The Parallel Semantics Program Dependence Graph

    Authors: Brian Homerding, Atmn Patel, Enrico Armenio Deiana, Yian Su, Zujun Tan, Ziyang Xu, Bhargav Reddy Godala, David I. August, Simone Campanoni

    Abstract: A compiler's intermediate representation (IR) defines a program's execution plan by encoding its instructions and their relative order. Compiler optimizations aim to replace a given execution plan with a semantically-equivalent one that increases the program's performance for the target architecture. Alternative representations of an IR, like the Program Dependence Graph (PDG), aid this process by… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  48. arXiv:2402.00629  [pdf, other

    cs.AR

    Cocco: Hardware-Map** Co-Exploration towards Memory Capacity-Communication Optimization

    Authors: Zhanhong Tan, Zijian Zhu, Kaisheng Ma

    Abstract: Memory is a critical design consideration in current data-intensive DNN accelerators, as it profoundly determines energy consumption, bandwidth requirements, and area costs. As DNN structures become more complex, a larger on-chip memory capacity is required to reduce data movement overhead, but at the expense of silicon costs. Some previous works have proposed memory-oriented optimizations, such a… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'24)

  49. arXiv:2402.00380  [pdf, ps, other

    math.NA

    $n$-Dimensional Volumetric Stretch Energy Minimization for Volume-/Mass-Preserving Parameterizations

    Authors: Zhong-Heng Tan, Tiexiang Li, Wen-Wei Lin, Shing-Tung Yau

    Abstract: In this paper, we develop an $n$ dimensional volumetric stretch energy ($n$-VSE) functional for the volume-/mass-preserving parameterization of the $n$-manifolds topologically equivalent to $n$-ball. The $n$-VSE has a lower bound and equal to it if and only if the map is volume-/mass-preserving. This motivates us to minimize the $n$-VSE to achieve the ideal volume-/mass-preserving parameterization… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    MSC Class: 49Q10; 52C26; 65D18; 65F05; 68U05

  50. arXiv:2402.00371  [pdf, other

    cs.CL

    What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

    Authors: Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov

    Abstract: Social media bot detection has always been an arms race between advancements in machine learning bot detectors and adversarial bot strategies to evade detection. In this work, we bring the arms race to the next level by investigating the opportunities and risks of state-of-the-art large language models (LLMs) in social bot detection. To investigate the opportunities, we design novel LLM-based bot… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.