Skip to main content

Showing 151–200 of 2,649 results for author: Chen, P

.
  1. arXiv:2402.13724  [pdf, other

    cs.HC cs.CV

    Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters

    Authors: Zechen Bai, Peng Chen, Xiaolan Peng, Lu Liu, Hui Chen, Mike Zheng Shou, Feng Tian

    Abstract: Animating virtual characters has always been a fundamental research problem in virtual reality (VR). Facial animations play a crucial role as they effectively convey emotions and attitudes of virtual humans. However, creating such facial animations can be challenging, as current methods often involve utilization of expensive motion capture devices or significant investments of time and effort from… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 9 pages. To appear in IEEE-VR

  2. arXiv:2402.13502  [pdf, other

    astro-ph.SR

    Statistical Analyses of Solar Prominences and Active Region Features in 304 Å Filtergrams detected via Deep Learning

    Authors: T. Zhang, Q. Hao, P. F. Chen

    Abstract: Solar active regions (ARs) are areas on the Sun with very strong magnetic fields where various activities take place. Prominences are one of the typical solar features in the solar atmosphere, whose eruptions often lead to solar flares and coronal mass ejections (CMEs). Therefore, studying their morphological features and their relationship with solar activity is useful in predicting eruptive even… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 27 pages, 32 figures, Accepted for publication in ApJS

  3. arXiv:2402.11592  [pdf, other

    cs.LG cs.CL

    Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

    Authors: Yihua Zhang, **zhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen

    Abstract: In the evolving landscape of natural language processing (NLP), fine-tuning pre-trained Large Language Models (LLMs) with first-order (FO) optimizers like SGD and Adam has become standard. Yet, as LLMs grow {in size}, the substantial memory overhead from back-propagation (BP) for FO gradient computation presents a significant challenge. Addressing this issue is crucial, especially for applications… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  4. arXiv:2402.08875  [pdf, other

    cs.CV

    Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos

    Authors: Yang Qian, Yinan Sun, Ali Kargarandehkordi, Onur Cezmi Mutlu, Saimourya Surabhi, **yi Chen, Zain Jabbar, Dennis Paul Wall, Peter Washington

    Abstract: The increasing variety and quantity of tagged multimedia content on platforms such as TikTok provides an opportunity to advance computer vision modeling. We have curated a distinctive dataset of 283,582 unique video clips categorized under 386 hashtags relating to modern human actions. We release this dataset as a valuable resource for building domain-specific foundation models for human movement… ▽ More

    Submitted 19 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 10 pages

  5. arXiv:2402.05956  [pdf, other

    cs.LG

    Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting

    Authors: Peng Chen, Yingying Zhang, Yunyao Cheng, Yang Shu, Yihang Wang, Qingsong Wen, Bin Yang, Chenjuan Guo

    Abstract: Transformers for time series forecasting mainly model time series from limited or fixed scales, making it challenging to capture different characteristics spanning various scales. We propose Pathformer, a multi-scale Transformer with adaptive pathways. It integrates both temporal resolution and temporal distance for multi-scale modeling. Multi-scale division divides the time series into different… ▽ More

    Submitted 6 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by the 12th International Conference on Learning Representations (ICLR 2024)

  6. arXiv:2402.05637  [pdf, other

    cs.CV

    Learning pseudo-contractive denoisers for inverse problems

    Authors: Deliang Wei, Peng Chen, Fang Li

    Abstract: Deep denoisers have shown excellent performance in solving inverse problems in signal and image processing. In order to guarantee the convergence, the denoiser needs to satisfy some Lipschitz conditions like non-expansiveness. However, enforcing such constraints inevitably compromises recovery performance. This paper introduces a novel training strategy that enforces a weaker constraint on the dee… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    MSC Class: 68T07; 68U10; 68U10; 47J07; 94A08; 94A08; 90C25

  7. arXiv:2402.05457  [pdf, other

    cs.CL cs.AI cs.MM cs.SD eess.AS

    It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

    Authors: Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Ensiong Chng, Chao-Han Huck Yang

    Abstract: Recent studies have successfully shown that large language models (LLMs) can be successfully used for generative error correction (GER) on top of the automatic speech recognition (ASR) output. Specifically, an LLM is utilized to carry out a direct map** from the N-best hypotheses list generated by an ASR system to the predicted output transcription. However, despite its effectiveness, GER introd… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted to ICLR 2024, 17 pages. This work will be open sourced under MIT license

  8. arXiv:2402.05410  [pdf, other

    cs.CV

    SpirDet: Towards Efficient, Accurate and Lightweight Infrared Small Target Detector

    Authors: Qianchen Mao, Qiang Li, Bingshu Wang, Yongjun Zhang, Tao Dai, C. L. Philip Chen

    Abstract: In recent years, the detection of infrared small targets using deep learning methods has garnered substantial attention due to notable advancements. To improve the detection capability of small targets, these methods commonly maintain a pathway that preserves high-resolution features of sparse and tiny targets. However, it can result in redundant and expensive computations. To tackle this challeng… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  9. arXiv:2402.04699  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Breaking Free: How to Hack Safety Guardrails in Black-Box Diffusion Models!

    Authors: Shashank Kotyan, Po-Yuan Mao, Pin-Yu Chen, Danilo Vasconcellos Vargas

    Abstract: Deep neural networks can be exploited using natural adversarial samples, which do not impact human perception. Current approaches often rely on deep neural networks' white-box nature to generate these adversarial samples or synthetically alter the distribution of adversarial samples compared to the training distribution. In contrast, we propose EvoSeed, a novel evolutionary strategy-based algorith… ▽ More

    Submitted 22 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  10. arXiv:2402.02417  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Revealing flat bands and hybridization gaps in a twisted bilayer graphene device with microARPES

    Authors: Zhihao Jiang, Kimberly Hsieh, Alfred J. H. Jones, Paulina Majchrzak, Chakradhar Sahoo, Kenji Watanabe, Takashi Taniguchi, Jill A. Miwa, Yong P. Chen, Søren Ulstrup

    Abstract: Controlling the electronic structure of two-dimensional materials using the combination of twist angle and electrostatic do** is an effective means to induce emergent phenomena. In bilayer graphene with an interlayer twist angle near the magic angle, the electronic dispersion is strongly modified by a manifold of hybridizing moiré Dirac cones leading to flat band segments with strong electronic… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 21 pages, 5 figures

    Journal ref: 2D Mater. 10, 045027 (2023)

  11. arXiv:2402.02155  [pdf, ps, other

    math.OC

    Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds

    Authors: Pengyu Chen, Xu Shi, Rujun Jiang, Jiulin Wang

    Abstract: This paper investigates simple bilevel optimization problems where the upper-level objective minimizes a composite convex function over the optimal solutions of a composite convex lower-level problem. Existing methods for such problems either only guarantee asymptotic convergence, have slow sublinear rates, or require strong assumptions. To address these challenges, we develop a novel penalty-base… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  12. arXiv:2402.02140  [pdf, other

    cs.CV eess.IV

    Generative Visual Compression: A Review

    Authors: Bolin Chen, Shanzhi Yin, Peilin Chen, Shiqi Wang, Yan Ye

    Abstract: Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the acquisition of digital content and impelling the progress of visual compression towards competitive performance gains and diverse functionalities over traditional codecs. This paper provides a thorough review on the recent advances of generative visual compression, illustrating great potentials and promi… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  13. arXiv:2402.01911  [pdf, other

    cs.LG

    From PEFT to DEFT: Parameter Efficient Finetuning for Reducing Activation Density in Transformers

    Authors: Bharat Runwal, Tejaswini Pedapati, Pin-Yu Chen

    Abstract: Pretrained Language Models (PLMs) have become the de facto starting point for fine-tuning on downstream tasks. However, as model sizes continue to increase, traditional fine-tuning of all parameters becomes challenging. To address this, parameter-efficient fine-tuning (PEFT) methods have gained popularity as a means to adapt PLMs effectively. In parallel, recent studies have revealed the presence… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Preprint

  14. arXiv:2402.01162  [pdf, other

    cs.CV cs.AI

    2AFC Prompting of Large Multimodal Models for Image Quality Assessment

    Authors: Hanwei Zhu, Xiangjie Sui, Baoliang Chen, Xuelin Liu, Peilin Chen, Yuming Fang, Shiqi Wang

    Abstract: While abundant research has been conducted on improving high-level visual understanding and reasoning capabilities of large multimodal models~(LMMs), their visual quality assessment~(IQA) ability has been relatively under-explored. Here we take initial steps towards this goal by employing the two-alternative forced choice~(2AFC) prompting, as 2AFC is widely regarded as the most reliable way of col… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  15. arXiv:2401.17203  [pdf, other

    cs.CV

    CPR++: Object Localization via Single Coarse Point Supervision

    Authors: Xuehui Yu, Pengfei Chen, Kuiran Wang, Xumeng Han, Guorong Li, Zhenjun Han, Qixiang Ye, Jianbin Jiao

    Abstract: Point-based object localization (POL), which pursues high-performance object sensing under low-cost data annotation, has attracted increased attention. However, the point annotation mode inevitably introduces semantic variance due to the inconsistency of annotated points. Existing POL heavily rely on strict annotation rules, which are difficult to define and apply, to handle the problem. In this s… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accpted by TPAMI 2024

  16. arXiv:2401.15951  [pdf, other

    quant-ph

    Observation of quantum strong Mpemba effect

    Authors: Jie Zhang, Gang Xia, Chun-Wang Wu, Ting Chen, Qian Zhang, Yi Xie, Wen-Bo Su, Wei Wu, Cheng-Wei Qiu, **-xing Chen, Weibin Li, Hui **g, Yan-Li Zhou

    Abstract: An ancient and counterintuitive phenomenon know as the Mpemba effect (water can cool faster when initially heated up) showcases the critical role of initial conditions in relaxation processes. How to realize and utilize this effect for speeding up relaxation is an important but challenging task in purely quantum system till now. Here, we report the first experiment, as far as we know,about the str… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  17. arXiv:2401.15148  [pdf, other

    astro-ph.HE astro-ph.SR

    Spectroscopic observations of progenitor activity 100 days before a Type Ibn supernova

    Authors: S. J. Brennan, J. Sollerman, I. Irani, S. Schulze, P. Chen, K. K. Das, K. De, C. Fransson, A. Gal-Yam, A. Gkini, K. R. Hinds, R. Lunnan, D. Perley, YJ. Qin, R. Stein, J. Wise, L. Yan, E. A. Zimmerman, S. Anand, R. J. Bruch, R. Dekany, A. J. Drake, C. Fremling, B. Healy, V. Karambelkar , et al. (8 additional authors not shown)

    Abstract: Obtaining spectroscopic observations of the progenitors of core-collapse supernovae is often unfeasible due to an inherent lack of knowledge as to which stars will go supernova and when they will explode. In this letter, we present photometric and spectroscopic observations of the progenitor activity of SN 2023fyq in the preceding 150 days before the He-rich progenitor exploded as a Type Ibn super… ▽ More

    Submitted 25 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 7 Pages, 5 Figures, accepted to A&A Letters

    Journal ref: A&A 684, L18 (2024)

  18. arXiv:2401.14044  [pdf

    physics.app-ph

    Electrical switching of the perpendicular Neel order in a collinear antiferromagnet

    Authors: Wenqing He, Tianyi Zhang, Yongjian Zhou, Caihua Wan, Hao Wu, Baoshan Cui, Jihao Xia, Ran Zhang, Tengyu Guo, Peng Chen, Mingkun Zhao, Leina Jiang, Alexander Grutter, Purnima P. Balakrishnan, Andrew J. Caruana, Christy J. Kinane, Sean Langridge, Guoqiang Yu, Cheng Song, Xiufeng Han

    Abstract: Electrical manipulation of magnetic order by current-induced spin torques lays the foundation for spintronics. One promising approach is encoding information in the Néel vector of antiferromagnetic (AFM) materials, particularly to collinear antiferromagnets with the perpendicular magnetic anisotropy (PMA), as the negligible stray fields and terahertz spin dynamics can enable memory devices with hi… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  19. arXiv:2401.14034  [pdf, other

    cs.CV

    Unsupervised Spatial-Temporal Feature Enrichment and Fidelity Preservation Network for Skeleton based Action Recognition

    Authors: Chuankun Li, Shuai Li, Yanbo Gao, ** Chen, Jian Li, Wanqing Li

    Abstract: Unsupervised skeleton based action recognition has achieved remarkable progress recently. Existing unsupervised learning methods suffer from severe overfitting problem, and thus small networks are used, significantly reducing the representation capability. To address this problem, the overfitting mechanism behind the unsupervised learning for skeleton based action recognition is first investigated… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  20. arXiv:2401.13886  [pdf

    cond-mat.mes-hall

    Observation of possible excitonic charge density waves and metal-insulator transitions in atomically thin semimetals

    Authors: Qiang Gao, Yang-hao Chan, Pengfei Jiao, Haiyang Chen, Shuaishuai Yin, Kanjanaporn Tangprapha, Yichen Yang, Xiaolong Li, Zhengtai Liu, Dawei Shen, Shengwei Jiang, Peng Chen

    Abstract: Charge density wave (CDW) is a collective quantum phenomenon with a charge modulation in solids1-2. Condensation of electron and hole pairs with finite momentum will lead to such an ordered state3-7. However, lattice symmetry breaking manifested as the softening of phonon modes can occur simultaneously, which makes it difficult to disentangle the origin of the transition8-14. Here, we report a con… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: https://www.nature.com/articles/s41567-023-02349-0 published in Nature Physics

  21. arXiv:2401.13280  [pdf, other

    cs.CV cs.CE

    DDI-CoCo: A Dataset For Understanding The Effect Of Color Contrast In Machine-Assisted Skin Disease Detection

    Authors: Ming-Chang Chiu, Yingfei Wang, Yen-Ju Kuo, Pin-Yu Chen

    Abstract: Skin tone as a demographic bias and inconsistent human labeling poses challenges in dermatology AI. We take another angle to investigate color contrast's impact, beyond skin tones, on malignancy detection in skin disease datasets: We hypothesize that in addition to skin tones, the color difference between the lesion area and skin also plays a role in malignancy detection performance of dermatology… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures, 2 tables, Accepted to ICASSP 2024

  22. arXiv:2401.12728  [pdf, other

    astro-ph.SR astro-ph.GA

    Filamentary Network and Magnetic Field Structures Revealed with BISTRO in the High-Mass Star-Forming Region NGC2264 : Global Properties and Local Magnetogravitational Configurations

    Authors: Jia-Wei Wang, Patrick M. Koch, Seamus D. Clarke, Gary Fuller, Nicolas Peretto, Ya-Wen Tang, Hsi-Wei Yen, Shih-** Lai, Nagayoshi Ohashi, Doris Arzoumanian, Doug Johnstone, Ray Furuya, Shu-ichiro Inutsuka, Chang Won Lee, Derek Ward-Thompson, Valentin J. M. Le Gouellec, Hong-Li Liu, Lapo Fanciullo, Jihye Hwang, Kate Pattle, Frédérick Poidevin, Mehrnoosh Tahani, Takashi Onaka, Mark G. Rawlings, Eun Jung Chung , et al. (132 additional authors not shown)

    Abstract: We report 850 $μ$m continuum polarization observations toward the filamentary high-mass star-forming region NGC 2264, taken as part of the B-fields In STar forming Regions Observations (BISTRO) large program on the James Clerk Maxwell Telescope (JCMT). These data reveal a well-structured non-uniform magnetic field in the NGC 2264C and 2264D regions with a prevailing orientation around 30 deg from… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in the Astrophysical Journal. 43 pages, 32 figures, and 4 tables (including Appendix)

  23. arXiv:2401.11946  [pdf, other

    cs.CR

    A Dynamic YOLO-Based Sequence-Matching Model for Efficient Coverless Image Steganography

    Authors: Jiajun Liu, Lina Tan, Zhili Zhou, Yi Li, Peng Chen

    Abstract: Many existing coverless steganography methods establish a map** relationship between cover images and hidden data. There exists an issue that the number of images stored in the database grows exponentially as the steganographic capacity rises. The need for a high steganographic capacity makes it challenging to build an image database. To improve the image library utilization and anti-attack capa… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  24. arXiv:2401.11436  [pdf, other

    cs.CV

    Geometric Prior Guided Feature Representation Learning for Long-Tailed Classification

    Authors: Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Puhua Chen

    Abstract: Real-world data are long-tailed, the lack of tail samples leads to a significant limitation in the generalization ability of the model. Although numerous approaches of class re-balancing perform well for moderate class imbalance problems, additional knowledge needs to be introduced to help the tail class recover the underlying true distribution when the observed distribution from a few tail sample… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: This work was accepted by the IJCV

  25. arXiv:2401.10446  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Large Language Models are Efficient Learners of Noise-Robust Speech Recognition

    Authors: Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng

    Abstract: Recent advances in large language models (LLMs) have promoted generative error correction (GER) for automatic speech recognition (ASR), which leverages the rich linguistic knowledge and powerful reasoning ability of LLMs to improve recognition results. The latest work proposes a GER benchmark with HyPoradise dataset to learn the map** from ASR N-best hypotheses to ground-truth transcription by e… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR 2024, Spotlight top 5%, 24 pages. This work will be open sourced at: https://github.com/YUCHEN005/RobustGER under MIT license

  26. arXiv:2401.08577  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

    Authors: Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan

    Abstract: Human beings possess the capability to multiply a melange of multisensory cues while actively exploring and interacting with the 3D world. Current multi-modal large language models, however, passively absorb sensory data as inputs, lacking the capacity to actively interact with the objects in the 3D environment and dynamically collect their multisensory information. To usher in the study of this a… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Project page: https://vis-www.cs.umass.edu/multiply

  27. arXiv:2401.08276  [pdf, other

    cs.CV cs.CL

    AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception

    Authors: Yipo Huang, Quan Yuan, Xiangfei Sheng, Zhichao Yang, Haoning Wu, Pengfei Chen, Yuzhe Yang, Leida Li, Weisi Lin

    Abstract: With collective endeavors, multimodal large language models (MLLMs) are undergoing a flourishing development. However, their performances on image aesthetics perception remain indeterminate, which is highly desired in real-world applications. An obvious obstacle lies in the absence of a specific benchmark to evaluate the effectiveness of MLLMs on aesthetic perception. This blind gro** may impede… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  28. arXiv:2401.08107  [pdf, other

    cs.CV cs.MM

    Deep Shape-Texture Statistics for Completely Blind Image Quality Evaluation

    Authors: Yixuan Li, Peilin Chen, Hanwei Zhu, Keyan Ding, Leida Li, Shiqi Wang

    Abstract: Opinion-Unaware Blind Image Quality Assessment (OU-BIQA) models aim to predict image quality without training on reference images and subjective quality scores. Thereinto, image statistical comparison is a classic paradigm, while the performance is limited by the representation ability of visual descriptors. Deep features as visual descriptors have advanced IQA in recent research, but they are dis… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  29. The role of electromagnetic interaction in the $X(3872)$ and its analogs

    Authors: ** Chen, Zhan-Wei Liu, Zi-Le Zhang, Si-Qiang Luo, Fu-Lai Wang, Jun-Zhang Wang, Xiang Liu

    Abstract: We investigate the role of the electromagnetic interaction in the formation and decay of the $X(3872)$. The binding properties of the $X(3872)$ are studied by assuming the molecular nature and considering the $S$-$D$ wave mixing, isospin breaking, and coupled channel effects, and in particular the correction from the electromagnetic interaction. The radiative decays can better reflect the differen… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 17 pages, 8 figures, 7 tables

    Journal ref: Phys.Rev.D 109, 094002 (2024)

  30. arXiv:2401.05800  [pdf, other

    cs.LG cs.AI

    Graph Spatiotemporal Process for Multivariate Time Series Anomaly Detection with Missing Values

    Authors: Yu Zheng, Huan Yee Koh, Ming **, Lianhua Chi, Haishuai Wang, Khoa T. Phan, Yi-** Phoebe Chen, Shirui Pan, Wei Xiang

    Abstract: The detection of anomalies in multivariate time series data is crucial for various practical applications, including smart power grids, traffic flow forecasting, and industrial process control. However, real-world time series data is usually not well-structured, posting significant challenges to existing approaches: (1) The existence of missing values in multivariate time series data along variabl… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted by Information Fusion

  31. arXiv:2401.05561  [pdf, other

    cs.CL

    TrustLLM: Trustworthiness in Large Language Models

    Authors: Lichao Sun, Yue Huang, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang , et al. (45 additional authors not shown)

    Abstract: Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in… ▽ More

    Submitted 17 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: This work is still under work and we welcome your contribution

  32. arXiv:2401.03371  [pdf

    physics.app-ph physics.data-an

    Advancing Noise-Resilient Twist Angle Characterization in Bilayer Graphene through Raman Spectroscopy via GAN-CNN Modeling

    Authors: Dan Hu, Ting-Fung Chung, Yong P. Chen, Ya** Qi

    Abstract: In this study, we introduce an innovative methodology for robust twist angle identification in bilayer graphene using Raman spectroscopy, featuring the integration of generative adversarial network and convolutional neural network (GAN-CNN). Our proposed approach showcases remarkable resistance to noise interference, particularly in ultra-low Signal-to-Noise Ratio (SNR) conditions. We demonstrate… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  33. arXiv:2401.02651  [pdf, other

    cs.CV

    Benchmarking PathCLIP for Pathology Image Analysis

    Authors: Sunyi Zheng, Xiaonan Cui, Yuxuan Sun, **gxiong Li, Honglin Li, Yunlong Zhang, **yi Chen, Xue** **g, Zhaoxiang Ye, Lin Yang

    Abstract: Accurate image classification and retrieval are of importance for clinical diagnosis and treatment decision-making. The recent contrastive language-image pretraining (CLIP) model has shown remarkable proficiency in understanding natural images. Drawing inspiration from CLIP, PathCLIP is specifically designed for pathology image analysis, utilizing over 200,000 image and text pairs in training. Whi… ▽ More

    Submitted 12 June, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  34. arXiv:2401.02611  [pdf, other

    cs.CV

    MOODv2: Masked Image Modeling for Out-of-Distribution Detection

    Authors: **gyao Li, Pengguang Chen, Shaozuo Yu, Shu Liu, Jiaya Jia

    Abstract: The crux of effective out-of-distribution (OOD) detection lies in acquiring a robust in-distribution (ID) representation, distinct from OOD samples. While previous methods predominantly leaned on recognition-based techniques for this purpose, they often resulted in shortcut learning, lacking comprehensive representations. In our study, we conducted a comprehensive analysis, exploring distinct pret… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  35. arXiv:2401.01921  [pdf, other

    cs.MS cond-mat.str-el

    The Cytnx Library for Tensor Networks

    Authors: Kai-Hsin Wu, Chang-Teng Lin, Ke Hsu, Hao-Ti Hung, Manuel Schneider, Chia-Min Chung, Ying-Jer Kao, Pochung Chen

    Abstract: We introduce a tensor network library designed for classical and quantum physics simulations called Cytnx (pronounced as sci-tens). This library provides almost an identical interface and syntax for both C++ and Python, allowing users to effortlessly switch between two languages. Aiming at a quick learning process for new users of tensor network algorithms, the interfaces resemble the popular Pyth… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  36. arXiv:2401.01517  [pdf, other

    astro-ph.GA

    The rotation curve and mass distribution of M31

    Authors: Xiangwei Zhang, Bingqiu Chen, Pinjian Chen, Jiarui Sun, Zhijia Tian

    Abstract: To gain a better understanding of the Andromeda galaxy M31 and its role in the Local Group, measuring its mass precisely is essential. In this work, we have constructed the rotation curve of M31 out to $\sim$125 kpc using 13,679 M31 objects obtained from various sources, including the LAMOST data release 9 (LAMOST DR9), the DESI survey, and relevant literature. We divide all objects in our sample… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  37. arXiv:2312.17677  [pdf, other

    cs.CR cs.SE

    Prompt Fuzzing for Fuzz Driver Generation

    Authors: Yunlong Lyu, Yuxuan Xie, Peng Chen, Hao Chen

    Abstract: Crafting high-quality fuzz drivers not only is time-consuming but also requires a deep understanding of the library. However, the state-of-the-art automatic fuzz driver generation techniques fall short of expectations. While fuzz drivers derived from consumer code can reach deep states, they have limited coverage. Conversely, interpretative fuzzing can explore most API calls but requires numerous… ▽ More

    Submitted 29 May, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: To appear in the ACM CCS 2024

  38. arXiv:2312.17611  [pdf, other

    cs.CV

    P2M2-Net: Part-Aware Prompt-Guided Multimodal Point Cloud Completion

    Authors: Linlian Jiang, Pan Chen, Ye Wang, Tieru Wu, Rui Ma

    Abstract: Inferring missing regions from severely occluded point clouds is highly challenging. Especially for 3D shapes with rich geometry and structure details, inherent ambiguities of the unknown parts are existing. Existing approaches either learn a one-to-one map** in a supervised manner or train a generative model to synthesize the missing points for the completion of 3D point cloud shapes. These met… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: Best Poster Award of CAD/Graphics 2023

  39. arXiv:2312.17080  [pdf, other

    cs.CL

    MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation

    Authors: Zhongshen Zeng, Pengguang Chen, Shu Liu, Haiyun Jiang, Jiaya Jia

    Abstract: In this work, we introduce a novel evaluation paradigm for Large Language Models (LLMs) that compels them to transition from a traditional question-answering role, akin to a student, to a solution-scoring role, akin to a teacher. This paradigm, focusing on "reasoning about reasoning," hence termed meta-reasoning, shifts the emphasis from result-oriented assessments, which often neglect the reasoni… ▽ More

    Submitted 5 June, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Code: https://github.com/dvlab-research/MR-GSM8K

  40. arXiv:2312.16467  [pdf, other

    cs.CL cs.LG

    Transfer and Alignment Network for Generalized Category Discovery

    Authors: Wenbin An, Feng Tian, Wenkai Shi, Yan Chen, Yaqiang Wu, Qianying Wang, ** Chen

    Abstract: Generalized Category Discovery is a crucial real-world task. Despite the improved performance on known categories, current methods perform poorly on novel categories. We attribute the poor performance to two reasons: biased knowledge transfer between labeled and unlabeled data and noisy representation learning on the unlabeled data. To mitigate these two issues, we propose a Transfer and Alignment… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  41. arXiv:2312.15960  [pdf, other

    cs.LG cs.PL cs.SE

    MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks

    Authors: **gyao Li, Pengguang Chen, Jiaya Jia

    Abstract: Large Language Models (LLMs) have showcased impressive capabilities in handling straightforward programming tasks. However, their performance tends to falter when confronted with more challenging programming problems. We observe that conventional models often generate solutions as monolithic code blocks, restricting their effectiveness in tackling intricate questions. To overcome this limitation,… ▽ More

    Submitted 5 January, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Model: https://huggingface.co/**gyaoLi/MoTCoder-15B-v1.0. Code: https://github.com/dvlab-research/MoTCoder

  42. arXiv:2312.15944  [pdf, other

    cs.LG cs.CV

    BAL: Balancing Diversity and Novelty for Active Learning

    Authors: **gyao Li, Pengguang Chen, Shaozuo Yu, Shu Liu, Jiaya Jia

    Abstract: The objective of Active Learning is to strategically label a subset of the dataset to maximize performance within a predetermined labeling budget. In this study, we harness features acquired through self-supervised learning. We introduce a straightforward yet potent metric, Cluster Distance Difference, to identify diverse data. Subsequently, we introduce a novel framework, Balancing Active Learnin… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: Our paper is accepted by TPAMI

  43. arXiv:2312.15895  [pdf, other

    cs.CV

    Semantic-aware SAM for Point-Prompted Instance Segmentation

    Authors: Zhaoyang Wei, Pengfei Chen, Xuehui Yu, Guorong Li, Jianbin Jiao, Zhenjun Han

    Abstract: Single-point annotation in visual tasks, with the goal of minimizing labelling costs, is becoming increasingly prominent in research. Recently, visual foundation models, such as Segment Anything (SAM), have gained widespread usage due to their robust zero-shot capabilities and exceptional annotation performance. However, SAM's class-agnostic output and high confidence in local segmentation introdu… ▽ More

    Submitted 26 May, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 16 pages, 8 figures, CVPR2024

  44. arXiv:2312.14810  [pdf, other

    cs.CE math.OC stat.ME

    Accurate, scalable, and efficient Bayesian Optimal Experimental Design with derivative-informed neural operators

    Authors: **woo Go, Peng Chen

    Abstract: We consider optimal experimental design (OED) problems in selecting the most informative observation sensors to estimate model parameters in a Bayesian framework. Such problems are computationally prohibitive when the parameter-to-observable (PtO) map is expensive to evaluate, the parameters are high-dimensional, and the optimization for sensor selection is combinatorial and high-dimensional. To a… ▽ More

    Submitted 27 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    MSC Class: 62K05; 35Q62; 62F15; 35R30; 35Q93; 65C60; 90C27 ACM Class: G.1.8; I.5.2; I.6.4

  45. arXiv:2312.14018  [pdf, ps, other

    eess.SP

    Enabling Secure Wireless Communications via Movable Antennas

    Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Xiaoming She, Chongjun Ouyang, Peng Chen

    Abstract: A pioneering secure transmission scheme is proposed, which harnesses movable antennas (MAs) to optimize antenna positions for augmenting the physical layer security. Particularly, an MA-enabled secure wireless system is considered, where a multi-antenna transmitter communicates with a single-antenna receiver in the presence of an eavesdropper. The beamformer and antenna positions at the transmitte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by IEEE ICASSP 2024

  46. arXiv:2312.14002  [pdf, other

    cond-mat.str-el

    Tensor Network Finite-Size Scaling for Two-Dimensional 3-state Clock Model

    Authors: Debasmita Maiti, Sing-Hong Chan, Pochung Chen

    Abstract: We benchmark recently proposed tensor network based finite-size scaling analysis in Phys. Rev. B 107, 205123 (2023) against two-dimensional classical 3-state clock model. Due to the higher complexity of the model, more complicated crossover behavior is observed. We advocate that the crossover behavior can be understood from the perspective of finite bond dimension inducing relevant perturbation. T… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  47. arXiv:2312.12436  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

    Authors: Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

    Abstract: The surge of interest towards Multi-modal Large Language Models (MLLMs), e.g., GPT-4V(ision) from OpenAI, has marked a significant trend in both academia and industry. They endow Large Language Models (LLMs) with powerful capabilities in visual understanding, enabling them to tackle diverse multi-modal tasks. Very recently, Google released Gemini, its newest and most capable MLLM built from the gr… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Total 120 pages. See our project at https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models

  48. arXiv:2312.11911  [pdf, other

    cs.CV cs.RO

    EVI-SAM: Robust, Real-time, Tightly-coupled Event-Visual-Inertial State Estimation and 3D Dense Map**

    Authors: Weipeng Guan, Peiyu Chen, Huibin Zhao, Yu Wang, Peng Lu

    Abstract: Event cameras are bio-inspired, motion-activated sensors that demonstrate substantial potential in handling challenging situations, such as motion blur and high-dynamic range. In this paper, we proposed EVI-SAM to tackle the problem of 6 DoF pose tracking and 3D reconstruction using monocular event camera. A novel event-based hybrid tracking framework is designed to estimate the pose, leveraging t… ▽ More

    Submitted 23 May, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  49. arXiv:2312.11686  [pdf

    physics.optics physics.app-ph quant-ph

    All-optical modulation with single-photons using electron avalanche

    Authors: Demid V. Sychev, Peigang Chen, Morris Yang, Colton Fruhling, Alexei Lagutchev, Alexander V. Kildishev, Alexandra Boltasseva, Vladimir M. Shalaev

    Abstract: The distinctive characteristics of light such as high-speed propagation, low-loss, low cross-talk and power consumption as well as quantum properties, make it uniquely suitable for various critical applications in communication, high-resolution imaging, optical computing, and emerging quantum information technologies. One limiting factor though is the weak optical nonlinearity of conventional medi… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  50. arXiv:2312.11583  [pdf, other

    cs.LG cs.AI

    AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System

    Authors: Chengyuan Zhu, Yiyuan Yang, Kaixiang Yang, Haifeng Zhang, Qinmin Yang, C. L. Philip Chen

    Abstract: The application of artificial intelligence technology has greatly enhanced and fortified the safety of energy pipelines, particularly in safeguarding against external threats. The predominant methods involve the integration of intelligent sensors to detect external vibration, enabling the identification of event types and locations, thereby replacing manual detection methods. However, practical im… ▽ More

    Submitted 25 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)