Skip to main content

Showing 1–50 of 746 results for author: Cheng, L

.
  1. arXiv:2407.00994  [pdf, other

    cs.CL

    LLM Uncertainty Quantification through Directional Entailment Graph and Claim Level Response Augmentation

    Authors: Longchao Da, Tie** Chen, Lu Cheng, Hua Wei

    Abstract: The Large language models (LLMs) have showcased superior capabilities in sophisticated tasks across various domains, stemming from basic question-answer (QA), they are nowadays used as decision assistants or explainers for unfamiliar content. However, they are not always correct due to the data sparsity in specific domain corpus, or the model's hallucination problems. Given this, how much should w… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages main content, 5 pages appendix

    ACM Class: I.2.7

  2. arXiv:2407.00499  [pdf, other

    cs.CL cs.AI cs.LG

    ConU: Conformal Uncertainty in Large Language Models with Correctness Coverage Guarantees

    Authors: Zhiyuan Wang, **hao Duan, Lu Cheng, Yue Zhang, Qingni Wang, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu

    Abstract: Uncertainty quantification (UQ) in natural language generation (NLG) tasks remains an open challenge, exacerbated by the intricate nature of the recent large language models (LLMs). This study investigates adapting conformal prediction (CP), which can convert any heuristic measure of uncertainty into rigorous theoretical guarantees by constructing prediction sets, for black-box LLMs in open-ended… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 13 pages, 9 figures, 6 tables

  3. arXiv:2406.18961  [pdf, other

    cs.MA

    Formation Under Communication Constraints: Control Performance Meets Channel Capacity

    Authors: Yaru Chen, Yirui Cong, Xiangyun Zhou, Long Cheng, Xiangke Wang

    Abstract: In wireless communication-based formation control systems, the control performance is significantly impacted by the channel capacity of each communication link between agents. This relationship, however, remains under-investigated in the existing studies. To address this gap, the formation control problem of classical second-order multi-agent systems with bounded process noises was considered taki… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.18763  [pdf, other

    cs.LG cs.AI

    Conformalized Link Prediction on Graph Neural Networks

    Authors: Tianyi Zhao, Jian Kang, Lu Cheng

    Abstract: Graph Neural Networks (GNNs) excel in diverse tasks, yet their applications in high-stakes domains are often hampered by unreliable predictions. Although numerous uncertainty quantification methods have been proposed to address this limitation, they often lack \textit{rigorous} uncertainty estimates. This work makes the first attempt to introduce a distribution-free and model-agnostic uncertainty… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  5. arXiv:2406.17795  [pdf, other

    cs.CV cs.GR

    RACon: Retrieval-Augmented Simulated Character Locomotion Control

    Authors: Yuxuan Mu, Shihao Zou, Kangning Yin, Zheng Tian, Li Cheng, Weinan Zhang, Jun Wang

    Abstract: In computer animation, driving a simulated character with lifelike motion is challenging. Current generative models, though able to generalize to diverse motions, often pose challenges to the responsiveness of end-user control. To address these issues, we introduce RACon: Retrieval-Augmented Simulated Character Locomotion Control. Our end-to-end hierarchical reinforcement learning method utilizes… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted in ICME2024 for oral presentation

  6. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  7. arXiv:2406.13168  [pdf

    math.OC

    Stochastic Multi-objective Multi-trip AMR Routing Problem with Time Windows

    Authors: Lulu Cheng, Ning Zhao

    Abstract: In recent years, with the rapidly aging population, alleviating the pressure on medical staff has become a critical issue. To improve the work efficiency of medical staff and reduce the risk of infection, we consider the multi-trip autonomous mobile robot (AMR) routing problem with the stochastic environment to find the solution to minimizing the total expected operating cost and maximizing the to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  8. arXiv:2406.12779  [pdf, other

    cs.CL

    Composited-Nested-Learning with Data Augmentation for Nested Named Entity Recognition

    Authors: Xingming Liao, Nankai Lin, Haowen Li, Lianglun Cheng, Zhuowei Wang, Chong Chen

    Abstract: Nested Named Entity Recognition (NNER) focuses on addressing overlapped entity recognition. Compared to Flat Named Entity Recognition (FNER), annotated resources are scarce in the corpus for NNER. Data augmentation is an effective approach to address the insufficient annotated corpus. However, there is a significant lack of exploration in data augmentation methods for NNER. Due to the presence of… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by CSCWD 2024

  9. arXiv:2406.12315  [pdf, other

    cs.AI

    PruningBench: A Comprehensive Benchmark of Structural Pruning

    Authors: Haoling Li, Changhao Li, Mengqi Xue, Gongfan Fang, Sheng Zhou, Zunlei Feng, Huiqiong Wang, Yong Wang, Lechao Cheng, Mingli Song, Jie Song

    Abstract: Structural pruning has emerged as a promising approach for producing more efficient models. Nevertheless, the community suffers from a lack of standardized benchmarks and metrics, leaving the progress in this area not fully comprehended. To fill this gap, we present the first comprehensive benchmark, termed \textit{PruningBench}, for structural pruning. PruningBench showcases the following three c… ▽ More

    Submitted 28 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPS 2024 Datasets and Benchmarks Track

  10. arXiv:2406.11515  [pdf, other

    cs.CR

    Obfuscating IoT Device Scanning Activity via Adversarial Example Generation

    Authors: Haocong Li, Yaxin Zhang, Long Cheng, Wenjia Niu, Haining Wang, Qiang Li

    Abstract: Nowadays, attackers target Internet of Things (IoT) devices for security exploitation, and search engines for devices and services compromise user privacy, including IP addresses, open ports, device types, vendors, and products.Typically, application banners are used to recognize IoT device profiles during network measurement and reconnaissance. In this paper, we propose a novel approach to obfusc… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2406.11169   

    eess.AS cs.SD

    Self-Distillation Prototypes Network: Learning Robust Speaker Representations without Supervision

    Authors: Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Shiliang Zhang, Wen Wang

    Abstract: Training speaker-discriminative and robust speaker verification systems without explicit speaker labels remains a persisting challenge. In this paper, we propose a new self-supervised speaker verification approach, Self-Distillation Prototypes Network (SDPN), which effectively facilitates self-supervised speaker representation learning. SDPN assigns the representation of the augmented views of an… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: We update this paper to an earlier paper

  12. arXiv:2406.09181  [pdf, other

    cs.CV cs.AI

    A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

    Authors: Yijun Bei, Hengrui Lou, **song Geng, Erteng Liu, Lechao Cheng, Jie Song, Mingli Song, Zunlei Feng

    Abstract: With the rapid development of AI-generated content (AIGC) technology, the production of realistic fake facial images and videos that deceive human visual perception has become possible. Consequently, various face forgery detection techniques have been proposed to identify such fake facial content. However, evaluating the effectiveness and generalizability of these detection techniques remains a si… ▽ More

    Submitted 13 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: This is a paper about constructing a large-scale universal evaluation benchmark for face forgery detection.The full text is 30 pages

  13. arXiv:2406.08235  [pdf, other

    physics.atom-ph

    Vibrational Branching Ratios for Laser-Cooling of Nonlinear Strontium-Containing Molecules

    Authors: Alexander Frenett, Zack Lasner, Lan Cheng, John M. Doyle

    Abstract: The vibrational branching ratios from the lowest excited electronic state for $\textrm{SrOCH}_3$, $\textrm{SrNH}_2$, and $\textrm{SrSH}$ are measured at the $< 0.1\%$ level. Spectra are obtained by driving the $\tilde{X} - \tilde{A}$ transitions and dispersing the fluorescence on a grating spectrometer. We also perform $\textit{ab initio}$ calculations for the energies of vibrational levels releva… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  14. arXiv:2406.06736  [pdf, other

    cs.LG cs.AI cs.CY

    Long-Term Fairness Inquiries and Pursuits in Machine Learning: A Survey of Notions, Methods, and Challenges

    Authors: Usman Gohar, Zeyu Tang, Jialu Wang, Kun Zhang, Peter L. Spirtes, Yang Liu, Lu Cheng

    Abstract: The widespread integration of Machine Learning systems in daily life, particularly in high-stakes domains, has raised concerns about the fairness implications. While prior works have investigated static fairness measures, recent studies reveal that automated decision-making has long-term implications and that off-the-shelf fairness approaches may not serve the purpose of achieving long-term fairne… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  15. arXiv:2406.05392  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas

    Authors: Chengyuan Deng, Yiqun Duan, Xin **, Heng Chang, Yijun Tian, Han Liu, Henry Peng Zou, Yiqiao **, Yijia Xiao, Yichen Wang, Shenghao Wu, Zongxing Xie, Kuofeng Gao, Sihong He, Jun Zhuang, Lu Cheng, Haohan Wang

    Abstract: Large Language Models (LLMs) have achieved unparalleled success across diverse language modeling tasks in recent years. However, this progress has also intensified ethical concerns, impacting the deployment of LLMs in everyday contexts. This paper provides a comprehensive survey of ethical challenges associated with LLMs, from longstanding issues such as copyright infringement, systematic bias, an… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  16. arXiv:2406.03081  [pdf, other

    quant-ph

    A Quantum Neural Network-Based Approach to Power Quality Disturbances Detection and Recognition

    Authors: Guo-Dong Li, Hai-Yan He, Yue Li, Xin-Hao Li, Hao Liu, Qing-Le Wang, Long Cheng

    Abstract: Power quality disturbances (PQDs) significantly impact the stability and reliability of power systems, necessitating accurate and efficient detection and recognition methods. While numerous classical algorithms for PQDs detection and recognition have been extensively studied and applied, related work in the quantum domain is still in its infancy. In this paper, an improved quantum neural networks… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  17. arXiv:2406.02167  [pdf, other

    eess.AS eess.SP

    ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency

    Authors: Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Qian Chen, Shiliang Zhang, Junjie Li

    Abstract: Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature fusion approach has been proposed to effectively capture speaker characteristics from short utterances. Constrained by the model's size, a robust backbone Enhanced Res2Net (ERes2Net) combining global and local feature fusion… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  18. arXiv:2406.00278  [pdf, ps, other

    math.MG math.CO

    Two new proofs of partial Godbersen's Conjecture

    Authors: Lin Cheng

    Abstract: Two new proofs are provided, offering two new perspectives on Godbersen's conjecture. One of the proofs utilizes Helly's theorem to provide a concise and elegant proof of the inequality in Godbersen's conjecture. The other proof utilizes the Brunn-Minkowski inequality to provide a completely new proof of the inclusion $-K\subset nK$ for convex bodies $K$ with centroid at the origin, thereby provin… ▽ More

    Submitted 5 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

  19. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  20. arXiv:2405.17921  [pdf

    cs.AI cs.CY

    Towards Clinical AI Fairness: Filling Gaps in the Puzzle

    Authors: Mingxuan Liu, Yilin Ning, Salinelat Teixayavong, Xiaoxuan Liu, Mayli Mertens, Yuqing Shang, Xin Li, Di Miao, Jie Xu, Daniel Shu Wei Ting, Lionel Tim-Ee Cheng, Jasmine Chiat Ling Ong, Zhen Ling Teo, Ting Fang Tan, Narrendar RaviChandran, Fei Wang, Leo Anthony Celi, Marcus Eng Hock Ong, Nan Liu

    Abstract: The ethical integration of Artificial Intelligence (AI) in healthcare necessitates addressing fairness-a concept that is highly context-specific across medical fields. Extensive studies have been conducted to expand the technical components of AI fairness, while tremendous calls for AI fairness have been raised from healthcare. Despite this, a significant disconnect persists between technical adva… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  21. arXiv:2405.17267  [pdf, other

    cs.LG cs.CV

    FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation

    Authors: Yuting Ma, Lechao Cheng, Yaxiong Wang, Zhun Zhong, Xiaohua Xu, Meng Wang

    Abstract: Federated learning (FL) is a popular privacy-preserving paradigm that enables distributed clients to collaboratively train models with a central server while kee** raw data locally. In practice, distinct model architectures, varying data distributions, and limited resources across local clients inevitably cause model performance degradation and a slowdown in convergence speed. However, existing… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 35 pages

  22. arXiv:2405.17129  [pdf, other

    cs.CL cs.AI

    TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection

    Authors: Long Cheng, Qihao Shao, Christine Zhao, Sheng Bi, Gina-Anne Levow

    Abstract: Cross-lingual emotion detection allows us to analyze global trends, public opinion, and social phenomena at scale. We participated in the Explainability of Cross-lingual Emotion Detection (EXALT) shared task, achieving an F1-score of 0.6046 on the evaluation set for the emotion detection sub-task. Our system outperformed the baseline by more than 0.16 F1-score absolute, and ranked second amongst c… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: (Under review) Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis

  23. arXiv:2405.16402  [pdf, other

    cs.CL cs.AI

    Assessing Empathy in Large Language Models with Real-World Physician-Patient Interactions

    Authors: Man Luo, Christopher J. Warren, Lu Cheng, Haidar M. Abdul-Muhsin, Imon Banerjee

    Abstract: The integration of Large Language Models (LLMs) into the healthcare domain has the potential to significantly enhance patient care and support through the development of empathetic, patient-facing chatbots. This study investigates an intriguing question Can ChatGPT respond with a greater degree of empathy than those typically offered by physicians? To answer this question, we collect a de-identifi… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  24. arXiv:2405.14210  [pdf, other

    cs.CV eess.IV

    Eidos: Efficient, Imperceptible Adversarial 3D Point Clouds

    Authors: Hanwei Zhang, Luo Cheng, Qisong He, Wei Huang, Renjue Li, Ronan Sicre, Xiaowei Huang, Holger Hermanns, Lijun Zhang

    Abstract: Classification of 3D point clouds is a challenging machine learning (ML) task with important real-world applications in a spectrum from autonomous driving and robot-assisted surgery to earth observation from low orbit. As with other ML tasks, classification models are notoriously brittle in the presence of adversarial attacks. These are rooted in imperceptible changes to inputs with the effect tha… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Preprint

  25. arXiv:2405.13388  [pdf, other

    cs.CV

    Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation

    Authors: Dingwen Zhang, Hao Li, Diqi He, Nian Liu, Lechao Cheng, **gdong Wang, Junwei Han

    Abstract: In recent times, following the paradigm of DETR (DEtection TRansformer), query-based end-to-end instance segmentation (QEIS) methods have exhibited superior performance compared to CNN-based models, particularly when trained on large-scale datasets. Nevertheless, the effectiveness of these QEIS methods diminishes significantly when confronted with limited training data. This limitation arises from… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: https://github.com/lifuguan/UPLVP

  26. arXiv:2405.11847  [pdf, other

    math.NA

    Understanding the ultraspherical spectral method

    Authors: Lu Cheng, Kuan Xu

    Abstract: The ultraspherical spectral method features high accuracy and fast solution. In this article, we determine the sources of error arising from the ultraspherical spectral method and derive its effective condition number, which explains why its backward error is consistent with a numerical method with bounded condition number. In addition, we show the cause for the Cauchy error to go below the machin… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    MSC Class: 15A12; 65L07; 65L10; 65L20; 65L70

  27. arXiv:2405.11566  [pdf, other

    cs.LG

    Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models

    Authors: Omer Belhasin, Idan Kligvasser, George Leifman, Regev Cohen, Erin Rainaldi, Li-Fang Cheng, Nishant Verma, Paul Varghese, Ehud Rivlin, Michael Elad

    Abstract: Analyzing the cardiovascular system condition via Electrocardiography (ECG) is a common and highly effective approach, and it has been practiced and perfected over many decades. ECG sensing is non-invasive and relatively easy to acquire, and yet it is still cumbersome for holter monitoring tests that may span over hours and even days. A possible alternative in this context is Photoplethysmography… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  28. arXiv:2405.08889  [pdf, other

    hep-ph hep-ex

    Incorporating Physical Priors into Weakly-Supervised Anomaly Detection

    Authors: Chi Lung Cheng, Gurpreet Singh, Benjamin Nachman

    Abstract: We propose a new machine-learning-based anomaly detection strategy for comparing data with a background-only reference (a form of weak supervision). The sensitivity of previous strategies degrades significantly when the signal is too rare or there are many unhelpful features. Our Prior-Assisted Weak Supervision (PAWS) method incorporates information from a class of signal models in order to signif… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 7 pages, 2 figures

  29. arXiv:2405.08538  [pdf, other

    cs.LG

    Self-Distillation Improves DNA Sequence Inference

    Authors: Tong Yu, Lei Cheng, Ruslan Khalitov, Erland Brandser Olsson, Zhirong Yang

    Abstract: Self-supervised pretraining (SSP) has been recognized as a method to enhance prediction accuracy in various downstream tasks. However, its efficacy for DNA sequences remains somewhat constrained. This limitation stems primarily from the fact that most existing SSP approaches in genomics focus on masked language modeling of individual sequences, neglecting the crucial aspect of encoding statistics… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  30. arXiv:2405.08125  [pdf, other

    cs.CY cs.AI cs.LG

    AI-Cybersecurity Education Through Designing AI-based Cyberharassment Detection Lab

    Authors: Ebuka Okpala, Nishant Vishwamitra, Keyan Guo, Song Liao, Long Cheng, Hongxin Hu, Yongkai Wu, Xiaohong Yuan, Jeannette Wade, Sajad Khorsandroo

    Abstract: Cyberharassment is a critical, socially relevant cybersecurity problem because of the adverse effects it can have on targeted groups or individuals. While progress has been made in understanding cyber-harassment, its detection, attacks on artificial intelligence (AI) based cyberharassment systems, and the social problems in cyberharassment detectors, little has been done in designing experiential… ▽ More

    Submitted 16 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages

  31. arXiv:2405.08013  [pdf, other

    cs.LG cs.AI cs.SI

    CTRL: Continuous-Time Representation Learning on Temporal Heterogeneous Information Network

    Authors: Chenglin Li, Yuanzhen Xie, Chenyun Yu, Lei Cheng, Bo Hu, Zang Li, Di Niu

    Abstract: Inductive representation learning on temporal heterogeneous graphs is crucial for scalable deep learning on heterogeneous information networks (HINs) which are time-varying, such as citation networks. However, most existing approaches are not inductive and thus cannot handle new nodes or edges. Moreover, previous temporal graph embedding methods are often trained with the temporal link prediction… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  32. arXiv:2405.04490  [pdf, other

    cs.DC quant-ph

    Resource-Efficient and Self-Adaptive Quantum Search in a Quantum-Classical Hybrid System

    Authors: Zihao Jiang, Zefan Du, Shaolun Ruan, Juntao Chen, Yong Wang, Long Cheng, Rajkumar Buyya, Ying Mao

    Abstract: Over the past decade, the rapid advancement of deep learning and big data applications has been driven by vast datasets and high-performance computing systems. However, as we approach the physical limits of semiconductor fabrication in the post-Moore's Law era, questions arise about the future of these applications. In parallel, quantum computing has made significant progress with the potential to… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  33. arXiv:2405.04008  [pdf, other

    physics.chem-ph

    A new computational framework for spinor-based relativistic exact two-component calculations using contracted basis functions

    Authors: Chaoqun Zhang, Kirk A. Peterson, Kenneth G. Dyall, Lan Cheng

    Abstract: A new computational framework for spinor-based relativistic exact two-component (X2C) calculations is developed using contracted basis sets with a spin-orbit contraction scheme. Generally contracted j-adapted basis sets using primitive functions in the correlation-consistent basis sets are constructed for the X2C Hamiltonian with atomic mean-field spin-orbit integrals (the X2CAMF scheme). The cont… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  34. arXiv:2405.02313  [pdf, ps, other

    physics.flu-dyn

    Physics-informed Data-driven Cavitation Model for a Specific MG EOS

    Authors: Minsheng Huang, Chengbao Yao, Pan Wang, Lidong Cheng, Wenjun Ying

    Abstract: We present a novel one-fluid cavitation model of a specific Mie-Grüneisen equation of state(EOS), named polynomial EOS, based on an artificial neural network. Not only the physics-informed equation but also the experimental data are embedded into the proposed model by an optimization problem. The physics-informed data-driven model provides the concerned pressure within the cavitation region, where… ▽ More

    Submitted 5 April, 2024; originally announced May 2024.

    Comments: 29 pages, 18 figures

  35. arXiv:2404.17844  [pdf, other

    cs.IR

    Towards Robust Recommendation: A Review and an Adversarial Robustness Evaluation Library

    Authors: Lei Cheng, Xiaowen Huang, Jitao Sang, Jian Yu

    Abstract: Recently, recommender system has achieved significant success. However, due to the openness of recommender systems, they remain vulnerable to malicious attacks. Additionally, natural noise in training data and issues such as data sparsity can also degrade the performance of recommender systems. Therefore, enhancing the robustness of recommender systems has become an increasingly important research… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  36. arXiv:2404.16295  [pdf, other

    q-fin.MF q-fin.PR

    Joint calibration to SPX and VIX Derivative Markets with Composite Change of Time Models

    Authors: Liexin Cheng, Xue Cheng, Xianhua Peng

    Abstract: The Chicago Board Options Exchange Volatility Index (VIX) is calculated from SPX options and derivatives of VIX are also traded in market, which leads to the so-called "consistent modeling" problem. This paper proposes a time-changed Lévy model for log price with a composite change of time structure to capture both features of the implied SPX volatility and the implied volatility of volatility. Co… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  37. arXiv:2404.14445  [pdf, other

    cs.LG cs.AI cs.CL

    A Multi-Faceted Evaluation Framework for Assessing Synthetic Data Generated by Large Language Models

    Authors: Yefeng Yuan, Yuhong Liu, Liang Cheng

    Abstract: The rapid advancements in generative AI and large language models (LLMs) have opened up new avenues for producing synthetic data, particularly in the realm of structured tabular formats, such as product reviews. Despite the potential benefits, concerns regarding privacy leakage have surfaced, especially when personal information is utilized in the training datasets. In addition, there is an absenc… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figure, 4 tables

  38. arXiv:2404.13396  [pdf

    cond-mat.mtrl-sci

    Angle-Resolved Magneto-Chiral Anisotropy in a Non-Centrosymmetric Atomic Layer Superlattice

    Authors: Long Cheng, Mingrui Bao, **gxian Zhang, Xue Zhang, Qun Yang, Qiang Li, Hui Cao, Dawei Qiu, Jia Liu, Fei Ye, Qing Wang, Genhao Liang, Hui Li, Guanglei Cheng, Hua Zhou, Jian-Min Zuo, Xiaodong Zhou, Jian Shen, Zhifeng Zhu, Sai Mu, Wenbo Wang, Xiaofang Zhai

    Abstract: Chirality in solid-state materials has sparked significant interest due to potential applications of topologically-protected chiral states in next-generation information technology. The electrical magneto-chiral effect (eMChE), arising from relativistic spin-orbit interactions, shows great promise for develo** chiral materials and devices for electronic integration. Here we demonstrate an angle-… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  39. arXiv:2404.11613  [pdf, other

    cs.CV

    InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

    Authors: Zhiheng Liu, Hao Ouyang, Qiuyu Wang, Ka Leong Cheng, Jie Xiao, Kai Zhu, Nan Xue, Yu Liu, Yujun Shen, Yang Cao

    Abstract: 3D Gaussians have recently emerged as an efficient representation for novel view synthesis. This work studies its editability with a particular focus on the inpainting task, which aims to supplement an incomplete set of 3D Gaussians with additional points for visually harmonious rendering. Compared to 2D inpainting, the crux of inpainting 3D Gaussians is to figure out the rendering-relevant proper… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Project page: https://johanan528.github.io/Infusion

  40. arXiv:2404.08966  [pdf, other

    cs.CV

    LoopGaussian: Creating 3D Cinemagraph with Multi-view Images via Eulerian Motion Field

    Authors: Jiyang Li, Lechao Cheng, Zhangye Wang, Tingting Mu, **gxuan He

    Abstract: Cinemagraph is a unique form of visual media that combines elements of still photography and subtle motion to create a captivating experience. However, the majority of videos generated by recent works lack depth information and are confined to the constraints of 2D image space. In this paper, inspired by significant progress in the field of novel view synthesis (NVS) achieved by 3D Gaussian Splatt… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

    Comments: 10 pages

  41. arXiv:2404.07810  [pdf, other

    math.OC

    Problem-Driven Scenario Reduction Framework for Power System Stochastic Operation

    Authors: Yingrui Zhuang, Lin Cheng, Ning Qi, Mads R. Almassalkhi, Feng Liu

    Abstract: Scenario reduction (SR) aims to identify a small yet representative scenario set to depict the underlying uncertainty, which is critical to scenario-based stochastic optimization (SBSO) of power systems. Existing SR techniques commonly aim to achieve statistical approximation to the original scenario set. However, SR and SBSO are commonly considered into two distinct and decoupled processes, which… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: This is a manuscript submitted to IEEE Transactions on Power Systems. This manuscript contains 10 pages, 5 figures

  42. arXiv:2404.05953  [pdf, other

    cs.RO

    3D Branch Point Cloud Completion for Robotic Pruning in Apple Orchards

    Authors: Tian Qiu, Alan Zoubi, Nikolai Spine, Lailiang Cheng, Yu Jiang

    Abstract: Robotic branch pruning is a significantly growing research area to cope with the shortage of labor force in the context of agriculture. One fundamental requirement in robotic pruning is the perception of detailed geometry and topology of branches. However, the point clouds obtained in agricultural settings often exhibit incompleteness due to several constraints, thereby restricting the accuracy of… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Submitted to IROS2024

  43. arXiv:2404.00802  [pdf, other

    physics.app-ph

    The Mechanics and Physics of Twisted and Coiled Polymer Actuators

    Authors: Qiong Wang, Anan Ghrayeb, SeongHyeon Kim, Liuyang Cheng, Sameh Tawfick

    Abstract: Twisted and coiled polymer actuators (TCPAs) generate large contractile mechanical work mimicking natural muscles, which makes them suitable for robotics and health-assistive devices. Understanding the mechanism of nylon TCPA remains challenging due to the interplay between their intricate geometry, chirality, residual stresses, and material microstructure. This study integrates a material microst… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  44. arXiv:2404.00464  [pdf, other

    cs.LG

    Leveraging Pre-trained and Transformer-derived Embeddings from EHRs to Characterize Heterogeneity Across Alzheimer's Disease and Related Dementias

    Authors: Matthew West, Colin Magdamo, Lily Cheng, Yingnan He, Sudeshna Das

    Abstract: Alzheimer's disease is a progressive, debilitating neurodegenerative disease that affects 50 million people globally. Despite this substantial health burden, available treatments for the disease are limited and its fundamental causes remain poorly understood. Previous work has suggested the existence of clinically-meaningful sub-types, which it is suggested may correspond to distinct etiologies, d… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 14 pages, 5 figures in main text

  45. arXiv:2403.19971  [pdf, other

    eess.AS eess.SP

    3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization

    Authors: Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Tinglong Zhu, Changhe Song, Rongjie Huang, Ziyang Ma, Qian Chen, Shiliang Zhang, Xihao Li

    Abstract: This paper introduces 3D-Speaker-Toolkit, an open source toolkit for multi-modal speaker verification and diarization. It is designed for the needs of academic researchers and industrial practitioners. The 3D-Speaker-Toolkit adeptly leverages the combined strengths of acoustic, semantic, and visual data, seamlessly fusing these modalities to offer robust speaker recognition capabilities. The acous… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  46. arXiv:2403.17701   

    eess.IV cs.CV cs.LG

    Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation

    Authors: Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu, Kaihong Wu

    Abstract: Image segmentation holds a vital position in the realms of diagnosis and treatment within the medical domain. Traditional convolutional neural networks (CNNs) and Transformer models have made significant advancements in this realm, but they still encounter challenges because of limited receptive field or high computing complexity. Recently, State Space Models (SSMs), particularly Mamba and its var… ▽ More

    Submitted 3 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Experimental method encountered errors, undergoing experiment again

  47. arXiv:2403.11517  [pdf, other

    q-bio.NC cs.HC

    Inter-individual and inter-site neural code conversion and image reconstruction without shared stimuli

    Authors: Haibao Wang, Jun Kai Ho, Fan L. Cheng, Shuntaro C. Aoki, Yusuke Muraki, Misato Tanaka, Yukiyasu Kamitani

    Abstract: The human brain demonstrates substantial inter-individual variability in fine-grained functional topography, posing challenges in identifying common neural representations across individuals. Functional alignment has the potential to harmonize these individual differences. However, it typically requires an identical set of stimuli presented to different individuals, which is often unavailable. To… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  48. arXiv:2403.11366  [pdf, other

    cs.LG cs.CL cs.DC

    JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning

    Authors: Anique Tahir, Lu Cheng, Huan Liu

    Abstract: The scaling of Large Language Models (LLMs) for retrieval-based tasks, particularly in Retrieval Augmented Generation (RAG), faces significant memory constraints, especially when fine-tuning extensive prompt sequences. Current open-source libraries support full-model inference and fine-tuning across multiple GPUs but fall short of accommodating the efficient parameter distribution required for ret… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  49. arXiv:2403.08441  [pdf, other

    quant-ph

    Stabilizer ground states: theory, algorithms and applications

    Authors: Jiace Sun, Lixue Cheng, Shi-Xin Zhang

    Abstract: Stabilizer states have been commonly utilized in quantum information, quantum error correction, and quantum circuit simulation due to their simple mathematical structure. In this work, we apply stabilizer states to tackle quantum many-body problems and introduce the concept of stabilizer ground states. We present a simplified equivalent formalism for identifying stabilizer ground states of general… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 22 pages, 6 figures

  50. arXiv:2403.05881  [pdf, other

    cs.CL

    KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques

    Authors: Rui Yang, Haoran Liu, Edison Marrese-Taylor, Qingcheng Zeng, Yu He Ke, Wanxin Li, Lechao Cheng, Qingyu Chen, James Caverlee, Yutaka Matsuo, Irene Li

    Abstract: Large Language Models (LLMs) have significantly advanced healthcare innovation on generation capabilities. However, their application in real clinical settings is challenging due to potential deviations from medical facts and inherent biases. In this work, we develop an augmented LLM framework, KG-Rank, which leverages a medical knowledge graph (KG) with ranking and re-ranking techniques, aiming t… ▽ More

    Submitted 18 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.