Skip to main content

Showing 1–50 of 273 results for author: Xia, R

.
  1. arXiv:2406.15126  [pdf, other

    cs.CL

    On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey

    Authors: Lin Long, Rui Wang, Ruixuan Xiao, Junbo Zhao, Xiao Ding, Gang Chen, Haobo Wang

    Abstract: Within the evolving landscape of deep learning, the dilemma of data quantity and quality has been a long-standing problem. The recent advent of Large Language Models (LLMs) offers a data-centric solution to alleviate the limitations of real-world data with synthetic data generation. However, current investigations into this field lack a unified framework and mostly stay on the surface. Therefore,… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: A survey on LLMs-driven synthetic data generation, curation and evaluation

  2. arXiv:2406.14884  [pdf, other

    cs.CL

    FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

    Authors: Ruixuan Xiao, Wentao Ma, Ke Wang, Yuchuan Wu, Junbo Zhao, Haobo Wang, Fei Huang, Yongbin Li

    Abstract: LLM-based agents have emerged as promising tools, which are crafted to fulfill complex tasks by iterative planning and action. However, these agents are susceptible to undesired planning hallucinations when lacking specific knowledge for expertise-intensive tasks. To address this, preliminary attempts are made to enhance planning reliability by incorporating external workflow-related knowledge. De… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.11633  [pdf, other

    cs.CV

    DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models

    Authors: Renqiu Xia, Song Mao, Xiangchao Yan, Hongbin Zhou, Bo Zhang, Haoyang Peng, Jiahao Pi, Daocheng Fu, Wenjie Wu, Hancheng Ye, Shiyang Feng, Bin Wang, Chao Xu, Conghui He, Pinlong Cai, Min Dou, Botian Shi, Sheng Zhou, Yongwei Wang, Bin Wang, Junchi Yan, Fei Wu, Yu Qiao

    Abstract: Scientific documents record research findings and valuable human knowledge, comprising a vast corpus of high-quality data. Leveraging multi-modality data extracted from these documents and assessing large models' abilities to handle scientific document-oriented tasks is therefore meaningful. Despite promising advancements, large models still perform poorly on multi-page scientific document extract… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Homepage of DocGenome: https://unimodal4reasoning.github.io/DocGenome_page 22 pages, 11 figures

  4. arXiv:2406.07961  [pdf, other

    cs.CV cs.AI

    Accurate Explanation Model for Image Classifiers using Class Association Embedding

    Authors: Ruitao Xie, **gbang Chen, Limai Jiang, Rui Xiao, Yi Pan, Yunpeng Cai

    Abstract: Image classification is a primary task in data analysis where explainable models are crucially demanded in various applications. Although amounts of methods have been proposed to obtain explainable knowledge from the black-box classifiers, these approaches lack the efficiency of extracting global knowledge regarding the classification task, thus is vulnerable to local traps and often leads to poor… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 40th IEEE International Conference on Data Engineering

  5. arXiv:2406.07571  [pdf, other

    cs.CY

    Supporting Self-Reflection at Scale with Large Language Models: Insights from Randomized Field Experiments in Classrooms

    Authors: Harsh Kumar, Ruiwei Xiao, Benjamin Lawson, Ilya Musabirov, Jiakai Shi, Xinyuan Wang, Huayin Luo, Joseph Jay Williams, Anna Rafferty, John Stamper, Michael Liut

    Abstract: Self-reflection on learning experiences constitutes a fundamental cognitive process, essential for the consolidation of knowledge and the enhancement of learning efficacy. However, traditional methods to facilitate reflection often face challenges in personalization, immediacy of feedback, engagement, and scalability. Integration of Large Language Models (LLMs) into the reflection process could mi… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: Accepted at L@S'24

  6. arXiv:2406.07268  [pdf, other

    cs.MM cs.CL cs.CV

    Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation

    Authors: **yuan Li, Ziyan Li, Han Li, Jianfei Yu, Rui Xia, Di Sun, Gang Pan

    Abstract: Grounded Multimodal Named Entity Recognition (GMNER) task aims to identify named entities, entity types and their corresponding visual regions. GMNER task exhibits two challenging attributes: 1) The tenuous correlation between images and text on social media contributes to a notable proportion of named entities being ungroundable. 2) There exists a distinction between coarse-grained noun phrases u… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Extension of our Findings of EMNLP 2023 & ACL 2024 paper

  7. arXiv:2406.02852  [pdf

    cond-mat.mtrl-sci

    Isolated anions induced high ionic conductivity

    Authors: Qifan Yang, **g Xu, Yuqi Wang, Xiao Fu, Ruijuan Xiao, Hong Li

    Abstract: One of the key materials in solid-state lithium batteries is fast ion conductors. However, the Li+ ion transport in inorganic crystals involves complex factors, making it a mystery to find and design ion conductors with low migration barriers. In this work, a distinctive structural characteristic involving isolated anions has been discovered to enhance high ionic conductivity in crystals. It is an… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  8. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  9. arXiv:2405.13049  [pdf, other

    cs.CL cs.AI cs.MM

    SemEval-2024 Task 3: Multimodal Emotion Cause Analysis in Conversations

    Authors: Fanfan Wang, Heqing Ma, Jianfei Yu, Rui Xia, Erik Cambria

    Abstract: The ability to understand emotions is an essential component of human-like artificial intelligence, as emotions greatly influence human cognition, decision making, and social interactions. In addition to emotion recognition in conversations, the task of identifying the potential causes behind an individual's emotional state in conversations, is of great importance in many application scenarios. We… ▽ More

    Submitted 10 June, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: 12 pages, 3 figures, 4 Tables

    Journal ref: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)

  10. arXiv:2405.09758  [pdf

    physics.optics

    Spatial-temporal manipulations of visible nanosecond sub-pulse sequences in an actively Q-switched Pr:YLF laser

    Authors: Shengbo Xu, Yunru Chen, Ran Xia, Changcheng Duan, Qingrui Zeng, Yu Xiao, Xiahui Tang, Gang Xu

    Abstract: Pulsed visible lasers either by Q-switching or mode locking have been attracting intense attentions both in solid-state laser and fiber laser. Here, we report on the simultaneous manipulation of reconfigurable sub-pulse sequences and customizable high-order vortex beams in an actively Q-switched visible laser. On the one hand, pulse sequences with up to 4 sub-pulses could be generated and fully co… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  11. arXiv:2405.09478  [pdf, ps, other

    astro-ph.GA

    Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies (AGNSTRONG). I. Sample and Strategy

    Authors: Huynh Anh N. Le, Chen Qin, Yongquan Xue, Shifu Zhu, Kim Ngan N. Nguyen, Ruisong Xia, Xiaozhi Lin

    Abstract: We introduce our project, AGNSTRONG (Active Galactic Nuclei and STaR fOrmation in Nearby Galaxies). Our research goals encompass investigating the kinematic properties of ionized and molecular gas outflows, understanding the impact of AGN feedback, and exploring the coevolution dynamics between AGN strength activity and star formation activity. We aim to conduct a thorough analysis to determine wh… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 13 pages, accepted for publication in The Astronomical Journal

  12. arXiv:2405.04645  [pdf, other

    cs.HC cs.CY

    Enhancing LLM-Based Feedback: Insights from Intelligent Tutoring Systems and the Learning Sciences

    Authors: John Stamper, Ruiwei Xiao, Xinying Hou

    Abstract: The field of Artificial Intelligence in Education (AIED) focuses on the intersection of technology, education, and psychology, placing a strong emphasis on supporting learners' needs with compassion and understanding. The growing prominence of Large Language Models (LLMs) has led to the development of scalable solutions within educational settings, including generating different types of feedback… ▽ More

    Submitted 11 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted to 25th International Conference on Artificial Intelligence in Education (AIED 2024) BlueSky special track

  13. arXiv:2405.03466  [pdf

    cond-mat.mtrl-sci

    A family of air-stable chalcogenide solid electrolytes in Li$_2$BMQ$_4$ (B = Ca, Sr and Ba; M = Si, Ge and Sn; Q = O, S and Se) systems

    Authors: Huican Mao, Xiang Zhu, Guangmao Li, Jie Pang, Junfeng Hao, Liqi Wang, Hailong Yu, Youguo Shi, Fan Wu, Shilie Pan, Ruijuan Xiao, Hong Li, Liquan Chen

    Abstract: Combining high-throughput first-principles calculations and experimental measurements, we have identified a novel family of fast lithium-ion chalcogenide conductors in Li$_2$BMQ$_4$ (2114, B = Ca, Sr and Ba; M = Si, Ge and Sn; Q = O, S and Se) systems. Our calculations demonstrate that most of the thermodynamically and kinetically stable sulfides and selenides in this new system exhibit ultralow L… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  14. arXiv:2405.03002  [pdf, other

    cond-mat.str-el

    Anomalous electronic energy relaxation and soft phonons in the Dirac semimetal Cd$_3$As$_2$

    Authors: Rishi Bhandia, David Barbalas, Run Xiao, Juan R. Chamorro, Tyrel M. McQueen, Nitin Samarth, N. P. Armitage

    Abstract: We have used a combination of linear response time-domain THz spectroscopy (TDTS) and high-field non-linear THz spectroscopy to separately probe the electronic momentum and energy relaxation rates respectively of the Dirac semimetal Cd$_3$As$_2$. We find, consistent with prior measurements, that Cd$_3$As$_2$ has an enormous nonlinearities in the THz frequency range. We extract the momentum relaxat… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  15. arXiv:2405.00313  [pdf, other

    cs.CV

    Streamlining Image Editing with Layered Diffusion Brushes

    Authors: Peyman Gholami, Robert Xiao

    Abstract: Denoising diffusion models have recently gained prominence as powerful tools for a variety of image generation and manipulation tasks. Building on this, we propose a novel tool for real-time editing of images that provides users with fine-grained region-targeted supervision in addition to existing prompt-based controls. Our novel editing technique, termed Layered Diffusion Brushes, leverages promp… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.00219

  16. arXiv:2404.15937  [pdf, other

    hep-ph hep-ex

    Probing Neutral Triple Gauge Couplings via $\boldsymbol{Zγ\,(\ell^+\ell^-γ)}$ Production at $\boldsymbol{e^+e^-}$ Colliders

    Authors: Danning Liu, Rui-Qing Xiao, Shu Li, John Ellis, Hong-Jian He, Rui Yuan

    Abstract: Neutral triple gauge couplings (nTGCs) are absent in the Standard Model (SM) and at the dimension-6 level in the Standard Model Effective Field Theory (SMEFT), arising first from dimension-8 operators. As such, they provide a unique window for probing new physics beyond the SM. These dimension-8 operators can be mapped to nTGC form factors whose structure is consistent with the spontaneously-broke… ▽ More

    Submitted 1 July, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Frontiers of Physics (in Press), 22 pages, 10 Figs and 10 Tables

    Report number: KCL-PH-TH/2024-18, CERN-TH-2024-046

  17. arXiv:2404.15675  [pdf, other

    cs.IR

    Hi-Gen: Generative Retrieval For Large-Scale Personalized E-commerce Search

    Authors: Yan**g Wu, Yinfu Feng, Jian Wang, Wenji Zhou, Yunan Ye, Rong Xiao

    Abstract: Leveraging generative retrieval (GR) techniques to enhance search systems is an emerging methodology that has shown promising results in recent years. In GR, a text-to-text model maps string queries directly to relevant document identifiers (docIDs), so it dramatically simplifies the whole retrieval process. However, when applying most GR models in large-scale E-commerce for personalized item sear… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  18. arXiv:2404.15353  [pdf, other

    eess.SP cs.AI cs.LG

    SQUWA: Signal Quality Aware DNN Architecture for Enhanced Accuracy in Atrial Fibrillation Detection from Noisy PPG Signals

    Authors: Runze Yan, Cheng Ding, Ran Xiao, Aleksandr Fedorov, Randall J Lee, Fadi Nahab, Xiao Hu

    Abstract: Atrial fibrillation (AF), a common cardiac arrhythmia, significantly increases the risk of stroke, heart disease, and mortality. Photoplethysmography (PPG) offers a promising solution for continuous AF monitoring, due to its cost efficiency and integration into wearable devices. Nonetheless, PPG signals are susceptible to corruption from motion artifacts and other factors often encountered in ambu… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 15 pages; 9 figures; 2024 Conference on Health, Inference, and Learning (CHIL)

  19. arXiv:2404.11889  [pdf, other

    eess.IV cs.CV

    Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans

    Authors: Lixing Tan, Shuang Song, Kangneng Zhou, Chengbo Duan, Lanying Wang, Huayang Ren, Linlin Liu, Wei Zhang, Ruoxiu Xiao

    Abstract: X-ray images play a vital role in the intraoperative processes due to their high resolution and fast imaging speed and greatly promote the subsequent segmentation, registration and reconstruction. However, over-dosed X-rays superimpose potential risks to human health to some extent. Data-driven algorithms from volume scans to X-ray images are restricted by the scarcity of paired X-ray and volume d… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures

  20. arXiv:2404.11398  [pdf, other

    cond-mat.mes-hall cond-mat.other cond-mat.str-el

    Revealing the spatial nature of sublattice symmetry

    Authors: Rong Xiao, Y. X. Zhao

    Abstract: The sublattice symmetry on a bipartite lattice is commonly regarded as the chiral symmetry in the AIII class of the tenfold Altland-Zirnbauer classification. Here, we reveal the spatial nature of sublattice symmetry, and show that this assertion holds only if the periodicity of primitive unit cells agrees with that of the sublattice labeling. In cases where the periodicity does not agree, sublatti… ▽ More

    Submitted 8 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 10 pages for the main text and 5 pages for the supplementary information

    Journal ref: Nat Commun 15, 3787 (2024)

  21. arXiv:2404.05226  [pdf, ps, other

    math.CO math.DS math.NT

    Monochromatic Polynomial sumset structures on $\mathbb{N}$

    Authors: Zhengxing Lian, Rongzhong Xiao

    Abstract: In the paper, we searh for monochromatic infinite additive structures involving polynomials on $\mathbb{N}$. Ultimately, we can prove that for any $r\in \mathbb{N}$, any distinct natural numbers $a,b$ and any $2$-coloring of $\mathbb{N}$, there exist subsets $B,C\subset \mathbb{N}$ with $|B|=r$ and $|C|=\infty$ such that there exists a color containing $B+aC$ and $B+bC$. In fact, for the specific… ▽ More

    Submitted 15 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Welcome comments

  22. arXiv:2404.02213  [pdf, other

    cs.HC cs.AI cs.CY

    Exploring How Multiple Levels of GPT-Generated Programming Hints Support or Disappoint Novices

    Authors: Ruiwei Xiao, Xinying Hou, John Stamper

    Abstract: Recent studies have integrated large language models (LLMs) into diverse educational contexts, including providing adaptive programming hints, a type of feedback focuses on hel** students move forward during problem-solving. However, most existing LLM-based hint systems are limited to one single hint type. To investigate whether and how different levels of hints can support students' problem-sol… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted CHI 2024 LBW - 10 pages

  23. arXiv:2403.17777  [pdf, other

    econ.EM

    Deconvolution from two order statistics

    Authors: JoonHwan Cho, Yao Luo, Ruli Xiao

    Abstract: Economic data are often contaminated by measurement errors and truncated by ranking. This paper shows that the classical measurement error model with independent and additive measurement errors is identified nonparametrically using only two order statistics of repeated measurements. The identification result confirms a hypothesis by Athey and Haile (2002) for a symmetric ascending auction model wi… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  24. arXiv:2403.15901  [pdf, other

    cs.AI cs.CV

    MatchSeg: Towards Better Segmentation via Reference Image Matching

    Authors: Ruiqiang Xiao, Jiayu Huo, Haotian Zheng, Yang Liu, Sebastien Ourselin, Rachel Sparks

    Abstract: Recently, automated medical image segmentation methods based on deep learning have achieved great success. However, they heavily rely on large annotated datasets, which are costly and time-consuming to acquire. Few-shot learning aims to overcome the need for annotated data by using a small labeled dataset, known as a support set, to guide predicting labels for new, unlabeled images, known as the q… ▽ More

    Submitted 19 June, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  25. arXiv:2403.15835  [pdf, other

    cs.CV

    Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

    Authors: Hancheng Ye, Chong Yu, Peng Ye, Renqiu Xia, Yansong Tang, Jiwen Lu, Tao Chen, Bo Zhang

    Abstract: Recent Vision Transformer Compression (VTC) works mainly follow a two-stage scheme, where the importance score of each model unit is first evaluated or preset in each submodule, followed by the sparsity score evaluation according to the target sparsity constraint. Such a separate evaluation process induces the gap between importance and sparsity score distributions, thus causing high search costs… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024. Our code will be available at www.github.com/HankYe/Once-for-Both

  26. arXiv:2403.14131  [pdf

    cond-mat.mtrl-sci

    Efficient Learning Strategy for Predicting Glass Forming Ability in Imbalanced Datasets of Bulk Metallic Glasses

    Authors: Xuhe Gong, Jiazi Bi, Xiaobin Liu, Ran Li, Ruijuan Xiao, Tao Zhang, Hong Li

    Abstract: The prediction of glass forming ability (GFA) and various properties in bulk metallic glasses (BMGs) pose a challenge due to the unique disordered atomic structure in this type of materials. Machine learning shows the potential ability to find a way out. However, the training set from the experimental data of BMGs faces the issue of data imbalance, including the distribution of data related to ele… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  27. arXiv:2403.14116  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Mechanistic Insights into Temperature Effects for Ionic Conductivity in Li6PS5Cl

    Authors: Zicun Li, Jianxing Huang, Xinguo Ren, **bin Li, Ruijuan Xiao, Hong Li

    Abstract: Ensuring solid-state lithium batteries perform well across a wide temperature range is crucial for their practical use. Molecular dynamics (MD) simulations can provide valuable insights into the temperature dependence of the battery materials, however, the high computational cost of ab initio MD poses challenges for simulating ion migration dynamics at low temperatures. To address this issue, accu… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  28. arXiv:2403.02799  [pdf, other

    cs.CL cs.AI

    DPPA: Pruning Method for Large Language Model to Model Merging

    Authors: Yaochen Zhu, Rui Xia, Jiajun Zhang

    Abstract: Model merging is to combine fine-tuned models derived from multiple domains, with the intent of enhancing the model's proficiency across various domains. The principal concern is the resolution of parameter conflicts. A substantial amount of existing research remedy this issue during the merging stage, with the latest study focusing on resolving this issue throughout the pruning stage. The DARE ap… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  29. arXiv:2402.17213  [pdf, other

    cs.CV cs.AI

    VCD: Knowledge Base Guided Visual Commonsense Discovery in Images

    Authors: Xiangqing Shen, Yurun Song, Siwei Wu, Rui Xia

    Abstract: Visual commonsense contains knowledge about object properties, relationships, and behaviors in visual data. Discovering visual commonsense can provide a more comprehensive and richer understanding of images, and enhance the reasoning and decision-making capabilities of computer vision systems. However, the visual commonsense defined in existing visual commonsense discovery studies is coarse-graine… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  30. arXiv:2402.12185  [pdf, other

    cs.CV

    ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

    Authors: Renqiu Xia, Bo Zhang, Hancheng Ye, Xiangchao Yan, Qi Liu, Hongbin Zhou, Zijun Chen, Min Dou, Botian Shi, Junchi Yan, Yu Qiao

    Abstract: Recently, many versatile Multi-modal Large Language Models (MLLMs) have emerged continuously. However, their capacity to query information depicted in visual charts and engage in reasoning based on the queried contents remains under-explored. In this paper, to comprehensively and rigorously benchmark the ability of the off-the-shelf MLLMs in the chart domain, we construct ChartX, a multi-modal eva… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Code and dataset are available for downloading at: https://github.com/UniModal4Reasoning/ChartVLM 22 pages, 15 figures

  31. arXiv:2402.11809  [pdf, other

    cs.CL cs.AI cs.LG

    Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding

    Authors: Hanling Yi, Feng Lin, Hongbin Li, Peiyang Ning, Xiaotian Yu, Rong Xiao

    Abstract: This research aims to accelerate the inference speed of large language models (LLMs) with billions of parameters. We propose \textbf{S}mart \textbf{P}arallel \textbf{A}uto-\textbf{C}orrect d\textbf{E}coding (SPACE), an innovative approach designed for achieving lossless acceleration of LLMs. By integrating semi-autoregressive inference and speculative decoding capabilities, SPACE uniquely enables… ▽ More

    Submitted 19 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 Findings

  32. arXiv:2402.07913  [pdf, other

    cs.CL cs.AI cs.HC

    QACP: An Annotated Question Answering Dataset for Assisting Chinese Python Programming Learners

    Authors: Rui Xiao, Lu Han, Xiaoying Zhou, Jiong Wang, Na Zong, Pengyu Zhang

    Abstract: In online learning platforms, particularly in rapidly growing computer programming courses, addressing the thousands of students' learning queries requires considerable human cost. The creation of intelligent assistant large language models (LLMs) tailored for programming education necessitates distinct data support. However, in real application scenarios, the data resources for training such LLMs… ▽ More

    Submitted 22 February, 2024; v1 submitted 30 January, 2024; originally announced February 2024.

  33. arXiv:2401.13588  [pdf

    cs.CL cs.AI cs.SE

    Evaluation of General Large Language Models in Contextually Assessing Semantic Concepts Extracted from Adult Critical Care Electronic Health Record Notes

    Authors: Darren Liu, Cheng Ding, Delgersuren Bold, Monique Bouvier, Jiaying Lu, Benjamin Shickel, Craig S. Jabaley, Wenhui Zhang, Soo** Park, Michael J. Young, Mark S. Wainwright, Gilles Clermont, Parisa Rashidi, Eric S. Rosenthal, Laurie Dimisko, Ran Xiao, Joo Heung Yoon, Carl Yang, Xiao Hu

    Abstract: The field of healthcare has increasingly turned its focus towards Large Language Models (LLMs) due to their remarkable performance. However, their performance in actual clinical applications has been underexplored. Traditional evaluations based on question-answering tasks don't fully capture the nuanced contexts. This gap highlights the need for more in-depth and practical assessments of LLMs in r… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  34. arXiv:2401.12522  [pdf, other

    cs.CL cs.AI cs.LG

    BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models

    Authors: Feng Lin, Hanling Yi, Hongbin Li, Yifan Yang, Xiaotian Yu, Guangming Lu, Rong Xiao

    Abstract: Large language models (LLMs) commonly employ autoregressive generation during inference, leading to high memory bandwidth demand and consequently extended latency. To mitigate this inefficiency, we present Bi-directional Tuning for lossless Acceleration (BiTA), an innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification. Inspired by the concept of pro… ▽ More

    Submitted 25 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: An appendix has been included. Source code at https://github.com/linfeng93/BiTA

  35. arXiv:2401.04926  [pdf, ps, other

    astro-ph.HE astro-ph.GA

    First Observational Evidence for an Interconnected Evolution between Time Lag and QPO Frequency among AGNs

    Authors: Ruisong Xia, Hao Liu, Yongquan Xue

    Abstract: Quasi-periodic oscillations (QPOs) have been widely observed in black hole X-ray binaries (BHBs), which often exhibit significant X-ray variations. Extensive research has explored the long-term evolution of the properties of QPOs in BHBs. In contrast, such evolution in active galactic nuclei (AGNs) has remained largely unexplored due to limited observational data. By using the 10 new XMM-Newton ob… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 11 pages, 4 figures, accepted for publication in ApJ Letters

  36. arXiv:2401.02847  [pdf, other

    cs.CV cs.GR cs.LG

    Generating Non-Stationary Textures using Self-Rectification

    Authors: Yang Zhou, Rongjun Xiao, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: This paper addresses the challenge of example-based non-stationary texture synthesis. We introduce a novel twostep approach wherein users first modify a reference texture using standard image editing tools, yielding an initial rough target for the synthesis. Subsequently, our proposed method, termed "self-rectification", automatically refines this target into a coherent, seamless texture, while fa… ▽ More

    Submitted 30 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Project page: https://github.com/xiaorongjun000/Self-Rectification

  37. arXiv:2401.02134  [pdf

    econ.GN

    Female Entrepreneur on Board:Assessing the Effect of Gender on Corporate Financial Constraints

    Authors: Ruiying Xiao

    Abstract: This study investigates the impact of female leadership on the financial constraints of firms, which are publicly listed entrepreneurial enterprises in China. Utilizing data from 938 companies on the China Growth Enterprise Market (GEM) over a period of 2013-2022, this paper explores how the female presence in CEO positions, senior management, and board membership influences a firm's ability to ma… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  38. arXiv:2312.17120  [pdf, other

    cs.CL cs.AI cs.LG

    Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math

    Authors: Zengzhi Wang, Rui Xia, Pengfei Liu

    Abstract: High-quality, large-scale corpora are the cornerstone of building foundation models. In this work, we introduce \textsc{MathPile}, a diverse and high-quality math-centric corpus comprising about 9.5 billion tokens. Throughout its creation, we adhered to the principle of ``\emph{less is more}'', firmly believing in the supremacy of data quality over quantity, even in the pre-training phase. Our met… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 37 pages. Working in Progress. https://github.com/GAIR-NLP/MathPile/

  39. arXiv:2312.16330  [pdf

    physics.optics cond-mat.mes-hall physics.app-ph

    Achieving 100% amplitude modulation depth in a graphene-based tuneable capacitance metamaterial

    Authors: Ruqiao Xia, Nikita W. Almond, Stephen J. Kindness, Sergey A. Mikhailov, Wadood Tadbier, Riccardo Degl'Innocenti, Yuezhen Lu, Abbie Lowe, Ben Ramsay, Lukas A. Jakob, James Dann, Stephan Hofmann, Harvey E. Beere, David A. Ritchie, Wladislaw Michailow

    Abstract: Effective control of terahertz radiation requires the development of efficient and fast modulators with a large modulation depth. This challenge is often tackled by using metamaterials, artificial sub-wavelength optical structures engineered to resonate at the desired terahertz frequency. Metamaterial-based devices exploiting graphene as the active tuneable element have been proven to be a highly… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 16 pages, 5 figures

  40. arXiv:2312.08718  [pdf, other

    cs.RO

    Trajectory Planning and Tracking of Hybrid Flying-Crawling Quadrotors

    Authors: Dongnan Hu, Ruihao Xia, Xin **, Yang Tang

    Abstract: Hybrid Flying-Crawling Quadrotors (HyFCQs) are transformable robots with the ability of terrestrial and aerial hybrid motion. This article presents a trajectory planning and tracking framework designed for HyFCQs. In this framework, a terrestrial-aerial path-searching method with the crawling limitation of HyFCQs is proposed to guarantee the dynamical feasibility of trajectories. Additionally, a t… ▽ More

    Submitted 14 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  41. arXiv:2312.07075  [pdf, other

    cs.RO

    Motion Planning and Control of A Morphing Quadrotor in Restricted Scenarios

    Authors: Guiyang Cui, Ruihao Xia, Xin **, Yang Tang

    Abstract: Morphing quadrotors with four external actuators can adapt to different restricted scenarios by changing their geometric structure. However, previous works mainly focus on the improvements in structures and controllers, and existing planning algorithms don't consider the morphological modifications, which leads to safety and dynamic feasibility issues. In this paper, we propose a unified planning… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 8 pages, 9 figures

  42. arXiv:2312.04353  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.supr-con

    Interface-Induced Superconductivity in Magnetic Topological Insulator-Iron Chalcogenide Heterostructures

    Authors: Hemian Yi, Yi-Fan Zhao, Ying-Ting Chan, Jiaqi Cai, Ruobing Mei, Xianxin Wu, Zi-Jie Yan, Ling-Jie Zhou, Ruoxi Zhang, Zihao Wang, Stephen Paolini, Run Xiao, Ke Wang, Anthony R. Richardella, John Singleton, Laurel E. Winter, Thomas Prokscha, Zaher Salman, Andreas Suter, Purnima P. Balakrishnan, Alexander J. Grutter, Moses H. W. Chan, Nitin Samarth, Xiaodong Xu, Weida Wu , et al. (2 additional authors not shown)

    Abstract: When two different electronic materials are brought together, the resultant interface often shows unexpected quantum phenomena, including interfacial superconductivity and Fu-Kane topological superconductivity (TSC). Here, we use molecular beam epitaxy (MBE) to synthesize heterostructures formed by stacking together two magnetic materials, a ferromagnetic topological insulator (TI) and an antiferr… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 14 pages, 4 figures. Accepted by Science. Comments are welcome

  43. arXiv:2312.03582  [pdf

    physics.app-ph

    The Extended Resonant Modal Theory and Its Applications

    Authors: Ruqi Xiao, Wen Geyi, Guo Yang, Wen Wu

    Abstract: In this paper, we extend the resonant modal theory (RMT) developed previously for a metal object to an arbitrary source region consisting of metals, dielectrics, or the combination of both. The influences of dielectrics on the fields are replaced by equivalent volume sources through the use of the compensation theorem in electromagnetic theory. The resonant frequencies can be determined by finding… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  44. arXiv:2312.02300  [pdf

    cs.LG eess.SP

    Reconsideration on evaluation of machine learning models in continuous monitoring using wearables

    Authors: Cheng Ding, Zhicheng Guo, Cynthia Rudin, Ran Xiao, Fadi B Nahab, Xiao Hu

    Abstract: This paper explores the challenges in evaluating machine learning (ML) models for continuous health monitoring using wearable devices beyond conventional metrics. We state the complexities posed by real-world variability, disease dynamics, user-specific characteristics, and the prevalence of false notifications, necessitating novel evaluation strategies. Drawing insights from large-scale heart stu… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  45. arXiv:2311.18399  [pdf, other

    eess.AS cs.SD

    Audio Prompt Tuning for Universal Sound Separation

    Authors: Yuzhuo Liu, Xubo Liu, Yan Zhao, Yuanyuan Wang, Rui Xia, **chuan Tain, Yuxuan Wang

    Abstract: Universal sound separation (USS) is a task to separate arbitrary sounds from an audio mixture. Existing USS systems are capable of separating arbitrary sources, given a few examples of the target sources as queries. However, separating arbitrary sounds with a single system is challenging, and the robustness is not always guaranteed. In this work, we propose audio prompt tuning (APT), a simple yet… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  46. arXiv:2311.15614  [pdf, other

    cs.CL

    FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

    Authors: Ruixuan Xiao, Yiwen Dong, Junbo Zhao, Runze Wu, Minmin Lin, Gang Chen, Haobo Wang

    Abstract: Collecting high-quality labeled data for model training is notoriously time-consuming and labor-intensive for various NLP tasks. While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context learning in the era of large language models (LLMs), have been proposed and alleviate the labeling burden to some extent, their performances are still subject to hu… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 (Main conference)

  47. In-Context Learning for Knowledge Base Question Answering for Unmanned Systems based on Large Language Models

    Authors: Yunlong Chen, Yaming Zhang, Jianfei Yu, Li Yang, Rui Xia

    Abstract: Knowledge Base Question Answering (KBQA) aims to answer factoid questions based on knowledge bases. However, generating the most appropriate knowledge base query code based on Natural Language Questions (NLQ) poses a significant challenge in KBQA. In this work, we focus on the CCKS2023 Competition of Question Answering with Knowledge Graph Inference for Unmanned Systems. Inspired by the recent suc… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Runner up of the CCKS 2023 question answering with knowledge graph inference for unmanned systems evaluation task, accepted as an evaluation paper

    ACM Class: I.2.7

  48. arXiv:2310.17237  [pdf, other

    math.OC

    A Unified Framework for Rank-based Loss Minimization

    Authors: Rufeng Xiao, Yuze Ge, Rujun Jiang, Yifan Yan

    Abstract: The empirical loss, commonly referred to as the average loss, is extensively utilized for training machine learning models. However, in order to address the diverse performance requirements of machine learning models, the use of the rank-based loss is prevalent, replacing the empirical loss in many cases. The rank-based loss comprises a weighted sum of sorted individual losses, encompassing both c… ▽ More

    Submitted 3 January, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: conference

  49. arXiv:2310.16780  [pdf, ps, other

    math.DS

    Pointwise convergence of some continuous-time polynomial ergodic averages

    Authors: Wen Huang, Song Shao, Rongzhong Xiao

    Abstract: In this paper, we study the pointwise convergence of some continuous-time polynomial ergodic averages. Our method is based on the topological models of measurable flows. One of main results of the paper is as follow. Let $(X,\mathcal{X},μ, (T^{t})_{t\in \mathbb{R}})$ and $(X,\mathcal{X},μ, (S^{t})_{t\in \mathbb{R}})$ be two measurable flows, $a\in \mathbb{Q}$, and $Q\in \mathbb{R}[t]$ with… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 46 pages

  50. arXiv:2310.14155  [pdf

    eess.SP

    Photoplethysmography based atrial fibrillation detection: an updated review from July 2019

    Authors: Cheng Ding, Ran Xiao, Weijia Wang, Elizabeth Holdsworth, Xiao Hu

    Abstract: Atrial fibrillation (AF) is a prevalent cardiac arrhythmia associated with significant health ramifications, including an elevated susceptibility to ischemic stroke, heart disease, and heightened mortality. Photoplethysmography (PPG) has emerged as a promising technology for continuous AF monitoring for its cost-effectiveness and widespread integration into wearable devices. Our team previously co… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.