Skip to main content

Showing 51–100 of 549 results for author: Bian, J

.
  1. arXiv:2312.10324  [pdf, other

    cs.LG cs.CV

    Federated Learning with Instance-Dependent Noisy Label

    Authors: Lei Wang, Jieming Bian, Jie Xu

    Abstract: Federated learning (FL) with noisy labels poses a significant challenge. Existing methods designed for handling noisy labels in centralized learning tend to lose their effectiveness in the FL setting, mainly due to the small dataset size and the heterogeneity of client data. While some attempts have been made to tackle FL with noisy labels, they primarily focused on scenarios involving class-condi… ▽ More

    Submitted 9 January, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  2. arXiv:2312.07899  [pdf

    q-bio.QM cs.AI cs.CV cs.LG

    Morphological Profiling for Drug Discovery in the Era of Deep Learning

    Authors: Qiaosi Tang, Ranjala Ratnayake, Gustavo Seabra, Zhe Jiang, Ruogu Fang, Lina Cui, Yousong Ding, Tamer Kahveci, Jiang Bian, Chenglong Li, Hendrik Luesch, Yanjun Li

    Abstract: Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the capturing of a wide range of morphological features of cells or organisms in response to perturbations at the single-cell resolution. Concurrently, significant advances in machine learning and deep learning, especially in computer vision, have led to substantial… ▽ More

    Submitted 15 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 44 pages, 5 figure, 5 tables

  3. arXiv:2312.06099  [pdf

    cs.CL

    Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need

    Authors: Cheng Peng, Xi Yang, Aokun Chen, Zehao Yu, Kaleb E Smith, Anthony B Costa, Mona G Flores, Jiang Bian, Yonghui Wu

    Abstract: Objective To solve major clinical natural language processing (NLP) tasks using a unified text-to-text learning architecture based on a generative large language model (LLM) via prompt tuning. Methods We formulated 7 key clinical NLP tasks as text-to-text learning and solved them using one unified generative clinical LLM, GatorTronGPT, developed using GPT-3 architecture and trained with up to 20 b… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  4. arXiv:2312.03130  [pdf, other

    hep-ex physics.ins-det

    The DUNE Far Detector Vertical Drift Technology, Technical Design Report

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1304 additional authors not shown)

    Abstract: DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 425 pages; 281 figures Central editing team: A. Heavey, S. Kettell, A. Marchionni, S. Palestini, S. Rajogopalan, R. J. Wilson

    Report number: Fermilab Report no: TM-2813-LBNF

  5. arXiv:2312.01637  [pdf

    physics.ao-ph physics.geo-ph

    Near-real-time monitoring of global ocean carbon sink

    Authors: Piyu Ke, Xiaofan Gui, Wei Cao, Dezhi Wang, Ce Hou, Lixing Wang, Xuanren Song, Yun Li, Biqing Zhu, Jiang Bian, Stephen Sitch, Philippe Ciais, Pierre Friedlingstein, Zhu Liu

    Abstract: Mitigation of climate change will highly rely on a carbon emission trajectory that achieves carbon neutrality by the 2050s. The ocean plays a critical role in modulating climate change by sequestering CO2 from the atmosphere. Relying on the multidisciplinary cutting-edge methodologies and technologies, the near-real-time monitoring of global ocean carbon sinks from January 2022 to July 2023 aims t… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  6. arXiv:2312.00568  [pdf, ps, other

    eess.SP

    A WINNER+ Based 3-D Non-Stationary Wideband MIMO Channel Model

    Authors: Ji Bian, Jian Sun, Cheng-Xiang Wang, Rui Feng, Jie Huang, Yang Yang, Minggao Zhang

    Abstract: In this paper, a three-dimensional (3-D) non-stationary wideband multiple-input multiple-output (MIMO) channel model based on the WINNER+ channel model is proposed. The angular distributions of clusters in both the horizontal and vertical planes are jointly considered. The receiver and clusters can be moving, which makes the model more general. Parameters including number of clusters, powers, dela… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  7. arXiv:2311.15230  [pdf, other

    cs.CV cs.MM

    GAIA: Zero-shot Talking Avatar Generation

    Authors: Tianyu He, Junliang Guo, Runyi Yu, Yuchi Wang, Jialiang Zhu, Kaikai An, Leyi Li, Xu Tan, Chunyu Wang, Han Hu, HsiangTao Wu, Sheng Zhao, Jiang Bian

    Abstract: Zero-shot talking avatar generation aims at synthesizing natural talking videos from speech and a single portrait image. Previous methods have relied on domain-specific heuristics such as war**-based motion representation and 3D Morphable Models, which limit the naturalness and diversity of the generated avatars. In this work, we introduce GAIA (Generative AI for Avatar), which eliminates the do… ▽ More

    Submitted 14 March, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: ICLR 2024. Project page: https://microsoft.github.io/GAIA/

  8. arXiv:2311.13208  [pdf, other

    physics.app-ph cond-mat.mtrl-sci

    Electrified Fracture of Nanotube Films

    Authors: **bo Bian, Shijun Wang, Zhaokuan Yu, Zhong Zhang, Zhi** Xu

    Abstract: Strong and conductive carbon nanotube films are ideal candidates for lightning-strike protection. Understanding their failure mechanisms by considering the anisotropic and single-fiber nature is essential to improve performance. Our experimental studies show that the single-layer, nanometer-thick films fail under electrification by crack nucleation and propagation, reminiscent of brittle and ducti… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Journal ref: Physical Review Materials 8 (2), 026001, 2024

  9. arXiv:2311.08896  [pdf, other

    cs.CL

    HeLM: Highlighted Evidence augmented Language Model for Enhanced Table-to-Text Generation

    Authors: Junyi Bian, Xiaolei Qin, Wuhe Zou, Mengzuo Huang, Congyi Luo, Ke Zhang, Weidong Zhang

    Abstract: Large models have demonstrated significant progress across various domains, particularly in tasks related to text generation. In the domain of Table to Text, many Large Language Model (LLM)-based methods currently resort to modifying prompts to invoke public APIs, incurring potential costs and information leaks. With the advent of open-source large models, fine-tuning LLMs has become feasible. In… ▽ More

    Submitted 27 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  10. arXiv:2311.07835  [pdf, other

    hep-ex hep-ph

    Expanding neutrino oscillation parameter measurements in NOvA using a Bayesian approach

    Authors: NOvA Collaboration, M. A. Acero, B. Acharya, P. Adamson, N. Anfimov, A. Antoshkin, E. Arrieta-Diaz, L. Asquith, A. Aurisano, A. Back, N. Balashov, P. Baldi, B. A. Bambah, A. Bat, K. Bays, R. Bernstein, T. J. C. Bezerra, V. Bhatnagar, D. Bhattarai, B. Bhuyan, J. Bian, A. C. Booth, R. Bowles, B. Brahma, C. Bromberg , et al. (174 additional authors not shown)

    Abstract: NOvA is a long-baseline neutrino oscillation experiment that measures oscillations in charged-current $ν_μ \rightarrow ν_μ$ (disappearance) and $ν_μ \rightarrow ν_{e}$ (appearance) channels, and their antineutrino counterparts, using neutrinos of energies around 2 GeV over a distance of 810 km. In this work we reanalyze the dataset first examined in our previous paper [Phys. Rev. D 106, 032004 (20… ▽ More

    Submitted 27 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 20 pages, 17 figures; version accepted by Phys. Rev. D. Data associated with this paper is available at https://doi.org/10.15484/2349444

    Report number: FERMILAB-PUB-23-667-AD-CSAID-ND

  11. Atmospheric neutrino oscillation analysis with neutron tagging and an expanded fiducial volume in Super-Kamiokande I-V

    Authors: Super-Kamiokande Collaboration, :, T. Wester, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya , et al. (212 additional authors not shown)

    Abstract: We present a measurement of neutrino oscillation parameters with the Super-Kamiokande detector using atmospheric neutrinos from the complete pure-water SK I-V (April 1996-July 2020) data set, including events from an expanded fiducial volume. The data set corresponds to 6511.3 live days and an exposure of 484.2 kiloton-years. Measurements of the neutrino oscillation parameters $Δm^2_{32}$,… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 24 pages, 18 figures

  12. arXiv:2311.03842  [pdf, ps, other

    hep-ex

    Measurement of the neutrino-oxygen neutral-current quasielastic cross section using atmospheric neutrinos in the SK-Gd experiment

    Authors: S. Sakai, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya, H. Shiba, K. Shimizu , et al. (211 additional authors not shown)

    Abstract: We report the first measurement of the atmospheric neutrino-oxygen neutral-current quasielastic (NCQE) cross section in the gadolinium-loaded Super-Kamiokande (SK) water Cherenkov detector. In June 2020, SK began a new experimental phase, named SK-Gd, by loading 0.011% by mass of gadolinium into the ultrapure water of the SK detector. The introduction of gadolinium to ultrapure water has the effec… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 8 pages, 3 figures

  13. arXiv:2311.03615  [pdf, other

    cs.LG cs.DC

    CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

    Authors: Jieming Bian, Lei Wang, Shaolei Ren, Jie Xu

    Abstract: Training large-scale artificial intelligence (AI) models demands significant computational power and energy, leading to increased carbon footprint with potential environmental repercussions. This paper delves into the challenges of training AI models across geographically distributed (geo-distributed) data centers, emphasizing the balance between learning performance and carbon footprint. We consi… ▽ More

    Submitted 5 February, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Preprint, Experiments Updated

  14. arXiv:2311.01797  [pdf, other

    cs.LG stat.ML

    On the Generalization Properties of Diffusion Models

    Authors: Puheng Li, Zhong Li, Huishuai Zhang, Jiang Bian

    Abstract: Diffusion models are a class of generative models that serve to establish a stochastic transport map between an empirically observed, yet unknown, target distribution and a known prior. Despite their remarkable success in real-world applications, a theoretical understanding of their generalization capabilities remains underdeveloped. This work embarks on a comprehensive theoretical exploration of… ▽ More

    Submitted 12 January, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 42 pages, 11 figures

  15. arXiv:2311.01159  [pdf, other

    hep-ex

    Search for Periodic Time Variations of the Solar $^8$B Neutrino Flux between 1996 and 2018 in Super-Kamiokande

    Authors: K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Sato, H. Sekiya, H. Shiba, K. Shimizu, M. Shiozawa , et al. (211 additional authors not shown)

    Abstract: We report a search for time variations of the solar $^8$B neutrino flux using 5804 live days of Super-Kamiokande data collected between May 31, 1996, and May 30, 2018. Super-Kamiokande measured the precise time of each solar neutrino interaction over 22 calendar years to search for solar neutrino flux modulations with unprecedented precision. Periodic modulations are searched for in a dataset comp… ▽ More

    Submitted 6 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 8 pages, 5 figures, 2 tables, and data file: "sksolartimevariation5804d.txt" (the data file updated with additional 3 columns -- R^2 correction, upper-error, lower-error)

    Journal ref: Phys.Rev.Lett 132, 241803 (2024)

  16. arXiv:2310.14714  [pdf, other

    cs.LG cs.AI

    BatteryML:An Open-source platform for Machine Learning on Battery Degradation

    Authors: Han Zhang, Xiaofan Gui, Shun Zheng, Ziheng Lu, Yuqi Li, Jiang Bian

    Abstract: Battery degradation remains a pivotal concern in the energy storage domain, with machine learning emerging as a potent tool to drive forward insights and solutions. However, this intersection of electrochemical science and machine learning poses complex challenges. Machine learning experts often grapple with the intricacies of battery science, while battery researchers face hurdles in adapting int… ▽ More

    Submitted 3 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    MSC Class: 68T05

    Journal ref: International Conference on Learning Representations (ICLR) 2024

  17. arXiv:2310.11954  [pdf, other

    cs.CL cs.MM eess.AS

    MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

    Authors: Dingyao Yu, Kaitao Song, Peiling Lu, Tianyu He, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian

    Abstract: AI-empowered music processing is a diverse field that encompasses dozens of tasks, ranging from generation tasks (e.g., timbre synthesis) to comprehension tasks (e.g., music classification). For developers and amateurs, it is very difficult to grasp all of these task to satisfy their requirements in music processing, especially considering the huge differences in the representations of music data… ▽ More

    Submitted 25 October, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  18. arXiv:2310.11249  [pdf, other

    cs.AI q-fin.GN

    Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle

    Authors: Xu Yang, Xiao Yang, Weiqing Liu, **hui Li, Peng Yu, Zeqi Ye, Jiang Bian

    Abstract: In the wake of relentless digital transformation, data-driven solutions are emerging as powerful tools to address multifarious industrial tasks such as forecasting, anomaly detection, planning, and even complex decision-making. Although data-centric R&D has been pivotal in harnessing these solutions, it often comes with significant costs in terms of human, computational, and time resources. This p… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 29 pages, 11 figures

  19. arXiv:2310.07449  [pdf, other

    cs.CV

    PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction

    Authors: Jia-Wang Bian, Wen**g Bian, Victor Adrian Prisacariu, Philip Torr

    Abstract: Neural surface reconstruction is sensitive to the camera pose noise, even if state-of-the-art pose estimators like COLMAP or ARKit are used. More importantly, existing Pose-NeRF joint optimisation methods have struggled to improve pose accuracy in challenging real-world scenarios. To overcome the challenges, we introduce the pose residual field (PoRF), a novel implicit representation that uses an… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024. Find the project page at https://porf.active.vision/

  20. arXiv:2310.07446  [pdf, other

    cs.LG

    ProbTS: Benchmarking Point and Distributional Forecasting across Diverse Prediction Horizons

    Authors: Jiawen Zhang, Xumeng Wen, Zhenwei Zhang, Shun Zheng, Jia Li, Jiang Bian

    Abstract: Delivering precise point and distributional forecasts across a spectrum of prediction horizons represents a significant and enduring challenge in the application of time-series forecasting within various industries. Prior research on develo** deep learning models for time-series forecasting has often concentrated on isolated aspects, such as long-term point forecasting or short-term probabilisti… ▽ More

    Submitted 17 June, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Preprint

  21. arXiv:2310.07402  [pdf, other

    cs.LG cs.AI

    NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time Series Pretraining

    Authors: Chenguo Lin, Xumeng Wen, Wei Cao, Congrui Huang, Jiang Bian, Stephen Lin, Zhirong Wu

    Abstract: Recent research on time-series self-supervised models shows great promise in learning semantic representations. However, it has been limited to small-scale datasets, e.g., thousands of temporal sequences. In this work, we make key technical contributions that are tailored to the numerical properties of time-series data and allow the model to scale to large datasets, e.g., millions of temporal sequ… ▽ More

    Submitted 12 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  22. arXiv:2310.07338  [pdf, other

    cs.LG

    Towards Foundation Models for Learning on Tabular Data

    Authors: Han Zhang, Xumeng Wen, Shun Zheng, Wei Xu, Jiang Bian

    Abstract: Learning on tabular data underpins numerous real-world applications. Despite considerable efforts in develo** effective learning models for tabular data, current transferable tabular models remain in their infancy, limited by either the lack of support for direct instruction following in new tasks or the neglect of acquiring foundational knowledge and capabilities from diverse tabular datasets.… ▽ More

    Submitted 22 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  23. arXiv:2310.07321  [pdf, other

    cs.CL cs.AI cs.LG

    On the Impact of Cross-Domain Data on German Language Models

    Authors: Amin Dada, Aokun Chen, Cheng Peng, Kaleb E Smith, Ahmad Idrissi-Yaghir, Constantin Marc Seibold, Jianning Li, Lars Heiliger, Xi Yang, Christoph M. Friedrich, Daniel Truhn, Jan Egger, Jiang Bian, Jens Kleesiek, Yonghui Wu

    Abstract: Traditionally, large language models have been either trained on general web crawls or domain-specific data. However, recent successes of generative large language models, have shed light on the benefits of cross-domain datasets. To examine the significance of prioritizing data diversity over quality, we present a German dataset comprising texts from five domains, along with another dataset aimed… ▽ More

    Submitted 13 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 13 pages, 1 figure, accepted at Findings of the Association for Computational Linguistics: EMNLP 2023

  24. Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction

    Authors: Cheng Peng, Xi Yang, Kaleb E Smith, Zehao Yu, Aokun Chen, Jiang Bian, Yonghui Wu

    Abstract: Objective To develop soft prompt-based learning algorithms for large language models (LLMs), examine the shape of prompts, prompt-tuning using frozen/unfrozen LLMs, transfer learning, and few-shot learning abilities. Methods We developed a soft prompt-based LLM model and compared 4 training strategies including (1) fine-tuning without prompts; (2) hard-prompt with unfrozen LLMs; (3) soft-prompt wi… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Journal ref: Journal of Biomedical Informatics. Volume 153, May 2024, 104630

  25. arXiv:2310.05052  [pdf, other

    eess.SP cs.AI cs.LG

    Accurate battery lifetime prediction across diverse aging conditions with deep learning

    Authors: Han Zhang, Yuqi Li, Shun Zheng, Ziheng Lu, Xiaofan Gui, Wei Xu, Jiang Bian

    Abstract: Accurately predicting the lifetime of battery cells in early cycles holds tremendous value for battery research and development as well as numerous downstream applications. This task is rather challenging because diverse conditions, such as electrode materials, operating conditions, and working environments, collectively determine complex capacity-degradation behaviors. However, current prediction… ▽ More

    Submitted 24 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  26. arXiv:2310.04134  [pdf, other

    cs.CV

    TiC: Exploring Vision Transformer in Convolution

    Authors: Song Zhang, Qingzhong Wang, Jiang Bian, Haoyi Xiong

    Abstract: While models derived from Vision Transformers (ViTs) have been phonemically surging, pre-trained models cannot seamlessly adapt to arbitrary resolution images without altering the architecture and configuration, such as sampling the positional encoding, limiting their flexibility for various vision tasks. For instance, the Segment Anything Model (SAM) based on ViT-Huge requires all input images to… ▽ More

    Submitted 27 May, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  27. arXiv:2310.00704  [pdf, other

    cs.SD eess.AS

    UniAudio: An Audio Foundation Model Toward Universal Audio Generation

    Authors: Dongchao Yang, **chuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng

    Abstract: Large Language models (LLM) have demonstrated the capability to handle a variety of generative tasks. This paper presents the UniAudio system, which, unlike prior task-specific approaches, leverages LLM techniques to generate multiple types of audio (including speech, sounds, music, and singing) with given input conditions. UniAudio 1) first tokenizes all types of target audio along with other con… ▽ More

    Submitted 11 December, 2023; v1 submitted 1 October, 2023; originally announced October 2023.

  28. arXiv:2309.15074  [pdf, other

    cs.CL cs.AI cs.HC cs.NI

    Natural Language based Context Modeling and Reasoning for Ubiquitous Computing with Large Language Models: A Tutorial

    Authors: Haoyi Xiong, Jiang Bian, Sijia Yang, Xiaofei Zhang, Linghe Kong, Daqing Zhang

    Abstract: Large language models (LLMs) have become phenomenally surging, since 2018--two decades after introducing context-awareness into computing systems. Through taking into account the situations of ubiquitous devices, users and the societies, context-aware computing has enabled a wide spectrum of innovative applications, such as assisted living, location-based social network services and so on. To reco… ▽ More

    Submitted 26 December, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Under review

  29. arXiv:2309.12278  [pdf, other

    cs.CL

    Inspire the Large Language Model by External Knowledge on BioMedical Named Entity Recognition

    Authors: Junyi Bian, Jiaxuan Zheng, Yuyi Zhang, Shanfeng Zhu

    Abstract: Large language models (LLMs) have demonstrated dominating performance in many NLP tasks, especially on generative tasks. However, they often fall short in some information extraction tasks, particularly those requiring domain-specific knowledge, such as Biomedical Named Entity Recognition (NER). In this paper, inspired by Chain-of-thought, we leverage the LLM to solve the Biomedical NER step-by-st… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 10 pages, 5 figures

  30. arXiv:2309.08532  [pdf, other

    cs.CL cs.AI

    Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

    Authors: Qingyan Guo, Rui Wang, Junliang Guo, Bei Li, Kaitao Song, Xu Tan, Guoqing Liu, Jiang Bian, Yujiu Yang

    Abstract: Large Language Models (LLMs) excel in various tasks, but they rely on carefully crafted prompts that often demand substantial human effort. To automate this process, in this paper, we propose a novel framework for discrete prompt optimization, called EvoPrompt, which borrows the idea of evolutionary algorithms (EAs) as they exhibit good performance and fast convergence. To enable EAs to work on di… ▽ More

    Submitted 27 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: International Conference on Learning Representations (ICLR) 2024

  31. Experimental demonstration of enhanced violations of Leggett-Garg inequalities in a $\mathcal{PT}$-symmetric trapped-ion qubit

    Authors: Pengfei Lu, Xinxin Rao, Teng Liu, Yang Liu, Ji Bian, Feng Zhu, Le Luo

    Abstract: The Leggett-Garg inequality (LGI) places a bound for the distinction between quantum systems and classical systems. Despite that the tests of temporal quantum correlations on LGIs have been studied in Hermitian realm, there are still unknowns for LGIs in non-Hermitian conditions due to the interplay between dissipation and coherence. For example, a theoretical hypothesis to be experimentally valid… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Journal ref: Physical Review A 109, 042205 (2024)

  32. arXiv:2309.02467  [pdf

    cs.LG cs.CY

    Develo** A Fair Individualized Polysocial Risk Score (iPsRS) for Identifying Increased Social Risk of Hospitalizations in Patients with Type 2 Diabetes (T2D)

    Authors: Yu Huang, **gchuan Guo, William T Donahoo, Zhengkang Fan, Ying Lu, Wei-Han Chen, Huilin Tang, Lori Bilello, Elizabeth A Shenkman, Jiang Bian

    Abstract: Background: Racial and ethnic minority groups and individuals facing social disadvantages, which often stem from their social determinants of health (SDoH), bear a disproportionate burden of type 2 diabetes (T2D) and its complications. It is therefore crucial to implement effective social risk management strategies at the point of care. Objective: To develop an EHR-based machine learning (ML) anal… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  33. arXiv:2309.02285  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    PromptTTS 2: Describing and Generating Voices with Text Prompt

    Authors: Yichong Leng, Zhifang Guo, Kai Shen, Xu Tan, Zeqian Ju, Yanqing Liu, Yufei Liu, Dongchao Yang, Leying Zhang, Kaitao Song, Lei He, Xiang-Yang Li, Sheng Zhao, Tao Qin, Jiang Bian

    Abstract: Speech conveys more information than text, as the same word can be uttered in various voices to convey diverse information. Compared to traditional text-to-speech (TTS) methods relying on speech prompts (reference speech) for voice variability, using text prompts (descriptions) is more user-friendly since speech prompts can be hard to find or may not exist at all. TTS approaches based on the text… ▽ More

    Submitted 11 October, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Demo page: https://speechresearch.github.io/prompttts2

  34. arXiv:2309.01935  [pdf

    stat.AP

    The impact of electronic health records (EHR) data continuity on prediction model fairness and racial-ethnic disparities

    Authors: Yu Huang, **gchuan Guo, Zhaoyi Chen, Jie Xu, William T Donahoo, Olveen Carasquillo, Hrushyang Adloori, Jiang Bian, Elizabeth A Shenkman

    Abstract: Electronic health records (EHR) data have considerable variability in data completeness across sites and patients. Lack of "EHR data-continuity" or "EHR data-discontinuity", defined as "having medical information recorded outside the reach of an EHR system" can lead to a substantial amount of information bias. The objective of this study was to comprehensively evaluate (1) how EHR data-discontinui… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  35. arXiv:2308.12575  [pdf, other

    cs.LG

    Hypergraph Convolutional Networks for Fine-grained ICU Patient Similarity Analysis and Risk Prediction

    Authors: Yuxi Liu, Zhenhao Zhang, Shaowen Qin, Flora D. Salim, Antonio Jimeno Yepes, Jun Shen, Jiang Bian

    Abstract: The Intensive Care Unit (ICU) is one of the most important parts of a hospital, which admits critically ill patients and provides continuous monitoring and treatment. Various patient outcome prediction methods have been attempted to assist healthcare professionals in clinical decision-making. Existing methods focus on measuring the similarity between patients using deep neural networks to capture… ▽ More

    Submitted 21 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 16 pages, 2 figures

  36. arXiv:2308.08135  [pdf, other

    q-fin.ST cs.LG

    Microstructure-Empowered Stock Factor Extraction and Utilization

    Authors: Xianfeng Jiao, Zizhong Li, Chang Xu, Yang Liu, Weiqing Liu, Jiang Bian

    Abstract: High-frequency quantitative investment is a crucial aspect of stock investment. Notably, order flow data plays a critical role as it provides the most detailed level of information among high-frequency trading data, including comprehensive data from the order book and transaction records at the tick level. The order flow data is extremely valuable for market analysis as it equips traders with esse… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  37. arXiv:2308.04313  [pdf

    cs.AI cs.GR cs.HC

    Apple Vision Pro for Healthcare: "The Ultimate Display"? -- Entering the Wonderland of Precision Medicine

    Authors: Jan Egger, Christina Gsaxner, Xiaojun Chen, Jiang Bian, Jens Kleesiek, Behrus Puladi

    Abstract: At the Worldwide Developers Conference (WWDC) in June 2023, Apple introduced the Vision Pro. The Vision Pro is a Mixed Reality (MR) headset, more specifically it is a Virtual Reality (VR) device with an additional Video See-Through (VST) capability. The VST capability turns the Vision Pro also into an Augmented Reality (AR) device. The AR feature is enabled by streaming the real world via cameras… ▽ More

    Submitted 10 October, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: This is a Preprint under CC BY. This work was supported by NIH/NIAID R01AI172875, NIH/NCATS UL1 TR001427, the REACT-EU project KITE and enFaced 2.0 (FWF KLI 1044). B. Puladi was funded by the Medical Faculty of the RWTH Aachen University as part of the Clinician Scientist Program. C. Gsaxner was funded by the Advanced Research Opportunities Program from the RWTH Aachen University

  38. arXiv:2308.03028  [pdf, other

    cs.AI

    Pre-Trained Large Language Models for Industrial Control

    Authors: Lei Song, Chuheng Zhang, Li Zhao, Jiang Bian

    Abstract: For industrial control, develo** high-performance controllers with few samples and low technical debt is appealing. Foundation models, possessing rich prior knowledge obtained from pre-training with Internet-scale corpus, have the potential to be a good controller with proper prompts. In this paper, we take HVAC (Heating, Ventilation, and Air Conditioning) building control as an example to exami… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  39. arXiv:2307.15715  [pdf

    cs.CL cs.AI

    Improving Primary Healthcare Workflow Using Extreme Summarization of Scientific Literature Based on Generative AI

    Authors: Gregor Stiglic, Leon Kopitar, Lucija Gosak, Primoz Kocbek, Zhe He, Prithwish Chakraborty, Pablo Meyer, Jiang Bian

    Abstract: Primary care professionals struggle to keep up to date with the latest scientific literature critical in guiding evidence-based practice related to their daily work. To help solve the above-mentioned problem, we employed generative artificial intelligence techniques based on large-scale language models to summarize abstracts of scientific papers. Our objective is to investigate the potential of ge… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 5 pages, 5 figures

    MSC Class: 68T50 ACM Class: I.2.7

  40. arXiv:2307.12987  [pdf, other

    cs.MA

    Efficient Behavior-consistent Calibration for Multi-agent Market Simulation

    Authors: Tianlang He, Keyan Lu, Chang Xu, Yang Liu, Weiqing Liu, S. -H. Gary Chan, Jiang Bian

    Abstract: Order-driven market simulation mimics the trader behaviors to generate order streams to support interactive studies of financial strategies. In market simulator, the multi-agent approach is commonly adopted due to its explainability. Existing multi-agent systems employ heuristic search to generate order streams, which is inefficient for large-scale simulation. Furthermore, the search-based behavio… ▽ More

    Submitted 5 June, 2023; originally announced July 2023.

  41. arXiv:2307.03119  [pdf, other

    cs.AI cs.LG cs.MA

    Learning Multi-Agent Intention-Aware Communication for Optimal Multi-Order Execution in Finance

    Authors: Yuchen Fang, Zhenggang Tang, Kan Ren, Weiqing Liu, Li Zhao, Jiang Bian, Dongsheng Li, Weinan Zhang, Yong Yu, Tie-Yan Liu

    Abstract: Order execution is a fundamental task in quantitative finance, aiming at finishing acquisition or liquidation for a number of trading orders of the specific assets. Recent advance in model-free reinforcement learning (RL) provides a data-driven solution to the order execution problem. However, the existing works always optimize execution for an individual order, overlooking the practice that multi… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted in KDD 2023; The website is at https://seqml.github.io/marl4fin

  42. arXiv:2307.01229  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    EmoGen: Eliminating Subjective Bias in Emotional Music Generation

    Authors: Chenfei Kang, Peiling Lu, Botao Yu, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian

    Abstract: Music is used to convey emotions, and thus generating emotional music is important in automatic music generation. Previous work on emotional music generation directly uses annotated emotion labels as control signals, which suffers from subjective bias: different people may annotate different emotions on the same music, and one person may feel different emotions under different situations. Therefor… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 12 pages, 7 pages

  43. arXiv:2306.15736  [pdf, other

    cs.CL

    DMNER: Biomedical Entity Recognition by Detection and Matching

    Authors: Junyi Bian, Rongze Jiang, Weiqi Zhai, Tianyang Huang, Hong Zhou, Shanfeng Zhu

    Abstract: Biomedical named entity recognition (BNER) serves as the foundation for numerous biomedical text mining tasks. Unlike general NER, BNER require a comprehensive grasp of the domain, and incorporating external knowledge beyond training data poses a significant challenge. In this study, we propose a novel BNER framework called DMNER. By leveraging existing entity representation models SAPBERT, we tac… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 9 pages content, 2 pages appendix

  44. Warpformer: A Multi-scale Modeling Approach for Irregular Clinical Time Series

    Authors: Jiawen Zhang, Shun Zheng, Wei Cao, Jiang Bian, Jia Li

    Abstract: Irregularly sampled multivariate time series are ubiquitous in various fields, particularly in healthcare, and exhibit two key characteristics: intra-series irregularity and inter-series discrepancy. Intra-series irregularity refers to the fact that time-series signals are often recorded at irregular intervals, while inter-series discrepancy refers to the significant variability in sampling rates… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: KDD23 Research Track

  45. arXiv:2306.07542  [pdf, other

    cs.AI

    A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management

    Authors: Xianliang Yang, Zhihao Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Jiang Bian

    Abstract: Multi-agent reinforcement learning (MARL) models multiple agents that interact and learn within a shared environment. This paradigm is applicable to various industrial scenarios such as autonomous driving, quantitative trading, and inventory management. However, applying MARL to these real-world scenarios is impeded by many challenges such as scaling up, complex agent interactions, and non-station… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  46. arXiv:2306.04212  [pdf, other

    cs.LG cs.CY

    Migrate Demographic Group For Fair GNNs

    Authors: YanMing Hu, TianChi Liao, JiaLong Chen, **g Bian, ZiBin Zheng, Chuan Chen

    Abstract: Graph Neural networks (GNNs) have been applied in many scenarios due to the superior performance of graph learning. However, fairness is always ignored when designing GNNs. As a consequence, biased information in training data can easily affect vanilla GNNs, causing biased results toward particular demographic groups (divided by sensitive attributes, such as race and age). There have been efforts… ▽ More

    Submitted 23 March, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  47. arXiv:2306.03680  [pdf, other

    cs.LG

    Mildly Constrained Evaluation Policy for Offline Reinforcement Learning

    Authors: Linjie Xu, Zhengyao Jiang, **yu Wang, Lei Song, Jiang Bian

    Abstract: Offline reinforcement learning (RL) methodologies enforce constraints on the policy to adhere closely to the behavior policy, thereby stabilizing value learning and mitigating the selection of out-of-distribution (OOD) actions during test time. Conventional approaches apply identical constraints for both value learning and test time inference. However, our findings indicate that the constraints su… ▽ More

    Submitted 15 June, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

  48. arXiv:2306.01997  [pdf, other

    cs.LG

    UADB: Unsupervised Anomaly Detection Booster

    Authors: Hangting Ye, Zhining Liu, Xinyi Shen, Wei Cao, Shun Zheng, Xiaofan Gui, Huishuai Zhang, Yi Chang, Jiang Bian

    Abstract: Unsupervised Anomaly Detection (UAD) is a key data mining problem owing to its wide real-world applications. Due to the complete absence of supervision signals, UAD methods rely on implicit assumptions about anomalous patterns (e.g., scattered/sparsely/densely clustered) to detect anomalies. However, real-world data are complex and vary significantly across different domains. No single assumption… ▽ More

    Submitted 26 December, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: IEEE 39th International Conference on Data Engineering (ICDE 2023)

  49. arXiv:2306.00110  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    MuseCoco: Generating Symbolic Music from Text

    Authors: Peiling Lu, Xin Xu, Chenfei Kang, Botao Yu, Chengyi Xing, Xu Tan, Jiang Bian

    Abstract: Generating music from text descriptions is a user-friendly mode since the text is a relatively easy interface for user engagement. While some approaches utilize texts to control music audio generation, editing musical elements in generated audio is challenging for users. In contrast, symbolic music offers ease of editing, making it more accessible for users to manipulate specific musical elements.… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  50. arXiv:2305.19835  [pdf, ps, other

    cs.CL cs.AI

    Deliberate then Generate: Enhanced Prompting Framework for Text Generation

    Authors: Bei Li, Rui Wang, Junliang Guo, Kaitao Song, Xu Tan, Hany Hassan, Arul Menezes, Tong Xiao, Jiang Bian, **gBo Zhu

    Abstract: Large language models (LLMs) have shown remarkable success across a wide range of natural language generation tasks, where proper prompt designs make great impacts. While existing prompting methods are normally restricted to providing correct information, in this paper, we encourage the model to deliberate by proposing a novel Deliberate then Generate (DTG) prompting framework, which consists of e… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.