Skip to main content

Showing 101–150 of 648 results for author: Zhu, D

.
  1. arXiv:2401.08252  [pdf, other

    hep-ex

    Observation of $ψ(3686) \to Ω^- K^+ \barΞ^0 $+c.c

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (630 additional authors not shown)

    Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systemati… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  2. arXiv:2401.08199  [pdf

    physics.optics physics.comp-ph

    Photonic Modes Prediction via Multi-Modal Diffusion Model

    Authors: **yang Sun, Xi Chen, Xiumei Wang, Dandan Zhu, ** Zhou

    Abstract: The concept of photonic modes is the cornerstone in optics and photonics, which can describe the propagation of the light. The Maxwell's equations play the role in calculating the mode field based on the structure information, while this process needs a great deal of computations, especially in the handle with a three-dimensional model. To overcome this obstacle, we introduce the Multi-Modal Diffu… ▽ More

    Submitted 22 February, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  3. First observation of the decay $Λ^+_c\to nK^{0}_{S}π^+π^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (630 additional authors not shown)

    Abstract: Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4599.53$ MeV and $4698.82$ MeV with the BESIII detector, the decay $Λ_{c}^{+}\to nK_{S}^{0}π^+π^0$ is observed for the first time with a significance of $9.2σ$. The branching fraction is measured to be $(0.85\pm0.13\pm0.03)\%$, where the first uncertainty is statistical and the second systematic,… ▽ More

    Submitted 28 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Journal ref: Phys.Rev.D,109,053005 (2024)

  4. arXiv:2401.05521  [pdf, other

    cs.RO cs.AI eess.SY

    Current Effect-eliminated Optimal Target Assignment and Motion Planning for a Multi-UUV System

    Authors: Danjie Zhu, Simon X. Yang

    Abstract: The paper presents an innovative approach (CBNNTAP) that addresses the complexities and challenges introduced by ocean currents when optimizing target assignment and motion planning for a multi-unmanned underwater vehicle (UUV) system. The core of the proposed algorithm involves the integration of several key components. Firstly, it incorporates a bio-inspired neural network-based (BINN) approach… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: This paper was accepted by IEEE Transactions on Intelligent Transportation Systems

  5. Search for a massless particle beyond the Standard Model in the $Σ^+\rightarrow p+{\rm invisible}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: A massless particle beyond the Standard Model is searched for in the two-body decay $Σ^+\rightarrow p+{\rm invisible}$ using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097$ GeV with the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction $B(Σ^+\rightarrow p+{\rm invisible})$… ▽ More

    Submitted 5 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 11 pages, 5 figures

    Journal ref: Phys. Lett. B 852 (2024) 138614

  6. arXiv:2312.16453  [pdf, other

    cond-mat.mtrl-sci

    Hard X-ray Generation and Detection of Nanometer-Scale Localized Coherent Acoustic Wave Packets in SrTiO$_3$ and KTaO$_3$

    Authors: Yi**g Huang, Peihao Sun, Samuel W. Teitelbaum, Haoyuan Li, Yanwen Sun, Nan Wang, Sanghoon Song, Takahiro Sato, Matthieu Chollet, Taito Osaka, Ichiro Inoue, Ryan A. Duncan, Hyun D. Shin, Johann Haber, **jian Zhou, Marco Bernardi, Mingqiang Gu, James M. Rondinelli, Mariano Trigo, Makina Yabashi, Alexei A. Maznev, Keith A. Nelson, Diling Zhu, David A. Reis

    Abstract: We demonstrate that the absorption of femtosecond x-ray pulses can excite quasi-spherical high-wavevector coherent acoustic phonon wavepackets using an all x-ray pump and probe scattering experiment. The time- and momentum-resolved diffuse scattering signal is consistent with strain pulses induced by the rapid electron cascade dynamics following photoionization at uncorrelated excitation centers.… ▽ More

    Submitted 2 January, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

  7. arXiv:2312.16405  [pdf, ps, other

    hep-ex

    Observation of $χ_{cJ}\to 3(K^+K^-)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (632 additional authors not shown)

    Abstract: By analyzing $(27.12\pm0.14)\times10^8$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay processes $χ_{cJ} \to 3(K^+K^-)$ ($J=0,1,2$) are observed for the first time with statistical significances of 8.2$σ$, 8.1$σ$, and 12.4$σ$, respectively. The product branching fractions of $ψ(3686)\toγχ_{cJ}$, $χ_{cJ}\to 3(K^+K^-)$ are presented and the branching… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 8 pages, 2 figures

  8. arXiv:2312.13630  [pdf, other

    cs.CV cs.LG

    MFABA: A More Faithful and Accelerated Boundary-based Attribution Method for Deep Neural Networks

    Authors: Zhiyu Zhu, Huaming Chen, Jiayu Zhang, Xinyi Wang, Zhibo **, Minhui Xue, Dongxiao Zhu, Kim-Kwang Raymond Choo

    Abstract: To better understand the output of deep neural networks (DNN), attribution based methods have been an important approach for model interpretability, which assign a score for each input dimension to indicate its importance towards the model outcome. Notably, the attribution methods use the axioms of sensitivity and implementation invariance to ensure the validity and reliability of attribution resu… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by The 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24)

  9. Joint Trading and Scheduling among Coupled Carbon-Electricity-Heat-Gas Industrial Clusters

    Authors: Dafeng Zhu, Bo Yang, Yu Wu, Haoran Deng, Zhaoyang Dong, Kai Ma, ** Guan

    Abstract: This paper presents a carbon-energy coupling management framework for an industrial park, where the carbon flow model accompanying multi-energy flows is adopted to track and suppress carbon emissions on the user side. To deal with the quadratic constraint of gas flows, a bound tightening algorithm for constraints relaxation is adopted. The synergies among the carbon capture, energy storage, power-… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Smart Grid, 2023

  10. arXiv:2312.12743  [pdf, other

    cs.CV

    PointeNet: A Lightweight Framework for Effective and Efficient Point Cloud Analysis

    Authors: Lipeng Gu, Xuefeng Yan, Liangliang Nan, Dingkun Zhu, Honghua Chen, Weiming Wang, Mingqiang Wei

    Abstract: Current methodologies in point cloud analysis predominantly explore 3D geometries, often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these approaches inevitably lead to a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  11. arXiv:2312.12107  [pdf, other

    cs.DC cs.DB

    GraphScope Flex: LEGO-like Graph Computing Stack

    Authors: Tao He, Shuxian Hu, Longbin Lai, Dongze Li, Neng Li, Xue Li, Lexiao Liu, Xiaojian Luo, Binqing Lyu, Ke Meng, Sijie Shen, Li Su, Lei Wang, **gbo Xu, Wenyuan Yu, Weibin Zeng, Lei Zhang, Siyuan Zhang, **gren Zhou, Xiaoli Zhou, Diwen Zhu

    Abstract: Graph computing has become increasingly crucial in processing large-scale graph data, with numerous systems developed for this purpose. Two years ago, we introduced GraphScope as a system addressing a wide array of graph computing needs, including graph traversal, analytics, and learning in one system. Since its inception, GraphScope has achieved significant technological advancements and gained w… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  12. arXiv:2312.09577  [pdf, other

    cs.DB

    Enhancing Data Lakes with GraphAr: Efficient Graph Data Management with a Specialized Storage Scheme

    Authors: Xue Li, Weibin Zeng, Zhibin Wang, Diwen Zhu, **gbo Xu, Wenyuan Yu, **gren Zhou

    Abstract: Data lakes, increasingly adopted for their ability to store and analyze diverse types of data, commonly use columnar storage formats like Parquet and ORC for handling relational tables. However, these traditional setups fall short when it comes to efficiently managing graph data, particularly those conforming to the Labeled Property Graph (LPG) model. To address this gap, this paper introduces Gra… ▽ More

    Submitted 21 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: 15 pages, 10 figures

    ACM Class: E.5; E.2; H.2.4; H.2.1

  13. arXiv:2312.08613  [pdf, other

    cond-mat.mtrl-sci

    Directly observing atomic-scale relaxations of a glass forming liquid using femtosecond X-ray photon correlation spectroscopy

    Authors: Tomoki Fujita, Yanwen Sun, Haoyuan Li, Thies J. Albert, Sanghoon Song, Takahiro Sato, Jens Moesgaard, Antoine Cornet, Peihao Sun, Ying Chen, Mianzhen Mo, Narges Amini, Fan Yang, Arune Makareviciute, Garrett Coleman, Pierre Lucas, Jan Peter Embs, Vincent Esposito, Joan Vila-Comamala, Nan Wang, Talgat Mamyrbayev, Christian David, Jerome Hastings, Beatrice Ruta, Paul Fuoss , et al. (3 additional authors not shown)

    Abstract: Glass forming liquids exhibit structural relaxation behaviors, reflecting underlying atomic rearrangements on a wide range of timescales. These behaviors play a crucial role in determining many material properties. However, the relaxation processes on the atomic scale are not well understood due to the experimental difficulties in directly characterizing the evolving correlations of atomic order i… ▽ More

    Submitted 8 June, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  14. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, **gyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  15. arXiv:2311.14226  [pdf, other

    cs.HC cs.CY

    Uncovering Gender Stereotypes in Video Game Character Designs: A Multi-Modal Analysis of Honor of Kings

    Authors: Bingqing Liu, Kyrie Zhixuan Zhou, Danlei Zhu, Jaihyun Park

    Abstract: In this paper, we conduct a comprehensive analysis of gender stereotypes in the character design of Honor of Kings, a popular multiplayer online battle arena (MOBA) game in China. We probe gender stereotypes through the lens of role assignments, visual designs, spoken lines, and background stories, combining qualitative analysis and text mining based on the moral foundation theory. Male heroes are… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 3rd International Conference on Natural Language Processing for Digital Humanities (NLP4DH)

  16. arXiv:2311.12652  [pdf, other

    cs.LG math.OC

    FedDRO: Federated Compositional Optimization for Distributionally Robust Learning

    Authors: Prashant Khanduri, Chengyin Li, Rafi Ibn Sultan, Yao Qiang, Joerg Kliewer, Dongxiao Zhu

    Abstract: Recently, compositional optimization (CO) has gained popularity because of its applications in distributionally robust optimization (DRO) and many other machine learning problems. Large-scale and distributed availability of data demands the development of efficient federated learning (FL) algorithms for solving CO problems. Develo** FL algorithms for CO is particularly challenging because of the… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 38 Pages, 6 Figures

  17. arXiv:2311.11913  [pdf, other

    cs.LG q-fin.CP stat.ML

    Deep Calibration of Market Simulations using Neural Density Estimators and Embedding Networks

    Authors: Namid R. Stillman, Rory Baggott, Justin Lyon, Jianfei Zhang, Dingqiu Zhu, Tao Chen, Perukrishnen Vytelingum

    Abstract: The ability to construct a realistic simulator of financial exchanges, including reproducing the dynamics of the limit order book, can give insight into many counterfactual scenarios, such as a flash crash, a margin call, or changes in macroeconomic outlook. In recent years, agent-based models have been developed that reproduce many features of an exchange, as summarised by a set of stylised facts… ▽ More

    Submitted 27 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 4th ACM International Conference on AI in Finance (ICAIF 2023)

  18. arXiv:2311.11319  [pdf, other

    cs.CV cs.AI

    GeoSAM: Fine-tuning SAM with Sparse and Dense Visual Prompting for Automated Segmentation of Mobility Infrastructure

    Authors: Rafi Ibn Sultan, Chengyin Li, Hui Zhu, Prashant Khanduri, Marco Brocanelli, Dongxiao Zhu

    Abstract: The Segment Anything Model (SAM) has shown impressive performance when applied to natural image segmentation. However, it struggles with geographical images like aerial and satellite imagery, especially when segmenting mobility infrastructure including roads, sidewalks, and crosswalks. This inferior performance stems from the narrow features of these objects, their textures blending into the surro… ▽ More

    Submitted 30 January, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  19. arXiv:2311.10288   

    physics.app-ph

    Current manipulation of Giant tunneling altermagnetic resistance in collinear Antiferromagnetic RuO2/MgO/RuO2 sandwich structure

    Authors: Shijie Xu, Yan Huang, Farzad Mahfouzi, Zhizhong Zhang, Houyi Cheng, Bingqian Dai, **woong Kim, Wenlong Cai, Kewen Shi, Daoqian Zhu, Zongxia Guo, Caihua Cao, Kun Zhang, Albert Fert, Yue Zhang, Kang L. Wang, Nicholas Kioussis, Weisheng Zhao

    Abstract: As an emerging non-volatile memory technology, magnetic random access memory (MRAM) has key features and advantages including non-volatility, high speed, endurance, low power consumption and radiation tolerance. Conventional MRAM utilizes magnetic tunnel junctions (MTJs), which consist of two ferromagnetic layers separated by an insulating tunnel barrier. The orientation of the magnetic layers rep… ▽ More

    Submitted 24 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Modification required

  20. arXiv:2311.09948  [pdf, other

    cs.LG cs.CL cs.CR

    Hijacking Large Language Models via Adversarial In-Context Learning

    Authors: Yao Qiang, Xiangyu Zhou, Dongxiao Zhu

    Abstract: In-context learning (ICL) has emerged as a powerful paradigm leveraging LLMs for specific downstream tasks by utilizing labeled examples as demonstrations (demos) in the precondition prompts. Despite its promising performance, ICL suffers from instability with the choice and arrangement of examples. Additionally, crafted adversarial attacks pose a notable threat to the robustness of ICL. However,… ▽ More

    Submitted 15 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  21. arXiv:2311.08827  [pdf, other

    math.OC

    A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

    Authors: Daokuan Zhu, Tianqi Xu, Jie Lu

    Abstract: In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a given specific problem. In this paper, we propose a learning-based method to achieve efficient distributed optimization over networked systems. Specifically, a… ▽ More

    Submitted 3 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  22. arXiv:2311.07377  [pdf, other

    cs.SE cs.AI cs.DC cs.RO

    Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach

    Authors: Xi Zheng, Aloysius K. Mok, Ruzica Piskac, Yong Jae Lee, Bhaskar Krishnamachari, Dakai Zhu, Oleg Sokolsky, Insup Lee

    Abstract: The integration of machine learning (ML) into cyber-physical systems (CPS) offers significant benefits, including enhanced efficiency, predictive capabilities, real-time responsiveness, and the enabling of autonomous operations. This convergence has accelerated the development and deployment of a range of real-world applications, such as autonomous vehicles, delivery drones, service robots, and te… ▽ More

    Submitted 16 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  23. Discordance Minimization-based Imputation Algorithms for Missing Values in Rating Data

    Authors: Young Woong Park, **hak Kim, Dan Zhu

    Abstract: Ratings are frequently used to evaluate and compare subjects in various applications, from education to healthcare, because ratings provide succinct yet credible measures for comparing subjects. However, when multiple rating lists are combined or considered together, subjects often have missing ratings, because most rating lists do not rate every subject in the combined list. In this study, we pro… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  24. arXiv:2311.02458  [pdf

    physics.app-ph

    Spin-flop magnetoresistance in a collinear antiferromagnetic tunnel junction

    Authors: Shijie Xu, Zhizhong Zhang, Farzad Mahfouzi, Yan Huang, Houyi Cheng, Bingqian Dai, Wenlong Cai, Kewen Shi, Daoqian Zhu, Zongxia Guo, Caihua Cao, Yongshan Liu, Albert Fert, Nicholas Kioussis, Kang L. Wang, Yue Zhang., Weisheng Zhao

    Abstract: Collinear antiferromagnetic (AFM) materials have unique promise of no stray fields, display ultrafast dynamics, and being robust against perturbation filed which motivates the extensive research of antiferromagnetic spintronics. However, the manipulation and detection of antiferromagnetic order remain formidable challenges. Here, we report the electrical detection of colinear antiferromagnetism in… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  25. arXiv:2310.16155  [pdf, other

    quant-ph

    Coherent control of a superconducting qubit using light

    Authors: Hana K. Warner, Jeffrey Holzgrafe, Beatriz Yankelevich, David Barton, Stefano Poletto, C. J. Xin, Neil Sinclair, Di Zhu, Eyob Sete, Brandon Langley, Emma Batson, Marco Colangelo, Amirhassan Shams-Ansari, Graham Joe, Karl K. Berggren, Liang Jiang, Matthew Reagor, Marko Loncar

    Abstract: Quantum science and technology promise the realization of a powerful computational resource that relies on a network of quantum processors connected with low loss and low noise communication channels capable of distributing entangled states [1,2]. While superconducting microwave qubits (3-8 GHz) operating in cryogenic environments have emerged as promising candidates for quantum processor nodes du… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  26. arXiv:2310.14211  [pdf, other

    cs.LG cs.AI cs.CL cs.CR cs.SE

    LUNA: A Model-Based Universal Analysis Framework for Large Language Models

    Authors: Da Song, Xuan Xie, Jiayang Song, Derui Zhu, Yuheng Huang, Felix Juefei-Xu, Lei Ma

    Abstract: Over the past decade, Artificial Intelligence (AI) has had great success recently and is being used in a wide range of academic and industrial fields. More recently, LLMs have made rapid advancements that have propelled AI to a new level, enabling even more diverse applications and industrial domains with intelligence, particularly in areas like software engineering and natural language processing… ▽ More

    Submitted 13 June, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: 34 pages, 13 figures, To appear in Transactions on Software Engineering (Journal First)

  27. arXiv:2310.09547  [pdf, other

    math.OC

    A Distributed Buffering Drift-Plus-Penalty Algorithm for Coupling Constrained Optimization

    Authors: Dandan Wang, Daokuan Zhu, Zichong Ou, Jie Lu

    Abstract: This paper focuses on distributed constrained optimization over time-varying directed networks, where all agents cooperate to optimize the sum of their locally accessible objective functions subject to a coupled inequality constraint consisting of all their local constraint functions. To address this problem, we develop a buffering drift-plus-penalty algorithm, referred to as B-DPP. The proposed B… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  28. arXiv:2310.09478  [pdf, other

    cs.CV

    MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning

    Authors: Jun Chen, Deyao Zhu, Xiaoqian Shen, Xiang Li, Zechun Liu, Pengchuan Zhang, Raghuraman Krishnamoorthi, Vikas Chandra, Yunyang Xiong, Mohamed Elhoseiny

    Abstract: Large language models have shown their remarkable capabilities as a general interface for various language-related applications. Motivated by this, we target to build a unified interface for completing many vision-language tasks including image description, visual question answering, and visual grounding, among others. The challenge is to use a single model for performing diverse vision-language t… ▽ More

    Submitted 7 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

  29. arXiv:2310.06362  [pdf, other

    cs.CL

    InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspective

    Authors: Yifan Song, Peiyi Wang, Weimin Xiong, Dawei Zhu, Tianyu Liu, Zhifang Sui, Sujian Li

    Abstract: Continual learning (CL) aims to constantly learn new knowledge over time while avoiding catastrophic forgetting on old tasks. We focus on continual text classification under the class-incremental setting. Recent CL studies have identified the severe performance decrease on analogous classes as a key factor for catastrophic forgetting. In this paper, through an in-depth exploration of the represent… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023. An improved version of arXiv:2305.07289

  30. arXiv:2310.06162  [pdf

    eess.IV

    Empirical Evaluation of the Segment Anything Model (SAM) for Brain Tumor Segmentation

    Authors: Mohammad Peivandi, Jason Zhang, Michael Lu, Dongxiao Zhu, Zhifeng Kou

    Abstract: Brain tumor segmentation presents a formidable challenge in the field of Medical Image Segmentation. While deep-learning models have been useful, human expert segmentation remains the most accurate method. The recently released Segment Anything Model (SAM) has opened up the opportunity to apply foundation models to this difficult task. However, SAM was primarily trained on diverse natural images.… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  31. arXiv:2310.05242  [pdf, other

    cs.CL cs.AI

    ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

    Authors: Tianyang Zhong, Wei Zhao, Yutong Zhang, Yi Pan, Peixin Dong, Zuowei Jiang, Xiaoyan Kui, Youlan Shang, Li Yang, Yaonai Wei, Longtao Yang, Hao Chen, Huan Zhao, Yuxiao Liu, Ning Zhu, Yiwei Li, Yisong Wang, Jiaqi Yao, Jiaqi Wang, Ying Zeng, Lei He, Chao Zheng, Zhixue Zhang, Ming Li, Zhengliang Liu , et al. (17 additional authors not shown)

    Abstract: Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels. However, complex and diverse radiology reports with cross-source heterogeneity pose a huge generalizability challenge to the current methods under massive data volume, mainly because the style and normativity of radiology reports are obviousl… ▽ More

    Submitted 9 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  32. arXiv:2310.03234  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Non-Smooth Weakly-Convex Finite-sum Coupled Compositional Optimization

    Authors: Quanqi Hu, Dixian Zhu, Tianbao Yang

    Abstract: This paper investigates new families of compositional optimization problems, called $\underline{\bf n}$on-$\underline{\bf s}$mooth $\underline{\bf w}$eakly-$\underline{\bf c}$onvex $\underline{\bf f}$inite-sum $\underline{\bf c}$oupled $\underline{\bf c}$ompositional $\underline{\bf o}$ptimization (NSWC FCCO). There has been a growing interest in FCCO due to its wide-ranging applications in machin… ▽ More

    Submitted 3 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  33. arXiv:2309.16289  [pdf, other

    cs.CL cs.AI cs.LG

    LawBench: Benchmarking Legal Knowledge of Large Language Models

    Authors: Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang, Kai Chen, Zongwen Shen, Jidong Ge

    Abstract: Large language models (LLMs) have demonstrated strong capabilities in various aspects. However, when applying them to the highly specialized, safe-critical legal domain, it is unclear how much legal knowledge they possess and whether they can reliably perform legal-related tasks. To address this gap, we propose a comprehensive evaluation benchmark LawBench. LawBench has been meticulously crafted t… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  34. arXiv:2309.10625  [pdf, other

    cs.AI cs.CV

    NoisyNN: Exploring the Influence of Information Entropy Change in Learning Systems

    Authors: Xiaowei Yu, Zhe Huang, Yao Xue, Lu Zhang, Li Wang, Tianming Liu, Dajiang Zhu

    Abstract: We explore the impact of entropy change in deep learning systems via noise injection at different levels, i.e., the latent space and input image. The series of models that employ our methodology are collectively known as Noisy Neural Networks (NoisyNN), with examples such as NoisyViT and NoisyCNN. Noise is conventionally viewed as a harmful perturbation in various deep learning architectures, such… ▽ More

    Submitted 2 February, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Information Entropy, NoisyNN, ViT, CNN

  35. arXiv:2309.10400  [pdf, other

    cs.CL cs.LG

    PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

    Authors: Dawei Zhu, Nan Yang, Liang Wang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li

    Abstract: Large Language Models (LLMs) are trained with a pre-defined context length, restricting their use in scenarios requiring long inputs. Previous efforts for adapting LLMs to a longer length usually requires fine-tuning with this target length (Full-length fine-tuning), suffering intensive training cost. To decouple train length from target length for efficient context window extension, we propose Po… ▽ More

    Submitted 21 February, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: ICLR 2024

  36. arXiv:2309.10238  [pdf, other

    cs.CL

    PolicyGPT: Automated Analysis of Privacy Policies with Large Language Models

    Authors: Chenhao Tang, Zhengliang Liu, Chong Ma, Zihao Wu, Yiwei Li, Wei Liu, Dajiang Zhu, Quanzheng Li, Xiang Li, Tianming Liu, Lei Fan

    Abstract: Privacy policies serve as the primary conduit through which online service providers inform users about their data collection and usage procedures. However, in a bid to be comprehensive and mitigate legal risks, these policy documents are often quite verbose. In practical use, users tend to click the Agree button directly rather than reading them carefully. This practice exposes users to risks of… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  37. arXiv:2309.10160  [pdf, other

    physics.med-ph cs.AI

    RadOnc-GPT: A Large Language Model for Radiation Oncology

    Authors: Zhengliang Liu, Peilong Wang, Yiwei Li, Jason Holmes, Peng Shu, Lian Zhang, Chenbin Liu, Ninghao Liu, Dajiang Zhu, Xiang Li, Quanzheng Li, Samir H. Patel, Terence T. Sio, Tianming Liu, Wei Liu

    Abstract: This paper presents RadOnc-GPT, a large language model specialized for radiation oncology through advanced tuning methods. RadOnc-GPT was finetuned on a large dataset of radiation oncology patient records from the Mayo Clinic in Arizona. The model employs instruction tuning on three key tasks - generating radiotherapy treatment regimens, determining optimal radiation modalities, and providing diag… ▽ More

    Submitted 5 November, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  38. arXiv:2309.08035  [pdf, other

    cs.CV

    Interpretability-Aware Vision Transformer

    Authors: Yao Qiang, Chengyin Li, Prashant Khanduri, Dongxiao Zhu

    Abstract: Vision Transformers (ViTs) have become prominent models for solving various vision tasks. However, the interpretability of ViTs has not kept pace with their promising performance. While there has been a surge of interest in develo** {\it post hoc} solutions to explain ViTs' outputs, these methods do not generalize to different downstream tasks and various transformer architectures. Furthermore,… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 10 pages, 4 figures, 5 tables

  39. arXiv:2309.06419  [pdf, other

    cs.CL

    Radiology-Llama2: Best-in-Class Large Language Model for Radiology

    Authors: Zhengliang Liu, Yiwei Li, Peng Shu, Aoxiao Zhong, Longtao Yang, Chao Ju, Zihao Wu, Chong Ma, Jie Luo, Cheng Chen, Sekeun Kim, Jiang Hu, Haixing Dai, Lin Zhao, Dajiang Zhu, Jun Liu, Wei Liu, Dinggang Shen, Tianming Liu, Quanzheng Li, Xiang Li

    Abstract: This paper introduces Radiology-Llama2, a large language model specialized for radiology through a process known as instruction tuning. Radiology-Llama2 is based on the Llama2 architecture and further trained on a large dataset of radiology reports to generate coherent and clinically useful impressions from radiological findings. Quantitative evaluations using ROUGE metrics on the MIMIC-CXR and Op… ▽ More

    Submitted 29 August, 2023; originally announced September 2023.

  40. arXiv:2309.05088  [pdf

    cs.CY q-bio.OT

    Towards Trustworthy Artificial Intelligence for Equitable Global Health

    Authors: Hong Qin, Jude Kong, Wandi Ding, Ramneek Ahluwalia, Christo El Morr, Zeynep Engin, Jake Okechukwu Effoduh, Rebecca Hwa, Serena **gchuan Guo, Laleh Seyyed-Kalantari, Sylvia Kiwuwa Muyingo, Candace Makeda Moore, Ravi Parikh, Reva Schwartz, Dongxiao Zhu, Xiaoqian Wang, Yiye Zhang

    Abstract: Artificial intelligence (AI) can potentially transform global health, but algorithmic bias can exacerbate social inequities and disparity. Trustworthy AI entails the intentional design to ensure equity and mitigate potential biases. To advance trustworthy AI in global health, we convened a workshop on Fairness in Machine Intelligence for Global Health (FairMI4GH). The event brought together a glob… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 7 pages

  41. arXiv:2309.04482  [pdf

    cond-mat.mtrl-sci cs.LG

    Addressing the Accuracy-Cost Tradeoff in Material Property Prediction: A Teacher-Student Strategy

    Authors: Dong Zhu, Zhikuang xin, Siming Zheng, Yangang Wang, Xiaoyu Yang

    Abstract: Deep learning has revolutionized the process of new material discovery, with state-of-the-art models now able to predict material properties based solely on chemical compositions, thus eliminating the necessity for material structures. However, this cost-effective method has led to a trade-off in model accuracy. Specifically, the accuracy of Chemical Composition-based Property Prediction Models (C… ▽ More

    Submitted 22 August, 2023; originally announced September 2023.

  42. arXiv:2309.04104  [pdf

    physics.optics physics.app-ph

    Design of multifunctional color routers with Kerker switching using generative adversarial networks

    Authors: Jiahao Yan, Dayu Zhu, Yanjun Bao, Qin Chen, Baojun Li, Wenshan Cai

    Abstract: To achieve optoelectronic devices with high resolution and efficiency, there is a pressing need for optical structural units that possess an ultrasmall footprint yet exhibit strong controllability in both the frequency and spatial domains. For dielectric nanoparticles, the overlap of electric and magnetic dipole moments can scatter light completely forward or backward, which is called Kerker theor… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  43. arXiv:2309.02590  [pdf, other

    physics.med-ph

    Artificial General Intelligence for Radiation Oncology

    Authors: Chenbin Liu, Zhengliang Liu, Jason Holmes, Lu Zhang, Lian Zhang, Yuzhen Ding, Peng Shu, Zihao Wu, Haixing Dai, Yiwei Li, Dinggang Shen, Ninghao Liu, Quanzheng Li, Xiang Li, Dajiang Zhu, Tianming Liu, Wei Liu

    Abstract: The emergence of artificial general intelligence (AGI) is transforming radiation oncology. As prominent vanguards of AGI, large language models (LLMs) such as GPT-4 and PaLM 2 can process extensive texts and large vision models (LVMs) such as the Segment Anything Model (SAM) can process extensive imaging data to enhance the efficiency and precision of radiation therapy. This paper explores full-sp… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  44. arXiv:2309.01509  [pdf, other

    math.OC

    Distributed Online Optimization with Coupled Inequality Constraints over Unbalanced Directed Networks

    Authors: Dandan Wang, Daokuan Zhu, Kin Cheong Sou, Jie Lu

    Abstract: This paper studies a distributed online convex optimization problem, where agents in an unbalanced network cooperatively minimize the sum of their time-varying local cost functions subject to a coupled inequality constraint. To solve this problem, we propose a distributed dual subgradient tracking algorithm, called DUST, which attempts to optimize a dual objective by means of tracking the primal c… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  45. arXiv:2308.16362  [pdf, ps, other

    math.OC cs.LG

    A Unified Analysis for the Subgradient Methods Minimizing Composite Nonconvex, Nonsmooth and Non-Lipschitz Functions

    Authors: Daoli Zhu, Lei Zhao, Shuzhong Zhang

    Abstract: In this paper we propose a proximal subgradient method (Prox-SubGrad) for solving nonconvex and nonsmooth optimization problems without assuming Lipschitz continuity conditions. A number of subgradient upper bounds and their relationships are presented. By means of these upper bounding conditions, we establish some uniform recursive relations for the Moreau envelopes for weakly convex optimization… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  46. arXiv:2308.14936  [pdf, other

    cs.CV cs.AI

    AutoProSAM: Automated Prompting SAM for 3D Multi-Organ Segmentation

    Authors: Chengyin Li, Prashant Khanduri, Yao Qiang, Rafi Ibn Sultan, Indrin Chetty, Dongxiao Zhu

    Abstract: Segment Anything Model (SAM) is one of the pioneering prompt-based foundation models for image segmentation and has been rapidly adopted for various medical imaging applications. However, in clinical settings, creating effective prompts is notably challenging and time-consuming, requiring the expertise of domain specialists such as physicians. This requirement significantly diminishes SAM's primar… ▽ More

    Submitted 26 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

  47. arXiv:2308.13515  [pdf, other

    q-bio.NC

    Robust Core-Periphery Constrained Transformer for Domain Adaptation

    Authors: Xiaowei Yu, Dajiang Zhu, Tianming Liu

    Abstract: Unsupervised domain adaptation (UDA) aims to learn transferable representation across domains. Recently a few UDA works have successfully applied Transformer-based methods and achieved state-of-the-art (SOTA) results. However, it remains challenging when there exists a large domain gap between the source and target domain. Inspired by humans' exceptional transferability abilities to adapt knowledg… ▽ More

    Submitted 10 February, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Core-Periphery, ViT, Unsupervised domain adaptation

  48. arXiv:2308.08449  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Improving CTC-AED model with integrated-CTC and auxiliary loss regularization

    Authors: Daobin Zhu, Xiangdong Su, Hongbin Zhang

    Abstract: Connectionist temporal classification (CTC) and attention-based encoder decoder (AED) joint training has been widely applied in automatic speech recognition (ASR). Unlike most hybrid models that separately calculate the CTC and AED losses, our proposed integrated-CTC utilizes the attention mechanism of AED to guide the output of CTC. In this paper, we employ two fusion methods, namely direct addit… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  49. arXiv:2308.05511  [pdf, ps, other

    quant-ph

    Fast quantum state transfer and entanglement preparation in strongly coupled bosonic systems

    Authors: Yilun Xu, Daoquan Zhu, Feng-Xiao Sun, Qiongyi He, Wei Zhang

    Abstract: Continuous U(1) gauge symmetry, which guarantees the conservation of the total excitations in linear bosonic systems, will be broken when it comes to the strong-coupling regime where the rotation wave approximation (RWA) fails. Here we develop analytic solutions for multi-mode bosonic systems with XX-type couplings beyond RWA, and proposed a novel scheme to implement high-fidelity quantum state tr… ▽ More

    Submitted 29 October, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 30 pages, 8 figures

  50. arXiv:2308.05486  [pdf, ps, other

    econ.EM

    Money Growth and Inflation: A Quantile Sensitivity Approach

    Authors: Matteo Iacopini, Aubrey Poon, Luca Rossini, Dan Zhu

    Abstract: An innovative method is proposed to construct a quantile dependence system for inflation and money growth. By considering all quantiles and leveraging a novel notion of quantile sensitivity, the method allows the assessment of changes in the entire distribution of a variable of interest in response to a perturbation in another variable's quantile. The construction of this relationship is demonstra… ▽ More

    Submitted 17 November, 2023; v1 submitted 10 August, 2023; originally announced August 2023.