-
Observation of the Antimatter Hypernucleus $^4_{\barΛ}\overline{\hbox{H}}$
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (342 additional authors not shown)
Abstract:
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatt…
▽ More
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatter escapes the rapidly expanding fireball without annihilating, making such collisions an effective experimental tool to create heavy antimatter nuclear objects and study their properties, ho** to shed some light on existing questions on the asymmetry between matter and antimatter. Here we report the first observation of the antimatter hypernucleus \hbox{$^4_{\barΛ}\overline{\hbox{H}}$}, composed of a $\barΛ$ , an antiproton and two antineutrons. The discovery was made through its two-body decay after production in ultrarelativistic heavy-ion collisions by the STAR experiment at the Relativistic Heavy Ion Collider. In total, 15.6 candidate \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} antimatter hypernuclei are obtained with an estimated background count of 6.4. The lifetimes of the antihypernuclei \hbox{$^3_{\barΛ}\overline{\hbox{H}}$} and \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} are measured and compared with the lifetimes of their corresponding hypernuclei, testing the symmetry between matter and antimatter. Various production yield ratios among (anti)hypernuclei and (anti)nuclei are also measured and compared with theoretical model predictions, shedding light on their production mechanisms.
△ Less
Submitted 8 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
3D Structure-guided Network for Tooth Alignment in 2D Photograph
Authors:
Yulong Dou,
Lanzhuju Mei,
Dinggang Shen,
Zhiming Cui
Abstract:
Orthodontics focuses on rectifying misaligned teeth (i.e., malocclusions), affecting both masticatory function and aesthetics. However, orthodontic treatment often involves complex, lengthy procedures. As such, generating a 2D photograph depicting aligned teeth prior to orthodontic treatment is crucial for effective dentist-patient communication and, more importantly, for encouraging patients to a…
▽ More
Orthodontics focuses on rectifying misaligned teeth (i.e., malocclusions), affecting both masticatory function and aesthetics. However, orthodontic treatment often involves complex, lengthy procedures. As such, generating a 2D photograph depicting aligned teeth prior to orthodontic treatment is crucial for effective dentist-patient communication and, more importantly, for encouraging patients to accept orthodontic intervention. In this paper, we propose a 3D structure-guided tooth alignment network that takes 2D photographs as input (e.g., photos captured by smartphones) and aligns the teeth within the 2D image space to generate an orthodontic comparison photograph featuring aesthetically pleasing, aligned teeth. Notably, while the process operates within a 2D image space, our method employs 3D intra-oral scanning models collected in clinics to learn about orthodontic treatment, i.e., projecting the pre- and post-orthodontic 3D tooth structures onto 2D tooth contours, followed by a diffusion model to learn the map** relationship. Ultimately, the aligned tooth contours are leveraged to guide the generation of a 2D photograph with aesthetically pleasing, aligned teeth and realistic textures. We evaluate our network on various facial photographs, demonstrating its exceptional performance and strong applicability within the orthodontic industry.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Path Following Control of Automated Vehicle Considering Uncertainties and Disturbances with Parametric Varying
Authors:
Dan Shen
Abstract:
Automated Vehicle Path Following Control (PFC) is an advanced control system that can regulate the vehicle into a collision-free region in the presence of other objects on the road. Common collision avoidance functions, such as forward collision warning and automatic emergency braking, have recently been developed and equipped on production vehicles. However, it is impossible to develop a perfectl…
▽ More
Automated Vehicle Path Following Control (PFC) is an advanced control system that can regulate the vehicle into a collision-free region in the presence of other objects on the road. Common collision avoidance functions, such as forward collision warning and automatic emergency braking, have recently been developed and equipped on production vehicles. However, it is impossible to develop a perfectly precise vehicle model when the vehicle is driving. Most PFCs did not consider uncertainties in the vehicle model, external disturbances, and parameter variations at the same time. To address the issues associated with this important feature and function in autonomous driving, a new vehicle PFC is proposed using a robust model predictive control (MPC) design technique based on matrix inequality and the theoretical approach of the hybrid $\&$ switched system. The proposed methodology requires a combination of continuous and discrete states, e.g. regulating the continuous states of the AV (e.g., velocity and yaw angle) and discrete switching of the control strategy that affects the dynamic behaviors of the AV under different driving speeds. Firstly, considering bounded model uncertainties, and norm-bounded external disturbances, the system states and control matrices are modified.
△ Less
Submitted 19 October, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Global stability of Minkowski spacetime with minimal decay
Authors:
Dawei Shen
Abstract:
The global stability of Minkowski spacetime, a milestone in the field, has been proven in the celebrated work of Christodoulou and Klainerman \cite{Ch-Kl} in 1993. In 2007, Bieri \cite{Bieri} has extended the result of \cite{Ch-Kl} under lower decay and regularity assumptions on the initial data. In this paper, we extend the result of \cite{Bieri} to minimal decay assumptions. Also, concerning the…
▽ More
The global stability of Minkowski spacetime, a milestone in the field, has been proven in the celebrated work of Christodoulou and Klainerman \cite{Ch-Kl} in 1993. In 2007, Bieri \cite{Bieri} has extended the result of \cite{Ch-Kl} under lower decay and regularity assumptions on the initial data. In this paper, we extend the result of \cite{Bieri} to minimal decay assumptions. Also, concerning the treatment of curvature estimates, we replace the vectorfield method used in \cite{Ch-Kl,Bieri} by the $r^p$--weighted estimates of Dafermos and Rodnianski \cite{Da-Ro}.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data
Authors:
Tianyang Zhong,
Wei Zhao,
Yutong Zhang,
Yi Pan,
Peixin Dong,
Zuowei Jiang,
Xiaoyan Kui,
Youlan Shang,
Li Yang,
Yaonai Wei,
Longtao Yang,
Hao Chen,
Huan Zhao,
Yuxiao Liu,
Ning Zhu,
Yiwei Li,
Yisong Wang,
Jiaqi Yao,
Jiaqi Wang,
Ying Zeng,
Lei He,
Chao Zheng,
Zhixue Zhang,
Ming Li,
Zhengliang Liu
, et al. (17 additional authors not shown)
Abstract:
Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels. However, complex and diverse radiology reports with cross-source heterogeneity pose a huge generalizability challenge to the current methods under massive data volume, mainly because the style and normativity of radiology reports are obviousl…
▽ More
Radiology report generation, as a key step in medical image analysis, is critical to the quantitative analysis of clinically informed decision-making levels. However, complex and diverse radiology reports with cross-source heterogeneity pose a huge generalizability challenge to the current methods under massive data volume, mainly because the style and normativity of radiology reports are obviously distinctive among institutions, body regions inspected and radiologists. Recently, the advent of large language models (LLM) offers great potential for recognizing signs of health conditions. To resolve the above problem, we collaborate with the Second Xiangya Hospital in China and propose ChatRadio-Valuer based on the LLM, a tailored model for automatic radiology report generation that learns generalizable representations and provides a basis pattern for model adaptation in sophisticated analysts' cases. Specifically, ChatRadio-Valuer is trained based on the radiology reports from a single institution by means of supervised fine-tuning, and then adapted to disease diagnosis tasks for human multi-system evaluation (i.e., chest, abdomen, muscle-skeleton, head, and maxillofacial $\&$ neck) from six different institutions in clinical-level events. The clinical dataset utilized in this study encompasses a remarkable total of \textbf{332,673} observations. From the comprehensive results on engineering indicators, clinical efficacy and deployment cost metrics, it can be shown that ChatRadio-Valuer consistently outperforms state-of-the-art models, especially ChatGPT (GPT-3.5-Turbo) and GPT-4 et al., in terms of the diseases diagnosis from radiology reports. ChatRadio-Valuer provides an effective avenue to boost model generalization performance and alleviate the annotation workload of experts to enable the promotion of clinical AI applications in radiology reports.
△ Less
Submitted 9 October, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Results on Elastic Cross Sections in Proton-Proton Collisions at $\sqrt{s} = 510$ GeV with the STAR Detector at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (343 additional authors not shown)
Abstract:
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$…
▽ More
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$ does not fit the data in the aforementioned $t$ range, and we obtain a much better fit using a second-order polynomial for $B(t)$. The $t$ dependence of $B$ is determined using six subintervals of $t$ in the STAR measured $t$ range, and is in good agreement with the phenomenological models. The measured elastic differential cross section $\mathrm{d}σ/\mathrm{dt}$ agrees well with the results obtained at $\sqrt{s} = 546$ GeV for proton--antiproton collisions by the UA4 experiment. We also determine that the integrated elastic cross section within the STAR $t$-range is $σ^\mathrm{fid}_\mathrm{el} = 462.1 \pm 0.9 (\mathrm{stat.}) \pm 1.1 (\mathrm {syst.}) \pm 11.6 (\mathrm {scale})$~$μ\mathrm{b}$.
△ Less
Submitted 6 May, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Algebraic and Statistical Properties of the Ordinary Least Squares Interpolator
Authors:
Dennis Shen,
Dogyoon Song,
Peng Ding,
Jasjeet S. Sekhon
Abstract:
Deep learning research has uncovered the phenomenon of benign overfitting for overparameterized statistical models, which has drawn significant theoretical interest in recent years. Given its simplicity and practicality, the ordinary least squares (OLS) interpolator has become essential to gain foundational insights into this phenomenon. While properties of OLS are well established in classical, u…
▽ More
Deep learning research has uncovered the phenomenon of benign overfitting for overparameterized statistical models, which has drawn significant theoretical interest in recent years. Given its simplicity and practicality, the ordinary least squares (OLS) interpolator has become essential to gain foundational insights into this phenomenon. While properties of OLS are well established in classical, underparameterized settings, its behavior in high-dimensional, overparameterized regimes is less explored (unlike for ridge or lasso regression) though significant progress has been made of late. We contribute to this growing literature by providing fundamental algebraic and statistical results for the minimum $\ell_2$-norm OLS interpolator. In particular, we provide algebraic equivalents of (i) the leave-$k$-out residual formula, (ii) Cochran's formula, and (iii) the Frisch-Waugh-Lovell theorem in the overparameterized regime. These results aid in understanding the OLS interpolator's ability to generalize and have substantive implications for causal inference. Under the Gauss-Markov model, we present statistical results such as an extension of the Gauss-Markov theorem and an analysis of variance estimation under homoskedastic errors for the overparameterized regime. To substantiate our theoretical contributions, we conduct simulations that further explore the stochastic properties of the OLS interpolator.
△ Less
Submitted 30 May, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Longitudinal and transverse spin transfer to $Λ$ and $\overlineΛ$ hyperons in polarized $p$+$p$ collisions at $\sqrt{s} = 200$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (357 additional authors not shown)
Abstract:
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and…
▽ More
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and the transverse spin transfer coefficient, $D_{TT}$, to $Λ$ and $\overlineΛ$ in polarized proton-proton collisions at $\sqrt{s}$ = 200 GeV by the STAR experiment at RHIC. The data set includes longitudinally polarized proton-proton collisions with an integrated luminosity of 52 pb$^{-1}$, and transversely polarized proton-proton collisions with a similar integrated luminosity. Both data sets have about twice the statistics of previous results and cover a kinematic range of $|η_{Λ(\overlineΛ)}|$ $<$ 1.2 and transverse momentum $p_{T,{Λ(\overlineΛ)}}$ up to 8 GeV/$c$. We also report the first measurements of the hyperon spin transfer coefficients $D_{LL}$ and $D_{TT}$ as a function of the fractional jet momentum $z$ carried by the hyperon, which can provide more direct constraints on the polarized fragmentation functions.
△ Less
Submitted 7 December, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Reaction plane correlated triangular flow in Au+Au collisions at $\sqrt{s_{NN}}=3$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (341 additional authors not shown)
Abstract:
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$,…
▽ More
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$, opposite in sign compared to the slope for directed flow. No significant $v_3$ signal is observed for charged pions and kaons. Comparisons with models suggest that a mean field potential is required to describe these results, and that the triangular shape of the participant nucleons is the result of stop** and nuclear geometry.
△ Less
Submitted 19 April, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Radiology-Llama2: Best-in-Class Large Language Model for Radiology
Authors:
Zhengliang Liu,
Yiwei Li,
Peng Shu,
Aoxiao Zhong,
Longtao Yang,
Chao Ju,
Zihao Wu,
Chong Ma,
Jie Luo,
Cheng Chen,
Sekeun Kim,
Jiang Hu,
Haixing Dai,
Lin Zhao,
Dajiang Zhu,
Jun Liu,
Wei Liu,
Dinggang Shen,
Tianming Liu,
Quanzheng Li,
Xiang Li
Abstract:
This paper introduces Radiology-Llama2, a large language model specialized for radiology through a process known as instruction tuning. Radiology-Llama2 is based on the Llama2 architecture and further trained on a large dataset of radiology reports to generate coherent and clinically useful impressions from radiological findings. Quantitative evaluations using ROUGE metrics on the MIMIC-CXR and Op…
▽ More
This paper introduces Radiology-Llama2, a large language model specialized for radiology through a process known as instruction tuning. Radiology-Llama2 is based on the Llama2 architecture and further trained on a large dataset of radiology reports to generate coherent and clinically useful impressions from radiological findings. Quantitative evaluations using ROUGE metrics on the MIMIC-CXR and OpenI datasets demonstrate that Radiology-Llama2 achieves state-of-the-art performance compared to other generative language models, with a Rouge-1 score of 0.4834 on MIMIC-CXR and 0.4185 on OpenI. Additional assessments by radiology experts highlight the model's strengths in understandability, coherence, relevance, conciseness, and clinical utility. The work illustrates the potential of localized language models designed and tuned for specialized domains like radiology. When properly evaluated and deployed, such models can transform fields like radiology by automating rote tasks and enhancing human expertise.
△ Less
Submitted 29 August, 2023;
originally announced September 2023.
-
Artificial General Intelligence for Radiation Oncology
Authors:
Chenbin Liu,
Zhengliang Liu,
Jason Holmes,
Lu Zhang,
Lian Zhang,
Yuzhen Ding,
Peng Shu,
Zihao Wu,
Haixing Dai,
Yiwei Li,
Dinggang Shen,
Ninghao Liu,
Quanzheng Li,
Xiang Li,
Dajiang Zhu,
Tianming Liu,
Wei Liu
Abstract:
The emergence of artificial general intelligence (AGI) is transforming radiation oncology. As prominent vanguards of AGI, large language models (LLMs) such as GPT-4 and PaLM 2 can process extensive texts and large vision models (LVMs) such as the Segment Anything Model (SAM) can process extensive imaging data to enhance the efficiency and precision of radiation therapy. This paper explores full-sp…
▽ More
The emergence of artificial general intelligence (AGI) is transforming radiation oncology. As prominent vanguards of AGI, large language models (LLMs) such as GPT-4 and PaLM 2 can process extensive texts and large vision models (LVMs) such as the Segment Anything Model (SAM) can process extensive imaging data to enhance the efficiency and precision of radiation therapy. This paper explores full-spectrum applications of AGI across radiation oncology including initial consultation, simulation, treatment planning, treatment delivery, treatment verification, and patient follow-up. The fusion of vision data with LLMs also creates powerful multimodal models that elucidate nuanced clinical patterns. Together, AGI promises to catalyze a shift towards data-driven, personalized radiation therapy. However, these models should complement human expertise and care. This paper provides an overview of how AGI can transform radiation oncology to elevate the standard of patient care in radiation oncology, with the key insight being AGI's ability to exploit multimodal clinical data at scale.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Direct observation of topological surface states in the layered kagome lattice with broken time-reversal symmetry
Authors:
Zhicheng Jiang,
Tongrui Li,
Jian Yuan,
Zhengtai Liu,
Zhipeng Cao,
Soohyun Cho,
Mingfang Shu,
Yichen Yang,
Jianyang Ding,
Zhikai Li,
Jiayu Liu,
Zhonghao Liu,
Jishan Liu,
Jie Ma,
Zhe Sun,
Yanfeng Guo,
Dawei Shen
Abstract:
Magnetic topological quantum materials display a diverse range of fascinating physical properties which arise from their intrinsic magnetism and the breaking of time-reversal symmetry. However, so far, few examples of intrinsic magnetic topological materials have been confirmed experimentally, which significantly hinder our comprehensive understanding of the abundant physical properties in this sy…
▽ More
Magnetic topological quantum materials display a diverse range of fascinating physical properties which arise from their intrinsic magnetism and the breaking of time-reversal symmetry. However, so far, few examples of intrinsic magnetic topological materials have been confirmed experimentally, which significantly hinder our comprehensive understanding of the abundant physical properties in this system. The kagome lattices, which host diversity of electronic structure signatures such as Dirac nodes, flat bands, and saddle points, provide an alternative and promising platform for in-depth investigations into correlations and band topology. In this article, drawing inspiration from the stacking configuration of MnBi$_2$Te$_4$, we conceive and then synthesize a high-quality single crystal EuTi$_3$Bi$_4$, which is a unique natural heterostructure consisting of both topological kagome layers and magnetic interlayers. We investigate the electronic structure of EuTi$_3$Bi$_4$ and uncover distinct features of anisotropic multiple Van Hove singularitie (VHS) that might prevent Fermi surface nesting, leading to the absence of a charge density wave (CDW). In addition, we identify the topological nontrivial surface states that serve as connections between different saddle bands in the vicinity of the Fermi level. Combined with calculations, we establish that, the effective time-reversal symmetry S=$θ$$τ_{1/2}$ play a crucial role in the antiferromagnetic ground state of EuTi$_3$Bi$_4$, which ensures the stability of the topological surface states and gives rise to their intriguing topological nature. Therefore, EuTi$_3$Bi$_4$ offers the rare opportunity to investigate correlated topological states in magnetic kagome materials.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Electronic band reconstruction across the insulator-metal transition in colossal magnetoresistive EuCd2P2
Authors:
Huali Zhang,
Feng Du,
Xiaoying Zheng,
Shuaishuai Luo,
Yi Wu,
Hao Zheng,
Shengtao Cui,
Zhe Sun,
Zhengtai Liu,
Dawei Shen,
Michael Smidman,
Yu Song,
Ming Shi,
Zhicheng Zhong,
Chao Cao,
Huiqiu Yuan,
Yang Liu
Abstract:
While colossal magnetoresistance (CMR) in Eu-based compounds is often associated with strong spin-carrier interactions, the underlying reconstruction of the electronic bands is much less understood from spectroscopic experiments. Here using angle-resolved photoemission, we directly observe an electronic band reconstruction across the insulator-metal (and magnetic) transition in the recently discov…
▽ More
While colossal magnetoresistance (CMR) in Eu-based compounds is often associated with strong spin-carrier interactions, the underlying reconstruction of the electronic bands is much less understood from spectroscopic experiments. Here using angle-resolved photoemission, we directly observe an electronic band reconstruction across the insulator-metal (and magnetic) transition in the recently discovered CMR compound EuCd2P2. This transition is manifested by a large magnetic band splitting associated with the magnetic order, as well as unusual energy shifts of the valence bands: both the large ordered moment of Eu and carrier localization in the paramagnetic phase are crucial. Our results provide spectroscopic evidence for an electronic structure reconstruction underlying the enormous CMR observed in EuCd2P2, which could be important for understanding Eu-based CMR materials, as well as designing CMR materials based on large-moment rare-earth magnets.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Variability of magnetic hot stars from the TESS observations
Authors:
Dong-Xiang Shen,
Gang Li,
Iskandar Abdusamatjan,
Jian-Ning Fu,
Chun-Hua Zhu,
**-Long Yu,
Yu Zhang,
Guo-Liang Lv,
Nan-Nan Zhai,
**-Zhong Liu
Abstract:
Magnetic hot stars refer to the stars, which effective temperatures approximately in the range from 7,000 to 50,000 K, and with large-scale globally organized magnetic fields. These magnetic fields exhibit strengths ranging from tens of Gauss to tens of kilo-Gauss. They are key in understanding the effects caused by magnetic fields in the stellar evolution. However, there are only three magnetic h…
▽ More
Magnetic hot stars refer to the stars, which effective temperatures approximately in the range from 7,000 to 50,000 K, and with large-scale globally organized magnetic fields. These magnetic fields exhibit strengths ranging from tens of Gauss to tens of kilo-Gauss. They are key in understanding the effects caused by magnetic fields in the stellar evolution. However, there are only three magnetic hot stars studied via a combination of spectropolarimetric and asteroseismic modeling. Combined with $Transiting\;Exoplanet\;Survey\;Satellite\;(TESS)$ 1-56 sectors data sets, we provided a photometric variability and stochastic low frequency (SLF) variability study of 118 magnetic hot stars. 9 new rotating variable stars are identified. Using the Bayesian Markov Chain Monte Carlo (MCMC) framework, we fitted the morphologies of SLF variability for magnetic hot stars. Our analysis reveals that the magnetic hot stars in our sample have $γ< 5.5$ with the vast majority having $1 \leq γ\leq 3$. The $ν_{\rm char}$ is primarily in the ranges of $0\;\text{d}^{-1} < ν_{\rm char} < 6.3\;\text{d}^{-1}$. The amplitude of SLF variability, log$α_{\rm 0}$, shows a dominant distribution ranging from 0.8 to 3. No significant correlations are observed between the luminosity and fitting parameters, suggesting no clear dependence of SLF variability on stellar mass for our sample of magnetic hot stars with masses between approximately $1.5 M_{\odot}< M < 20 M_{\odot}$. We found a significant negative correlation between the $B_{\rm p}$ and $ν_{char}$. This suppression effect of magnetic fields on $ν_{\rm char}$ may be a result of their inhibition of macroturbulence.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Magnetic kagome materials RETi3Bi4 family with weak interlayer interactions
Authors:
**gwen Guo,
Liqin Zhou,
Jianyang Ding,
Gexing Qu,
Zhengtai Liu,
Yu Du,
Heng Zhang,
Jiajun Li,
Yiying Zhang,
Fuwei Zhou,
Wuyi Qi,
Fengyi Guo,
Tianqi Wang,
Fucong Fei,
Yaobo Huang,
Tian Qian,
Dawei Shen,
Hongming Weng,
Fengqi Song
Abstract:
Kagome materials have attracted a surge of research interest recently, especially for the ones combining with magnetism, and the ones with weak interlayer interactions which can fabricate thin devices. However, kagome materials combining both characters of magnetism and weak interlayer interactions are rare. Here we investigate a new family of titanium based kagome materials RETi3Bi4 (RE = Eu, Gd…
▽ More
Kagome materials have attracted a surge of research interest recently, especially for the ones combining with magnetism, and the ones with weak interlayer interactions which can fabricate thin devices. However, kagome materials combining both characters of magnetism and weak interlayer interactions are rare. Here we investigate a new family of titanium based kagome materials RETi3Bi4 (RE = Eu, Gd and Sm). The flakes of nanometer thickness of RETi3Bi4 can be obtained by exfoliation due to the weak interlayer interactions. According to magnetic measurements, out-of-plane ferromagnetism, out-of-plane anti-ferromagnetism, and in-plane ferromagnetism are formed for RE = Eu, Gd, and Sm respectively. The magnetic orders are simple and the saturation magnetizations can be relatively large since the rare earth elements solely provide the magnetic moments. Further by angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations, the electronic structures of RETi3Bi4 are investigated. The ARPES results are consistent with the calculations, indicating the bands characteristic with kagome sublattice in RETi3Bi4. We expect these materials to be promising candidates for observation of the exotic magnetic topological phases and the related topological quantum transport studies.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction
Authors:
Zeyu Han,
Yuhan Wang,
Lu** Zhou,
Peng Wang,
Binyu Yan,
Jiliu Zhou,
Yan Wang,
Dinggang Shen
Abstract:
To obtain high-quality positron emission tomography (PET) scans while reducing radiation exposure to the human body, various approaches have been proposed to reconstruct standard-dose PET (SPET) images from low-dose PET (LPET) images. One widely adopted technique is the generative adversarial networks (GANs), yet recently, diffusion probabilistic models (DPMs) have emerged as a compelling alternat…
▽ More
To obtain high-quality positron emission tomography (PET) scans while reducing radiation exposure to the human body, various approaches have been proposed to reconstruct standard-dose PET (SPET) images from low-dose PET (LPET) images. One widely adopted technique is the generative adversarial networks (GANs), yet recently, diffusion probabilistic models (DPMs) have emerged as a compelling alternative due to their improved sample quality and higher log-likelihood scores compared to GANs. Despite this, DPMs suffer from two major drawbacks in real clinical settings, i.e., the computationally expensive sampling process and the insufficient preservation of correspondence between the conditioning LPET image and the reconstructed PET (RPET) image. To address the above limitations, this paper presents a coarse-to-fine PET reconstruction framework that consists of a coarse prediction module (CPM) and an iterative refinement module (IRM). The CPM generates a coarse PET image via a deterministic process, and the IRM samples the residual iteratively. By delegating most of the computational overhead to the CPM, the overall sampling speed of our method can be significantly improved. Furthermore, two additional strategies, i.e., an auxiliary guidance strategy and a contrastive diffusion strategy, are proposed and integrated into the reconstruction process, which can enhance the correspondence between the LPET image and the RPET image, further improving clinical reliability. Extensive experiments on two human brain PET datasets demonstrate that our method outperforms the state-of-the-art PET reconstruction methods. The source code is available at \url{https://github.com/Show-han/PET-Reconstruction}.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
Photoemission Evidence of a Novel Charge Order in Kagome Metal FeGe
Authors:
Zhisheng Zhao,
Tongrui Li,
Peng Li,
Xueliang Wu,
Jianghao Yao,
Ziyuan Chen,
Shengtao Cui,
Zhe Sun,
Yichen Yang,
Zhicheng Jiang,
Zhengtai Liu,
Alex Louat,
Timur Kim,
Cephise Cacho,
Aifeng Wang,
Yilin Wang,
Dawei Shen,
Juan Jiang,
Donglai Feng
Abstract:
A charge order has been discovered to emerge deep into the antiferromagnetic phase of the kagome metal FeGe. To study its origin, the evolution of the low-lying electronic structure across the charge order phase transition is investigated with angle-resolved photoemission spectroscopy. We do not find signatures of nesting between Fermi surface sections or van-Hove singularities in zero-frequency j…
▽ More
A charge order has been discovered to emerge deep into the antiferromagnetic phase of the kagome metal FeGe. To study its origin, the evolution of the low-lying electronic structure across the charge order phase transition is investigated with angle-resolved photoemission spectroscopy. We do not find signatures of nesting between Fermi surface sections or van-Hove singularities in zero-frequency joint density of states, and there are no obvious energy gaps at the Fermi level, which exclude the nesting mechanism for the charge order formation in FeGe. However, two obvious changes in the band structure have been detected, i.e., one electron-like band around the K point and another one around the A point move upward in energy position when the charge order forms. These features can be well reproduced by our density-functional theory calculations, where the charge order is primarily driven by magnetic energy saving via large dimerizations of a quarter of Ge1-sites (in the kagome plane) along the c-axis. Our results provide strong support for this novel charge order formation mechanism in FeGe, in contrast to the conventional nesting mechanism.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Tissue Segmentation of Thick-Slice Fetal Brain MR Scans with Guidance from High-Quality Isotropic Volumes
Authors:
Shijie Huang,
Xukun Zhang,
Zhiming Cui,
He Zhang,
Geng Chen,
Dinggang Shen
Abstract:
Accurate tissue segmentation of thick-slice fetal brain magnetic resonance (MR) scans is crucial for both reconstruction of isotropic brain MR volumes and the quantification of fetal brain development. However, this task is challenging due to the use of thick-slice scans in clinically-acquired fetal brain data. To address this issue, we propose to leverage high-quality isotropic fetal brain MR vol…
▽ More
Accurate tissue segmentation of thick-slice fetal brain magnetic resonance (MR) scans is crucial for both reconstruction of isotropic brain MR volumes and the quantification of fetal brain development. However, this task is challenging due to the use of thick-slice scans in clinically-acquired fetal brain data. To address this issue, we propose to leverage high-quality isotropic fetal brain MR volumes (and also their corresponding annotations) as guidance for segmentation of thick-slice scans. Due to existence of significant domain gap between high-quality isotropic volume (i.e., source data) and thick-slice scans (i.e., target data), we employ a domain adaptation technique to achieve the associated knowledge transfer (from high-quality <source> volumes to thick-slice <target> scans). Specifically, we first register the available high-quality isotropic fetal brain MR volumes across different gestational weeks to construct longitudinally-complete source data. To capture domain-invariant information, we then perform Fourier decomposition to extract image content and style codes. Finally, we propose a novel Cycle-Consistent Domain Adaptation Network (C2DA-Net) to efficiently transfer the knowledge learned from high-quality isotropic volumes for accurate tissue segmentation of thick-slice scans. Our C2DA-Net can fully utilize a small set of annotated isotropic volumes to guide tissue segmentation on unannotated thick-slice scans. Extensive experiments on a large-scale dataset of 372 clinically acquired thick-slice MR scans demonstrate that our C2DA-Net achieves much better performance than cutting-edge methods quantitatively and qualitatively.
△ Less
Submitted 4 December, 2023; v1 submitted 13 August, 2023;
originally announced August 2023.
-
TriDo-Former: A Triple-Domain Transformer for Direct PET Reconstruction from Low-Dose Sinograms
Authors:
Jiaqi Cui,
Pinxian Zeng,
Xinyi Zeng,
Peng Wang,
Xi Wu,
Jiliu Zhou,
Yan Wang,
Dinggang Shen
Abstract:
To obtain high-quality positron emission tomography (PET) images while minimizing radiation exposure, various methods have been proposed for reconstructing standard-dose PET (SPET) images from low-dose PET (LPET) sinograms directly. However, current methods often neglect boundaries during sinogram-to-image reconstruction, resulting in high-frequency distortion in the frequency domain and diminishe…
▽ More
To obtain high-quality positron emission tomography (PET) images while minimizing radiation exposure, various methods have been proposed for reconstructing standard-dose PET (SPET) images from low-dose PET (LPET) sinograms directly. However, current methods often neglect boundaries during sinogram-to-image reconstruction, resulting in high-frequency distortion in the frequency domain and diminished or fuzzy edges in the reconstructed images. Furthermore, the convolutional architectures, which are commonly used, lack the ability to model long-range non-local interactions, potentially leading to inaccurate representations of global structures. To alleviate these problems, we propose a transformer-based model that unites triple domains of sinogram, image, and frequency for direct PET reconstruction, namely TriDo-Former. Specifically, the TriDo-Former consists of two cascaded networks, i.e., a sinogram enhancement transformer (SE-Former) for denoising the input LPET sinograms and a spatial-spectral reconstruction transformer (SSR-Former) for reconstructing SPET images from the denoised sinograms. Different from the vanilla transformer that splits an image into 2D patches, based specifically on the PET imaging mechanism, our SE-Former divides the sinogram into 1D projection view angles to maintain its inner-structure while denoising, preventing the noise in the sinogram from prorogating into the image domain. Moreover, to mitigate high-frequency distortion and improve reconstruction details, we integrate global frequency parsers (GFPs) into SSR-Former. The GFP serves as a learnable frequency filter that globally adjusts the frequency components in the frequency domain, enforcing the network to restore high-frequency details resembling real SPET images. Validations on a clinical dataset demonstrate that our TriDo-Former outperforms the state-of-the-art methods qualitatively and quantitatively.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Jet-hadron correlations with respect to the event plane in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions in STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai,
H. Caines
, et al. (340 additional authors not shown)
Abstract:
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A seco…
▽ More
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A second-order event plane is used in the analysis as an experimental estimate of the reaction plane formed by the collision impact parameter and the beam direction. Charged-particle jets with $15 < p_{\rm T, jet} <$ 20 and $20 < p_{\rm T, jet} <$ 40 GeV/$c$ were reconstructed with the anti-$k_{\rm T}$ algorithm with radius parameter setting of (R=0.4) in the 20-50\% centrality bin to maximize the initial-state eccentricity of the interaction region. The reaction plane fit method is implemented to remove the flow-modulated background with better precision than prior methods. Yields and widths of jet-associated charged-hadron distributions are extracted in three angular bins between the jet axis and the event plane. The event-plane (EP) dependence is further quantified by ratios of the associated yields in different EP bins. No dependence on orientation of the jet axis with respect to the event plane is seen within the uncertainties in the kinematic regime studied. This finding is consistent with a similar experimental observation by ALICE in $\sqrt{s_{\mathrm{NN}}}$ = 2.76 TeV Pb+Pb collision data.
△ Less
Submitted 20 March, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Evaluating Large Language Models for Radiology Natural Language Processing
Authors:
Zhengliang Liu,
Tianyang Zhong,
Yiwei Li,
Yutong Zhang,
Yi Pan,
Zihao Zhao,
Peixin Dong,
Chao Cao,
Yuxiao Liu,
Peng Shu,
Yaonai Wei,
Zihao Wu,
Chong Ma,
Jiaqi Wang,
Sheng Wang,
Mengyue Zhou,
Zuowei Jiang,
Chunlin Li,
Jason Holmes,
Shaochen Xu,
Lu Zhang,
Haixing Dai,
Kai Zhang,
Lin Zhao,
Yuanhao Chen
, et al. (20 additional authors not shown)
Abstract:
The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP). LLMs have revolutionized a multitude of domains, and they have made a significant impact in the medical field. Large language models are now more abundant than ever, and many of these models exhibit bilingual capabilities, proficient in both English and Chinese. However, a compreh…
▽ More
The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP). LLMs have revolutionized a multitude of domains, and they have made a significant impact in the medical field. Large language models are now more abundant than ever, and many of these models exhibit bilingual capabilities, proficient in both English and Chinese. However, a comprehensive evaluation of these models remains to be conducted. This lack of assessment is especially apparent within the context of radiology NLP. This study seeks to bridge this gap by critically evaluating thirty two LLMs in interpreting radiology reports, a crucial component of radiology NLP. Specifically, the ability to derive impressions from radiologic findings is assessed. The outcomes of this evaluation provide key insights into the performance, strengths, and weaknesses of these LLMs, informing their practical applications within the medical domain.
△ Less
Submitted 27 July, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Multi-View Vertebra Localization and Identification from CT Images
Authors:
Han Wu,
Jiadong Zhang,
Yu Fang,
Zhentao Liu,
Nizhuan Wang,
Zhiming Cui,
Dinggang Shen
Abstract:
Accurately localizing and identifying vertebrae from CT images is crucial for various clinical applications. However, most existing efforts are performed on 3D with crop** patch operation, suffering from the large computation costs and limited global information. In this paper, we propose a multi-view vertebra localization and identification from CT images, converting the 3D problem into a 2D lo…
▽ More
Accurately localizing and identifying vertebrae from CT images is crucial for various clinical applications. However, most existing efforts are performed on 3D with crop** patch operation, suffering from the large computation costs and limited global information. In this paper, we propose a multi-view vertebra localization and identification from CT images, converting the 3D problem into a 2D localization and identification task on different views. Without the limitation of the 3D cropped patch, our method can learn the multi-view global information naturally. Moreover, to better capture the anatomical structure information from different view perspectives, a multi-view contrastive learning strategy is developed to pre-train the backbone. Additionally, we further propose a Sequence Loss to maintain the sequential structure embedded along the vertebrae. Evaluation results demonstrate that, with only two 2D networks, our method can localize and identify vertebrae in CT images accurately, and outperforms the state-of-the-art methods consistently. Our code is available at https://github.com/ShanghaiTech-IMPACT/Multi-View-Vertebra-Localization-and-Identification-from-CT-Images.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
A catalogue and statistical analysis for magnetic stars
Authors:
Abdurepqet Rustem,
Guoliang Lv,
**zhong Liu,
Chunhua Zhu,
Yu Zhang,
Dongxiang Shen,
Yuhao Zhang,
Xiaolong He
Abstract:
Magnetic fields are significant in the structure and evolution of stars. We present a comprehensive catalogue of 1784 known magnetic stars, detailing their identifications, HD numbers, precise locations, spectral types, and averaged quadratic effective magnetic fields among other important information. The group comprises 177 O-type stars, 551 B-type stars, 520 A-type stars, 91 F-type stars, 53 G-…
▽ More
Magnetic fields are significant in the structure and evolution of stars. We present a comprehensive catalogue of 1784 known magnetic stars, detailing their identifications, HD numbers, precise locations, spectral types, and averaged quadratic effective magnetic fields among other important information. The group comprises 177 O-type stars, 551 B-type stars, 520 A-type stars, 91 F-type stars, 53 G-type stars, 61 K-type stars, 31 M-type stars, and an additional 300 stars whose spectral classification remains indeterminate. Our analysis examines the statistical properties of these magnetic stars. The relative integrated distribution function and number distribution function for all magnetic stars of the same spectral type can be effectively approximated using an exponential function of the averaged quadratic effective magnetic field. The analysis further reveals that A and B-type stars possess the strongest mean magnetic fields, indicating an easier detection of their magnetic fields.
△ Less
Submitted 23 July, 2023;
originally announced July 2023.
-
Accurate 3D Prediction of Missing Teeth in Diverse Patterns for Precise Dental Implant Planning
Authors:
Lei Ma,
Peng Xue,
Yuning Gu,
Yue Zhao,
Min Zhu,
Zhongxiang Ding,
Dinggang Shen
Abstract:
In recent years, the demand for dental implants has surged, driven by their high success rates and esthetic advantages. However, accurate prediction of missing teeth for precise digital implant planning remains a challenge due to the intricate nature of dental structures and the variability in tooth loss patterns. This study presents a novel framework for accurate prediction of missing teeth in di…
▽ More
In recent years, the demand for dental implants has surged, driven by their high success rates and esthetic advantages. However, accurate prediction of missing teeth for precise digital implant planning remains a challenge due to the intricate nature of dental structures and the variability in tooth loss patterns. This study presents a novel framework for accurate prediction of missing teeth in different patterns, facilitating digital implant planning. The proposed framework begins by estimating point-to-point correspondence among a dataset of dental mesh models reconstructed from CBCT images of healthy subjects. Subsequently, tooth dictionaries are constructed for each tooth type, encoding their position and shape information based on the established point-to-point correspondence. To predict missing teeth in a given dental mesh model, sparse coefficients are learned by sparsely representing adjacent teeth of the missing teeth using the corresponding tooth dictionaries. These coefficients are then applied to the dictionaries of the missing teeth to generate accurate predictions of their positions and shapes. The evaluation results on real subjects shows that our proposed framework achieves an average prediction error of 1.04mm for predictions of single missing tooth and an average prediction error of 1.33mm for the prediction of 14 missing teeth, which demonstrates its capability of accurately predicting missing teeth in various patterns. By accurately predicting missing teeth, dental professionals can improve the planning and placement of dental implants, leading to better esthetic and functional outcomes for patients undergoing dental implant procedures.
△ Less
Submitted 16 July, 2023;
originally announced July 2023.
-
A Comprehensive Survey of Artificial Intelligence Techniques for Talent Analytics
Authors:
Chuan Qin,
Le Zhang,
Yihang Cheng,
Rui Zha,
Dazhong Shen,
Qi Zhang,
Xi Chen,
Ying Sun,
Chen Zhu,
Hengshu Zhu,
Hui Xiong
Abstract:
In today's competitive and fast-evolving business environment, it is a critical time for organizations to rethink how to make talent-related decisions in a quantitative manner. Indeed, the recent development of Big Data and Artificial Intelligence (AI) techniques have revolutionized human resource management. The availability of large-scale talent and management-related data provides unparalleled…
▽ More
In today's competitive and fast-evolving business environment, it is a critical time for organizations to rethink how to make talent-related decisions in a quantitative manner. Indeed, the recent development of Big Data and Artificial Intelligence (AI) techniques have revolutionized human resource management. The availability of large-scale talent and management-related data provides unparalleled opportunities for business leaders to comprehend organizational behaviors and gain tangible knowledge from a data science perspective, which in turn delivers intelligence for real-time decision-making and effective talent management at work for their organizations. In the last decade, talent analytics has emerged as a promising field in applied data science for human resource management, garnering significant attention from AI communities and inspiring numerous research efforts. To this end, we present an up-to-date and comprehensive survey on AI technologies used for talent analytics in the field of human resource management. Specifically, we first provide the background knowledge of talent analytics and categorize various pertinent data. Subsequently, we offer a comprehensive taxonomy of relevant research efforts, categorized based on three distinct application-driven scenarios: talent management, organization management, and labor market analysis. In conclusion, we summarize the open challenges and potential prospects for future research directions in the domain of AI-driven talent analytics.
△ Less
Submitted 5 May, 2024; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Review of Large Vision Models and Visual Prompt Engineering
Authors:
Jiaqi Wang,
Zhengliang Liu,
Lin Zhao,
Zihao Wu,
Chong Ma,
Sigang Yu,
Haixing Dai,
Qiushi Yang,
Yiheng Liu,
Songyao Zhang,
Enze Shi,
Yi Pan,
Tuo Zhang,
Dajiang Zhu,
Xiang Li,
Xi Jiang,
Bao Ge,
Yixuan Yuan,
Dinggang Shen,
Tianming Liu,
Shu Zhang
Abstract:
Visual prompt engineering is a fundamental technology in the field of visual and image Artificial General Intelligence, serving as a key component for achieving zero-shot capabilities. As the development of large vision models progresses, the importance of prompt engineering becomes increasingly evident. Designing suitable prompts for specific visual tasks has emerged as a meaningful research dire…
▽ More
Visual prompt engineering is a fundamental technology in the field of visual and image Artificial General Intelligence, serving as a key component for achieving zero-shot capabilities. As the development of large vision models progresses, the importance of prompt engineering becomes increasingly evident. Designing suitable prompts for specific visual tasks has emerged as a meaningful research direction. This review aims to summarize the methods employed in the computer vision domain for large vision models and visual prompt engineering, exploring the latest advancements in visual prompt engineering. We present influential large models in the visual domain and a range of prompt engineering methods employed on these models. It is our hope that this review provides a comprehensive and systematic description of prompt engineering methods based on large visual models, offering valuable insights for future researchers in their exploration of this field.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Kagome surface states and weak electronic correlation in vanadium-kagome metals
Authors:
Jianyang Ding,
Ningning Zhao,
Zicheng Tao,
Zhe Huang,
Zhicheng Jiang,
Yichen Yang,
Soohyun Cho,
Zhengtai Liu,
Jishan Liu,
Yanfeng Guo,
Kai Liu,
Zhonghao Liu,
Dawei Shen
Abstract:
RV6Sn6 (R = Y and lanthanides) with two-dimensional vanadium-kagome surface states is an ideal platform to investigate kagome physics and manipulate the kagome features to realize novel phenomena. Utilizing the micron-scale spatially resolved angle-resolved photoemission spectroscopy and first-principles calculations, we report a systematical study of the electronic structures of RV6Sn6 (R = Gd, T…
▽ More
RV6Sn6 (R = Y and lanthanides) with two-dimensional vanadium-kagome surface states is an ideal platform to investigate kagome physics and manipulate the kagome features to realize novel phenomena. Utilizing the micron-scale spatially resolved angle-resolved photoemission spectroscopy and first-principles calculations, we report a systematical study of the electronic structures of RV6Sn6 (R = Gd, Tb, and Lu) on the two cleaved surfaces, i.e., the V- and RSn1-terminated (001) surfaces. The calculated bands without any renormalization match well with the main ARPES dispersive features, indicating the weak electronic correlation in this system. We observe 'W'-like kagome surface states around the Brillouin zone corners showing R-element-dependent intensities, which is probably due to various coupling strengths between V and RSn1 layers. Our finding suggests an avenue for tuning electronic states by interlayer coupling based on two-dimensional kagome lattices.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer
Authors:
Jiaxin Deng,
Dong Shen,
Shiyao Wang,
Xiangyu Wu,
Fan Yang,
Guorui Zhou,
Gaofeng Meng
Abstract:
In recent years, live streaming platforms have gained immense popularity as they allow users to broadcast their videos and interact in real-time with hosts and peers. Due to the dynamic changes of live content, accurate recommendation models are crucial for enhancing user experience. However, most previous works treat the live as a whole item and explore the Click-through-Rate (CTR) prediction fra…
▽ More
In recent years, live streaming platforms have gained immense popularity as they allow users to broadcast their videos and interact in real-time with hosts and peers. Due to the dynamic changes of live content, accurate recommendation models are crucial for enhancing user experience. However, most previous works treat the live as a whole item and explore the Click-through-Rate (CTR) prediction framework on item-level, neglecting that the dynamic changes that occur even within the same live room. In this paper, we proposed a ContentCTR model that leverages multimodal transformer for frame-level CTR prediction. First, we present an end-to-end framework that can make full use of multimodal information, including visual frames, audio, and comments, to identify the most attractive live frames. Second, to prevent the model from collapsing into a mediocre solution, a novel pairwise loss function with first-order difference constraints is proposed to utilize the contrastive information existing in the highlight and non-highlight frames. Additionally, we design a temporal text-video alignment module based on Dynamic Time War** to eliminate noise caused by the ambiguity and non-sequential alignment of visual and textual information. We conduct extensive experiments on both real-world scenarios and public datasets, and our ContentCTR model outperforms traditional recommendation models in capturing real-time content changes. Moreover, we deploy the proposed method on our company platform, and the results of online A/B testing further validate its practical significance.
△ Less
Submitted 25 June, 2023;
originally announced June 2023.
-
Radiology-GPT: A Large Language Model for Radiology
Authors:
Zhengliang Liu,
Aoxiao Zhong,
Yiwei Li,
Longtao Yang,
Chao Ju,
Zihao Wu,
Chong Ma,
Peng Shu,
Cheng Chen,
Sekeun Kim,
Haixing Dai,
Lin Zhao,
Lichao Sun,
Dajiang Zhu,
Jun Liu,
Wei Liu,
Dinggang Shen,
Xiang Li,
Quanzheng Li,
Tianming Liu
Abstract:
We introduce Radiology-GPT, a large language model for radiology. Using an instruction tuning approach on an extensive dataset of radiology domain knowledge, Radiology-GPT demonstrates superior performance compared to general language models such as StableLM, Dolly and LLaMA. It exhibits significant versatility in radiological diagnosis, research, and communication. This work serves as a catalyst…
▽ More
We introduce Radiology-GPT, a large language model for radiology. Using an instruction tuning approach on an extensive dataset of radiology domain knowledge, Radiology-GPT demonstrates superior performance compared to general language models such as StableLM, Dolly and LLaMA. It exhibits significant versatility in radiological diagnosis, research, and communication. This work serves as a catalyst for future developments in clinical NLP. The successful implementation of Radiology-GPT is indicative of the potential of localizing generative large language models, specifically tailored for distinctive medical specialties, while ensuring adherence to privacy standards such as HIPAA. The prospect of develo** individualized, large-scale language models that cater to specific needs of various hospitals presents a promising direction. The fusion of conversational competence and domain-specific knowledge in these models is set to foster future development in healthcare AI. A demo of Radiology-GPT is available at https://huggingface.co/spaces/allen-eric/radiology-gpt.
△ Less
Submitted 19 March, 2024; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Artificial General Intelligence for Medical Imaging
Authors:
Xiang Li,
Lu Zhang,
Zihao Wu,
Zhengliang Liu,
Lin Zhao,
Yixuan Yuan,
Jun Liu,
Gang Li,
Dajiang Zhu,
**kun Yan,
Quanzheng Li,
Wei Liu,
Tianming Liu,
Dinggang Shen
Abstract:
In this review, we explore the potential applications of Artificial General Intelligence (AGI) models in healthcare, focusing on foundational Large Language Models (LLMs), Large Vision Models, and Large Multimodal Models. We emphasize the importance of integrating clinical expertise, domain knowledge, and multimodal capabilities into AGI models. In addition, we lay out key roadmaps that guide the…
▽ More
In this review, we explore the potential applications of Artificial General Intelligence (AGI) models in healthcare, focusing on foundational Large Language Models (LLMs), Large Vision Models, and Large Multimodal Models. We emphasize the importance of integrating clinical expertise, domain knowledge, and multimodal capabilities into AGI models. In addition, we lay out key roadmaps that guide the development and deployment of healthcare AGI models. Throughout the review, we provide critical perspectives on the potential challenges and pitfalls associated with deploying large-scale AGI models in the medical field. This comprehensive review aims to offer insights into the future implications of AGI in medical imaging, healthcare and beyond.
△ Less
Submitted 2 July, 2023; v1 submitted 8 June, 2023;
originally announced June 2023.
-
A New Scoring Method for the Evaluation of Vehicle Road Departure Detection Systems
Authors:
Dan Shen,
Lingxi Li,
Stanley Chien,
Yaobin Chen,
Rini Sherony
Abstract:
Road departure detection systems (RDDSs) for eliminating unintentional road departure collisions have been developed and equipped on some commercial vehicles in recent years. In order to provide a standardized and objective performance evaluation of RDDSs without the affections of systems complex nature of RDDSs and the design requirements, this paper proposes the development of the scoring method…
▽ More
Road departure detection systems (RDDSs) for eliminating unintentional road departure collisions have been developed and equipped on some commercial vehicles in recent years. In order to provide a standardized and objective performance evaluation of RDDSs without the affections of systems complex nature of RDDSs and the design requirements, this paper proposes the development of the scoring method for evaluating vehicle RDDSs. Both flat road edge and vertical road edge are considered in the proposed scoring method, which combines two key variables: 1) the lateral distance of vehicle from road edge when RDW triggers; 2) the lateral distance of vehicle from road edge when RKA triggers. Two main criteria of road departure warning (RDW) and Road Kee** Assistance (RKA) are used to describe the performance of RDDSs.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Hyperspectral Target Detection Based on Low-Rank Background Subspace Learning and Graph Laplacian Regularization
Authors:
Dunbin Shen,
Xiaorui Ma,
Wenfeng Kong,
Jiacheng Tian,
Hongyu Wang
Abstract:
Hyperspectral target detection is good at finding dim and small objects based on spectral characteristics. However, existing representation-based methods are hindered by the problem of the unknown background dictionary and insufficient utilization of spatial information. To address these issues, this paper proposes an efficient optimizing approach based on low-rank representation (LRR) and graph L…
▽ More
Hyperspectral target detection is good at finding dim and small objects based on spectral characteristics. However, existing representation-based methods are hindered by the problem of the unknown background dictionary and insufficient utilization of spatial information. To address these issues, this paper proposes an efficient optimizing approach based on low-rank representation (LRR) and graph Laplacian regularization (GLR). Firstly, to obtain a complete and pure background dictionary, we propose a LRR-based background subspace learning method by jointly mining the low-dimensional structure of all pixels. Secondly, to fully exploit local spatial relationships and capture the underlying geometric structure, a local region-based GLR is employed to estimate the coefficients. Finally, the desired detection map is generated by computing the ratio of representation errors from binary hypothesis testing. The experiments conducted on two benchmark datasets validate the effectiveness and superiority of the approach. For reproduction, the accompanying code is available at https://github.com/shendb2022/LRBSL-GLR.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs
Authors:
Zihao Zhao,
Sheng Wang,
**chen Gu,
Yitao Zhu,
Lanzhuju Mei,
Zixu Zhuang,
Zhiming Cui,
Qian Wang,
Dinggang Shen
Abstract:
The integration of Computer-Aided Diagnosis (CAD) with Large Language Models (LLMs) presents a promising frontier in clinical applications, notably in automating diagnostic processes akin to those performed by radiologists and providing consultations similar to a virtual family doctor. Despite the promising potential of this integration, current works face at least two limitations: (1) From the pe…
▽ More
The integration of Computer-Aided Diagnosis (CAD) with Large Language Models (LLMs) presents a promising frontier in clinical applications, notably in automating diagnostic processes akin to those performed by radiologists and providing consultations similar to a virtual family doctor. Despite the promising potential of this integration, current works face at least two limitations: (1) From the perspective of a radiologist, existing studies typically have a restricted scope of applicable imaging domains, failing to meet the diagnostic needs of different patients. Also, the insufficient diagnostic capability of LLMs further undermine the quality and reliability of the generated medical reports. (2) Current LLMs lack the requisite depth in medical expertise, rendering them less effective as virtual family doctors due to the potential unreliability of the advice provided during patient consultations. To address these limitations, we introduce ChatCAD+, to be universal and reliable. Specifically, it is featured by two main modules: (1) Reliable Report Generation and (2) Reliable Interaction. The Reliable Report Generation module is capable of interpreting medical images from diverse domains and generate high-quality medical reports via our proposed hierarchical in-context learning. Concurrently, the interaction module leverages up-to-date information from reputable medical websites to provide reliable medical advice. Together, these designed modules synergize to closely align with the expertise of human medical professionals, offering enhanced consistency and reliability for interpretation and advice. The source code is available at https://github.com/zhaozh10/ChatCAD.
△ Less
Submitted 17 April, 2024; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Optimal Control of Logically Constrained Partially Observable and Multi-Agent Markov Decision Processes
Authors:
Krishna C. Kalagarla,
Dhruva Kartik,
Dongming Shen,
Rahul Jain,
Ashutosh Nayyar,
Pierluigi Nuzzo
Abstract:
Autonomous systems often have logical constraints arising, for example, from safety, operational, or regulatory requirements. Such constraints can be expressed using temporal logic specifications. The system state is often partially observable. Moreover, it could encompass a team of multiple agents with a common objective but disparate information structures and constraints. In this paper, we firs…
▽ More
Autonomous systems often have logical constraints arising, for example, from safety, operational, or regulatory requirements. Such constraints can be expressed using temporal logic specifications. The system state is often partially observable. Moreover, it could encompass a team of multiple agents with a common objective but disparate information structures and constraints. In this paper, we first introduce an optimal control theory for partially observable Markov decision processes (POMDPs) with finite linear temporal logic constraints. We provide a structured methodology for synthesizing policies that maximize a cumulative reward while ensuring that the probability of satisfying a temporal logic constraint is sufficiently high. Our approach comes with guarantees on approximate reward optimality and constraint satisfaction. We then build on this approach to design an optimal control framework for logically constrained multi-agent settings with information asymmetry. We illustrate the effectiveness of our approach by implementing it on several case studies.
△ Less
Submitted 19 June, 2024; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Preference or Intent? Double Disentangled Collaborative Filtering
Authors:
Chao Wang,
Hengshu Zhu,
Dazhong Shen,
Wei wu,
Hui Xiong
Abstract:
People usually have different intents for choosing items, while their preferences under the same intent may also different. In traditional collaborative filtering approaches, both intent and preference factors are usually entangled in the modeling process, which significantly limits the robustness and interpretability of recommendation performances. For example, the low-rating items are always tre…
▽ More
People usually have different intents for choosing items, while their preferences under the same intent may also different. In traditional collaborative filtering approaches, both intent and preference factors are usually entangled in the modeling process, which significantly limits the robustness and interpretability of recommendation performances. For example, the low-rating items are always treated as negative feedback while they actually could provide positive information about user intent. To this end, in this paper, we propose a two-fold representation learning approach, namely Double Disentangled Collaborative Filtering (DDCF), for personalized recommendations. The first-level disentanglement is for separating the influence factors of intent and preference, while the second-level disentanglement is performed to build independent sparse preference representations under individual intent with limited computational complexity. Specifically, we employ two variational autoencoder networks, intent recognition network and preference decomposition network, to learn the intent and preference factors, respectively. In this way, the low-rating items will be treated as positive samples for modeling intents while the negative samples for modeling preferences. Finally, extensive experiments on three real-world datasets and four evaluation metrics clearly validate the effectiveness and the interpretability of DDCF.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Learning Better Contrastive View from Radiologist's Gaze
Authors:
Sheng Wang,
Zixu Zhuang,
Xi Ouyang,
Lichi Zhang,
Zheren Li,
Chong Ma,
Tianming Liu,
Dinggang Shen,
Qian Wang
Abstract:
Recent self-supervised contrastive learning methods greatly benefit from the Siamese structure that aims to minimizing distances between positive pairs. These methods usually apply random data augmentation to input images, expecting the augmented views of the same images to be similar and positively paired. However, random augmentation may overlook image semantic information and degrade the qualit…
▽ More
Recent self-supervised contrastive learning methods greatly benefit from the Siamese structure that aims to minimizing distances between positive pairs. These methods usually apply random data augmentation to input images, expecting the augmented views of the same images to be similar and positively paired. However, random augmentation may overlook image semantic information and degrade the quality of augmented views in contrastive learning. This issue becomes more challenging in medical images since the abnormalities related to diseases can be tiny, and are easy to be corrupted (e.g., being cropped out) in the current scheme of random augmentation. In this work, we first demonstrate that, for widely-used X-ray images, the conventional augmentation prevalent in contrastive pre-training can affect the performance of the downstream diagnosis or classification tasks. Then, we propose a novel augmentation method, i.e., FocusContrast, to learn from radiologists' gaze in diagnosis and generate contrastive views for medical images with guidance from radiologists' visual attention. Specifically, we track the gaze movement of radiologists and model their visual attention when reading to diagnose X-ray images. The learned model can predict visual attention of the radiologists given a new input image, and further guide the attention-aware augmentation that hardly neglects the disease-related abnormalities. As a plug-and-play and framework-agnostic module, FocusContrast consistently improves state-of-the-art contrastive learning methods of SimCLR, MoCo, and BYOL by 4.0~7.0% in classification accuracy on a knee X-ray dataset.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Mining fMRI Dynamics with Parcellation Prior for Brain Disease Diagnosis
Authors:
Xiaozhao Liu,
Mianxin Liu,
Lang Mei,
Yuyao Zhang,
Feng Shi,
Han Zhang,
Dinggang Shen
Abstract:
To characterize atypical brain dynamics under diseases, prevalent studies investigate functional magnetic resonance imaging (fMRI). However, most of the existing analyses compress rich spatial-temporal information as the brain functional networks (BFNs) and directly investigate the whole-brain network without neurological priors about functional subnetworks. We thus propose a novel graph learning…
▽ More
To characterize atypical brain dynamics under diseases, prevalent studies investigate functional magnetic resonance imaging (fMRI). However, most of the existing analyses compress rich spatial-temporal information as the brain functional networks (BFNs) and directly investigate the whole-brain network without neurological priors about functional subnetworks. We thus propose a novel graph learning framework to mine fMRI signals with topological priors from brain parcellation for disease diagnosis. Specifically, we 1) detect diagnosis-related temporal features using a "Transformer" for a higher-level BFN construction, and process it with a following graph convolutional network, and 2) apply an attention-based multiple instance learning strategy to emphasize the disease-affected subnetworks to further enhance the diagnosis performance and interpretability. Experiments demonstrate higher effectiveness of our method than compared methods in the diagnosis of early mild cognitive impairment. More importantly, our method is capable of localizing crucial brain subnetworks during the diagnosis, providing insights into the pathogenic source of mild cognitive impairment.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction
Authors:
Qi Wang,
Zhijie Wen,
Jun Shi,
Qian Wang,
Dinggang Shen,
Shihui Ying
Abstract:
Multi-modal magnetic resonance imaging (MRI) plays a crucial role in comprehensive disease diagnosis in clinical medicine. However, acquiring certain modalities, such as T2-weighted images (T2WIs), is time-consuming and prone to be with motion artifacts. It negatively impacts subsequent multi-modal image analysis. To address this issue, we propose an end-to-end deep learning framework that utilize…
▽ More
Multi-modal magnetic resonance imaging (MRI) plays a crucial role in comprehensive disease diagnosis in clinical medicine. However, acquiring certain modalities, such as T2-weighted images (T2WIs), is time-consuming and prone to be with motion artifacts. It negatively impacts subsequent multi-modal image analysis. To address this issue, we propose an end-to-end deep learning framework that utilizes T1-weighted images (T1WIs) as auxiliary modalities to expedite T2WIs' acquisitions. While image pre-processing is capable of mitigating misalignment, improper parameter selection leads to adverse pre-processing effects, requiring iterative experimentation and adjustment. To overcome this shortage, we employ Optimal Transport (OT) to synthesize T2WIs by aligning T1WIs and performing cross-modal synthesis, effectively mitigating spatial misalignment effects. Furthermore, we adopt an alternating iteration framework between the reconstruction task and the cross-modal synthesis task to optimize the final results. Then, we prove that the reconstructed T2WIs and the synthetic T2WIs become closer on the T2 image manifold with iterations increasing, and further illustrate that the improved reconstruction result enhances the synthesis process, whereas the enhanced synthesis result improves the reconstruction process. Finally, experimental results from FastMRI and internal datasets confirm the effectiveness of our method, demonstrating significant improvements in image reconstruction quality even at low sampling rates.
△ Less
Submitted 21 May, 2024; v1 submitted 4 May, 2023;
originally announced May 2023.
-
Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT
Authors:
Zhenxiang Xiao,
Yuzhong Chen,
Lu Zhang,
Junjie Yao,
Zihao Wu,
Xiaowei Yu,
Yi Pan,
Lin Zhao,
Chong Ma,
Xinyu Liu,
Wei Liu,
Xiang Li,
Yixuan Yuan,
Dinggang Shen,
Dajiang Zhu,
Tianming Liu,
Xi Jiang
Abstract:
Prompts have been proven to play a crucial role in large language models, and in recent years, vision models have also been using prompts to improve scalability for multiple downstream tasks. In this paper, we focus on adapting prompt design based on instruction tuning into a visual transformer model for image classification which we called Instruction-ViT. The key idea is to implement multi-modal…
▽ More
Prompts have been proven to play a crucial role in large language models, and in recent years, vision models have also been using prompts to improve scalability for multiple downstream tasks. In this paper, we focus on adapting prompt design based on instruction tuning into a visual transformer model for image classification which we called Instruction-ViT. The key idea is to implement multi-modal prompts (text or image prompt) related to category information to guide the fine-tuning of the model. Based on the experiments of several image captionining tasks, the performance and domain adaptability were improved. Our work provided an innovative strategy to fuse multi-modal prompts with better performance and faster adaptability for visual classification models.
△ Less
Submitted 29 April, 2023;
originally announced May 2023.
-
Prompt Engineering for Healthcare: Methodologies and Applications
Authors:
Jiaqi Wang,
Enze Shi,
Sigang Yu,
Zihao Wu,
Chong Ma,
Haixing Dai,
Qiushi Yang,
Yanqing Kang,
**ru Wu,
Huawen Hu,
Chenxi Yue,
Haiyang Zhang,
Yiheng Liu,
Yi Pan,
Zhengliang Liu,
Lichao Sun,
Xiang Li,
Bao Ge,
Xi Jiang,
Dajiang Zhu,
Yixuan Yuan,
Dinggang Shen,
Tianming Liu,
Shu Zhang
Abstract:
Prompt engineering is a critical technique in the field of natural language processing that involves designing and optimizing the prompts used to input information into models, aiming to enhance their performance on specific tasks. With the recent advancements in large language models, prompt engineering has shown significant superiority across various domains and has become increasingly important…
▽ More
Prompt engineering is a critical technique in the field of natural language processing that involves designing and optimizing the prompts used to input information into models, aiming to enhance their performance on specific tasks. With the recent advancements in large language models, prompt engineering has shown significant superiority across various domains and has become increasingly important in the healthcare domain. However, there is a lack of comprehensive reviews specifically focusing on prompt engineering in the medical field. This review will introduce the latest advances in prompt engineering in the field of natural language processing for the medical field. First, we will provide the development of prompt engineering and emphasize its significant contributions to healthcare natural language processing applications such as question-answering systems, text summarization, and machine translation. With the continuous improvement of general large language models, the importance of prompt engineering in the healthcare domain is becoming increasingly prominent. The aim of this article is to provide useful resources and bridges for healthcare natural language processing researchers to better explore the application of prompt engineering in this field. We hope that this review can provide new ideas and inspire for research and application in medical natural language processing.
△ Less
Submitted 23 March, 2024; v1 submitted 28 April, 2023;
originally announced April 2023.
-
ChatABL: Abductive Learning via Natural Language Interaction with ChatGPT
Authors:
Tianyang Zhong,
Yaonai Wei,
Li Yang,
Zihao Wu,
Zhengliang Liu,
Xiaozheng Wei,
Wenjun Li,
Junjie Yao,
Chong Ma,
Xiang Li,
Dajiang Zhu,
Xi Jiang,
Junwei Han,
Dinggang Shen,
Tianming Liu,
Tuo Zhang
Abstract:
Large language models (LLMs) such as ChatGPT have recently demonstrated significant potential in mathematical abilities, providing valuable reasoning paradigm consistent with human natural language. However, LLMs currently have difficulty in bridging perception, language understanding and reasoning capabilities due to incompatibility of the underlying information flow among them, making it challen…
▽ More
Large language models (LLMs) such as ChatGPT have recently demonstrated significant potential in mathematical abilities, providing valuable reasoning paradigm consistent with human natural language. However, LLMs currently have difficulty in bridging perception, language understanding and reasoning capabilities due to incompatibility of the underlying information flow among them, making it challenging to accomplish tasks autonomously. On the other hand, abductive learning (ABL) frameworks for integrating the two abilities of perception and reasoning has seen significant success in inverse decipherment of incomplete facts, but it is limited by the lack of semantic understanding of logical reasoning rules and the dependence on complicated domain knowledge representation. This paper presents a novel method (ChatABL) for integrating LLMs into the ABL framework, aiming at unifying the three abilities in a more user-friendly and understandable manner. The proposed method uses the strengths of LLMs' understanding and logical reasoning to correct the incomplete logical facts for optimizing the performance of perceptual module, by summarizing and reorganizing reasoning rules represented in natural language format. Similarly, perceptual module provides necessary reasoning examples for LLMs in natural language format. The variable-length handwritten equation deciphering task, an abstract expression of the Mayan calendar decoding, is used as a testbed to demonstrate that ChatABL has reasoning ability beyond most existing state-of-the-art methods, which has been well supported by comparative studies. To our best knowledge, the proposed ChatABL is the first attempt to explore a new pattern for further approaching human-level cognitive ability via natural language interaction with ChatGPT.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Collision-energy Dependence of Deuteron Cumulants and Proton-deuteron Correlations in Au+Au collisions at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (343 additional authors not shown)
Abstract:
We report the first measurements of cumulants, up to $4^{th}$ order, of deuteron number distributions and proton-deuteron correlations in Au+Au collisions recorded by the STAR experiment in phase-I of Beam Energy Scan (BES) program at the Relativistic Heavy Ion Collider. Deuteron cumulants, their ratios, and proton-deuteron mixed cumulants are presented for different collision centralities coverin…
▽ More
We report the first measurements of cumulants, up to $4^{th}$ order, of deuteron number distributions and proton-deuteron correlations in Au+Au collisions recorded by the STAR experiment in phase-I of Beam Energy Scan (BES) program at the Relativistic Heavy Ion Collider. Deuteron cumulants, their ratios, and proton-deuteron mixed cumulants are presented for different collision centralities covering a range of center-of-mass energy per nucleon pair $\sqrt{s_{NN}}$~=~7.7 to 200~GeV. It is found that the cumulant ratios at lower collision energies favor a canonical ensemble over a grand canonical ensemble in thermal models. An anti-correlation between proton and deuteron multiplicity is observed across all collision energies and centralities, consistent with the expectation from global baryon number conservation. The UrQMD model coupled with a phase-space coalescence mechanism qualitatively reproduces the collision-energy dependence of cumulant ratios and proton-deuteron correlations.
△ Less
Submitted 28 June, 2024; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Domain Generalization for Mammographic Image Analysis with Contrastive Learning
Authors:
Zheren Li,
Zhiming Cui,
Lichi Zhang,
Sheng Wang,
Chen** Lei,
Xi Ouyang,
Dongdong Chen,
Xiangyu Zhao,
Yajia Gu,
Zaiyi Liu,
Chunling Liu,
Dinggang Shen,
Jie-Zhi Cheng
Abstract:
The deep learning technique has been shown to be effectively addressed several image analysis tasks in the computer-aided diagnosis scheme for mammography. The training of an efficacious deep learning model requires large data with diverse styles and qualities. The diversity of data often comes from the use of various scanners of vendors. But, in practice, it is impractical to collect a sufficient…
▽ More
The deep learning technique has been shown to be effectively addressed several image analysis tasks in the computer-aided diagnosis scheme for mammography. The training of an efficacious deep learning model requires large data with diverse styles and qualities. The diversity of data often comes from the use of various scanners of vendors. But, in practice, it is impractical to collect a sufficient amount of diverse data for training. To this end, a novel contrastive learning is developed to equip the deep learning models with better style generalization capability. Specifically, the multi-style and multi-view unsupervised self-learning scheme is carried out to seek robust feature embedding against style diversity as a pretrained model. Afterward, the pretrained network is further fine-tuned to the downstream tasks, e.g., mass detection, matching, BI-RADS rating, and breast density classification. The proposed method has been evaluated extensively and rigorously with mammograms from various vendor style domains and several public datasets. The experimental results suggest that the proposed domain generalization method can effectively improve performance of four mammographic image tasks on the data from both seen and unseen domains, and outperform many state-of-the-art (SOTA) generalization methods.
△ Less
Submitted 7 September, 2023; v1 submitted 20 April, 2023;
originally announced April 2023.
-
Event-by-event correlations between $Λ$ ($\barΛ$) hyperon global polarization and handedness with charged hadron azimuthal separation in Au+Au collisions at $\sqrt{s_{\text{NN}}} = 27 \text{ GeV}$ from STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
Global polarizations ($P$) of $Λ$ ($\barΛ$) hyperons have been observed in non-central heavy-ion collisions. The strong magnetic field primarily created by the spectator protons in such collisions would split the $Λ$ and $\barΛ$ global polarizations ($ΔP = P_Λ - P_{\barΛ} < 0$). Additionally, quantum chromodynamics (QCD) predicts topological charge fluctuations in vacuum, resulting in a chirality…
▽ More
Global polarizations ($P$) of $Λ$ ($\barΛ$) hyperons have been observed in non-central heavy-ion collisions. The strong magnetic field primarily created by the spectator protons in such collisions would split the $Λ$ and $\barΛ$ global polarizations ($ΔP = P_Λ - P_{\barΛ} < 0$). Additionally, quantum chromodynamics (QCD) predicts topological charge fluctuations in vacuum, resulting in a chirality imbalance or parity violation in a local domain. This would give rise to an imbalance ($Δn = \frac{N_{\text{L}} - N_{\text{R}}}{\langle N_{\text{L}} + N_{\text{R}} \rangle} \neq 0$) between left- and right-handed $Λ$ ($\barΛ$) as well as a charge separation along the magnetic field, referred to as the chiral magnetic effect (CME). This charge separation can be characterized by the parity-even azimuthal correlator ($Δγ$) and parity-odd azimuthal harmonic observable ($Δa_{1}$). Measurements of $ΔP$, $Δγ$, and $Δa_{1}$ have not led to definitive conclusions concerning the CME or the magnetic field, and $Δn$ has not been measured previously. Correlations among these observables may reveal new insights. This paper reports measurements of correlation between $Δn$ and $Δa_{1}$, which is sensitive to chirality fluctuations, and correlation between $ΔP$ and $Δγ$ sensitive to magnetic field in Au+Au collisions at 27 GeV. For both measurements, no correlations have been observed beyond statistical fluctuations.
△ Less
Submitted 22 July, 2023; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task
Authors:
Zihao Wu,
Lu Zhang,
Chao Cao,
Xiaowei Yu,
Haixing Dai,
Chong Ma,
Zhengliang Liu,
Lin Zhao,
Gang Li,
Wei Liu,
Quanzheng Li,
Dinggang Shen,
Xiang Li,
Dajiang Zhu,
Tianming Liu
Abstract:
Recently, ChatGPT and GPT-4 have emerged and gained immense global attention due to their unparalleled performance in language processing. Despite demonstrating impressive capability in various open-domain tasks, their adequacy in highly specific fields like radiology remains untested. Radiology presents unique linguistic phenomena distinct from open-domain data due to its specificity and complexi…
▽ More
Recently, ChatGPT and GPT-4 have emerged and gained immense global attention due to their unparalleled performance in language processing. Despite demonstrating impressive capability in various open-domain tasks, their adequacy in highly specific fields like radiology remains untested. Radiology presents unique linguistic phenomena distinct from open-domain data due to its specificity and complexity. Assessing the performance of large language models (LLMs) in such specific domains is crucial not only for a thorough evaluation of their overall performance but also for providing valuable insights into future model design directions: whether model design should be generic or domain-specific. To this end, in this study, we evaluate the performance of ChatGPT/GPT-4 on a radiology NLI task and compare it to other models fine-tuned specifically on task-related data samples. We also conduct a comprehensive investigation on ChatGPT/GPT-4's reasoning ability by introducing varying levels of inference difficulty. Our results show that 1) GPT-4 outperforms ChatGPT in the radiology NLI task; 2) other specifically fine-tuned models require significant amounts of data samples to achieve comparable performance to ChatGPT/GPT-4. These findings demonstrate that constructing a generic model that is capable of solving various tasks across different domains is feasible.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
An Iterative Optimizing Framework for Radiology Report Summarization with ChatGPT
Authors:
Chong Ma,
Zihao Wu,
Jiaqi Wang,
Shaochen Xu,
Yaonai Wei,
Fang Zeng,
Zhengliang Liu,
Xi Jiang,
Lei Guo,
Xiaoyan Cai,
Shu Zhang,
Tuo Zhang,
Dajiang Zhu,
Dinggang Shen,
Tianming Liu,
Xiang Li
Abstract:
The 'Impression' section of a radiology report is a critical basis for communication between radiologists and other physicians, and it is typically written by radiologists based on the 'Findings' section. However, writing numerous impressions can be laborious and error-prone for radiologists. Although recent studies have achieved promising results in automatic impression generation using large-sca…
▽ More
The 'Impression' section of a radiology report is a critical basis for communication between radiologists and other physicians, and it is typically written by radiologists based on the 'Findings' section. However, writing numerous impressions can be laborious and error-prone for radiologists. Although recent studies have achieved promising results in automatic impression generation using large-scale medical text data for pre-training and fine-tuning pre-trained language models, such models often require substantial amounts of medical text data and have poor generalization performance. While large language models (LLMs) like ChatGPT have shown strong generalization capabilities and performance, their performance in specific domains, such as radiology, remains under-investigated and potentially limited. To address this limitation, we propose ImpressionGPT, which leverages the in-context learning capability of LLMs by constructing dynamic contexts using domain-specific, individualized data. This dynamic prompt approach enables the model to learn contextual knowledge from semantically similar examples from existing data. Additionally, we design an iterative optimization algorithm that performs automatic evaluation on the generated impression results and composes the corresponding instruction prompts to further optimize the model. The proposed ImpressionGPT model achieves state-of-the-art performance on both MIMIC-CXR and OpenI datasets without requiring additional training data or fine-tuning the LLMs. This work presents a paradigm for localizing LLMs that can be applied in a wide range of similar application scenarios, bridging the gap between general-purpose LLMs and the specific language processing needs of various domains.
△ Less
Submitted 8 May, 2024; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Construction of unbiased dental template and parametric dental model for precision digital dentistry
Authors:
Lei Ma,
**gyang Zhang,
Ke Deng,
Peng Xue,
Zhiming Cui,
Yu Fang,
Minhui Tang,
Yue Zhao,
Min Zhu,
Zhongxiang Ding,
Dinggang Shen
Abstract:
Dental template and parametric dental models are important tools for various applications in digital dentistry. However, constructing an unbiased dental template and accurate parametric dental models remains a challenging task due to the complex anatomical and morphological dental structures and also low volume ratio of the teeth. In this study, we develop an unbiased dental template by constructi…
▽ More
Dental template and parametric dental models are important tools for various applications in digital dentistry. However, constructing an unbiased dental template and accurate parametric dental models remains a challenging task due to the complex anatomical and morphological dental structures and also low volume ratio of the teeth. In this study, we develop an unbiased dental template by constructing an accurate dental atlas from CBCT images with guidance of teeth segmentation. First, to address the challenges, we propose to enhance the CBCT images and their segmentation images, including image crop**, image masking and segmentation intensity reassigning. Then, we further use the segmentation images to perform co-registration with the CBCT images to generate an accurate dental atlas, from which an unbiased dental template can be generated. By leveraging the unbiased dental template, we construct parametric dental models by estimating point-to-point correspondences between the dental models and employing Principal Component Analysis to determine shape subspaces of the parametric dental models. A total of 159 CBCT images of real subjects are collected to perform the constructions. Experimental results demonstrate effectiveness of our proposed method in constructing unbiased dental template and parametric dental model. The developed dental template and parametric dental models are available at https://github.com/Marvin0724/Teeth_template.
△ Less
Submitted 7 April, 2023;
originally announced April 2023.
-
Observation of the electromagnetic field effect via charge-dependent directed flow in heavy-ion collisions at the Relativistic Heavy Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (331 additional authors not shown)
Abstract:
The deconfined quark-gluon plasma (QGP) created in relativistic heavy-ion collisions enables the exploration of the fundamental properties of matter under extreme conditions. Non-central collisions can produce strong magnetic fields on the order of $10^{18}$ Gauss, which offers a probe into the electrical conductivity of the QGP. In particular, quarks and anti-quarks carry opposite charges and rec…
▽ More
The deconfined quark-gluon plasma (QGP) created in relativistic heavy-ion collisions enables the exploration of the fundamental properties of matter under extreme conditions. Non-central collisions can produce strong magnetic fields on the order of $10^{18}$ Gauss, which offers a probe into the electrical conductivity of the QGP. In particular, quarks and anti-quarks carry opposite charges and receive contrary electromagnetic forces that alter their momenta. This phenomenon can be manifested in the collective motion of final-state particles, specifically in the rapidity-odd directed flow, denoted as $v_1(\mathsf{y})$. Here we present the charge-dependent measurements of $dv_1/d\mathsf{y}$ near midrapidities for $π^{\pm}$, $K^{\pm}$, and $p(\bar{p})$ in Au+Au and isobar ($_{44}^{96}$Ru+$_{44}^{96}$Ru and $_{40}^{96}$Zr+$_{40}^{96}$Zr) collisions at $\sqrt{s_{\rm NN}}=$ 200 GeV, and in Au+Au collisions at 27 GeV, recorded by the STAR detector at the Relativistic Heavy Ion Collider. The combined dependence of the $v_1$ signal on collision system, particle species, and collision centrality can be qualitatively and semi-quantitatively understood as several effects on constituent quarks. While the results in central events can be explained by the $u$ and $d$ quarks transported from initial-state nuclei, those in peripheral events reveal the impacts of the electromagnetic field on the QGP. Our data put valuable constraints on the electrical conductivity of the QGP in theoretical calculations.
△ Less
Submitted 22 February, 2024; v1 submitted 6 April, 2023;
originally announced April 2023.
-
Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models
Authors:
Yiheng Liu,
Tianle Han,
Siyuan Ma,
Jiayue Zhang,
Yuanyuan Yang,
Jiaming Tian,
Hao He,
Antong Li,
Mengshen He,
Zhengliang Liu,
Zihao Wu,
Lin Zhao,
Dajiang Zhu,
Xiang Li,
Ning Qiang,
Dingang Shen,
Tianming Liu,
Bao Ge
Abstract:
This paper presents a comprehensive survey of ChatGPT-related (GPT-3.5 and GPT-4) research, state-of-the-art large language models (LLM) from the GPT series, and their prospective applications across diverse domains. Indeed, key innovations such as large-scale pre-training that captures knowledge across the entire world wide web, instruction fine-tuning and Reinforcement Learning from Human Feedba…
▽ More
This paper presents a comprehensive survey of ChatGPT-related (GPT-3.5 and GPT-4) research, state-of-the-art large language models (LLM) from the GPT series, and their prospective applications across diverse domains. Indeed, key innovations such as large-scale pre-training that captures knowledge across the entire world wide web, instruction fine-tuning and Reinforcement Learning from Human Feedback (RLHF) have played significant roles in enhancing LLMs' adaptability and performance. We performed an in-depth analysis of 194 relevant papers on arXiv, encompassing trend analysis, word cloud representation, and distribution analysis across various application domains. The findings reveal a significant and increasing interest in ChatGPT-related research, predominantly centered on direct natural language processing applications, while also demonstrating considerable potential in areas ranging from education and history to mathematics, medicine, and physics. This study endeavors to furnish insights into ChatGPT's capabilities, potential implications, ethical concerns, and offer direction for future advancements in this field.
△ Less
Submitted 21 August, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task
Authors:
Honglin Xiong,
Sheng Wang,
Yitao Zhu,
Zihao Zhao,
Yuxiao Liu,
Linlin Huang,
Qian Wang,
Dinggang Shen
Abstract:
The recent progress of large language models (LLMs), including ChatGPT and GPT-4, in comprehending and responding to human instructions has been remarkable. Nevertheless, these models typically perform better in English and have not been explicitly trained for the medical domain, resulting in suboptimal precision in diagnoses, drug recommendations, and other medical advice. Additionally, training…
▽ More
The recent progress of large language models (LLMs), including ChatGPT and GPT-4, in comprehending and responding to human instructions has been remarkable. Nevertheless, these models typically perform better in English and have not been explicitly trained for the medical domain, resulting in suboptimal precision in diagnoses, drug recommendations, and other medical advice. Additionally, training and deploying a dialogue model is still believed to be impossible for hospitals, hindering the promotion of LLMs. To tackle these challenges, we have collected databases of medical dialogues in Chinese with ChatGPT's help and adopted several techniques to train an easy-deploy LLM. Remarkably, we were able to fine-tune the ChatGLM-6B on a single A100 80G in 13 hours, which means having a healthcare-purpose LLM can be very affordable. DoctorGLM is currently an early-stage engineering attempt and contain various mistakes. We are sharing it with the broader community to invite feedback and suggestions to improve its healthcare-focused capabilities: https://github.com/xionghonglin/DoctorGLM.
△ Less
Submitted 17 April, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.