-
Atomic-scale investigation of $γ$-Ga$_2$O$_3$ deposited on MgAl$_2$O$_4$ and its relationship with $β$-Ga$_2$O$_3$
Authors:
J. Tang,
K. Jiang,
C. Xu,
M. J. Cabral,
K. Xiao,
L. M. Porter,
R. F. Davis
Abstract:
Nominally phase-pure $γ$-$Ga_2O_3$ was deposited on (100) $MgAl_2O_4$ within a narrow temperature window centered at $\sim$470 $^{\circ}$C using metal-organic chemical vapor deposition (MOCVD). The film deposited at 440 $^{\circ}$C exhibited either poor crystallization or an amorphous structure; the film grown at 500 $^{\circ}$C contained both $β$-$Ga_2O_3$ and $γ$-$Ga_2O_3$. A nominally phase-pur…
▽ More
Nominally phase-pure $γ$-$Ga_2O_3$ was deposited on (100) $MgAl_2O_4$ within a narrow temperature window centered at $\sim$470 $^{\circ}$C using metal-organic chemical vapor deposition (MOCVD). The film deposited at 440 $^{\circ}$C exhibited either poor crystallization or an amorphous structure; the film grown at 500 $^{\circ}$C contained both $β$-$Ga_2O_3$ and $γ$-$Ga_2O_3$. A nominally phase-pure $β$-$Ga_2O_3$ film was obtained at 530 $^{\circ}$C. Atomic-resolution scanning transmission electron microscopy (STEM) investigations of the $γ$-$Ga_2O_3$ film grown at 470 $^{\circ}$C revealed a high density of antiphase boundaries. A planar defect model developed for $γ$-$Al_2O_3$ was extended to explain the stacking sequences of the Ga sublattice observed in the STEM images of $γ$-$Ga_2O_3$. The presence of the 180$^{\circ}$ rotational domains and 90$^{\circ}$ rotational domains of $β$-$Ga_2O_3$ inclusions within the $γ$-$Ga_2O_3$ matrix is discussed within the context of a comprehensive investigation of the epitaxial relationship between those two phases in the as-grown film at 470 $^{\circ}$C and the same film annealed at 600 $^{\circ}$C. The results led to the hypotheses that (i) incorporation of certain dopants including Si, Ge, Sn, Mg, Al, and Sc, into $β$-$Ga_2O_3$, locally stabilizes the "$γ$-phase" and (ii) the site preference(s) for these dopants promotes the formation of the "$γ$-phase" and/or $γ$-$Ga_2O_3$ solid solutions. However, in the absence of such dopants, pure $γ$-$Ga_2O_3$ remains the least stable $Ga_2O_3$ polymorph, as indicated by its very narrow growth window, lower growth temperatures relative to other $Ga_2O_3$ polymorphs, and the largest calculated difference in Helmholtz free energy per formula unit between $γ$-$Ga_2O_3$ and $β$-$Ga_2O_3$ than all other polymorphs.
△ Less
Submitted 20 October, 2023; v1 submitted 19 October, 2023;
originally announced October 2023.
-
When the atoms dance: exploring mechanisms of electron-beam induced modifications of materials with machine-learning assisted high temporal resolution electron microscopy
Authors:
Matthew G. Boebinger,
Ayana Ghosh,
Kevin M. Roccapriore,
Sudhajit Misra,
Kai Xiao,
Stephen Jesse,
Maxim Ziatdinov,
Sergei V. Kalinin,
Raymond R. Unocic
Abstract:
Directed atomic fabrication using an aberration-corrected scanning transmission electron microscope (STEM) opens new pathways for atomic engineering of functional materials. In this approach, the electron beam is used to actively alter the atomic structure through electron beam induced irradiation processes. One of the impediments that has limited widespread use thus far has been the ability to un…
▽ More
Directed atomic fabrication using an aberration-corrected scanning transmission electron microscope (STEM) opens new pathways for atomic engineering of functional materials. In this approach, the electron beam is used to actively alter the atomic structure through electron beam induced irradiation processes. One of the impediments that has limited widespread use thus far has been the ability to understand the fundamental mechanisms of atomic transformation pathways at high spatiotemporal resolution. Here, we develop a workflow for obtaining and analyzing high-speed spiral scan STEM data, up to 100 fps, to track the atomic fabrication process during nanopore milling in monolayer MoS2. An automated feedback-controlled electron beam positioning system combined with deep convolution neural network (DCNN) was used to decipher fast but low signal-to-noise datasets and classify time-resolved atom positions and nature of their evolving atomic defect configurations. Through this automated decoding, the initial atomic disordering and reordering processes leading to nanopore formation was able to be studied across various timescales. Using these experimental workflows a greater degree of speed and information can be extracted from small datasets without compromising spatial resolution. This approach can be adapted to other 2D materials systems to gain further insights into the defect formation necessary to inform future automated fabrication techniques utilizing the STEM electron beam.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Machine Learning for Automated Mitral Regurgitation Detection from Cardiac Imaging
Authors:
Ke Xiao,
Erik Learned-Miller,
Evangelos Kalogerakis,
James Priest,
Madalina Fiterau
Abstract:
Mitral regurgitation (MR) is a heart valve disease with potentially fatal consequences that can only be forestalled through timely diagnosis and treatment. Traditional diagnosis methods are expensive, labor-intensive and require clinical expertise, posing a barrier to screening for MR. To overcome this impediment, we propose a new semi-supervised model for MR classification called CUSSP. CUSSP ope…
▽ More
Mitral regurgitation (MR) is a heart valve disease with potentially fatal consequences that can only be forestalled through timely diagnosis and treatment. Traditional diagnosis methods are expensive, labor-intensive and require clinical expertise, posing a barrier to screening for MR. To overcome this impediment, we propose a new semi-supervised model for MR classification called CUSSP. CUSSP operates on cardiac imaging slices of the 4-chamber view of the heart. It uses standard computer vision techniques and contrastive models to learn from large amounts of unlabeled data, in conjunction with specialized classifiers to establish the first ever automated MR classification system. Evaluated on a test set of 179 labeled -- 154 non-MR and 25 MR -- sequences, CUSSP attains an F1 score of 0.69 and a ROC-AUC score of 0.88, setting the first benchmark result for this new task.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
Coulomb potential screening via charged carriers and charge-neutral dipoles/excitons in two-dimensional case
Authors:
Ke Xiao,
Chiming Kan,
Xiaodong Cui
Abstract:
With the shrink of dimensionality, Coulomb interaction displays a distinct role owing to the reduced dielectric screening in out-of-plane direction. Apart from the dielectric screening, the free charge carriers and/or dipoles can also make nonnegligible contribution to Coulomb interaction. While the Thomas Fermi model is effective in describing charge carrier screening in three dimensions, the ext…
▽ More
With the shrink of dimensionality, Coulomb interaction displays a distinct role owing to the reduced dielectric screening in out-of-plane direction. Apart from the dielectric screening, the free charge carriers and/or dipoles can also make nonnegligible contribution to Coulomb interaction. While the Thomas Fermi model is effective in describing charge carrier screening in three dimensions, the extent of screening to two-dimension resulting from charge-neutral dipoles and carriers remains quantitatively unclear. To address this gap, we present a simple analytical solution based on linear response theory, offering a comprehensive depiction of the Coulomb screened potential in both 2D and 3D systems, where screening effects from both charge carriers and charge-neutral dipoles are addressed. Our work provides a handy tool for directly analysing and evaluating Coulomb interaction strength in atomically thin materials and particularly in the context of electronic and optoelectronic engineering. As a demonstration, we utilize the derived modified Coulomb potential for the exciton system to estimate the exciton binding energy variation arising from exciton density fluctuation and the temperature dependent exciton polarizability, yielding excellent agreement with the experimental and computational findings.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
S-PLUS: Photometric Re-calibration with the Stellar Color Regression Method and an Improved Gaia XP Synthetic Photometry Method
Authors:
Kai Xiao,
Yang Huang,
Haibo Yuan,
Timothy C. Beers,
Bowen Huang,
Shuai Xu,
Lin Yang,
Felipe Almeida-Fernandes,
Helio D. Perottoni,
Guilherme Limberg,
William Schoenell,
Tiago Ribeiro,
Antonio Kanaan,
Natanael Gomes de Olivira
Abstract:
We present a comprehensive re-calibration of medium- and broad-band photometry from the Southern Photometric Local Universe Survey (S-PLUS) by leveraging two approaches: an improved Gaia XP Synthetic Photometry (XPSP) method with corrected Gaia XP spectra, the Stellar Color Regression (SCR) method with corrected Gaia EDR3 photometric data and spectroscopic data from LAMOST DR7. Through the use of…
▽ More
We present a comprehensive re-calibration of medium- and broad-band photometry from the Southern Photometric Local Universe Survey (S-PLUS) by leveraging two approaches: an improved Gaia XP Synthetic Photometry (XPSP) method with corrected Gaia XP spectra, the Stellar Color Regression (SCR) method with corrected Gaia EDR3 photometric data and spectroscopic data from LAMOST DR7. Through the use of millions of stars as standards per band, we demonstrate the existence of position-dependent systematic errors, up to 23 mmag for the Main Survey region, in the S-PLUS DR4 photometric data. A comparison between the XPSP and SCR methods reveals minor differences in zero-point offsets, typically within the range of 1 to 6 mmag, indicating the accuracy of the re-calibration, and a two- to three-fold improvement in the zero-point precision. During this process, we also verified and corrected for the systematic errors related to CCD position. The corrected S-PLUS DR4 photometric data will provide a solid data foundation for conducting scientific research that relies on high-calibration precision. Our results underscore the power of the XPSP method in combination with the SCR method, showcasing their effectiveness in enhancing calibration precision for wide-field surveys when combined with Gaia photometry and XP spectra, to be applied for other S-PLUS sub-surveys.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
J-PLUS: Photometric Re-calibration with the Stellar Color Regression Method and an Improved Gaia XP Synthetic Photometry Method
Authors:
Kai Xiao,
Haibo Yuan,
C. Lopez-Sanjuan,
Yang Huang,
Bowen Huang,
Timothy C. Beers,
Shuai Xu,
Yuanchang Wang,
Lin Yang,
J. Alcaniz,
Carlos Andrés Galarza,
R. E. Angulo,
A. J. Cenarro,
D. Cristobal-Hornillos,
R. A. Dupke,
A. Ederoclite,
C. Hernandez-Monteagudo,
A. Marn-Franch,
M. Moles,
L. Sodre Jr.,
H. Vazquez Ramio,
J. Varela
Abstract:
We employ the corrected Gaia Early Data Release 3 (EDR3) photometric data and spectroscopic data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) DR7 to assemble a sample of approximately 0.25 million FGK dwarf photometric standard stars for the 12 J-PLUS filters using the Stellar Color Regression (SCR) method. We then independently validated the J-PLUS DR3 photometry, a…
▽ More
We employ the corrected Gaia Early Data Release 3 (EDR3) photometric data and spectroscopic data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) DR7 to assemble a sample of approximately 0.25 million FGK dwarf photometric standard stars for the 12 J-PLUS filters using the Stellar Color Regression (SCR) method. We then independently validated the J-PLUS DR3 photometry, and uncovered significant systematic errors: up to 15 mmag in the results of Stellar Locus (SL) method, and up to 10 mmag mainly caused by magnitude-, color-, and extinction-dependent errors of the Gaia XP spectra with the Gaia BP/RP (XP) Synthetic Photometry (XPSP) method. We have also further developed the XPSP method using the corrected Gaia XP spectra by Huang et al. (2023) and applied it to the J-PLUS DR3 photometry. This resulted in an agreement of 1-5 mmag with the SCR method, and a two-fold improvement in the J-PLUS zero-point precision. Finally, the zero-point calibration for around 91% of the tiles within the LAMOST observation footprint is determined through the SCR method, with the remaining approximately 9% of tiles outside this footprint relying on the improved XPSP method. The re-calibrated J-PLUS DR3 photometric data establishes a solid data foundation for conducting research that depends on high-precision photometric calibration.
△ Less
Submitted 22 October, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Exciton-exciton Interaction in Monolayer MoSe$_2$ from Mutual Screening of Coulomb Binding
Authors:
Ke Xiao,
Tengfei Yan,
Chengxin Xiao,
Feng-ren Fan,
Ruihuan Duan,
Zheng Liu,
Kenji Watanabe,
Takashi Taniguchi,
Wang Yao,
Xiaodong Cui
Abstract:
The potential for low-threshold optical nonlinearity has received significant attention in the fields of photonics and conceptual optical neuron networks. Excitons in two-dimensional (2D) semiconductors are particularly promising in this regard as reduced screening and dimensional confinement foster their pronounced many-body interactions towards nonlinearity. However, experimental determination o…
▽ More
The potential for low-threshold optical nonlinearity has received significant attention in the fields of photonics and conceptual optical neuron networks. Excitons in two-dimensional (2D) semiconductors are particularly promising in this regard as reduced screening and dimensional confinement foster their pronounced many-body interactions towards nonlinearity. However, experimental determination of the interactions remains ambiguous, as optical pum** in general creates a mixture of excitons and unbound carriers, where the impacts of band gap renormalization and carrier screening on exciton energy counteract each other. Here by comparing the influences on exciton ground and excited states energies in the photoluminescence spectroscopy of monolayer MoSe$_2$, we are able to identify separately the screening of Coulomb binding by the neutral excitons and by charge carriers. The energy difference between exciton ground state (A-1s) and excited state (A-2s) red-shifts by 5.5 meV when the neutral exciton density increases from 0 to $4\times 10^{11}$ cm$^{-2}$, in contrast to the blue shifts with the increase of either electron or hole density. This energy difference change is attributed to the mutual screening of Coulomb binding of neutral excitons, from which we extract an exciton polarizability of $α_{2D}^{\rm exciton} = 2.55\times 10^{-17}$ eV(m/V)$^2$. Our finding uncovers a new mechanism that dominates the repulsive part of many-body interaction between neutral excitons.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Autonomous synthesis of thin film materials with pulsed laser deposition enabled by in situ spectroscopy and automation
Authors:
Sumner B. Harris,
Arpan Biswas,
Seok Joon Yun,
Christopher M. Rouleau,
Alexander A. Puretzky,
Rama K. Vasudevan,
David B. Geohegan,
Kai Xiao
Abstract:
Synthesis of thin films has traditionally relied upon slow, sequential processes carried out with substantial human intervention, frequently utilizing a mix of experience and serendipity to optimize material structure and properties. With recent advances in autonomous systems which combine synthesis, characterization, and decision making with artificial intelligence (AI), large parameter spaces ca…
▽ More
Synthesis of thin films has traditionally relied upon slow, sequential processes carried out with substantial human intervention, frequently utilizing a mix of experience and serendipity to optimize material structure and properties. With recent advances in autonomous systems which combine synthesis, characterization, and decision making with artificial intelligence (AI), large parameter spaces can be explored autonomously at rates beyond what is possible by human experimentalists, greatly accelerating discovery, optimization, and understanding in materials synthesis which directly address the grand challenges in synthesis science. Here, we demonstrate autonomous synthesis of a contemporary 2D material by combining the highly versatile pulsed laser deposition (PLD) technique with automation and machine learning (ML). We incorporated in situ and real-time spectroscopy, a high-throughput methodology, and cloud connectivity to enable autonomous synthesis workflows with PLD. Ultrathin WSe2 films were grown using co-ablation of two targets and showed a 10x increase in throughput over traditional PLD workflows. Gaussian process regression and Bayesian optimization were used with in situ Raman spectroscopy to autonomously discover two distinct growth windows and the process-property relationship after sampling only 0.25% of a large 4D parameter space. Any material that can be grown with PLD could be autonomously synthesized with our platform and workflows, enabling accelerated discovery and optimization of a vast number of materials.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Improvement of the Pan-STARRS Photometric Calibration with LAMOST and Gaia
Authors:
Kai Xiao,
Haibo Yuan,
Bowen Huang,
Ruoyi Zhang,
Lin Yang,
Shuai Xu
Abstract:
In this work, we perform the re-calibration of PS1 photometry by correcting for position-dependent systematic errors using the spectroscopy-based Stellar Color Regression method (SCR), the photometry-based SCR method (SCR$'$), and the Gaia XP synthetic photometry method (XPSP). We confirm the significant large-scale and small-scale spatial variation of magnitude offsets for all the $grizy$ filters…
▽ More
In this work, we perform the re-calibration of PS1 photometry by correcting for position-dependent systematic errors using the spectroscopy-based Stellar Color Regression method (SCR), the photometry-based SCR method (SCR$'$), and the Gaia XP synthetic photometry method (XPSP). We confirm the significant large-scale and small-scale spatial variation of magnitude offsets for all the $grizy$ filters. We show that the PS1 photometric calibration precisions in the $grizy$ filters are around 5--7\,mmag when averaged over 14$'$ regions. We note a much larger calibration error up to 0.04 mag in the Galactic plane, which is probably caused by the systematic errors of the PS1 magnitudes in crowded fields. The results of the three methods are consistent with each other within 1--2\,mmag or better for all the filters. We provide two-dimensional maps and a python package ({\url{https://doi.org/10.12149/101283}}) to correct for position-dependent magnitude offsets of PS1, which can be used for high-precision investigations and as a reference to calibrate other surveys.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Framework for additive manufacturing of porous Inconel 718 for electrochemical applications
Authors:
Ahmad Zafari,
Kiran Kiran,
Inmaculada Gimenez-Garcia,
Antoni Forner-Cuenca,
Kenong Xia,
Ian Gibson,
Davoud Jafari
Abstract:
Porous electrodes were developed using laser powder bed fusion of Inconel 718 lattice structures and electrodeposition of a porous nickel catalytic layer. Laser energy densities of ~83-333 J/m were used to fabricate ~500 um thick electrodes made of body centered cubic unit cells of 200-500 um and strut thicknesses of 100-200 um. Unit cells of 500 um and strut thickness of 200 um were identified as…
▽ More
Porous electrodes were developed using laser powder bed fusion of Inconel 718 lattice structures and electrodeposition of a porous nickel catalytic layer. Laser energy densities of ~83-333 J/m were used to fabricate ~500 um thick electrodes made of body centered cubic unit cells of 200-500 um and strut thicknesses of 100-200 um. Unit cells of 500 um and strut thickness of 200 um were identified as optimum. Despite small changes in feature sizes by the energy input, the porosity of >50% and pore size of ~100 um did not change. In a subsequent step, we used nickel electrodeposition to create smaller scale pores on the electrode. The electrochemical performance of the electrodes for hydrogen/oxygen evolution reaction (HER/OER) was evaluated in a three-electrode setup. For HER, a much larger maximum current density of ~ -372 mA/cm2 at a less negative potential of ~-0.4 V vs RHE (potential against reversible hydrogen electrode) was obtained in the nickel-coated samples, as compared to -240 mA/cm2 at ~-0.6 V in the bare one, indicating superior performance of the coated sample. Conversely, OER exhibited minor performance differences upon application of the coating, indicating insignificant dependence of OER to surface composition and available surface.
△ Less
Submitted 14 January, 2024; v1 submitted 7 August, 2023;
originally announced August 2023.
-
Flow states and heat transport in liquid metal convection
Authors:
Lei Ren,
Xin Tao,
Lu Zhang,
Ming-Jiu Ni,
Ke-Qing Xia,
Yi-Chao Xie
Abstract:
We present an experimental study of Rayleigh-Bénard convection using liquid metal alloy gallium-indium-tin as the working fluid with a Prandtl number of $Pr=0.029$. The flow state and the heat transport were measured in a Rayleigh number range of $1.2\times10^{4} \le Ra \le 1.3\times10^{7}$. The temperature fluctuation at the cell centre is used as a proxy for the flow state. It is found that, as…
▽ More
We present an experimental study of Rayleigh-Bénard convection using liquid metal alloy gallium-indium-tin as the working fluid with a Prandtl number of $Pr=0.029$. The flow state and the heat transport were measured in a Rayleigh number range of $1.2\times10^{4} \le Ra \le 1.3\times10^{7}$. The temperature fluctuation at the cell centre is used as a proxy for the flow state. It is found that, as $Ra$ increases from the lower end of the parameter range, the flow evolves from a convection state to an oscillation state, a chaotic state, and finally a turbulent state for $Ra>10^5$. The study suggests that the large-scale circulation in the turbulent state is a residual of the cell structures near the onset of convection, which is in contrast with the case of $Pr\sim1$, where the cell structure is replaced by high-order flow modes transiently before the emergence of the large-scale circulation in the turbulent state. The evolution of the flow state is also reflected by the heat transport characterised by the Nusselt number $Nu$ and the probability density function (PDF) of the temperature fluctuation at the cell centre. It is found that the effective local heat transport scaling exponent $γ$, i.e., $Nu\sim Ra^γ$, changes continuously from $γ=0.49$ at $Ra\sim 10^4$ to $γ=0.25$ for $Ra>10^6$. Meanwhile, the PDF at the cell centre gradually evolves from a Gaussian-like shape before the transition to turbulence to an exponential-like shape in the turbulent state. For $Ra>10^6$, the flow shows self-similar behaviour, which is revealed by the universal shape of the PDF of the temperature fluctuation at the cell centre and a $Nu=0.19Ra^{0.25}$ scaling for the heat transport.
△ Less
Submitted 29 July, 2023;
originally announced July 2023.
-
Photometric calibration of the Stellar Abundance and Galactic Evolution Survey (SAGES): Nanshan One-meter Wide-field Telescope g, r, and i band imaging data
Authors:
Kai Xiao,
Haibo Yuan,
Bowen Huang,
Shuai Xu,
Jie Zheng,
Chun Li,
Zhou Fan,
Wei Wang,
Gang Zhao,
Guojie Feng,
Xuan Zhang,
**zhong Liu,
Ruoyi Zhang,
Lin Yang,
Yu Zhang,
Chunhai Bai,
Hubiao Niu,
Esamdin Ali,
Lu Ma
Abstract:
In this paper, a total of approximately 2.6 million dwarfs were constructed as standard stars, with an accuracy of about 0.01-0.02 mag for each band, by combining spectroscopic data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope Data Release 7, photometric data from the corrected Gaia Early Data Release 3, and photometric metallicities. Using the spectroscopy based stellar colo…
▽ More
In this paper, a total of approximately 2.6 million dwarfs were constructed as standard stars, with an accuracy of about 0.01-0.02 mag for each band, by combining spectroscopic data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope Data Release 7, photometric data from the corrected Gaia Early Data Release 3, and photometric metallicities. Using the spectroscopy based stellar color regression method (SCR method) and the photometric-based SCR method (SCR' method), we performed the relative calibration of the Nanshan One-meter Wide-field Telescope imaging data. Based on the corrected Pan-STARRS DR1 photometry, the absolute calibration was also performed. In the photometric calibration process, we analyzed the dependence of the calibration zero points on different images (observation time), different gates of the CCD detector, and different CCD positions. We found that the stellar flat and the relative gain between different gates depend on time. The amplitude of gain variation in three channels is approximately 0.5%-0.7% relative to the other channel, with a maximum value of 4%. In addition, significant spatial variations of the stellar flat fitting residual are found and corrected. Using repeated sources in the adjacent images, we checked and discovered internal consistency of about 1-2 mmag in all the filters. Using the PS1 magnitudes synthesized by Gaia DR3 BP/RP spectra by the synthetic photometry method, we found that the photometric calibration uniformity is about 1-2 mmag for all the bands, at a spatial resolution of 1.3 degree. A detailed comparison between the spectroscopy-based SCR and photometric-based SCR method magnitude offsets was performed, and we achieved an internal consistency precision of about 2 mmag or better with resolutions of 1.3 degree for all the filters. Which is mainly from the position-dependent errors of the E(B-V) used in SCR' method.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Anisotropic in-plane heat transport of Kitaev magnet Na$_2$Co$_2$TeO$_6$
Authors:
Shuangkui Guang,
Na Li,
Qing Huang,
Ke Xia,
Yiyan Wang,
Hui Liang,
Yan Sun,
Qiuju Li,
Xia Zhao,
Rui Leonard Luo,
Gang Chen,
Haidong Zhou,
Xuefeng Sun
Abstract:
We report a study on low-temperature heat transport of Kitaev magnet Na$_2$Co$_2$TeO$_6$, with the heat current and magnetic fields along the honeycomb spin layer (the $ab$ plane). The zero-field thermal conductivity of $κ^a_{xx}$ and $κ^{a*}_{xx}$ display similar temperature dependence and small difference in their magnitudes; whereas, their magnetic field (parallel to the heat current) dependenc…
▽ More
We report a study on low-temperature heat transport of Kitaev magnet Na$_2$Co$_2$TeO$_6$, with the heat current and magnetic fields along the honeycomb spin layer (the $ab$ plane). The zero-field thermal conductivity of $κ^a_{xx}$ and $κ^{a*}_{xx}$ display similar temperature dependence and small difference in their magnitudes; whereas, their magnetic field (parallel to the heat current) dependence are quite different and are related to the field-induced magnetic transitions. The $κ^a_{xx}(B)$ data for $B \parallel a$ at very low temperatures have an anomaly at 10.25--10.5 T, which reveals an unexplored magnetic transition. The planar thermal Hall conductivity $κ^a_{xy}$ and $κ^{a*}_{xy}$ show very weak signals at low fields and rather large values with sign change at high fields. This may point to a possible magnetic structure transition or the change of the magnon band topology that induces a radical change of magnon Berry curvature distribution before entering the spin polarized state. These results put clear constraints on the high-field phase and the theoretical models for Na$_2$Co$_2$TeO$_6$.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
RLTF: Reinforcement Learning from Unit Test Feedback
Authors:
Jiate Liu,
Yiqin Zhu,
Kaiwen Xiao,
Qiang Fu,
Xiao Han,
Wei Yang,
Deheng Ye
Abstract:
The goal of program synthesis, or code generation, is to generate executable code based on given descriptions. Recently, there has been an increasing number of studies employing reinforcement learning (RL) to improve the performance of large language models (LLMs) for code. However, current representative works either rely solely on offline frameworks, limiting the exploration of new sample spaces…
▽ More
The goal of program synthesis, or code generation, is to generate executable code based on given descriptions. Recently, there has been an increasing number of studies employing reinforcement learning (RL) to improve the performance of large language models (LLMs) for code. However, current representative works either rely solely on offline frameworks, limiting the exploration of new sample spaces, or fall short in the utilization of unit test signals, not accounting for specific error locations within the code. To address these issues, we propose RLTF, i.e., Reinforcement Learning from Unit Test Feedback, a novel online RL framework with unit test feedback of multi-granularity for refining code LLMs. Our approach generates data in real-time during training and simultaneously utilizes fine-grained feedback signals to guide the model towards producing higher-quality code. Extensive experiments show that RLTF achieves state-of-the-art performance on the APPS and the MBPP benchmarks. Our code is available at: https://github.com/Zyq-scut/RLTF.
△ Less
Submitted 12 November, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Token-Event-Role Structure-based Multi-Channel Document-Level Event Extraction
Authors:
Qizhi Wan,
Changxuan Wan,
Keli Xiao,
Hui Xiong,
Dexi Liu,
Xi** Liu
Abstract:
Document-level event extraction is a long-standing challenging information retrieval problem involving a sequence of sub-tasks: entity extraction, event type judgment, and event type-specific multi-event extraction. However, addressing the problem as multiple learning tasks leads to increased model complexity. Also, existing methods insufficiently utilize the correlation of entities crossing diffe…
▽ More
Document-level event extraction is a long-standing challenging information retrieval problem involving a sequence of sub-tasks: entity extraction, event type judgment, and event type-specific multi-event extraction. However, addressing the problem as multiple learning tasks leads to increased model complexity. Also, existing methods insufficiently utilize the correlation of entities crossing different events, resulting in limited event extraction performance. This paper introduces a novel framework for document-level event extraction, incorporating a new data structure called token-event-role and a multi-channel argument role prediction module. The proposed data structure enables our model to uncover the primary role of tokens in multiple events, facilitating a more comprehensive understanding of event relationships. By leveraging the multi-channel prediction module, we transform entity and multi-event extraction into a single task of predicting token-event pairs, thereby reducing the overall parameter size and enhancing model efficiency. The results demonstrate that our approach outperforms the state-of-the-art method by 9.5 percentage points in terms of the F1 score, highlighting its superior performance in event extraction. Furthermore, an ablation study confirms the significant value of the proposed data structure in improving event extraction tasks, further validating its importance in enhancing the overall performance of the framework.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
Molecular geometric deep learning
Authors:
Cong Shen,
Jiawei Luo,
Kelin Xia
Abstract:
Geometric deep learning (GDL) has demonstrated huge power and enormous potential in molecular data analysis. However, a great challenge still remains for highly efficient molecular representations. Currently, covalent-bond-based molecular graphs are the de facto standard for representing molecular topology at the atomic level. Here we demonstrate, for the first time, that molecular graphs construc…
▽ More
Geometric deep learning (GDL) has demonstrated huge power and enormous potential in molecular data analysis. However, a great challenge still remains for highly efficient molecular representations. Currently, covalent-bond-based molecular graphs are the de facto standard for representing molecular topology at the atomic level. Here we demonstrate, for the first time, that molecular graphs constructed only from non-covalent bonds can achieve similar or even better results than covalent-bond-based models in molecular property prediction. This demonstrates the great potential of novel molecular representations beyond the de facto standard of covalent-bond-based molecular graphs. Based on the finding, we propose molecular geometric deep learning (Mol-GDL). The essential idea is to incorporate a more general molecular representation into GDL models. In our Mol-GDL, molecular topology is modeled as a series of molecular graphs, each focusing on a different scale of atomic interactions. In this way, both covalent interactions and non-covalent interactions are incorporated into the molecular representation on an equal footing. We systematically test Mol-GDL on fourteen commonly-used benchmark datasets. The results show that our Mol-GDL can achieve a better performance than state-of-the-art (SOTA) methods. Source code and data are available at https://github.com/CS-BIO/Mol-GDL.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Curvature-enhanced Graph Convolutional Network for Biomolecular Interaction Prediction
Authors:
Cong Shen,
**jian Ding,
Junjie Wee,
Jialin Bi,
Jiawei Luo,
Kelin Xia
Abstract:
Geometric deep learning has demonstrated a great potential in non-Euclidean data analysis. The incorporation of geometric insights into learning architecture is vital to its success. Here we propose a curvature-enhanced graph convolutional network (CGCN) for biomolecular interaction prediction, for the first time. Our CGCN employs Ollivier-Ricci curvature (ORC) to characterize network local struct…
▽ More
Geometric deep learning has demonstrated a great potential in non-Euclidean data analysis. The incorporation of geometric insights into learning architecture is vital to its success. Here we propose a curvature-enhanced graph convolutional network (CGCN) for biomolecular interaction prediction, for the first time. Our CGCN employs Ollivier-Ricci curvature (ORC) to characterize network local structures and to enhance the learning capability of GCNs. More specifically, ORCs are evaluated based on the local topology from node neighborhoods, and further used as weights for the feature aggregation in message-passing procedure. Our CGCN model is extensively validated on fourteen real-world bimolecular interaction networks and a series of simulated data. It has been found that our CGCN can achieve the state-of-the-art results. It outperforms all existing models, as far as we know, in thirteen out of the fourteen real-world datasets and ranks as the second in the rest one. The results from the simulated data show that our CGCN model is superior to the traditional GCN models regardless of the positive-to-negativecurvature ratios, network densities, and network sizes (when larger than 500).
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Torsion Graph Neural Networks
Authors:
Cong Shen,
Xiang Liu,
Jiawei Luo,
Kelin Xia
Abstract:
Geometric deep learning (GDL) models have demonstrated a great potential for the analysis of non-Euclidian data. They are developed to incorporate the geometric and topological information of non-Euclidian data into the end-to-end deep learning architectures. Motivated by the recent success of discrete Ricci curvature in graph neural network (GNNs), we propose TorGNN, an analytic Torsion enhanced…
▽ More
Geometric deep learning (GDL) models have demonstrated a great potential for the analysis of non-Euclidian data. They are developed to incorporate the geometric and topological information of non-Euclidian data into the end-to-end deep learning architectures. Motivated by the recent success of discrete Ricci curvature in graph neural network (GNNs), we propose TorGNN, an analytic Torsion enhanced Graph Neural Network model. The essential idea is to characterize graph local structures with an analytic torsion based weight formula. Mathematically, analytic torsion is a topological invariant that can distinguish spaces which are homotopy equivalent but not homeomorphic. In our TorGNN, for each edge, a corresponding local simplicial complex is identified, then the analytic torsion (for this local simplicial complex) is calculated, and further used as a weight (for this edge) in message-passing process. Our TorGNN model is validated on link prediction tasks from sixteen different types of networks and node classification tasks from three types of networks. It has been found that our TorGNN can achieve superior performance on both tasks, and outperform various state-of-the-art models. This demonstrates that analytic torsion is a highly efficient topological invariant in the characterization of graph structures and can significantly boost the performance of GNNs.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Non-Hermitian Topological Magnonics
Authors:
Tao Yu,
Ji Zou,
Bowen Zeng,
J. W. Rao,
Ke Xia
Abstract:
Dissipation in mechanics, optics, acoustics, and electronic circuits is nowadays recognized to be not always detrimental but can be exploited to achieve non-Hermitian topological phases or properties with functionalities for potential device applications. As elementary excitations of ordered magnetic moments that exist in various magnetic materials, magnons are the information carriers in magnonic…
▽ More
Dissipation in mechanics, optics, acoustics, and electronic circuits is nowadays recognized to be not always detrimental but can be exploited to achieve non-Hermitian topological phases or properties with functionalities for potential device applications. As elementary excitations of ordered magnetic moments that exist in various magnetic materials, magnons are the information carriers in magnonic devices with low-energy consumption for reprogrammable logic, non-reciprocal communication, and non-volatile memory functionalities. Non-Hermitian topological magnonics deals with the engineering of dissipation and/or gain for non-Hermitian topological phases or properties in magnets that are not achievable in the conventional Hermitian scenario, with associated functionalities cross-fertilized with their electronic, acoustic, optic, and mechanic counterparts, such as giant enhancement of magnonic frequency combs, magnon amplification, (quantum) sensing of the magnetic field with unprecedented sensitivity, magnon accumulation, and perfect absorption of microwaves. In this review article, we address the unified approach in constructing magnonic non-Hermitian Hamiltonian, introduce the basic non-Hermitian topological physics, and provide a comprehensive overview of the recent theoretical and experimental progress towards achieving distinct non-Hermitian topological phases or properties in magnonic devices, including exceptional points, exceptional nodal phases, non-Hermitian magnonic SSH model, and non-Hermitian skin effect. We emphasize the non-Hermitian Hamiltonian approach based on the Lindbladian or self-energy of the magnonic subsystem but address the physics beyond it as well, such as the crucial quantum jump effect in the quantum regime and non-Markovian dynamics. We provide a perspective for future opportunities and challenges before concluding this article.
△ Less
Submitted 9 November, 2023; v1 submitted 7 June, 2023;
originally announced June 2023.
-
Effects of sequential decay on collective flows and nuclear stop** power in heavy-ion collisions at intermediate energies
Authors:
Kui Xiao,
PengCheng Li,
YongJia Wang,
FuHu Liu,
QingFeng Li
Abstract:
In this study, the rapidity distribution, collective flows, and nuclear stop** power in $^{197}\mathrm{Au}+^{197}\mathrm{Au}$ collisions at intermediate energies were investigated using the ultrarelativistic quantum molecular dynamics (UrQMD) model with GEMINI++ code. The UrQMD model was adopted to simulate the dynamic evolution of heavy-ion collisions, whereas the GEMINI++ code was used to simu…
▽ More
In this study, the rapidity distribution, collective flows, and nuclear stop** power in $^{197}\mathrm{Au}+^{197}\mathrm{Au}$ collisions at intermediate energies were investigated using the ultrarelativistic quantum molecular dynamics (UrQMD) model with GEMINI++ code. The UrQMD model was adopted to simulate the dynamic evolution of heavy-ion collisions, whereas the GEMINI++ code was used to simulate the decay of primary fragments produced by UrQMD. The calculated results were compared with the INDRA and FOPI experimental data. It was found that the rapidity distribution, collective flows, and nuclear stop** power were affected to a certain extent by the decay of primary fragments, especially at lower beam energies. Furthermore, the experimental data of the collective flows and nuclear stop** power at the investigated beam energies were better reproduced when the sequential decay effect was included.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Vortex Dynamics in Rotating Rayleigh-Bénard Convection
Authors:
Shan-Shan Ding,
Guang-Yu Ding,
Kai Leong Chong,
Wen-Tao Wu,
Ke-Qing Xia,
**-Qiang Zhong
Abstract:
We investigate the spatial distribution and dynamics of the vortices in rotating Rayleigh-Bénard convection in a reduced Rayleigh-number range $1.3{\le}Ra/Ra_{c}{\le}166$. Under slow rotations ($Ra{\gtrsim}10Ra_{c}$), the vortices are randomly distributed. The size-distribution of the Voronoi cells of the vortex centers is well described by the standard $Γ$ distribution. In this flow regime the vo…
▽ More
We investigate the spatial distribution and dynamics of the vortices in rotating Rayleigh-Bénard convection in a reduced Rayleigh-number range $1.3{\le}Ra/Ra_{c}{\le}166$. Under slow rotations ($Ra{\gtrsim}10Ra_{c}$), the vortices are randomly distributed. The size-distribution of the Voronoi cells of the vortex centers is well described by the standard $Γ$ distribution. In this flow regime the vortices exhibit Brownian-type horizontal motion. The probability density functions of the vortex displacements are, however, non-Gaussian at short time scales. At modest rotating rates ($4Ra_{c}{\le}Ra{\lesssim}10Ra_{c}$) the centrifugal force leads to radial vortex motions, i.e., warm cyclones (cold anticyclones) moving towards (outward from) the rotation axis. The mean-square-displacements of the vortices increase faster than linearly at large time. This super-diffusive behavior can be satisfactorily explained by a Langevin model incorporating the centrifugal force. In the rapidly rotating regime ($1.6Ra_{c}{\le}Ra{\le}4Ra_{c}$) the vortices are densely distributed, with the size-distribution of their Voronoi cells differing significantly from the standard $Γ$ distribution. The hydrodynamic interaction of neighboring vortices results in formation of vortex clusters. Inside clusters the correlation of the vortex velocity fluctuations is scale free, with the correlation length being approximately $30\%$ of the cluster length. We examine the influence of cluster forming on the dynamics of individual vortex. Within clusters, cyclones exhibit inverse-centrifugal motion as they submit to the motion of strong anticyclones, while the velocity for outward motion of the anticyclones is increased. Our analysis show that the mobility of isolated vortices, scaled by their vorticity strength, is a simple power function of the Froude number.
△ Less
Submitted 27 May, 2023;
originally announced May 2023.
-
PaLM 2 Technical Report
Authors:
Rohan Anil,
Andrew M. Dai,
Orhan Firat,
Melvin Johnson,
Dmitry Lepikhin,
Alexandre Passos,
Siamak Shakeri,
Emanuel Taropa,
Paige Bailey,
Zhifeng Chen,
Eric Chu,
Jonathan H. Clark,
Laurent El Shafey,
Yan** Huang,
Kathy Meier-Hellstern,
Gaurav Mishra,
Erica Moreira,
Mark Omernick,
Kevin Robinson,
Sebastian Ruder,
Yi Tay,
Kefan Xiao,
Yuanzhong Xu,
Yu**g Zhang,
Gustavo Hernandez Abrego
, et al. (103 additional authors not shown)
Abstract:
We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on…
▽ More
We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on downstream tasks across different model sizes, while simultaneously exhibiting faster and more efficient inference compared to PaLM. This improved efficiency enables broader deployment while also allowing the model to respond faster, for a more natural pace of interaction. PaLM 2 demonstrates robust reasoning capabilities exemplified by large improvements over PaLM on BIG-Bench and other reasoning tasks. PaLM 2 exhibits stable performance on a suite of responsible AI evaluations, and enables inference-time control over toxicity without additional overhead or impact on other capabilities. Overall, PaLM 2 achieves state-of-the-art performance across a diverse set of tasks and capabilities.
When discussing the PaLM 2 family, it is important to distinguish between pre-trained models (of various sizes), fine-tuned variants of these models, and the user-facing products that use these models. In particular, user-facing products typically include additional pre- and post-processing steps. Additionally, the underlying models may evolve over time. Therefore, one should not expect the performance of user-facing products to exactly match the results reported in this report.
△ Less
Submitted 13 September, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Quantifying and Defending against Privacy Threats on Federated Knowledge Graph Embedding
Authors:
Yuke Hu,
Wei Liang,
Ruofan Wu,
Kai Xiao,
Weiqiang Wang,
Xiaochen Li,
**fei Liu,
Zhan Qin
Abstract:
Knowledge Graph Embedding (KGE) is a fundamental technique that extracts expressive representation from knowledge graph (KG) to facilitate diverse downstream tasks. The emerging federated KGE (FKGE) collaboratively trains from distributed KGs held among clients while avoiding exchanging clients' sensitive raw KGs, which can still suffer from privacy threats as evidenced in other federated model tr…
▽ More
Knowledge Graph Embedding (KGE) is a fundamental technique that extracts expressive representation from knowledge graph (KG) to facilitate diverse downstream tasks. The emerging federated KGE (FKGE) collaboratively trains from distributed KGs held among clients while avoiding exchanging clients' sensitive raw KGs, which can still suffer from privacy threats as evidenced in other federated model trainings (e.g., neural networks). However, quantifying and defending against such privacy threats remain unexplored for FKGE which possesses unique properties not shared by previously studied models. In this paper, we conduct the first holistic study of the privacy threat on FKGE from both attack and defense perspectives. For the attack, we quantify the privacy threat by proposing three new inference attacks, which reveal substantial privacy risk by successfully inferring the existence of the KG triple from victim clients. For the defense, we propose DP-Flames, a novel differentially private FKGE with private selection, which offers a better privacy-utility tradeoff by exploiting the entity-binding sparse gradient property of FKGE and comes with a tight privacy accountant by incorporating the state-of-the-art private selection technique. We further propose an adaptive privacy budget allocation policy to dynamically adjust defense magnitude across the training procedure. Comprehensive evaluations demonstrate that the proposed defense can successfully mitigate the privacy threat by effectively reducing the success rate of inference attacks from $83.1\%$ to $59.4\%$ on average with only a modest utility decrease.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
GPT-4 Technical Report
Authors:
OpenAI,
Josh Achiam,
Steven Adler,
Sandhini Agarwal,
Lama Ahmad,
Ilge Akkaya,
Florencia Leoni Aleman,
Diogo Almeida,
Janko Altenschmidt,
Sam Altman,
Shyamal Anadkat,
Red Avila,
Igor Babuschkin,
Suchir Balaji,
Valerie Balcom,
Paul Baltescu,
Haiming Bao,
Mohammad Bavarian,
Jeff Belgum,
Irwan Bello,
Jake Berdine,
Gabriel Bernadett-Shapiro,
Christopher Berner,
Lenny Bogdonoff,
Oleg Boiko
, et al. (256 additional authors not shown)
Abstract:
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo…
▽ More
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was develo** infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.
△ Less
Submitted 4 March, 2024; v1 submitted 15 March, 2023;
originally announced March 2023.
-
In-Plane Electric Field Induced Orbital Hybridization of Excitonic States In Monolayer WSe2
Authors:
Bairen Zhu,
Ke Xiao,
Siyuan Yang,
Kenji Watanabe,
Takashi Taniguchi,
Xiaodong Cui
Abstract:
The giant exciton binding energy and the richness of degrees of freedom make monolayer transition metal dichalcogenide an unprecedented playground for exploring exciton physics in 2D systems. Thanks to the well energetically separated excitonic states, the response of the discrete excitonic states to the electric field could be precisely examined. Here we utilize the photocurrent spectroscopy to p…
▽ More
The giant exciton binding energy and the richness of degrees of freedom make monolayer transition metal dichalcogenide an unprecedented playground for exploring exciton physics in 2D systems. Thanks to the well energetically separated excitonic states, the response of the discrete excitonic states to the electric field could be precisely examined. Here we utilize the photocurrent spectroscopy to probe excitonic states under a static in-plane electric field. We demonstrate that the in-plane electric field leads to a significant orbital hybridization of Rydberg excitonic states with different angular momentum (especially orbital hybridization of 2s and 2p) and consequently optically actives 2p-state exciton. Besides, the electric-field controlled mixing of the high lying exciton state and continuum band enhances the oscillator strength of the discrete excited exciton states. This electric field modulation of the excitonic states in monolayer TMDs provides a paradigm of the manipulation of 2D excitons for potential applications of the electro-optical modulation in 2D semiconductors.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
Low-temperature specific heat and heat transport of Tb$_2$Ti$_{2-x}$Zr$_x$O$_7$ single crystals
Authors:
H. L. Che,
S. J. Li,
J. C. Wu,
N. Li,
S. K. Guang,
K. Xia,
X. Y. Yue,
Y. Y. Wang,
X. Zhao,
Q. J. Li,
X. F. Sun
Abstract:
We report a study on the specific heat and heat transport of Tb$_2$Ti$_{2-x}$Zr$_x$O$_7$ ($x =$ 0, 0.02, 0.1, 0.2, and 0.4) single crystals at low temperatures and in high magnetic fields. The magnetic specific heat can be described by the Schottky contribution from the crystal-electric-field (CEF) levels of Tb$^{3+}$, with introducing Gaussian distributions of the energy split of the ground-state…
▽ More
We report a study on the specific heat and heat transport of Tb$_2$Ti$_{2-x}$Zr$_x$O$_7$ ($x =$ 0, 0.02, 0.1, 0.2, and 0.4) single crystals at low temperatures and in high magnetic fields. The magnetic specific heat can be described by the Schottky contribution from the crystal-electric-field (CEF) levels of Tb$^{3+}$, with introducing Gaussian distributions of the energy split of the ground-state doublet and the gap between the ground state and first excited level. These crystals has an extremely low phonon thermal conductivity in a broad temperature range that can be attributed to the scattering by the magnetic excitations, which are mainly associated with the CEF levels. There is strong magnetic field dependence of thermal conductivity, which is more likely related to the field-induced changes of phonon scattering by the CEF levels than magnetic transitions or spin excitations. For magnetic field along the [111] direction, there is large thermal Hall conductivity at low temperatures which displays a broad peak around 8 T. At high fields up to 14 T, the thermal Hall conductivity decreases to zero, which supports its origin from either the spinon transport or the phonon skew scattering by CEF levels. The thermal Hall effect is rather robust with Zr do** up to 0.2 but is strongly weakened in higher Zr-doped sample.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Hysteresis and training effect in the electric control of spin current in Pt/Y3Fe5O12 heterostructures
Authors:
Y. D. Sun,
Lei Wang,
Lili Lang,
Ke Xia,
S. M. Zhou
Abstract:
We have reported on the hysteresis and training effect of spin current in Pt/Y3Fe5O12 heterostructures during subsequent cycles of ionic liquid gate voltage Vg. The inverse spin Hall effect voltage in spin pum** and spin Hall magnetoresistance exhibit diode-like behaviors in the first half cycle of Vg andalsoshowhysteresisinthe first cycle of Vg. Both the diode-like behavior and the hysteresis b…
▽ More
We have reported on the hysteresis and training effect of spin current in Pt/Y3Fe5O12 heterostructures during subsequent cycles of ionic liquid gate voltage Vg. The inverse spin Hall effect voltage in spin pum** and spin Hall magnetoresistance exhibit diode-like behaviors in the first half cycle of Vg andalsoshowhysteresisinthe first cycle of Vg. Both the diode-like behavior and the hysteresis become weak and even vanish in the second cycle of Vg due to the training effect. The above experimental results can be well explained by the screening charge do** model, in which the charge and the local magnetic moment are asymmetrically distributed in the Pt layer. The applicability of this model is further confirmed by measurements of anisotropic magnetoresistance and ferromagnetic resonance. The diode-like behavior is attributed to interplay between the asymmetrically distributed local magnetic moment and the spin current relaxation in the Pt layer. The hysteresis and the training effect arise from the incompletely reversible process between oxidation and reduction of Pt atoms and the evolution of the surface morphology at the ionic liquid/Pt interface under electric gating. This work provides new insights to improve the functional performance of electrically controlled spin current devices.
△ Less
Submitted 19 February, 2023;
originally announced February 2023.
-
Principle of learning sign rules by neural networks in qubit lattice models
Authors:
** Cao,
Shijie Hu,
Zhi** Yin,
Ke Xia
Abstract:
A neural network is a powerful tool that can uncover hidden laws beyond human intuition. However, it often appears as a black box due to its complicated nonlinear structures. By drawing upon the Gutzwiller mean-field theory, we can showcase a principle of sign rules for ordered states in qubit lattice models. We introduce a shallow feed-forward neural network with a single hidden neuron to present…
▽ More
A neural network is a powerful tool that can uncover hidden laws beyond human intuition. However, it often appears as a black box due to its complicated nonlinear structures. By drawing upon the Gutzwiller mean-field theory, we can showcase a principle of sign rules for ordered states in qubit lattice models. We introduce a shallow feed-forward neural network with a single hidden neuron to present these sign rules. We conduct systematical benchmarks in various models, including the generalized Ising, spin-$1/2$ XY, (frustrated) Heisenberg rings, triangular XY antiferromagnet on a torus, and the Fermi-Hubbard ring at an arbitrary filling. These benchmarks show that all the leading-order sign rule characteristics can be visualized in classical forms, such as pitch angles. Besides, quantum fluctuations can result in an imperfect accuracy rate quantitatively.
△ Less
Submitted 22 December, 2023; v1 submitted 5 February, 2023;
originally announced February 2023.
-
Persistent Dirac for molecular representation
Authors:
JunJie Wee,
Ginestra Bianconi,
Kelin Xia
Abstract:
Molecular representations are of fundamental importance for the modeling and analysis of molecular systems. Representation models and in general approaches based on topological data analysis (TDA) have demonstrated great success in various steps of drug design and materials discovery. Here we develop a mathematically rigorous computational framework for molecular representation based on the persis…
▽ More
Molecular representations are of fundamental importance for the modeling and analysis of molecular systems. Representation models and in general approaches based on topological data analysis (TDA) have demonstrated great success in various steps of drug design and materials discovery. Here we develop a mathematically rigorous computational framework for molecular representation based on the persistent Dirac operator. The properties of the spectrum of the discrete weighted and unweighted Dirac matrices are systemically discussed and used to demonstrate the geometric and topological properties of both non-homology and homology eigenvectors of real molecular structures. This allows us to asses the influence of weighting schemes on the information encoded in the Dirac eigenspectrum. A series of physical persistent attributes, which characterize the spectrum of the Dirac matrices across a filtration, are proposed and used as efficient molecular fingerprints. Finally, our persistent Dirac-based model is used for clustering molecular configurations from nine types of organic-inorganic halide perovskites. We found that our model can cluster the structures very well, demonstrating the representation and featurization power of the current approach.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
Measuring The Impact Of Programming Language Distribution
Authors:
Gabriel Orlanski,
Kefan Xiao,
Xavier Garcia,
Jeffrey Hui,
Joshua Howland,
Jonathan Malmaud,
Jacob Austin,
Rishabh Singh,
Michele Catasta
Abstract:
Current benchmarks for evaluating neural code models focus on only a small subset of programming languages, excluding many popular languages such as Go or Rust. To ameliorate this issue, we present the BabelCode framework for execution-based evaluation of any benchmark in any language. BabelCode enables new investigations into the qualitative performance of models' memory, runtime, and individual…
▽ More
Current benchmarks for evaluating neural code models focus on only a small subset of programming languages, excluding many popular languages such as Go or Rust. To ameliorate this issue, we present the BabelCode framework for execution-based evaluation of any benchmark in any language. BabelCode enables new investigations into the qualitative performance of models' memory, runtime, and individual test case results. Additionally, we present a new code translation dataset called Translating Python Programming Puzzles (TP3) from the Python Programming Puzzles (Schuster et al. 2021) benchmark that involves translating expert-level python functions to any language. With both BabelCode and the TP3 benchmark, we investigate if balancing the distributions of 14 languages in a training dataset improves a large language model's performance on low-resource languages. Training a model on a balanced corpus results in, on average, 12.34% higher $pass@k$ across all tasks and languages compared to the baseline. We find that this strategy achieves 66.48% better $pass@k$ on low-resource languages at the cost of only a 12.94% decrease to high-resource languages. In our three translation tasks, this strategy yields, on average, 30.77% better low-resource $pass@k$ while having 19.58% worse high-resource $pass@k$.
△ Less
Submitted 24 May, 2023; v1 submitted 3 February, 2023;
originally announced February 2023.
-
J-PLUS: Towards an homogeneous photometric calibration using Gaia BP/RP low-resolution spectra
Authors:
C. López-Sanjuan,
H. Vázquez Ramió,
K. Xiao,
H. Yuan,
J. M. Carrasco,
J. Varela,
D. Cristóbal-Hornillos,
P. -E. Tremblay,
A. Ederoclite,
A. Marín-Franch,
A. J. Cenarro,
P. R. T. Coelho,
S. Daflon,
A. del Pino,
H. Domínguez Sánchez,
J. A. Fernández-Ontiveros,
A. Hernán-Caballero,
F. M. Jiménez-Esteban,
J. Alcaniz,
R. E. Angulo,
R. A. Dupke,
C. Hernández-Monteagudo,
M. Moles,
L. Sodré Jr
Abstract:
We present the photometric calibration of the twelve optical passbands for the Javalambre Photometric Local Universe Survey (J-PLUS) third data release (DR3), comprising 1642 pointings of two square degrees each. We selected nearly 1.5 million main sequence stars with a signal-to-noise ratio larger than ten in the twelve J-PLUS passbands and available low-resolution (R = 20-80) spectrum from the b…
▽ More
We present the photometric calibration of the twelve optical passbands for the Javalambre Photometric Local Universe Survey (J-PLUS) third data release (DR3), comprising 1642 pointings of two square degrees each. We selected nearly 1.5 million main sequence stars with a signal-to-noise ratio larger than ten in the twelve J-PLUS passbands and available low-resolution (R = 20-80) spectrum from the blue and red photometers (BP/RP) in Gaia DR3. We compared the synthetic photometry from BP/RP spectra with the J-PLUS instrumental magnitudes, after correcting for the magnitude and color terms between both systems, to obtain an homogeneous photometric solution for J-PLUS. To circumvent the current limitations in the absolute calibration of the BP/RP spectra, the absolute color scale was derived using the locus of 109 white dwarfs closer than 100 pc with a negligible interstellar extinction. Finally, the absolute flux scale was anchored to the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS) photometry in the r band. The precision of the J-PLUS photometric calibration, estimated from duplicated objects observed in adjacent pointings and by comparison with the spectro-photometric standard star GD 153, is ~12 mmag in u, J0378, and J0395; and ~7 mmag in J0410, J0430, g, J0515, r, J0660, i, J0861, and z. The estimated accuracy in the calibration along the surveyed area is better than 1% for all the passbands. The Gaia BP/RP spectra provide a high-quality, homogeneous photometric reference in the optical range across the full-sky, in spite of their current limitations as an absolute reference. The calibration method for J-PLUS DR3 reaches an absolute precision and accuracy of 1% in the twelve optical filters within an area of 3284 square degrees.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
Wrap** dynamics and full uptake conditions for nonspherical active nanoparticles
Authors:
Ke Xiao,
Rui Ma,
Chen-Xu Wu
Abstract:
The cellular uptake of self-propelled nanoparticles (NPs) or viruses, usually nonspherical, by cell membrane is crucial in many biological processes. In this study, using Onsager variational principle, we obtain a general wrap** equation for nonspherical self-propelled nanoparticles. Two analytical critical conditions are theoretically derived, one for the continuous full uptake of prolate parti…
▽ More
The cellular uptake of self-propelled nanoparticles (NPs) or viruses, usually nonspherical, by cell membrane is crucial in many biological processes. In this study, using Onsager variational principle, we obtain a general wrap** equation for nonspherical self-propelled nanoparticles. Two analytical critical conditions are theoretically derived, one for the continuous full uptake of prolate particles and the other for snapthrough full wrap** of oblate particles. They capture considerably well the full uptake critical boundaries in the phase diagrams constructed in terms of active force, aspect ratio, adhesion energy density, and membrane tension based on numerical calculations. It is found that enhancing activity (active force), reducing effective dynamic viscosity, increasing adhesion energy density, and decreasing membrane tension, can significantly improve the wrap** efficiency for the self-propelled particles. These results elucidate some of the previous specific investigations conclusively and may offer novel possibilities for designing an effective active NP-based vehicle for controlled drug delivery.
△ Less
Submitted 13 January, 2023;
originally announced January 2023.
-
Natural Language to Code Generation in Interactive Data Science Notebooks
Authors:
Pengcheng Yin,
Wen-Ding Li,
Kefan Xiao,
Abhishek Rao,
Yeming Wen,
Kensen Shi,
Joshua Howland,
Paige Bailey,
Michele Catasta,
Henryk Michalewski,
Alex Polozov,
Charles Sutton
Abstract:
Computational notebooks, such as Jupyter notebooks, are interactive computing environments that are ubiquitous among data scientists to perform data wrangling and analytic tasks. To measure the performance of AI pair programmers that automatically synthesize programs for those tasks given natural language (NL) intents from users, we build ARCADE, a benchmark of 1082 code generation problems using…
▽ More
Computational notebooks, such as Jupyter notebooks, are interactive computing environments that are ubiquitous among data scientists to perform data wrangling and analytic tasks. To measure the performance of AI pair programmers that automatically synthesize programs for those tasks given natural language (NL) intents from users, we build ARCADE, a benchmark of 1082 code generation problems using the pandas data analysis framework in data science notebooks. ARCADE features multiple rounds of NL-to-code problems from the same notebook. It requires a model to understand rich multi-modal contexts, such as existing notebook cells and their execution states as well as previous turns of interaction. To establish a strong baseline on this challenging task, we develop PaChiNCo, a 62B code language model (LM) for Python computational notebooks, which significantly outperforms public code LMs. Finally, we explore few-shot prompting strategies to elicit better code with step-by-step decomposition and NL explanation, showing the potential to improve the diversity and explainability of model predictions.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
A passive bias-free ultrabroadband optical isolator based on unidirectional self-induced transparency
Authors:
Haodong Wu,
Jiangshan Tang,
Mingyuan Chen,
Min Xiao,
Franco Nori,
Keyu Xia,
Yanqing Lu
Abstract:
Achieving a broadband nonreciprocal device without gain and any external bias is very challenging and highly desirable for modern photonic technologies and quantum networks. Here, we theoretically propose a passive and bias-free all-optical isolator for a femtosecond laser pulse by exploiting a new mechanism of unidirectional self-induced transparency, obtained with a nonlinear medium followed by…
▽ More
Achieving a broadband nonreciprocal device without gain and any external bias is very challenging and highly desirable for modern photonic technologies and quantum networks. Here, we theoretically propose a passive and bias-free all-optical isolator for a femtosecond laser pulse by exploiting a new mechanism of unidirectional self-induced transparency, obtained with a nonlinear medium followed by a normal absorbing medium at one side. The transmission contrast between the forward and backward directions can reach ~14.3 dB for a 2π5 fs laser pulse, implying isolation of a signal with an ultrabroad bandwidth of 200 THz. The 20 dB bandwidth is about 57 nm, already comparable with a magneto-optical isolator. This cavity-free optical isolator may pave the way to integrated nonmagnetic isolation of ultrashort laser pulses.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Vesiculation mechanisms mediated by anisotropic proteins
Authors:
Ke Xiao,
Chen-Xu Wu,
Rui Ma
Abstract:
Endocytosis is an essential biological process for the trafficking of macromolecules (cargo) and membrane proteins in cells. In yeast cells, this involves the invagination of a tubular structure on the membrane and the formation of endocytic vesicles. Bin/Amphiphysin/Rvs (BAR) proteins holding a crescent-shape are generally assumed to be the active player to squeeze the tubular structure and pinch…
▽ More
Endocytosis is an essential biological process for the trafficking of macromolecules (cargo) and membrane proteins in cells. In yeast cells, this involves the invagination of a tubular structure on the membrane and the formation of endocytic vesicles. Bin/Amphiphysin/Rvs (BAR) proteins holding a crescent-shape are generally assumed to be the active player to squeeze the tubular structure and pinch off the vesicle by forming a scaffold on the side of the tubular membrane. Here we use the extended Helfrich model to theoretically investigate how BAR proteins help drive the formation of vesicles via generating anisotropic curvatures. Our results show that, within the classical Helfrich model, increasing the spontaneous curvature at the side of a tubular membrane is unable to reduce the tube radius to a critical size to induce membrane fission. However, membranes coated with proteins that generate anisotropic curvatures are prone to experience an hourglass-shaped necking or a tube-shaped necking process, an important step leading to membrane fission and vesicle formation. In addition, our study shows that depending on the type of anisotropic curvatures generated by a protein, the force to maintain the protein coated membrane at a tubular shape exhibits qualitatively different relationship with the spontaneous curvature. This result provides an experimental guidance to determine the type of anisotropic curvatures of a protein.
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
Thermal Transport of Fractionalized Antiferromagnetic and Field Induced States in the Kitaev Material Na$_2$Co$_2$TeO$_6$
Authors:
S. K. Guang,
N. Li,
R. L. Luo,
Q. Huang,
Y. Y. Wang,
X. Y. Yue,
K. Xia,
Q. J. Li,
X. Zhao,
G. Chen,
H. D. Zhou,
X. F. Sun
Abstract:
We report an in-plane thermal transport study of the honeycomb Kitaev material Na$_2$Co$_2$TeO$_6$ at subKelvin temperatures. In zero field, the $κ(T)$ displays a rather weak $T$-dependence but has a non-zero residual term $κ_0/T$, indicating strong phonon scattering by magnetic excitation and the possibility of itinerant spinon-like excitations coexisting with an antiferromagnetic order below 27…
▽ More
We report an in-plane thermal transport study of the honeycomb Kitaev material Na$_2$Co$_2$TeO$_6$ at subKelvin temperatures. In zero field, the $κ(T)$ displays a rather weak $T$-dependence but has a non-zero residual term $κ_0/T$, indicating strong phonon scattering by magnetic excitation and the possibility of itinerant spinon-like excitations coexisting with an antiferromagnetic order below 27 K. We propose the zero-field ground state is a novel fractionalized antiferromagnetic (AF*) state with both magnetic order and fractionalized excitations. With both the heat current and external field along the $a*$ (Co-Co bond) direction, the $κ_{a*}$ exhibits two sharp minima at 7.5 T and 10 T, and its value at 8.5 T is almost the same as the pure phononic transport for the high-field polarized state. This confirms the phase boundaries of the reported field-induced intermediate state and suggest its gapless continuum excitations possibly transport heat. No such intermediate phase was found in the $κ_a$ for the current and field along the $a$ (zigzag chain) direction. Finally, Na$_2$Co$_2$TeO$_6$ displays a strongly anisotropic magneto-thermal conductivity since the in-plane (out-of-plane) field strongly enhances (suppresses) the $κ_{a*}$ and $κ_a$.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Efficiently Scaling Transformer Inference
Authors:
Reiner Pope,
Sholto Douglas,
Aakanksha Chowdhery,
Jacob Devlin,
James Bradbury,
Anselm Levskaya,
Jonathan Heek,
Kefan Xiao,
Shivani Agrawal,
Jeff Dean
Abstract:
We study the problem of efficient generative inference for Transformer models, in one of its most challenging settings: large deep models, with tight latency targets and long sequence lengths. Better understanding of the engineering tradeoffs for inference for large Transformer-based models is important as use cases of these models are growing rapidly throughout application areas. We develop a sim…
▽ More
We study the problem of efficient generative inference for Transformer models, in one of its most challenging settings: large deep models, with tight latency targets and long sequence lengths. Better understanding of the engineering tradeoffs for inference for large Transformer-based models is important as use cases of these models are growing rapidly throughout application areas. We develop a simple analytical model for inference efficiency to select the best multi-dimensional partitioning techniques optimized for TPU v4 slices based on the application requirements. We combine these with a suite of low-level optimizations to achieve a new Pareto frontier on the latency and model FLOPS utilization (MFU) tradeoffs on 500B+ parameter models that outperforms the FasterTransformer suite of benchmarks. We further show that with appropriate partitioning, the lower memory requirements of multiquery attention (i.e. multiple query heads share single key/value head) enables scaling up to 32x larger context lengths. Finally, we achieve a low-batch-size latency of 29ms per token during generation (using int8 weight quantization) and a 76% MFU during large-batch-size processing of input tokens, while supporting a long 2048-token context length on the PaLM 540B parameter model.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Quality-Cost Trade-off on Constructing Logical Views for Vehicular Cyber-Physical Systems: A Deep Reinforcement Learning Approach
Authors:
Junyuan Wu,
Xincao Xu,
Chuzhao Li,
Hao Zhang,
Ke Xiao,
Kai Liu
Abstract:
With the development of sensing technologies, vehicle-to-everything (V2X) communications, edge computing paradigm, vehicular cyber-physical systems (VCPS) are emerging as the most fundamental platform for realizing future intelligent transportation systems (ITSs). In particular, the construction of logical views at the edge nodes based on heterogeneous information sensing and uploading are critica…
▽ More
With the development of sensing technologies, vehicle-to-everything (V2X) communications, edge computing paradigm, vehicular cyber-physical systems (VCPS) are emerging as the most fundamental platform for realizing future intelligent transportation systems (ITSs). In particular, the construction of logical views at the edge nodes based on heterogeneous information sensing and uploading are critical to the realization of VCPS. However, a higher-quality view in terms of timeliness and accuracy may require higher cost on sensing and uploading. In view of this, this paper is dedicated to striking a balance between the quality and the cost for constructing logical views of VCPS. Specifically, we first derive an information sensing model based on multi-class M/G/1 priority queue and a data uploading model based on reliability-guaranteed vehicle-to-infrastructure (V2I) communications. On this basis, we design two metrics, namely, age of view (AoV) and cost of view (CoV), simultaneously. Then, we formulate a bi-objective problem to maximize the AoV and minimize the CoV. Further, we propose a distributed distributional deep deterministic policy gradient (D4PG) solution to determine sensing information, frequency, uploading priority, transmission power, and V2I bandwidth. Finally, we build a simulation model and give a comprehensive performance evaluation, and the simulation results conclusively demonstrate the superiority of the proposed solution.
△ Less
Submitted 19 September, 2023; v1 submitted 31 October, 2022;
originally announced November 2022.
-
Entanglement-enhanced optomechanical sensor array for dark matter searches
Authors:
Anthony J. Brady,
Xin Chen,
Kewen Xiao,
Yi Xia,
Jack Manley,
Mitul Dey Chowdhury,
Zhen Liu,
Roni Harnik,
Dalziel J. Wilson,
Zheshen Zhang,
Quntao Zhuang
Abstract:
The nature of dark matter is one of the most important open questions in modern physics. The search for dark matter is challenging since, besides gravitational interaction, it feebly interacts with ordinary matter. Mechanical sensors are one of the leading candidates for dark matter searches in the low frequency region. Here, we propose entanglement-enhanced optomechanical sensing systems to assis…
▽ More
The nature of dark matter is one of the most important open questions in modern physics. The search for dark matter is challenging since, besides gravitational interaction, it feebly interacts with ordinary matter. Mechanical sensors are one of the leading candidates for dark matter searches in the low frequency region. Here, we propose entanglement-enhanced optomechanical sensing systems to assist the search for DM with mechanical sensing devices. To assess the performance of our setup, we adopt the integrated sensitivity, which is particularly suitable for broadband sensing as it precisely quantifies the bandwidth-sensitivity tradeoff of the system. We then show that, by coherently operating the optomechanical sensor array and utilizing continuous-variable multi-partite entanglement between the optical fields, the array of sensors has a scaling advantage over independent sensors (i.e., $\sqrt{M}\rightarrow M$, where $M$ is the number of sensors) as well as a performance boost due to entanglement. Such an advantage is robust to imhomogeneities of the mechanical sensors and is achievable with off-the-shelf experimental components.
△ Less
Submitted 7 December, 2022; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Neural Causal Models for Counterfactual Identification and Estimation
Authors:
Kevin Xia,
Yushu Pan,
Elias Bareinboim
Abstract:
Evaluating hypothetical statements about how the world would be had a different course of action been taken is arguably one key capability expected from modern AI systems. Counterfactual reasoning underpins discussions in fairness, the determination of blame and responsibility, credit assignment, and regret. In this paper, we study the evaluation of counterfactual statements through neural models.…
▽ More
Evaluating hypothetical statements about how the world would be had a different course of action been taken is arguably one key capability expected from modern AI systems. Counterfactual reasoning underpins discussions in fairness, the determination of blame and responsibility, credit assignment, and regret. In this paper, we study the evaluation of counterfactual statements through neural models. Specifically, we tackle two causal problems required to make such evaluations, i.e., counterfactual identification and estimation from an arbitrary combination of observational and experimental data. First, we show that neural causal models (NCMs) are expressive enough and encode the structural constraints necessary for performing counterfactual reasoning. Second, we develop an algorithm for simultaneously identifying and estimating counterfactual distributions. We show that this algorithm is sound and complete for deciding counterfactual identification in general settings. Third, considering the practical implications of these results, we introduce a new strategy for modeling NCMs using generative adversarial networks. Simulations corroborate with the proposed methodology.
△ Less
Submitted 30 September, 2022;
originally announced October 2022.
-
Dynamic Nonreciprocity with a Kerr Nonlinear Resonator
Authors:
Rui-Kai Pan,
Lei Tang,
Keyu Xia,
Franco Nori
Abstract:
On-chip optical nonreciprocal devices are vital components for integrated photonic systems and scalable quantum information processing. Nonlinear optical isolators and circulators have attracted considerable attention because of their fundamental interest and their important advantages in integrated photonic circuits. However, optical nonreciprocal devices based on Kerr or Kerr-like nonlinearity a…
▽ More
On-chip optical nonreciprocal devices are vital components for integrated photonic systems and scalable quantum information processing. Nonlinear optical isolators and circulators have attracted considerable attention because of their fundamental interest and their important advantages in integrated photonic circuits. However, optical nonreciprocal devices based on Kerr or Kerr-like nonlinearity are subject to dynamical reciprocity when the forward and backward signals coexist simultaneously in a nonlinear system. Here, we theoretically propose a method for realizing on-chip nonlinear isolators and circulators with dynamic nonreciprocity. Dynamic nonreciprocity is achieved via the chiral modulation on the resonance frequency due to coexisting self- and cross-Kerr nonlinearities in an optical ring resonator. This work showing dynamic nonreciprocity with a Kerr nonlinear resonator can be an essential step toward integrated optical isolation.
△ Less
Submitted 10 November, 2022; v1 submitted 19 August, 2022;
originally announced August 2022.
-
Age of View: A New Metric for Evaluating Heterogeneous Information Fusion in Vehicular Cyber-Physical Systems
Authors:
Xincao Xu,
Kai Liu,
Qisen Zhang,
Hao Jiang,
Ke Xiao,
Jiangtao Luo
Abstract:
Heterogeneous information fusion is one of the most critical issues for realizing vehicular cyber-physical systems (VCPSs). This work makes the first attempt at quantitatively measuring the quality of heterogeneous information fusion in VCPS by designing a new metric called Age of View (AoV). Specifically, we derive a sensing model based on a multi-class M/G/1 priority queue and a transmission mod…
▽ More
Heterogeneous information fusion is one of the most critical issues for realizing vehicular cyber-physical systems (VCPSs). This work makes the first attempt at quantitatively measuring the quality of heterogeneous information fusion in VCPS by designing a new metric called Age of View (AoV). Specifically, we derive a sensing model based on a multi-class M/G/1 priority queue and a transmission model based on Shannon theory. On this basis, we formally define AoV by modeling the timeliness, completeness, and consistency of the heterogeneous information fusion in VCPS and formulate the problem aiming to minimize the system's average AoV. Further, we propose a new solution called Multi-agent Difference-Reward-based deep reinforcement learning with a Greedy Bandwidth Allocation (MDR-GBA) to solve the problem. In particular, each vehicle acts as an independent agent and decides the sensing frequencies and uploading priorities of heterogeneous information. Meanwhile, the roadside unit (RSU) decides the Vehicle-to-Infrastructure (V2I) bandwidth allocation for each vehicle based on a greedy scheme. Finally, we build the simulation model and compare the performance of the proposed solution with state-of-the-art algorithms. The experimental results conclusively demonstrate the significance of the new metric and the superiority of the proposed solution.
△ Less
Submitted 31 July, 2022;
originally announced August 2022.
-
Low-temperature transport properties of intermetallic compound HoAgGe with kagome spin ice state
Authors:
N. Li,
Q. Huang,
X. Y. Yue,
S. K. Guang,
K. Xia,
Y. Y. Wang,
Q. J. Li,
X. Zhao,
H. D. Zhou,
X. F. Sun
Abstract:
We study the magnetic susceptibility, magnetization, resistivity and thermal conductivity of intermetallic HoAgGe single crystals at low temperatures and in magnetic fields along the $a$ and $c$ axis, while the electric and heat currents are along the $c$ axis. The magnetization curves show a series of metamagnetic transitions and small hysteresis at low field for $B \parallel a$, and a weak metam…
▽ More
We study the magnetic susceptibility, magnetization, resistivity and thermal conductivity of intermetallic HoAgGe single crystals at low temperatures and in magnetic fields along the $a$ and $c$ axis, while the electric and heat currents are along the $c$ axis. The magnetization curves show a series of metamagnetic transitions and small hysteresis at low field for $B \parallel a$, and a weak metamagnetic transition for $B \parallel c$, respectively. Both the magnetic susceptibility and $ρ(T)$ curve show anomalies at the antiferromagnetic transition ($T\rm_N \sim$ 11.3 K) and spin reorientation transition ($\sim$ 7 K). In zero field and at very low temperatures, the electrons are found to be the main heat carriers. For $B \parallel a$, the $ρ(B)$ curves display large and positive transverse magnetoresistance (MR) with extraordinary field dependence between $B^2$ and $B$-linear, accompanied with anomalies at the metamagnetic transitions and low-field hysteresis; meanwhile, the $κ(B)$ mainly decrease with increasing field and display some anomalies at the metamagnetic transitions. For $B \parallel c$, there is weak and negative longitudinal MR while the $κ(B)$ show rather strong field dependence, indicating the role of phonon heat transport.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Magneto-thermomechanically triggered active mechanical metamaterials -- untethered, reversible, reprogrammable transformations with shape locking
Authors:
Bihui Zou,
Zihe Liang,
Zhiming Cui,
Kai Xiao,
Shuang Shao,
Jaehyung Ju
Abstract:
Future active metamaterials for reconfigurable structural applications require fast, untethered, reversible, and reprogrammable (multimodal) transformability with shape locking. Herein, we aim to construct and demonstrate a magneto-thermomechanical tool that enables a single material system to transform with untethered, reversible, low-powered reprogrammable deformations and shape locking via the…
▽ More
Future active metamaterials for reconfigurable structural applications require fast, untethered, reversible, and reprogrammable (multimodal) transformability with shape locking. Herein, we aim to construct and demonstrate a magneto-thermomechanical tool that enables a single material system to transform with untethered, reversible, low-powered reprogrammable deformations and shape locking via the application of magneto-thermomechanically triggered prestress on a shape memory polymer and structural instability with asymmetric magnetic torque. We demonstrate the mutual assistance of two physics concepts - magnetic control combined with the thermomechanical behavior of shape memory polymers, without requiring new materials synthesis and high-power energy for reprogramming. Our approach can open a new path of active metamaterials, flexible yet stiff soft robots, and multimodal morphing structures, where we can design them in reversible and reprogrammable ways.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
Authors:
Kun Xia,
Le Wang,
San** Zhou,
Nanning Zheng,
Wei Tang
Abstract:
The main challenge of Temporal Action Localization is to retrieve subtle human actions from various co-occurring ingredients, e.g., context and background, in an untrimmed video. While prior approaches have achieved substantial progress through devising advanced action detectors, they still suffer from these co-occurring ingredients which often dominate the actual action content in videos. In this…
▽ More
The main challenge of Temporal Action Localization is to retrieve subtle human actions from various co-occurring ingredients, e.g., context and background, in an untrimmed video. While prior approaches have achieved substantial progress through devising advanced action detectors, they still suffer from these co-occurring ingredients which often dominate the actual action content in videos. In this paper, we explore two orthogonal but complementary aspects of a video snippet, i.e., the action features and the co-occurrence features. Especially, we develop a novel auxiliary task by decoupling these two types of features within a video snippet and recombining them to generate a new feature representation with more salient action information for accurate action localization. We term our method RefactorNet, which first explicitly factorizes the action content and regularizes its co-occurrence features, and then synthesizes a new action-dominated video representation. Extensive experimental results and ablation studies on THUMOS14 and ActivityNet v1.3 demonstrate that our new representation, combined with a simple action detector, can significantly improve the action localization performance.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Sign-reversed anomalous Nernst effect in the ferromagnetic Weyl-semimetal Fe$_{3-x}$GeTe$_2$: the role of Fe vacancies
Authors:
Haiyang Yang,
Qi Wang,
Junwu Huang,
Zhouliang Wang,
Keqi Xia,
Chao Cao,
Mingliang Tian,
Zhuan Xu,
Jianhui Dai,
Yuke Li
Abstract:
Anomalous Nernst effect, as a thermal partner of anomalous Hall effect, is particularly sensitive to the Berry curvature anomaly near the Fermi level, and has been used to probe the topological nature of quantum materials. In this work, we report the observation of both effects in the ferromagnetic Weyl-semimetal Fe$_{3-x}$GeTe$_2$ with tunable Fe vacancies. With decreasing Fe vacancies, the anoma…
▽ More
Anomalous Nernst effect, as a thermal partner of anomalous Hall effect, is particularly sensitive to the Berry curvature anomaly near the Fermi level, and has been used to probe the topological nature of quantum materials. In this work, we report the observation of both effects in the ferromagnetic Weyl-semimetal Fe$_{3-x}$GeTe$_2$ with tunable Fe vacancies. With decreasing Fe vacancies, the anomalous Hall conductivity evolves as a function of the longitudinal conductivity from the hop** region to the region where the intrinsic Berry curvature contribution dominates. Concomitant evolutions in the anomalous Nernst signal and the anomalous off-diagonal thermoelectric coefficient are observed below the Curie temperature, displaying a unique sign change caused by the Fe vacancies. Combining these results with first-principles calculations, we argue that the Fe-vacancy concentration plays a unique role in simultaneously tuning the chemical potential and ferromagnetism, which in turn controls the Berry curvature contribution in this family of ferromagnetic topological semimetals.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Photometric calibration methods for wide-field photometric surveys
Authors:
Bowen Huang,
Kai Xiao,
Haibo Yuan
Abstract:
Uniform and accurate photometric calibration plays an important role in the current and next-generation wide-field imaging surveys. Herein, we review the modern photometric calibration methods, including the classic standard star method, "hardware/observation-driven" methods (such as the Ubercalibration, Hypercalibration, and Forward Global Calibration Methods), and "software/physics-driven" metho…
▽ More
Uniform and accurate photometric calibration plays an important role in the current and next-generation wide-field imaging surveys. Herein, we review the modern photometric calibration methods, including the classic standard star method, "hardware/observation-driven" methods (such as the Ubercalibration, Hypercalibration, and Forward Global Calibration Methods), and "software/physics-driven" methods (e.g., the Stellar Locus Regression, Stellar Locus, and Stellar Color Regression Methods). Further, we discuss their advantages, limitations, and future developments toward millimagnitude precision calibration.
△ Less
Submitted 2 June, 2022;
originally announced June 2022.
-
Volumetric-map**-based inverse design of 3D architected materials and mobility control by topology reconstruction
Authors:
Kai Xiao,
Xiang Zhou,
Jaehyung Ju
Abstract:
The recent development of modular origami structures has ushered in a new era for active metamaterials with multiple degrees of freedom (multi-DOF). Notably, no systematic inverse design approach for volumetric modular origami structures has been reported. Moreover, very few topologies of modular origami have been studied for the design of active metamaterials with multi-DOF. Herein, we develop an…
▽ More
The recent development of modular origami structures has ushered in a new era for active metamaterials with multiple degrees of freedom (multi-DOF). Notably, no systematic inverse design approach for volumetric modular origami structures has been reported. Moreover, very few topologies of modular origami have been studied for the design of active metamaterials with multi-DOF. Herein, we develop an inverse design method and reconfigurable algorithm for constructing 3D active architected structures - we synthesize modular origami structures that can be volumetrically mapped to a target 3D shape. We can control the reconfigurability by reconstructing the topology of the architected structures. Our inverse design based on volumetric map** with mobility control by topology reconstruction can be used to construct architected metamaterials with any 3D complex shape that are also transformable with multi-DOF. Our work opens a new path toward 3D reconfigurable structures based on volumetric inverse design. This work is significant for the design of 3D active metamaterials and 3D morphing devices for automotive, aerospace, and biomedical engineering applications.
△ Less
Submitted 10 May, 2022;
originally announced May 2022.
-
Causal Transportability for Visual Recognition
Authors:
Chengzhi Mao,
Kevin Xia,
James Wang,
Hao Wang,
Junfeng Yang,
Elias Bareinboim,
Carl Vondrick
Abstract:
Visual representations underlie object recognition tasks, but they often contain both robust and non-robust features. Our main observation is that image classifiers may perform poorly on out-of-distribution samples because spurious correlations between non-robust features and labels can be changed in a new environment. By analyzing procedures for out-of-distribution generalization with a causal gr…
▽ More
Visual representations underlie object recognition tasks, but they often contain both robust and non-robust features. Our main observation is that image classifiers may perform poorly on out-of-distribution samples because spurious correlations between non-robust features and labels can be changed in a new environment. By analyzing procedures for out-of-distribution generalization with a causal graph, we show that standard classifiers fail because the association between images and labels is not transportable across settings. However, we then show that the causal effect, which severs all sources of confounding, remains invariant across domains. This motivates us to develop an algorithm to estimate the causal effect for image classification, which is transportable (i.e., invariant) across source and target environments. Without observing additional variables, we show that we can derive an estimand for the causal effect under empirical assumptions using representations in deep models as proxies. Theoretical analysis, empirical results, and visualizations show that our approach captures causal invariances and improves overall generalization.
△ Less
Submitted 26 April, 2022;
originally announced April 2022.
-
Nonlinear dissipation induced photon blockade
Authors:
Xin Su,
Jiang-Shan Tang,
Keyu Xia
Abstract:
We theoretically propose a scheme for photon blockade in a cavity quantum electrodynamical system consisting of an N-type atomic medium interacting with a single-mode Fabry-Perot cavity. In contrast to inefficient nonlinear-dispersion-induced photon blockade suppressed by a large detuning, the photon blockade in our scheme is induced by a large nonlinear dissipation of the cavity created by the N-…
▽ More
We theoretically propose a scheme for photon blockade in a cavity quantum electrodynamical system consisting of an N-type atomic medium interacting with a single-mode Fabry-Perot cavity. In contrast to inefficient nonlinear-dispersion-induced photon blockade suppressed by a large detuning, the photon blockade in our scheme is induced by a large nonlinear dissipation of the cavity created by the N-type atomic system. A deep photon blockade is manifested with a vanishing equal-time second-order correlation function within the cavity linewidth. This work provides an efficient photon blockade because it work in the near-resonance case.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.