Search | arXiv e-print repository

Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision

Authors: Zhaoqing Wang, Xiaobo Xia, Ziye Chen, Xiao He, Yandong Guo, Mingming Gong, Tongliang Liu

Abstract: Current state-of-the-art open-vocabulary segmentation methods typically rely on image-mask-text triplet annotations for supervision. However, acquiring such detailed annotations is labour-intensive and poses scalability challenges in complex real-world scenarios. While existing weakly-supervised approaches leverage image-text pairs to reduce the expansive annotation cost, the lack of mask supervis… ▽ More Current state-of-the-art open-vocabulary segmentation methods typically rely on image-mask-text triplet annotations for supervision. However, acquiring such detailed annotations is labour-intensive and poses scalability challenges in complex real-world scenarios. While existing weakly-supervised approaches leverage image-text pairs to reduce the expansive annotation cost, the lack of mask supervision makes it difficult for the model to locate multiple instances and accurately group pixels with similar semantics, significantly hampering versatility and performance. In this paper, we introduce Unpair-Seg, a novel weakly-supervised open-vocabulary segmentation framework that learns from unpaired image-mask and image-text pairs, which can be independently and efficiently collected. Unpair-Seg initially predicts a set of binary masks and generates pseudo labels by identifying confident pairs of masks and text entities. We then train a feature adapter to align region embeddings with text embeddings based on these pseudo labels, achieving open-vocabulary segmentation. However, the inherent noise in the mask-entity correspondence poses a challenge to obtaining reliable pairs. To address this, we employ a vision-language large model to re-caption the input images and extract precise entities, and we design a multi-scale matching strategy to reduce noisy mask-entity pairs. Our Unpair-Seg framework demonstrates impressive performance, achieving 14.6\% and 19.5\% mIoU on the ADE-847 and PASCAL Context-459 datasets, significantly narrowing the gap between fully-supervised and weakly-supervised methods. △ Less

Submitted 11 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

Comments: 27 pages, 18 figures, 10 tables

arXiv:2402.08919 [pdf, other]

Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

Authors: Alessandro Achille, Greg Ver Steeg, Tian Yu Liu, Matthew Trager, Carson Klingenberg, Stefano Soatto

Abstract: Quantifying the degree of similarity between images is a key copyright issue for image-based machine learning. In legal doctrine however, determining the degree of similarity between works requires subjective analysis, and fact-finders (judges and juries) can demonstrate considerable variability in these subjective judgement calls. Images that are structurally similar can be deemed dissimilar, whe… ▽ More Quantifying the degree of similarity between images is a key copyright issue for image-based machine learning. In legal doctrine however, determining the degree of similarity between works requires subjective analysis, and fact-finders (judges and juries) can demonstrate considerable variability in these subjective judgement calls. Images that are structurally similar can be deemed dissimilar, whereas images of completely different scenes can be deemed similar enough to support a claim of copying. We seek to define and compute a notion of "conceptual similarity" among images that captures high-level relations even among images that do not share repeated elements or visually similar components. The idea is to use a base multi-modal model to generate "explanations" (captions) of visual data at increasing levels of complexity. Then, similarity can be measured by the length of the caption needed to discriminate between the two images: Two highly dissimilar images can be discriminated early in their description, whereas conceptually dissimilar ones will need more detail to be distinguished. We operationalize this definition and show that it correlates with subjective (averaged human evaluation) assessment, and beats existing baselines on both image-to-image and text-to-text similarity benchmarks. Beyond just providing a number, our method also offers interpretability by pointing to the specific level of granularity of the description where the source data are differentiated. △ Less

Submitted 13 February, 2024; originally announced February 2024.

arXiv:2402.08901 [pdf]

Characterization of the ATLAS Liquid Argon Front-End ASIC ALFE2 for the HL-LHC upgrade

Authors: D. Matakias, G. Carini, H. Chen, M. Dabrowski, G. Deptuch, L. Duflot, J. Kierstead, T. Liu, H. Ma, N. Morange, S. Rescia, S. Tang, H. Xu

Abstract: ALFE2 is an ATLAS Liquid Argon Calorimeter (LAr) Front-End ASIC designed for the HL-LHC upgrade. ALFE2 comprises four channels of pre-amplifiers and CR-(RC)2 shapers with adjustable input impedance. ALFE2 features two separate gain outputs to provide 16-bit dynamic-range coverage and an optimum resolution. ALFE2 is characterized using a Front-End Test Board (FETB) based on a Zynq UltraScale+ MPSoC… ▽ More ALFE2 is an ATLAS Liquid Argon Calorimeter (LAr) Front-End ASIC designed for the HL-LHC upgrade. ALFE2 comprises four channels of pre-amplifiers and CR-(RC)2 shapers with adjustable input impedance. ALFE2 features two separate gain outputs to provide 16-bit dynamic-range coverage and an optimum resolution. ALFE2 is characterized using a Front-End Test Board (FETB) based on a Zynq UltraScale+ MPSoC and two octal-channel 16-bit high-speed ADCs. The test results indicate that ALFE2 fulfills or greatly exceeds all specifications on gain, noise, linearity, uniformity, and radiation tolerance. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: 7 pages, 4 figures

arXiv:2402.08458 [pdf, other]

The SRG/eROSITA All-Sky Survey: Cosmology Constraints from Cluster Abundances in the Western Galactic Hemisphere

Authors: V. Ghirardini, E. Bulbul, E. Artis, N. Clerc, C. Garrel, S. Grandis, M. Kluge, A. Liu, Y. E. Bahar, F. Balzer, I. Chiu, J. Comparat, D. Gruen, F. Kleinebreil, S. Krippendorf, A. Merloni, K. Nandra, N. Okabe, F. Pacaud, P. Predehl, M. E. Ramos-Ceja, T. H. Reiprich, J. S. Sanders, T. Schrabback, R. Seppi , et al. (24 additional authors not shown)

Abstract: The cluster mass function traces the growth of linear density perturbations and provides valuable insights into the growth of structures, the nature of dark matter, and the cosmological parameters governing the Universe. The primary science goal of eROSITA, on board the {\it Spectrum Roentgen Gamma (SRG)} mission, launched in 2019, is to constrain cosmology through the evolution of cluster mass fu… ▽ More The cluster mass function traces the growth of linear density perturbations and provides valuable insights into the growth of structures, the nature of dark matter, and the cosmological parameters governing the Universe. The primary science goal of eROSITA, on board the {\it Spectrum Roentgen Gamma (SRG)} mission, launched in 2019, is to constrain cosmology through the evolution of cluster mass function. In this paper, we present the cosmological constraints obtained from 5259 clusters of galaxies detected over an area of 12791~deg$^2$ in the Western Galactic Hemisphere of the eROSITA's first All-Sky Survey (eRASS1). The common footprint region between the eROSITA Survey and DES, KiDS, and HSC surveys is used for calibration of the scaling between X-ray count rate and their total mass through measurements of their weak gravitational lensing signal. eRASS1 cluster abundances constrain the $Λ$CDM parameters, which are the energy density of the total matter to $Ω_{\mathrm{m}}=0.29^{+0.01}_{-0.02}$, and the normalization of the density fluctuations to $σ_8=0.88\pm0.02$ and their combination yields $S_8=σ_8 (Ω_\mathrm{m} / 0.3)^{0.5}=0.86\pm0.01$, consistent and at a similar precision with the state-of-the-art CMB measurements. eRASS1 cosmological experiment places a most stringent upper limit on the summed masses of left-handed light neutrinos to $\sum m_ν< 0.22\mathrm{~eV}$ (95\% confidence interval). Combining eRASS1 cluster abundance measurements with CMB and ground-based neutrino oscillation experiments, we measure the summed neutrino masses to be $\sum m_ν=0.08_{-0.02}^{+0.03}\mathrm{~eV}$ or $\sum m_ν=0.12_{-0.01}^{+0.03}\mathrm{~eV}$ depending on the mass hierarchy scenario for neutrino eigenstates. eRASS1 cluster abundances significantly improve the constraints on the dark energy equation of state parameter to $w=-1.12\pm0.12$. (ABRIDGED) △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: 43 pages, 22 figures, submitted to A&A

arXiv:2402.07429 [pdf]

doi 10.5281/zenodo.10635489

Particle Filter SLAM for Vehicle Localization

Authors: Tianrui Liu, Changxin Xu, Yuxin Qiao, Chufeng Jiang, Jiqiang Yu

Abstract: Simultaneous Localization and Map** (SLAM) presents a formidable challenge in robotics, involving the dynamic construction of a map while concurrently determining the precise location of the robotic agent within an unfamiliar environment. This intricate task is further compounded by the inherent "chicken-and-egg" dilemma, where accurate map** relies on a dependable estimation of the robot's lo… ▽ More Simultaneous Localization and Map** (SLAM) presents a formidable challenge in robotics, involving the dynamic construction of a map while concurrently determining the precise location of the robotic agent within an unfamiliar environment. This intricate task is further compounded by the inherent "chicken-and-egg" dilemma, where accurate map** relies on a dependable estimation of the robot's location, and vice versa. Moreover, the computational intensity of SLAM adds an additional layer of complexity, making it a crucial yet demanding topic in the field. In our research, we address the challenges of SLAM by adopting the Particle Filter SLAM method. Our approach leverages encoded data and fiber optic gyro (FOG) information to enable precise estimation of vehicle motion, while lidar technology contributes to environmental perception by providing detailed insights into surrounding obstacles. The integration of these data streams culminates in the establishment of a Particle Filter SLAM framework, representing a key endeavor in this paper to effectively navigate and overcome the complexities associated with simultaneous localization and map** in robotic systems. △ Less

Submitted 19 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: 6 pages, Journal of Industrial Engineering and Applied Science

Journal ref: Journal of Industrial Engineering and Applied Science 2024

arXiv:2402.07422 [pdf]

doi 10.5281/zenodo.10635481

News Recommendation with Attention Mechanism

Authors: Tianrui Liu, Changxin Xu, Yuxin Qiao, Chufeng Jiang, Weisheng Chen

Abstract: This paper explores the area of news recommendation, a key component of online information sharing. Initially, we provide a clear introduction to news recommendation, defining the core problem and summarizing current methods and notable recent algorithms. We then present our work on implementing the NRAM (News Recommendation with Attention Mechanism), an attention-based approach for news recommend… ▽ More This paper explores the area of news recommendation, a key component of online information sharing. Initially, we provide a clear introduction to news recommendation, defining the core problem and summarizing current methods and notable recent algorithms. We then present our work on implementing the NRAM (News Recommendation with Attention Mechanism), an attention-based approach for news recommendation, and assess its effectiveness. Our evaluation shows that NRAM has the potential to significantly improve how news content is personalized for users on digital news platforms. △ Less

Submitted 19 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: 7 pages, Journal of Industrial Engineering and Applied Science

Journal ref: Journal of Industrial Engineering and Applied Science 2024

arXiv:2402.07394 [pdf, other]

Cloud-cloud collision and cluster formation in the W5-NW complex

Authors: Namitha Issac, Anindya Saha, Saanika Choudhary, Aakash Chaudhary, Anandmayee Tej, Hong-Li Liu, Tie Liu, Maheswar Gopinathan

Abstract: We present a detailed structural and gas kinematic study of the star-forming complex W5-NW. A cloud-cloud collision scenario unravels with evidences of collision induced star and cluster formation. Various signatures of cloud-cloud collision such as "complementary distribution" and "bridging-features" are explored. At the colliding region, the two clouds have complementary morphologies, where W5-N… ▽ More We present a detailed structural and gas kinematic study of the star-forming complex W5-NW. A cloud-cloud collision scenario unravels with evidences of collision induced star and cluster formation. Various signatures of cloud-cloud collision such as "complementary distribution" and "bridging-features" are explored. At the colliding region, the two clouds have complementary morphologies, where W5-NWb has a filamentary key-like shape which fits into the U-shaped cavity in W5-NWa that behaves like a keyhole. The interaction region between the two clouds is characterised by bridging features with intermediate velocities connecting the two clouds. A skewed V-shaped bridging feature is also detected at the site of collision. A robust picture of the molecular gas distribution highlighting the bridges is seen in the position-position-velocity diagram obtained using the SCOUSEPY algorithm. Star cluster formation with an over-density of Class I and Class II young stellar objects is also seen towards this cloud complex, likely triggered by the cloud collision event. △ Less

Submitted 11 February, 2024; originally announced February 2024.

Comments: 16 pages, 9 figures, 2 appendices, Accepted for publication in AJ

arXiv:2402.07224 [pdf, other]

Arbitrarily configurable nonlinear topological modes

Authors: Kai Bai, Jia-Zheng Li, Tian-Rui Liu, Liang Fang, Duanduan Wan, Meng Xiao

Abstract: Topological modes (TMs) are typically localized at boundaries, interfaces and dislocations, and exponentially decay into the bulk of a large enough lattice. Recently, the non-Hermitian skin effect has been leveraged to delocalize the wavefunctions of TMs from the boundary and thus to increase the capacity of TMs dramatically. Here, we explore the capability of nonlinearity in designing and reconfi… ▽ More Topological modes (TMs) are typically localized at boundaries, interfaces and dislocations, and exponentially decay into the bulk of a large enough lattice. Recently, the non-Hermitian skin effect has been leveraged to delocalize the wavefunctions of TMs from the boundary and thus to increase the capacity of TMs dramatically. Here, we explore the capability of nonlinearity in designing and reconfiguring the wavefunctions of TMs. With growing intensity, wavefunctions of these in-gap nonlinear TMs undergo an initial deviation from exponential decay, gradually merge into arbitrarily designable plateaus, then encompass the entire nonlinear domain, and eventually concentrate at the nonlinear boundary. Intriguingly, such extended nonlinear TMs are still robust against defects and disorders, and stable in dynamics under external excitation. Advancing the conceptual understanding of the nonlinear TMs, our results open new avenues for increasing the capacity of TMs and develo** compact and reconfigurable topological devices. △ Less

Submitted 11 February, 2024; originally announced February 2024.

Comments: 3 figures in the main text

arXiv:2402.07011 [pdf, other]

FedImpro: Measuring and Improving Client Update in Federated Learning

Authors: Zhenheng Tang, Yonggang Zhang, Shaohuai Shi, Xinmei Tian, Tongliang Liu, Bo Han, Xiaowen Chu

Abstract: Federated Learning (FL) models often experience client drift caused by heterogeneous data, where the distribution of data differs across clients. To address this issue, advanced research primarily focuses on manipulating the existing gradients to achieve more consistent client models. In this paper, we present an alternative perspective on client drift and aim to mitigate it by generating improved… ▽ More Federated Learning (FL) models often experience client drift caused by heterogeneous data, where the distribution of data differs across clients. To address this issue, advanced research primarily focuses on manipulating the existing gradients to achieve more consistent client models. In this paper, we present an alternative perspective on client drift and aim to mitigate it by generating improved local models. First, we analyze the generalization contribution of local training and conclude that this generalization contribution is bounded by the conditional Wasserstein distance between the data distribution of different clients. Then, we propose FedImpro, to construct similar conditional distributions for local training. Specifically, FedImpro decouples the model into high-level and low-level components, and trains the high-level portion on reconstructed feature distributions. This approach enhances the generalization contribution and reduces the dissimilarity of gradients in FL. Experimental results show that FedImpro can help FL defend against data heterogeneity and enhance the generalization performance of the model. △ Less

Submitted 14 March, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

arXiv:2402.06116 [pdf, other]

LLMs for Coding and Robotics Education

Authors: Peng Shu, Huaqin Zhao, Hanqi Jiang, Yiwei Li, Shaochen Xu, Yi Pan, Zihao Wu, Zhengliang Liu, Guoyu Lu, Le Guan, Gong Chen, Xianqiao Wang Tianming Liu

Abstract: Large language models and multimodal large language models have revolutionized artificial intelligence recently. An increasing number of regions are now embracing these advanced technologies. Within this context, robot coding education is garnering increasing attention. To teach young children how to code and compete in robot challenges, large language models are being utilized for robot code expl… ▽ More Large language models and multimodal large language models have revolutionized artificial intelligence recently. An increasing number of regions are now embracing these advanced technologies. Within this context, robot coding education is garnering increasing attention. To teach young children how to code and compete in robot challenges, large language models are being utilized for robot code explanation, generation, and modification. In this paper, we highlight an important trend in robot coding education. We test several mainstream large language models on both traditional coding tasks and the more challenging task of robot code generation, which includes block diagrams. Our results show that GPT-4V outperforms other models in all of our tests but struggles with generating block diagram images. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 20 pages, 6 figures, 1 table

arXiv:2402.04792 [pdf, other]

Direct Language Model Alignment from Online AI Feedback

Authors: Shangmin Guo, Biao Zhang, Tianlin Liu, Tianqi Liu, Misha Khalman, Felipe Llinares, Alexandre Rame, Thomas Mesnard, Yao Zhao, Bilal Piot, Johan Ferret, Mathieu Blondel

Abstract: Direct alignment from preferences (DAP) methods, such as DPO, have recently emerged as efficient alternatives to reinforcement learning from human feedback (RLHF), that do not require a separate reward model. However, the preference datasets used in DAP methods are usually collected ahead of training and never updated, thus the feedback is purely offline. Moreover, responses in these datasets are… ▽ More Direct alignment from preferences (DAP) methods, such as DPO, have recently emerged as efficient alternatives to reinforcement learning from human feedback (RLHF), that do not require a separate reward model. However, the preference datasets used in DAP methods are usually collected ahead of training and never updated, thus the feedback is purely offline. Moreover, responses in these datasets are often sampled from a language model distinct from the one being aligned, and since the model evolves over training, the alignment phase is inevitably off-policy. In this study, we posit that online feedback is key and improves DAP methods. Our method, online AI feedback (OAIF), uses an LLM as annotator: on each training iteration, we sample two responses from the current model and prompt the LLM annotator to choose which one is preferred, thus providing online feedback. Despite its simplicity, we demonstrate via human evaluation in several tasks that OAIF outperforms both offline DAP and RLHF methods. We further show that the feedback leveraged in OAIF is easily controllable, via instruction prompts to the LLM annotator. △ Less

Submitted 29 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

Comments: 18 pages, 9 figures, 4 tables

arXiv:2402.04033 [pdf, other]

On provable privacy vulnerabilities of graph representations

Authors: Ruofan Wu, Guanhua Fang, Qiying Pan, Mingyang Zhang, Tengfei Liu, Weiqiang Wang

Abstract: Graph representation learning (GRL) is critical for extracting insights from complex network structures, but it also raises security concerns due to potential privacy vulnerabilities in these representations. This paper investigates the structural vulnerabilities in graph neural models where sensitive topological information can be inferred through edge reconstruction attacks. Our research primari… ▽ More Graph representation learning (GRL) is critical for extracting insights from complex network structures, but it also raises security concerns due to potential privacy vulnerabilities in these representations. This paper investigates the structural vulnerabilities in graph neural models where sensitive topological information can be inferred through edge reconstruction attacks. Our research primarily addresses the theoretical underpinnings of similarity-based edge reconstruction attacks (SERA), furnishing a non-asymptotic analysis of their reconstruction capacities. Moreover, we present empirical corroboration indicating that such attacks can perfectly reconstruct sparse graphs as graph size increases. Conversely, we establish that sparsity is a critical factor for SERA's effectiveness, as demonstrated through analysis and experiments on (dense) stochastic block models. Finally, we explore the resilience of private graph representations produced via noisy aggregation (NAG) mechanism against SERA. Through theoretical analysis and empirical assessments, we affirm the mitigation of SERA using NAG . In parallel, we also empirically delineate instances wherein SERA demonstrates both efficacy and deficiency in its capacity to function as an instrument for elucidating the trade-off between privacy and utility. △ Less

Submitted 23 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.03941 [pdf, other]

Discovery of the Hidden World with Large Language Models

Authors: Chenxi Liu, Yongqiang Chen, Tongliang Liu, Mingming Gong, James Cheng, Bo Han, Kun Zhang

Abstract: Science originates with discovering new causal knowledge from a combination of known facts and observations. Traditional causal discovery approaches mainly rely on high-quality measured variables, usually given by human experts, to find causal relations. However, the causal variables are usually unavailable in a wide range of real-world applications. The rise of large language models (LLMs) that a… ▽ More Science originates with discovering new causal knowledge from a combination of known facts and observations. Traditional causal discovery approaches mainly rely on high-quality measured variables, usually given by human experts, to find causal relations. However, the causal variables are usually unavailable in a wide range of real-world applications. The rise of large language models (LLMs) that are trained to learn rich knowledge from the massive observations of the world, provides a new opportunity to assist with discovering high-level hidden variables from the raw observational data. Therefore, we introduce COAT: Causal representatiOn AssistanT. COAT incorporates LLMs as a factor proposer that extracts the potential causal factors from unstructured data. Moreover, LLMs can also be instructed to provide additional information used to collect data values (e.g., annotation criteria) and to further parse the raw unstructured data into structured data. The annotated data will be fed to a causal learning module (e.g., the FCI algorithm) that provides both rigorous explanations of the data, as well as useful feedback to further improve the extraction of causal factors by LLMs. We verify the effectiveness of COAT in uncovering the underlying causal system with two case studies of review rating analysis and neuropathic diagnosis. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: Preliminary version of an ongoing project; Chenxi and Yongqiang contributed equally; 26 pages, 41 figures; Project page: https://causalcoat.github.io/

arXiv:2402.03921 [pdf, other]

Large Language Models to Enhance Bayesian Optimization

Authors: Tennison Liu, Nicolás Astorga, Nabeel Seedat, Mihaela van der Schaar

Abstract: Bayesian optimization (BO) is a powerful approach for optimizing complex and expensive-to-evaluate black-box functions. Its importance is underscored in many applications, notably including hyperparameter tuning, but its efficacy depends on efficiently balancing exploration and exploitation. While there has been substantial progress in BO methods, striking this balance remains a delicate process.… ▽ More Bayesian optimization (BO) is a powerful approach for optimizing complex and expensive-to-evaluate black-box functions. Its importance is underscored in many applications, notably including hyperparameter tuning, but its efficacy depends on efficiently balancing exploration and exploitation. While there has been substantial progress in BO methods, striking this balance remains a delicate process. In this light, we present LLAMBO, a novel approach that integrates the capabilities of Large Language Models (LLM) within BO. At a high level, we frame the BO problem in natural language, enabling LLMs to iteratively propose and evaluate promising solutions conditioned on historical evaluations. More specifically, we explore how combining contextual understanding, few-shot learning proficiency, and domain knowledge of LLMs can improve model-based BO. Our findings illustrate that LLAMBO is effective at zero-shot warmstarting, and enhances surrogate modeling and candidate sampling, especially in the early stages of search when observations are sparse. Our approach is performed in context and does not require LLM finetuning. Additionally, it is modular by design, allowing individual components to be integrated into existing BO frameworks, or function cohesively as an end-to-end method. We empirically validate LLAMBO's efficacy on the problem of hyperparameter tuning, highlighting strong empirical performance across a range of diverse benchmarks, proprietary, and synthetic tasks. △ Less

Submitted 8 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: Accepted as Poster at ICLR2024

arXiv:2402.03829 [pdf, ps, other]

Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ and Observation of One Structure between $\sqrt{s} = 3.80-4.95$ GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections… ▽ More Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections for both processes, one possible new structure around 3.9 GeV/$c^2$ is observed for the first time, in addition to seven known resonances $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $Y(4230)$, $Y(4360)$, $ψ(4415)$, and $Y(4660)$. These results offer crucial experimental insights into the nature of hadron production in the open charm region. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: 9 pages, 4 figures, 1 tables, 1 Supplemental_Material

arXiv:2402.03094 [pdf, other]

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector

Authors: Yuqian Fu, Yu Wang, Yixuan Pan, Lian Huai, Xingyu Qiu, Zeyu Shangguan, Tong Liu, Yanwei Fu, Luc Van Gool, Xingqun Jiang

Abstract: This paper studies the challenging cross-domain few-shot object detection (CD-FSOD), aiming to develop an accurate object detector for novel domains with minimal labeled examples. While transformer-based open-set detectors, such as DE-ViT, show promise in traditional few-shot object detection, their generalization to CD-FSOD remains unclear: 1) can such open-set detection methods easily generalize… ▽ More This paper studies the challenging cross-domain few-shot object detection (CD-FSOD), aiming to develop an accurate object detector for novel domains with minimal labeled examples. While transformer-based open-set detectors, such as DE-ViT, show promise in traditional few-shot object detection, their generalization to CD-FSOD remains unclear: 1) can such open-set detection methods easily generalize to CD-FSOD? 2) If not, how can models be enhanced when facing huge domain gaps? To answer the first question, we employ measures including style, inter-class variance (ICV), and indefinable boundaries (IB) to understand the domain gap. Based on these measures, we establish a new benchmark named CD-FSOD to evaluate object detection methods, revealing that most of the current approaches fail to generalize across domains. Technically, we observe that the performance decline is associated with our proposed measures: style, ICV, and IB. Consequently, we propose several novel modules to address these issues. First, the learnable instance features align initial fixed instances with target categories, enhancing feature distinctiveness. Second, the instance reweighting module assigns higher importance to high-quality instances with slight IB. Third, the domain prompter encourages features resilient to different styles by synthesizing imaginary domains without altering semantic contents. These techniques collectively contribute to the development of the Cross-Domain Vision Transformer for CD-FSOD (CD-ViTO), significantly improving upon the base DE-ViT. Experimental results validate the efficacy of our model. All datasets, codes, and models will be released to the community. △ Less

Submitted 19 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.03046 [pdf, other]

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Authors: Shengyi Huang, Quentin Gallouédec, Florian Felten, Antonin Raffin, Rousslan Fernand Julien Dossa, Yanxiao Zhao, Ryan Sullivan, Viktor Makoviychuk, Denys Makoviichuk, Mohamad H. Danesh, Cyril Roumégous, Jiayi Weng, Chufan Chen, Md Masudur Rahman, João G. M. Araújo, Guorui Quan, Daniel Tan, Timo Klein, Rujikorn Charakorn, Mark Towers, Yann Berthelot, Kinal Mehta, Dipam Chakraborty, Arjun KG, Valentin Charraut , et al. (8 additional authors not shown)

Abstract: In many Reinforcement Learning (RL) papers, learning curves are useful indicators to measure the effectiveness of RL algorithms. However, the complete raw data of the learning curves are rarely available. As a result, it is usually necessary to reproduce the experiments from scratch, which can be time-consuming and error-prone. We present Open RL Benchmark, a set of fully tracked RL experiments, i… ▽ More In many Reinforcement Learning (RL) papers, learning curves are useful indicators to measure the effectiveness of RL algorithms. However, the complete raw data of the learning curves are rarely available. As a result, it is usually necessary to reproduce the experiments from scratch, which can be time-consuming and error-prone. We present Open RL Benchmark, a set of fully tracked RL experiments, including not only the usual data such as episodic return, but also all algorithm-specific and system metrics. Open RL Benchmark is community-driven: anyone can download, use, and contribute to the data. At the time of writing, more than 25,000 runs have been tracked, for a cumulative duration of more than 8 years. Open RL Benchmark covers a wide range of RL libraries and reference implementations. Special care is taken to ensure that each experiment is precisely reproducible by providing not only the full parameters, but also the versions of the dependencies used to generate it. In addition, Open RL Benchmark comes with a command-line interface (CLI) for easy fetching and generating figures to present the results. In this document, we include two case studies to demonstrate the usefulness of Open RL Benchmark in practice. To the best of our knowledge, Open RL Benchmark is the first RL benchmark of its kind, and the authors hope that it will improve and facilitate the work of researchers in the field. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: Under review

arXiv:2402.02992 [pdf, other]

Decoding-time Realignment of Language Models

Authors: Tianlin Liu, Shangmin Guo, Leonardo Bianco, Daniele Calandriello, Quentin Berthet, Felipe Llinares, Jessica Hoffmann, Lucas Dixon, Michal Valko, Mathieu Blondel

Abstract: Aligning language models with human preferences is crucial for reducing errors and biases in these models. Alignment techniques, such as reinforcement learning from human feedback (RLHF), are typically cast as optimizing a tradeoff between human preference rewards and a proximity regularization term that encourages staying close to the unaligned model. Selecting an appropriate level of regularizat… ▽ More Aligning language models with human preferences is crucial for reducing errors and biases in these models. Alignment techniques, such as reinforcement learning from human feedback (RLHF), are typically cast as optimizing a tradeoff between human preference rewards and a proximity regularization term that encourages staying close to the unaligned model. Selecting an appropriate level of regularization is critical: insufficient regularization can lead to reduced model capabilities due to reward hacking, whereas excessive regularization hinders alignment. Traditional methods for finding the optimal regularization level require retraining multiple models with varying regularization strengths. This process, however, is resource-intensive, especially for large models. To address this challenge, we propose decoding-time realignment (DeRa), a simple method to explore and evaluate different regularization strengths in aligned models without retraining. DeRa enables control over the degree of alignment, allowing users to smoothly transition between unaligned and aligned models. It also enhances the efficiency of hyperparameter tuning by enabling the identification of effective regularization strengths using a validation dataset. △ Less

Submitted 24 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

Comments: In Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

arXiv:2402.02153 [pdf]

Talbot-like pattern evolution in complex structured light from unitary transformation

Authors: Zheng-Xiao Cao, Ting-Ting Liu, Bo-Zhao, Carmelo Rosales-Guzmán, Jun Liu, Zhi-Han Zhu

Abstract: Astigmatic unitary transformations allow for the adiabatic connections of all feasible states of paraxial Gaussian beams on the same modal sphere, i.e., Hermite-Laguerre-Gaussian (HLG) modes. Here, we present a comprehensive investigation into the unitary modal evolution of complex structured Gaussian beams, comprised by HLG modes from disparate modal spheres, via astigmatic transformation. The no… ▽ More Astigmatic unitary transformations allow for the adiabatic connections of all feasible states of paraxial Gaussian beams on the same modal sphere, i.e., Hermite-Laguerre-Gaussian (HLG) modes. Here, we present a comprehensive investigation into the unitary modal evolution of complex structured Gaussian beams, comprised by HLG modes from disparate modal spheres, via astigmatic transformation. The non-synchronized higher-order geometric phases in cyclic transformations originates a Talbot-effect-like modal evolution in the superposition state of these HLG modes, resulting in pattern variations and revivals in transformations with specific geodesic loops. Using Ince-Gaussian modes as an illustrative example, we systematically analyze and experimentally corroborate the beamforming mechanism behind the pattern evolution. Our results outline a generic modal conversion theory of structured Gaussian beams via astigmatic unitary transformation, offering a new approach for sha** spatial modal structure. These findings may inspire a wide variety of applications based on structured light. △ Less

Submitted 21 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

Comments: 7 pages, 5 figures

arXiv:2402.01993 [pdf, other]

Measurement of the Electromagnetic Transition Form-factors in the decays $η'\rightarrowπ^+π^-l^+l^-$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (618 additional authors not shown)

Abstract: With a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η'\rightarrowπ^+π^-l^+l^-(l=e,$ $μ)$ via the process $J/ψ\rightarrowγη'$. The branching fractions are measured to be $\mathcal{B}(η'\rightarrowπ^+π^-e^+e^-)=(2.45\pm0.02(\rm{stat.})\pm0.08(\rm{syst.})) \times10^{-3}$ and… ▽ More With a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η'\rightarrowπ^+π^-l^+l^-(l=e,$ $μ)$ via the process $J/ψ\rightarrowγη'$. The branching fractions are measured to be $\mathcal{B}(η'\rightarrowπ^+π^-e^+e^-)=(2.45\pm0.02(\rm{stat.})\pm0.08(\rm{syst.})) \times10^{-3}$ and $\mathcal{B}(η'\rightarrowπ^+π^-μ^+μ^-)=(2.16\pm0.12(\rm{stat.})\pm0.06(\rm{syst.}))\times10^{-5}$, and the ratio is $\frac{\mathcal{B}(η'\rightarrowπ^{+}π^{-}e^{+}e^{-})}{\mathcal{B}(η'\rightarrowπ^{+}π^{-}μ^{+}μ^{-})} = 113.4\pm0.9(\rm{stat.})\pm3.7(\rm{syst.})$. In addition, by combining the $η'\rightarrowπ^+π^-e^+e^-$ and $η'\rightarrowπ^+π^-μ^+μ^-$ decays, the slope parameter of the electromagnetic transition form factor is measured to be $b_{η'}=1.30\pm0.19\ (\mathrm{GeV}/c^{2})^{-2}$, which is consistent with previous measurements from BESIII and theoretical predictions from the VMD model. The asymmetry in the angle between the $π^+π^-$ and $l^+l^-$ decay planes, which has the potential to reveal the $CP$-violation originating from an unconventional electric dipole transition, is also investigated. The asymmetry parameters are determined to be $\mathcal{A}_{CP}(η'\rightarrowπ^+π^-e^+e^-)=(-0.21\pm0.73(\rm{stat.})\pm0.01(\rm{syst.}))\%$ and $\mathcal{A}_{CP}(η'\rightarrowπ^+π^-μ^+μ^-)=(0.62\pm4.71(\rm{stat.})\pm0.08(\rm{syst.}))\%$, implying that no evidence of $CP$-violation is observed at the present statistics. Finally, an axion-like particle is searched for via the decay $η'\rightarrowπ^+π^-a, a\rightarrow e^+e^-$, and upper limits of the branching fractions are presented for the mass assumptions of the axion-like particle in the range of $0-500\ \mathrm{MeV}/c^{2}$. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2402.01978 [pdf]

Magnetosheath ion field-aligned anisotropy and implications for ion leakage to the foreshock

Authors: Terry Zixu Liu, Vassilis Angelopoulos, Hui Zhang, Andrew Vu, Joachim Raeder

Abstract: The ion foreshock is highly dynamic, disturbing the bow shock and the magnetosphere-ionosphere system. To forecast foreshock-driven space weather effects, it is necessary to model foreshock ions as a function of upstream shock parameters. Case studies in the accompanying paper show that magnetosheath ions sometimes exhibit strong field-aligned anisotropy towards the upstream direction, which may b… ▽ More The ion foreshock is highly dynamic, disturbing the bow shock and the magnetosphere-ionosphere system. To forecast foreshock-driven space weather effects, it is necessary to model foreshock ions as a function of upstream shock parameters. Case studies in the accompanying paper show that magnetosheath ions sometimes exhibit strong field-aligned anisotropy towards the upstream direction, which may be responsible for enhancing magnetosheath leakage and therefore foreshock ion density. To understand the conditions leading to such an anisotropy and the potential for enhanced leakage, we perform case studies and a statistical study of magnetosheath and foreshock region data surrounding ~500 THEMIS bow shock crossings. We quantify the anisotropy using the heat flux along the field-aligned direction. We show that the strong field-aligned heat flux persists across the entire magnetosheath from the magnetopause to the bow shock. Ion distribution functions reveal that the strong heat flux is caused by a secondary thermal population. We find that stronger anisotropy events exhibit heat flux preferentially towards the upstream direction near the bow shock and occur under larger IMF strength and larger solar wind dynamic pressure and/or energy flux. Additionally, we show that near the bow shock, magnetosheath leakage is a significant contributor to foreshock ions, and through enhancing the leakage the magnetosheath ion anisotropy can modulate the foreshock ion velocity and density. Our results imply that likely due to field line dra** and compression against the magnetopause that leads to a directional mirror force, modeling the foreshock ions necessitates a more global accounting of downstream conditions. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: Under review by JGR Space Physics

arXiv:2402.01878 [pdf, other]

LiPO: Listwise Preference Optimization through Learning-to-Rank

Authors: Tianqi Liu, Zhen Qin, Junru Wu, Jiaming Shen, Misha Khalman, Rishabh Joshi, Yao Zhao, Mohammad Saleh, Simon Baumgartner, Jialu Liu, Peter J. Liu, Xuanhui Wang

Abstract: Aligning language models (LMs) with curated human feedback is critical to control their behaviors in real-world applications. Several recent policy optimization methods, such as DPO and SLiC, serve as promising alternatives to the traditional Reinforcement Learning from Human Feedback (RLHF) approach. In practice, human feedback often comes in a format of a ranked list over multiple responses to a… ▽ More Aligning language models (LMs) with curated human feedback is critical to control their behaviors in real-world applications. Several recent policy optimization methods, such as DPO and SLiC, serve as promising alternatives to the traditional Reinforcement Learning from Human Feedback (RLHF) approach. In practice, human feedback often comes in a format of a ranked list over multiple responses to amortize the cost of reading prompt. Multiple responses can also be ranked by reward models or AI feedback. There lacks such a thorough study on directly fitting upon a list of responses. In this work, we formulate the LM alignment as a \textit{listwise} ranking problem and describe the LiPO framework, where the policy can potentially learn more effectively from a ranked list of plausible responses given the prompt. This view draws an explicit connection to Learning-to-Rank (LTR), where most existing preference optimization work can be mapped to existing ranking objectives. Following this connection, we provide an examination of ranking objectives that are not well studied for LM alignment with DPO and SLiC as special cases when list size is two. In particular, we highlight a specific method, LiPO-$λ$, which leverages a state-of-the-art \textit{listwise} ranking objective and weights each preference pair in a more advanced manner. We show that LiPO-$λ$ can outperform DPO variants and SLiC by a clear margin on several preference alignment tasks with both curated and real rankwise preference data. △ Less

Submitted 22 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

arXiv:2402.01349 [pdf, other]

Beyond the Answers: Reviewing the Rationality of Multiple Choice Question Answering for the Evaluation of Large Language Models

Authors: Haochun Wang, Sendong Zhao, Zewen Qiang, Nuwa Xi, Bing Qin, Ting Liu

Abstract: In the field of natural language processing (NLP), Large Language Models (LLMs) have precipitated a paradigm shift, markedly enhancing performance in natural language generation tasks. Despite these advancements, the comprehensive evaluation of LLMs remains an inevitable challenge for the community. Recently, the utilization of Multiple Choice Question Answering (MCQA) as a benchmark for LLMs has… ▽ More In the field of natural language processing (NLP), Large Language Models (LLMs) have precipitated a paradigm shift, markedly enhancing performance in natural language generation tasks. Despite these advancements, the comprehensive evaluation of LLMs remains an inevitable challenge for the community. Recently, the utilization of Multiple Choice Question Answering (MCQA) as a benchmark for LLMs has gained considerable traction. This study first investigates the limitations of MCQA as an evaluation method for LLMs and then analyzes the fundamental reason for the limitations of MCQA, that while LLMs may select the correct answers, it is possible that they also recognize other wrong options as correct. Finally, we propose a dataset augmenting method for Multiple-Choice Questions (MCQs), MCQA+, that can more accurately reflect the performance of the model, which underscores the need for more robust evaluation mechanisms in assessing the performance of LLMs. △ Less

Submitted 29 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: 17 pages, 8 figures

arXiv:2402.01060 [pdf, other]

doi 10.1103/PhysRevApplied.21.L041002

High-volume tunable resonator for axion searches above 7 GHz

Authors: Taj A. Dyson, Chelsea L. Bartram, Ashley Davidson, Jonah B. Ezekiel, Laura M. Futamura, Tongtian Liu, Chao-Lin Kuo

Abstract: We present results from the first experimental demonstration of a tunable thin-shell axion haloscope. This novel geometry decouples the overall volume of the cavity-based resonator from its resonant frequency, thereby evading the steep sensitivity degradation at high-frequencies. An aluminum $2.6$ L ($41$ $λ^3$) prototype which tunes from $7.1$ to $8.0$ GHz was fabricated and characterized at room… ▽ More We present results from the first experimental demonstration of a tunable thin-shell axion haloscope. This novel geometry decouples the overall volume of the cavity-based resonator from its resonant frequency, thereby evading the steep sensitivity degradation at high-frequencies. An aluminum $2.6$ L ($41$ $λ^3$) prototype which tunes from $7.1$ to $8.0$ GHz was fabricated and characterized at room temperature. An axion-sensitive, straightforwardly tunable $\mathrm{TM}$$_{010}$ mode is clearly identified with a room temperature quality factor, $Q$, of $\sim$$5,000$. The on-resonance $E$-field distribution is mapped and found to agree with numerical calculations. Anticipating future cryogenic operation, we develop an alignment protocol relying only on rf measurements of the cavity, maintaining a form factor of $0.57$ across the full tuning range. These measurements demonstrate the feasibility of cavity-based haloscopes with operating volume $V\ggλ^3$. We discuss plans for future development and the parameters required for a thin-shell haloscope exploring the post-inflationary axion parameter space ($\sim$$4$ to $\sim$$30$ GHz) at DFSZ sensitivity. △ Less

Submitted 23 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: 6 pages, 7 figures; references added, Table 2 updated, acknowledgments made more descriptive, grammar copy edits, and title updated to published version

Journal ref: PhysRevApplied 21 (2024) L041002

arXiv:2402.00795 [pdf, other]

LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law

Authors: Toni J. B. Liu, Nicolas Boullé, Raphaël Sarfati, Christopher J. Earls

Abstract: Pretrained large language models (LLMs) are surprisingly effective at performing zero-shot tasks, including time-series forecasting. However, understanding the mechanisms behind such capabilities remains highly challenging due to the complexity of the models. We study LLMs' ability to extrapolate the behavior of dynamical systems whose evolution is governed by principles of physical interest. Our… ▽ More Pretrained large language models (LLMs) are surprisingly effective at performing zero-shot tasks, including time-series forecasting. However, understanding the mechanisms behind such capabilities remains highly challenging due to the complexity of the models. We study LLMs' ability to extrapolate the behavior of dynamical systems whose evolution is governed by principles of physical interest. Our results show that LLaMA 2, a language model trained primarily on texts, achieves accurate predictions of dynamical system time series without fine-tuning or prompt engineering. Moreover, the accuracy of the learned physical rules increases with the length of the input context window, revealing an in-context version of neural scaling law. Along the way, we present a flexible and efficient algorithm for extracting probability density functions of multi-digit numbers directly from LLMs. △ Less

Submitted 20 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

arXiv:2401.17873 [pdf, other]

Measurements of Normalized Differential Cross Sections of Inclusive $η$ Production in $e^{+}e^{-}$ Annihilation at Energy from 2.0000 to 3.6710 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, D. Anderle, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (641 additional authors not shown)

Abstract: Using data samples collected with the BESIII detector operating at the BEPCII storage ring, the cross section of the inclusive process $e^{+}e^{-} \to η+ X$, normalized by the total cross section of $e^{+}e^{-} \to \text{hadrons}$, is measured at eight center-of-mass energy points from 2.0000 GeV to 3.6710 GeV. These are the first measurements with momentum dependence in this energy region. Our me… ▽ More Using data samples collected with the BESIII detector operating at the BEPCII storage ring, the cross section of the inclusive process $e^{+}e^{-} \to η+ X$, normalized by the total cross section of $e^{+}e^{-} \to \text{hadrons}$, is measured at eight center-of-mass energy points from 2.0000 GeV to 3.6710 GeV. These are the first measurements with momentum dependence in this energy region. Our measurement shows a significant discrepancy from calculations with the existing fragmentation functions. To address this discrepancy, a new QCD analysis is performed at the next-to-next-to-leading order with hadron mass corrections and higher twist effects, which can explain both the established high-energy data and our measurements reasonably well. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 9 pages, 2 figures

arXiv:2401.17538 [pdf, other]

Post-Quantum Cryptography for Internet of Things: A Survey on Performance and Optimization

Authors: Tao Liu, Gowri Ramachandran, Raja Jurdak

Abstract: Due to recent development in quantum computing, the invention of a large quantum computer is no longer a distant future. Quantum computing severely threatens modern cryptography, as the hard mathematical problems beneath classic public-key cryptosystems can be solved easily by a sufficiently large quantum computer. As such, researchers have proposed PQC based on problems that even quantum computer… ▽ More Due to recent development in quantum computing, the invention of a large quantum computer is no longer a distant future. Quantum computing severely threatens modern cryptography, as the hard mathematical problems beneath classic public-key cryptosystems can be solved easily by a sufficiently large quantum computer. As such, researchers have proposed PQC based on problems that even quantum computers cannot efficiently solve. Generally, post-quantum encryption and signatures can be hard to compute. This could potentially be a problem for IoT, which usually consist lightweight devices with limited computational power. In this paper, we survey existing literature on the performance for PQC in resource-constrained devices to understand the severeness of this problem. We also review recent proposals to optimize PQC algorithms for resource-constrained devices. Overall, we find that whilst PQC may be feasible for reasonably lightweight IoT, proposals for their optimization seem to lack standardization. As such, we suggest future research to seek coordination, in order to ensure an efficient and safe migration toward IoT for the post-quantum era. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 13 pages, 3 figures and 7 tables. Formatted version submitted to ACM Computer Surveys

arXiv:2401.17526 [pdf, other]

Power Characterization of Noisy Quantum Kernels

Authors: Yabo Wang, Bo Qi, Xin Wang, Tongliang Liu, Daoyi Dong

Abstract: Quantum kernel methods have been widely recognized as one of promising quantum machine learning algorithms that have potential to achieve quantum advantages. In this paper, we theoretically characterize the power of noisy quantum kernels and demonstrate that under global depolarization noise, for different input data the predictions of the optimal hypothesis inferred by the noisy quantum kernel ap… ▽ More Quantum kernel methods have been widely recognized as one of promising quantum machine learning algorithms that have potential to achieve quantum advantages. In this paper, we theoretically characterize the power of noisy quantum kernels and demonstrate that under global depolarization noise, for different input data the predictions of the optimal hypothesis inferred by the noisy quantum kernel approximately concentrate towards some fixed value. In particular, we depict the convergence rate in terms of the strength of quantum noise, the size of training samples, the number of qubits, the number of layers affected by quantum noises, as well as the number of measurement shots. Our results show that noises may make quantum kernel methods to only have poor prediction capability, even when the generalization error is small. Thus, we provide a crucial warning to employ noisy quantum kernel methods for quantum computation and the theoretical results can also serve as guidelines when develo** practical quantum kernel algorithms for achieving quantum advantages. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 3 figures

arXiv:2401.17487 [pdf]

doi 10.1088/1748-0221/9/02/C02007

The 120Gbps VCSEL Array Based Optical Transmitter (ATx) Development for the High-Luminosity LHC (HL-LHC) Experiments

Authors: Di Guo, Chonghan Liu, **ghong Chen, John Chramowicz, Binwei Deng, Datao Gong, Suen Hou, Ge **, Simon Kwan, Futian Liang, Xiaoting Li, Gang Liu, Tiankuan Liu, Alan Prosser, Da-Shung Su, **-Kun Teng, Tongye Xu, **gbo Ye, Xiandong Zhao, Annie C. Xiang, Hao Liang

Abstract: The integration of a Verticle Cavity Surface-Emitting Laser (VCSEL) array and a driving Application-Specific Integrated Circuit (ASIC) in a custom optical array transmitter module (ATx) for operation in the detector front-end is constructed, assembled and tested. The ATx provides 12 parallel channels with each channel operating at 10 Gbps. The optical transmitter eye diagram passes the eye mask an… ▽ More The integration of a Verticle Cavity Surface-Emitting Laser (VCSEL) array and a driving Application-Specific Integrated Circuit (ASIC) in a custom optical array transmitter module (ATx) for operation in the detector front-end is constructed, assembled and tested. The ATx provides 12 parallel channels with each channel operating at 10 Gbps. The optical transmitter eye diagram passes the eye mask and the bit-error rate (BER) less than 1E-12 transmission is achieved at 10 Gbps/ch. The overall insertion loss including the radiation induced attenuation is sufficiently low to meet the proposed link budget requirement. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 10 pages, 9 figures

arXiv:2401.17471 [pdf]

doi 10.1088/1748-0221/9/03/C03007

Optical Data Transmission ASICs for the High-Luminosity LHC (HL-LHC) Experiments

Authors: Xiaoting Li, Gang Liu, **ghong Chen, Binwei Deng, Datao Gong, Di Guo, Mengxun He, Suen Hou, Guangming Huang, Ge **, Hao Liang, Futian Liang, Chonghan Liu, Tiankuan Liu, Xiangming Sun, **-Kun Teng, Annie C. Xiang, **gbo Ye, Yang You, Xiandong Zhao

Abstract: We present the design and test results of two optical data transmission ASICs for the High-Luminosity LHC (HL-LHC) experiments. These ASICs include a two-channel serializer (LOCs2) and a single-channel Vertical Cavity Surface Emitting Laser (VCSEL) driver (LOCld1V2). Both ASICs are fabricated in a commercial 0.25-um Silicon-on-Sapphire (SoS) CMOS technology and operate at a data rate up to 8 Gbps… ▽ More We present the design and test results of two optical data transmission ASICs for the High-Luminosity LHC (HL-LHC) experiments. These ASICs include a two-channel serializer (LOCs2) and a single-channel Vertical Cavity Surface Emitting Laser (VCSEL) driver (LOCld1V2). Both ASICs are fabricated in a commercial 0.25-um Silicon-on-Sapphire (SoS) CMOS technology and operate at a data rate up to 8 Gbps per channel. The power consumption of LOCs2 and LOCld1V2 are 1.25 W and 0.27 W at 8-Gbps data rate, respectively. LOCld1V2 has been verified meeting the radiation-tolerance requirements for HL-LHC experiments. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 9 pages, 12 figures

arXiv:2401.17309 [pdf, other]

The Hot Circum-Galactic Medium in the eROSITA All Sky Survey II. Scaling Relations between X-ray Luminosity and Galaxies' Mass

Authors: Yi Zhang, Johan Comparat, Gabriele Ponti, Andrea Merloni, Kirpal Nandra, Frank Haberl, Nhut Truong, Annalisa Pillepich, Nicola Locatelli, Xiaoyuan Zhang, Jeremy Sanders, Xueying Zheng, Ang Liu, Paola Popesso, Teng Liu, Peter Predehl, Mara Salvato

Abstract: Understanding how the hot circum-galactic medium (CGM) properties relate to the galaxy's properties can constrain galaxy evolution models. We aim to measure the scaling relations between the X-ray luminosity of the hot CGM and the fundamental properties of a galaxy, i.e., its stellar mass and halo mass. We calculate the X-ray luminosity of the hot CGM based on the surface brightness profiles of ce… ▽ More Understanding how the hot circum-galactic medium (CGM) properties relate to the galaxy's properties can constrain galaxy evolution models. We aim to measure the scaling relations between the X-ray luminosity of the hot CGM and the fundamental properties of a galaxy, i.e., its stellar mass and halo mass. We calculate the X-ray luminosity of the hot CGM based on the surface brightness profiles of central galaxy samples measured in Zhang et al. (2024a, submitted) from Spectrum Roentgen Gamma (SRG)/eROSITA all-sky survey data. We relate the X-ray luminosity to the galaxies' stellar and halo mass. We compare the observed relations to the TNG, EAGLE, and SIMBA simulations. The hot CGM X-ray luminosity correlates with the galaxy's stellar mass ($M_*$). It increases from $2.1 \pm 1.3\times10^{39} \rm erg/s$ to $2.0 \pm 0.1\times10^{41} \rm erg/s$, when $\log(M_*)$ increases from 10.0 to 11.5. A double power law describes the correlation, with a break at $\log(M_*)=11.28\pm 0.03$ and a power-law index or $1.9\pm 0.2$ ($4.2\pm0.1$) below (above) the break. The hot CGM X-ray luminosity as a function of halo mass is measured within $\log(M_{\rm 500c})=11.3-13.7$, extending our knowledge of the scaling relation by more than two orders of magnitude. $L_{\rm X,CGM}$ increases with $M_{\rm 500c}$ from $2.7 \pm 0.9\times10^{39}\ \rm erg/s$ at $\log(M_{\rm 500c})=11.3$ to $9.2 \pm 0.4\times10^{41}\ \rm erg/s$ at $\log(M_{\rm 500c})=13.7$. The relation follows a power law of $\log(L_{\rm X,CGM})= (1.35\pm 0.04)\times \log(M_{\rm 500c})+(23.8\pm0.5)$. We find a general agreement between simulations and observation. We explore, at the low mass end, the average scaling relations between the CGM X-ray luminosity and the galaxy's stellar mass or halo mass, which constitutes a new benchmark for galaxy evolution models and feedback processes. △ Less

Submitted 29 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: 8 pages, 4 figures, submitted to A&A

arXiv:2401.17308 [pdf, other]

The Hot Circum-Galactic Medium in the eROSITA All Sky Survey I. X-ray Surface Brightness Profiles

Authors: Yi Zhang, Johan Comparat, Gabriele Ponti, Andrea Merloni, Kirpal Nandra, Frank Haberl, Nicola Locatelli, Xiaoyuan Zhang, Jeremy Sanders, Xueying Zheng, Ang Liu, Paola Popesso, Teng Liu, Nhut Truong, Annalisa Pillepich, Peter Predehl, Mara Salvato

Abstract: We apply a stacking technique to gain enough statistics to detect the hot CGM around galaxies. We use the X-ray data from the first four SRG/eROSITA all-sky surveys (eRASS:4). We discuss how the satellite galaxies could bias the stacking and carefully build the central galaxy samples based on the SDSS spectroscopic survey and halo-based group finder algorithm and isolated galaxy sample from the ni… ▽ More We apply a stacking technique to gain enough statistics to detect the hot CGM around galaxies. We use the X-ray data from the first four SRG/eROSITA all-sky surveys (eRASS:4). We discuss how the satellite galaxies could bias the stacking and carefully build the central galaxy samples based on the SDSS spectroscopic survey and halo-based group finder algorithm and isolated galaxy sample from the ninth data release of the DESI Legacy survey (LS DR9, photometric). We mask the detected X-ray point sources and carefully model the X-ray emission from the unresolved active galactic nuclei (AGN) and X-ray binaries (XRB) to obtain the X-ray emission from the hot CGM. The X-ray surface brightness profiles are measured for $\log(M_*)>10.0$ or $\log(M_{\rm 200m})>11.5$ central galaxies, and $\log(M_*)>9.5$ isolated galaxies. We detect the X-ray emission around MW-mass ($\log(M_*)=10.5-11.0$) and more massive central galaxies extending up to the virial radius ($R_{\rm vir}$). The signal-to-noise ratio of the extended emission around the MW-mass galaxy is about $8.1σ$ within $R_{\rm vir}$. We use the $β$ model to describe the X-ray surface brightness profiles of the hot CGM. We obtain a central surface brightness of $S_{\rm X,0}\approx 3.1\times10^{35}\rm erg/s/kpc^2$ and $β\approx0.42$ for MW-mass galaxy. We estimate the baryon budget of the hot CGM and obtain a value lower than the $ΛCDM$ cosmology predicted. Our results set a firm footing for the presence of a hot CGM around MW-mass and more massive central galaxies. These measurements constitute a new benchmark for galaxy evolution models and possible implementations of feedback processes therein. △ Less

Submitted 29 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: 21 pages, 12 figures, submitted to A&A

arXiv:2401.17306 [pdf, other]

The SRG/eROSITA all-sky survey: Hard X-ray selected Active Galactic Nuclei

Authors: Sophia G. H. Waddell, J. Buchner, K. Nandra, M. Salvato, A. Merloni, I. Gauger, Th. Boller, R. Seppi, J. Wolf, T. Liu, M. Brusa, J. Comparat, T. Dwelly, Z. Igo, B. Musiimenta

Abstract: The eROSITA instrument aboard the Spectrum Roentgen Gamma (SRG) satellite has performed its first all-sky survey between December 2019 and June 2020. This paper presents the resulting hard X-ray (2.3-5 keV) sample, the first created from an all-sky imaging survey in the 2-8 keV band, for sources within western galactic sky. The 5466 hard X-ray selected sources detected with eROSITA are presented a… ▽ More The eROSITA instrument aboard the Spectrum Roentgen Gamma (SRG) satellite has performed its first all-sky survey between December 2019 and June 2020. This paper presents the resulting hard X-ray (2.3-5 keV) sample, the first created from an all-sky imaging survey in the 2-8 keV band, for sources within western galactic sky. The 5466 hard X-ray selected sources detected with eROSITA are presented and discussed. The Bayesian statistics-based code NWAY is used to identify the counterparts for the X-ray sources. These sources are classified based on their multiwavelength properties, and the literature is searched to identify spectroscopic redshifts, which further inform the source classification. A total of 2547 sources are found to have good-quality counterparts, and 111 of these are detected only in the hard band. Comparing with other hard X-ray selected surveys, the eROSITA hard sample covers a larger redshift range and probes dimmer sources, providing a complementary and expanded sample as compared to Swift-BAT. Examining the column density distribution of missed and detected eROSITA sources present in the follow-up catalog of Swift BAT 70 month sources, it is demonstrated that eROSITA can detect obscured sources with column densities $>10^{24}$ cm$^{-2}$, but that the completeness drops rapidly after $10^{23}$ cm$^{-2}$. A sample of hard-only sources, many of which are likely to be heavily obscured AGN, is also presented and discussed. X-ray spectral fitting reveals that these sources have extremely faint soft X-ray emission and their optical images suggest that they are found in more edge-on galaxies with lower b/a. The resulting X-ray catalog is demonstrated to be a powerful tool for understanding AGN, in particular heavily obscured AGN found in the hard-only sample. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 20 pages, 21 figures. Submitted to A&A. Associated with eROSITA data release. Associated catalogs will be released after journal publication

arXiv:2401.17300 [pdf, other]

The eROSITA Final Equatorial Depth Survey (eFEDS): the hard X-ray selected sample

Authors: K. Nandra, S. G. H. Waddell, T. Liu, J. Buchner, T. Dwelly, M. Salvato, Y. Shen, Q. Wu, R. Arcodia, Th. Boller, H. Brunner, M. Brusa, W. Collmar, J. Comparat, A. Georgakakis, M. Grau, S. Hämmerich, H. Ibarra-Medel, Z. Igo, M. Krumpe, G. Lamer, A. Merloni, B. Musiimenta, J. Wolf, R. J. Assef , et al. (3 additional authors not shown)

Abstract: During its calibration and performance verification phase, the eROSITA instrument aboard the SRG satellite performed a uniform wide--area X-ray survey of approximately 140 deg$^{2}$ in a region of the sky known as the eROSITA Final Equatorial Depth Survey (eFEDS). The primary aim of eFEDS is to demonstrate the scientific performance to be expected at the end of the 8-pass eROSITA all sky survey. T… ▽ More During its calibration and performance verification phase, the eROSITA instrument aboard the SRG satellite performed a uniform wide--area X-ray survey of approximately 140 deg$^{2}$ in a region of the sky known as the eROSITA Final Equatorial Depth Survey (eFEDS). The primary aim of eFEDS is to demonstrate the scientific performance to be expected at the end of the 8-pass eROSITA all sky survey. This will provide the first focussed image of the whole sky in the hard X-ray ($>2$~keV) bandpass. The expected source population in this energy range is thus of great interest, particularly for AGN studies. We use the 2.3--5 keV selection presented by Brunner et al. (2022) to construct a sample of 246 point-like hard X-ray sources for further study and characterization. These are classified as either extragalactic ($\sim 90$~\%) or Galactic ($\sim 10$~\%), with the former consisting overwhelmingly of AGN and the latter active stars. We concentrate our further analysis on the extragalactic/AGN sample, describing their X-ray and multiwavelength properties and comparing them to the eFEDS main AGN sample selected in the softer 0.2-2.3 keV band. The eROSITA hard band selects a subsample of sources that is a factor $>10$ brighter than the eFEDS main sample. The AGN within the hard population reach up to $z=3.2$ but on the whole are relatively nearby, with median $z$=0.34 compared to $z$=0.94 for the main sample. The hard survey probes typical luminosities in the range $\log L_{\rm X} = 43-46$. X-ray spectral analysis shows significant intrinsic absorption (with $\log N_{\rm H}>21$) in $\sim 20$~\% of the sources, with a hard X-ray power law continuum with mean $<Γ>=1.83\pm0.04$, typical of AGN, but slightly harder than the soft-selected eROSITA sample. (abridged) △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: Submitted to A&A as part of eROSITA Data Release 1

arXiv:2401.17289 [pdf, other]

A new understanding of the Gemini-Monoceros X-ray enhancement from discoveries with eROSITA

Authors: Jonathan R. Knies, Manami Sasaki, Werner Becker, Teng Liu, Gabriele Ponti, Paul P. Plucinsky

Abstract: The Gemini-Monoceros X-ray enhancement is a rich field for studying diffuse X-ray emission and supernova remnants (SNRs). With the launch of eROSITA onboard the SRG platform in 2019, we are now able to fully study these sources. Many of the SNRs in the vicinity are suspected to be very old remnants, which are severely understudied in X-rays due to numerous observational challenges. In addition, id… ▽ More The Gemini-Monoceros X-ray enhancement is a rich field for studying diffuse X-ray emission and supernova remnants (SNRs). With the launch of eROSITA onboard the SRG platform in 2019, we are now able to fully study these sources. Many of the SNRs in the vicinity are suspected to be very old remnants, which are severely understudied in X-rays due to numerous observational challenges. In addition, identification of new faint large SNRs might help to solve the long-standing discrepancy of observed and expected number of Galactic SNRs. We performed a detailed X-ray spectral analysis of the entire diffuse structure and a detailed background analysis of the vicinity. We also made use of multi-wavelength data to better understand the morphology and to constrain the distances to the different sources. We estimated the plasma properties of the sources and calculated a grid of model SNRs to determine the individual SNR properties. Most of the diffuse plasma of the Monogem Ring SNR is well described by a single non-equilibrium ionization (NEI) component with an average temperature of $kT = 0.14\pm 0.03$ keV. We obtain an age of $\approx 1.2\cdot 10^5$ yr - consistent with PSR B0656+14 - for the Monogem Ring. In the south-east, we found evidence for a hotter second plasma component and a possible new SNR candidate at $\approx 300$ pc, with the new candidate having an age of $\approx 50,000$ yr. We were also able to improve on previous studies on the more distant Monoceros Loop and PKS 0646+06 SNRs. We obtained significantly higher temperatures than previous studies, and for PKS 0646+06 a much lower estimated age of the SNR. We also found a new SNR candidate G190.4+12.5 which most likely is located at $D > 1.5 $ kpc, expanding into a low density medium at a high distance from the Galactic plane, with an estimated age of $40,000-60,000$ yr. △ Less

Submitted 30 May, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: Accepted by A&A - 02/04/2024

arXiv:2401.17279 [pdf, other]

doi 10.1051/0004-6361/202449415

Characterisation of the X-ray point source variability in the eROSITA south ecliptic pole field

Authors: David Bogensberger, Kirpal Nandra, Mara Salvato, Teng Liu, Julien Wolf, Scott Croom, Hattie Starck, Johannes Buchner, Gabriele Ponti, Jacob Ider Chitham, Chandreyee Maitra, Jan Robrade, Andrea Merloni, Mirko Krumpe

Abstract: Aims: During the Spectrum Roentgen Gamma (SRG)/ eROSITA all-sky surveys, X-ray sources close to the South Ecliptic Pole (SEP) are observed almost every 4 hours. We aim to identify the sources exhibiting the most significant long-term X-ray variability within 3 degrees of the SEP in the first three surveys, and investigate their properties. Methods: We determined the variability significance of a… ▽ More Aims: During the Spectrum Roentgen Gamma (SRG)/ eROSITA all-sky surveys, X-ray sources close to the South Ecliptic Pole (SEP) are observed almost every 4 hours. We aim to identify the sources exhibiting the most significant long-term X-ray variability within 3 degrees of the SEP in the first three surveys, and investigate their properties. Methods: We determined the variability significance of all sources observed by eROSITA within 3 degrees of the SEP by using thresholds on the Bayesian excess variance (SCATT_LO) and the maximum amplitude deviation (AMPL_SIG). Sources exhibiting a variability significance above $3σ$ were subdivided into likely Galactic and extragalactic sources, by using spectral and photometric information of their optical counterparts. We quantified the X-ray normalised excess variances of all variable sources, and also calculated the periodograms of the brightest ones. Results: Out of more than $10^4$ X-ray sources detected by eROSITA within 3 degrees of the SEP, we identified 453 that exhibit significant X-ray variability. SCATT_LO is significantly more sensitive to detecting variable sources in this field, but AMPL_SIG helps provide a more complete variability sample. Of those variable sources, 168 were classified as likely extragalactic, and 235 as likely Galactic. The periodograms of most bright and variable extragalactic sources are approximately described by an aliased power law ($P\proptoν^{-α}$) with an index of $α\approx 1$. We identified a potential tidal disruption event, and long-term transient sources. The stellar X-ray variability was predominantly caused by bright X-ray flares from coronally active stars. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 22 pages, 29 figures

Journal ref: A&A 687, A37 (2024)

arXiv:2401.17274 [pdf, other]

doi 10.1051/0004-6361/202347165

The SRG/eROSITA all-sky survey: First X-ray catalogues and data release of the western Galactic hemisphere

Authors: A. Merloni, G. Lamer, T. Liu, M. E. Ramos-Ceja, H. Brunner, E. Bulbul, K. Dennerl, V. Doroshenko, M. J. Freyberg, S. Friedrich, E. Gatuzz, A. Georgakakis, F. Haberl, Z. Igo, I. Kreykenbohm, A. Liu, C. Maitra, A. Malyali, M. G. F. Mayer, K. Nandra, P. Predehl, J. Robrade, M. Salvato, J. S. Sanders, I. Stewart , et al. (120 additional authors not shown)

Abstract: The eROSITA telescope array aboard the Spektrum Roentgen Gamma (SRG) satellite began surveying the sky in December 2019, with the aim of producing all-sky X-ray source lists and sky maps of an unprecedented depth. Here we present catalogues of both point-like and extended sources using the data acquired in the first six months of survey operations (eRASS1; completed June 2020) over the half sky wh… ▽ More The eROSITA telescope array aboard the Spektrum Roentgen Gamma (SRG) satellite began surveying the sky in December 2019, with the aim of producing all-sky X-ray source lists and sky maps of an unprecedented depth. Here we present catalogues of both point-like and extended sources using the data acquired in the first six months of survey operations (eRASS1; completed June 2020) over the half sky whose proprietary data rights lie with the German eROSITA Consortium. We describe the observation process, the data analysis pipelines, and the characteristics of the X-ray sources. With nearly 930000 entries detected in the most sensitive 0.2-2.3 keV energy range, the eRASS1 main catalogue presented here increases the number of known X-ray sources in the published literature by more than 60%, and provides a comprehensive inventory of all classes of X-ray celestial objects, covering a wide range of physical processes. A smaller catalogue of 5466 sources detected in the less sensitive but harder 2.3-5 keV band is the result of the first true imaging survey of the entire sky above 2 keV. We show that the number counts of X-ray sources in eRASS1 are consistent with those derived over narrower fields by past X-ray surveys of a similar depth, and we explore the number counts variation as a function of the location in the sky. Adopting a uniform all-sky flux limit (at 50% completeness) of F_{0.5-2 keV} > 5 \times 10^{-14}$ erg\,s$^{-1}$\,cm$^{-2}$, we estimate that the eROSITA all-sky survey resolves into individual sources about 20% of the cosmic X-ray background in the 1-2 keV range. The catalogues presented here form part of the first data release (DR1) of the SRG/eROSITA all-sky survey. Beyond the X-ray catalogues, DR1 contains all detected and calibrated event files, source products (light curves and spectra), and all-sky maps. Illustrative examples of these are provided. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 39 pages, 23 figures. Accepted for publication in A&A. Accompanying eROSITA-DE Data Release 1

Journal ref: A&A, vol. 682, A34 (2024)

arXiv:2401.17038 [pdf, other]

Towards Assessing the Synthetic-to-Measured Adversarial Vulnerability of SAR ATR

Authors: Bowen Peng, Bo Peng, **gyuan Xia, Tianpeng Liu, Yongxiang Liu, Li Liu

Abstract: Recently, there has been increasing concern about the vulnerability of deep neural network (DNN)-based synthetic aperture radar (SAR) automatic target recognition (ATR) to adversarial attacks, where a DNN could be easily deceived by clean input with imperceptible but aggressive perturbations. This paper studies the synthetic-to-measured (S2M) transfer setting, where an attacker generates adversari… ▽ More Recently, there has been increasing concern about the vulnerability of deep neural network (DNN)-based synthetic aperture radar (SAR) automatic target recognition (ATR) to adversarial attacks, where a DNN could be easily deceived by clean input with imperceptible but aggressive perturbations. This paper studies the synthetic-to-measured (S2M) transfer setting, where an attacker generates adversarial perturbation based solely on synthetic data and transfers it against victim models trained with measured data. Compared with the current measured-to-measured (M2M) transfer setting, our approach does not need direct access to the victim model or the measured SAR data. We also propose the transferability estimation attack (TEA) to uncover the adversarial risks in this more challenging and practical scenario. The TEA makes full use of the limited similarity between the synthetic and measured data pairs for blind estimation and optimization of S2M transferability, leading to feasible surrogate model enhancement without mastering the victim model and data. Comprehensive evaluations based on the publicly available synthetic and measured paired labeled experiment (SAMPLE) dataset demonstrate that the TEA outperforms state-of-the-art methods and can significantly enhance various attack algorithms in computer vision and remote sensing applications. Codes and data are available at https://github.com/scenarri/S2M-TEA. △ Less

Submitted 30 January, 2024; originally announced January 2024.

arXiv:2401.16681 [pdf]

doi 10.5170/CERN-2009-006.476

The Design of a High Speed Low Power Phase Locked Loop

Authors: Tiankuan Liu, Datao Gong, Suen Hou, Zhihua Liang, Chonghan Liu, Da-Shung Su, **-Kun Teng, Annie C. Xiang, **gbo Ye

Abstract: The upgrade of the ATLAS Liquid Argon Calorimeter readout system calls for the development of radiation tolerant, high speed and low power serializer ASIC. We have designed a phase locked loop using a commercial 0.25 um Silicon-on-Sapphire (SoS) CMOS technology. Post-layout simulation indicates that tuning range is 3.79-5.01 GHz and power consumption is 104 mW. The PLL has been submitted for fabri… ▽ More The upgrade of the ATLAS Liquid Argon Calorimeter readout system calls for the development of radiation tolerant, high speed and low power serializer ASIC. We have designed a phase locked loop using a commercial 0.25 um Silicon-on-Sapphire (SoS) CMOS technology. Post-layout simulation indicates that tuning range is 3.79-5.01 GHz and power consumption is 104 mW. The PLL has been submitted for fabrication. The design and simulation results are presented. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: 5 pages, 13 figures, Proceedings of the Topical Workshop on Electronics for Particle Physics 2009

arXiv:2401.16278 [pdf, other]

Characteristics of the MTx optical transmitter in Total Ionizing Dose

Authors: D. Gong, S. Hou, B. J. Juang, J. -H. Li, C. Liu, T. Liu, M. Qi, J. Ye, Lei Zhang, Li Zhang, H. P. Zhu

Abstract: The dual-channel multi-mode 850 nm optical Miniature Transmitter (MTx) is developed for data transmission of the ATLAS LAr calorimeter readout at LHC. The MTx's are exposed to the radiation field of proton-proton collisions, therefore, the tolerance in Total Ionizing Dose (TID) is required. The TID effects in the MTx are investigated with X-rays and Co-60 gamma-rays for the active components of VC… ▽ More The dual-channel multi-mode 850 nm optical Miniature Transmitter (MTx) is developed for data transmission of the ATLAS LAr calorimeter readout at LHC. The MTx's are exposed to the radiation field of proton-proton collisions, therefore, the tolerance in Total Ionizing Dose (TID) is required. The TID effects in the MTx are investigated with X-rays and Co-60 gamma-rays for the active components of VCSEL diodes and the customized Link-on-Chip laser driver (LOCld) developed in 0.25 um Silicon-on-Sapphire CMOS technology. The irradiation tests were conducted at various dose rates. The responses to TID are observed with degradation of laser currents at initial dose of 10 to 100 Gy(SiO2), and partial recovery with additional TID to a stable output about 90 % of the original. The optical eye diagrams of irradiated samples show slightly increased jittering, and are suitable for the ATLAS requirement of 5 Gbps applications. △ Less

Submitted 28 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

Comments: 7 pages, 10 figures

arXiv:2401.16107 [pdf, other]

Beyond Direct Diagnosis: LLM-based Multi-Specialist Agent Consultation for Automatic Diagnosis

Authors: Haochun Wang, Sendong Zhao, Zewen Qiang, Nuwa Xi, Bing Qin, Ting Liu

Abstract: Automatic diagnosis is a significant application of AI in healthcare, where diagnoses are generated based on the symptom description of patients. Previous works have approached this task directly by modeling the relationship between the normalized symptoms and all possible diseases. However, in the clinical diagnostic process, patients are initially consulted by a general practitioner and, if nece… ▽ More Automatic diagnosis is a significant application of AI in healthcare, where diagnoses are generated based on the symptom description of patients. Previous works have approached this task directly by modeling the relationship between the normalized symptoms and all possible diseases. However, in the clinical diagnostic process, patients are initially consulted by a general practitioner and, if necessary, referred to specialists in specific domains for a more comprehensive evaluation. The final diagnosis often emerges from a collaborative consultation among medical specialist groups. Recently, large language models have shown impressive capabilities in natural language understanding. In this study, we adopt tuning-free LLM-based agents as medical practitioners and propose the Agent-derived Multi-Specialist Consultation (AMSC) framework to model the diagnosis process in the real world by adaptively fusing probability distributions of agents over potential diseases. Experimental results demonstrate the superiority of our approach compared with baselines. Notably, our approach requires significantly less parameter updating and training time, enhancing efficiency and practical utility. Furthermore, we delve into a novel perspective on the role of implicit symptoms within the context of automatic diagnosis. △ Less

Submitted 29 January, 2024; originally announced January 2024.

arXiv:2401.15969 [pdf, other]

Routers in Vision Mixture of Experts: An Empirical Study

Authors: Tianlin Liu, Mathieu Blondel, Carlos Riquelme, Joan Puigcerver

Abstract: Mixture-of-Experts (MoE) models are a promising way to scale up model capacity without significantly increasing computational cost. A key component of MoEs is the router, which decides which subset of parameters (experts) process which feature embeddings (tokens). In this paper, we present a comprehensive study of routers in MoEs for computer vision tasks. We introduce a unified MoE formulation th… ▽ More Mixture-of-Experts (MoE) models are a promising way to scale up model capacity without significantly increasing computational cost. A key component of MoEs is the router, which decides which subset of parameters (experts) process which feature embeddings (tokens). In this paper, we present a comprehensive study of routers in MoEs for computer vision tasks. We introduce a unified MoE formulation that subsumes different MoEs with two parametric routing tensors. This formulation covers both sparse MoE, which uses a binary or hard assignment between experts and tokens, and soft MoE, which uses a soft assignment between experts and weighted combinations of tokens. Routers for sparse MoEs can be further grouped into two variants: Token Choice, which matches experts to each token, and Expert Choice, which matches tokens to each expert. We conduct head-to-head experiments with 6 different routers, including existing routers from prior work and new ones we introduce. We show that (i) many routers originally developed for language modeling can be adapted to perform strongly in vision tasks, (ii) in sparse MoE, Expert Choice routers generally outperform Token Choice routers, and (iii) soft MoEs generally outperform sparse MoEs with a fixed compute budget. These results provide new insights regarding the crucial role of routers in vision MoE models. △ Less

Submitted 18 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

arXiv:2401.15885 [pdf, other]

Rectify the Regression Bias in Long-Tailed Object Detection

Authors: Ke Zhu, Minghao Fu, Jie Shao, Tianyu Liu, Jianxin Wu

Abstract: Long-tailed object detection faces great challenges because of its extremely imbalanced class distribution. Recent methods mainly focus on the classification bias and its loss function design, while ignoring the subtle influence of the regression branch. This paper shows that the regression bias exists and does adversely and seriously impact the detection accuracy. While existing methods fail to h… ▽ More Long-tailed object detection faces great challenges because of its extremely imbalanced class distribution. Recent methods mainly focus on the classification bias and its loss function design, while ignoring the subtle influence of the regression branch. This paper shows that the regression bias exists and does adversely and seriously impact the detection accuracy. While existing methods fail to handle the regression bias, the class-specific regression head for rare classes is hypothesized to be the main cause of it in this paper. As a result, three kinds of viable solutions to cater for the rare categories are proposed, including adding a class-agnostic branch, clustering heads and merging heads. The proposed methods brings in consistent and significant improvements over existing long-tailed detection methods, especially in rare and common classes. The proposed method achieves state-of-the-art performance in the large vocabulary LVIS dataset with different backbones and architectures. It generalizes well to more difficult evaluation metrics, relatively balanced datasets, and the mask branch. This is the first attempt to reveal and explore rectifying of the regression bias in long-tailed object detection. △ Less

Submitted 31 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

arXiv:2401.15845 [pdf]

doi 10.1103/PhysRevB.109.205403

Extremely intrinsic chirality in two-dimensional planar waveguide grating induced by quasi-bound states in the continuum

Authors: Dandan Zhang, Tingting Liu, Linlin Lei, Weimin Deng, Tongbiao Wang, Qinghua Liao, Wenxing Liu, Shuyuan Xiao, Tianbao Yu

Abstract: The strong chiral light-matter interaction is crucial for various important fields such as chiral optics, quantum optics, and biomedical optics, driving a quest for the extreme intrinsic chirality assisted by ultrahigh quality ($Q$-) factor resonances. In this quest, we propose a straightforward method to achieve extreme intrinsic chirality in lossless planar structures by manipulating the quasi-B… ▽ More The strong chiral light-matter interaction is crucial for various important fields such as chiral optics, quantum optics, and biomedical optics, driving a quest for the extreme intrinsic chirality assisted by ultrahigh quality ($Q$-) factor resonances. In this quest, we propose a straightforward method to achieve extreme intrinsic chirality in lossless planar structures by manipulating the quasi-BIC through in-plane perturbation. The temporal coupled-mode theory is employed to derive the conditions necessary for achieving maximal intrinsic chirality. The quasi-BIC should be excited within the transparent spectral range of the structure and couple with $x$- and $y$-polarized waves with the same intensity but a phase difference of $π$/2. For an illustration, a planar chiral dielectric dimeric waveguide grating is designed that strong interacts with left circularly polarized (LCP) light while decouples from right circularly polarized (RCP) light through in-plane symmetry engineering. Furthermore, by adjusting the magnitude of the in-plane asymmetry, we can independently manipulate the $Q$-factors of the chiral quasi-BIC while maintaining nearly unity circular dichroism. Our results provide a simple yet powerful paradigm for achieving extreme intrinsic chirality on an easily manufacturable platform, which may have potential applications in chiral emission, chiral sensing, and enantiomer separation. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Journal ref: Physical Review B 109 (20), 205403 (2024)

arXiv:2401.15755 [pdf]

doi 10.5170/CERN-2009-006.481

Development of A 16:1 serializer for data transmission at 5 Gbps

Authors: Datao Gong, Suen Hou, Zhihua Liang, Chonghan Liu, Tiankuan Liu, Da-Shun Su, **-Kun Teng, Annie C. Xiang, **gbo Ye

Abstract: Radiation tolerant, high speed and low power serializer ASIC is critical for optical link systems in particle physics experiments. Based on a commercial 0.25 um silicon-on-sapphire CMOS technology, we design a 16:1 serializer with 5 Gbps serial data rate. This ASIC has been submitted for fabrication. The post-layout simulation indicates the deterministic jitter is 54 ps (pk-pk) and random jitter i… ▽ More Radiation tolerant, high speed and low power serializer ASIC is critical for optical link systems in particle physics experiments. Based on a commercial 0.25 um silicon-on-sapphire CMOS technology, we design a 16:1 serializer with 5 Gbps serial data rate. This ASIC has been submitted for fabrication. The post-layout simulation indicates the deterministic jitter is 54 ps (pk-pk) and random jitter is 3 ps (rms). The power consumption of the serializer is 500 mW. The design details and post layout simulation results are presented in this paper. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 5 pages, 12 figures, Proceedings of the Topical Workshop on Electronics for Particle Physics 2009

arXiv:2401.15754 [pdf]

doi 10.5170/CERN-2009-006.471

High-Speed Serial Optical Link Test Bench Using FPGA with Embedded Transceivers

Authors: Annie C. Xiang, Tingting Cao, Datao Gong, Suen Hou, Chonghan Liu, Tiankuan Liu, Da-Shung Su, **-Kun Teng, **gbo Ye

Abstract: We develop a custom Bit Error Rate test bench based on Altera's Stratix II GX transceiver signal integrity development kit, demonstrate it on point-to-point serial optical link with data rate up to 5 Gbps, and compare it with commercial stand alone tester. The 8B/10B protocol is implemented and its effects studied. A variable optical attenuator is inserted in the fibre loop to induce transmission… ▽ More We develop a custom Bit Error Rate test bench based on Altera's Stratix II GX transceiver signal integrity development kit, demonstrate it on point-to-point serial optical link with data rate up to 5 Gbps, and compare it with commercial stand alone tester. The 8B/10B protocol is implemented and its effects studied. A variable optical attenuator is inserted in the fibre loop to induce transmission degradation and to measure receiver sensitivity. We report comparable receiver sensitivity results using the FPGA based tester and commercial tester. The results of the FPGA also shows that there are more one-to-zero bit flips than zero-to-one bit flips at lower error rate. In 8B/10B coded transmission, there are more word errors than bit flips, and the total error rate is less than two times that of non-coded transmission. Total error rate measured complies with simulation results, according to the protocol setup. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 5 pages, 8 figures, Proceedings of the Topical Workshop on Electronics for Particle Physics 2009

arXiv:2401.15750 [pdf]

doi 10.1088/1748-0221/10/01/C01004

The Clock Distribution System for the ATLAS Liquid Argon Calorimeter Phase-I Upgrade Demonstrator

Authors: Binwei Deng, Hucheng Chen, Kai Chen, **ghong Chen, Datao Gong, Di Guo, Xueye Hu, De** Huang, James Kierstead, Xiaoting Li, Chonghan Liu, Tiankuan Liu, Annie C. Xiang, Hao Xu, Tongye Xu, Yang You, **gbo Ye

Abstract: A prototype Liquid-argon Trigger Digitizer Board (LTDB), called the LTDB Demonstrator, has been developed to demonstrate the functions of the ATLAS Liquid Argon Calorimeter Phase-I trigger electronics upgrade. Forty Analog-to-Digital converters and four FPGAs with embedded multi-gigabit-transceivers on each Demonstrator need high quality clocks. A clock distribution system based on commercial comp… ▽ More A prototype Liquid-argon Trigger Digitizer Board (LTDB), called the LTDB Demonstrator, has been developed to demonstrate the functions of the ATLAS Liquid Argon Calorimeter Phase-I trigger electronics upgrade. Forty Analog-to-Digital converters and four FPGAs with embedded multi-gigabit-transceivers on each Demonstrator need high quality clocks. A clock distribution system based on commercial components has been developed for the Demonstrator. The design of the clock distribution system is presented. The performance of the clock distribution system has been evaluated. The components used in the clock distribution system have been qualified to meet radiation tolerance requirements of the Demonstrator. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 7 pages, 4 figures, Proceedings of the Topical Workshop on Electronics for Particle Physics 2014

arXiv:2401.15749 [pdf]

doi 10.1088/1748-0221/10/01/C01034

The VCSEL-based Array Optical Transmitter (ATx) Development Towards 120-Gbps Link for Collider Detector: Development Update

Authors: Di Guo, Chonghan Liu, **ghong Chen, John Chramowicz, Datao Gong, Suen Hou, De** Huang, Ge **, Xiaoting Li, Tiankuan Liu, Alan Prosser, **-Kun Teng, **gbo Ye, Yongzhao Zhou, Yang You, Annie C. Xiang, Hao Liang

Abstract: A compact radiation-tolerant array optical transmitter module (ATx) is developed to provide data transmission up to 10Gbps per channel with 12 parallel channels for collider detector applications. The ATx integrates a Vertical Cavity Surface-Emitting Laser (VCSEL) array and driver circuitry for electrical to optical conversion, an edge warp substrate for the electrical interface and a micro-lens a… ▽ More A compact radiation-tolerant array optical transmitter module (ATx) is developed to provide data transmission up to 10Gbps per channel with 12 parallel channels for collider detector applications. The ATx integrates a Vertical Cavity Surface-Emitting Laser (VCSEL) array and driver circuitry for electrical to optical conversion, an edge warp substrate for the electrical interface and a micro-lens array for the optical interface. This paper reports the continuing development of the ATx custom package. A simple, high-accuracy and reliable active-alignment method for the optical coupling is introduced. The radiation-resistance of the optoelectronic components is evaluated and the inclusion of a custom-designed array driver is discussed. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 8 pages, 8 figures

arXiv:2401.15746 [pdf]

doi 10.1088/1748-0221/10/01/C01050

JTAG-based Remote Configuration of FPGAs over Optical Fibers

Authors: Binwei Deng, Chonghan Liu, **ghong Chen, Kai Chen, Datao Gong, Di Guo, Suen Hou, De** Huang, Xiaoting Li, Tiankuan Liu, **-Kun Teng, Annie C. Xiang, Hao Xu, Yang You, **gbo Ye

Abstract: In this paper, a remote FPGA-configuration method based on JTAG extension over optical fibers is presented. The method takes advantage of commercial components and ready-to-use software such as iMPACT and does not require any hardware or software development. The method combines the advantages of the slow remote JTAG configuration and the fast local flash memory configuration. The method has been… ▽ More In this paper, a remote FPGA-configuration method based on JTAG extension over optical fibers is presented. The method takes advantage of commercial components and ready-to-use software such as iMPACT and does not require any hardware or software development. The method combines the advantages of the slow remote JTAG configuration and the fast local flash memory configuration. The method has been verified successfully and used in the Demonstrator of Liquid-Argon Trigger Digitization Board (LTDB) for the ATLAS liquid argon calorimeter Phase-I trigger upgrade. All components on the FPGA side are verified to meet the radiation tolerance requirements. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 8 pages, 4 figures

arXiv:2401.15283 [pdf]

Isotope engineering of carrier mobility via Fröhlich electron-phonon interaction

Authors: Wenjiang Zhou, Te-Huan Liu, Bai Song

Abstract: Isotope effects on phonon properties and transport have been predicted and observed for decades. However, despite the crucial impact of electron-phonon interactions, the effect of isotopes on electron transport remains largely unexplored. Here, by using first-principles calculations, we theoretically predict that the electron mobility of lithium hydride (LiH) can increase by up to ~100% as… ▽ More Isotope effects on phonon properties and transport have been predicted and observed for decades. However, despite the crucial impact of electron-phonon interactions, the effect of isotopes on electron transport remains largely unexplored. Here, by using first-principles calculations, we theoretically predict that the electron mobility of lithium hydride (LiH) can increase by up to ~100% as $^3\rm{H}$ is replaced with $^1\rm{H}$. This remarkable phenomenon is primarily attributed to the isotope engineering of the Fröhlich interaction by the mass-induced line shift of the longitudinal optical (LO) phonons. Notably, the isotope-dependent absorption of LO phonons dominates while the isotope-insensitive emission process is mostly suppressed due to energy conservation. We further propose general guidelines for evaluating isotope effects on carrier transport in different materials. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Showing 301–350 of 4,055 results for author: Liu, T