Search | arXiv e-print repository

Assessment of the Role and Origin of S* in Orange Carotenoid Protein Photoconversion

Authors: James P. Pidgeon, George A. Sutherland, Matthew S. Proctor, Shuangqing Wang, Dimitri Chekulaev, Sayantan Bhattacharya, Rahul Jayaprakash, Andrew Hitchcock, Ravi Kumar Venkatraman, Matthew P. Johnson, C. Neil Hunter, Jenny Clark

Abstract: The orange carotenoid protein (OCP) is the water-soluble mediator of non-photochemical quenching in cyanobacteria, a crucial photoprotective mechanism in response to excess illumination. OCP converts from a globular, inactive state (OCPo) to an extended, active conformation (OCPr) under high-light conditions, resulting in a concomitant redshift in the absorption of the bound carotenoid. Here, OCP… ▽ More The orange carotenoid protein (OCP) is the water-soluble mediator of non-photochemical quenching in cyanobacteria, a crucial photoprotective mechanism in response to excess illumination. OCP converts from a globular, inactive state (OCPo) to an extended, active conformation (OCPr) under high-light conditions, resulting in a concomitant redshift in the absorption of the bound carotenoid. Here, OCP was trapped in either the active or inactive state by fixing each protein conformation in trehalose-sucrose glass. Glass-encapsulated OCPo did not convert under intense illumination and OCPr did not convert in darkness, allowing the optical properties of each conformation to be determined at room temperature. We measured pump wavelength-dependent transient absorption of OCPo in glass films and found that initial OCP photoproducts are still formed, despite the glass preventing completion of the photocycle. By comparison to the pump wavelength dependence of the OCPo to OCPr photoconversion yield in buffer, we show that the long-lived carotenoid singlet-like feature (S*) is associated with ground-state heterogeneity within OCPo, rather than triggering OCP photoconversion. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.14365 [pdf, other]

JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

Authors: Kun Zhou, Beichen Zhang, Jiapeng Wang, Zhipeng Chen, Wayne Xin Zhao, **g Sha, Zhichao Sheng, Shi** Wang, Ji-Rong Wen

Abstract: Mathematical reasoning is an important capability of large language models~(LLMs) for real-world applications. To enhance this capability, existing work either collects large-scale math-related texts for pre-training, or relies on stronger LLMs (\eg GPT-4) to synthesize massive math problems. Both types of work generally lead to large costs in training or synthesis. To reduce the cost, based on op… ▽ More Mathematical reasoning is an important capability of large language models~(LLMs) for real-world applications. To enhance this capability, existing work either collects large-scale math-related texts for pre-training, or relies on stronger LLMs (\eg GPT-4) to synthesize massive math problems. Both types of work generally lead to large costs in training or synthesis. To reduce the cost, based on open-source available texts, we propose an efficient way that trains a small LLM for math problem synthesis, to efficiently generate sufficient high-quality pre-training data. To achieve it, we create a dataset using GPT-4 to distill its data synthesis capability into the small LLM. Concretely, we craft a set of prompts based on human education stages to guide GPT-4, to synthesize problems covering diverse math knowledge and difficulty levels. Besides, we adopt the gradient-based influence estimation method to select the most valuable math-related texts. The both are fed into GPT-4 for creating the knowledge distillation dataset to train the small LLM. We leverage it to synthesize 6 million math problems for pre-training our JiuZhang3.0 model, which only needs to invoke GPT-4 API 9.3k times and pre-train on 4.6B data. Experimental results have shown that JiuZhang3.0 achieves state-of-the-art performance on several mathematical reasoning datasets, under both natural language reasoning and tool manipulation settings. Our code and data will be publicly released in \url{https://github.com/RUCAIBox/JiuZhang3.0}. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 28 pages, SOTA math LLM using Well-trained Data Synthesis LLM

arXiv:2405.14091 [pdf, other]

Peas-in-a-Pod Across the Radius Valley: Rocky Systems are Less Uniform in Mass but More Uniform in Size and Spacing

Authors: Armaan V. Goyal, Songhu Wang

Abstract: The ubiquity of "peas-in-a-pod" architectural patterns and the existence of the radius valley each present a striking population-level trend for planets with $R_{p} \leq 4 R_{\oplus}$ that serves to place powerful constraints on the formation and evolution of these subgiant worlds. As it has yet to be determined whether the strength of this peas-in-a-pod uniformity differs on either side of the ra… ▽ More The ubiquity of "peas-in-a-pod" architectural patterns and the existence of the radius valley each present a striking population-level trend for planets with $R_{p} \leq 4 R_{\oplus}$ that serves to place powerful constraints on the formation and evolution of these subgiant worlds. As it has yet to be determined whether the strength of this peas-in-a-pod uniformity differs on either side of the radius valley, we separately assess the architectures of systems containing only small ($R_{p} \leq 1.6 R_{\oplus}$), rocky planets from those harboring only intermediate-size ($1.6 R_{\oplus} < R_{p} \leq 4 R_{\oplus}$), volatile-rich worlds to perform a novel statistical comparison of intra-system planetary uniformity across compositionally distinct regimes. We find that, compared to their volatile-rich counterparts, rocky systems are less uniform in mass ($2.6σ$), but more uniform in size ($4.0σ$) and spacing ($3.0σ$). We provide further statistical validation for these results, demonstrating that they are not substantially influenced by the presence of mean motion resonances, low-mass host stars, alternative bulk compositional assumptions, sample size effects, or detection biases. We also obtain tentative evidence ($>2 σ$ significance) that the enhanced size uniformity of rocky systems is dominated by the presence of super-Earths ($1 R_{\oplus} \leq R_{p} \leq 1.6 R_{\oplus}$), while their enhanced mass diversity is driven by the presence of sub-Earth ($R_{p} < 1 R_{\oplus}$) worlds. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Accepted to ApJ Letters (May 2024). 17 pages (including 3 for Appendix), 4 figures, 3 tables

arXiv:2405.14079 [pdf, other]

Advancing Transportation Mode Share Analysis with Built Environment: Deep Hybrid Models with Urban Road Network

Authors: Dingyi Zhuang, Qingyi Wang, Yunhan Zheng, Xiaotong Guo, Shenhao Wang, Haris N Koutsopoulos, **hua Zhao

Abstract: Transportation mode share analysis is important to various real-world transportation tasks as it helps researchers understand the travel behaviors and choices of passengers. A typical example is the prediction of communities' travel mode share by accounting for their sociodemographics like age, income, etc., and travel modes' attributes (e.g. travel cost and time). However, there exist only limite… ▽ More Transportation mode share analysis is important to various real-world transportation tasks as it helps researchers understand the travel behaviors and choices of passengers. A typical example is the prediction of communities' travel mode share by accounting for their sociodemographics like age, income, etc., and travel modes' attributes (e.g. travel cost and time). However, there exist only limited efforts in integrating the structure of the urban built environment, e.g., road networks, into the mode share models to capture the impacts of the built environment. This task usually requires manual feature engineering or prior knowledge of the urban design features. In this study, we propose deep hybrid models (DHM), which directly combine road networks and sociodemographic features as inputs for travel mode share analysis. Using graph embedding (GE) techniques, we enhance travel demand models with a more powerful representation of urban structures. In experiments of mode share prediction in Chicago, results demonstrate that DHM can provide valuable spatial insights into the sociodemographic structure, improving the performance of travel demand models in estimating different mode shares at the city level. Specifically, DHM improves the results by more than 20\% while retaining the interpretation power of the choice models, demonstrating its superiority in interpretability, prediction accuracy, and geographical insights. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 29 pages

arXiv:2405.13998 [pdf, other]

Bridging Operator Learning and Conditioned Neural Fields: A Unifying Perspective

Authors: Sifan Wang, Jacob H Seidman, Shyam Sankaran, Hanwen Wang, George J. Pappas, Paris Perdikaris

Abstract: Operator learning is an emerging area of machine learning which aims to learn map**s between infinite dimensional function spaces. Here we uncover a connection between operator learning architectures and conditioned neural fields from computer vision, providing a unified perspective for examining differences between popular operator learning models. We find that many commonly used operator learn… ▽ More Operator learning is an emerging area of machine learning which aims to learn map**s between infinite dimensional function spaces. Here we uncover a connection between operator learning architectures and conditioned neural fields from computer vision, providing a unified perspective for examining differences between popular operator learning models. We find that many commonly used operator learning models can be viewed as neural fields with conditioning mechanisms restricted to point-wise and/or global information. Motivated by this, we propose the Continuous Vision Transformer (CViT), a novel neural operator architecture that employs a vision transformer encoder and uses cross-attention to modulate a base field constructed with a trainable grid-based positional encoding of query coordinates. Despite its simplicity, CViT achieves state-of-the-art results across challenging benchmarks in climate modeling and fluid dynamics. Our contributions can be viewed as a first step towards adapting advanced computer vision architectures for building more flexible and accurate machine learning models in physical sciences. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 23 pages, 13 figures

arXiv:2405.13859 [pdf, other]

QGait: Toward Accurate Quantization for Gait Recognition with Binarized Input

Authors: Senmao Tian, Haoyu Gao, Gangyi Hong, Shuyun Wang, **gJie Wang, Xin Yu, Shunli Zhang

Abstract: Existing deep learning methods have made significant progress in gait recognition. Typically, appearance-based models binarize inputs into silhouette sequences. However, mainstream quantization methods prioritize minimizing task loss over quantization error, which is detrimental to gait recognition with binarized inputs. Minor variations in silhouette sequences can be diminished in the network's i… ▽ More Existing deep learning methods have made significant progress in gait recognition. Typically, appearance-based models binarize inputs into silhouette sequences. However, mainstream quantization methods prioritize minimizing task loss over quantization error, which is detrimental to gait recognition with binarized inputs. Minor variations in silhouette sequences can be diminished in the network's intermediate layers due to the accumulation of quantization errors. To address this, we propose a differentiable soft quantizer, which better simulates the gradient of the round function during backpropagation. This enables the network to learn from subtle input perturbations. However, our theoretical analysis and empirical studies reveal that directly applying the soft quantizer can hinder network convergence. We further refine the training strategy to ensure convergence while simulating quantization errors. Additionally, we visualize the distribution of outputs from different samples in the feature space and observe significant changes compared to the full precision network, which harms performance. Based on this, we propose an Inter-class Distance-guided Distillation (IDD) strategy to preserve the relative distance between the embeddings of samples with different labels. Extensive experiments validate the effectiveness of our approach, demonstrating state-of-the-art accuracy across various settings and datasets. The code will be made publicly available. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.13804 [pdf, other]

Guarding Multiple Secrets: Enhanced Summary Statistic Privacy for Data Sharing

Authors: Shuaiqi Wang, Rongzhe Wei, Mohsen Ghassemi, Eleonora Kreacic, Vamsi K. Potluru

Abstract: Data sharing enables critical advances in many research areas and business applications, but it may lead to inadvertent disclosure of sensitive summary statistics (e.g., means or quantiles). Existing literature only focuses on protecting a single confidential quantity, while in practice, data sharing involves multiple sensitive statistics. We propose a novel framework to define, analyze, and prote… ▽ More Data sharing enables critical advances in many research areas and business applications, but it may lead to inadvertent disclosure of sensitive summary statistics (e.g., means or quantiles). Existing literature only focuses on protecting a single confidential quantity, while in practice, data sharing involves multiple sensitive statistics. We propose a novel framework to define, analyze, and protect multi-secret summary statistics privacy in data sharing. Specifically, we measure the privacy risk of any data release mechanism by the worst-case probability of an attacker successfully inferring summary statistic secrets. Given an attacker's objective spanning from inferring a subset to the entirety of summary statistic secrets, we systematically design and analyze tailored privacy metrics. Defining the distortion as the worst-case distance between the original and released data distribution, we analyze the tradeoff between privacy and distortion. Our contribution also includes designing and analyzing data release mechanisms tailored for different data distributions and secret types. Evaluations on real-world data demonstrate the effectiveness of our mechanisms in practical applications. △ Less

Submitted 12 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.13372 [pdf, other]

Ada-HGNN: Adaptive Sampling for Scalable Hypergraph Neural Networks

Authors: Shuai Wang, David W. Zhang, Jia-Hong Huang, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

Abstract: Hypergraphs serve as an effective model for depicting complex connections in various real-world scenarios, from social to biological networks. The development of Hypergraph Neural Networks (HGNNs) has emerged as a valuable method to manage the intricate associations in data, though scalability is a notable challenge due to memory limitations. In this study, we introduce a new adaptive sampling str… ▽ More Hypergraphs serve as an effective model for depicting complex connections in various real-world scenarios, from social to biological networks. The development of Hypergraph Neural Networks (HGNNs) has emerged as a valuable method to manage the intricate associations in data, though scalability is a notable challenge due to memory limitations. In this study, we introduce a new adaptive sampling strategy specifically designed for hypergraphs, which tackles their unique complexities in an efficient manner. We also present a Random Hyperedge Augmentation (RHA) technique and an additional Multilayer Perceptron (MLP) module to improve the robustness and generalization capabilities of our approach. Thorough experiments with real-world datasets have proven the effectiveness of our method, markedly reducing computational and memory demands while maintaining performance levels akin to conventional HGNNs and other baseline models. This research paves the way for improving both the scalability and efficacy of HGNNs in extensive applications. We will also make our codebase publicly accessible. △ Less

Submitted 14 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.13315 [pdf, other]

Study of the decays $χ_{cJ}\toΛ\barΛω$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, we present the first observation of the decays $χ_{cJ}\toΛ\barΛω$, where $J=0, 1, 2$, with statistical significances of $11.7 σ, 11.2 σ$, and $11.8 σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\toΛ\barΛω)=({2.37 \pm 0.22 \pm 0.23}) \times 10^{-4}$,… ▽ More Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, we present the first observation of the decays $χ_{cJ}\toΛ\barΛω$, where $J=0, 1, 2$, with statistical significances of $11.7 σ, 11.2 σ$, and $11.8 σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\toΛ\barΛω)=({2.37 \pm 0.22 \pm 0.23}) \times 10^{-4}$, $\mathcal{B}(χ_{c1}\toΛ\barΛω)=({1.01 \pm 0.10 \pm 0.11}) \times 10^{-4}$, and $\mathcal{B}(χ_{c2}\toΛ\barΛω)=({1.40 \pm 0.13 \pm 0.17}) \times 10^{-4}$, where the first uncertainties are statistical and the second are systematic. We observe no clear intermediate structures. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 11 pages, 10 figures

arXiv:2405.13118 [pdf, other]

Gliese 12 b, A Temperate Earth-sized Planet at 12 Parsecs Discovered with TESS and CHEOPS

Authors: Shishir Dholakia, Larissa Palethorpe, Alexander Venner, Annelies Mortier, Thomas G. Wilson, Chelsea X. Huang, Ken Rice, Vincent Van Eylen, Emma Nabbie, Ryan Cloutier, Walter Boschin, David Ciardi, Laetitia Delrez, Georgina Dransfield, Elsa Ducrot, Zahra Essack, Mark E. Everett, Michaël Gillon, Matthew J. Hooton, Michelle Kunimoto, David W. Latham, Mercedes López-Morales, Bin Li, Fan Li, Scott McDermott , et al. (11 additional authors not shown)

Abstract: We report on the discovery of Gliese 12 b, the nearest transiting temperate, Earth-sized planet found to date. Gliese 12 is a bright ($V=12.6$ mag, $K=7.8$ mag) metal-poor M4V star only $12.162\pm0.005$ pc away from the Solar System with one of the lowest stellar activity levels known for an M-dwarf. A planet candidate was detected by TESS based on only 3 transits in sectors 42, 43, and 57, with a… ▽ More We report on the discovery of Gliese 12 b, the nearest transiting temperate, Earth-sized planet found to date. Gliese 12 is a bright ($V=12.6$ mag, $K=7.8$ mag) metal-poor M4V star only $12.162\pm0.005$ pc away from the Solar System with one of the lowest stellar activity levels known for an M-dwarf. A planet candidate was detected by TESS based on only 3 transits in sectors 42, 43, and 57, with an ambiguity in the orbital period due to observational gaps. We performed follow-up transit observations with CHEOPS and ground-based photometry with MINERVA-Australis, SPECULOOS, and Purple Mountain Observatory, as well as further TESS observations in sector 70. We statistically validate Gliese 12 b as a planet with an orbital period of $12.76144\pm0.00006$ days and a radius of $1.0\pm{0.1}$ R$_\oplus$, resulting in an equilibrium temperature of $\sim$315K. Gliese 12 b has excellent future prospects for precise mass measurement, which may inform how planetary internal structure is affected by the stellar compositional environment. Gliese 12 b also represents one of the best targets to study whether Earth-like planets orbiting cool stars can retain their atmospheres, a crucial step to advance our understanding of habitability on Earth and across the Galaxy. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 19 pages, 7 figures, Accepted for publication in MNRAS, Authors Shishir Dholakia and Larissa Palethorpe contributed equally

arXiv:2405.13094 [pdf, other]

KPG: Key Propagation Graph Generator for Rumor Detection based on Reinforcement Learning

Authors: Yusong Zhang, Kun Xie, Xingyi Zhang, Xiangyu Dong, Sibo Wang

Abstract: The proliferation of rumors on social media platforms during significant events, such as the US elections and the COVID-19 pandemic, has a profound impact on social stability and public health. Existing approaches for rumor detection primarily rely on propagation graphs to enhance model effectiveness. However, the presence of noisy and irrelevant structures during the propagation process limits th… ▽ More The proliferation of rumors on social media platforms during significant events, such as the US elections and the COVID-19 pandemic, has a profound impact on social stability and public health. Existing approaches for rumor detection primarily rely on propagation graphs to enhance model effectiveness. However, the presence of noisy and irrelevant structures during the propagation process limits the efficacy of these approaches. To tackle this issue, techniques such as weight adjustment and data augmentation have been proposed. However, these techniques heavily depend on rich original propagation structures, thus hindering performance when dealing with rumors that lack sufficient propagation information in the early propagation stages. In this paper, we propose Key Propagation Graph Generator (KPG), a novel reinforcement learning-based rumor detection framework that generates contextually coherent and informative propagation patterns for events with insufficient topology information, while also identifies indicative substructures for events with redundant and noisy propagation structures. KPG consists of two key components: the Candidate Response Generator (CRG) and the Ending Node Selector (ENS). CRG learns the latent distribution from refined propagation patterns, filtering out noise and generating new candidates for ENS. Simultaneously, ENS identifies the most influential substructures within propagation graphs and generates training data for CRG. Moreover, we introduce an end-to-end framework that utilizes rewards to guide the entire training process via a pre-trained graph neural network. Extensive experiments conducted on four datasets demonstrate the superiority of our KPG compared to the state-of-the-art approaches. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.13075 [pdf, other]

Score-CDM: Score-Weighted Convolutional Diffusion Model for Multivariate Time Series Imputation

Authors: S. Zhang, S. Wang, H. Miao, H. Chen, C. Fan, J. Zhang

Abstract: Multivariant time series (MTS) data are usually incomplete in real scenarios, and imputing the incomplete MTS is practically important to facilitate various time series mining tasks. Recently, diffusion model-based MTS imputation methods have achieved promising results by utilizing CNN or attention mechanisms for temporal feature learning. However, it is hard to adaptively trade off the diverse ef… ▽ More Multivariant time series (MTS) data are usually incomplete in real scenarios, and imputing the incomplete MTS is practically important to facilitate various time series mining tasks. Recently, diffusion model-based MTS imputation methods have achieved promising results by utilizing CNN or attention mechanisms for temporal feature learning. However, it is hard to adaptively trade off the diverse effects of local and global temporal features by simply combining CNN and attention. To address this issue, we propose a Score-weighted Convolutional Diffusion Model (Score-CDM for short), whose backbone consists of a Score-weighted Convolution Module (SCM) and an Adaptive Reception Module (ARM). SCM adopts a score map to capture the global temporal features in the time domain, while ARM uses a Spectral2Time Window Block (S2TWB) to convolve the local time series data in the spectral domain. Benefiting from the time convolution properties of Fast Fourier Transformation, ARM can adaptively change the receptive field of the score map, and thus effectively balance the local and global temporal features. We conduct extensive evaluations on three real MTS datasets of different domains, and the result verifies the effectiveness of the proposed Score-CDM. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.12971 [pdf, other]

BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

Authors: Theodore Zhao, Yu Gu, Jianwei Yang, Naoto Usuyama, Ho Hin Lee, Tristan Naumann, Jianfeng Gao, Angela Crabtree, Jacob Abel, Christine Moung-Wen, Brian Piening, Carlo Bifulco, Mu Wei, Hoifung Poon, Sheng Wang

Abstract: Biomedical image analysis is fundamental for biomedical discovery in cell biology, pathology, radiology, and many other biomedical domains. Holistic image analysis comprises interdependent subtasks such as segmentation, detection, and recognition of relevant objects. Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, an… ▽ More Biomedical image analysis is fundamental for biomedical discovery in cell biology, pathology, radiology, and many other biomedical domains. Holistic image analysis comprises interdependent subtasks such as segmentation, detection, and recognition of relevant objects. Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, and recognition for 82 object types across 9 imaging modalities. Through joint learning, we can improve accuracy for individual tasks and enable novel applications such as segmenting all relevant objects in an image through a text prompt, rather than requiring users to laboriously specify the bounding box for each object. We leveraged readily available natural-language labels or descriptions accompanying those datasets and use GPT-4 to harmonize the noisy, unstructured text information with established biomedical object ontologies. We created a large dataset comprising over six million triples of image, segmentation mask, and textual description. On image segmentation, we showed that BiomedParse is broadly applicable, outperforming state-of-the-art methods on 102,855 test image-mask-label triples across 9 imaging modalities (everything). On object detection, which aims to locate a specific object of interest, BiomedParse again attained state-of-the-art performance, especially on objects with irregular shapes (everywhere). On object recognition, which aims to identify all objects in a given image along with their semantic types, we showed that BiomedParse can simultaneously segment and label all biomedical objects in an image (all at once). In summary, BiomedParse is an all-in-one tool for biomedical image analysis by jointly solving segmentation, detection, and recognition for all major biomedical image modalities, paving the path for efficient and accurate image-based biomedical discovery. △ Less

Submitted 4 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

Comments: Project page: https://aka.ms/biomedparse-project

arXiv:2405.12809 [pdf, other]

Precision measurement of the branching fraction of \boldmath $J/ψ\rightarrow K^+K^-$ via $ψ(2S)\rightarrow π^+π^-J/ψ$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (604 additional authors not shown)

Abstract: Using a sample of $448.1 \times 10^6$ $ψ(2S)$ events collected with the BESIII detector, we perform a study of the decay $J/ψ\rightarrow K^+K^-$ via $ψ(2S)\rightarrow π^+π^-J/ψ$. The branching fraction of $J/ψ\rightarrow K^+K^-$ is determined to be $\mathcal{B}_{K^+K^-}=(3.072\pm 0.023({\rm stat.})\pm 0.050({\rm syst.}))\times 10^{-4}$, which is consistent with previous measurements but with sig… ▽ More Using a sample of $448.1 \times 10^6$ $ψ(2S)$ events collected with the BESIII detector, we perform a study of the decay $J/ψ\rightarrow K^+K^-$ via $ψ(2S)\rightarrow π^+π^-J/ψ$. The branching fraction of $J/ψ\rightarrow K^+K^-$ is determined to be $\mathcal{B}_{K^+K^-}=(3.072\pm 0.023({\rm stat.})\pm 0.050({\rm syst.}))\times 10^{-4}$, which is consistent with previous measurements but with significantly improved precision. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: to be submitted to PRD

arXiv:2405.12676 [pdf]

Experimental investigation of trans-scale displacement responses of wrinkle defects in fiber reinforced composite laminates

Authors: Li Ma, Shoulong Wang, Changchen Liu, Ange Wen, Kaidi Ying, **g Guo

Abstract: Wrinkle defects were found widely exist in the field of industrial products, i.e. wind turbine blades and filament-wound composite pressure vessels. The magnitude of wrinkle wavelength varies from several millimeters to over one hundred millimeters. Locating the wrinkle defects and measuring their responses are very important to the assessment of the structures that containing wrinkle defects. A m… ▽ More Wrinkle defects were found widely exist in the field of industrial products, i.e. wind turbine blades and filament-wound composite pressure vessels. The magnitude of wrinkle wavelength varies from several millimeters to over one hundred millimeters. Locating the wrinkle defects and measuring their responses are very important to the assessment of the structures that containing wrinkle defects. A meso-mechanical modeling is presented based on the homogenization method to obtain the effective stiffness of a graded wrinkle. The finite element simulation predicts the trans-scale response of out-of-plane displacement of wrinkled laminates, where the maximum displacement ranges from nanoscale to millimeter scale. Such trans-scale effect requires different measurement approaches to observe the displacement responses. Here we employed Shearography (Speckle Pattern Shearing Interferometry) and fringe projection profilometry (FPP) method respectively according to the different magnitude of displacement. In FPP method, a displacement extraction algorithm was presented to obtain the out-of-plane displacement. The measurement sensitivity and accuracy of Shearography and FPP are compared, which provides a quantitative reference for industrial non-destructive test. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.12592 [pdf]

Spin-polarized p-wave superconductivity in the kagome material RbV$_3$Sb$_5$

Authors: Shuo Wang, Xilin Feng, **g-Zhi Fang, Jia-Peng Peng, Zi-Ting Sun, Jia-Jie Yang, **gchao Liu, Jia-Ji Zhao, Jian-Kun Wang, Xin-Jie Liu, Ze-Nan Wu, Shengbiao Sun, Ning Kang, Xiao-Song Wu, Zhensheng Zhang, Xuewen Fu, Kam Tuen Law, Ben-Chuan Lin, Dapeng Yu

Abstract: The study of kagome materials has attracted much attention in the past few years due to the presence of many electron-electron interaction-driven phases in a single material.In this work, we report the discovery of intrinsic spin-polarized p-wave superconductivity in the thin-flake kagome material RbV$_3$Sb$_5$. Firstly, when an in-plane magnetic field is swept in opposite directions, we observe a… ▽ More The study of kagome materials has attracted much attention in the past few years due to the presence of many electron-electron interaction-driven phases in a single material.In this work, we report the discovery of intrinsic spin-polarized p-wave superconductivity in the thin-flake kagome material RbV$_3$Sb$_5$. Firstly, when an in-plane magnetic field is swept in opposite directions, we observe a unique form of hysteresis in magnetoresistance which is different from the hysteresis induced by extrinsic mechanisms such as flux-trap** or superheating and supercooling effects. The unconventional hysteresis indicates the emergence of an intrinsic time-reversal symmetry-breaking superconducting phase. Strikingly, at a fixed magnetic field, the finite-resistance state can be quenched to the zero-resistance state by applying a large current. Secondly, at temperatures around 400 mK, the re-entrance of superconductivity occurs during an in-plane field-swee** process with a fixed swee** direction. This kind of re-entrance is asymmetric about the zero field axis and observed in all field directions for a fixed current direction, which is different from the re-entrance observed in conventional superconductors. Moreover, the angle-dependent in-plane critical field measurements reveal a two-fold symmetry that deviates from the original, centrosymmetric D$_{6h}$ point group symmetry of the crystal. These findings put very strong constraints on the possible superconducting pairing symmetry of RbV$_3$Sb$_5$. We point out that the pairing symmetry, which is consistent with the crystal symmetry and all the observed novel properties, is a time-reversal symmetry-breaking, p-wave pairing with net spin polarization. Importantly, this p-wave pairing gives rise to a nodal topological superconducting state with Majorana flat bands on the sample edges. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 21 pages, 4 figures

arXiv:2405.12099 [pdf]

Chemical control of self-assembly by the electrosolvation force

Authors: Sida Wang, Rowan Walker-Gibbons, Bethany Watkins, Binghui Lin, Madhavi Krishnan

Abstract: Self-assembly of matter in solution generally relies on attractive interactions that overcome entropy and drive the formation of higher-order molecular and particulate structures. Such interactions play key roles in a variety of contexts, e.g., crystallisation, biomolecular folding and condensation, pathological protein aggregation, pharmaceuticals and fine chemicals. The electrosolvation force en… ▽ More Self-assembly of matter in solution generally relies on attractive interactions that overcome entropy and drive the formation of higher-order molecular and particulate structures. Such interactions play key roles in a variety of contexts, e.g., crystallisation, biomolecular folding and condensation, pathological protein aggregation, pharmaceuticals and fine chemicals. The electrosolvation force entails a new conceptual paradigm in the known palette of interactions that drive the spontaneous accretion and organisation of matter. However, an understanding of the underlying physical chemistry, and therefore the ability to exert control over and tune the interaction, remains incomplete. Here we demonstrate that this force arises from the structure of the interfacial electrolyte. Neutral molecules such as a different solvent, osmolytes or surfactants, can - even at very low concentrations in the medium - disrupt or reinforce pre-existing interfacial solvent structure, thereby furnishing unanticipated chemical tuning of the ability of matter to self-assemble. The observations further present unexpected mechanistic elements that may explain the impact of co-solvents and osmolytes on protein structure, stability and pathological protein condensation. Our findings shed new light on microscopic mechanisms that drive the emergence of order and structure from molecular to macroscopic scales in the solution phase. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.11838 [pdf, ps, other]

Sweedler duality for Hom-algebras and Hom-modules

Authors: Jiacheng Sun, Shuanhong Wang, Chi Zhang, Haoran Zhu

Abstract: The construction of Sweedler duality is an important tool in the theory of Hopf algebras over a field, which is a right adjoint to the dual algebra functor. In this paper, we study the Sweedler duality of Hom-algebras and their Hom-modules. We delve into the structure of Hom-coalgebras and derive the linear morphisms associated with them. Additionally, as an application, we present the (right) Hom… ▽ More The construction of Sweedler duality is an important tool in the theory of Hopf algebras over a field, which is a right adjoint to the dual algebra functor. In this paper, we study the Sweedler duality of Hom-algebras and their Hom-modules. We delve into the structure of Hom-coalgebras and derive the linear morphisms associated with them. Additionally, as an application, we present the (right) Hom-(co)module morphisms under the Sweedler duality. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 11 pages

MSC Class: 17A30 (Primary); 17D30; 17A60 (Secondary)

arXiv:2405.11806 [pdf, ps, other]

Global stability and period-doubling bifurcations of a discrete Kolmogorov predator-prey model with Ricker-type prey growth

Authors: Lei Niu, Susu Wang

Abstract: In this paper, we study the dynamics of a discrete Kolmogorov predator-prey model with Ricker-type prey growth. We give the sufficient and necessary condition to guarantee the existence and uniqueness of the positive fixed point. Using the center manifold theory, we prove that the period-doubling bifurcations can occur at the positive fixed point. Furthermore, our numerical simulations reveal that… ▽ More In this paper, we study the dynamics of a discrete Kolmogorov predator-prey model with Ricker-type prey growth. We give the sufficient and necessary condition to guarantee the existence and uniqueness of the positive fixed point. Using the center manifold theory, we prove that the period-doubling bifurcations can occur at the positive fixed point. Furthermore, our numerical simulations reveal that the model can exhibit cascades of period-doubling bifurcations leading to chaos, which is a significant difference from the behavior of continuous predator-prey models. Despite the complexities of the model dynamics, we are able to provide a criterion for the global stability of the positive fixed point by using a geometric analysis of the nullclines. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.11750 [pdf, other]

The Intermediate-Mass Black Hole Reverberation Map** Project: Initial Results for a candidate IMBH in a nearby Seyfert 1 Galaxy

Authors: Wenwen Zuo, Hengxiao Guo, **gbo Sun, Qi Yuan, Paulina Lira, Minfeng Gu, Philip G. Edwards, Alok C. Gupta, Shubham Kishore, Jamie Stevens, Tao An, Zhen-Yi Cai, Haicheng Feng, Luis C. Ho, Dragana Ilić, Andjelka B. Kovačević, ShaSha Li, Mar Mezcua, Luka Č. Popović, Mouyuan Sun, Tushar Tripathi, Vivian U., Oliver Vince, Jianguo Wang, Junxian Wang , et al. (3 additional authors not shown)

Abstract: To investigate the short-term variability and determine the size of the optical continuum emitting size of intermediate-mass black holes (IMBHs), we carried out high-cadence, multi-band photometric monitoring of a Seyfert 1 galaxy J0249-0815 across two nights, together with a one-night single-band preliminary test. The presence of the broad Ha component in our target was confirmed by recent Paloma… ▽ More To investigate the short-term variability and determine the size of the optical continuum emitting size of intermediate-mass black holes (IMBHs), we carried out high-cadence, multi-band photometric monitoring of a Seyfert 1 galaxy J0249-0815 across two nights, together with a one-night single-band preliminary test. The presence of the broad Ha component in our target was confirmed by recent Palomar/P200 spectroscopic observations, 23 years after Sloan Digital Sky Survey, ruling out the supernovae origin of the broad Ha line. The photometric experiment was primarily conducted utilizing four-channel imagers MuSCAT 3 & 4 mounted on 2-meter telescopes within the Las Cumbres Observatory Global Telescope Network. Despite the expectation of variability, we observed no significant variation (<1.4%) on timescales of 6-10 hours. This non-detection is likely due to substantial host galaxy light diluting the subtle AGN variability. Dual-band preliminary tests and tailored simulations may enhance the possibility of detecting variability and lag in future IMBH reverberation campaigns. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 14 pages, 6 figures, submitted to ApJ, comments welcome

arXiv:2405.11585 [pdf, other]

Improved measurement of the branching fraction of $h_{c}\rightarrowγη^\prime/η$ and search for $h_{c}\rightarrowγπ^0$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

Abstract: The processes $h_c\rightarrowγP(P = η^\prime,~η,~π^{0}))$ are studied with a sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider. The branching fractions of $h_c\rightarrowγη^\prime$ and $h_c\rightarrowγη$ are measured to be $(1.40\pm0.11\pm0.04\pm0.10)\times10^{-3}$ and $(3.77\pm0.55\pm0.13\pm0.26)\times10^{-4}$, respectively, where the… ▽ More The processes $h_c\rightarrowγP(P = η^\prime,~η,~π^{0}))$ are studied with a sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider. The branching fractions of $h_c\rightarrowγη^\prime$ and $h_c\rightarrowγη$ are measured to be $(1.40\pm0.11\pm0.04\pm0.10)\times10^{-3}$ and $(3.77\pm0.55\pm0.13\pm0.26)\times10^{-4}$, respectively, where the first uncertainties are statistical, the second systematic, and the third from the branching fraction of $ψ(3686)\rightarrowπ^{0}h_c$. The ratio $R_{h_c}=\frac{\mathscr{B}(h_c\rightarrowγη)}{\mathscr{B}(h_c\rightarrowγη^\prime)}$ is calculated to be $(27.0\pm4.4\pm1.0)\%$. The measurements are consistent with the previous results with improved precision by a factor of 2. The results are valuable for gaining a deeper understanding of $η-η^\prime$ mixing, and its manifestation within quantum chromodynamics. No significant signal is found for the decay $h_c\rightarrowγπ^{0}$, and an upper limit is placed on its branching fraction of $\mathscr{B}(h_c\rightarrowγπ^{0})<5.0\times10^{-5}$, at the 90\% confidence level. △ Less

Submitted 19 May, 2024; originally announced May 2024.

arXiv:2405.11394 [pdf, other]

Online Mental Stress Detection Using Frontal-channel EEG Recordings in a Classroom Scenario

Authors: Chi-Yuan Chang, Chieh Hsu, Ying Choon Wu, Siwen Wang, Darin Tsui, Tzyy-** Jung

Abstract: Objective: To investigate the effects of different approaches to EEG preprocessing, channel montage selection, and model architecture on the performance of an online-capable stress detection algorithm in a classroom scenario. Methods: This analysis used EEG data from a longitudinal stress and fatigue study conducted among university students. Their self-reported stress ratings during each class se… ▽ More Objective: To investigate the effects of different approaches to EEG preprocessing, channel montage selection, and model architecture on the performance of an online-capable stress detection algorithm in a classroom scenario. Methods: This analysis used EEG data from a longitudinal stress and fatigue study conducted among university students. Their self-reported stress ratings during each class session were the basis for classifying EEG recordings into either normal or elevated stress states. We used a data-processing pipeline that combined Artifact Subspace Reconstruction (ASR)and an Independent Component Analysis (ICA)-based method to achieve online artifact removal. We compared the performance of a Linear Discriminant Analysis (LDA) and a 4-layer neural network as classifiers. We opted for accuracy, balanced accuracy, and F1 score as the metrics for assessing performance. We examined the impact of varying numbers of input channels using different channel montages. Additionally, we explored different window lengths and step sizes during online evaluation. Results: Our online artifact removal method achieved performance comparable to the offline ICA method in both offline and online evaluations. A balanced accuracy of 77% and 78% in an imbalanced binary classification were observed when using the 11-frontal-channel LDA model with the proposed artifact removal method. Moreover, the model performance remained intact when changing the channel montage from 30 full-scalp channels to just 11 frontal channels. During the online evaluation, we achieved the highest balanced accuracy (78%) with a window length of 20 seconds and a step size of 1 second. Significance: This study comprehensively investigates the deployment of stress detection in real-world scenarios. The findings of this study provide insight into the development of daily mental stress monitoring. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.11221 [pdf, other]

Real-time equilibrium reconstruction by neural network based on HL-3 tokamak

Authors: Guohui Zheng, Songfen Liu, Zongyu Yang, Rui Ma, Xinwen Gong, Ao Wang, Shuo Wang, Wulyu Zhong

Abstract: A neural network model, EFITNN, has been developed capable of real-time magnetic equilibrium reconstruction based on HL-3 tokamak magnetic measurement signals. The model processes inputs from 68 channels of magnetic measurement data gathered from 1159 HL-3 experimental discharges, including plasma current, loop voltage, and the poloidal magnetic fields measured by equilibrium probes. The outputs o… ▽ More A neural network model, EFITNN, has been developed capable of real-time magnetic equilibrium reconstruction based on HL-3 tokamak magnetic measurement signals. The model processes inputs from 68 channels of magnetic measurement data gathered from 1159 HL-3 experimental discharges, including plasma current, loop voltage, and the poloidal magnetic fields measured by equilibrium probes. The outputs of the model feature eight key plasma parameters, alongside high-resolution ($129\times129$) reconstructions of the toroidal current density $J_{\text P}$ and poloidal magnetic flux profiles $Ψ_{rz}$. Moreover, the network's architecture employs a multi-task learning structure, which enables the sharing of weights and mutual correction among different outputs, and lead to increase the model's accuracy by up to 32%. The performance of EFITNN demonstrates remarkable consistency with the offline EFIT, achieving average $R^2 = 0.941, 0.997$ and $0.959$ for eight plasma parameters, $Ψ_{rz}$ and $J_{\text P}$, respectively. The model's robust generalization capabilities are particularly evident in its successful predictions of quasi-snowflake (QSF) divertor configurations and its adept handling of data from shot numbers or plasma current intervals not previously encountered during training. Compared to numerical methods, EFITNN significantly enhances computational efficiency with average computation time ranging from 0.08ms to 0.45ms, indicating its potential utility in real-time isoflux control and plasma profile management. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.11196 [pdf, other]

Natural Is The Best: Model-Agnostic Code Simplification for Pre-trained Large Language Models

Authors: Yan Wang, Xiaoning Li, Tien Nguyen, Shaohua Wang, Chao Ni, Ling Ding

Abstract: Pre-trained Large Language Models (LLM) have achieved remarkable successes in several domains. However, code-oriented LLMs are heavy in computational complexity, and quadratically with the length of the input. Toward simplifying the input program of an LLM, the state-of-the-art approach has the strategies to filter the input code tokens based on the attention scores given by the LLM. The decision… ▽ More Pre-trained Large Language Models (LLM) have achieved remarkable successes in several domains. However, code-oriented LLMs are heavy in computational complexity, and quadratically with the length of the input. Toward simplifying the input program of an LLM, the state-of-the-art approach has the strategies to filter the input code tokens based on the attention scores given by the LLM. The decision to simplify the input should not rely on the attention patterns of an LLM, as these patterns are influenced by both the model architecture and the pre-training dataset. Since the model and dataset are part of the solution domain, not the problem domain where the input belongs, the outcome may differ when the model is pre-trained on a different dataset. We propose SlimCode, a model-agnostic code simplification solution for LLMs that depends on the nature of input code tokens. As an empirical study on the LLMs including CodeBERT, CodeT5, and GPT-4 for two main tasks: code search and summarization, we reported that 1) the removal ratio of code has a linear-like relation with the saving ratio on training time, 2) the impact of categorized tokens on code simplification can vary significantly, 3) the impact of categorized tokens on code simplification is task-specific but model-agnostic, and 4) the above findings hold for the paradigm-prompt engineering and interactive in-context learning. The empirical results showed that SlimCode can improve the state-of-the-art technique by 9.46% and 5.15% in terms of MRR and BLEU score on code search and summarization. Moreover, SlimCode is 133 times faster than the state-of-the-art approach. Additionally, SlimCode can reduce the cost of invoking GPT-4 by up to 24% per API query, while still producing comparable results to those with the original code. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.11034 [pdf, other]

Safety in Graph Machine Learning: Threats and Safeguards

Authors: Song Wang, Yushun Dong, Binchi Zhang, Zihan Chen, Xingbo Fu, Yinhan He, Cong Shen, Chuxu Zhang, Nitesh V. Chawla, Jundong Li

Abstract: Graph Machine Learning (Graph ML) has witnessed substantial advancements in recent years. With their remarkable ability to process graph-structured data, Graph ML techniques have been extensively utilized across diverse applications, including critical domains like finance, healthcare, and transportation. Despite their societal benefits, recent research highlights significant safety concerns assoc… ▽ More Graph Machine Learning (Graph ML) has witnessed substantial advancements in recent years. With their remarkable ability to process graph-structured data, Graph ML techniques have been extensively utilized across diverse applications, including critical domains like finance, healthcare, and transportation. Despite their societal benefits, recent research highlights significant safety concerns associated with the widespread use of Graph ML models. Lacking safety-focused designs, these models can produce unreliable predictions, demonstrate poor generalizability, and compromise data confidentiality. In high-stakes scenarios such as financial fraud detection, these vulnerabilities could jeopardize both individuals and society at large. Therefore, it is imperative to prioritize the development of safety-oriented Graph ML models to mitigate these risks and enhance public confidence in their applications. In this survey paper, we explore three critical aspects vital for enhancing safety in Graph ML: reliability, generalizability, and confidentiality. We categorize and analyze threats to each aspect under three headings: model threats, data threats, and attack threats. This novel taxonomy guides our review of effective strategies to protect against these threats. Our systematic review lays a groundwork for future research aimed at develo** practical, safety-centered Graph ML models. Furthermore, we highlight the significance of safe Graph ML practices and suggest promising avenues for further investigation in this crucial area. △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: 20 pages

arXiv:2405.10943 [pdf, other]

Efficient photon-pair generation in layer-poled lithium niobate nanophotonic waveguides

Authors: Xiaodong Shi, Sakthi Sanjeev Mohanraj, Veerendra Dhyani, Angela Anna Baiju, Sihao Wang, Jiapeng Sun, Lin Zhou, Anna Paterova, Victor Leong, Di Zhu

Abstract: Integrated photon-pair sources are crucial for scalable photonic quantum systems. Thin-film lithium niobate is a promising platform for on-chip photon-pair generation through spontaneous parametric down-conversion (SPDC). However, the device implementation faces practical challenges. Periodically poled lithium niobate (PPLN), despite enabling flexible quasi-phase matching, suffers from poor fabric… ▽ More Integrated photon-pair sources are crucial for scalable photonic quantum systems. Thin-film lithium niobate is a promising platform for on-chip photon-pair generation through spontaneous parametric down-conversion (SPDC). However, the device implementation faces practical challenges. Periodically poled lithium niobate (PPLN), despite enabling flexible quasi-phase matching, suffers from poor fabrication reliability and device repeatability, while conventional modal phase matching (MPM) methods yield limited efficiencies due to inadequate mode overlaps. Here, we introduce a layer-poled lithium niobate (LPLN) nanophotonic waveguide for efficient photon-pair generation. It leverages layer-wise polarity inversion through electrical poling to break spatial symmetry and significantly enhance nonlinear interactions for MPM, achieving a notable normalized second-harmonic generation (SHG) conversion efficiency of 4615% W^{-1}cm^{-2}. Through a cascaded SHG and SPDC process, we demonstrate photon-pair generation with a normalized brightness of 3.1*10^6 Hz nm^{-1} mW^{-2} in a 3.3 mm long LPLN waveguide, surpassing existing on-chip sources under similar operating configurations. Crucially, our LPLN waveguides offer enhanced fabrication reliability and reduced sensitivity to geometric variations and temperature fluctuations compared to PPLN devices. We expect LPLN to become a promising solution for on-chip nonlinear wavelength conversion and non-classical light generation, with immediate applications in quantum communication, networking, and on-chip photonic quantum information processing. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2405.10758 [pdf, other]

Seeing is (Not) Believing: Practical Phishing Attacks Targeting Social Media Sharing Cards

Authors: Wangchenlu Huang, Shenao Wang, Yanjie Zhao, Guosheng Xu, Haoyu Wang

Abstract: In the digital era, Online Social Networks (OSNs) play a crucial role in information dissemination, with sharing cards for link previews emerging as a key feature. These cards offer snapshots of shared content, including titles, descriptions, and images. In this study, we investigate the construction and dissemination mechanisms of these cards, focusing on two primary server-side generation method… ▽ More In the digital era, Online Social Networks (OSNs) play a crucial role in information dissemination, with sharing cards for link previews emerging as a key feature. These cards offer snapshots of shared content, including titles, descriptions, and images. In this study, we investigate the construction and dissemination mechanisms of these cards, focusing on two primary server-side generation methods based on Share-SDK and HTML meta tags. Our investigation reveals a novel type of attack, i.e., Sharing Card Forgery (SCF) attack that can be exploited to create forged benign sharing cards for malicious links. We demonstrate the feasibility of these attacks through practical implementations and evaluate their effectiveness across 13 various online social networks. Our findings indicate a significant risk, as the deceptive cards can evade detection and persist on social platforms, thus posing a substantial threat to user security. We also delve into countermeasures and discuss the challenges in effectively mitigating these types of attacks. This study not only sheds light on a novel phishing technique but also calls for heightened awareness and improved defensive strategies in the OSN ecosystem. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2405.10757 [pdf, other]

doi 10.1145/3637528.3671910

Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective

Authors: Zhiwei Zhang, Minhua Lin, Enyan Dai, Suhang Wang

Abstract: Graph Neural Networks (GNNs) have shown remarkable performance in various tasks. However, recent works reveal that GNNs are vulnerable to backdoor attacks. Generally, backdoor attack poisons the graph by attaching backdoor triggers and the target class label to a set of nodes in the training graph. A GNN trained on the poisoned graph will then be misled to predict test nodes attached with trigger… ▽ More Graph Neural Networks (GNNs) have shown remarkable performance in various tasks. However, recent works reveal that GNNs are vulnerable to backdoor attacks. Generally, backdoor attack poisons the graph by attaching backdoor triggers and the target class label to a set of nodes in the training graph. A GNN trained on the poisoned graph will then be misled to predict test nodes attached with trigger to the target class. Despite their effectiveness, our empirical analysis shows that triggers generated by existing methods tend to be out-of-distribution (OOD), which significantly differ from the clean data. Hence, these injected triggers can be easily detected and pruned with widely used outlier detection methods in real-world applications. Therefore, in this paper, we study a novel problem of unnoticeable graph backdoor attacks with in-distribution (ID) triggers. To generate ID triggers, we introduce an OOD detector in conjunction with an adversarial learning strategy to generate the attributes of the triggers within distribution. To ensure a high attack success rate with ID triggers, we introduce novel modules designed to enhance trigger memorization by the victim model trained on poisoned graph. Extensive experiments on real-world datasets demonstrate the effectiveness of the proposed method in generating in distribution triggers that can by-pass various defense strategies while maintaining a high attack success rate. △ Less

Submitted 11 July, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

Comments: Accepted by KDD 2024

arXiv:2405.10583 [pdf, other]

doi 10.1073/pnas.2322270121

Large Fermi surface in pristine kagome metal CsV$_3$Sb$_5$ and enhanced quasiparticle effective masses

Authors: Wei Zhang, Tsz Fung Poon, Chun Wai Tsang, Wenyan Wang, X. Liu, J. Xie, S. T. Lam, Shanmin Wang, Kwing To Lai, A. Pourret, G. Seyfarth, G. Knebel, Wing Chi Yu, Swee K. Goh

Abstract: The kagome metal CsV$_3$Sb$_5$ is an ideal platform to study the interplay between topology and electron correlation. To understand the fermiology of CsV$_3$Sb$_5$, intensive quantum oscillation (QO) studies at ambient pressure have been conducted. However, due to the Fermi surface reconstruction by the complicated charge density wave (CDW) order, the QO spectrum is exceedingly complex, hindering… ▽ More The kagome metal CsV$_3$Sb$_5$ is an ideal platform to study the interplay between topology and electron correlation. To understand the fermiology of CsV$_3$Sb$_5$, intensive quantum oscillation (QO) studies at ambient pressure have been conducted. However, due to the Fermi surface reconstruction by the complicated charge density wave (CDW) order, the QO spectrum is exceedingly complex, hindering a complete understanding of the fermiology. Here, we directly map the Fermi surface of the pristine CsV$_3$Sb$_5$ by measuring Shubnikov-de Haas QOs up to 29 T under pressure, where the CDW order is completely suppressed. The QO spectrum of the pristine CsV$_3$Sb$_5$ is significantly simpler than the one in the CDW phase, and the detected oscillation frequencies agree well with our density functional theory calculations. In particular, a frequency as large as 8,200 T is detected. Pressure-dependent QO studies further reveal a weak but noticeable enhancement of the quasiparticle effective masses on approaching the critical pressure where the CDW order disappears, hinting at the presence of quantum fluctuations. Our high-pressure QO results reveal the large, unreconstructed Fermi surface of CsV$_3$Sb$_5$, paving the way to understanding the parent state of this intriguing metal in which the electrons can be organized into different ordered states. △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: 4 figures, 1 table. This is the preprint of a published paper in PNAS

Journal ref: Proc. Natl. Acad. Sci. U.S.A. 121, e2322270121 (2024)

arXiv:2405.10573 [pdf, other]

A method for reversing the laser modulation in a Storage ring

Authors: Weihang Liu, Yu Zhao, Yi Jiao, Sheng Wang, Chao Feng

Abstract: The pursuit of coherent radiation generation remains a central focus in advancing storage ring light sources. Despite the promise of laser modulation in achieving this goal, it brings about a noticeable decline in beam quality. Efforts to mitigate this decline have resulted in the proposal of demodulation schemes. However, implementing modulation and demodulation within the storage ring presents s… ▽ More The pursuit of coherent radiation generation remains a central focus in advancing storage ring light sources. Despite the promise of laser modulation in achieving this goal, it brings about a noticeable decline in beam quality. Efforts to mitigate this decline have resulted in the proposal of demodulation schemes. However, implementing modulation and demodulation within the storage ring presents significant challenges due to dynamical and spatial constraints within straight sections. In this study, we propose a straightforward and easily implementable method for achieving reversible laser modulation in a storage ring. Notably, our approach circumvents the need for special storage ring requirements, such as lengthy straight sections or bypass section. Simulation results demonstrate a substantial restoration of beam quality following demodulation. This innovative scheme holds great promise for the realization of high repetition rate coherent storage ring light sources. △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: 7 pages, 7 figures

arXiv:2405.10210 [pdf, other]

GPT Store Mining and Analysis

Authors: Dongxun Su, Yanjie Zhao, Xinyi Hou, Shenao Wang, Haoyu Wang

Abstract: As a pivotal extension of the renowned ChatGPT, the GPT Store serves as a dynamic marketplace for various Generative Pre-trained Transformer (GPT) models, sha** the frontier of conversational AI. This paper presents an in-depth measurement study of the GPT Store, with a focus on the categorization of GPTs by topic, factors influencing GPT popularity, and the potential security risks. Our investi… ▽ More As a pivotal extension of the renowned ChatGPT, the GPT Store serves as a dynamic marketplace for various Generative Pre-trained Transformer (GPT) models, sha** the frontier of conversational AI. This paper presents an in-depth measurement study of the GPT Store, with a focus on the categorization of GPTs by topic, factors influencing GPT popularity, and the potential security risks. Our investigation starts with assessing the categorization of GPTs in the GPT Store, analyzing how they are organized by topics, and evaluating the effectiveness of the classification system. We then examine the factors that affect the popularity of specific GPTs, looking into user preferences, algorithmic influences, and market trends. Finally, the study delves into the security risks of the GPT Store, identifying potential threats and evaluating the robustness of existing security measures. This study offers a detailed overview of the GPT Store's current state, shedding light on its operational dynamics and user interaction patterns. Our findings aim to enhance understanding of the GPT ecosystem, providing valuable insights for future research, development, and policy-making in generative AI. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.10197 [pdf, other]

Forte: A Suite of Advanced Multireference Quantum Chemistry Methods

Authors: Francesco A. Evangelista, Chenyang Li, Prakash Verma, Kevin P. Hannon, Jeffrey B. Schriber, Tianyuan Zhang, Chenxi Cai, Shuhe Wang, Nan He, Nicholas H. Stair, Meng Huang, Renke Huang, Jonathon P. Misiewicz, Shuhang Li, Kevin Marin, Zijun Zhao, Lori A. Burns

Abstract: Forte is an open-source library specialized in multireference electronic structure theories for molecular systems and the rapid prototy** of new methods. This paper gives an overview of the capabilities of Forte, its software architecture, and examples of applications enabled by the methods it implements. Forte is an open-source library specialized in multireference electronic structure theories for molecular systems and the rapid prototy** of new methods. This paper gives an overview of the capabilities of Forte, its software architecture, and examples of applications enabled by the methods it implements. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.09828 [pdf, other]

PillarNeXt: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features

Authors: Xusheng Li, Chengliang Wang, Shumao Wang, Zhuo Zeng, Ji Liu

Abstract: The multi-line LiDAR is widely used in autonomous vehicles, so point cloud-based 3D detectors are essential for autonomous driving. Extracting rich multi-scale features is crucial for point cloud-based 3D detectors in autonomous driving due to significant differences in the size of different types of objects. However, because of the real-time requirements, large-size convolution kernels are rarely… ▽ More The multi-line LiDAR is widely used in autonomous vehicles, so point cloud-based 3D detectors are essential for autonomous driving. Extracting rich multi-scale features is crucial for point cloud-based 3D detectors in autonomous driving due to significant differences in the size of different types of objects. However, because of the real-time requirements, large-size convolution kernels are rarely used to extract large-scale features in the backbone. Current 3D detectors commonly use feature pyramid networks to obtain large-scale features; however, some objects containing fewer point clouds are further lost during down-sampling, resulting in degraded performance. Since pillar-based schemes require much less computation than voxel-based schemes, they are more suitable for constructing real-time 3D detectors. Hence, we propose the PillarNeXt, a pillar-based scheme. We redesigned the feature encoding, the backbone, and the neck of the 3D detector. We propose the Voxel2Pillar feature encoding, which uses a sparse convolution constructor to construct pillars with richer point cloud features, especially height features. The Voxel2Pillar adds more learnable parameters to the feature encoding, enabling the initial pillars to have higher performance ability. We extract multi-scale and large-scale features in the proposed fully sparse backbone, which does not utilize large-size convolutional kernels; the backbone consists of the proposed multi-scale feature extraction module. The neck consists of the proposed sparse ConvNeXt, whose simple structure significantly improves the performance. We validate the effectiveness of the proposed PillarNeXt on the Waymo Open Dataset, and the object detection accuracy for vehicles, pedestrians, and cyclists is improved. We also verify the effectiveness of each proposed module in detail through ablation studies. △ Less

Submitted 19 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.09768 [pdf, other]

Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model

Authors: Siyang Wang, Éva Székely

Abstract: Recent advances in generative language modeling applied to discrete speech tokens presented a new avenue for text-to-speech (TTS) synthesis. These speech language models (SLMs), similarly to their textual counterparts, are scalable, probabilistic, and context-aware. While they can produce diverse and natural outputs, they sometimes face issues such as unintelligibility and the inclusion of non-spe… ▽ More Recent advances in generative language modeling applied to discrete speech tokens presented a new avenue for text-to-speech (TTS) synthesis. These speech language models (SLMs), similarly to their textual counterparts, are scalable, probabilistic, and context-aware. While they can produce diverse and natural outputs, they sometimes face issues such as unintelligibility and the inclusion of non-speech noises or hallucination. As the adoption of this innovative paradigm in speech synthesis increases, there is a clear need for an in-depth evaluation of its capabilities and limitations. In this paper, we evaluate TTS from a discrete token-based SLM, through both automatic metrics and listening tests. We examine five key dimensions: speaking style, intelligibility, speaker consistency, prosodic variation, spontaneous behaviour. Our results highlight the model's strength in generating varied prosody and spontaneous outputs. It is also rated higher in naturalness and context appropriateness in listening tests compared to a conventional TTS. However, the model's performance in intelligibility and speaker consistency lags behind traditional TTS. Additionally, we show that increasing the scale of SLMs offers a modest boost in robustness. Our findings aim to serve as a benchmark for future advancements in generative SLMs for speech synthesis. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: 11 pages, 4 figures. Language Resources and Evaluation Conference (LREC) 2024. demo: https://swatsw.github.io/lrec24_eval_slm/

arXiv:2405.09672 [pdf, other]

doi 10.1145/3658180

Eulerian-Lagrangian Fluid Simulation on Particle Flow Maps

Authors: Junwei Zhou, Duowen Chen, Molin Deng, Yitong Deng, Yuchen Sun, Sinan Wang, Shiying Xiong, Bo Zhu

Abstract: We propose a novel Particle Flow Map (PFM) method to enable accurate long-range advection for incompressible fluid simulation. The foundation of our method is the observation that a particle trajectory generated in a forward simulation naturally embodies a perfect flow map. Centered on this concept, we have developed an Eulerian-Lagrangian framework comprising four essential components: Lagrangian… ▽ More We propose a novel Particle Flow Map (PFM) method to enable accurate long-range advection for incompressible fluid simulation. The foundation of our method is the observation that a particle trajectory generated in a forward simulation naturally embodies a perfect flow map. Centered on this concept, we have developed an Eulerian-Lagrangian framework comprising four essential components: Lagrangian particles for a natural and precise representation of bidirectional flow maps; a dual-scale map representation to accommodate the map** of various flow quantities; a particle-to-grid interpolation scheme for accurate quantity transfer from particles to grid nodes; and a hybrid impulse-based solver to enforce incompressibility on the grid. The efficacy of PFM has been demonstrated through various simulation scenarios, highlighting the evolution of complex vortical structures and the details of turbulent flows. Notably, compared to NFM, PFM reduces computing time by up to 49 times and memory consumption by up to 41%, while enhancing vorticity preservation as evidenced in various tests like leapfrog, vortex tube, and turbulent flow. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.09539 [pdf, ps, other]

MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer

Authors: Chengyu Wu, Chengkai Wang, Yaqi Wang, Huiyu Zhou, Yatao Zhang, Qifeng Wang, Shuai Wang

Abstract: Esophageal cancer is one of the most common types of cancer worldwide and ranks sixth in cancer-related mortality. Accurate computer-assisted diagnosis of cancer progression can help physicians effectively customize personalized treatment plans. Currently, CT-based cancer diagnosis methods have received much attention for their comprehensive ability to examine patients' conditions. However, multi-… ▽ More Esophageal cancer is one of the most common types of cancer worldwide and ranks sixth in cancer-related mortality. Accurate computer-assisted diagnosis of cancer progression can help physicians effectively customize personalized treatment plans. Currently, CT-based cancer diagnosis methods have received much attention for their comprehensive ability to examine patients' conditions. However, multi-modal based methods may likely introduce information redundancy, leading to underperformance. In addition, efficient and effective interactions between multi-modal representations need to be further explored, lacking insightful exploration of prognostic correlation in multi-modality features. In this work, we introduce a multi-modal heterogeneous graph-based conditional feature-guided diffusion model for lymph node metastasis diagnosis based on CT images as well as clinical measurements and radiomics data. To explore the intricate relationships between multi-modal features, we construct a heterogeneous graph. Following this, a conditional feature-guided diffusion approach is applied to eliminate information redundancy. Moreover, we propose a masked relational representation learning strategy, aiming to uncover the latent prognostic correlations and priorities of primary tumor and lymph node image representations. Various experimental results validate the effectiveness of our proposed method. The code is available at https://github.com/wuchengyu123/MMFusion. △ Less

Submitted 16 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

Comments: Early accepted to MICCAI 2024 (6/6/5)

arXiv:2405.09477 [pdf, other]

Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph Task

Authors: Shurong Wang, Yufei Zhang, Xuliang Huang, Hongwei Wang

Abstract: Knowledge graph embedding (KGE) has caught significant interest for its effectiveness in knowledge graph completion (KGC), specifically link prediction (LP), with recent KGE models cracking the LP benchmarks. Despite the rapidly growing literature, insufficient attention has been paid to the cooperation between humans and AI on KG. However, humans' capability to analyze graphs conceptually may fur… ▽ More Knowledge graph embedding (KGE) has caught significant interest for its effectiveness in knowledge graph completion (KGC), specifically link prediction (LP), with recent KGE models cracking the LP benchmarks. Despite the rapidly growing literature, insufficient attention has been paid to the cooperation between humans and AI on KG. However, humans' capability to analyze graphs conceptually may further improve the efficacy of KGE models with semantic information. To this effect, we carefully designed a human-AI team (HAIT) system dubbed KG-HAIT, which harnesses the human insights on KG by leveraging fully human-designed ad-hoc dynamic programming (DP) on KG to produce human insightful feature (HIF) vectors that capture the subgraph structural feature and semantic similarities. By integrating HIF vectors into the training of KGE models, notable improvements are observed across various benchmarks and metrics, accompanied by accelerated model convergence. Our results underscore the effectiveness of human-designed DP in the task of LP, emphasizing the pivotal role of collaboration between humans and AI on KG. We open avenues for further exploration and innovation through KG-HAIT, paving the way towards more effective and insightful KG analysis techniques. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.09463 [pdf, other]

Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis Screening

Authors: Yan Kong, Sheng Wang, Jiangdong Cai, Zihao Zhao, Zhenrong Shen, Yonghao Li, Manman Fei, Qian Wang

Abstract: Accurate detection of vulvovaginal candidiasis is critical for women's health, yet its sparse distribution and visually ambiguous characteristics pose significant challenges for accurate identification by pathologists and neural networks alike. Our eye-tracking data reveals that areas garnering sustained attention - yet not marked by experts after deliberation - are often aligned with false positi… ▽ More Accurate detection of vulvovaginal candidiasis is critical for women's health, yet its sparse distribution and visually ambiguous characteristics pose significant challenges for accurate identification by pathologists and neural networks alike. Our eye-tracking data reveals that areas garnering sustained attention - yet not marked by experts after deliberation - are often aligned with false positives of neural networks. Leveraging this finding, we introduce Gaze-DETR, a pioneering method that integrates gaze data to enhance neural network precision by diminishing false positives. Gaze-DETR incorporates a universal gaze-guided warm-up protocol applicable across various detection methods and a gaze-guided rectification strategy specifically designed for DETR-based models. Our comprehensive tests confirm that Gaze-DETR surpasses existing leading methods, showcasing remarkable improvements in detection accuracy and generalizability. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: MICCAI-2024 early accept. Our code is available at https://github.com/YanKong0408/Gaze-DETR

arXiv:2405.09220 [pdf, other]

ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models

Authors: Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen

Abstract: In this paper, we present the findings of our Project ALPINE which stands for ``Autoregressive Learning for Planning In NEtworks." Project ALPINE initiates a theoretical investigation into the development of planning capabilities in Transformer-based language models through their autoregressive learning mechanisms, aiming to identify any potential limitations in their planning abilities. We abstra… ▽ More In this paper, we present the findings of our Project ALPINE which stands for ``Autoregressive Learning for Planning In NEtworks." Project ALPINE initiates a theoretical investigation into the development of planning capabilities in Transformer-based language models through their autoregressive learning mechanisms, aiming to identify any potential limitations in their planning abilities. We abstract planning as a network path-finding task where the objective is to generate a valid path from a specified source node to a designated target node. In terms of expressiveness, we show that the Transformer is capable of executing path-finding by embedding the adjacency and reachability matrices within its weights. Our theoretical analysis of the gradient-based learning dynamic of the Transformer reveals that the Transformer is capable of learning both the adjacency matrix and a limited form of the reachability matrix. These theoretical insights are then validated through experiments, which demonstrate that the Transformer indeed learns the adjacency matrix and an incomplete reachability matrix, which aligns with the predictions made in our theoretical analysis. Additionally, when applying our methodology to a real-world planning benchmark, called Blocksworld, our observations remain consistent. Our theoretical and empirical analyses further unveil a potential limitation of Transformer in path-finding: it cannot identify reachability relationships through transitivity, and thus would fail when path concatenation is needed to generate a path. In summary, our findings shed new light on how the internal mechanisms of autoregressive learning enable planning in networks. This study may contribute to our understanding of the general planning capabilities in other related domains. △ Less

Submitted 27 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.09171 [pdf, other]

doi 10.1109/ICASSP48485.2024.10445996

Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis

Authors: Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li

Abstract: It remains a challenge to effectively control the emotion rendering in text-to-speech (TTS) synthesis. Prior studies have primarily focused on learning a global prosodic representation at the utterance level, which strongly correlates with linguistic prosody. Our goal is to construct a hierarchical emotion distribution (ED) that effectively encapsulates intensity variations of emotions at various… ▽ More It remains a challenge to effectively control the emotion rendering in text-to-speech (TTS) synthesis. Prior studies have primarily focused on learning a global prosodic representation at the utterance level, which strongly correlates with linguistic prosody. Our goal is to construct a hierarchical emotion distribution (ED) that effectively encapsulates intensity variations of emotions at various levels of granularity, encompassing phonemes, words, and utterances. During TTS training, the hierarchical ED is extracted from the ground-truth audio and guides the predictor to establish a connection between emotional and linguistic prosody. At run-time inference, the TTS model generates emotional speech and, at the same time, provides quantitative control of emotion over the speech constituents. Both objective and subjective evaluations validate the effectiveness of the proposed framework in terms of emotion prediction and control. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: This is accepted to IEEE ICASSP 2024

arXiv:2405.09163 [pdf, other]

DVS-RG: Differential Variable Speed Limits Control using Deep Reinforcement Learning with Graph State Representation

Authors: **gwen Yang, ** Wang, Fatemeh Golpayegani, Shen Wang

Abstract: Variable speed limit (VSL) control is an established yet challenging problem to improve freeway traffic mobility and alleviate bottlenecks by customizing speed limits at proper locations based on traffic conditions. Recent advances in deep reinforcement learning (DRL) have shown promising results in solving VSL control problems by interacting with sophisticated environments. However, the modeling… ▽ More Variable speed limit (VSL) control is an established yet challenging problem to improve freeway traffic mobility and alleviate bottlenecks by customizing speed limits at proper locations based on traffic conditions. Recent advances in deep reinforcement learning (DRL) have shown promising results in solving VSL control problems by interacting with sophisticated environments. However, the modeling of these methods ignores the inherent graph structure of the traffic state which can be a key factor for more efficient VSL control. Graph structure can not only capture the static spatial feature but also the dynamic temporal features of traffic. Therefore, we propose the DVS-RG: DRL-based differential variable speed limit controller with graph state representation. DVS-RG provides distinct speed limits per lane in different locations dynamically. The road network topology and traffic information(e.g., occupancy, speed) are integrated as the state space of DVS-RG so that the spatial features can be learned. The normalization reward which combines efficiency and safety is used to train the VSL controller to avoid excessive inefficiencies or low safety. The results obtained from the simulation study on SUMO show that DRL-RG achieves higher traffic efficiency (the average waiting time reduced to 68.44\%) and improves the safety measures (the number of potential collision reduced by 15.93\% ) compared to state-of-the-art DRL methods. △ Less

Submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.09066 [pdf, other]

Search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, M. Albrecht, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, V. Batozskaya, D. Becker, K. Begzsuren, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, J. Bloms, A. Bortone, I. Boyko , et al. (559 additional authors not shown)

Abstract: We present the first search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$ by analyzing a data sample of electron-positron collisions recorded with the BESIII detector at center-of-mass energies between 4.178 and 4.226 GeV, corresponding to an integrated luminosity of 6.32~fb$^{-1}$. No significant signal is observed. The upper limits on the branching fractions for… ▽ More We present the first search for the leptonic decays $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$ by analyzing a data sample of electron-positron collisions recorded with the BESIII detector at center-of-mass energies between 4.178 and 4.226 GeV, corresponding to an integrated luminosity of 6.32~fb$^{-1}$. No significant signal is observed. The upper limits on the branching fractions for $D^{*+}\to e^+ν_e$ and $D^{*+}\to μ^+ν_μ$ are set to be $1.1 \times 10^{-5}$ and $4.3 \times 10^{-6}$ at 90\% confidence level, respectively. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 14 pages, 7 figures

arXiv:2405.08729 [pdf, other]

Targeted Augmentation for Low-Resource Event Extraction

Authors: Sijia Wang, Lifu Huang

Abstract: Addressing the challenge of low-resource information extraction remains an ongoing issue due to the inherent information scarcity within limited training examples. Existing data augmentation methods, considered potential solutions, struggle to strike a balance between weak augmentation (e.g., synonym augmentation) and drastic augmentation (e.g., conditional generation without proper guidance). Thi… ▽ More Addressing the challenge of low-resource information extraction remains an ongoing issue due to the inherent information scarcity within limited training examples. Existing data augmentation methods, considered potential solutions, struggle to strike a balance between weak augmentation (e.g., synonym augmentation) and drastic augmentation (e.g., conditional generation without proper guidance). This paper introduces a novel paradigm that employs targeted augmentation and back validation to produce augmented examples with enhanced diversity, polarity, accuracy, and coherence. Extensive experimental results demonstrate the effectiveness of the proposed paradigm. Furthermore, identified limitations are discussed, shedding light on areas for future improvement. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 15 pages, NAACL 2024

arXiv:2405.08437 [pdf, other]

Stabilization and dynamics of magnetic antivortices in a nanodisk with anisotropic Dzyaloshinskii-Moriya interaction

Authors: Xin Hu, X. S. Wang, Zhenyu Wang

Abstract: We theoretically investigate the antivortex stabilized by anisotropic Dzyaloshinskii-Moriya interaction (DMI) in nanodisks. It is remarkably found that the antivortex remains stable even when the nanodisk radius is reduced to 15 nm, owing to the short-range nature of the DMI. We also investigate the antivortex dynamics under a static in-plane magnetic field, which shows that the displacement of th… ▽ More We theoretically investigate the antivortex stabilized by anisotropic Dzyaloshinskii-Moriya interaction (DMI) in nanodisks. It is remarkably found that the antivortex remains stable even when the nanodisk radius is reduced to 15 nm, owing to the short-range nature of the DMI. We also investigate the antivortex dynamics under a static in-plane magnetic field, which shows that the displacement of the antivortex core depends on its vorticity and helicity, providing a fundamental basic for distinguishing different vortex types. Additionally, spin-polarized currents can trigger a self-sustained gyration of the antivortex at low current densities, while inducing polarity switching at high current densities. Our findings offer valuable insights into the DMI role in stabilizing topological solitons and their potential applications in spin-torque nano-oscillators and magnetic memories. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 8 pages and 7 figures

arXiv:2405.07892 [pdf, other]

All Nodes are created Not Equal: Node-Specific Layer Aggregation and Filtration for GNN

Authors: Shilong Wang, Hao Wu, Yifan Duan, Guibin Zhang, Guohao Li, Yuxuan Liang, Shirui Pan, Kun Wang, Yang Wang

Abstract: The ever-designed Graph Neural Networks, though opening a promising path for the modeling of the graph-structure data, unfortunately introduce two daunting obstacles to their deployment on devices. (I) Most of existing GNNs are shallow, due mostly to the over-smoothing and gradient-vanish problem as they go deeper as convolutional architectures. (II) The vast majority of GNNs adhere to the homophi… ▽ More The ever-designed Graph Neural Networks, though opening a promising path for the modeling of the graph-structure data, unfortunately introduce two daunting obstacles to their deployment on devices. (I) Most of existing GNNs are shallow, due mostly to the over-smoothing and gradient-vanish problem as they go deeper as convolutional architectures. (II) The vast majority of GNNs adhere to the homophily assumption, where the central node and its adjacent nodes share the same label. This assumption often poses challenges for many GNNs working with heterophilic graphs. Addressing the aforementioned issue has become a looming challenge in enhancing the robustness and scalability of GNN applications. In this paper, we take a comprehensive and systematic approach to overcoming the two aforementioned challenges for the first time. We propose a Node-Specific Layer Aggregation and Filtration architecture, termed NoSAF, a framework capable of filtering and processing information from each individual nodes. NoSAF introduces the concept of "All Nodes are Created Not Equal" into every layer of deep networks, aiming to provide a reliable information filter for each layer's nodes to sieve out information beneficial for the subsequent layer. By incorporating a dynamically updated codebank, NoSAF dynamically optimizes the optimal information outputted downwards at each layer. This effectively overcomes heterophilic issues and aids in deepening the network. To compensate for the information loss caused by the continuous filtering in NoSAF, we also propose NoSAF-D (Deep), which incorporates a compensation mechanism that replenishes information in every layer of the model, allowing NoSAF to perform meaningful computations even in very deep layers. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.07741 [pdf, other]

Search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (635 additional authors not shown)

Abstract: Using 9.0 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies from 4.178 to 4.278 GeV with the BESIII detector at the BEPCII collider, we perform the first search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$. No $χ_{c1}(3872)\toγψ_2(3823)$ signal is observed. The upper limit on the ratio of branching fractions… ▽ More Using 9.0 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies from 4.178 to 4.278 GeV with the BESIII detector at the BEPCII collider, we perform the first search for the radiative transition $χ_{c1}(3872)\toγψ_2(3823)$. No $χ_{c1}(3872)\toγψ_2(3823)$ signal is observed. The upper limit on the ratio of branching fractions $\mathcal{B}(χ_{c1}(3872)\toγψ_2(3823), ψ_2(3823)\toγχ_{c1})/\mathcal{B}(χ_{c1}(3872)\toπ^+π^- J/ψ)$ is set as 0.075 at the 90\% confidence level. Our result contradicts theoretical predictions under the assumption that the $χ_{c1}(3872)$ is the pure charmonium state $χ_{c1}(2P)$. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 8 pages, 2 figures

arXiv:2405.07559 [pdf, other]

doi 10.1007/s10909-024-03131-z

Preliminary Design of Detector Assembly for DIXE

Authors: Jiejia Liu, Sifan Wang, Hai **, Qian Wang, Wei Cui

Abstract: Diffuse X-ray Explorer (DIXE) is a proposed X-ray spectroscopic survey experiment for the China Space Station. Its detector assembly (DA) contains the transition edge sensor (TES) microcalorimeter and readout electronics based on the superconducting quantum interference device (SQUID) on the cold stage. The cold stage is thermally connected to the ADR stage, and a Kevlar suspension is used to stab… ▽ More Diffuse X-ray Explorer (DIXE) is a proposed X-ray spectroscopic survey experiment for the China Space Station. Its detector assembly (DA) contains the transition edge sensor (TES) microcalorimeter and readout electronics based on the superconducting quantum interference device (SQUID) on the cold stage. The cold stage is thermally connected to the ADR stage, and a Kevlar suspension is used to stabilize and isolate it from the 4 K environment. TES and SQUID are both sensitive to the magnetic field, so a hybrid shielding structure consisting of an outer Cryoperm shield and an inner niobium shield is used to attenuate the magnetic field. In addition, IR/optical/UV photons can produce shot noise and thus degrade the energy resolution of the TES microcalorimeter. A blocking filter assembly is designed to minimize the effects. In it, five filters are mounted at different temperature stages, reducing the probability of IR/optical/UV photons reaching the detector through multiple reflections between filters and absorption. This paper will describe the preliminary design of the detector assembly and its optimization. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 13 pages, 6 figures. Submitted version, the full version is published by Journal of Low Temperature Physics

arXiv:2405.07556 [pdf]

Safety-Aware Human-Lead Vehicle Platooning by Proactively Reacting to Uncertain Human Behaving

Authors: Jia Hu, Shuhan Wang, Yiming Zhang, Haoran Wang

Abstract: Human-Lead Cooperative Adaptive Cruise Control (HL-CACC) is regarded as a promising vehicle platooning technology in real-world implementation. By utilizing a Human-driven Vehicle (HV) as the platoon leader, HL-CACC reduces the cost and enhances the reliability of perception and decision-making. However, state-of-the-art HL-CACC technology still has a great limitation on driving safety for the lac… ▽ More Human-Lead Cooperative Adaptive Cruise Control (HL-CACC) is regarded as a promising vehicle platooning technology in real-world implementation. By utilizing a Human-driven Vehicle (HV) as the platoon leader, HL-CACC reduces the cost and enhances the reliability of perception and decision-making. However, state-of-the-art HL-CACC technology still has a great limitation on driving safety for the lack of considering the leading human driver's uncertain behaving. In this study, a HL-CACC controller is designed based on Stochastic Model Predictive Control (SMPC). It is enabled to predict the driving intention of the leading Connected Human-Driven Vehicle (CHV). The proposed controller has the following features: i) enhanced perceived safety in oscillating traffic; ii) guaranteed safety against hard brakes; iii) computational efficient for real-time implementation. The proposed controller is evaluated on a PreScan&Simulink simulation platform. Real vehicle trajectory data is collected for the calibration of simulation. Results reveal that the proposed controller: i) improves perceived safety by 19.17% in oscillating traffic; ii) enhances actual safety by 7.76% against hard brake; iii) is confirmed with string stability. The computation time is approximately 3 milliseconds when running on a laptop equipped with an Intel i5-13500H CPU. This indicates the proposed controller is ready for real-time implementation. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.07485 [pdf, other]

The Energy Sources, the Physical Properties, and the Mass-loss History of SN 2017dio

Authors: Deng-Wang Shi, Shan-Qin Wang, Wen-Pei Gan, En-Wei Liang

Abstract: We study the energy sources, the physical properties of the ejecta and the circumstellar medium (CSM), as well as the mass-loss history of the progenitor of SN 2017dio which is a broad-lined Ic (Ic-BL) supernova (SN) having unusual light curves (LCs) and signatures of hydrogen-rich CSM in its early spectrum. We find that the temperature of SN 2017dio began to increase linearly about 20 days after… ▽ More We study the energy sources, the physical properties of the ejecta and the circumstellar medium (CSM), as well as the mass-loss history of the progenitor of SN 2017dio which is a broad-lined Ic (Ic-BL) supernova (SN) having unusual light curves (LCs) and signatures of hydrogen-rich CSM in its early spectrum. We find that the temperature of SN 2017dio began to increase linearly about 20 days after the explosion. We use the $^{56}$Ni plus the ejecta-CSM interaction (CSI) model to fit the LCs of SN 2017dio, finding that the masses of the ejecta, the $^{56}$Ni, and the CSM are $\sim$ 12.41 M$_\odot$, $\sim$ 0.17 M$_\odot$, and $\sim$ 5.82 M$_\odot$, respectively. The early-time photosphere velocity and the kinetic energy of the SN are respectively {$\sim$ 1.89 $\times 10^4$ km s$^{-1}$} and $\sim$ 2.66 $\times 10^{52}$ erg, which are respectively comparable to those of SNe Ic-BL and hypernovae (HNe). We suggest that the CSM of SN 2017dio might be {from an luminous-blue-variable-like outburst or} pulsational pair instability $\sim$ 1.2$-$11.4 yr prior to the SN explosion{, or binary mass transfer}. {Moreover,} we find that its ejecta mass is larger than those of many SNe Ic-BL, and that its $^{56}$Ni mass ($M_{\rm Ni}$) is approximately equal to the mean (or median) value of $M_{\rm Ni}$ of SNe Ic-BL in the literature, but lower than $M_{\rm Ni}$ of prototype HNe (e.g., SN 1998bw and SN 2003dh). △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: Accepted for publication in ApJ, 17 pages, 4 figures, 3 tables

arXiv:2405.07367 [pdf, other]

TOI-2447 b / NGTS-29 b: a 69-day Saturn around a Solar analogue

Authors: Samuel Gill, Daniel Bayliss, Solène Ulmer-Moll, Peter J. Wheatley, Rafael Brahm, David R. Anderson, David Armstrong, Ioannis Apergis, Douglas R. Alves, Matthew R. Burleigh, R. P. Butler, François Bouchy, Matthew P. Battley, Edward M. Bryant, Allyson Bieryla, Jeffrey D. Crane, Karen A. Collins, Sarah L. Casewell, Ilaria Carleo, Alastair B. Claringbold, Paul A. Dalba, Diana Dragomir, Philipp Eigmüller, Jan Eberhardt, Michael Fausnaugh , et al. (41 additional authors not shown)

Abstract: Discovering transiting exoplanets with relatively long orbital periods ($>$10 days) is crucial to facilitate the study of cool exoplanet atmospheres ($T_{\rm eq} < 700 K$) and to understand exoplanet formation and inward migration further out than typical transiting exoplanets. In order to discover these longer period transiting exoplanets, long-term photometric and radial velocity campaigns are r… ▽ More Discovering transiting exoplanets with relatively long orbital periods ($>$10 days) is crucial to facilitate the study of cool exoplanet atmospheres ($T_{\rm eq} < 700 K$) and to understand exoplanet formation and inward migration further out than typical transiting exoplanets. In order to discover these longer period transiting exoplanets, long-term photometric and radial velocity campaigns are required. We report the discovery of TOI-2447 b ($=$ NGTS-29b), a Saturn-mass transiting exoplanet orbiting a bright (T=10.0) Solar-type star (T$_{\rm eff}$=5730 K). TOI-2447 b was identified as a transiting exoplanet candidate from a single transit event of 1.3% depth and 7.29 h duration in $TESS$ Sector 31 and a prior transit event from 2017 in NGTS data. Four further transit events were observed with NGTS photometry which revealed an orbital period of P=69.34 days. The transit events establish a radius for TOI-2447 b of $0.865 \pm 0.010\rm R_{\rm J}$, while radial velocity measurements give a mass of $0.386 \pm 0.025 \rm M_{\rm J}$. The equilibrium temperature of the planet is $414$ K, making it much cooler than the majority of $TESS$ planet discoveries. We also detect a transit signal in NGTS data not caused by TOI-2447 b, along with transit timing variations and evidence for a $\sim$150 day signal in radial velocity measurements. It is likely that the system hosts additional planets, but further photometry and radial velocity campaigns will be needed to determine their parameters with confidence. TOI-2447 b/NGTS-29b joins a small but growing population of cool giants that will provide crucial insights into giant planet composition and formation mechanisms. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: 16 pages, 12 figures. Accepted for publication in MNRAS

Showing 301–350 of 8,633 results for author: Wang, S