Search | arXiv e-print repository

Non-contact excitation of multi-GHz lithium niobate electromechanical resonators

Authors: Danqing Wang, Jiacheng Xie, Yu Guo, Mohan Shen, Hong X. Tang

Abstract: The demand for high-performance electromechanical resonators is ever-growing across diverse applications, ranging from sensing and time-kee** to advanced communication devices. Among the electromechanical materials being explored, thin-film lithium niobate stands out for its strong piezoelectric properties and low acoustic loss. However, in nearly all existing lithium niobate electromechanical d… ▽ More The demand for high-performance electromechanical resonators is ever-growing across diverse applications, ranging from sensing and time-kee** to advanced communication devices. Among the electromechanical materials being explored, thin-film lithium niobate stands out for its strong piezoelectric properties and low acoustic loss. However, in nearly all existing lithium niobate electromechanical devices, the configuration is such that the electrodes are in direct contact with the mechanical resonator. This configuration introduces an undesirable mass-loading effect, giving rise to spurious modes and additional dam**. Here, we present an electromechanical platform that mitigates this challenge by leveraging a flip-chip bonding technique to separate the electrodes from the mechanical resonator. By offloading the electrodes from the resonator, our approach yields a substantial increase in the quality factor of these resonators, paving the way for enhanced performance and reliability for their device applications. △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: 6 pages, 4 figures

arXiv:2407.04188 [pdf]

Kappa-tail technique: Modeling and application to Solar Energetic Particles observed by Parker Solar Probe

Authors: G. Livadiotis, A. T. Cummings, M. E. Cuesta, R. Bandyopadhyay, H. A. Farooki, L. Y. Khoo, D. J. McComas, J. S. Rankin, T. Sharma, M. M. Shen, C. M. S. Cohen, G. D. Muro, Z. Xu

Abstract: We develop the kappa-tail fitting technique, which analyzes observations of power-law tails of distributions and energy-flux spectra and connects them to theoretical modeling of kappa distributions, to determine the thermodynamics of the examined space plasma. In particular, we (i) construct the associated mathematical formulation, (ii) prove its decisive lead for determining whether the observed… ▽ More We develop the kappa-tail fitting technique, which analyzes observations of power-law tails of distributions and energy-flux spectra and connects them to theoretical modeling of kappa distributions, to determine the thermodynamics of the examined space plasma. In particular, we (i) construct the associated mathematical formulation, (ii) prove its decisive lead for determining whether the observed power-law is associated with kappa distributions; and (iii) provide a validation of the technique using pseudo-observations of typical input plasma parameters. Then, we apply this technique to a case-study by determining the thermodynamics of solar energetic particle (SEP) protons, for a SEP event observed on April 17, 2021, by the PSP/ISOIS instrument suite onboard PSP. The results show SEP temperatures and densities of the order of $\sim 1$ MeV and $ \sim 5 \cdot 10^{-7} $ cm$^{-3}$, respectively. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2406.17960 [pdf, other]

MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation

Authors: Liuyi Wang, Zongtao He, Mengjiao Shen, **gwei Yang, Chengju Liu, Qijun Chen

Abstract: Despite the remarkable developments of recent large models in Embodied Artificial Intelligence (E-AI), their integration into robotics is hampered by their excessive parameter sizes and computational demands. Towards the Vision-and-Language Navigation (VLN) task, a core task in E-AI, this paper reveals the great potential of using knowledge distillation for obtaining lightweight student models by… ▽ More Despite the remarkable developments of recent large models in Embodied Artificial Intelligence (E-AI), their integration into robotics is hampered by their excessive parameter sizes and computational demands. Towards the Vision-and-Language Navigation (VLN) task, a core task in E-AI, this paper reveals the great potential of using knowledge distillation for obtaining lightweight student models by proposing a Meta-Ability Guided Interactive Chain-of-distillation (MAGIC) method. Specifically, a Meta-Ability Knowledge Distillation (MAKD) framework is proposed for decoupling and refining the necessary meta-abilities of VLN agents. A Meta-Knowledge Randomization Weighting (MKRW) and a Meta-Knowledge Transferable Determination (MKTD) module are incorporated to dynamically adjust aggregation weights at the meta-ability and sample levels, respectively. Move beyond the traditional one-step unidirectional distillation, an Interactive Chain-of-Distillation (ICoD) learning strategy is proposed to allow students to give feedback to teachers, forming a new multi-step teacher-student co-evolution pipeline. Remarkably, on the R2R test unseen public leaderboard, our smallest model, MAGIC-S, with only 5% (11M) of the teacher's size, outperforms all previous methods under the same training data. Additionally, our largest model, MAGIC-L, surpasses the previous state-of-the-art by 5.84% in SPL and 3.18% in SR. Furthermore, a new dataset was collected and annotated from our living environments, where MAGIC-S demonstrated superior performance and real-time efficiency. Our code is publicly available on https://github.com/CrystalSixone/VLN-MAGIC. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.12079 [pdf, other]

Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint

Authors: Xinglong Sun, Barath Lakshmanan, Maying Shen, Shiyi Lan, **gde Chen, Jose Alvarez

Abstract: As we push the boundaries of performance in various vision tasks, the models grow in size correspondingly. To keep up with this growth, we need very aggressive pruning techniques for efficient inference and deployment on edge devices. Existing pruning approaches are limited to channel pruning and struggle with aggressive parameter reductions. In this paper, we propose a novel multi-dimensional pru… ▽ More As we push the boundaries of performance in various vision tasks, the models grow in size correspondingly. To keep up with this growth, we need very aggressive pruning techniques for efficient inference and deployment on edge devices. Existing pruning approaches are limited to channel pruning and struggle with aggressive parameter reductions. In this paper, we propose a novel multi-dimensional pruning framework that jointly optimizes pruning across channels, layers, and blocks while adhering to latency constraints. We develop a latency modeling technique that accurately captures model-wide latency variations during pruning, which is crucial for achieving an optimal latency-accuracy trade-offs at high pruning ratio. We reformulate pruning as a Mixed-Integer Nonlinear Program (MINLP) to efficiently determine the optimal pruned structure with only a single pass. Our extensive results demonstrate substantial improvements over previous methods, particularly at large pruning ratios. In classification, our method significantly outperforms prior art HALP with a Top-1 accuracy of 70.0(v.s. 68.6) and an FPS of 5262 im/s(v.s. 4101 im/s). In 3D object detection, we establish a new state-of-the-art by pruning StreamPETR at a 45% pruning ratio, achieving higher FPS (37.3 vs. 31.7) and mAP (0.451 vs. 0.449) than the dense baseline. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: Under Review

arXiv:2406.04484 [pdf, ps, other]

Step Out and Seek Around: On Warm-Start Training with Incremental Data

Authors: Maying Shen, Hongxu Yin, Pavlo Molchanov, Lei Mao, Jose M. Alvarez

Abstract: Data often arrives in sequence over time in real-world deep learning applications such as autonomous driving. When new training data is available, training the model from scratch undermines the benefit of leveraging the learned knowledge, leading to significant training costs. Warm-starting from a previously trained checkpoint is the most intuitive way to retain knowledge and advance learning. How… ▽ More Data often arrives in sequence over time in real-world deep learning applications such as autonomous driving. When new training data is available, training the model from scratch undermines the benefit of leveraging the learned knowledge, leading to significant training costs. Warm-starting from a previously trained checkpoint is the most intuitive way to retain knowledge and advance learning. However, existing literature suggests that this warm-starting degrades generalization. In this paper, we advocate for warm-starting but step** out of the previous converging point, thus allowing a better adaptation to new data without compromising previous knowledge. We propose Knowledge Consolidation and Acquisition (CKCA), a continuous model improvement algorithm with two novel components. First, a novel feature regularization (FeatReg) to retain and refine knowledge from existing checkpoints; Second, we propose adaptive knowledge distillation (AdaKD), a novel approach to forget mitigation and knowledge transfer. We tested our method on ImageNet using multiple splits of the training data. Our approach achieves up to $8.39\%$ higher top1 accuracy than the vanilla warm-starting and consistently outperforms the prior art with a large margin. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.03596 [pdf]

A Multivariate Equivalence Test Based on Mahalanobis Distance with a Data-Driven Margin

Authors: Chao Wang, Yu-Ting Weng, Shaobo Liu, Tengfei Li, Meiyu Shen, Yi Tsong

Abstract: Multivariate equivalence testing is needed in a variety of scenarios for drug development. For example, drug products obtained from natural sources may contain many components for which the individual effects and/or their interactions on clinical efficacy and safety cannot be completely characterized. Such lack of sufficient characterization poses a challenge for both generic drug developers to de… ▽ More Multivariate equivalence testing is needed in a variety of scenarios for drug development. For example, drug products obtained from natural sources may contain many components for which the individual effects and/or their interactions on clinical efficacy and safety cannot be completely characterized. Such lack of sufficient characterization poses a challenge for both generic drug developers to demonstrate and regulatory authorities to determine the sameness of a proposed generic product to its reference product. Another case is to ensure batch-to-batch consistency of naturally derived products containing a vast number of components, such as botanical products. The equivalence or sameness between products containing many components that cannot be individually evaluated needs to be studied in a holistic manner. Multivariate equivalence test based on Mahalanobis distance may be suitable to evaluate many variables holistically. Existing studies based on such method assumed either a predetermined constant margin, for which a consensus is difficult to achieve, or a margin derived from the data, where, however, the randomness is ignored during the testing. In this study, we propose a multivariate equivalence test based on Mahalanobis distance with a data-drive margin with the randomness in the margin considered. Several possible implementations are compared with existing approaches via extensive simulation studies. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2406.01125 [pdf, other]

$Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers

Authors: Pengtao Chen, Mingzhu Shen, Peng Ye, Jianjian Cao, Chongjun Tu, Christos-Savvas Bouganis, Yiren Zhao, Tao Chen

Abstract: Diffusion models are widely recognized for generating high-quality and diverse images, but their poor real-time performance has led to numerous acceleration works, primarily focusing on UNet-based structures. With the more successful results achieved by diffusion transformers (DiT), there is still a lack of exploration regarding the impact of DiT structure on generation, as well as the absence of… ▽ More Diffusion models are widely recognized for generating high-quality and diverse images, but their poor real-time performance has led to numerous acceleration works, primarily focusing on UNet-based structures. With the more successful results achieved by diffusion transformers (DiT), there is still a lack of exploration regarding the impact of DiT structure on generation, as well as the absence of an acceleration framework tailored to the DiT architecture. To tackle these challenges, we conduct an investigation into the correlation between DiT blocks and image generation. Our findings reveal that the front blocks of DiT are associated with the outline of the generated images, while the rear blocks are linked to the details. Based on this insight, we propose an overall training-free inference acceleration framework $Δ$-DiT: using a designed cache mechanism to accelerate the rear DiT blocks in the early sampling stages and the front DiT blocks in the later stages. Specifically, a DiT-specific cache mechanism called $Δ$-Cache is proposed, which considers the inputs of the previous sampling image and reduces the bias in the inference. Extensive experiments on PIXART-$α$ and DiT-XL demonstrate that the $Δ$-DiT can achieve a $1.6\times$ speedup on the 20-step generation and even improves performance in most cases. In the scenario of 4-step consistent model generation and the more challenging $1.12\times$ acceleration, our method significantly outperforms existing methods. Our code will be publicly available. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 12 pages, 6 figures, 6 tables

arXiv:2405.19626 [pdf, other]

Position: CXL Shared Memory Programming: Barely Distributed and Almost Persistent

Authors: Yi Xu, Suyash Mahar, Ziheng Liu, Mingyao Shen, Steven Swanson

Abstract: While Compute Express Link (CXL) enables support for cache-coherent shared memory among multiple nodes, it also introduces new types of failures--processes can fail before data does, or data might fail before a process does. The lack of a failure model for CXL-based shared memory makes it challenging to understand and mitigate these failures. To solve these challenges, in this paper, we describe… ▽ More While Compute Express Link (CXL) enables support for cache-coherent shared memory among multiple nodes, it also introduces new types of failures--processes can fail before data does, or data might fail before a process does. The lack of a failure model for CXL-based shared memory makes it challenging to understand and mitigate these failures. To solve these challenges, in this paper, we describe a model categorizing and handling the CXL-based shared memory's failures: data and process failures. Data failures in CXL-based shared memory render data inaccessible or inconsistent for a currently running application. We argue that such failures are unlike data failures in distributed storage systems and require CXL-specific handling. To address this, we look into traditional data failure mitigation techniques like erasure coding and replication and propose new solutions to better handle data failures in CXL-based shared memory systems. Next, we look into process failures and compare the failures and potential solutions with PMEM's failure model and programming solutions. We argue that although PMEM shares some of CXL's characteristics, it does not fully address CXL's volatile nature and low access latencies. Finally, taking inspiration from PMEM programming solutions, we propose techniques to handle these new failures. Thus, this paper is the first work to define the CXL-based shared memory failure model and propose tailored solutions that address challenges specific to CXL-based systems. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.16590 [pdf, other]

Parker Solar Probe Observations of Energetic Particles in the Flank of a Coronal Mass Ejection Close to the Sun

Authors: N. A. Schwadron, Stuart D. Bale, J. Bonnell, A. Case, M. Shen, E. R. Christian, C. M. S. Cohen, A. J. Davis, M. I. Desai, K. Goetz, J. Giacalone, M. E. Hill, J. C. Kasper, K. Korreck, D. Larson, R. Livi, T. Lim, R. A. Leske, O. Malandraki, D. Malaspina, W. H. Matthaeus, D. J. McComas, R. L. McNutt Jr., R. A. Mewaldt, D. G. Mitchell , et al. (10 additional authors not shown)

Abstract: We present an event observed by Parker Solar Probe at $\sim$0.2 au on March 2, 2022 in which imaging and \emph{in situ} measurements coincide. During this event, PSP passed through structures on the flank of a streamer blowout CME including an isolated flux tube in front of the CME, a turbulent sheath, and the CME itself. Imaging observations and \emph{in situ} helicity and principal variance sign… ▽ More We present an event observed by Parker Solar Probe at $\sim$0.2 au on March 2, 2022 in which imaging and \emph{in situ} measurements coincide. During this event, PSP passed through structures on the flank of a streamer blowout CME including an isolated flux tube in front of the CME, a turbulent sheath, and the CME itself. Imaging observations and \emph{in situ} helicity and principal variance signatures consistently show the presence of flux ropes internal to the CME. In both the sheath, and the CME interval, the distributions are more isotropic, the spectra are softer, and the abundance ratios of Fe/O and He/H are lower than those in the isolated flux tube, and yet elevated relative to typical plasma and SEP abundances. These signatures in the sheath and the CME indicate that both flare populations and those from the plasma are accelerated to form the observed energetic particle enhancements. In contrast, the isolated flux tube shows large streaming, hard spectra and large Fe/O and He/H ratios, indicating flare sources. Energetic particle fluxes are most enhanced within the CME interval from suprathermal through energetic particle energies ($\sim$ keV to $>10$ MeV), indicating particle acceleration, and confinement local to the closed magnetic structure. The flux-rope morphology of the CME helps to enable local modulation and trap** of energetic particles, particularly along helicity channels and other plasma boundaries. Thus, the CME acts to build-up energetic particle populations, allowing them to be fed into subsequent higher energy particle acceleration throughout the inner heliosphere where a compression or shock forms on the CME front. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 41 pages, 19 figures, In Press

Journal ref: Astrophysical Journal, 2024

arXiv:2405.16395 [pdf, other]

Daily Physical Activity Monitoring -- Adaptive Learning from Multi-source Motion Sensor Data

Authors: Haoting Zhang, Donglin Zhan, Yunduan Lin, **ghai He, Qing Zhu, Zuo-Jun Max Shen, Zeyu Zheng

Abstract: In healthcare applications, there is a growing need to develop machine learning models that use data from a single source, such as that from a wrist wearable device, to monitor physical activities, assess health risks, and provide immediate health recommendations or interventions. However, the limitation of using single-source data often compromises the model's accuracy, as it fails to capture the… ▽ More In healthcare applications, there is a growing need to develop machine learning models that use data from a single source, such as that from a wrist wearable device, to monitor physical activities, assess health risks, and provide immediate health recommendations or interventions. However, the limitation of using single-source data often compromises the model's accuracy, as it fails to capture the full scope of human activities. While a more comprehensive dataset can be gathered in a lab setting using multiple sensors attached to various body parts, this approach is not practical for everyday use due to the impracticality of wearing multiple sensors. To address this challenge, we introduce a transfer learning framework that optimizes machine learning models for everyday applications by leveraging multi-source data collected in a laboratory setting. We introduce a novel metric to leverage the inherent relationship between these multiple data sources, as they are all paired to capture aspects of the same physical activity. Through numerical experiments, our framework outperforms existing methods in classification accuracy and robustness to noise, offering a promising avenue for the enhancement of daily activity monitoring. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.13380 [pdf, other]

The Illusion of Anonymity: Uncovering the Impact of User Actions on Privacy in Web3 Social Ecosystems

Authors: Bin Wang, Tianjian Liu, Wenqi Wang, Yuan Weng, Chao Li, Guangquan Xu, Meng Shen, Sencun Zhu, Wei Wang

Abstract: The rise of Web3 social ecosystems signifies the dawn of a new chapter in digital interaction, offering significant prospects for user engagement and financial advancement. Nonetheless, this progress is shadowed by potential privacy concessions, especially as these platforms frequently merge with existing Web2.0 social media accounts, amplifying data privacy risks for users. In this study, we in… ▽ More The rise of Web3 social ecosystems signifies the dawn of a new chapter in digital interaction, offering significant prospects for user engagement and financial advancement. Nonetheless, this progress is shadowed by potential privacy concessions, especially as these platforms frequently merge with existing Web2.0 social media accounts, amplifying data privacy risks for users. In this study, we investigate the nuanced dynamics between user engagement on Web3 social platforms and the consequent privacy concerns. We scrutinize the widespread phenomenon of fabricated activities, which encompasses the establishment of bogus accounts aimed at mimicking popularity and the deliberate distortion of social interactions by some individuals to gain financial rewards. Such deceptive maneuvers not only distort the true measure of the active user base but also amplify privacy threats for all members of the user community. We also find that, notwithstanding their attempts to limit social exposure, users remain entangled in privacy vulnerabilities. The actions of those highly engaged users, albeit often a minority group, can inadvertently breach the privacy of the larger collective. By casting light on the delicate interplay between user engagement, financial motives, and privacy issues, we offer a comprehensive examination of the intrinsic challenges and hazards present in the Web3 social milieu. We highlight the urgent need for more stringent privacy measures and ethical protocols to navigate the complex web of social exchanges and financial ambitions in the rapidly evolving Web3. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.07978 [pdf, other]

Unveiling the Pockels Coefficient of Ferroelectric Nitride ScAlN

Authors: Guangcanlan Yang, Haochen Wang, Sai Mu, Hao Xie, Tyler Wang, Chengxing He, Mohan Shen, Mengxia Liu, Chris G. Van de Walle, Hong X. Tang

Abstract: Nitride ferroelectrics have recently emerged as promising alternatives to oxide ferroelectrics due to their compatibility with mainstream semiconductor processing. ScAlN, in particular, has exhibited remarkable piezoelectric coupling strength ($K^2$) comparable to that of lithium niobate (LN), making it a valuable choice for RF filters in wireless communications. Recently, ScAlN has sparked intere… ▽ More Nitride ferroelectrics have recently emerged as promising alternatives to oxide ferroelectrics due to their compatibility with mainstream semiconductor processing. ScAlN, in particular, has exhibited remarkable piezoelectric coupling strength ($K^2$) comparable to that of lithium niobate (LN), making it a valuable choice for RF filters in wireless communications. Recently, ScAlN has sparked interest in its use for nanophotonic devices, chiefly due to its large bandgap facilitating operation in blue wavelengths coupled with promises of enhanced nonlinear optical properties such as a large second-order susceptibility ($χ^{(2)}$). It is still an open question whether ScAlN can outperform oxide ferroelectrics concerning the Pockels effect -- an electro-optic coupling extensively utilized in optical communications devices. In this paper, we present a comprehensive theoretical analysis and experimental demonstration of ScAlN's Pockels effect. Our findings reveal that the electro-optic coupling of ScAlN, despite being weak at low Sc concentration, may be significantly enhanced at high levels of Sc do**, which points the direction of continued research efforts to unlock the full potential of ScAlN. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.07804 [pdf]

Multiple stochastic resonances and inverse stochastic resonances in asymmetric bistable system under the ultra-high frequency excitation

Authors: Cong Wang, Zhongqiu Wang, Jianhua Yang, Miguel A. F. Sanjuán, Gong Tao, Zhen Shan, Mengen Shen

Abstract: Ultra-high frequency linear frequency modulation (UHF-LFM) signal, as a kind of typical non-stationary signal, has been widely used in microwave radar and other fields, with advantages such as long transmission distance, strong anti-interference ability, and wide bandwidth. Utilizing optimal dynamics response has unique advantages in weak feature identification under strong background noise. We pr… ▽ More Ultra-high frequency linear frequency modulation (UHF-LFM) signal, as a kind of typical non-stationary signal, has been widely used in microwave radar and other fields, with advantages such as long transmission distance, strong anti-interference ability, and wide bandwidth. Utilizing optimal dynamics response has unique advantages in weak feature identification under strong background noise. We propose a new stochastic resonance method in an asymmetric bistable system with the time-varying parameter to handle this special non-stationary signal. Interestingly, the nonlinear response exhibits multiple stochastic resonances (MSR) and inverse stochastic resonances (ISR) under UHF-LFM signal excitation, and some resonance regions may deviate or collapse due to the influence of system asymmetry. In addition, we analyze the responses of each resonance region and the mechanism and evolution law of each resonance region in detail. Finally, we significantly expand the resonance region within the parameter range by optimizing the time scale, which verifies the effectiveness of the proposed time-varying scale method. The mechanism and evolution law of MSR and ISR will provide references for researchers in related fields. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 23 pages, 13 figures

arXiv:2405.07303 [pdf, other]

Search for solar axions by Primakoff effect with the full dataset of the CDEX-1B Experiment

Authors: L. T. Yang, S. K. Liu, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

Abstract: We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China **** Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axio… ▽ More We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China **** Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axions with mass up to 100 eV/$c^2$. Within the hadronic model of KSVZ, our results exclude axion mass $>5.3~\rm{eV}/c^2$ at 95\% C.L. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: 7 pages, 5 figures

arXiv:2405.05578 [pdf]

Sha** the Future of Urban Mobility: Insights into Autonomous Vehicle Acceptance in Shanghai Through TAM and Perceived Risk Analysis

Authors: Miaomiao Shen, Linxuan Yu, **g Xu, Zihao Sang, Ruijia Li, Xiang Yuan

Abstract: Autonomous vehicles (AVs) have begun experimental commercialization initiatives in places such as Shanghai, China, and it is a valuable research question whether people's willingness to use AVs has changed from the prior. This study explores Shanghai residents' attitudes towards AVs by applying the Technology Acceptance Model (TAM), the Perceived Risk (BAR) model, and introducing perceived externa… ▽ More Autonomous vehicles (AVs) have begun experimental commercialization initiatives in places such as Shanghai, China, and it is a valuable research question whether people's willingness to use AVs has changed from the prior. This study explores Shanghai residents' attitudes towards AVs by applying the Technology Acceptance Model (TAM), the Perceived Risk (BAR) model, and introducing perceived externalities as a new psychological variable. Through a survey in Shanghai, where AVs are operational, and structural equation modeling, it was found that perceived usefulness and ease of use positively influence willingness to use AVs, with perceived usefulness being the most significant factor. Perceived externalities have a positive impact, while perceived risk negatively affects willingness to use. Interestingly, ease of use increases perceived risk, but this is mitigated by the benefits perceived in usefulness. This research, differing significantly from previous studies, aims to guide government policy and industry strategies to enhance design, marketing, and popularization. △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.01626 [pdf, other]

Mineral Detection of Neutrinos and Dark Matter 2024. Proceedings

Authors: Sebastian Baum, Patrick Huber, Patrick Stengel, Natsue Abe, Daniel G. Ang, Lorenzo Apollonio, Gabriela R. Araujo, Levente Balogh, Pranshu Bhaumik Yilda Boukhtouchen, Joseph Bramante, Lorenzo Caccianiga, Andrew Calabrese-Day, Qing Chang, Juan I. Collar, Reza Ebadi, Alexey Elykov, Katherine Freese, Audrey Fung, Claudio Galelli, Arianna E. Gleason, Mariano Guerrero Perez, Janina Hakenmüller, Takeshi Hanyu, Noriko Hasebe, Shigenobu Hirose , et al. (35 additional authors not shown)

Abstract: The second "Mineral Detection of Neutrinos and Dark Matter" (MDvDM'24) meeting was held January 8-11, 2024 in Arlington, VA, USA, hosted by Virginia Tech's Center for Neutrino Physics. This document collects contributions from this workshop, providing an overview of activities in the field. MDvDM'24 was the second topical workshop dedicated to the emerging field of mineral detection of neutrinos a… ▽ More The second "Mineral Detection of Neutrinos and Dark Matter" (MDvDM'24) meeting was held January 8-11, 2024 in Arlington, VA, USA, hosted by Virginia Tech's Center for Neutrino Physics. This document collects contributions from this workshop, providing an overview of activities in the field. MDvDM'24 was the second topical workshop dedicated to the emerging field of mineral detection of neutrinos and dark matter, following a meeting hosted by IFPU in Trieste, Italy in October 2022. Mineral detectors have been proposed for a wide variety of applications, including searching for dark matter, measuring various fluxes of astrophysical neutrinos over gigayear timescales, monitoring nuclear reactors, and nuclear disarmament protocols; both as paleo-detectors using natural minerals that could have recorded the traces of nuclear recoils for timescales as long as a billion years and as detectors recording nuclear recoil events on laboratory timescales using natural or artificial minerals. Contributions to this proceedings discuss the vast physics potential, the progress in experimental studies, and the numerous challenges lying ahead on the path towards mineral detection. These include a better understanding of the formation and annealing of recoil defects in crystals; identifying the best classes of minerals and, for paleo-detectors, understanding their geology; modeling and control of the relevant backgrounds; develo**, combining, and scaling up imaging and data analysis techniques; and many others. During the last years, MDvDM has grown rapidly and gained attention. Small-scale experimental efforts focused on establishing various microscopic readout techniques are underway at institutions in North America, Europe and Asia. We are looking ahead to an exciting future full of challenges to overcome, surprises to be encountered, and discoveries lying ahead of us. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: Summary and proceedings of the MDvDM'24 conference, Jan 8-11 2024

arXiv:2405.00284 [pdf, other]

Sub-terahertz optomechanics

Authors: Jiacheng Xie, Mohan Shen, Hong X. Tang

Abstract: We demonstrate optomechanics in the sub-terahertz regime. An optical racetrack resonator, patterned from thin-film lithium niobate, is suspended to support mechanical structures oscillating at these extremely high frequencies, which are read out through cavity optomechanical coupling. Our hybrid platform paves the way for advancing mechanical systems in the quantum regime at elevated temperatures. We demonstrate optomechanics in the sub-terahertz regime. An optical racetrack resonator, patterned from thin-film lithium niobate, is suspended to support mechanical structures oscillating at these extremely high frequencies, which are read out through cavity optomechanical coupling. Our hybrid platform paves the way for advancing mechanical systems in the quantum regime at elevated temperatures. △ Less

Submitted 30 April, 2024; originally announced May 2024.

arXiv:2404.18096 [pdf, other]

Snake with Shifted Window: Learning to Adapt Vessel Pattern for OCTA Segmentation

Authors: Xinrun Chen, Mei Shen, Haojian Ning, Mengzhan Zhang, Chengliang Wang, Shiying Li

Abstract: Segmenting specific targets or structures in optical coherence tomography angiography (OCTA) images is fundamental for conducting further pathological studies. The retinal vascular layers are rich and intricate, and such vascular with complex shapes can be captured by the widely-studied OCTA images. In this paper, we thus study how to use OCTA images with projection vascular layers to segment reti… ▽ More Segmenting specific targets or structures in optical coherence tomography angiography (OCTA) images is fundamental for conducting further pathological studies. The retinal vascular layers are rich and intricate, and such vascular with complex shapes can be captured by the widely-studied OCTA images. In this paper, we thus study how to use OCTA images with projection vascular layers to segment retinal structures. To this end, we propose the SSW-OCTA model, which integrates the advantages of deformable convolutions suited for tubular structures and the swin-transformer for global feature extraction, adapting to the characteristics of OCTA modality images. Our model underwent testing and comparison on the OCTA-500 dataset, achieving state-of-the-art performance. The code is available at: https://github.com/ShellRedia/Snake-SWin-OCTA. △ Less

Submitted 28 April, 2024; originally announced April 2024.

arXiv:2404.10241 [pdf, other]

Vision-and-Language Navigation via Causal Learning

Authors: Liuyi Wang, Zongtao He, Ronghao Dang, Mengjiao Shen, Chengju Liu, Qijun Chen

Abstract: In the pursuit of robust and generalizable environment perception and language understanding, the ubiquitous challenge of dataset bias continues to plague vision-and-language navigation (VLN) agents, hindering their performance in unseen environments. This paper introduces the generalized cross-modal causal transformer (GOAT), a pioneering solution rooted in the paradigm of causal inference. By de… ▽ More In the pursuit of robust and generalizable environment perception and language understanding, the ubiquitous challenge of dataset bias continues to plague vision-and-language navigation (VLN) agents, hindering their performance in unseen environments. This paper introduces the generalized cross-modal causal transformer (GOAT), a pioneering solution rooted in the paradigm of causal inference. By delving into both observable and unobservable confounders within vision, language, and history, we propose the back-door and front-door adjustment causal learning (BACL and FACL) modules to promote unbiased learning by comprehensively mitigating potential spurious correlations. Additionally, to capture global confounder features, we propose a cross-modal feature pooling (CFP) module supervised by contrastive learning, which is also shown to be effective in improving cross-modal representations during pre-training. Extensive experiments across multiple VLN datasets (R2R, REVERIE, RxR, and SOON) underscore the superiority of our proposed method over previous state-of-the-art approaches. Code is available at https://github.com/CrystalSixone/VLN-GOAT. △ Less

Submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.09793 [pdf, other]

First Search for Light Fermionic Dark Matter Absorption on Electrons Using Germanium Detector in CDEX-10 Experiment

Authors: J. X. Liu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

Abstract: We present the first results of the search for sub-MeV fermionic dark matter absorbed by electron targets of Germanium using the 205.4~kg$\cdot$day data collected by the CDEX-10 experiment, with the analysis threshold of 160~eVee. No significant dark matter (DM) signals over the background are observed. Results are presented as limits on the cross section of DM--electron interaction. We present ne… ▽ More We present the first results of the search for sub-MeV fermionic dark matter absorbed by electron targets of Germanium using the 205.4~kg$\cdot$day data collected by the CDEX-10 experiment, with the analysis threshold of 160~eVee. No significant dark matter (DM) signals over the background are observed. Results are presented as limits on the cross section of DM--electron interaction. We present new constraints of cross section in the DM range of 0.1--10 keV/$c^2$ for vector and axial-vector interaction. The upper limit on the cross section is set to be $\rm 5.5\times10^{-46}~cm^2$ for vector interaction, and $\rm 1.8\times10^{-46}~cm^2$ for axial-vector interaction at DM mass of 5 keV/$c^2$. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 6 pages, 4 figures

arXiv:2404.09644 [pdf, ps, other]

doi 10.1109/LRA.2020.2972833

An Origami-Inspired Variable Friction Surface for Increasing the Dexterity of Robotic Grippers

Authors: Qiujie Lu, Angus B. Clark, Matthew Shen, Nicolas Rojas

Abstract: While the gras** capability of robotic grippers has shown significant development, the ability to manipulate objects within the hand is still limited. One explanation for this limitation is the lack of controlled contact variation between the grasped object and the gripper. For instance, human hands have the ability to firmly grip object surfaces, as well as slide over object faces, an aspect th… ▽ More While the gras** capability of robotic grippers has shown significant development, the ability to manipulate objects within the hand is still limited. One explanation for this limitation is the lack of controlled contact variation between the grasped object and the gripper. For instance, human hands have the ability to firmly grip object surfaces, as well as slide over object faces, an aspect that aids the enhanced manipulation of objects within the hand without losing contact. In this letter, we present a parametric, origami-inspired thin surface capable of transitioning between a high friction and a low friction state, suitable for implementation as an epidermis in robotic fingers. A numerical analysis of the proposed surface based on its design parameters, force analysis, and performance in in-hand manipulation tasks is presented. Through the development of a simple two-fingered two-degree-of-freedom gripper utilizing the proposed variable-friction surfaces with different parameters, we experimentally demonstrate the improved manipulation capabilities of the hand when compared to the same gripper without changeable friction. Results show that the pattern density and valley gap are the main parameters that effect the in-hand manipulation performance. The origami-inspired thin surface with a higher pattern density generated a smaller valley gap and smaller height change, producing a more stable improvement of the manipulation capabilities of the hand. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 8 pages, 11 figures

Journal ref: IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 2538-2545, April 2020

arXiv:2404.04661 [pdf, other]

Transform then Explore: a Simple and Effective Technique for Exploratory Combinatorial Optimization with Reinforcement Learning

Authors: Tianle Pu, Changjun Fan, Mutian Shen, Yizhou Lu, Li Zeng, Zohar Nussinov, Chao Chen, Zhong Liu

Abstract: Many complex problems encountered in both production and daily life can be conceptualized as combinatorial optimization problems (COPs) over graphs. Recent years, reinforcement learning (RL) based models have emerged as a promising direction, which treat the COPs solving as a heuristic learning problem. However, current finite-horizon-MDP based RL models have inherent limitations. They are not all… ▽ More Many complex problems encountered in both production and daily life can be conceptualized as combinatorial optimization problems (COPs) over graphs. Recent years, reinforcement learning (RL) based models have emerged as a promising direction, which treat the COPs solving as a heuristic learning problem. However, current finite-horizon-MDP based RL models have inherent limitations. They are not allowed to explore adquately for improving solutions at test time, which may be necessary given the complexity of NP-hard optimization tasks. Some recent attempts solve this issue by focusing on reward design and state feature engineering, which are tedious and ad-hoc. In this work, we instead propose a much simpler but more effective technique, named gauge transformation (GT). The technique is originated from physics, but is very effective in enabling RL agents to explore to continuously improve the solutions during test. Morever, GT is very simple, which can be implemented with less than 10 lines of Python codes, and can be applied to a vast majority of RL models. Experimentally, we show that traditional RL models with GT technique produce the state-of-the-art performances on the MaxCut problem. Furthermore, since GT is independent of any RL models, it can be seamlessly integrated into various RL frameworks, paving the way of these models for more effective explorations in the solving of general COPs. △ Less

Submitted 6 April, 2024; originally announced April 2024.

arXiv:2404.03604 [pdf, other]

A Unified Algorithmic Framework for Dynamic Assortment Optimization under MNL Choice

Authors: Shuo Sun, Rajan Udwani, Zuo-Jun Max Shen

Abstract: We consider assortment and inventory planning problems with dynamic stockout-based substitution effects and no replenishment. We consider two settings: 1. Customers can see all available products when they arrive, which is commonly seen in physical stores. 2. The seller can choose to offer a subset of available products to each customer, which is typical on online platforms. Both settings are know… ▽ More We consider assortment and inventory planning problems with dynamic stockout-based substitution effects and no replenishment. We consider two settings: 1. Customers can see all available products when they arrive, which is commonly seen in physical stores. 2. The seller can choose to offer a subset of available products to each customer, which is typical on online platforms. Both settings are known to be computationally challenging, and the current approximation algorithms for the two settings are quite different. We develop a unified algorithm framework under the MNL choice model for both settings. Our algorithms improve on the state-of-the-art algorithms in terms of approximation guarantee, runtime, and the ability to manage uncertainty in the total number of customers and handle more complex constraints. In the process, we establish various novel properties of dynamic assortment planning (under the MNL choice) that may be useful more broadly. △ Less

Submitted 4 April, 2024; originally announced April 2024.

arXiv:2403.17152 [pdf, other]

Local magnetic response of superconducting Sr$\mathrm{_2}$RuO$\mathrm{_4}$ thin films and rings

Authors: G. M. Ferguson, Hari P. Nair, Nathaniel J. Schreiber, Ludi Miao, Kyle M. Shen, Darrell G. Schlom, Katja C. Nowack

Abstract: We conduct local magnetic measurements on superconducting thin-film samples of Sr$\mathrm{_2}$RuO$\mathrm{_4}$ using scanning Superconducting Quantum Interference Device (SQUID) susceptometry. From the diamagnetic response, we extract the magnetic penetration depth, $λ$, which exhibits a quadratic temperature dependence at low temperatures. Although a quadratic dependence in high-purity bulk sampl… ▽ More We conduct local magnetic measurements on superconducting thin-film samples of Sr$\mathrm{_2}$RuO$\mathrm{_4}$ using scanning Superconducting Quantum Interference Device (SQUID) susceptometry. From the diamagnetic response, we extract the magnetic penetration depth, $λ$, which exhibits a quadratic temperature dependence at low temperatures. Although a quadratic dependence in high-purity bulk samples has been attributed to non-local electrodynamics, our analysis suggests that in our thin-film samples the presence of scattering is the origin of the quadratic dependence. While we observe micron-scale variations in the diamagnetic response and superconducting transition temperature, the form of the temperature dependence of $λ$ is independent of position. Finally, we characterize flux trap** in superconducting rings lithographically fabricated from the thin films, paving the way to systematic device-based tests of the superconducting order parameter in Sr$\mathrm{_2}$RuO$\mathrm{_4}$. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2403.15791 [pdf, other]

DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation

Authors: Mu-Yi Shen, Chia-Chi Hsu, Hao-Yu Hou, Yu-Chen Huang, Wei-Fang Sun, Chia-Che Chang, Yu-Lun Liu, Chun-Yi Lee

Abstract: In this study, we introduce the DriveEnv-NeRF framework, which leverages Neural Radiance Fields (NeRF) to enable the validation and faithful forecasting of the efficacy of autonomous driving agents in a targeted real-world scene. Standard simulator-based rendering often fails to accurately reflect real-world performance due to the sim-to-real gap, which represents the disparity between virtual sim… ▽ More In this study, we introduce the DriveEnv-NeRF framework, which leverages Neural Radiance Fields (NeRF) to enable the validation and faithful forecasting of the efficacy of autonomous driving agents in a targeted real-world scene. Standard simulator-based rendering often fails to accurately reflect real-world performance due to the sim-to-real gap, which represents the disparity between virtual simulations and real-world conditions. To mitigate this gap, we propose a workflow for building a high-fidelity simulation environment of the targeted real-world scene using NeRF. This approach is capable of rendering realistic images from novel viewpoints and constructing 3D meshes for emulating collisions. The validation of these capabilities through the comparison of success rates in both simulated and real environments demonstrates the benefits of using DriveEnv-NeRF as a real-world performance indicator. Furthermore, the DriveEnv-NeRF framework can serve as a training environment for autonomous driving agents under various lighting conditions. This approach enhances the robustness of the agents and reduces performance degradation when deployed to the target real scene, compared to agents fully trained using the standard simulator rendering pipeline. △ Less

Submitted 30 May, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

Comments: Project page: https://github.com/muyishen2040/DriveEnvNeRF

arXiv:2403.12807 [pdf, ps, other]

Freshness-aware Block Propagation Optimization in 6G-based Web 3.0: An Evolutionary Game Approach

Authors: **bo Wen, Jiawen Kang, Zehui Xiong, Hongyang Du, Zhaohui Yang, Dusit Niyato, Meng Shen, Yutao Jiao, Yang Zhang

Abstract: Driven by the aspiration to establish a decentralized digital economy, Web 3.0 is emerging as the fundamental technology for digital transformation. Incorporating the promising sixth-generation (6G) technology with large bandwidth and space-air-ground integrated coverage, 6G-based Web 3.0 holds great potential in empowering users with enhanced data control and facilitating secure peer-to-peer tran… ▽ More Driven by the aspiration to establish a decentralized digital economy, Web 3.0 is emerging as the fundamental technology for digital transformation. Incorporating the promising sixth-generation (6G) technology with large bandwidth and space-air-ground integrated coverage, 6G-based Web 3.0 holds great potential in empowering users with enhanced data control and facilitating secure peer-to-peer transactions, especially in consumer electronics, through the utilization of blockchain technologies. However, 6G-based Web 3.0 is still in its infancy, such as ensuring block freshness and optimizing block propagation to improve blockchain performance. In this paper, we develop a freshness-aware block propagation optimization framework for 6G-based Web 3.0. We first propose a novel metric called Age of Block Information (AoBI) based on the concept of age of information to quantify block freshness. To make block propagation optimization tractable, we classify miners into five different states and propose a block propagation model for public blockchains inspired by epidemic models. Moreover, considering that the miners are bounded rational, we propose an incentive mechanism based on the evolutionary game for block propagation to improve block propagation efficiency. Numerical results demonstrate that compared with other block propagation mechanisms, the proposed scheme has a higher block forwarding probability, which improves block propagation efficiency. △ Less

Submitted 19 March, 2024; originally announced March 2024.

arXiv:2403.08819 [pdf, other]

Thermometer: Towards Universal Calibration for Large Language Models

Authors: Maohao Shen, Subhro Das, Kristjan Greenewald, Prasanna Sattigeri, Gregory Wornell, Soumya Ghosh

Abstract: We consider the issue of calibration in large language models (LLM). Recent studies have found that common interventions such as instruction tuning often result in poorly calibrated LLMs. Although calibration is well-explored in traditional applications, calibrating LLMs is uniquely challenging. These challenges stem as much from the severe computational requirements of LLMs as from their versatil… ▽ More We consider the issue of calibration in large language models (LLM). Recent studies have found that common interventions such as instruction tuning often result in poorly calibrated LLMs. Although calibration is well-explored in traditional applications, calibrating LLMs is uniquely challenging. These challenges stem as much from the severe computational requirements of LLMs as from their versatility, which allows them to be applied to diverse tasks. Addressing these challenges, we propose THERMOMETER, a calibration approach tailored to LLMs. THERMOMETER learns an auxiliary model, given data from multiple tasks, for calibrating a LLM. It is computationally efficient, preserves the accuracy of the LLM, and produces better-calibrated responses for new tasks. Extensive empirical evaluations across various benchmarks demonstrate the effectiveness of the proposed method. △ Less

Submitted 27 June, 2024; v1 submitted 19 February, 2024; originally announced March 2024.

Comments: Camera ready version for ICML 2024

arXiv:2403.08230 [pdf, other]

A vicious cycle along busy bus corridors and how to abate it

Authors: Minyu Shen, Weihua Gu, Michael J. Cassidy, Yongjie Lin, Wei Ni

Abstract: We unveil that a previously-unreported vicious cycle can be created when bus queues form at curbside stops along a corridor. Buses caught in this cycle exhibit growing variation in headways as they travel from stop to stop. Bus (and patron) delays accumulate in like fashion and can grow large on long, busy corridors. We show that this damaging cycle can be abated in simple ways. Present solutions… ▽ More We unveil that a previously-unreported vicious cycle can be created when bus queues form at curbside stops along a corridor. Buses caught in this cycle exhibit growing variation in headways as they travel from stop to stop. Bus (and patron) delays accumulate in like fashion and can grow large on long, busy corridors. We show that this damaging cycle can be abated in simple ways. Present solutions entail holding buses at a corridor entrance and releasing them as per various strategies proposed in the literature. We introduce a modest variant to the simplest of these strategies. It releases buses at headways that are slightly less than, or equal to, the scheduled values. It turns out that periodically releasing buses at slightly smaller headways can substantially reduce bus delays caused by holding so that benefits can more readily outweigh costs in corridors that contain a sufficient number of serial bus stops. The simple variant is shown to perform about as well as, or better than, other bus-holding strategies in terms of saving delays, and is more effective than other strategies in regularizing bus headways. We also show that grou** buses from across multiple lines and holding them by group can be effective when patrons have the flexibility to choose buses from across all lines in a group. Findings come by formulating select models of bus-corridor dynamics and using these to simulate part of the Bus Rapid Transit corridor in Guangzhou, China. △ Less

Submitted 13 March, 2024; originally announced March 2024.

arXiv:2402.19454 [pdf, ps, other]

Controllable suppression of the unconventional superconductivity in bulk and thin-film Sr$_{2}$RuO$_{4}$ via high-energy electron irradiation

Authors: Jacob P. Ruf, Hilary M. L. Noad, Romain Grasset, Ludi Miao, Elina Zhakina, Philippa H. McGuinness, Hari P. Nair, Nathaniel J. Schreiber, Naoki Kikugawa, Dmitry Sokolov, Marcin Konczykowski, Darrell G. Schlom, Kyle M. Shen, Andrew P. Mackenzie

Abstract: In bulk Sr$_{2}$RuO$_{4}$, the strong sensitivity of the superconducting transition temperature $T_{\text{c}}$ to nonmagnetic impurities provides robust evidence for a superconducting order parameter that changes sign around the Fermi surface. In superconducting epitaxial thin-film Sr$_{2}$RuO$_{4}$, the relationship between $T_{\text{c}}$ and the residual resistivity $ρ_0$, which in bulk samples… ▽ More In bulk Sr$_{2}$RuO$_{4}$, the strong sensitivity of the superconducting transition temperature $T_{\text{c}}$ to nonmagnetic impurities provides robust evidence for a superconducting order parameter that changes sign around the Fermi surface. In superconducting epitaxial thin-film Sr$_{2}$RuO$_{4}$, the relationship between $T_{\text{c}}$ and the residual resistivity $ρ_0$, which in bulk samples is taken to be a proxy for the low-temperature elastic scattering rate, is far less clear. Using high-energy electron irradiation to controllably introduce point disorder into bulk single-crystal and thin-film Sr$_{2}$RuO$_{4}$, we show that $T_{\text{c}}$ is suppressed in both systems at nearly identical rates. This suggests that part of $ρ_0$ in films comes from defects that do not contribute to superconducting pairbreaking, and establishes a quantitative link between the superconductivity of bulk and thin-film samples. △ Less

Submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.19208 [pdf, other]

Deciphering the Belle II data on $B\to K ν\barν$ decay in the (dark) SMEFT with minimal flavour violation

Authors: Biao-Feng Hou, Xin-Qiang Li, Meng Shen, Ya-Dong Yang, Xing-Bo Yuan

Abstract: Recently, the Belle II collaboration announced the first measurement of $\mathcal B(B^+\to K^+ν\barν)$, which is found to be about $2.7σ$ higher than the SM prediction. We decipher the data with two new physics scenarios: the underlying $b\to s ν\barν$ transition is, besides the SM contribution, further affected by heavy new mediators that are much heavier than the EW scale, or amended by an addit… ▽ More Recently, the Belle II collaboration announced the first measurement of $\mathcal B(B^+\to K^+ν\barν)$, which is found to be about $2.7σ$ higher than the SM prediction. We decipher the data with two new physics scenarios: the underlying $b\to s ν\barν$ transition is, besides the SM contribution, further affected by heavy new mediators that are much heavier than the EW scale, or amended by an additional decay channel with undetected light final states like dark matter or axion-like particles. These two scenarios can be most conveniently analyzed in the SMEFT and the dark SMEFT (DSMEFT) framework, respectively. We consider the flavour structures of the resulting effective operators to be either generic or satisfy the minimal flavour violation (MFV) hypothesis, both for the quark and lepton sectors. In the first scenario, once the MFV is assumed, only one SM-like low-energy effective operator induced by the SMEFT dim-6 operators can account for the Belle II excess, the parameter space of which is, however, excluded by the Belle upper bound on $\mathcal B(B^0\to K^{*0}ν\barν)$. In the second scenario, it is found that the Belle II excess can be accommodated by 22 of the DSMEFT operators involving one or two scalar, fermionic, or vector dark matters as well as ALPs. These operators also receive dominant constraints from the $B^0\to K^{*0}+$inv and $B_s\to$inv decays. In the MFV hypothesis, the number of viable operators is reduced to 14, and the $B^+\toπ^+ +$inv and $K^+\toπ^++$inv decays start to put further constraints. Within the parameter space allowed by all the current experimental data, the $q^2$ distributions (and $F_L$) of the $B\to K^{(*)}+$inv decays are studied for each viable operator. In addition, we, for the first time, calculate systematically the longitudinal polarization fraction $F_L$ of $K^*$ in the $B\to K^*+$inv decays within the DLEFT. △ Less

Submitted 14 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

Comments: 51 pages, 13 figures, comments welcome; v2: discussions on $F_L$ of the $B \to K^* + \text{inv}$ decay in the DSMEFT added, errors in the $B \to V + \text{inv}$ decays for some operators involving two scalar or vector DM fields fixed, the related discussions and figures 2, 6, 7, 11 and 12 corrected, main conclusion unchanged, a few comments and refs added

arXiv:2402.13033 [pdf, other]

Enhancing Real-World Complex Network Representations with Hyperedge Augmentation

Authors: Xiangyu Zhao, Zehui Li, Mingzhu Shen, Guy-Bart Stan, Pietro Liò, Yiren Zhao

Abstract: Graph augmentation methods play a crucial role in improving the performance and enhancing generalisation capabilities in Graph Neural Networks (GNNs). Existing graph augmentation methods mainly perturb the graph structures and are usually limited to pairwise node relations. These methods cannot fully address the complexities of real-world large-scale networks that often involve higher-order node r… ▽ More Graph augmentation methods play a crucial role in improving the performance and enhancing generalisation capabilities in Graph Neural Networks (GNNs). Existing graph augmentation methods mainly perturb the graph structures and are usually limited to pairwise node relations. These methods cannot fully address the complexities of real-world large-scale networks that often involve higher-order node relations beyond only being pairwise. Meanwhile, real-world graph datasets are predominantly modelled as simple graphs, due to the scarcity of data that can be used to form higher-order edges. Therefore, reconfiguring the higher-order edges as an integration into graph augmentation strategies lights up a promising research path to address the aforementioned issues. In this paper, we present Hyperedge Augmentation (HyperAug), a novel graph augmentation method that constructs virtual hyperedges directly form the raw data, and produces auxiliary node features by extracting from the virtual hyperedge information, which are used for enhancing GNN performances on downstream tasks. We design three diverse virtual hyperedge construction strategies to accompany the augmentation scheme: (1) via graph statistics, (2) from multiple data perspectives, and (3) utilising multi-modality. Furthermore, to facilitate HyperAug evaluation, we provide 23 novel real-world graph datasets across various domains including social media, biology, and e-commerce. Our empirical study shows that HyperAug consistently and significantly outperforms GNN baselines and other graph augmentation methods, across a variety of application contexts, which clearly indicates that it can effectively incorporate higher-order node relations into graph augmentation methods for real-world complex networks. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: Preprint. Under review. 17 pages, 4 figures, 14 tables. arXiv admin note: text overlap with arXiv:2306.05108

arXiv:2402.06160 [pdf, other]

Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?

Authors: Maohao Shen, J. Jon Ryu, Soumya Ghosh, Yuheng Bu, Prasanna Sattigeri, Subhro Das, Gregory W. Wornell

Abstract: This paper questions the effectiveness of a modern predictive uncertainty quantification approach, called \emph{evidential deep learning} (EDL), in which a single neural network model is trained to learn a meta distribution over the predictive distribution by minimizing a specific objective function. Despite their perceived strong empirical performance on downstream tasks, a line of recent studies… ▽ More This paper questions the effectiveness of a modern predictive uncertainty quantification approach, called \emph{evidential deep learning} (EDL), in which a single neural network model is trained to learn a meta distribution over the predictive distribution by minimizing a specific objective function. Despite their perceived strong empirical performance on downstream tasks, a line of recent studies by Bengs et al. identify limitations of the existing methods to conclude their learned epistemic uncertainties are unreliable, e.g., in that they are non-vanishing even with infinite data. Building on and sharpening such analysis, we 1) provide a sharper understanding of the asymptotic behavior of a wide class of EDL methods by unifying various objective functions; 2) reveal that the EDL methods can be better interpreted as an out-of-distribution detection algorithm based on energy-based-models; and 3) conduct extensive ablation studies to better assess their empirical effectiveness with real-world datasets. Through all these analyses, we conclude that even when EDL methods are empirically effective on downstream tasks, this occurs despite their poor uncertainty quantification capabilities. Our investigation suggests that incorporating model uncertainty can help EDL methods faithfully quantify uncertainties and further improve performance on representative downstream tasks, albeit at the cost of additional computational complexity. △ Less

Submitted 12 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 29 pages, 12 figures

arXiv:2402.06094 [pdf, other]

Rethinking Data Selection for Supervised Fine-Tuning

Authors: Ming Shen

Abstract: Although supervised finetuning (SFT) has emerged as an essential technique to align large language models with humans, it is considered superficial, with style learning being its nature. At the same time, recent works indicate the importance of data selection for SFT, showing that finetuning with high-quality and diverse subsets of the original dataset leads to superior downstream performance. In… ▽ More Although supervised finetuning (SFT) has emerged as an essential technique to align large language models with humans, it is considered superficial, with style learning being its nature. At the same time, recent works indicate the importance of data selection for SFT, showing that finetuning with high-quality and diverse subsets of the original dataset leads to superior downstream performance. In this work, we rethink the intuition behind data selection for SFT. Considering SFT is superficial, we propose that essential demonstrations for SFT should focus on reflecting human-like interactions instead of data quality or diversity. However, it is not straightforward to directly assess to what extent a demonstration reflects human styles. Towards an initial attempt in this direction, we find selecting instances with long responses is surprisingly more effective for SFT than utilizing full datasets or instances selected based on quality and diversity. We hypothesize that such a simple heuristic implicitly mimics a crucial aspect of human-style conversation: detailed responses are usually more helpful. △ Less

Submitted 8 February, 2024; originally announced February 2024.

arXiv:2402.02381 [pdf, other]

Empowering Computing and Networks Convergence System with Distributed Cooperative Routing

Authors: Yujiao Hu, Qingmin Jia, Meng Shen, Renchao Xie, Tao Huang, F. Richard Yu

Abstract: The emergence of intelligent applications and recent advances in the fields of computing and networks are driving the development of computing and networks convergence (CNC) system. However, existing researches failed to achieve comprehensive scheduling optimization of computing and network resources. This shortfall results in some requirements of computing requests unable to be guaranteed in an e… ▽ More The emergence of intelligent applications and recent advances in the fields of computing and networks are driving the development of computing and networks convergence (CNC) system. However, existing researches failed to achieve comprehensive scheduling optimization of computing and network resources. This shortfall results in some requirements of computing requests unable to be guaranteed in an end-to-end service pattern, negatively impacting the development of CNC systems. In this article, we propose a distributed cooperative routing framework for the CNC system to ensure the deadline requirements and minimize the computation cost of requests. The framework includes trading plane, management plane, control plane and forwarding plane. The cross-plane cooperative end-to-end routing schemes consider both computation efficiency of heterogeneous servers and the network congestion degrees while making routing plan, thereby determining where to execute requests and corresponding routing paths. Simulations results substantiates the performance of our routing schemes in scheduling computing requests in the CNC system. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: Submit to IEEE Network

arXiv:2402.00210 [pdf, other]

Correlation of Coronal Mass Ejection Shock Temperature with Solar Energetic Particle Intensity

Authors: Manuel Enrique Cuesta, D. J. McComas, L. Y. Khoo, R. Bandyopadhyay, T. Sharma, M. M. Shen, J. S. Rankin, A. T. Cummings, J. R. Szalay, C. M. S. Cohen, N. A. Schwadron, R. Chhiber, F. Pecora, W. H. Matthaeus, R. A. Leske, M. L. Stevens

Abstract: Solar energetic particle (SEP) events have been observed by the Parker Solar Probe (PSP) spacecraft since its launch in 2018. These events include sources from solar flares and coronal mass ejections (CMEs). Onboard PSP is the IS$\odot$IS instrument suite measuring ions over energies from ~ 20 keV/nucleon to 200 MeV/nucleon and electrons from ~ 20 keV to 6 MeV. Previous studies sought to group C… ▽ More Solar energetic particle (SEP) events have been observed by the Parker Solar Probe (PSP) spacecraft since its launch in 2018. These events include sources from solar flares and coronal mass ejections (CMEs). Onboard PSP is the IS$\odot$IS instrument suite measuring ions over energies from ~ 20 keV/nucleon to 200 MeV/nucleon and electrons from ~ 20 keV to 6 MeV. Previous studies sought to group CME characteristics based on their plasma conditions and arrived at general descriptions with large statistical errors, leaving open questions on how to properly group CMEs based solely on their plasma conditions. To help resolve these open questions, plasma properties of CMEs have been examined in relation to SEPs. Here we reexamine one plasma property, the solar wind proton temperature, and compare it to the proton SEP intensity in a region immediately downstream of a CME-driven shock for seven CMEs observed at radial distances within 1 au. We find a statistically strong correlation between proton SEP intensity and bulk proton temperature, indicating a clear relationship between SEPs and the conditions in the solar wind. Furthermore, we propose that an indirect coupling of SEP intensity to the level of turbulence and the amount of energy dissipation that results is mainly responsible for the observed correlation between SEP intensity and proton temperature. These results are key to understanding the interaction of SEPs with the bulk solar wind in CME-driven shocks and will improve our ability to model the interplay of shock evolution and particle acceleration. △ Less

Submitted 31 January, 2024; originally announced February 2024.

Comments: 12 pages, 4 figures, and 2 tables

arXiv:2401.15984 [pdf]

Choroidal thinning assessment through facial video analysis

Authors: Qinghua He, Yi Zhang, Mengxi Shen, Giovanni Gregori, Philip J. Rosenfeld, Ruikang K. Wang

Abstract: Different features of skin are associated with various medical conditions and provide opportunities to evaluate and monitor body health. This study created a strategy to assess choroidal thinning through the video analysis of facial skin. Videos capturing the entire facial skin were collected from 48 participants with age-related macular degeneration (AMD) and 12 healthy individuals. These facial… ▽ More Different features of skin are associated with various medical conditions and provide opportunities to evaluate and monitor body health. This study created a strategy to assess choroidal thinning through the video analysis of facial skin. Videos capturing the entire facial skin were collected from 48 participants with age-related macular degeneration (AMD) and 12 healthy individuals. These facial videos were analyzed using video-based trans-angiosomes imaging photoplethysmography (TaiPPG) to generate facial imaging biomarkers that were correlated with choroidal thickness (CT) measurements. The CT of all patients was determined using swept-source optical coherence tomography (SS-OCT). The results revealed the relationship between relative blood pulsation amplitude (BPA) in three typical facial angiosomes (cheek, side-forehead and mid-forehead) and the average macular CT (r = 0.48, p < 0.001; r = -0.56, p < 0.001; r = -0.40, p < 0.01). When considering a diagnostic threshold of 200μm, the newly developed facial video analysis tool effectively distinguished between cases of choroidal thinning and normal cases, yielding areas under the curve of 0.75, 0.79 and 0.69. These findings shed light on the connection between choroidal blood flow and facial skin hemodynamics, which suggests the potential for predicting vascular diseases through widely accessible skin imaging data. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: 8 pages, 4 figures

arXiv:2401.07129 [pdf, ps, other]

doi 10.1063/5.0197304

Synthesis of thin film infinite-layer nickelates by atomic hydrogen reduction: clarifying the role of the cap** layer

Authors: Christopher T. Parzyck, Vivek Anil, Yi Wu, Berit H. Goodge, Matthew Roddy, Lena F. Kourkoutis, Darrell G. Schlom, Kyle M. Shen

Abstract: We present an integrated procedure for the synthesis of infinite-layer nickelates using molecular-beam epitaxy with gas-phase reduction by atomic hydrogen. We first discuss challenges in the growth and characterization of perovskite NdNiO$_3$/SrTiO$_3$, arising from post growth crack formation in stoichiometric films. We then detail a procedure for fully reducing NdNiO$_3$ films to the infinite-la… ▽ More We present an integrated procedure for the synthesis of infinite-layer nickelates using molecular-beam epitaxy with gas-phase reduction by atomic hydrogen. We first discuss challenges in the growth and characterization of perovskite NdNiO$_3$/SrTiO$_3$, arising from post growth crack formation in stoichiometric films. We then detail a procedure for fully reducing NdNiO$_3$ films to the infinite-layer phase, NdNiO$_2$, using atomic hydrogen; the resulting films display excellent structural quality, smooth surfaces, and lower residual resistivities than films reduced by other methods. We utilize the in situ nature of this technique to investigate of the role that SrTiO$_3$ cap** layers play in the reduction process, illustrating their importance in preventing the formation of secondary phases at the exposed nickelate surface. A comparative bulk- and surface-sensitive study indicates formation of a polycrystalline crust on the film surface serves to limit the reduction process. △ Less

Submitted 13 January, 2024; originally announced January 2024.

Comments: Main text: 12 pages, 7 figures. Supplemental Materials: 11 pages, 11 figures

arXiv:2401.07045 [pdf, other]

Measurement of Solar $pp$ Neutrino Flux using Electron Recoil Data from PandaX-4T Commissioning Run

Authors: PandaX Collaboration, Xiaoying Lu, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Lisheng Geng, Karl Giboni, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Junting Huang, Zhou Huang, Ruquan Hou, Yu Hou, Xiangdong Ji , et al. (67 additional authors not shown)

Abstract: The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning dat… ▽ More The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning data with 0.63 tonne$\times$year exposure. The $pp$ neutrino flux is determined to be $(8.0 \pm 3.9 \,{\rm{(stat)}} \pm 10.0 \,{\rm{(syst)}} )\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$, consistent with Standard Solar Model and existing measurements, corresponding to a flux upper limit of $23.3\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$ at 90\% C.L.. △ Less

Submitted 2 July, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

Comments: 6 pages, 5 figures

arXiv:2401.05746 [pdf, other]

Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection

Authors: Heqing Zou, Meng Shen, Yuchen Hu, Chen Chen, Eng Siong Chng, Deepu Rajan

Abstract: Audio-visual deepfake detection scrutinizes manipulations in public video using complementary multimodal cues. Current methods, which train on fused multimodal data for multimodal targets face challenges due to uncertainties and inconsistencies in learned representations caused by independent modality manipulations in deepfake videos. To address this, we propose cross-modality and within-modality… ▽ More Audio-visual deepfake detection scrutinizes manipulations in public video using complementary multimodal cues. Current methods, which train on fused multimodal data for multimodal targets face challenges due to uncertainties and inconsistencies in learned representations caused by independent modality manipulations in deepfake videos. To address this, we propose cross-modality and within-modality regularization to preserve modality distinctions during multimodal representation learning. Our approach includes an audio-visual transformer module for modality correspondence and a cross-modality regularization module to align paired audio-visual signals, preserving modality distinctions. Simultaneously, a within-modality regularization module refines unimodal representations with modality-specific targets to retain modal-specific details. Experimental results on the public audio-visual dataset, FakeAVCeleb, demonstrate the effectiveness and competitiveness of our approach. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: Accepted by ICASSP 2024

arXiv:2312.16240 [pdf, other]

Merging Vision Transformers from Different Tasks and Domains

Authors: Peng Ye, Chenyu Huang, Mingzhu Shen, Tao Chen, Yongqi Huang, Yuning Zhang, Wanli Ouyang

Abstract: This work targets to merge various Vision Transformers (ViTs) trained on different tasks (i.e., datasets with different object categories) or domains (i.e., datasets with the same categories but different environments) into one unified model, yielding still good performance on each task or domain. Previous model merging works focus on either CNNs or NLP models, leaving the ViTs merging research un… ▽ More This work targets to merge various Vision Transformers (ViTs) trained on different tasks (i.e., datasets with different object categories) or domains (i.e., datasets with the same categories but different environments) into one unified model, yielding still good performance on each task or domain. Previous model merging works focus on either CNNs or NLP models, leaving the ViTs merging research untouched. To fill this gap, we first explore and find that existing model merging methods cannot well handle the merging of the whole ViT models and still have improvement space. To enable the merging of the whole ViT, we propose a simple-but-effective gating network that can both merge all kinds of layers (e.g., Embedding, Norm, Attention, and MLP) and select the suitable classifier. Specifically, the gating network is trained by unlabeled datasets from all the tasks (domains), and predicts the probability of which task (domain) the input belongs to for merging the models during inference. To further boost the performance of the merged model, especially when the difficulty of merging tasks increases, we design a novel metric of model weight similarity, and utilize it to realize controllable and combined weight merging. Comprehensive experiments on kinds of newly established benchmarks, validate the superiority of the proposed ViT merging framework for different tasks and domains. Our method can even merge beyond 10 ViT models from different vision tasks with a negligible effect on the performance of each task. △ Less

Submitted 25 December, 2023; originally announced December 2023.

arXiv:2312.14249 [pdf, other]

GenoCraft: A Comprehensive, User-Friendly Web-Based Platform for High-Throughput Omics Data Analysis and Visualization

Authors: Yingzhou Lu, Minjie Shen, Yue Zhao, Chenhao Li, Fan Meng, Xiao Wang, David Herrington, Yue Wang, Tim Fu, Capucine Van Rechem

Abstract: The surge in high-throughput omics data has reshaped the landscape of biological research, underlining the need for powerful, user-friendly data analysis and interpretation tools. This paper presents GenoCraft, a web-based comprehensive software solution designed to handle the entire pipeline of omics data processing. GenoCraft offers a unified platform featuring advanced bioinformatics tools, cov… ▽ More The surge in high-throughput omics data has reshaped the landscape of biological research, underlining the need for powerful, user-friendly data analysis and interpretation tools. This paper presents GenoCraft, a web-based comprehensive software solution designed to handle the entire pipeline of omics data processing. GenoCraft offers a unified platform featuring advanced bioinformatics tools, covering all aspects of omics data analysis. It encompasses a range of functionalities, such as normalization, quality control, differential analysis, network analysis, pathway analysis, and diverse visualization techniques. This software makes state-of-the-art omics data analysis more accessible to a wider range of users. With GenoCraft, researchers and data scientists have access to an array of cutting-edge bioinformatics tools under a user-friendly interface, making it a valuable resource for managing and analyzing large-scale omics data. The API with an interactive web interface is publicly available at https://genocraft.stanford. edu/. We also release all the codes in https://github.com/futianfan/GenoCraft. △ Less

Submitted 21 December, 2023; originally announced December 2023.

arXiv:2312.13502 [pdf, other]

Energy Relaxation and dynamics in the correlated metal Sr$_2$RuO$_4$ via THz two-dimensional coherent spectroscopy

Authors: David Barbalas, Ralph Romero III, Dipanjan Chaudhuri, Fahad Mahmood, Hari P. Nair, Nathaniel J. Schreiber, Darrel G. Schlom, K. M. Shen, N. P. Armitage

Abstract: Separating out the contributions of different scattering channels in strongly interacting metals is crucial in identifying the mechanisms that govern their properties. While momentum or current relaxation rates can be readily probed via \textit{dc} resistivity or optical/THz spectroscopy, distinguishing different kinds of inelastic scattering can be more challenging. Using nonlinear THz 2D coheren… ▽ More Separating out the contributions of different scattering channels in strongly interacting metals is crucial in identifying the mechanisms that govern their properties. While momentum or current relaxation rates can be readily probed via \textit{dc} resistivity or optical/THz spectroscopy, distinguishing different kinds of inelastic scattering can be more challenging. Using nonlinear THz 2D coherent spectroscopy, we measure the rates of energy relaxation after THz excitation in the strongly interacting Fermi liquid, Sr$_2$RuO$_4$. Energy relaxation is a bound on the total scattering and specifically a measure of contributions to the electron self-energy that arise from {\it inelastic} coupling to a bath. We observe two distinct energy relaxation channels: a fast process that we interpret as energy loss to the phonon system and a much slower relaxation that we interpret as arising from a non-equilibrium phonon effects and subsequent heat loss through diffusion. Interestingly, even the faster energy relaxation rate is at least an order of magnitude slower than the overall momentum relaxation rate, consistent with strong electron interactions and the dominance of energy-conserving umklapp or interband electron-electron scattering in momentum relaxation. The slowest energy relaxation rate decays on a sub-GHz scale, consistent with the relaxation dynamics of non-equilibrium phonons. Our observations reveal the versatility of nonlinear THz spectroscopy to measure the energy relaxation dynamics in correlated metals. Our work also highlights the need for improved theoretical understanding of such processes in interacting metals. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: 6 pages, 3 figures. SM included

arXiv:2312.07871 [pdf, other]

MLNet: Mutual Learning Network with Neighborhood Invariance for Universal Domain Adaptation

Authors: Yanzuo Lu, Meng Shen, Andy J Ma, Xiaohua Xie, Jian-Huang Lai

Abstract: Universal domain adaptation (UniDA) is a practical but challenging problem, in which information about the relation between the source and the target domains is not given for knowledge transfer. Existing UniDA methods may suffer from the problems of overlooking intra-domain variations in the target domain and difficulty in separating between the similar known and unknown class. To address these is… ▽ More Universal domain adaptation (UniDA) is a practical but challenging problem, in which information about the relation between the source and the target domains is not given for knowledge transfer. Existing UniDA methods may suffer from the problems of overlooking intra-domain variations in the target domain and difficulty in separating between the similar known and unknown class. To address these issues, we propose a novel Mutual Learning Network (MLNet) with neighborhood invariance for UniDA. In our method, confidence-guided invariant feature learning with self-adaptive neighbor selection is designed to reduce the intra-domain variations for more generalizable feature representation. By using the cross-domain mixup scheme for better unknown-class identification, the proposed method compensates for the misidentified known-class errors by mutual learning between the closed-set and open-set classifiers. Extensive experiments on three publicly available benchmarks demonstrate that our method achieves the best results compared to the state-of-the-arts in most cases and significantly outperforms the baseline across all the four settings in UniDA. Code is available at https://github.com/YanzuoLu/MLNet. △ Less

Submitted 27 February, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI 2024 (Poster)

arXiv:2311.16553 [pdf, other]

doi 10.1038/s41467-024-49714-y

Magnon interactions in a moderately correlated Mott insulator

Authors: Qisi Wang, S. Mustafi, E. Fogh, N. Astrakhantsev, Z. He, I. Biało, Ying Chan, L. Martinelli, M. Horio, O. Ivashko, N. E. Shaik, K. von Arx, Y. Sassa, E. Paris, M. H. Fischer, Y. Tseng, N. B. Christensen, A. Galdi, D. G. Schlom, K. M. Shen, T. Schmitt, H. M. Rønnow, J. Chang

Abstract: Quantum fluctuations in low-dimensional systems and near quantum phase transitions have significant influences on material properties. Yet, it is difficult to experimentally gauge the strength and importance of quantum fluctuations. Here we provide a resonant inelastic x-ray scattering study of magnon excitations in Mott insulating cuprates. From the thin film of SrCuO$_2$, single- and bi-magnon d… ▽ More Quantum fluctuations in low-dimensional systems and near quantum phase transitions have significant influences on material properties. Yet, it is difficult to experimentally gauge the strength and importance of quantum fluctuations. Here we provide a resonant inelastic x-ray scattering study of magnon excitations in Mott insulating cuprates. From the thin film of SrCuO$_2$, single- and bi-magnon dispersions are derived. Using an effective Heisenberg Hamiltonian generated from the Hubbard model, we show that the single-magnon dispersion is only described satisfactorily when including significant quantum corrections stemming from magnon-magnon interactions. Comparative results on La$_2$CuO$_4$ indicate that quantum fluctuations are much stronger in SrCuO$_2$ suggesting closer proximity to a magnetic quantum critical point. Monte Carlo calculations reveal that other magnetic orders may compete with the antiferromagnetic Néel order as the ground state. Our results indicate that SrCuO$_2$ - due to strong quantum fluctuations - is a unique starting point for the exploration of novel magnetic ground states. △ Less

Submitted 26 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

Journal ref: Nature Communications 15, 5348 (2024)

arXiv:2311.11842 [pdf]

Spontaneous supercrystal formation during a strain-engineered metal-insulator transition

Authors: O. Yu. Gorobtsov, L. Miao, Z. Shao, Y. Tan, N. I. Schnitzer, B. H. Goodge, J. Ruf, D. Weinstock, M. Cherukara, M. V. Holt, H. Nair, L. -Q. Chen, L. F. Kourkoutis, D. G. Schlom, K. M. Shen, A. Singer

Abstract: Mott metal-insulator transitions possess electronic, magnetic, and structural degrees of freedom promising next generation energy-efficient electronics. We report a previously unknown, hierarchically ordered state during a Mott transition and demonstrate correlated switching of functional electronic properties. We elucidate in-situ formation of an intrinsic supercrystal in a Ca2RuO4 thin film. Mac… ▽ More Mott metal-insulator transitions possess electronic, magnetic, and structural degrees of freedom promising next generation energy-efficient electronics. We report a previously unknown, hierarchically ordered state during a Mott transition and demonstrate correlated switching of functional electronic properties. We elucidate in-situ formation of an intrinsic supercrystal in a Ca2RuO4 thin film. Machine learning-assisted X-ray nanodiffraction together with electron microscopy reveal multi-scale periodic domain formation at and below the film transition temperature (TFilm ~ 200-250 K) and a separate anisotropic spatial structure at and above TFilm. Local resistivity measurements imply an intrinsic coupling of the supercrystal orientation to the material's anisotropic conductivity. Our findings add an additional degree of complexity to the physical understanding of Mott transitions, opening opportunities for designing materials with tunable electronic properties. △ Less

Submitted 20 November, 2023; originally announced November 2023.

arXiv:2311.02303 [pdf, other]

MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning

Authors: Bingchang Liu, Chaoyu Chen, Cong Liao, Zi Gong, Huan Wang, Zhichao Lei, Ming Liang, Dajun Chen, Min Shen, Hailian Zhou, Hang Yu, Jianguo Li

Abstract: Code LLMs have emerged as a specialized research field, with remarkable studies dedicated to enhancing model's coding capabilities through fine-tuning on pre-trained models. Previous fine-tuning approaches were typically tailored to specific downstream tasks or scenarios, which meant separate fine-tuning for each task, requiring extensive training resources and posing challenges in terms of deploy… ▽ More Code LLMs have emerged as a specialized research field, with remarkable studies dedicated to enhancing model's coding capabilities through fine-tuning on pre-trained models. Previous fine-tuning approaches were typically tailored to specific downstream tasks or scenarios, which meant separate fine-tuning for each task, requiring extensive training resources and posing challenges in terms of deployment and maintenance. Furthermore, these approaches failed to leverage the inherent interconnectedness among different code-related tasks. To overcome these limitations, we present a multi-task fine-tuning framework, MFTcoder, that enables simultaneous and parallel fine-tuning on multiple tasks. By incorporating various loss functions, we effectively address common challenges in multi-task learning, such as data imbalance, varying difficulty levels, and inconsistent convergence speeds. Extensive experiments have conclusively demonstrated that our multi-task fine-tuning approach outperforms both individual fine-tuning on single tasks and fine-tuning on a mixed ensemble of tasks. Moreover, MFTcoder offers efficient training capabilities, including efficient data tokenization modes and PEFT fine-tuning, resulting in significantly improved speed compared to traditional fine-tuning methods. MFTcoder seamlessly integrates with several mainstream open-source LLMs, such as CodeLLama and Qwen. Leveraging the CodeLLama foundation, our MFTcoder fine-tuned model, \textsc{CodeFuse-CodeLLama-34B}, achieves an impressive pass@1 score of 74.4\% on the HumaneEval benchmark, surpassing GPT-4 performance (67\%, zero-shot). MFTCoder is open-sourced at \url{https://github.com/codefuse-ai/MFTCOder} △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2310.16300 [pdf, other]

Snapshot: Fast, Userspace Crash Consistency for CXL and PM Using msync

Authors: Suyash Mahar, Mingyao Shen, Terence Kelly, Steven Swanson

Abstract: Crash consistency using persistent memory programming libraries requires programmers to use complex transactions and manual annotations. In contrast, the failure-atomic msync() (FAMS) interface is much simpler as it transparently tracks updates and guarantees that modified data is atomically durable on a call to the failure-atomic variant of msync(). However, FAMS suffers from several drawbacks, l… ▽ More Crash consistency using persistent memory programming libraries requires programmers to use complex transactions and manual annotations. In contrast, the failure-atomic msync() (FAMS) interface is much simpler as it transparently tracks updates and guarantees that modified data is atomically durable on a call to the failure-atomic variant of msync(). However, FAMS suffers from several drawbacks, like the overhead of msync() and the write amplification from page-level dirty data tracking. To address these drawbacks while preserving the advantages of FAMS, we propose Snapshot, an efficient userspace implementation of FAMS. Snapshot uses compiler-based annotation to transparently track updates in userspace and syncs them with the backing byte-addressable storage copy on a call to msync(). By kee** a copy of application data in DRAM, Snapshot improves access latency. Moreover, with automatic tracking and syncing changes only on a call to msync(), Snapshot provides crash-consistency guarantees, unlike the POSIX msync() system call. For a KV-Store backed by Intel Optane running the YCSB benchmark, Snapshot achieves at least 1.2$\times$ speedup over PMDK while significantly outperforming conventional (non-crash-consistent) msync(). On an emulated CXL memory semantic SSD, Snapshot outperforms PMDK by up to 10.9$\times$ on all but one YCSB workload, where PMDK is 1.2$\times$ faster than Snapshot. Further, Kyoto Cabinet commits perform up to 8.0$\times$ faster with Snapshot than its built-in, msync()-based crash-consistency mechanism. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: A shorter version of this paper appeared in the Proceedings of ICCD 2023

arXiv:2310.14112 [pdf, other]

Continuous Entanglement Distribution from an AlGaAs-on-Insulator Microcomb for Quantum Communications

Authors: Trevor J. Steiner, Maximilian Shen, Joshua E. Castro, John E. Bowers, Galan Moody

Abstract: Using an aluminum gallium arsenide microring resonator, we demonstrate a bright quantum optical microcomb with $>300$ nm bandwidth and more than 20 sets of time-energy entangled modes, enabling spectral demultiplexing with simple, off-the-shelf commercial telecom components. We report high-rate continuous entanglement distribution for two sets of entangled-photon pair frequency modes exhibiting up… ▽ More Using an aluminum gallium arsenide microring resonator, we demonstrate a bright quantum optical microcomb with $>300$ nm bandwidth and more than 20 sets of time-energy entangled modes, enabling spectral demultiplexing with simple, off-the-shelf commercial telecom components. We report high-rate continuous entanglement distribution for two sets of entangled-photon pair frequency modes exhibiting up to $20$ GHz/mW$^2$ pair generation rate. As an illustrative example of entanglement distribution, we perform a continuous-wave time-bin quantum key distribution protocol with 8 kbps raw key rates while maintaining less than 10$\%$ error rate and sufficient two-photon visibility to ensure security of the channel. When the $>$20 frequency modes are multiplexed, we estimate $>$100 kbps entanglement-based key rates or the creation of a multi-user quantum communications network. The entire system requires less than 110 $μ$W of on-chip optical power, demonstrating an efficient source of entangled frequency modes for quantum communications. As a proof of principle, a quantum key is distributed across 12 km of deployed fiber on the UCSB campus and used to transmit a 21 kB image with $<9\%$ error. △ Less

Submitted 21 October, 2023; originally announced October 2023.

Comments: 14 pages, 8 figures

arXiv:2310.07183 [pdf, other]

SAM-OCTA: Prompting Segment-Anything for OCTA Image Segmentation

Authors: Xinrun Chen, Chengliang Wang, Haojian Ning, Shiying Li, Mei Shen

Abstract: Segmenting specific targets or biomarkers is necessary to analyze optical coherence tomography angiography (OCTA) images. Previous methods typically segment all the targets in an OCTA sample, such as retinal vessels (RVs). Although these methods perform well in accuracy and precision, OCTA analyses often focusing local information within the images which has not been fulfilled. In this paper, we p… ▽ More Segmenting specific targets or biomarkers is necessary to analyze optical coherence tomography angiography (OCTA) images. Previous methods typically segment all the targets in an OCTA sample, such as retinal vessels (RVs). Although these methods perform well in accuracy and precision, OCTA analyses often focusing local information within the images which has not been fulfilled. In this paper, we propose a method called SAM-OCTA for local segmentation in OCTA images. The method fine-tunes a pre-trained segment anything model (SAM) using low-rank adaptation (LoRA) and utilizes prompt points for local RVs, arteries, and veins segmentation in OCTA. To explore the effect and mechanism of prompt points, we set up global and local segmentation modes with two prompt point generation strategies, namely random selection and special annotation. Considering practical usage, we conducted extended experiments with different model scales and analyzed the model performance before and after fine-tuning besides the general segmentation task. From comprehensive experimental results with the OCTA-500 dataset, our SAM-OCTA method has achieved state-of-the-art performance in common OCTA segmentation tasks related to RV and FAZ, and it also performs accurate segmentation of artery-vein and local vessels. The code is available at https://github.com/ShellRedia/SAM-OCTA-extend. △ Less

Submitted 20 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: arXiv admin note: text overlap with arXiv:2309.11758

arXiv:2310.06266 [pdf, other]

doi 10.1145/3639477.3639719

CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model

Authors: Peng Di, Jianguo Li, Hang Yu, Wei Jiang, Wenting Cai, Yang Cao, Chaoyu Chen, Dajun Chen, Hongwei Chen, Liang Chen, Gang Fan, Jie Gong, Zi Gong, Wen Hu, Tingting Guo, Zhichao Lei, Ting Li, Zheng Li, Ming Liang, Cong Liao, Bingchang Liu, Jiachen Liu, Zhiwei Liu, Shaojun Lu, Min Shen , et al. (13 additional authors not shown)

Abstract: Code Large Language Models (Code LLMs) have gained significant attention in the industry due to their wide applications in the full lifecycle of software engineering. However, the effectiveness of existing models in understanding non-English inputs for multi-lingual code-related tasks is still far from well studied. This paper introduces CodeFuse-13B, an open-sourced pre-trained code LLM. It is sp… ▽ More Code Large Language Models (Code LLMs) have gained significant attention in the industry due to their wide applications in the full lifecycle of software engineering. However, the effectiveness of existing models in understanding non-English inputs for multi-lingual code-related tasks is still far from well studied. This paper introduces CodeFuse-13B, an open-sourced pre-trained code LLM. It is specifically designed for code-related tasks with both English and Chinese prompts and supports over 40 programming languages. CodeFuse achieves its effectiveness by utilizing a high quality pre-training dataset that is carefully filtered by program analyzers and optimized during the training process. Extensive experiments are conducted using real-world usage scenarios, the industry-standard benchmark HumanEval-x, and the specially designed CodeFuseEval for Chinese prompts. To assess the effectiveness of CodeFuse, we actively collected valuable human feedback from the AntGroup's software development process where CodeFuse has been successfully deployed. The results demonstrate that CodeFuse-13B achieves a HumanEval pass@1 score of 37.10%, positioning it as one of the top multi-lingual code LLMs with similar parameter sizes. In practical scenarios, such as code generation, code translation, code comments, and testcase generation, CodeFuse performs better than other models when confronted with Chinese prompts. △ Less

Submitted 10 January, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

Comments: Accepted by ICSE-SEIP 2024

Showing 1–50 of 381 results for author: shen, M