Skip to main content

Showing 1–50 of 1,451 results for author: Yang, G

.
  1. arXiv:2407.01512  [pdf, other

    cs.RO cs.HC cs.LG

    Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

    Authors: Xuxin Cheng, Jialong Li, Shiqi Yang, Ge Yang, Xiaolong Wang

    Abstract: Teleoperation serves as a powerful method for collecting on-robot data essential for robot learning from demonstrations. The intuitiveness and ease of use of the teleoperation system are crucial for ensuring high-quality, diverse, and scalable data. To achieve this, we propose an immersive teleoperation system Open-TeleVision that allows operators to actively perceive the robot's surroundings in a… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Website: https://robot-tv.github.io/

  2. arXiv:2407.01281  [pdf, other

    cs.LG cs.AI math.FA

    Bridging Smoothness and Approximation: Theoretical Insights into Over-Smoothing in Graph Neural Networks

    Authors: Guangrui Yang, Jianfei Li, Ming Li, Han Feng, Ding-Xuan Zhou

    Abstract: In this paper, we explore the approximation theory of functions defined on graphs. Our study builds upon the approximation results derived from the $K$-functional. We establish a theoretical framework to assess the lower bounds of approximation for target functions using Graph Convolutional Networks (GCNs) and examine the over-smoothing phenomenon commonly observed in these networks. Initially, we… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.19939  [pdf, other

    physics.flu-dyn

    Data-driven methods for flow and transport in porous media: a review

    Authors: Guang Yang, Ran Xu, Yusong Tian, Songyuan Guo, **gyi Wu, Xu Chu

    Abstract: This review examined the current advancements in data-driven methods for analyzing flow and transport in porous media, which has various applications in energy, chemical engineering, environmental science, and beyond. Although there has been progress in recent years, the challenges of current experimental and high-fidelity numerical simulations, such as high computational costs and difficulties in… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  4. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  5. arXiv:2406.18605  [pdf, other

    physics.ins-det nucl-ex

    The neutron array of the compact spectrometer for heavy ion experiments in Fermi energy region

    Authors: Dawei Si, Sheng Xiao, Yuhao Qin, Yijie Wang, Junhuai Xu, Baiting Tian, Boyuan Zhang, Dong Guo, Qin Zhi, Xiaobao Wei, Yibo Hao, Zengxiang Wang, Tianren Zhuo, Yuansheng Yang, Xianglun Wei, Herun Yang, Peng Ma, Limin Duan, Fangfang Duan, Junbing Ma, Shiwei Xu, Zhen Bai, Guo Yang, Yanyun Yang, Zhigang Xiao

    Abstract: The emission of neutrons from heavy ion reactions is an important observable for studying the asymmetric nuclear equation of state and the reaction dynamics. A 20-unit neutron array has been developed and mounted on the compact spectrometer for heavy ion experiments (CSHINE) to measure the neutron spectra, neutron-neutron and neutron-proton correlation functions. Each unit consists of a… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 8 pages, 11 figures

  6. arXiv:2406.18576  [pdf, other

    cs.CV cs.AI

    Negative Prototypes Guided Contrastive Learning for WSOD

    Authors: Yu Zhang, Chuang Zhu, Guoqing Yang, Siqi Chen

    Abstract: Weakly Supervised Object Detection (WSOD) with only image-level annotation has recently attracted wide attention. Many existing methods ignore the inter-image relationship of instances which share similar characteristics while can certainly be determined not to belong to the same category. Therefore, in order to make full use of the weak label, we propose the Negative Prototypes Guided Contrastive… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  7. arXiv:2406.18552  [pdf, other

    cs.CV cs.AI

    Decoding Decision Reasoning: A Counterfactual-Powered Model for Knowledge Discovery

    Authors: Yingying Fang, Zihao **, Xiaodan Xing, Simon Walsh, Guang Yang

    Abstract: In medical imaging, particularly in early disease detection and prognosis tasks, discerning the rationale behind an AI model's predictions is crucial for evaluating the reliability of its decisions. Conventional explanation methods face challenges in identifying discernible decisive features in medical image classifications, where discriminative features are subtle or not immediately apparent. To… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

  8. arXiv:2406.17962  [pdf, other

    cs.CL

    SimsChat: A Customisable Persona-Driven Role-Playing Agent

    Authors: Bohao Yang, Dong Liu, Chen Tang, Chenghao Xiao, Kun Zhao, Chao Li, Lin Yuan, Guang Yang, Lanxiao Huang, Chenghua Lin

    Abstract: Large Language Models (LLMs) possess the remarkable capability to understand human instructions and generate high-quality text, enabling them to act as agents that simulate human behaviours. This capability allows LLMs to emulate human beings in a more advanced manner, beyond merely replicating simple human behaviours. However, there is a lack of exploring into leveraging LLMs to craft characters… ▽ More

    Submitted 30 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  9. arXiv:2406.17763  [pdf, other

    cs.LG cs.AI cs.CV math.NA

    DiffusionPDE: Generative PDE-Solving Under Partial Observation

    Authors: Jiahe Huang, Guandao Yang, Zichen Wang, Jeong Joon Park

    Abstract: We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where we do not have the full knowledge of the scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when the observations on the data or the underlying coefficients are incomplete, which… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Project page: https://jhhuangchloe.github.io/Diffusion-PDE/

  10. arXiv:2406.17173  [pdf, other

    eess.IV cs.CV cs.LG

    Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks

    Authors: Zihao **, Yingying Fang, Jiahao Huang, Caiwen Xu, Simon Walsh, Guang Yang

    Abstract: The manifestation of symptoms associated with lung diseases can vary in different depths for individual patients, highlighting the significance of 3D information in CT scans for medical image classification. While Vision Transformer has shown superior performance over convolutional neural networks in image classification tasks, their effectiveness is often demonstrated on sufficiently large 2D dat… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: conference

  11. arXiv:2406.16189  [pdf, other

    eess.IV cs.CV

    Fuzzy Attention-based Border Rendering Network for Lung Organ Segmentation

    Authors: Sheng Zhang, Yang Nan, Yingying Fang, Shiyi Wang, Xiaodan Xing, Zhifan Gao, Guang Yang

    Abstract: Automatic lung organ segmentation on CT images is crucial for lung disease diagnosis. However, the unlimited voxel values and class imbalance of lung organs can lead to false-negative/positive and leakage issues in advanced methods. Additionally, some slender lung organs are easily lost during the recycled down/up-sample procedure, e.g., bronchioles & arterioles, causing severe discontinuity issue… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024

  12. arXiv:2406.15752  [pdf, other

    eess.AS cs.AI cs.CL

    TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers

    Authors: Yakun Song, Zhuo Chen, Xiaofei Wang, Ziyang Ma, Guanrou Yang, Xie Chen

    Abstract: Neural codec language model (LM) has demonstrated strong capability in zero-shot text-to-speech (TTS) synthesis. However, the codec LM often suffers from limitations in inference speed and stability, due to its auto-regressive nature and implicit alignment between text and audio. In this work, to handle these challenges, we introduce a new variant of neural codec LM, namely TacoLM. Specifically, T… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: INTERSPEECH 2024

  13. arXiv:2406.15182  [pdf, other

    cs.CV

    DiffExplainer: Unveiling Black Box Models Via Counterfactual Generation

    Authors: Yingying Fang, Shuang Wu, Zihao **, Caiwen Xu, Shiyi Wang, Simon Walsh, Guang Yang

    Abstract: In the field of medical imaging, particularly in tasks related to early disease detection and prognosis, understanding the reasoning behind AI model predictions is imperative for assessing their reliability. Conventional explanation methods encounter challenges in identifying decisive features in medical image classifications, especially when discriminative features are subtle or not immediately e… ▽ More

    Submitted 26 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024

  14. arXiv:2406.14207  [pdf, other

    cs.LG

    LayerMatch: Do Pseudo-labels Benefit All Layers?

    Authors: Chaoqi Liang, Guanglei Yang, Lifeng Qiao, Zitong Huang, Hongliang Yan, Yunchao Wei, Wangmeng Zuo

    Abstract: Deep neural networks have achieved remarkable performance across various tasks when supplied with large-scale labeled data. However, the collection of labeled data can be time-consuming and labor-intensive. Semi-supervised learning (SSL), particularly through pseudo-labeling algorithms that iteratively assign pseudo-labels for self-training, offers a promising solution to mitigate the dependency o… ▽ More

    Submitted 27 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  15. arXiv:2406.13788  [pdf, other

    eess.SP

    Groupwise Deformable Registration of Diffusion Tensor Cardiovascular Magnetic Resonance: Disentangling Diffusion Contrast, Respiratory and Cardiac Motions

    Authors: Fanwen Wang, Yihao Luo, Ke Wen, Jiahao Huang, Pedro F. Ferreira, Yaqing Luo, Yinzhe Wu, Camila Munoz, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Diffusion tensor based cardiovascular magnetic resonance (DT-CMR) offers a non-invasive method to visualize the myocardial microstructure. With the assumption that the heart is stationary, frames are acquired with multiple repetitions for different diffusion encoding directions. However, motion from poor breath-holding and imprecise cardiac triggering complicates DT-CMR analysis, further challenge… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by MICCAI 2024

  16. arXiv:2406.13708  [pdf

    eess.IV physics.med-ph

    Low-rank based motion correction followed by automatic frame selection in DT-CMR

    Authors: Fanwen Wang, Pedro F. Ferreira, Camila Munoz, Ke Wen, Yaqing Luo, Jiahao Huang, Yinzhe Wu, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Motivation: Post-processing of in-vivo diffusion tensor CMR (DT-CMR) is challenging due to the low SNR and variation in contrast between frames which makes image registration difficult, and the need to manually reject frames corrupted by motion. Goals: To develop a semi-automatic post-processing pipeline for robust DT-CMR registration and automatic frame selection. Approach: We used low intrinsic… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted as ISMRM 2024 Digital poster 2141

    Journal ref: ISMRM 2024 Digital poster 2141

  17. arXiv:2406.13284  [pdf

    physics.med-ph q-bio.QM

    The association of domain-specific physical activity and sedentary activity with stroke: A prospective cohort study

    Authors: Xinyi He, Shidi Wang, Yi Li, Jiucun Wang, Guangrui Yang, Jun Chen, Zixin Hu

    Abstract: Background The incidence of stroke places a heavy burden on both society and individuals. Activity is closely related to cardiovascular health. This study aimed to investigate the relationship between the varying domains of PA, like occupation-related Physical Activity (OPA), transportation-related Physical Activity (TPA), leisure-time Physical Activity (LTPA), and Sedentary Activity (SA) with str… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  18. arXiv:2406.12496  [pdf, other

    cs.CV

    Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation

    Authors: Guoyu Yang, Yuan Wang, Daming Shi

    Abstract: Semantic segmentation plays a key role in applications such as autonomous driving and medical image. Although existing real-time semantic segmentation models achieve a commendable balance between accuracy and speed, their multi-path blocks still affect overall speed. To address this issue, this study proposes a Reparameterizable Dual-Resolution Network (RDRNet) dedicated to real-time semantic segm… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  19. arXiv:2406.11819  [pdf, other

    cs.CV

    MegaScenes: Scene-Level View Synthesis at Scale

    Authors: Joseph Tung, Gene Chou, Ruo** Cai, Guandao Yang, Kai Zhang, Gordon Wetzstein, Bharath Hariharan, Noah Snavely

    Abstract: Scene-level novel view synthesis (NVS) is fundamental to many vision and graphics applications. Recently, pose-conditioned diffusion models have led to significant progress by extracting 3D information from 2D foundation models, but these methods are limited by the lack of scene-level training data. Common dataset choices either consist of isolated objects (Objaverse), or of object-centric scenes… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Our project page is at https://megascenes.github.io

  20. arXiv:2406.11576  [pdf, other

    cs.CV

    Harmonizing Feature Maps: A Graph Convolutional Approach for Enhancing Adversarial Robustness

    Authors: Kejia Zhang, Juanjuan Weng, Junwei Wu, Guoqing Yang, Shaozi Li, Zhiming Luo

    Abstract: The vulnerability of Deep Neural Networks to adversarial perturbations presents significant security concerns, as the imperceptible perturbations can contaminate the feature space and lead to incorrect predictions. Recent studies have attempted to calibrate contaminated features by either suppressing or over-activating particular channels. Despite these efforts, we claim that adversarial attacks e… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  21. arXiv:2406.10222  [pdf, other

    astro-ph.IM

    Ultra-low noise laser and optical frequency comb-based timing system for the Black Hole Explorer (BHEX) mission

    Authors: Hannah Tomio, Guangning Yang, Holly F. Leopardi, Kenji Numata, Anthony W. Yu, Andrew Attar, Xiaozhen Xu, Wei Lu, Cheryl Gramling, T. K. Sridharan, Peter Kurczynski

    Abstract: In this effort, we demonstrate the performance of a highly stable time reference for the proposed Black Hole Explorer (BHEX) mission, a space-based extension to the Event Horizon Telescope (EHT) Very Long Baseline Interferometry (VLBI) project. This precision timing system is based on the use of a space-qualified, ultra-low noise laser developed as part of the Laser Interferometer Space Antenna (L… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: To be published in the proceedings of SPIE Astronomical Telescopes + Instrumentation 2024

  22. arXiv:2406.08887  [pdf, other

    eess.SP

    Low-Overhead Channel Estimation via 3D Extrapolation for TDD mmWave Massive MIMO Systems Under High-Mobility Scenarios

    Authors: Binggui Zhou, Xi Yang, Shaodan Ma, Feifei Gao, Guanghua Yang

    Abstract: In TDD mmWave massive MIMO systems, the downlink CSI can be attained through uplink channel estimation thanks to the uplink-downlink channel reciprocity. However, the channel aging issue is significant under high-mobility scenarios and thus necessitates frequent uplink channel estimation. In addition, large amounts of antennas and subcarriers lead to high-dimensional CSI matrices, aggravating the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 13 pages, 11 figures, 3 tables. This paper has been submitted to IEEE journal for possible publication

  23. arXiv:2406.08645  [pdf, other

    astro-ph.GA astro-ph.CO

    ODIN: Identifying Protoclusters and Cosmic Filaments Traced by Ly$α$-emitting Galaxies

    Authors: Vandana Ramakrishnan, Kyoung-Soo Lee, Maria Celeste Artale, Eric Gawiser. Yu** Yang, Changbom Park, Robin Ciardullo, Lucia Guaita, Sang Hyeok Im, Seongjae Kim, Ankit Kumar, Jaehyun Lee, Seong-Kook Lee, Byeongha Moon, Nelson Padilla, Alexandra Pope, Roxana Popescu, Hyunmi Song, Paulina Troncoso, Francisco Valdes, Ann Zabludoff

    Abstract: To understand the formation and evolution of massive cosmic structures, studying them at high redshift, in the epoch when they formed the majority of their mass is essential. The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey is undertaking the widest-area narrowband program to date, to use Ly$α$-emitting galaxies (LAEs) to trace the large-scale structure (LSS) of the Universe at t… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 26 pages, 18 figures; submitted to ApJ

  24. arXiv:2406.06902  [pdf

    cs.SE

    CodeScore-R: An Automated Robustness Metric for Assessing the FunctionalCorrectness of Code Synthesis

    Authors: Guang Yang, Yu Zhou, Xiang Chen, Xiangyu Zhang

    Abstract: Evaluation metrics are crucial in the field of code synthesis. Commonly used code evaluation metrics canbe classified into three types: match-based, semantic-based, and execution-based. Among them, the execution-basedPass@k metric accurately assesses the functionality of predicted code by executing test cases. However, calculatingthis metric requires a significant amount of overhead, necessitating… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: in Chinese language, Journal of Computer Research and Development

  25. arXiv:2406.06847  [pdf, other

    cs.CV

    Generalized W-Net: Arbitrary-style Chinese Character Synthesization

    Authors: Haochuan Jiang, Guanyu Yang, Fei Cheng, Kaizhu Huang

    Abstract: Synthesizing Chinese characters with consistent style using few stylized examples is challenging. Existing models struggle to generate arbitrary style characters with limited examples. In this paper, we propose the Generalized W-Net, a novel class of W-shaped architectures that addresses this. By incorporating Adaptive Instance Normalization and introducing multi-content, our approach can synthesi… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Journal ref: International Conference on Brain Inspired Cognitive Systems 2023

  26. arXiv:2406.06475  [pdf, other

    cs.IR cs.AI

    Survey for Landing Generative AI in Social and E-commerce Recsys -- the Industry Perspectives

    Authors: Da Xu, Danqing Zhang, Guangyu Yang, Bo Yang, Shuyuan Xu, Lingling Zheng, Cindy Liang

    Abstract: Recently, generative AI (GAI), with their emerging capabilities, have presented unique opportunities for augmenting and revolutionizing industrial recommender systems (Recsys). Despite growing research efforts at the intersection of these fields, the integration of GAI into industrial Recsys remains in its infancy, largely due to the intricate nature of modern industrial Recsys infrastructure, ope… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  27. arXiv:2406.06265  [pdf, other

    cond-mat.mtrl-sci

    Large Out-of-Plane Piezoelectric Effect in Janus Ferromagnetic Semiconductor Monolayer of CrOFBr

    Authors: Qiuyue Ma, Guochun Yang, Busheng Wang, Yong Liu

    Abstract: The exploitation of piezoelectric ferromagnetism (PFM) in two-dimensional (2D) materials with large out-of-plane piezoelectric response is motivated not only by technological applications but also scientific interest. In this study, the CrONM monolayer family (N=F, Cl; M=Br, Cl) was investigated using first-principles calculations, revealing that the Janus CrOFBr monolayer exhibits intrinsic ferro… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 21 pages, 6 figures

  28. arXiv:2406.06122  [pdf

    cs.CV

    W-Net: One-Shot Arbitrary-Style Chinese Character Generation with Deep Neural Networks

    Authors: Haochuan Jiang, Guanyu Yang, Kaizhu Huang, Rui Zhang

    Abstract: Due to the huge category number, the sophisticated combinations of various strokes and radicals, and the free writing or printing styles, generating Chinese characters with diverse styles is always considered as a difficult task. In this paper, an efficient and generalized deep framework, namely, the W-Net, is introduced for the one-shot arbitrary-style Chinese character generation task. Specifica… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Journal ref: 2018, Neural Information Processing - 25th International Conference, ICONIP

  29. arXiv:2406.05897  [pdf, other

    cs.CV

    InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight Information Sha**

    Authors: Yunchao Zhang, Guandao Yang, Leonidas Guibas, Yanchao Yang

    Abstract: 3D Gaussians, as a low-level scene representation, typically involve thousands to millions of Gaussians. This makes it difficult to control the scene in ways that reflect the underlying dynamic structure, where the number of independent entities is typically much smaller. In particular, it can be challenging to animate and move objects in the scene, which requires coordination among many Gaussians… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  30. arXiv:2406.05839  [pdf, other

    eess.AS cs.AI

    MaLa-ASR: Multimedia-Assisted LLM-Based ASR

    Authors: Guanrou Yang, Ziyang Ma, Fan Yu, Zhifu Gao, Shiliang Zhang, Xie Chen

    Abstract: As more and more information-rich data like video become available, utilizing multi-modal auxiliary information to enhance audio tasks has sparked widespread research interest. The recent surge in research on LLM-based audio models provides fresh perspectives for tackling audio tasks. Given that LLM can flexibly ingest multiple inputs, we propose MaLa-ASR, an LLM-based ASR model that can integrate… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  31. arXiv:2406.05766  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment

    Authors: Zijia Song, Zelin Zang, Yelin Wang, Guozheng Yang, Jiangbin Zheng, Kaicheng yu, Wanyu Chen, Stan Z. Li

    Abstract: Multimodal fusion breaks through the barriers between diverse modalities and has already yielded numerous impressive performances. However, in various specialized fields, it is struggling to obtain sufficient alignment data for the training process, which seriously limits the use of previously elegant models. Thus, semi-supervised learning attempts to achieve multimodal alignment with fewer matche… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  32. arXiv:2406.05356  [pdf

    physics.optics

    Thermalization dynamics in photonic lattices of different geometries

    Authors: Guowen Yang, Domenico Bongiovanni, Daohong Song, Roberto Morandotti, Zhigang Chen, Nikolaos K. Efremidis

    Abstract: The statistical mechanical behavior of weakly nonlinear multimoded optical settings is attracting increased interest during the last few years. The main purpose of this work is to numerically investigate the main factors that affect the thermalization process in photonic lattices. In particular, we find that lattices with identically selected properties (such as temperature, coupling coefficient,… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 11 pages, 4 figures

  33. arXiv:2406.04019  [pdf, other

    physics.flu-dyn

    An investigation of anisotropy in the bubbly turbulent flow via direct numerical simulations

    Authors: Xuanwei Zhang, Yanchao Liu, Wenkang Wang, Guang Yang, Xu Chu

    Abstract: This study explores the dynamics of dispersed bubbly turbulent flow in a channel using interface-resolved direct numerical simulation (DNS) with an efficient Coupled Level-Set Volume-of-Fluid (CLSVOF) solver. The influence of number of bubbles (96 and 192), flow direction, and Eotvos number was examined across eight distinct cases. The results indicate that in upward flows, bubbles tend to accumul… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  34. arXiv:2406.02023  [pdf, other

    cs.CR

    ShadowBound: Efficient Heap Memory Protection Through Advanced Metadata Management and Customized Compiler Optimization

    Authors: Zheng Yu, Ganxiang Yang, Xinyu Xing

    Abstract: In software development, the prevalence of unsafe languages such as C and C++ introduces potential vulnerabilities, especially within the heap, a pivotal component for dynamic memory allocation. Despite its significance, heap management complexities have made heap corruption pervasive, posing severe threats to system security. While prior solutions aiming for temporal and spatial memory safety exh… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  35. arXiv:2406.01815  [pdf

    cs.CV

    Deep asymmetric mixture model for unsupervised cell segmentation

    Authors: Yang Nan, Guang Yang

    Abstract: Automated cell segmentation has become increasingly crucial for disease diagnosis and drug discovery, as manual delineation is excessively laborious and subjective. To address this issue with limited manual annotation, researchers have developed semi/unsupervised segmentation approaches. Among these approaches, the Deep Gaussian mixture model plays a vital role due to its capacity to facilitate co… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures

  36. arXiv:2406.01604  [pdf, other

    cs.IR cs.AI cs.CV cs.MM

    An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval

    Authors: Xiaolun **g, Genke Yang, Jian Chu

    Abstract: CLIP4Clip model transferred from the CLIP has been the de-factor standard to solve the video clip retrieval task from frame-level input, triggering the surge of CLIP4Clip-based models in the video-text retrieval domain. In this work, we rethink the inherent limitation of widely-used mean pooling operation in the frame features aggregation and investigate the adaptions of excitation and aggregation… ▽ More

    Submitted 8 June, 2024; v1 submitted 25 May, 2024; originally announced June 2024.

    Comments: 20 pages

  37. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  38. arXiv:2406.00959  [pdf, other

    cond-mat.mtrl-sci

    Ta2Pd3Te5 topological thermometer

    Authors: Yupeng Li, Anqi Wang, Senyang Pan, Dayu Yan, Guang Yang, Xingchen Guo, Yu Hong, Guangtong Liu, Fanming Qu, Zhijun Wang, Tian Qian, **glei Zhang, Youguo Shi, Li Lu, Jie Shen

    Abstract: In recent decades, there has been a persistent pursuit of applications for surface/edge states in topological systems, driven by their dissipationless transport effects. However, there have been limited tangible breakthroughs in this field. This work demonstrates the remarkable properties of the topological insulator Ta2Pd3Te5, as a thermometer. This material exhibits a power-law correlation in te… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 15 pages, 9 figures

  39. arXiv:2405.18897  [pdf, other

    cs.CV

    MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning

    Authors: Junjie Wang, Guang**g Yang, Wentao Chen, Huahui Yi, Xiaohu Wu, Qicheng Lao

    Abstract: In response to the challenges posed by the extensive parameter updates required for full fine-tuning of large-scale pre-trained models, parameter-efficient fine-tuning (PEFT) methods, exemplified by Low-Rank Adaptation (LoRA), have emerged. LoRA simplifies the fine-tuning process but may still struggle with a certain level of redundancy in low-rank matrices and limited effectiveness from merely in… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Tech report

  40. arXiv:2405.18353  [pdf, other

    cs.LG stat.ML

    Simulating infinite-dimensional nonlinear diffusion bridges

    Authors: Gefan Yang, Elizabeth Louise Baker, Michael L. Severinsen, Christy Anna Hipsley, Stefan Sommer

    Abstract: The diffusion bridge is a type of diffusion process that conditions on hitting a specific state within a finite time period. It has broad applications in fields such as Bayesian inference, financial mathematics, control theory, and shape analysis. However, simulating the diffusion bridge for natural data can be challenging due to both the intractability of the drift term and continuous representat… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  41. arXiv:2405.17659  [pdf, other

    eess.IV cs.CV

    Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: Deep learning has been extensively applied in medical image reconstruction, where Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) represent the predominant paradigms, each possessing distinct advantages and inherent limitations: CNNs exhibit linear complexity with local sensitivity, whereas ViTs demonstrate quadratic complexity with global sensitivity. The emerging Mamba has sh… ▽ More

    Submitted 25 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  42. arXiv:2405.15241  [pdf, other

    eess.IV cs.CV

    Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving

    Authors: Jia He, Bonan Li, Ge Yang, Ziwen Liu

    Abstract: Solving 3D medical inverse problems such as image restoration and reconstruction is crucial in modern medical field. However, the curse of dimensionality in 3D medical data leads mainstream volume-wise methods to suffer from high resource consumption and challenges models to successfully capture the natural distribution, resulting in inevitable volume inconsistency and artifacts. Some recent works… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  43. arXiv:2405.12575  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Three-dimensional map** and electronic origin of large altermagnetic splitting near Fermi level in CrSb

    Authors: Guowei Yang, Zhanghuan Li, Sai Yang, Jiyuan Li, Hao Zheng, Weifan Zhu, Saizheng Cao, Wenxuan Zhao, Jiawen Zhang, Mao Ye, Yu Song, Lun-Hui Hu, Lexian Yang, Ming Shi, Huiqiu Yuan, Yongjun Zhang, Yuanfeng Xu, Yang Liu

    Abstract: Recently, a new kind of collinear magnetism, dubbed altermagnetism, has attracted considerable interests. A key characteristic of altermagnet is the momentum-dependent band and spin splitting without net magnetization. However, finding altermagnetic materials with large splitting near the Fermi level, which necessarily requires three-dimensional k-space map** and is crucial for spintronic applic… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 16 pages, 4 figures and 1 table

  44. arXiv:2405.12488  [pdf, other

    hep-ex

    First joint oscillation analysis of Super-Kamiokande atmospheric and T2K accelerator neutrino data

    Authors: Super-Kamiokande, T2K collaborations, :, S. Abe, K. Abe, N. Akhlaq, R. Akutsu, H. Alarakia-Charles, A. Ali, Y. I. Alj Hakim, S. Alonso Monsalve, S. Amanai, C. Andreopoulos, L. H. V. Anthony, M. Antonova, S. Aoki, K. A. Apte, T. Arai, T. Arihara, S. Arimoto, Y. Asada, R. Asaka, Y. Ashida, E. T. Atkin, N. Babu , et al. (524 additional authors not shown)

    Abstract: The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlap** in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 10 pages, 3 figures

  45. arXiv:2405.10219  [pdf

    physics.med-ph

    Current Views on Mechanisms of the FLASH Effect in Cancer Radiotherapy

    Authors: Yuqi Ma, Ziming Zhao, Wenkang Zhang, Jianfeng Lv, Junyi Chen, Xueqin Yan, XiaoJi Lin, Junlong Zhang, Bingwu Wang, Song Gao, Jie Xiao, Gen Yang

    Abstract: FLASH radiotherapy (FLASH-RT) is a new modality of radiotherapy by delivering doses with ultra-high dose rates. FLASH-RT has the ability to suppress tumor growth while sparing normal tissues, known as the FLASH effect. Although FLASH effect has proved valid in various models by different ionizing radiations, the exact underlying mechanism is still unclear. This article summarizes mainstream hypoth… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 24 pages, 5 figures

  46. arXiv:2405.09597  [pdf

    cs.LG cs.AI

    When AI Eats Itself: On the Caveats of Data Pollution in the Era of Generative AI

    Authors: Xiaodan Xing, Fadong Shi, Jiahao Huang, Yinzhe Wu, Yang Nan, Sheng Zhang, Yingying Fang, Mike Roberts, Carola-Bibiane Schönlieb, Javier Del Ser, Guang Yang

    Abstract: Generative artificial intelligence (AI) technologies and large models are producing realistic outputs across various domains, such as images, text, speech, and music. Creating these advanced generative models requires significant resources, particularly large and high-quality datasets. To minimize training expenses, many algorithm developers use data created by the models themselves as a cost-effe… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  47. arXiv:2405.09443  [pdf, other

    cs.IT eess.SP

    Low-Complexity Joint Azimuth-Range-Velocity Estimation for Integrated Sensing and Communication with OFDM Waveform

    Authors: Jun Zhang, Gang Yang, Qibin Ye, Yixuan Huang, Su Hu

    Abstract: Integrated sensing and communication (ISAC) is a main application scenario of the sixth-generation mobile communication systems. Due to the fast-growing number of antennas and subcarriers in cellular systems, the computational complexity of joint azimuth-range-velocity estimation (JARVE) in ISAC systems is extremely high. This paper studies the JARVE problem for a monostatic ISAC system with ortho… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 16 pages, 12 figures, submitted to IEEE journal

  48. arXiv:2405.09024  [pdf, other

    cs.CV

    Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels

    Authors: Guozhang Liu, Ting Liu, Mengke Yuan, Tao Pang, Guangxing Yang, Hao Fu, Tao Wang, Tongkui Liao

    Abstract: The ambiguous appearance, tiny scale, and fine-grained classes of objects in remote sensing imagery inevitably lead to the noisy annotations in category labels of detection dataset. However, the effects and treatments of the label noises are underexplored in modern oriented remote sensing object detectors. To address this issue, we propose a robust oriented remote sensing object detection method t… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  49. arXiv:2405.07978  [pdf, other

    cond-mat.mtrl-sci physics.app-ph physics.optics

    Unveiling the Pockels Coefficient of Ferroelectric Nitride ScAlN

    Authors: Guangcanlan Yang, Haochen Wang, Sai Mu, Hao Xie, Tyler Wang, Chengxing He, Mohan Shen, Mengxia Liu, Chris G. Van de Walle, Hong X. Tang

    Abstract: Nitride ferroelectrics have recently emerged as promising alternatives to oxide ferroelectrics due to their compatibility with mainstream semiconductor processing. ScAlN, in particular, has exhibited remarkable piezoelectric coupling strength ($K^2$) comparable to that of lithium niobate (LN), making it a valuable choice for RF filters in wireless communications. Recently, ScAlN has sparked intere… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  50. arXiv:2405.07411  [pdf, other

    cs.CV cs.AI

    MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks

    Authors: Haijiang Tian, **gkun Yue, Xiaohong Liu, Guoxing Yang, Zeyu Jiang, Guangyu Wang

    Abstract: Medical images are often more difficult to acquire than natural images due to the specialism of the equipment and technology, which leads to less medical image datasets. So it is hard to train a strong pretrained medical vision model. How to make the best of natural pretrained vision model and adapt in medical domain still pends. For image classification, a popular method is linear probe (LP). How… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.