Skip to main content

Showing 101–150 of 995 results for author: Zheng, S

.
  1. arXiv:2401.02651  [pdf, other

    cs.CV

    Benchmarking PathCLIP for Pathology Image Analysis

    Authors: Sunyi Zheng, Xiaonan Cui, Yuxuan Sun, **gxiong Li, Honglin Li, Yunlong Zhang, **yi Chen, Xue** **g, Zhaoxiang Ye, Lin Yang

    Abstract: Accurate image classification and retrieval are of importance for clinical diagnosis and treatment decision-making. The recent contrastive language-image pretraining (CLIP) model has shown remarkable proficiency in understanding natural images. Drawing inspiration from CLIP, PathCLIP is specifically designed for pathology image analysis, utilizing over 200,000 image and text pairs in training. Whi… ▽ More

    Submitted 12 June, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  2. arXiv:2401.01553  [pdf, other

    eess.IV cs.CV

    Multi-modal Learning with Missing Modality in Predicting Axillary Lymph Node Metastasis

    Authors: Shichuan Zhang, Sunyi Zheng, Zhongyi Shui, Honglin Li, Lin Yang

    Abstract: Multi-modal Learning has attracted widespread attention in medical image analysis. Using multi-modal data, whole slide images (WSIs) and clinical information, can improve the performance of deep learning models in the diagnosis of axillary lymph node metastasis. However, clinical information is not easy to collect in clinical practice due to privacy concerns, limited resources, lack of interoperab… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  3. arXiv:2401.00644  [pdf, other

    cs.CE

    DEWP: Deep Expansion Learning for Wind Power Forecasting

    Authors: Wei Fan, Yanjie Fu, Shun Zheng, Jiang Bian, Yuanchun Zhou, Hui Xiong

    Abstract: Wind is one kind of high-efficient, environmentally-friendly and cost-effective energy source. Wind power, as one of the largest renewable energy in the world, has been playing a more and more important role in supplying electricity. Though growing dramatically in recent years, the amount of generated wind power can be directly or latently affected by multiple uncertain factors, such as wind speed… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Accepted by TKDD

  4. arXiv:2312.17515  [pdf, other

    cs.CL

    Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in the Avalon Game

    Authors: Zi**g Shi, Meng Fang, Shunfeng Zheng, Shilong Deng, Ling Chen, Yali Du

    Abstract: Multi-agent collaboration with Large Language Models (LLMs) demonstrates proficiency in basic tasks, yet its efficiency in more complex scenarios remains unexplored. In gaming environments, these agents often face situations without established coordination protocols, requiring them to make intelligent inferences about teammates from limited data. This problem motivates the area of ad hoc teamwork… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: Code will release soon

  5. arXiv:2312.16867  [pdf, other

    cs.CV cs.GR

    DualFluidNet: an Attention-based Dual-pipeline Network for FLuid Simulation

    Authors: Yu Chen, Shuai Zheng, Menglong **, Yan Chang, Nianyi Wang

    Abstract: Fluid motion can be considered as a point cloud transformation when using the SPH method. Compared to traditional numerical analysis methods, using machine learning techniques to learn physics simulations can achieve near-accurate results, while significantly increasing efficiency. In this paper, we propose an innovative approach for 3D fluid simulations utilizing an Attention-based Dual-pipeline… ▽ More

    Submitted 18 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 14 pages

  6. arXiv:2312.15993  [pdf

    cs.AI cs.RO eess.SY

    Adaptive Kalman-based hybrid car following strategy using TD3 and CACC

    Authors: Yuqi Zheng, Ruidong Yan, Bin Jia, Rui Jiang, Adriana TAPUS, Xiao**g Chen, Shiteng Zheng, Ying Shang

    Abstract: In autonomous driving, the hybrid strategy of deep reinforcement learning and cooperative adaptive cruise control (CACC) can fully utilize the advantages of the two algorithms and significantly improve the performance of car following. However, it is challenging for the traditional hybrid strategy based on fixed coefficients to adapt to mixed traffic flow scenarios, which may decrease the performa… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 32pages,13figures

  7. arXiv:2312.14414  [pdf, other

    quant-ph

    Critical quantum geometric tensors of parametrically-driven nonlinear resonators

    Authors: Hao-Long Zhang, Jia-Hao Lv, Ken Chen, Xue-Jia Yu, Fan Wu, Zhen-Biao Yang, Shi-Biao Zheng

    Abstract: Parametrically driven nonlinear resonators represent a building block for realizing fault-tolerant quantum computation and are useful for critical quantum sensing. From a fundamental viewpoint, the most intriguing feature of such a system is perhaps the critical phenomena, which can occur without interaction with any other quantum system. The non-analytic behaviors of its eigenspectrum have been s… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Any comments or suggestions are welcome !

  8. arXiv:2312.11820  [pdf, other

    cs.AR

    SoC-Tuner: An Importance-guided Exploration Framework for DNN-targeting SoC Design

    Authors: Shixin Chen, Su Zheng, Chen Bai, Wenqian Zhao, Shuo Yin, Yang Bai, Bei Yu

    Abstract: Designing a system-on-chip (SoC) for deep neural network (DNN) acceleration requires balancing multiple metrics such as latency, power, and area. However, most existing methods ignore the interactions among different SoC components and rely on inaccurate and error-prone evaluation tools, leading to inferior SoC design. In this paper, we present SoC-Tuner, a DNN-targeting exploration framework to f… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: ASP-DAC 2024

  9. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  10. arXiv:2312.11539  [pdf, other

    cs.AI cs.CL cs.LG

    KGLens: A Parameterized Knowledge Graph Solution to Assess What an LLM Does and Doesn't Know

    Authors: Shangshang Zheng, He Bai, Yizhe Zhang, Yi Su, Xiaochuan Niu, Navdeep Jaitly

    Abstract: Measuring the alignment between a Knowledge Graph (KG) and Large Language Models (LLMs) is an effective method to assess the factualness and identify the knowledge blind spots of LLMs. However, this approach encounters two primary challenges including the translation of KGs into natural language and the efficient evaluation of these extensive and complex structures. In this paper, we present KGLen… ▽ More

    Submitted 16 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  11. arXiv:2312.09579  [pdf, other

    cs.CV cs.AI

    MobileSAMv2: Faster Segment Anything to Everything

    Authors: Chaoning Zhang, Dongshen Han, Sheng Zheng, **woo Choi, Tae-Ho Kim, Choong Seon Hong

    Abstract: Segment anything model (SAM) addresses two practical yet challenging segmentation tasks: \textbf{segment anything (SegAny)}, which utilizes a certain point to predict the mask for a single object of interest, and \textbf{segment everything (SegEvery)}, which predicts the masks for all objects on the image. What makes SegAny slow for SAM is its heavyweight image encoder, which has been addressed by… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: MobileSAM achieves faster segment anything, while MobileSAMv2 achieves faster segment everything

  12. arXiv:2312.08648  [pdf, other

    cs.CV

    CLIP-guided Federated Learning on Heterogeneous and Long-Tailed Data

    Authors: Jiangming Shi, Shanshan Zheng, Xiangbo Yin, Yang Lu, Yuan Xie, Yanyun Qu

    Abstract: Federated learning (FL) provides a decentralized machine learning paradigm where a server collaborates with a group of clients to learn a global model without accessing the clients' data. User heterogeneity is a significant challenge for FL, which together with the class-distribution imbalance further enhances the difficulty of FL. Great progress has been made in large vision-language models, such… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: This paper has been accepted by AAAI24

  13. arXiv:2312.08538  [pdf, other

    cs.LG cs.AI

    Contractive error feedback for gradient compression

    Authors: Bingcong Li, Shuai Zheng, Parameswaran Raman, Anshumali Shrivastava, Georgios B. Giannakis

    Abstract: On-device memory concerns in distributed deep learning have become severe due to (i) the growth of model size in multi-GPU training, and (ii) the wide adoption of deep neural networks for federated learning on IoT devices which have limited storage. In such settings, communication efficient optimization methods are attractive alternatives, however they still struggle with memory issues. To tackle… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  14. Quantum metric and metrology with parametrically-driven Tavis-Cummings models

    Authors: Jia-Hao Lü, Pei-Rong Han, Wen Ning, Xin Zhu, Fan Wu, Li-Tuo Shen, Zhen-Biao Yang, Shi-Biao Zheng

    Abstract: We study the quantum metric in a driven Tavis-Cummings model, comprised of multiple qubits interacting with a quantized photonic field. The parametrical driving of the photonic field breaks the system's U(1) symmetry down to a ${\rm Z}_2$ symmetry, whose spontaneous breaking initiates a superradiant phase transition. We analytically solved the eigenenergies and eigenstates, and numerically simulat… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 13 pages, 5 figures

    Journal ref: Opt. Express 31, 41669-41683 (2023)

  15. arXiv:2312.06632  [pdf, other

    cs.AI

    Control Risk for Potential Misuse of Artificial Intelligence in Science

    Authors: Jiyan He, Weitao Feng, Yaosen Min, **gwei Yi, Kunsheng Tang, Shuai Li, Jie Zhang, Kejiang Chen, Wenbo Zhou, Xing Xie, Weiming Zhang, Nenghai Yu, Shuxin Zheng

    Abstract: The expanding application of Artificial Intelligence (AI) in scientific fields presents unprecedented opportunities for discovery and innovation. However, this growth is not without risks. AI models in science, if misused, can amplify risks like creation of harmful substances, or circumvention of established regulations. In this study, we aim to raise awareness of the dangers of AI misuse in scien… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  16. arXiv:2312.05486  [pdf, other

    cs.AI cs.LG math.PR

    FreeFlow: A Comprehensive Understanding on Diffusion Probabilistic Models via Optimal Transport

    Authors: Bowen Sun, Shibao Zheng

    Abstract: The blooming diffusion probabilistic models (DPMs) have garnered significant interest due to their impressive performance and the elegant inspiration they draw from physics. While earlier DPMs relied upon the Markovian assumption, recent methods based on differential equations have been rapidly applied to enhance the efficiency and capabilities of these models. However, a theoretical interpretatio… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  17. arXiv:2312.04973  [pdf, other

    cs.GT

    Ex-post Individually Rational Bayesian Persuasion

    Authors: Jiahao Zhang, Shuran Zheng, Renato Paes Leme, Zhiwei Steven Wu

    Abstract: The success of Bayesian persuasion relies on the key assumption that the sender will commit to a predetermined information disclosure policy (signaling scheme). However, in practice, it is usually difficult for the receiver to monitor whether the sender sticks to the disclosure policy, which makes the credibility of the sender's disclosure policy questionable. The sender's credibility is particula… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 21 pages

  18. arXiv:2312.04020  [pdf, ps, other

    math.AP math.CA

    Note on gradient estimate of heat kernel for Schrödinger operators

    Authors: Shijun Zheng

    Abstract: Let $H=-Δ+V$ be a Schrödinger operator on $\mathbb{R}^n$. We show that gradient estimates for the heat kernel of $H$ with upper Gaussian bounds imply polynomial decay for the kernels of certain smooth dyadic spectral operators. The latter decay property has been known to play an important role in the Littlewood-Paley theory for $L^p$ and Sobolev spaces. We are able to establish the result by modif… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    MSC Class: 35J10; 42B37

    Journal ref: Applied Mathematics, Volume 1, No.5, November 2010, pp. 425-430

  19. arXiv:2312.03690  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Inverse Design of Vitrimeric Polymers by Molecular Dynamics and Generative Modeling

    Authors: Yiwen Zheng, Prakash Thakolkaran, Jake A. Smith, Ziheng Lu, Shuxin Zheng, Bichlien H. Nguyen, Siddhant Kumar, Aniruddh Vashisth

    Abstract: Vitrimer is a new class of sustainable polymers with the ability of self-healing through rearrangement of dynamic covalent adaptive networks. However, a limited choice of constituent molecules restricts their property space, prohibiting full realization of their potential applications. Through a combination of molecular dynamics (MD) simulations and machine learning (ML), particularly a novel grap… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  20. arXiv:2312.02546  [pdf, other

    cs.CV

    Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning

    Authors: Zhuo Huang, Chang Liu, Yinpeng Dong, Hang Su, Shibao Zheng, Tongliang Liu

    Abstract: Although vision models such as Contrastive Language-Image Pre-Training (CLIP) show impressive generalization performance, their zero-shot robustness is still limited under Out-of-Distribution (OOD) scenarios without fine-tuning. Instead of undesirably providing human supervision as commonly done, it is possible to take advantage of Multi-modal Large Language Models (MLLMs) that hold powerful visua… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: ICML 2024

  21. arXiv:2312.02155  [pdf, other

    cs.CV

    GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis

    Authors: Shunyuan Zheng, Boyao Zhou, Ruizhi Shao, Boning Liu, Sheng** Zhang, Liqiang Nie, Yebin Liu

    Abstract: We present a new approach, termed GPS-Gaussian, for synthesizing novel views of a character in a real-time manner. The proposed method enables 2K-resolution rendering under a sparse-view camera setting. Unlike the original Gaussian Splatting or neural implicit rendering methods that necessitate per-subject optimizations, we introduce Gaussian parameter maps defined on the source views and regress… ▽ More

    Submitted 16 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR 2024 (Highlight). Project page: https://shunyuanzheng.github.io/GPS-Gaussian

  22. arXiv:2312.01367  [pdf, other

    cs.CV cs.AI

    DiFace: Cross-Modal Face Recognition through Controlled Diffusion

    Authors: Bowen Sun, Shibao Zheng

    Abstract: Diffusion probabilistic models (DPMs) have exhibited exceptional proficiency in generating visual media of outstanding quality and realism. Nonetheless, their potential in non-generative domains, such as face recognition, has yet to be thoroughly investigated. Meanwhile, despite the extensive development of multi-modal face recognition methods, their emphasis has predominantly centered on visual m… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  23. arXiv:2312.01292  [pdf, ps, other

    cs.NI eess.SP

    Joint Beam Scheduling and Power Optimization for Beam Hop** LEO Satellite Systems

    Authors: Shuang Zheng, Xing Zhang, Peng Wang, Wenbo Wang

    Abstract: Low earth orbit (LEO) satellite communications can provide ubiquitous and reliable services, making it an essential part of the Internet of Everything network. Beam hop** (BH) is an emerging technology for effectively addressing the issue of low resource utilization caused by the non-uniform spatio-temporal distribution of traffic demands. However, how to allocate multi-dimensional resources in… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  24. arXiv:2312.00186  [pdf, other

    stat.AP cs.AI

    Planning Reliability Assurance Tests for Autonomous Vehicles

    Authors: Simin Zheng, Lu Lu, Yili Hong, Jian Liu

    Abstract: Artificial intelligence (AI) technology has become increasingly prevalent and transforms our everyday life. One important application of AI technology is the development of autonomous vehicles (AV). However, the reliability of an AV needs to be carefully demonstrated via an assurance test so that the product can be used with confidence in the field. To plan for an assurance test, one needs to dete… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 29 pages, 5 figures

  25. arXiv:2311.18220  [pdf, ps, other

    cs.CC

    Lifting query complexity to time-space complexity for two-way finite automata

    Authors: Shenggen Zheng, Yaqiao Li, Minghua Pan, Jozef Gruska, Lvzhou Li

    Abstract: Time-space tradeoff has been studied in a variety of models, such as Turing machines, branching programs, and finite automata, etc. While communication complexity as a technique has been applied to study finite automata, it seems it has not been used to study time-space tradeoffs of finite automata. We design a new technique showing that separations of query complexity can be lifted, via communica… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  26. arXiv:2311.17267  [pdf, other

    cs.CV

    E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer

    Authors: Jacob Zhiyuan Fang, Skyler Zheng, Vasu Sharma, Robinson Piramuthu

    Abstract: To build scalable models for challenging real-world tasks, it is important to learn from diverse, multi-modal data in various forms (e.g., videos, text, and images). Among the existing works, a plethora of them have focused on leveraging large but cumbersome cross-modal architectures. Regardless of their effectiveness, larger architectures unavoidably prevent the models from being extended to real… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  27. arXiv:2311.16480  [pdf, other

    cs.CV cs.AI cs.CL

    WsiCaption: Multiple Instance Generation of Pathology Reports for Gigapixel Whole-Slide Images

    Authors: **yi Chen, Honglin Li, Chenglu Zhu, Sunyi Zheng, Zhongyi Shui, Lin Yang

    Abstract: Whole slide images are the foundation of digital pathology for the diagnosis and treatment of carcinomas. Writing pathology reports is laborious and error-prone for inexperienced pathologists. To reduce the workload and improve clinical automation, we investigate how to generate pathology reports given whole slide images. On the data end, we curated the largest WSI-text dataset (PathText). In spec… ▽ More

    Submitted 27 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  28. arXiv:2311.16155  [pdf, other

    eess.SP cs.LG

    Deep Learning-Based Frequency Offset Estimation

    Authors: Tao Chen, Shilian Zheng, Jiawei Zhu, Qi Xuan, Xiaoniu Yang

    Abstract: In wireless communication systems, the asynchronization of the oscillators in the transmitter and the receiver along with the Doppler shift due to relative movement may lead to the presence of carrier frequency offset (CFO) in the received signals. Estimation of CFO is crucial for subsequent processing such as coherent demodulation. In this brief, we demonstrate the utilization of deep learning fo… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  29. arXiv:2311.15939  [pdf, other

    cs.CV

    Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

    Authors: Zhongyi Shui, Yunlong Zhang, Kai Yao, Chenglu Zhu, Sunyi Zheng, **gxiong Li, Honglin Li, Yuxuan Sun, Ruizhe Guo, Lin Yang

    Abstract: Nucleus instance segmentation in histology images is crucial for a broad spectrum of clinical applications. Current dominant algorithms rely on regression of nuclear proxy maps. Distinguishing nucleus instances from the estimated maps requires carefully curated post-processing, which is error-prone and parameter-sensitive. Recently, the Segment Anything Model (SAM) has earned huge attention in med… ▽ More

    Submitted 24 January, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: under review

  30. arXiv:2311.15058  [pdf

    physics.optics

    Controlled generation of Poincaré sphere beams with inverse-designed multimode meta-waveguides

    Authors: **g Luan, Shuang Zheng, Zhenyu Wan, Tiange Wu, Weijie Chang, Deming Liu, Minming Zhang

    Abstract: The angular momentum of light can be described by positions on various Poincaré spheres, where different structured light beams have proven useful for numerous optical applications. However, the dynamic generation and control of arbitrary structured light on different Poincaré spheres is still handled via bulky optics in free space. Here we propose and demonstrate multimode silicon photonic integr… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  31. arXiv:2311.14273  [pdf, other

    hep-ph astro-ph.CO

    Gravitational Dark Matter from Minimal Preheating

    Authors: Ruopeng Zhang, Sibo Zheng

    Abstract: Following our previous work, we continue to explore gravitational dark matter production during the minimal preheating caused by inflaton self-resonance. In this situation there is only one dimensionless index parameter $n$ characterizing the inflation potential after the end of inflation, which leads to a robust prediction on the gravitational dark matter relic abundance. Using lattice method to… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 13 pages, 5 figures

  32. arXiv:2311.12885  [pdf, other

    cs.CV

    Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis

    Authors: Honglin Li, Yunlong Zhang, Chenglu Zhu, Jiatong Cai, Sunyi Zheng, Lin Yang

    Abstract: Histopathology image analysis is the golden standard of clinical diagnosis for Cancers. In doctors daily routine and computer-aided diagnosis, the Whole Slide Image (WSI) of histopathology tissue is used for analysis. Because of the extremely large scale of resolution, previous methods generally divide the WSI into a large number of patches, then aggregate all patches within a WSI by Multi-Instanc… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  33. arXiv:2311.12413  [pdf, ps, other

    math.PR math.OC

    On the calculation of upper variance under multiple probabilities

    Authors: Xinpeng Li, Miao Yu, Shiyi Zheng

    Abstract: The notion of upper variance under multiple probabilities is defined by a corresponding minimax optimization problem. This paper proposes a simple algorithm to solve the related minimax optimization problem exactly. As an application, we provide the probabilistic representation for a class of quadratic programming problems.

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 8 pages

  34. arXiv:2311.12358  [pdf, other

    cs.LG cs.DC

    Federated Learning via Consensus Mechanism on Heterogeneous Data: A New Perspective on Convergence

    Authors: Shu Zheng, Tiandi Ye, Xiang Li, Ming Gao

    Abstract: Federated learning (FL) on heterogeneous data (non-IID data) has recently received great attention. Most existing methods focus on studying the convergence guarantees for the global objective. While these methods can guarantee the decrease of the global objective in each communication round, they fail to ensure risk decrease for each client. In this paper, to address the problem,we propose FedCOME… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  35. arXiv:2311.11534  [pdf, other

    astro-ph.GA astro-ph.SR

    Spatial distribution of NH2D in massive star-forming regions

    Authors: Yuqiang Li, Junzhi Wang, Juan Li, Shu Liu, Kai Yang, Siqi Zheng, Zhe Lu

    Abstract: To understand the relation between NH$_2$D and its physical environment, we mapped ortho-NH$_2$D $1_{11}^s-1_{01}^a$ at 85.9 GHz toward 24 Galactic late-stage massive star-forming regions with Institut de Radioastronomie Millim$ é$trique (IRAM) 30-m telescope. Ortho-NH$_2$D $1_{11}^s-1_{01}^a$ was detected in 18 of 24 sources. Comparing with the distribution of H$^{13}$CN 1-0 as a dense gas tracer… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 30 pages, 20 figures, 4 tables. Accepted to MNRAS

  36. arXiv:2311.11465  [pdf, other

    cs.CV

    Understanding Segment Anything Model: SAM is Biased Towards Texture Rather than Shape

    Authors: Chaoning Zhang, Yu Qiao, Shehbaz Tariq, Sheng Zheng, Chenshuang Zhang, Chenghao Li, Hyundong Shin, Choong Seon Hong

    Abstract: In contrast to the human vision that mainly depends on the shape for recognizing the objects, deep image recognition models are widely known to be biased toward texture. Recently, Meta research team has released the first foundation model for image segmentation, termed segment anything model (SAM), which has attracted significant attention. In this work, we understand SAM from the perspective of t… ▽ More

    Submitted 3 June, 2023; originally announced November 2023.

  37. arXiv:2311.10463  [pdf, other

    eess.IV cs.CV

    Correlation-Distance Graph Learning for Treatment Response Prediction from rs-fMRI

    Authors: Xiatian Zhang, Sisi Zheng, Hubert P. H. Shum, Haozheng Zhang, Nan Song, Mingkang Song, Hongxiao Jia

    Abstract: Resting-state fMRI (rs-fMRI) functional connectivity (FC) analysis provides valuable insights into the relationships between different brain regions and their potential implications for neurological or psychiatric disorders. However, specific design efforts to predict treatment response from rs-fMRI remain limited due to difficulties in understanding the current brain state and the underlying mech… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Proceedings of the 2023 International Conference on Neural Information Processing (ICONIP)

  38. DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

    Authors: Chenyu Jiang, Zhen Jia, Shuai Zheng, Yida Wang, Chuan Wu

    Abstract: Multi-task model training has been adopted to enable a single deep neural network model (often a large language model) to handle multiple tasks (e.g., question answering and text summarization). Multi-task training commonly receives input sequences of highly different lengths due to the diverse contexts of different tasks. Padding (to the same sequence length) or packing (short examples into long… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 18 pages, 18 figures

  39. arXiv:2311.10360  [pdf, other

    hep-ph

    Self-interacting dark matter to freeze-in via vector portal

    Authors: Xinyue Yin, Shuai Xu, Sibo Zheng

    Abstract: It is challenging to resolve the small-scale problem for dark matter being a weakly-interacting massive particle. We attempt to address this issue by proposing a self-interacting freeze-in dark matter via dark photon. In this model, the dark matter obtains the observed relic abundance via Standard Model $γ$ and $Z$ boson induced freeze-in processes, whereas the dark matter force mediator has a neg… ▽ More

    Submitted 8 May, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: 18 pages, 6 figures. A refined version to eliminate errors in the previous version, with new model realization and numerical analysis

  40. arXiv:2311.07972  [pdf, other

    stat.ME

    Residual Importance Weighted Transfer Learning For High-dimensional Linear Regression

    Authors: Junlong Zhao, Shengbin Zheng, Chenlei Leng

    Abstract: Transfer learning is an emerging paradigm for leveraging multiple sources to improve the statistical inference on a single target. In this paper, we propose a novel approach named residual importance weighted transfer learning (RIW-TL) for high-dimensional linear models built on penalized likelihood. Compared to existing methods such as Trans-Lasso that selects sources in an all-in-all-out manner,… ▽ More

    Submitted 3 January, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  41. arXiv:2311.07877  [pdf, other

    cs.CV

    Test-Time Training for Semantic Segmentation with Output Contrastive Loss

    Authors: Yunlong Zhang, Yuxuan Sun, Sunyi Zheng, Zhongyi Shui, Chenglu Zhu, Lin Yang

    Abstract: Although deep learning-based segmentation models have achieved impressive performance on public benchmarks, generalizing well to unseen environments remains a major challenge. To improve the model's generalization ability to the new domain during evaluation, the test-time training (TTT) is a challenging paradigm that adapts the source-pretrained model in an online fashion. Early efforts on TTT mai… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  42. arXiv:2311.07125  [pdf, other

    cs.CV

    Attention-Challenging Multiple Instance Learning for Whole Slide Image Classification

    Authors: Yunlong Zhang, Honglin Li, Yuxuan Sun, Sunyi Zheng, Chenglu Zhu, Lin Yang

    Abstract: In the application of Multiple Instance Learning (MIL) methods for Whole Slide Image (WSI) classification, attention mechanisms often focus on a subset of discriminative instances, which are closely linked to overfitting. To mitigate overfitting, we present Attention-Challenging MIL (ACMIL). ACMIL combines two techniques based on separate analyses for attention value concentration. Firstly, UMAP o… ▽ More

    Submitted 28 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Under review

  43. arXiv:2311.06330  [pdf, other

    cs.AI cs.CE cs.CL cs.MA econ.GN

    Smart Agent-Based Modeling: On the Use of Large Language Models in Computer Simulations

    Authors: Zengqing Wu, Run Peng, Xu Han, Shuyuan Zheng, Yixin Zhang, Chuan Xiao

    Abstract: Computer simulations offer a robust toolset for exploring complex systems across various disciplines. A particularly impactful approach within this realm is Agent-Based Modeling (ABM), which harnesses the interactions of individual agents to emulate intricate system dynamics. ABM's strength lies in its bottom-up methodology, illuminating emergent phenomena by modeling the behaviors of individual c… ▽ More

    Submitted 14 December, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: Source codes are available at https://github.com/Roihn/SABM

  44. arXiv:2311.04534  [pdf, other

    cs.CL cs.SD eess.AS

    Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token-based ASR

    Authors: Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Shiliang Zhang, Chong Deng, Yukun Ma, Hai Yu, Jiaqing Liu, Chong Zhang

    Abstract: Recently, unified speech-text models, such as SpeechGPT, VioLA, and AudioPaLM, have achieved remarkable performance on various speech tasks. These models discretize speech signals into tokens (speech discretization) and use a shared vocabulary for both text and speech tokens. Then they train a single decoder-only Transformer on a mixture of speech tasks. However, these models rely on the Loss Mask… ▽ More

    Submitted 4 February, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 5 pages, accepted by ICASSP 2024

  45. arXiv:2311.03761  [pdf, other

    cs.LG cs.AI eess.SP

    Augmenting Radio Signals with Wavelet Transform for Deep Learning-Based Modulation Recognition

    Authors: Tao Chen, Shilian Zheng, Kunfeng Qiu, Luxin Zhang, Qi Xuan, Xiaoniu Yang

    Abstract: The use of deep learning for radio modulation recognition has become prevalent in recent years. This approach automatically extracts high-dimensional features from large datasets, facilitating the accurate classification of modulation schemes. However, in real-world scenarios, it may not be feasible to gather sufficient training data in advance. Data augmentation is a method used to increase the d… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  46. arXiv:2311.03004  [pdf, other

    cs.IT physics.app-ph

    Breaking the Degrees-of-Freedom Limit of Holographic MIMO Communications: A 3-D Antenna Array Topology

    Authors: Shuai S. A. Yuan, Jie Wu, Hong**g Xu, Tengjiao Wang, Da Li, Xiaoming Chen, Chongwen Huang, Sheng Sun, Shilie Zheng, Xianmin Zhang, Er-** Li, Wei E. I. Sha

    Abstract: The performance of holographic multiple-input multiple-output (MIMO) communications, employing two-dimensional (2-D) planar antenna arrays, is typically compromised by finite degrees-of-freedom (DOF) stemming from limited array size. The DOF constraint becomes significant when the element spacing approaches approximately half a wavelength, thereby restricting the overall performance of MIMO system… ▽ More

    Submitted 27 February, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Journal ref: IEEE Transactions on Vehicular Technology, Volume 73 , Issue 8, 2024

  47. arXiv:2311.01721  [pdf, ps, other

    astro-ph.GA astro-ph.SR

    Tentative detection of cyanoformamide NCCONH2 in space

    Authors: Juan Li, Donghui Quan, Junzhi Wang, Xia Zhang, Xing Lu, Qian Gou, Feng Gao, Yajun Wu, Edwin Bergin, Shanghuo Li, Zhiqiang Shen, Fujun Du, Meng Li, Siqi Zheng, Xingwu Zheng

    Abstract: The peptide-like molecules, cyanoformamide (NCCONH2), is the cyano (CN) derivative of formamide (NH2CHO). It is known to play a role in the synthesis of nucleic acid precursors under prebiotic conditions. In this paper, we present a tentative detection of NCCONH2 in the interstellar medium (ISM) with the Atacama Large Millimeter/submillimeter Array (ALMA) archive data. Ten unblended lines of NCCON… ▽ More

    Submitted 15 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 20 pages, 6 figures, 2 tables, accepted by PASJ

  48. arXiv:2311.00660  [pdf, other

    cs.CV

    TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain

    Authors: Shen Zheng, Changjie Lu, Srinivasa G. Narasimhan

    Abstract: Rain generation algorithms have the potential to improve the generalization of deraining methods and scene understanding in rainy conditions. However, in practice, they produce artifacts and distortions and struggle to control the amount of rain generated due to a lack of proper constraints. In this paper, we propose an unpaired image-to-image translation framework for generating realistic rainy i… ▽ More

    Submitted 7 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: WACV 2024

  49. arXiv:2310.20161  [pdf, ps, other

    astro-ph.GA astro-ph.SR

    Sulphur isotopes toward Sagittarius B2 extended envelope in the Galactic Center

    Authors: Qingxu Li, Juan Li, Siqi Zheng, Junzhi Wang, Feng Gao, Yajun Wu

    Abstract: The isotopic ratios are good tools for probing the stellar nucleosynthesis and chemical evolution. We performed high-sensitivity map** observations of the J=7-6 rotational transitions of OCS, OC34S, O13CS, and OC33S toward the Galactic Center giant molecular cloud, Sagittarius B2 (Sgr B2) with IRAM 30m telescope. Positions with optically thin and uncontaminated lines are chosen to determine the… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 20 pages, 7 figures, accepted by PASJ

  50. arXiv:2310.19102  [pdf, other

    cs.LG

    Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

    Authors: Yilong Zhao, Chien-Yu Lin, Kan Zhu, Zihao Ye, Lequn Chen, Size Zheng, Luis Ceze, Arvind Krishnamurthy, Tianqi Chen, Baris Kasikci

    Abstract: The growing demand for Large Language Models (LLMs) in applications such as content generation, intelligent chatbots, and sentiment analysis poses considerable challenges for LLM service providers. To efficiently use GPU resources and boost throughput, batching multiple requests has emerged as a popular paradigm; to further speed up batching, LLM quantization techniques reduce memory consumption a… ▽ More

    Submitted 16 April, 2024; v1 submitted 29 October, 2023; originally announced October 2023.