Skip to main content

Showing 101–150 of 1,131 results for author: Zhou, M

.
  1. arXiv:2312.02237  [pdf, other

    cs.CV

    Singular Regularization with Information Bottleneck Improves Model's Adversarial Robustness

    Authors: Guanlin Li, Naishan Zheng, Man Zhou, Jie Zhang, Tianwei Zhang

    Abstract: Adversarial examples are one of the most severe threats to deep learning models. Numerous works have been proposed to study and defend adversarial examples. However, these works lack analysis of adversarial information or perturbation, which cannot reveal the mystery of adversarial examples and lose proper interpretation. In this paper, we aim to fill this gap by studying adversarial information a… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  2. arXiv:2312.01408  [pdf, other

    cs.CV

    Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts

    Authors: Tianqi Chen, Yongfei Liu, Zhendong Wang, Jianbo Yuan, Quanzeng You, Hongxia Yang, Mingyuan Zhou

    Abstract: In light of the remarkable success of in-context learning in large language models, its potential extension to the vision domain, particularly with visual foundation models like Stable Diffusion, has sparked considerable interest. Existing approaches in visual in-context learning frequently face hurdles such as expensive pretraining, limiting frameworks, inadequate visual comprehension, and limite… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  3. arXiv:2312.00951  [pdf, other

    cs.RO eess.SY

    AV4EV: Open-Source Modular Autonomous Electric Vehicle Platform for Making Mobility Research Accessible

    Authors: Zhijie Qiao, Mingyan Zhou, Zhijun Zhuang, Tejas Agarwal, Felix Jahncke, Po-Jen Wang, Jason Friedman, Hongyi Lai, Divyanshu Sahu, Tomáš Nagy, Martin Endler, Jason Schlessman, Rahul Mangharam

    Abstract: When academic researchers develop and validate autonomous driving algorithms, there is a challenge in balancing high-performance capabilities with the cost and complexity of the vehicle platform. Much of today's research on autonomous vehicles (AV) is limited to experimentation on expensive commercial vehicles that require large skilled teams to retrofit the vehicles and test them in dedicated fac… ▽ More

    Submitted 12 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 pages, 5 figures

  4. arXiv:2311.18303  [pdf, other

    cs.CV

    OmniMotionGPT: Animal Motion Generation with Limited Data

    Authors: Zhangsihao Yang, Mingyuan Zhou, Mengyi Shan, Bingbing Wen, Ziwei Xuan, Mitch Hill, Junjie Bai, Guo-Jun Qi, Yalin Wang

    Abstract: Our paper aims to generate diverse and realistic animal motion sequences from textual descriptions, without a large-scale animal text-motion dataset. While the task of text-driven human motion synthesis is already extensively studied and benchmarked, it remains challenging to transfer this success to other skeleton structures with limited data. In this work, we design a model architecture that imi… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: The project page is at https://zshyang.github.io/omgpt-website/

  5. arXiv:2311.17950  [pdf, other

    cs.CV cs.AI

    Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching

    Authors: Shitong Shao, Zeyuan Yin, Muxin Zhou, Xindong Zhang, Zhiqiang Shen

    Abstract: The lightweight "local-match-global" matching introduced by SRe2L successfully creates a distilled dataset with comprehensive information on the full 224x224 ImageNet-1k. However, this one-sided approach is limited to a particular backbone, layer, and statistics, which limits the improvement of the generalization of a distilled dataset. We suggest that sufficient and various "local-match-global" m… ▽ More

    Submitted 16 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR2024

  6. arXiv:2311.16293  [pdf, other

    cs.AR cs.CR

    FHEmem: A Processing In-Memory Accelerator for Fully Homomorphic Encryption

    Authors: Minxuan Zhou, Yu** Nam, Pranav Gangwar, Weihong Xu, Arpan Dutta, Kartikeyan Subramanyam, Chris Wilkerson, Rosario Cammarota, Saransh Gupta, Tajana Rosing

    Abstract: Fully Homomorphic Encryption (FHE) is a technique that allows arbitrary computations to be performed on encrypted data without the need for decryption, making it ideal for securing many emerging applications. However, FHE computation is significantly slower than computation on plain data due to the increase in data size after encryption. Processing In-Memory (PIM) is a promising technology that ca… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  7. arXiv:2311.14940  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    Reconstruction of Cosmological Initial Density Field with Observations from the Epoch of Reionization

    Authors: Meng Zhou, Yi Mao

    Abstract: Initial density distribution provides a basis for understanding the complete evolution of cosmological density fluctuations. While reconstruction in our local Universe exploits the observations of galaxy surveys with large volumes, observations of high-redshift galaxies are performed with a small field of view and therefore can hardly be used for reconstruction. Here we propose to reconstruct the… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: 12 pages, 8 figures, 2 tables

  8. arXiv:2311.12866  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Modular Blended Attention Network for Video Question Answering

    Authors: Mingjie Zhou

    Abstract: In multimodal machine learning tasks, it is due to the complexity of the assignments that the network structure, in most cases, is assembled in a sophisticated way. The holistic architecture can be separated into several logical parts according to the respective ends that the modules are devised to achieve. As the number of modalities of information representation increases, constructing ad hoc su… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: I will not add others' names since this work has not been published

  9. arXiv:2311.10996  [pdf, other

    cs.LG

    BrainZ-BP: A Non-invasive Cuff-less Blood Pressure Estimation Approach Leveraging Brain Bio-impedance and Electrocardiogram

    Authors: Bufang Yang, Le Liu, Wenxuan Wu, Mengliang Zhou, Hongxing Liu, Xinbao Ning

    Abstract: Accurate and continuous blood pressure (BP) monitoring is essential to the early prevention of cardiovascular diseases. Non-invasive and cuff-less BP estimation algorithm has gained much attention in recent years. Previous studies have demonstrated that brain bio-impedance (BIOZ) is a promising technique for non-invasive intracranial pressure (ICP) monitoring. Clinically, treatment for patients wi… ▽ More

    Submitted 23 November, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

  10. arXiv:2311.06475  [pdf, ps, other

    math.AP

    A Counterexample for the Principal Eigenvalue of An Elliptic Operator with Large Advection

    Authors: Xueli Bai, Xin Xu, Kexin Zhang, Maolin Zhou

    Abstract: There are numerous studies focusing on the convergence of the principal eigenvalue $λ(s)$ as $s\to+\infty$ corresponding to the elliptic eigenvalue problem \begin{align*} -Δ\varphi(x)-2s\mathbf{v}\cdot\nabla\varphi(x)+c(x)\varphi(x)=λ(s)\varphi(x),\quad x\in Ω, \end{align*} where $Ω$ is a bounded domain and the advection term $\mathbf{v}$ under some certain restrictions. In this paper, w… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  11. arXiv:2311.06177  [pdf, ps, other

    nucl-th

    Quantum Fluctuations Driving the Generation and Strong Correlations of Fission Fragment Angular Momenta

    Authors: M. H. Zhou, S. Y. Chen, Z. Y. Li, M. S. Smith, Z. P. Li

    Abstract: Two critical issues in the study of the fission mechanism are how the fission fragment angular momenta (FFAM) develop dynamically from equilibrium and how they are correlated with each other. To this end, we construct a time-dependent generator coordinate method that incorporates crucial quantum fluctuations -- multiple rotations, vibrations, and their couplings -- based on covariant density funct… ▽ More

    Submitted 10 March, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

  12. arXiv:2311.05152  [pdf, other

    cs.LG cs.AI cs.CV cs.MM

    Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks

    Authors: Haoyi Duan, Yan Xia, Mingze Zhou, Li Tang, Jieming Zhu, Zhou Zhao

    Abstract: In recent years, the deployment of large-scale pre-trained models in audio-visual downstream tasks has yielded remarkable outcomes. However, these models, primarily trained on single-modality unconstrained datasets, still encounter challenges in feature extraction for multi-modal tasks, leading to suboptimal performance. This limitation arises due to the introduction of irrelevant modality-specifi… ▽ More

    Submitted 20 December, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS 2023

  13. arXiv:2311.03557  [pdf, other

    cs.LG cs.CV eess.IV

    Spatio-Temporal Similarity Measure based Multi-Task Learning for Predicting Alzheimer's Disease Progression using MRI Data

    Authors: Xulong Wang, Yu Zhang, Menghui Zhou, Tong Liu, Jun Qi, Po Yang

    Abstract: Identifying and utilising various biomarkers for tracking Alzheimer's disease (AD) progression have received many recent attentions and enable hel** clinicians make the prompt decisions. Traditional progression models focus on extracting morphological biomarkers in regions of interest (ROIs) from MRI/PET images, such as regional average cortical thickness and regional volume. They are effective… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  14. arXiv:2311.03120  [pdf, other

    astro-ph.SR astro-ph.EP

    Near-Infrared Ca II Triplet As An Stellar Activity Indicator: Library and Comparative Study

    Authors: Xin Huang, Yu-JI He, ZhongRui Bai, Hailong Yuan, MingKuan Yang, Ming Zhou, Yiqiao Dong, Mengxin Wang, Han He, **ghua Zhang, Yao-Quan Chu, Yongheng Zhao, Yong Zhang, Haotong Zhang

    Abstract: We have established and released a new stellar index library of the Ca II Triplet, which serves as an indicator for characterizing the chromospheric activity of stars. The library is based on data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) Low-Resolution Spectroscopic Survey (LRS) Data Release 9 (DR9). To better reflect the chromospheric activity of stars, we have… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 15 pages, 13 figures, 5 table, submitted to ApJS

  15. arXiv:2311.02691  [pdf, ps, other

    cs.IT eess.SP

    Age of Information Analysis for CR-NOMA Aided Uplink Systems with Randomly Arrived Packets

    Authors: Yanshi Sun, Yanglin Ye, Zhiguo Ding, Momiao Zhou, Lei Liu

    Abstract: This paper studies the application of cognitive radio inspired non-orthogonal multiple access (CR-NOMA) to reduce age of information (AoI) for uplink transmission. In particular, a time division multiple access (TDMA) based legacy network is considered, where each user is allocated with a dedicated time slot to transmit its status update information. The CR-NOMA is implemented as an add-on to the… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  16. arXiv:2311.02356  [pdf, other

    cs.LG

    MATA*: Combining Learnable Node Matching with A* Algorithm for Approximate Graph Edit Distance Computation

    Authors: Junfeng Liu, Min Zhou, Shuai Ma, Lujia Pan

    Abstract: Graph Edit Distance (GED) is a general and domain-agnostic metric to measure graph similarity, widely used in graph search or retrieving tasks. However, the exact GED computation is known to be NP-complete. For instance, the widely used A* algorithms explore the entire search space to find the optimal solution which inevitably suffers scalability issues. Learning-based methods apply graph represen… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: Accepted by CIKM23

  17. arXiv:2310.19596  [pdf, other

    cs.CL cs.AI

    LLMaAA: Making Large Language Models as Active Annotators

    Authors: Ruoyu Zhang, Yanzeng Li, Yongliang Ma, Ming Zhou, Lei Zou

    Abstract: Prevalent supervised learning methods in natural language processing (NLP) are notoriously data-hungry, which demand large amounts of high-quality annotated data. In practice, acquiring such data is a costly endeavor. Recently, the superior few-shot performance of large language models (LLMs) has propelled the development of dataset generation, where the training data are solely synthesized from L… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023 camera ready

  18. arXiv:2310.17082  [pdf, ps, other

    astro-ph.HE

    Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 3 figures, Accepted by the APJL

  19. arXiv:2310.13266  [pdf, ps, other

    cs.IT

    Measurement-Based Small-Scale Channel Model for Sub-6 GHz RIS-Assisted Communications

    Authors: Jian Sang, Jifeng Lan, Mingyong Zhou, Boning Gao, Wankai Tang, Xiao Li, Michail Matthaiou, Shi **, Marco Di Renzo

    Abstract: Reconfigurable intelligent surfaces (RISs) have attracted increasing interest from both academia and industry, thanks to their unique features on controlling electromagnetic (EM) waves. Although theoretical models for RIS-empowered communications have covered a variety of applications, yet, very few papers have investigated the modeling of real propagation characteristics. In this paper, we fill t… ▽ More

    Submitted 4 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  20. Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

    Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 49pages, 11figures

    Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

  21. arXiv:2310.08774  [pdf, other

    q-bio.PE cs.LG stat.ML

    PhyloGFN: Phylogenetic inference with generative flow networks

    Authors: Mingyang Zhou, Zichao Yan, Elliot Layne, Nikolay Malkin, Dinghuai Zhang, Moksh Jain, Mathieu Blanchette, Yoshua Bengio

    Abstract: Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt… ▽ More

    Submitted 24 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  22. arXiv:2310.08442  [pdf, other

    cs.CV cs.AI

    Debias the Training of Diffusion Models

    Authors: Hu Yu, Li Shen, Jie Huang, Man Zhou, Hongsheng Li, Feng Zhao

    Abstract: Diffusion models have demonstrated compelling generation quality by optimizing the variational lower bound through a simple denoising score matching loss. In this paper, we provide theoretical evidence that the prevailing practice of using a constant loss weight strategy in diffusion models leads to biased estimation during the training phase. Simply optimizing the denoising network to predict Gau… ▽ More

    Submitted 3 November, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: University of Science and Technology of China, Alibaba Group, The Chinese University of Hong Kong

  23. arXiv:2310.07453  [pdf, ps, other

    hep-ph

    Two-Loop QCD Corrections to C even Bottomonium Exclusive Decays to Double $J/ψ$

    Authors: Yu-Dong Zhang, Xiao-Wei Bai, Feng Feng, Wen-Long Sang, Ming-Zhen Zhou

    Abstract: In the framework of nonrelativistic QCD (NRQCD) factorization, we compute both the polarized and the unpolarized decay widths for the processes $η_b(χ_{bJ})\to J/ψJ/ψ$, accurate up to next-to-next-to-leading-order (NNLO) in $α_s$. For the first time, we confirm that the NRQCD factorization does hold at NNLO for the process involving triple quarkonia. We find the radiative corrections are considera… ▽ More

    Submitted 12 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 17 pages, 3 figures, 4 tables; The affiliation and acknowledgement of the first author are updated

  24. arXiv:2310.06389  [pdf, other

    cs.CV stat.ML

    Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling

    Authors: Huangjie Zheng, Zhendong Wang, Jianbo Yuan, Guanghan Ning, Pengcheng He, Quanzeng You, Hongxia Yang, Mingyuan Zhou

    Abstract: Diffusion models excel at generating photo-realistic images but come with significant computational costs in both training and sampling. While various techniques address these computational challenges, a less-explored issue is designing an efficient and adaptable network backbone for iterative refinement. Current options like U-Net and Vision Transformer often rely on resource-intensive deep netwo… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  25. arXiv:2310.04722  [pdf, other

    cs.SD cs.AI eess.AS

    A Holistic Evaluation of Piano Sound Quality

    Authors: Monan Zhou, Shangda Wu, Shaohua Ji, Zi** Li, Wei Li

    Abstract: This paper aims to develop a holistic evaluation method for piano sound quality to assist in purchasing decisions. Unlike previous studies that focused on the effect of piano performance techniques on sound quality, this study evaluates the inherent sound quality of different pianos. To derive quality evaluation systems, the study uses subjective questionnaires based on a piano sound quality datas… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  26. arXiv:2310.01251  [pdf, other

    cs.CV cs.LG

    Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks

    Authors: Meng Zhou, Matthias W Wagner, Uri Tabori, Cynthia Hawkins, Birgit B Ertl-Wagner, Farzad Khalvati

    Abstract: Medical image analysis has significantly benefited from advancements in deep learning, particularly in the application of Generative Adversarial Networks (GANs) for generating realistic and diverse images that can augment training datasets. However, the effectiveness of such approaches is often limited by the amount of available data in clinical settings. Additionally, the common GAN-based approac… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Preprint, In Submission

  27. arXiv:2309.17315  [pdf, other

    eess.SY

    Data-Driven Newton Raphson Controller Based on Koopman Operator Theory

    Authors: Mi Zhou

    Abstract: Newton-Raphson controller is a powerful prediction-based variable gain integral controller. Basically, the classical model-based Newton-Raphson controller requires two elements: the prediction of the system output and the derivative of the predicted output with respect to the control input. In real applications, the model may not be known and it is infeasible to predict the system sometime ahead a… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  28. arXiv:2309.16834  [pdf, other

    eess.SY

    Energy Optimal Control of a Harmonic Oscillator with a State Inequality Constraint

    Authors: Mi Zhou, Erik I Verriest, Chaouki Abdallah

    Abstract: In this article, the optimal control problem for a harmonic oscillator with an inequality constraint is considered. The applied energy of the oscillator during a fixed final time period is used as the performance criterion. The analytical solution with both small and large terminal time is found for a special case when the undriven oscillator system is initially at rest. For other initial states o… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  29. arXiv:2309.15776  [pdf, other

    cs.IT

    Time-Domain Channel Measurements and Small-Scale Fading Characterization for RIS-Assisted Wireless Communication Systems

    Authors: Yanqing Ren, Mingyong Zhou, Xiaokun Teng, Shengguo Meng, Wankai Tang, Xiao Li, Shi **, Michail Matthaiou

    Abstract: As a potentially revolutionary enabling technology for the sixth generation (6G) mobile communication system, reconfigurable intelligent surfaces (RISs) have attracted extensive attention from industry and academia. In RIS-assisted wireless communication systems, practical channel measurements and modeling serve as the foundation for system design, network optimization, and performance evaluation.… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  30. Interactive Content Diversity and User Exploration in Online Movie Recommenders: A Field Experiment

    Authors: Ruixuan Sun, Avinash Akella, Ruoyan Kong, Moyan Zhou, Joseph A. Konstan

    Abstract: Recommender systems often struggle to strike a balance between matching users' tastes and providing unexpected recommendations. When recommendations are too narrow and fail to cover the full range of users' preferences, the system is perceived as useless. Conversely, when the system suggests too many items that users don't like, it is considered impersonal or ineffective. To better understand user… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: International Journal of Human Computer Interaction

  31. arXiv:2309.13259  [pdf, other

    cs.IR cs.AI cs.SD eess.AS

    WikiMT++ Dataset Card

    Authors: Monan Zhou, Shangda Wu, Yuan Wang, Wei Li

    Abstract: WikiMT++ is an expanded and refined version of WikiMusicText (WikiMT), featuring 1010 curated lead sheets in ABC notation. To expand application scenarios of WikiMT, we add both objective (album, lyrics, video) and subjective emotion (12 emotion adjectives) and emo\_4q (Russell 4Q) attributes, enhancing its usability for music information retrieval, conditional music generation, automatic composit… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  32. arXiv:2309.08733  [pdf, other

    cs.RO

    Optimal path planning of multi-agent cooperative systems with rigid formation

    Authors: Ananda Rangan Narayanan, Mi Zhou, Erik Verriest

    Abstract: In this article, we consider the path-planning problem of a cooperative homogeneous robotic system with rigid formation. An optimal controller is designed for each agent in such rigid systems based on Pontryagin's minimum principle theory. We found that the optimal control for each agent is equivalent to the optimal control for the Center of Mass (CoM). This equivalence is then proved by using som… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  33. arXiv:2309.07867  [pdf, other

    cs.LG cs.AI stat.CO stat.ME stat.ML

    Beta Diffusion

    Authors: Mingyuan Zhou, Tianqi Chen, Zhendong Wang, Huangjie Zheng

    Abstract: We introduce beta diffusion, a novel generative modeling method that integrates demasking and denoising to generate data within bounded ranges. Using scaled and shifted beta distributions, beta diffusion utilizes multiplicative transitions over time to create both forward and reverse diffusion processes, maintaining beta distributions in both the forward marginals and the reverse conditionals, giv… ▽ More

    Submitted 24 December, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023

  34. arXiv:2309.06006  [pdf, ps, other

    cs.CV cs.AI

    SoccerNet 2023 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim , et al. (77 additional authors not shown)

    Abstract: The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, fo… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  35. arXiv:2309.05201  [pdf, other

    cs.CL

    Two is Better Than One: Answering Complex Questions by Multiple Knowledge Sources with Generalized Links

    Authors: Minhao Zhang, Yongliang Ma, Yanzeng Li, Ruoyu Zhang, Lei Zou, Ming Zhou

    Abstract: Incorporating multiple knowledge sources is proven to be beneficial for answering complex factoid questions. To utilize multiple knowledge bases (KB), previous works merge all KBs into a single graph via entity alignment and reduce the problem to question-answering (QA) over the fused KB. In reality, various link relations between KBs might be adopted in QA over multi-KBs. In addition to the ident… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  36. arXiv:2309.04867  [pdf, other

    cs.DM math.PR

    Finite-sample analysis of rotation operator under $l_2$ norm and $l_\infty$ norm

    Authors: Mi Zhou

    Abstract: In this article, we consider a special operator called the two-dimensional rotation operator and analyze its convergence and finite-sample bounds under the $l_2$ norm and $l_\infty$ norm with constant step size. We then consider the same problem with stochastic noise with affine variance. Furthermore, simulations are provided to illustrate our results. Finally, we conclude this article by proposin… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  37. arXiv:2309.03461  [pdf, other

    nucl-th

    Covariant density functional theory for nuclear fission based on two-center harmonic oscillator basis

    Authors: Zeyu Li, Shengyuan Chen, Minghui Zhou, Yong**g Chen, Zhipan Li

    Abstract: Nowdays, modern microscopic approaches for fission are generally based on the framework of nuclear density functional theory (DFT), which has enabled a self-consistent treatment of both static and dynamic aspects of fission. The key issue is a DFT solver with high precision and efficiency especially for the large elongated configurations. Purpose: To develope a DFT solver with high precision and e… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  38. arXiv:2309.01958  [pdf, other

    cs.CV eess.IV

    Empowering Low-Light Image Enhancer through Customized Learnable Priors

    Authors: Naishan Zheng, Man Zhou, Yanmeng Dong, Xiangyu Rui, Jie Huang, Chongyi Li, Feng Zhao

    Abstract: Deep neural networks have achieved remarkable progress in enhancing low-light images by improving their brightness and eliminating noise. However, most existing methods construct end-to-end map** networks heuristically, neglecting the intrinsic prior of image enhancement task and lacking transparency and interpretability. Although some unfolding solutions have been proposed to relieve these issu… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023

  39. arXiv:2308.16083  [pdf, other

    cs.CV eess.IV

    Learned Image Reasoning Prior Penetrates Deep Unfolding Network for Panchromatic and Multi-Spectral Image Fusion

    Authors: Man Zhou, Jie Huang, Naishan Zheng, Chongyi Li

    Abstract: The success of deep neural networks for pan-sharpening is commonly in a form of black box, lacking transparency and interpretability. To alleviate this issue, we propose a novel model-driven deep unfolding framework with image reasoning prior tailored for the pan-sharpening task. Different from existing unfolding solutions that deliver the proximal operator networks as the uncertain and vague prio… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 10 pages; Accepted by ICCV 2023

  40. arXiv:2308.13783  [pdf, other

    cs.CV

    Generalized Lightness Adaptation with Channel Selective Normalization

    Authors: Mingde Yao, Jie Huang, Xin **, Ruikang Xu, Shenglong Zhou, Man Zhou, Zhiwei Xiong

    Abstract: Lightness adaptation is vital to the success of image processing to avoid unexpected visual deterioration, which covers multiple aspects, e.g., low-light image enhancement, image retouching, and inverse tone map**. Existing methods typically work well on their trained lightness conditions but perform poorly in unknown ones due to their limited generalization ability. To address this limitation,… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023. Code: https://github.com/mdyao/CSNorm/

  41. arXiv:2308.13484  [pdf

    physics.app-ph cond-mat.mes-hall

    Ultra-clean assembly of van der Waals heterostructures

    Authors: Wendong Wang, Nicholas Clark, Matthew Hamer, Amy Carl, Endre Tovari, Sam Sullivan-Allsop, Evan Tillotson, Yunze Gao, Hugo de Latour, Francisco Selles, James Howarth, Eli G. Castanon, Mingwei Zhou, Haoyu Bai, Xiao Li, Astrid Weston, Kenji Watanabe, Takashi Taniguchi, Cecilia Mattevi, Thomas H. Bointon, Paul V. Wiper, Andrew J. Strudwick, Leonid A. Ponomarenko, Andrey Kretinin, Sarah J. Haigh , et al. (2 additional authors not shown)

    Abstract: Layer-by-layer assembly of van der Waals (vdW) heterostructures underpins new discoveries in solid state physics, material science and chemistry. Despite the successes, all current 2D material (2DM) transfer techniques rely on the use of polymers which limit the cleanliness, ultimate electronic performance, and potential for optoelectronic applications of the heterostructures. In this article, we… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 23 pages, 4 figures

    Journal ref: Nature Electronics, 2023

  42. arXiv:2308.09038  [pdf, other

    cs.SE

    Personalized First Issue Recommender for Newcomers in Open Source Projects

    Authors: Wenxin Xiao, **gyue Li, Hao He, Ruiqiao Qiu, Minghui Zhou

    Abstract: Many open source projects provide good first issues (GFIs) to attract and retain newcomers. Although several automated GFI recommenders have been proposed, existing recommenders are limited to recommending generic GFIs without considering differences between individual newcomers. However, we observe mismatches between generic GFIs and the diverse background of newcomers, resulting in failed attemp… ▽ More

    Submitted 26 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: The 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  43. arXiv:2308.08823  [pdf, other

    cs.LG

    Mitigating Semantic Confusion from Hostile Neighborhood for Graph Active Learning

    Authors: Tianmeng Yang, Min Zhou, Yu**g Wang, Zhengjie Lin, Lujia Pan, Bin Cui, Yunhai Tong

    Abstract: Graph Active Learning (GAL), which aims to find the most informative nodes in graphs for annotation to maximize the Graph Neural Networks (GNNs) performance, has attracted many research efforts but remains non-trivial challenges. One major challenge is that existing GAL strategies may introduce semantic confusion to the selected training set, particularly when graphs are noisy. Specifically, most… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted by CIKM 2023

  44. Growth of millimeter-sized high-quality CuFeSe$_2$ single crystals by the molten salt method and study of their semiconducting behavior

    Authors: Mingwei Ma, Binbin Ruan, Menghu Zhou, Yadong Gu, Qingxin Dong, Qingsong Yang, Qiaoyu Wang, Lewei Chen, Yunqing Shi, Junkun Yi, Genfu Chen, Zhian Ren

    Abstract: An eutectic AlCl$_3$/KCl molten salt method in a horizontal configuration was employed to grow millimeter-sized and composition homogeneous CuFeSe$_2$ single crystals due to the continuous growth process in a temperature gradient induced solution convection. The typical as-grown CuFeSe$_2$ single crystals in cubic forms are nearly 1.6$\times$1.2$\times$1.0 mm3 in size. The chemical composition and… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Journal ref: Journal of Crystal Growth (2023)

  45. arXiv:2308.06005  [pdf, other

    cs.SE

    How Early Participation Determines Long-Term Sustained Activity in GitHub Projects?

    Authors: Wenxin Xiao, Hao He, Weiwei Xu, Yuxia Zhang, Minghui Zhou

    Abstract: Although the open source model bears many advantages in software development, open source projects are always hard to sustain. Previous research on open source sustainability mainly focuses on projects that have already reached a certain level of maturity (e.g., with communities, releases, and downstream projects). However, limited attention is paid to the development of (sustainable) open source… ▽ More

    Submitted 28 September, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: The 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023)

  46. arXiv:2308.05993  [pdf, other

    cs.CV cs.RO

    Image-based Geolocalization by Ground-to-2.5D Map Matching

    Authors: Mengjie Zhou, Liu Liu, Yiran Zhong, Andrew Calway

    Abstract: We study the image-based geolocalization problem, aiming to localize ground-view query images on cartographic maps. Current methods often utilize cross-view localization techniques to match ground-view query images with 2D maps. However, the performance of these methods is unsatisfactory due to significant cross-view appearance differences. In this paper, we lift cross-view matching to a 2.5D spac… ▽ More

    Submitted 3 November, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

  47. arXiv:2308.05942  [pdf, other

    cs.SE

    Understanding and Remediating Open-Source License Incompatibilities in the PyPI Ecosystem

    Authors: Weiwei Xu, Hao He, Kai Gao, Minghui Zhou

    Abstract: The reuse and distribution of open-source software must be in compliance with its accompanying open-source license. In modern packaging ecosystems, maintaining such compliance is challenging because a package may have a complex multi-layered dependency graph with many packages, any of which may have an incompatible license. Although prior research finds that license incompatibilities are prevalent… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  48. TextPainter: Multimodal Text Image Generation with Visual-harmony and Text-comprehension for Poster Design

    Authors: Yifan Gao, **peng Lin, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang

    Abstract: Text design is one of the most critical procedures in poster design, as it relies heavily on the creativity and expertise of humans to design text images considering the visual harmony and text-semantic. This study introduces TextPainter, a novel multimodal approach that leverages contextual visual information and corresponding text semantics to generate text images. Specifically, TextPainter take… ▽ More

    Submitted 12 August, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM MM 2023. Dataset Link: https://tianchi.aliyun.com/dataset/160034

  49. arXiv:2308.03002  [pdf, ps, other

    quant-ph

    The effect of Quantum Statistics on the sensitivity in an SU(1,1) interferometer

    Authors: Jie Zeng, Yingxing Ding, Mengyao Zhou, Gao-Feng Jiao, Keye Zhang, L. Q. Chen, Wei** Zhang, Chun-Hua Yuan

    Abstract: We theoretically study the effect of quantum statistics of the light field on the quantum enhancement of parameter estimation based on cat state input the SU(1,1) interferometer. The phase sensitivity is dependent on the relative phase $θ$ between two coherent states of Schrödinger cat states. The optimal sensitivity is achieved when the relative phase is $π$% , i.e., odd coherent states input. Fo… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures. arXiv admin note: text overlap with arXiv:2302.09823

  50. arXiv:2308.01924  [pdf, other

    astro-ph.HE astro-ph.CO hep-ph physics.plasm-ph

    Magnetogenesis in a collisionless plasma: from Weibel instability to turbulent dynamo

    Authors: Muni Zhou, Vladimir Zhdankin, Matthew W. Kunz, Nuno F. Loureiro, Dmitri A. Uzdensky

    Abstract: We report on a first-principles numerical and theoretical study of plasma dynamo in a fully kinetic framework. By applying an external mechanical force to an initially unmagnetized plasma, we develop a self-consistent treatment of the generation of ``seed'' magnetic fields, the formation of turbulence, and the inductive amplification of fields by the fluctuation dynamo. Driven large-scale motions… ▽ More

    Submitted 28 July, 2023; originally announced August 2023.

    Comments: 16 pages, 10 figures