Skip to main content

Showing 201–250 of 775 results for author: Huang, R

.
  1. arXiv:2305.01402  [pdf, other

    hep-ex physics.ins-det

    The background model of the CUPID-Mo $0νββ$ experiment

    Authors: CUPID-Mo Collaboration, :, C. Augier, A. S. Barabash, F. Bellini, G. Benato, M. Beretta, L. Bergé, J. Billard, Yu. A. Borovlev, L. Cardani, N. Casali, A. Cazes, E. Celi, M. Chapellier, D. Chiesa, I. Dafinei, F. A. Danevich, M. De Jesus, P. de Marcillac, T. Dixon, L. Dumoulin, K. Eitel, F. Ferri, B. K. Fujikawa , et al. (58 additional authors not shown)

    Abstract: CUPID-Mo, located in the Laboratoire Souterrain de Modane (France), was a demonstrator for the next generation $0νββ$ decay experiment, CUPID. It consisted of an array of 20 enriched Li$_{2}$$ ^{100}$MoO$_4$ bolometers and 20 Ge light detectors and has demonstrated that the technology of scintillating bolometers with particle identification capabilities is mature. Furthermore, CUPID-Mo can inform… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  2. arXiv:2305.00787  [pdf, other

    cs.CV

    GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation

    Authors: Zhenhui Ye, **zheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, **glin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao

    Abstract: Generating talking person portraits with arbitrary speech audio is a crucial problem in the field of digital human and metaverse. A modern talking face generation method is expected to achieve the goals of generalized audio-lip synchronization, good video quality, and high system efficiency. Recently, neural radiance field (NeRF) has become a popular rendering technique in this field since it coul… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 18 Pages, 7 figures

  3. $G$-invariant Bergman kernel and geometric quantization on complex manifolds with boundary

    Authors: Chin-Yu Hsiao, Rung-Tzung Huang, Xiaoshan Li, Guokuan Shao

    Abstract: Let $M$ be a complex manifold with boundary $X$, which admits a holomorphic Lie group $G$-action preserving $X$. We establish a full asymptotic expansion for the $G$-invariant Bergman kernel under certain assumptions. As an application, we get $G$-invariant version of Fefferman's result about regularity of biholomorphic maps on strongly pseudoconvex domains of $\mathbb C^n$. Moreover, we show that… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 36 pages

    Journal ref: Math. Ann. 2024

  4. arXiv:2305.00427  [pdf, other

    cs.SE

    An overview of Web3.0 Technology: Infrastructure, Applications, and Popularity

    Authors: Renke Huang, Jiachi Chen, Yanlin Wang, Tingting Bi, Zibin Zheng

    Abstract: Web3, the next generation of the Internet, represents a decentralized and democratized web. Although it has garnered significant public interest and found numerous real-world applications, there is a limited understanding of people's perceptions and experiences with Web3. In this study, we conducted an empirical study to investigate the categories of Web3 application and their popularity, as well… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 25 pages, 5 figures

  5. Lattice dynamics and ferroelectric properties of the nitride perovskite ${\mathrm{LaWN}}_{3}$

    Authors: Yue-Wen Fang, Craig A. J. Fisher, Akihide Kuwabara, Xin-Wei Shen, Takafumi Ogawa, Hiroki Moriwake, Rong Huang, Chun-Gang Duan

    Abstract: Using first-principles calculations we examine the crystal structures and phase transitions of nitride perovskite LaWN$_3$. Lattice dynamics calculations indicate that the ground-state structure belongs to space group $R3c$. Two competitive phase transition pathways are identified which are characterized by symmetry-adapted distortion modes. The results suggest that $R3c$ LaWN$_3$ should be an exc… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: 13 pages, 8 figures in main text and 5 figures in supplementary

    Journal ref: Phys. Rev. B 95, 014111 (2017)

  6. arXiv:2304.12995  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

    Authors: Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, **glin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe

    Abstract: Large language models (LLMs) have exhibited remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. Despite the recent success, current LLMs are not capable of processing complex audio information or conducting spoken conversations (like Siri or Alexa). In this work, we propose a multi-modal AI system named AudioGPT, which complements… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  7. arXiv:2304.11053  [pdf, other

    cs.CL cs.SD eess.AS

    A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale

    Authors: Cal Peyser, Michael Picheny, Kyunghyun Cho, Rohit Prabhavalkar, Ronny Huang, Tara Sainath

    Abstract: Unpaired text and audio injection have emerged as dominant methods for improving ASR performance in the absence of a large labeled corpus. However, little guidance exists on deploying these methods to improve production ASR systems that are trained on very large supervised corpora and with realistic requirements like a constrained model size and CPU budget, streaming capability, and a rich lattice… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Journal ref: 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  8. arXiv:2304.07999   

    cs.HC cs.AI cs.CV

    Everyone Can Be Picasso? A Computational Framework into the Myth of Human versus AI Painting

    Authors: Yilin Ye, Rong Huang, Kang Zhang, Wei Zeng

    Abstract: The recent advances of AI technology, particularly in AI-Generated Content (AIGC), have enabled everyone to easily generate beautiful paintings with simple text description. With the stunning quality of AI paintings, it is widely questioned whether there still exists difference between human and AI paintings and whether human artists will be replaced by AI. To answer these questions, we develop a… ▽ More

    Submitted 22 February, 2024; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: The results in Figure 3 in Section 4 have error due to my mistakes in feature calculation. Particularly the error is in the classification accuracy

    ACM Class: I.2.0; J.5; H.5.2

  9. arXiv:2304.07036  [pdf, other

    eess.IV cs.CV cs.LG

    Hierarchical Agent-based Reinforcement Learning Framework for Automated Quality Assessment of Fetal Ultrasound Video

    Authors: Si**g Liu, Qilong Ying, Shuangchi He, Xin Yang, Dong Ni, Ruobing Huang

    Abstract: Ultrasound is the primary modality to examine fetal growth during pregnancy, while the image quality could be affected by various factors. Quality assessment is essential for controlling the quality of ultrasound images to guarantee both the perceptual and diagnostic values. Existing automated approaches often require heavy structural annotations and the predictions may not necessarily be consiste… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  10. arXiv:2304.06590  [pdf, other

    quant-ph

    Maximizing temporal quantum correlation by approaching an exceptional point

    Authors: Chun-Wang Wu, Man-Chao Zhang, Yan-Li Zhou, Ting Chen, Ran Huang, Yi Xie, Bao-Quan Ou, Wei Wu, Adam Miranowicz, Jie Zhang, Hui **g, **-Xing Chen

    Abstract: Quantum correlations, both spatial and temporal, are the central pillars of quantum mechanics. Over the last two decades, a big breakthrough in quantum physics is its complex extension to the non-Hermitian realm, and dizzying varieties of novel phenomena and applications beyond the Hermitian framework have been uncovered. However, unique features of non-Hermitian quantum correlations, especially i… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 4 figures and 8 pages

  11. arXiv:2303.17997  [pdf, other

    quant-ph

    Switching classical and quantum nonreciprocities with spinning photonics

    Authors: Yonglin Xiang, Yunlan Zuo, Xun-Wei Xu, Ran Huang, Hui **g

    Abstract: We study how to achieve, manipulate, and switch classical or quantum nonreciprocal effects of light with a spinning Kerr resonator. In particular, we show that even when there is no classical nonreciprocity (i.e., with the same mean number of photons for both clockwise and counterclockwise propagating modes), it is still possible to realize nonreciprocity of quantum correlations of photons in such… ▽ More

    Submitted 28 August, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

  12. arXiv:2303.17007  [pdf

    hep-ex hep-ph nucl-th

    Impact of cross-section uncertainties on supernova neutrino spectral parameter fitting in the Deep Underground Neutrino Experiment

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, Z. Ahmad, J. Ahmed, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, P. Amedo, J. Anderson, D. A. Andrade , et al. (1294 additional authors not shown)

    Abstract: A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics… ▽ More

    Submitted 7 July, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: 25 pages, 21 figures

    Report number: FERMILAB-PUB-23-132-CSAID-LBNF-ND-T

    Journal ref: Phys. Rev. D 107, 112012 (2023)

  13. arXiv:2303.14491  [pdf, other

    cs.HC

    Is It the End? Guidelines for Cinematic Endings in Data Videos

    Authors: Xian Xu, Aoyu Wu, Leni Yang, Zheng Wei, Rong Huang, David Yip, Huamin Qu

    Abstract: Data videos are becoming increasingly popular in society and academia. Yet little is known about how to create endings that strengthen a lasting impression and persuasion. To fulfill the gap, this work aims to develop guidelines for data video endings by drawing inspiration from cinematic arts. To contextualize cinematic endings in data videos, 111 film endings and 105 data video endings are first… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  14. arXiv:2303.12583  [pdf, other

    cond-mat.mtrl-sci physics.app-ph

    Enhanced functional reversibility in lead-free ferroelectric material over long cycle pyroelectric energy conversion

    Authors: Chenbo Zhang, Zeyuan Zhu, Ka Hung Chan, Ruhao Huang, Xian Chen

    Abstract: The ferroelectric material usually exhibits temperature dependent spontaneous polarization, known as pyroelectricity, which can be used to directly convert thermal energy to electricity from ambient low-grade waste heat. When utilizing the structural phase transformations of the material, the conversion capability can be magnified, consequently the device performance can be strongly boosted by ord… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 18 pages, 5 figures, 1 table

  15. arXiv:2303.12270  [pdf, other

    cs.CV eess.IV

    EBSR: Enhanced Binary Neural Network for Image Super-Resolution

    Authors: Renjie Wei, Shuwen Zhang, Zechun Liu, Meng Li, Yuchen Fan, Runsheng Wang, Ru Huang

    Abstract: While the performance of deep convolutional neural networks for image super-resolution (SR) has improved significantly, the rapid increase of memory and computation requirements hinders their deployment on resource-constrained devices. Quantized networks, especially binary neural networks (BNN) for SR have been proposed to significantly improve the model inference efficiency but suffer from large… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  16. arXiv:2303.10859  [pdf, other

    cs.LG stat.ML

    Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs

    Authors: Yuan Cheng, Ruiquan Huang, **g Yang, Yingbin Liang

    Abstract: In reward-free reinforcement learning (RL), an agent explores the environment first without any reward information, in order to achieve certain learning goals afterwards for any given reward. In this paper we focus on reward-free RL under low-rank MDP models, in which both the representation and linear weight vectors are unknown. Although various algorithms have been proposed for reward-free low-r… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted by ICLR 2023

  17. arXiv:2303.05309  [pdf, other

    cs.CV cs.CL

    MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition

    Authors: Xize Cheng, Linjun Li, Tao **, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao

    Abstract: Multi-media communications facilitate global interaction among people. However, despite researchers exploring cross-lingual translation techniques such as machine translation and audio speech translation to overcome language barriers, there is still a shortage of cross-lingual studies on visual speech. This lack of research is mainly due to the absence of datasets containing visual speech and tran… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: https://github.com/Exgc/AVMuST-TED

  18. Qubit Energy Tuner Based on Single Flux Quantum Circuits

    Authors: Xiao Geng, Rutian Huang, Yongcheng He, Kaiyong He, Genting Dai, Liangliang Yang, Xinyu Wu, Qing Yu, Mingjun Cheng, Guodong Chen, Jianshe Liu, Wei Chen

    Abstract: A device called qubit energy tuner (QET) based on single flux quantum (SFQ) circuits is proposed for Z control of superconducting qubits. Created from the improvement of flux digital-to-analog converters (flux DACs), a QET is able to set the energy levels or the frequencies of qubits, especially flux-tunable transmons, and perform gate operations requiring Z control. The circuit structure of QET i… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  19. arXiv:2303.01038  [pdf, other

    cs.CV cs.AI

    Neural Intrinsic Embedding for Non-rigid Point Cloud Matching

    Authors: Puhua Jiang, Mingze Sun, Ruqi Huang

    Abstract: As a primitive 3D data representation, point clouds are prevailing in 3D sensing, yet short of intrinsic structural information of the underlying objects. Such discrepancy poses great challenges on directly establishing correspondences between point clouds sampled from deformable shapes. In light of this, we propose Neural Intrinsic Embedding (NIE) to embed each vertex into a high-dimensional spac… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: To appear at CVPR 2023

  20. arXiv:2303.00802  [pdf, other

    cs.CL cs.SD eess.AS

    Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition

    Authors: Philipp Klumpp, Pooja Chitkara, Leda Sarı, Prashant Serai, Jilong Wu, Irina-Elena Veliche, Rongqing Huang, Qing He

    Abstract: The awareness for biased ASR datasets or models has increased notably in recent years. Even for English, despite a vast amount of available training data, systems perform worse for non-native speakers. In this work, we improve an accent-conversion model (ACM) which transforms native US-English speech into accented pronunciation. We include phonetic knowledge in the ACM training to provide accurate… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  21. Demonstrating the wormhole mechanism of the entanglement spectrum via a perturbed boundary

    Authors: Zenan Liu, Rui-Zhen Huang, Zheng Yan, Dao-Xin Yao

    Abstract: The Li-Haldane conjecture is one of the most famous conjectures in physics and opens a new research area in the quantum entanglement and topological phase. Although a lot of theoretical and numerical works have confirmed the conjecture in topological states with bulk-boundary correspondence, the cases with gapped boundary and the systems in high dimension are widely unknown. What is the valid scop… ▽ More

    Submitted 9 April, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 11 pages, 11 figures

    Journal ref: Phys. Rev. B 109,094416 (2024)

  22. arXiv:2302.14177  [pdf, other

    cs.DL cs.SE

    Soft-Search: Two Datasets to Study the Identification and Production of Research Software

    Authors: Eva Maxfield Brown, Lindsey Schwartz, Richard Lewei Huang, Nicholas Weber

    Abstract: Software is an important tool for scholarly work, but software produced for research is in many cases not easily identifiable or discoverable. A potential first step in linking research and software is software identification. In this paper we present two datasets to study the identification and production of research software. The first dataset contains almost 1000 human labeled annotations of so… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  23. arXiv:2302.13737  [pdf, ps, other

    cs.DS

    On Coresets for Clustering in Small Dimensional Euclidean Spaces

    Authors: Lingxiao Huang, Ruiyuan Huang, Zengfeng Huang, Xuan Wu

    Abstract: We consider the problem of constructing small coresets for $k$-Median in Euclidean spaces. Given a large set of data points $P\subset \mathbb{R}^d$, a coreset is a much smaller set $S\subset \mathbb{R}^d$, so that the $k$-Median costs of any $k$ centers w.r.t. $P$ and $S$ are close. Existing literature mainly focuses on the high-dimension case and there has been great success in obtaining dimensio… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  24. arXiv:2302.12471  [pdf, other

    physics.app-ph cond-mat.mes-hall

    Cubic singularities in binary linear electromechanical oscillators

    Authors: Xin Zhou, Hui **g, Xing**g Ren, Jianqi Zhang, Ran Huang, Zhipeng Li, Xiaopeng Sun, Xuezhong Wu, Cheng-Wei Qiu, Franco Nori, Dingbang Xiao

    Abstract: Singularities arise in diverse disciplines and play a key role in both exploring fundamental laws of physics and making highly-sensitive sensors. Higher-order (>3) singularities, with further improved performance, however, usually require exquisite tuning of multiple (>3) coupled degrees of freedom or nonlinear control, thus severely limiting their applications in practice. Here we propose theoret… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  25. arXiv:2302.10463  [pdf, other

    cs.RO cs.CV cs.LG

    Multimodal Trajectory Prediction: A Survey

    Authors: Renhao Huang, Hao Xue, Maurice Pagnucco, Flora Salim, Yang Song

    Abstract: Trajectory prediction is an important task to support safe and intelligent behaviours in autonomous systems. Many advanced approaches have been proposed over the years with improved spatial and temporal feature extraction. However, human behaviour is naturally multimodal and uncertain: given the past trajectory and surrounding environment information, an agent can have multiple plausible trajector… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  26. arXiv:2301.13662  [pdf, other

    cs.SD eess.AS

    InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt

    Authors: Dongchao Yang, Songxiang Liu, Rongjie Huang, Chao Weng, Helen Meng

    Abstract: Expressive text-to-speech (TTS) aims to synthesize different speaking style speech according to human's demands. Nowadays, there are two common ways to control speaking styles: (1) Pre-defining a group of speaking style and using categorical index to denote different speaking style. However, there are limitations in the diversity of expressiveness, as these models can only generate the pre-defined… ▽ More

    Submitted 25 June, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Submit to TASLP

  27. arXiv:2301.12661  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models

    Authors: Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Lu** Liu, Mingze Li, Zhenhui Ye, **glin Liu, Xiang Yin, Zhou Zhao

    Abstract: Large-scale multimodal generative modeling has created milestones in text-to-image and text-to-video generation. Its application to audio still lags behind for two main reasons: the lack of large-scale datasets with high-quality text-audio pairs, and the complexity of modeling long continuous audio data. In this work, we propose Make-An-Audio with a prompt-enhanced diffusion model that addresses t… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

    Comments: Audio samples are available at https://Text-to-Audio.github.io

  28. arXiv:2301.12520  [pdf, other

    cs.CL

    Producing Usable Taxonomies Cheaply and Rapidly at Pinterest Using Discovered Dynamic $μ$-Topics

    Authors: Abhijit Mahabal, Jiyun Luo, Rui Huang, Michael Ellsworth, Rui Li

    Abstract: Creating a taxonomy of interests is expensive and human-effort intensive: not only do we need to identify nodes and interconnect them, in order to use the taxonomy, we must also connect the nodes to relevant entities such as users, pins, and queries. Connecting to entities is challenging because of ambiguities inherent to language but also because individual interests are dynamic and evolve. Here,… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  29. arXiv:2301.07584  [pdf, other

    cs.CV cs.AI cs.LG

    Joint Representation Learning for Text and 3D Point Cloud

    Authors: Rui Huang, Xuran Pan, Henry Zheng, Haojun Jiang, Zhifeng Xie, Shiji Song, Gao Huang

    Abstract: Recent advancements in vision-language pre-training (e.g. CLIP) have shown that vision models can benefit from language supervision. While many models using language modality have achieved great success on 2D vision tasks, the joint representation learning of 3D point cloud with text remains under-explored due to the difficulty of 3D-Text data pair acquisition and the irregularity of 3D data struc… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  30. arXiv:2301.04327  [pdf, other

    cs.CL cs.AI

    Dual Learning for Large Vocabulary On-Device ASR

    Authors: Cal Peyser, Ronny Huang, Tara Sainath, Rohit Prabhavalkar, Michael Picheny, Kyunghyun Cho

    Abstract: Dual learning is a paradigm for semi-supervised machine learning that seeks to leverage unsupervised data by solving two opposite tasks at once. In this scheme, each model is used to generate pseudo-labels for unlabeled examples that are used to train the other model. Dual learning has seen some use in speech processing by pairing ASR and TTS as dual tasks. However, these results mostly address on… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  31. arXiv:2301.03398  [pdf, other

    cs.RO cs.AI

    Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration

    Authors: Chao Yu, Xinyi Yang, Jiaxuan Gao, Jiayu Chen, Yunfei Li, Jijia Liu, Yunfei Xiang, Ruixin Huang, Huazhong Yang, Yi Wu, Yu Wang

    Abstract: We consider the problem of cooperative exploration where multiple robots need to cooperatively explore an unknown region as fast as possible. Multi-agent reinforcement learning (MARL) has recently become a trending paradigm for solving this challenge. However, existing MARL-based methods adopt action-making steps as the metric for exploration efficiency by assuming all the agents are acting in a f… ▽ More

    Submitted 11 April, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: This paper is accepted by AAMAS 2023. The source code can be found in https://github.com/yang-xy20/async_mappo

  32. arXiv:2301.02814  [pdf, ps, other

    cs.LG cs.DS

    Randomized Greedy Algorithms and Composable Coreset for k-Center Clustering with Outliers

    Authors: Hu Ding, Ruomin Huang, Kai Liu, Haikuo Yu, Zixiu Wang

    Abstract: In this paper, we study the problem of {\em $k$-center clustering with outliers}. The problem has many important applications in real world, but the presence of outliers can significantly increase the computational complexity. Though a number of methods have been developed in the past decades, it is still quite challenging to design quality guaranteed algorithm with low complexity for this problem… ▽ More

    Submitted 7 January, 2023; originally announced January 2023.

  33. arXiv:2301.02497  [pdf, ps, other

    math.AT

    Some asymptotic formulae for torsion in homotopy groups

    Authors: Guy Boyde, Ruizhi Huang

    Abstract: Inspired by a remarkable work of Félix, Halperin and Thomas on the asymptotic estimation of the ranks of rational homotopy groups, and more recent works of Wu and the authors on local hyperbolicity, we prove two asymptotic formulae for torsion rank of homotopy groups, one using ordinary homology and one using $K$-theory. We use these to obtain explicit quantitative asymptotic lower bounds on the t… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: 19 pages; comments are very welcome

    MSC Class: 55Q52; 55Q05 (Primary) 55Q15; 55P40 (Secondary)

  34. Exploring the Intrinsic Scatter of the Star-Forming Galaxy Main Sequence at redshift 0.5 to 3.0

    Authors: Rongjun Huang, Andrew J. Battisti, Kathryn Grasha, Elisabete da Cunha, Claudia del P Lagos, Sarah K. Leslie, Emily Wisnioski

    Abstract: Previous studies have shown that the normalization and scatter of the galaxy 'main sequence' (MS), the relation between star formation rate (SFR) and stellar mass ($M_*$), evolves over cosmic time. However, such studies often rely on photometric redshifts and/or only rest-frame UV to near-IR data, which may underestimate the SFR and $M_*$ uncertainties. We use MAGPHYS+photo-z to fit the UV to radi… ▽ More

    Submitted 10 January, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: 16 pages, 15 figures, 3 tables. The paper has been accepted in MNRAS on January 3rd, 2023

  35. arXiv:2212.09807  [pdf, other

    physics.comp-ph physics.ins-det

    Highly-parallelized simulation of a pixelated LArTPC on a GPU

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, Z. Ahmad, J. Ahmed, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, C. Alt, A. Alton, R. Alvarez, P. Amedo, J. Anderson , et al. (1282 additional authors not shown)

    Abstract: The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we pr… ▽ More

    Submitted 28 February, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 26 pages, 15 figures

    Report number: FERMILAB-PUB-22-926-LBNF

  36. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  37. arXiv:2212.07086  [pdf, other

    cs.CV

    NLIP: Noise-robust Language-Image Pre-training

    Authors: Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chun**g Xu, Xiaodan Liang

    Abstract: Large-scale cross-modal pre-training paradigms have recently shown ubiquitous success on a wide range of downstream tasks, e.g., zero-shot classification, retrieval and image captioning. However, their successes highly rely on the scale and quality of web-crawled data that naturally contain incomplete and noisy information (e.g., wrong or irrelevant content). Existing works either design manual ru… ▽ More

    Submitted 4 January, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: AAAI 2023

  38. arXiv:2212.05524  [pdf, other

    stat.ME stat.AP

    Bayesian inference for partial orders from random linear extensions: power relations from 12th Century Royal Acta

    Authors: Geoff K. Nicholls, Jeong Eun Lee, Nicholas Karn, David Johnson, Rukuang Huang, Alexis Muir-Watt

    Abstract: We give a new class of models for time series data in which actors are listed in order of precedence. We model the lists as a realisation of a queue in which queue-position is constrained by an underlying social hierarchy. We model the hierarchy as a partial order so that the lists are random linear extensions. We account for noise via a random queue-jum** process. We give a marginally consisten… ▽ More

    Submitted 1 August, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: 57 pages, 37 figures and 2 tables including appendices

    MSC Class: 62M05 (Primary) 06A06; 62P25 (Secondary)

  39. arXiv:2212.02715  [pdf, other

    eess.SY cs.AI cs.LG math.OC

    Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning

    Authors: Ramij R. Hossain, Tianzhixi Yin, Yan Du, Renke Huang, Jie Tan, Wenhao Yu, Yuan Liu, Qiuhua Huang

    Abstract: This article proposes a model-based deep reinforcement learning (DRL) method to design emergency control strategies for short-term voltage stability problems in power systems. Recent advances show promising results in model-free DRL-based methods for power systems, but model-free methods suffer from poor sample efficiency and training time, both critical for making state-of-the-art DRL algorithms… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  40. arXiv:2211.15432  [pdf, other

    cs.CL

    E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

    Authors: W. Ronny Huang, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, David Rybach, Robert David, Rohit Prabhavalkar, Cyril Allauzen, Cal Peyser, Trevor D. Strohman

    Abstract: We explore unifying a neural segmenter with two-pass cascaded encoder ASR into a single model. A key challenge is allowing the segmenter (which runs in real-time, synchronously with the decoder) to finalize the 2nd pass (which runs 900 ms behind real-time) without introducing user-perceived latency or deletion errors during inference. We propose a design where the neural segmenter is integrated wi… ▽ More

    Submitted 5 March, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: ICASSP 2023

  41. arXiv:2211.15117  [pdf, other

    physics.optics

    Analysis and design of transition radiation in layered uniaxial crystals using Tandem neural networks

    Authors: Xiaoke Gao, Xiaoyu Zhao, Ruoyu Huang, Siyuan Ma, Xikui Ma, Tianyu Dong

    Abstract: With the flourishing development of nanophotonics, Cherenkov radiation pattern can be designed to achieve superior performance in particle detection by fine-tuning the properties of metamaterials such as photonic crystals (PCs) surrounding the swift particle. However, the radiation pattern can be sensitive to the geometry and material properties of PCs, such as periodicity, unit thickness, and die… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  42. arXiv:2211.14864  [pdf, other

    cs.CV cs.RO

    A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition

    Authors: Rui Huang, Ze Huang, Songzhi Su

    Abstract: Visual Place Recognition is an essential component of systems for camera localization and loop closure detection, and it has attracted widespread interest in multiple domains such as computer vision, robotics and AR/VR. In this work, we propose a faster, lighter and stronger approach that can generate models with fewer parameters and can spend less time in the inference stage. We designed RepVGG-l… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: CCF Conference on Computer Supported Cooperative Work and Social Computing (ChineseCSCW)

  43. arXiv:2211.13955  [pdf, other

    cs.CR cs.LG

    MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention

    Authors: Wenxuan Zeng, Meng Li, Wenjie Xiong, Tong Tong, Wen-jie Lu, ** Tan, Runsheng Wang, Ru Huang

    Abstract: Secure multi-party computation (MPC) enables computation directly on encrypted data and protects both data and model privacy in deep learning inference. However, existing neural network architectures, including Vision Transformers (ViTs), are not designed or optimized for MPC and incur significant latency overhead. We observe Softmax accounts for the major latency bottleneck due to a high communic… ▽ More

    Submitted 19 August, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted by ICCV 2023 conference

  44. arXiv:2211.11478  [pdf, other

    cs.CV cs.AI

    Background-Mixed Augmentation for Weakly Supervised Change Detection

    Authors: Rui Huang, Ruofei Wang, Qing Guo, Jieda Wei, Yuxiang Zhang, Wei Fan, Yang Liu

    Abstract: Change detection (CD) is to decouple object changes (i.e., object missing or appearing) from background changes (i.e., environment variations) like light and season variations in two images captured in the same scene over a long time span, presenting critical applications in disaster management, urban development, etc. In particular, the endless patterns of background changes require detectors to… ▽ More

    Submitted 19 June, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: AAAI 2023 Accepted

  45. arXiv:2211.11255  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection

    Authors: Lu** Liu, Yi Ren, Xize Cheng, Rongjie Huang, Chongxuan Li, Zhou Zhao

    Abstract: Out-of-distribution (OOD) detection is a crucial task for ensuring the reliability and safety of deep learning. Currently, discriminator models outperform other methods in this regard. However, the feature extraction process used by discriminator models suffers from the loss of critical information, leaving room for bad cases and malicious attacks. In this paper, we introduce a new perceptron bias… ▽ More

    Submitted 3 June, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  46. arXiv:2211.10666  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement

    Authors: Chenye Cui, Yi Ren, **glin Liu, Rongjie Huang, Zhou Zhao

    Abstract: Video to sound generation aims to generate realistic and natural sound given a video input. However, previous video-to-sound generation methods can only generate a random or average timbre without any controls or specializations of the generated sound timbre, leading to the problem that people cannot obtain the desired timbre under these methods sometimes. In this paper, we pose the task of genera… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

  47. arXiv:2211.09623  [pdf, other

    cs.CV cs.AI cs.CL

    Cross-Modal Adapter for Text-Video Retrieval

    Authors: Haojun Jiang, Jianke Zhang, Rui Huang, Chunjiang Ge, Zanlin Ni, Jiwen Lu, Jie Zhou, Shiji Song, Gao Huang

    Abstract: Text-video retrieval is an important multi-modal learning task, where the goal is to retrieve the most relevant video for a given text query. Recently, pre-trained models, e.g., CLIP, show great potential on this task. However, as pre-trained models are scaling up, fully fine-tuning them on text-video retrieval datasets has a high risk of overfitting. Moreover, in practice, it would be costly to t… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Tech Report

  48. arXiv:2211.08743  [pdf, other

    cs.CV

    Yield Evaluation of Citrus Fruits based on the YoloV5 compressed by Knowledge Distillation

    Authors: Yuqi Li, Yuting He, Yihang Zhou, Zirui Gong, Renjie Huang

    Abstract: In the field of planting fruit trees, pre-harvest estimation of fruit yield is important for fruit storage and price evaluation. However, considering the cost, the yield of each tree cannot be assessed by directly picking the immature fruit. Therefore, the problem is a very difficult task. In this paper, a fruit counting and yield assessment method based on computer vision is proposed for citrus f… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  49. arXiv:2211.06693  [pdf, other

    math.AP math-ph

    Smoluchowski coagulation equation with velocity dependence

    Authors: Franco Flandoli, Ruojun Huang, Andrea Papini

    Abstract: In the present article we introduce a variant of Smoluchowski's coagulation equation with both position and velocity variables taking a kinetic viewpoint arising as the scaling limit of a system of second-order (microscopic) coagulating particles. We focus on the rigorous study of the PDE system in the spatially-homogeneous case proving existence and uniqueness under different initial conditions i… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: 38 pages, single column

    MSC Class: 35Q70; 82C22; 40K05

  50. arXiv:2211.03624  [pdf

    eess.SP cs.AR

    Extremely-Fast, Energy-Efficient Massive MIMO Precoding with Analog RRAM Matrix Computing

    Authors: Pushen Zuo, Zhong Sun, Ru Huang

    Abstract: Signal processing in wireless communications, such as precoding, detection, and channel estimation, are basically about solving inverse matrix problems, which, however, are slow and inefficient in conventional digital computers, thus requiring a radical paradigm shift to achieve fast, real-time solutions. Here, for the first time, we apply the emerging analog matrix computing (AMC) to the linear p… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Submitted to an IEEE journal for possible publication