Skip to main content

Showing 151–200 of 797 results for author: Cai, J

.
  1. arXiv:2306.03372  [pdf, ps, other

    stat.ML cs.LG

    Online Tensor Learning: Computational and Statistical Trade-offs, Adaptivity and Optimal Regret

    Authors: Jian-Feng Cai, **gyang Li, Dong Xia

    Abstract: We investigate a generalized framework for estimating latent low-rank tensors in an online setting, encompassing both linear and generalized linear models. This framework offers a flexible approach for handling continuous or categorical variables. Additionally, we investigate two specific applications: online tensor completion and online binary tensor learning. To address these challenges, we prop… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

  2. arXiv:2306.01864  [pdf, other

    cs.LG cs.SD eess.AS

    Discovering COVID-19 Coughing and Breathing Patterns from Unlabeled Data Using Contrastive Learning with Varying Pre-Training Domains

    Authors: **** Cai, Sudip Vhaduri, Xiao Luo

    Abstract: Rapid discovery of new diseases, such as COVID-19 can enable a timely epidemic response, preventing the large-scale spread and protecting public health. However, limited research efforts have been taken on this problem. In this paper, we propose a contrastive learning-based modeling approach for COVID-19 coughing and breathing pattern discovery from non-COVID coughs. To validate our models, extens… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted by Proceedings of INTERSPEECH 2023

    Journal ref: Proceedings of INTERSPEECH 2023

  3. arXiv:2305.17030  [pdf, other

    astro-ph.HE hep-ph

    The First LHAASO Catalog of Gamma-Ray Sources

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.… ▽ More

    Submitted 27 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 40 pages, 13 figures, 4 tables

    Journal ref: The Astrophysical Journal Supplement Series, 271 (2024) 25

  4. arXiv:2305.16248  [pdf, other

    cs.HC cs.MM cs.SI

    Hate Raids on Twitch: Understanding Real-Time Human-Bot Coordinated Attacks in Live Streaming Communities

    Authors: Jie Cai, Sagnik Chowdhury, Hongyang Zhou, Donghee Yvette Wohn

    Abstract: Online harassment and content moderation have been well-documented in online communities. However, new contexts and systems always bring new ways of harassment and need new moderation mechanisms. This study focuses on hate raids, a form of group attack in real-time in live streaming communities. Through a qualitative analysis of hate raids discussion in the Twitch subreddit (r/Twitch), we found th… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted at CSCW2023

  5. arXiv:2305.12793  [pdf, other

    eess.AS cs.CL cs.MM cs.SD

    Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training

    Authors: Jianfeng He, Julian Salazar, Kaisheng Yao, Haoqi Li, **glun Cai

    Abstract: End-to-end (E2E) spoken language understanding (SLU) is constrained by the cost of collecting speech-semantics pairs, especially when label domains change. Hence, we explore \textit{zero-shot} E2E SLU, which learns E2E SLU without speech-semantics pairs, instead using only speech-text and text-semantics pairs. Previous work achieved zero-shot by pseudolabeling all speech-text transcripts with a na… ▽ More

    Submitted 2 February, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 18 pages, 7 figures

  6. arXiv:2305.07677  [pdf, other

    cs.SD cs.CL cs.LG

    Masked Audio Text Encoders are Effective Multi-Modal Rescorers

    Authors: **glun Cai, Monica Sunkara, Xilai Li, Anshu Bhatia, Xiao Pan, Sravan Bodapati

    Abstract: Masked Language Models (MLMs) have proven to be effective for second-pass rescoring in Automatic Speech Recognition (ASR) systems. In this work, we propose Masked Audio Text Encoder (MATE), a multi-modal masked language model rescorer which incorporates acoustic representations into the input space of MLM. We adopt contrastive learning for effectively aligning the modalities by learning shared rep… ▽ More

    Submitted 24 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  7. arXiv:2305.07039  [pdf, other

    cs.LG cs.AI

    Value Iteration Networks with Gated Summarization Module

    Authors: **yu Cai, Jialong Li, Mingyue Zhang, Kenji Tei

    Abstract: In this paper, we address the challenges faced by Value Iteration Networks (VIN) in handling larger input maps and mitigating the impact of accumulated errors caused by increased iterations. We propose a novel approach, Value Iteration Networks with Gated Summarization Module (GS-VIN), which incorporates two main improvements: (1) employing an Adaptive Iteration Strategy in the Value Iteration mod… ▽ More

    Submitted 16 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: 13 pages,6 figures,submitted to IEEE ACCESS

  8. arXiv:2305.06199  [pdf, ps, other

    math.ST cs.IT stat.ME stat.ML

    Computationally Efficient and Statistically Optimal Robust High-Dimensional Linear Regression

    Authors: Yinan Shen, **gyang Li, Jian-Feng Cai, Dong Xia

    Abstract: High-dimensional linear regression under heavy-tailed noise or outlier corruption is challenging, both computationally and statistically. Convex approaches have been proven statistically optimal but suffer from high computational costs, especially since the robust loss functions are usually non-smooth. More recently, computationally fast non-convex approaches via sub-gradient descent are proposed,… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: This manuscript supersedes an earlier one (arXiv:2203.00953). Two manuscripts share around 60% contents. There will be no further update for the earlier manuscript

  9. Measurement of ultra-high-energy diffuse gamma-ray emission of the Galactic plane from 10 TeV to 1 PeV with LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The diffuse Galactic $γ$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γ$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer ar… ▽ More

    Submitted 19 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages, 8 figures, 5 tables; accepted for publication in Physical Review Letters; source mask file provided as ancillary file

    Journal ref: Phys. Rev. Lett. 131, 151001 (2023)

  10. arXiv:2305.03837  [pdf, other

    eess.AS cs.LG cs.SD

    Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation

    Authors: Nilaksh Das, Monica Sunkara, Sravan Bodapati, **glun Cai, Devang Kulshreshtha, Jeff Farris, Katrin Kirchhoff

    Abstract: End-to-end ASR models trained on large amount of data tend to be implicitly biased towards language semantics of the training data. Internal language model estimation (ILME) has been proposed to mitigate this bias for autoregressive models such as attention-based encoder-decoder and RNN-T. Typically, ILME is performed by modularizing the acoustic and language components of the model architecture,… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted to ICASSP 2023

  11. arXiv:2305.03688  [pdf, other

    cs.CL

    DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition

    Authors: Zeqi Tan, Shen Huang, Zixia Jia, Jiong Cai, Yinghui Li, Weiming Lu, Yueting Zhuang, Kewei Tu, Pengjun Xie, Fei Huang, Yong Jiang

    Abstract: The MultiCoNER \RNum{2} shared task aims to tackle multilingual named entity recognition (NER) in fine-grained and noisy scenarios, and it inherits the semantic ambiguity and low-context setting of the MultiCoNER \RNum{1} task. To cope with these problems, the previous top systems in the MultiCoNER \RNum{1} either incorporate the knowledge bases or gazetteers. However, they still suffer from insuf… ▽ More

    Submitted 16 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted to SemEval 2023, winners for 9 out of 13 tracks, performance beyond ChatGPT

  12. arXiv:2305.02543  [pdf, other

    math.OC

    A Preconditioned Riemannian Gradient Descent Algorithm for Low-Rank Matrix Recovery

    Authors: Fengmiao Bian, Jian-Feng Cai, Rui Zhang

    Abstract: The low-rank matrix recovery problem often arises in various fields, including signal processing, machine learning, and imaging science. The Riemannian gradient descent (RGD) algorithm has proven to be an efficient algorithm for solving this problem. In this paper, we present a preconditioned Riemannian gradient descent (PRGD) for low-rank matrix recovery. The preconditioner, noted for its simplic… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  13. arXiv:2304.12294  [pdf, other

    cs.CV

    Explicit Correspondence Matching for Generalizable Neural Radiance Fields

    Authors: Yuedong Chen, Haofei Xu, Qianyi Wu, Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai

    Abstract: We present a new generalizable NeRF method that is able to directly generalize to new unseen scenarios and perform novel view synthesis with as few as two source views. The key to our approach lies in the explicitly modeled correspondence matching information, so as to provide the geometry prior to the prediction of NeRF color and density for volume rendering. The explicit correspondence matching… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: Code and pre-trained models: https://github.com/donydchen/matchnerf Project Page: https://donydchen.github.io/matchnerf/

  14. arXiv:2304.12184  [pdf, other

    eess.SP cs.AI cs.IT cs.LG

    Active RIS-aided EH-NOMA Networks: A Deep Reinforcement Learning Approach

    Authors: Zhaoyuan Shi, Huabing Lu, Xianzhong Xie, Helin Yang, Chongwen Huang, Jun Cai, Zhiguo Ding

    Abstract: An active reconfigurable intelligent surface (RIS)-aided multi-user downlink communication system is investigated, where non-orthogonal multiple access (NOMA) is employed to improve spectral efficiency, and the active RIS is powered by energy harvesting (EH). The problem of joint control of the RIS's amplification matrix and phase shift matrix is formulated to maximize the communication success ra… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  15. Engineering artificial atomic systems of giant electric dipole moment

    Authors: Baiyi Yu, Yaoming Chu, Ralf Betzholz, Shaoliang Zhang, Jianming Cai

    Abstract: The electric dipole moment (EDM) plays a crucial role in determining the interaction strength of an atom with electric fields, making it paramount to quantum technologies based on coherent atomic control. We propose a scheme for engineering the potential in a Paul trap to realize a two-level quantum system with a giant EDM formed by the motional states of a trapped electron. We show that, under re… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 5 figures + 26 pages Supplemental Material. Comments are welcome

    Journal ref: Phys. Rev. Lett. 132, 073202 (2024)

  16. Accelerated quantum control in a three-level system by jum** along the geodesics

    Authors: Musang Gong, Min Yu, Ralf Betzholz, Yaoming Chu, Pengcheng Yang, Zhenyu Wang, Jianming Cai

    Abstract: In a solid-state spin system, we experimentally demonstrate a protocol for quantum-state population transfer with an improved efficiency compared to traditional stimulated Raman adiabatic passage (STIRAP). Using the ground-state triplet of the nitrogen-vacancy center in diamond, we show that the required evolution time for high-fidelity state transfer can be reduced by almost one order of magnitud… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 8 pages, 6 figures

  17. arXiv:2304.10547  [pdf, ps, other

    cs.AI cs.HC

    The Design Space of Generative Models

    Authors: Meredith Ringel Morris, Carrie J. Cai, Jess Holbrook, Chinmay Kulkarni, Michael Terry

    Abstract: Card et al.'s classic paper "The Design Space of Input Devices" established the value of design spaces as a tool for HCI analysis and invention. We posit that develo** design spaces for emerging pre-trained, generative AI models is necessary for supporting their integration into human-centered systems and practices. We explore what it means to develop an AI model design space by proposing two de… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Journal ref: NeurIps 2022 Human-Centered AI Workshop

  18. arXiv:2304.08470  [pdf

    cond-mat.mes-hall cond-mat.str-el

    Signatures of Fractional Quantum Anomalous Hall States in Twisted MoTe2 Bilayer

    Authors: Jiaqi Cai, Eric Anderson, Chong Wang, Xiaowei Zhang, Xiaoyu Liu, William Holtzmann, Yinong Zhang, Fengren Fan, Takashi Taniguchi, Kenji Watanabe, Ying Ran, Ting Cao, Liang Fu, Di Xiao, Wang Yao, Xiaodong Xu

    Abstract: The interplay between spontaneous symmetry breaking and topology can result in exotic quantum states of matter. A celebrated example is the quantum anomalous Hall (QAH) state, which exhibits an integer quantum Hall effect at zero magnetic field thanks to its intrinsic ferromagnetism. In the presence of strong electron-electron interactions, exotic fractional-QAH (FQAH) states at zero magnetic fiel… ▽ More

    Submitted 18 April, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 15 pages, 4 figures, v2: extended data (6 figures) is added. Comments are welcome

    Journal ref: Nature (2023)

  19. arXiv:2304.07454  [pdf, other

    cs.HC cs.NI

    Realizing Immersive Communications in Human Digital Twin by Edge Computing Empowered Tactile Internet: Visions and Case Study

    Authors: Hao Xiang, Changyan Yi, Kun Wu, Jiayuan Chen, Jun Cai, Dusit Niyato, Xuemin, Shen

    Abstract: Human digital twin (HDT) is expected to revolutionize the future human lifestyle and prompts the development of advanced human-centric applications (e.g., Metaverse) by bridging physical and virtual spaces. However, the fulfillment of HDT poses stringent demands on the pervasive connectivity, real-time feedback, multi-modal data transmission and ultra-high reliability, which urge the need of enabl… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

  20. arXiv:2304.06870  [pdf, other

    cs.CV

    AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics

    Authors: Shan Jia, Mingzhen Huang, Zhou Zhou, Yan Ju, Jialing Cai, Siwei Lyu

    Abstract: Recent advancements in language-image models have led to the development of highly realistic images that can be generated from textual descriptions. However, the increased visual quality of these generated images poses a potential threat to the field of media forensics. This paper aims to investigate the level of challenge that language-image generation models pose to media forensics. To achieve t… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  21. arXiv:2304.03442  [pdf, other

    cs.HC cs.AI cs.LG

    Generative Agents: Interactive Simulacra of Human Behavior

    Authors: Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein

    Abstract: Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototy** tools. In this paper, we introduce generative agents--computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; t… ▽ More

    Submitted 5 August, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  22. arXiv:2304.02253  [pdf, other

    cs.RO

    Flipbot: Learning Continuous Paper Flip** via Coarse-to-Fine Exteroceptive-Proprioceptive Exploration

    Authors: Chao Zhao, Chunli Jiang, Junhao Cai, Michael Yu Wang, Hongyu Yu, Qifeng Chen

    Abstract: This paper tackles the task of singulating and gras** paper-like deformable objects. We refer to such tasks as paper-flip**. In contrast to manipulating deformable objects that lack compression strength (such as shirts and ropes), minor variations in the physical properties of the paper-like deformable objects significantly impact the results, making manipulation highly challenging. Here, we p… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to International Conference on Robotics and Automation (ICRA) 2023

  23. Learn to Grasp via Intention Discovery and its Application to Challenging Clutter

    Authors: Chao Zhao, Chunli Jiang, Junhao Cai, Hongyu Yu, Michael Yu Wang, Qifeng Chen

    Abstract: Humans excel in gras** objects through diverse and robust policies, many of which are so probabilistically rare that exploration-based learning methods hardly observe and learn. Inspired by the human learning process, we propose a method to extract and exploit latent intents from demonstrations, and then learn diverse and robust gras** policies through self-exploration. The resulting policy ca… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L)

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 2, pp. 488-495, Feb. 2023

  24. arXiv:2304.02251  [pdf, other

    cs.RO

    ERRA: An Embodied Representation and Reasoning Architecture for Long-horizon Language-conditioned Manipulation Tasks

    Authors: Chao Zhao, Shuai Yuan, Chunli Jiang, Junhao Cai, Hongyu Yu, Michael Yu Wang, Qifeng Chen

    Abstract: This letter introduces ERRA, an embodied learning architecture that enables robots to jointly obtain three fundamental capabilities (reasoning, planning, and interaction) for solving long-horizon language-conditioned manipulation tasks. ERRA is based on tightly-coupled probabilistic inferences at two granularity levels. Coarse-resolution inference is formulated as sequence generation through a lar… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L)

  25. arXiv:2303.17038  [pdf

    cond-mat.mes-hall cond-mat.str-el

    Programming Correlated Magnetic States via Gate Controlled Moiré Geometry

    Authors: Eric Anderson, Feng-Ren Fan, Jiaqi Cai, William Holtzmann, Takashi Taniguchi, Kenji Watanabe, Di Xiao, Wang Yao, Xiaodong Xu

    Abstract: Understanding quantum many-body systems is at the heart of condensed matter physics. The ability to control the underlying lattice geometry of a system, and thus its many-body interactions, would enable the realization of and transition between emergent quantum ground states. Here, we report in-situ gate switching between honeycomb and triangular lattice geometries of an electron many-body Hamilto… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: 13 pages, 4 figures, plus supplementary material

  26. arXiv:2303.16705  [pdf, ps, other

    cs.CC

    Planar 3-way Edge Perfect Matching Leads to A Holant Dichotomy

    Authors: **-Yi Cai, Austen Z. Fan

    Abstract: We prove a complexity dichotomy theorem for a class of Holant problems on planar 3-regular bipartite graphs. The complexity dichotomy states that for every weighted constraint function $f$ defining the problem (the weights can even be negative), the problem is either computable in polynomial time if $f$ satisfies a tractability criterion, or \#P-hard otherwise. One particular problem in this probl… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2110.01173

  27. arXiv:2303.11651  [pdf

    physics.chem-ph physics.comp-ph

    AlphaMat: A Material Informatics Hub Connecting Data, Features, Models and Applications

    Authors: Zhilong Wang, Junfei Cai, An Chen, Yanqiang Han, Kehao Tao, Simin Ye, Shiwei Wang, Imran Ali, **** Li

    Abstract: The development of modern civil industry, energy and information technology is inseparable from the rapid explorations of new materials, which are hampered by months to years of painstaking attempts, resulting in only a small fraction of materials being determined in a vast chemical space. Artificial intelligence (AI)-based methods are promising to address this gap, but face many challenges such a… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  28. An in-depth exploration of LAMOST Unknown spectra based on density clustering

    Authors: Haifeng Yang, Xiaona Yin, Jianghui Cai, Yuqing Yang, Ali Luo, Zhongrui Bai, Lichan Zhou, Xujun Zhao, Yaling Xun

    Abstract: LAMOST (Large Sky Area Multi-Object Fiber Spectroscopic Telescope) has completed the observation of nearly 20 million celestial objects, including a class of spectra labeled `Unknown'. Besides low signal-to-noise ratio, these spectra often show some anomalous features that do not work well with current templates. In this paper, a total of 638,000 `Unknown' spectra from LAMOST DR5 are selected, and… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 18 pages, 15 figures

  29. arXiv:2303.08980  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Dynamically tunable moiré Rydberg excitons in a monolayer semiconductor on twisted bilayer graphene

    Authors: Minhao He, Jiaqi Cai, Huiyuan Zheng, Eric Seewald, Takashi Taniguchi, Kenji Watanabe, Jiaqiang Yan, Matthew Yankowitz, Abhay Pasupathy, Wang Yao, Xiaodong Xu

    Abstract: Moiré excitons are emergent optical excitations in 2D semiconductors with deep moiré superlattice potentials. While these excitations have been realized in several platforms, a system with dynamically tunable moiré potential to tailor the moiré exciton properties is yet to be realized. Here, we present a continuously tunable moiré potential in a monolayer WSe2 that is enabled by its proximity to t… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  30. arXiv:2303.08566  [pdf, other

    cs.CV cs.AI cs.LG

    Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning

    Authors: Haoyu He, Jianfei Cai, **g Zhang, Dacheng Tao, Bohan Zhuang

    Abstract: Visual Parameter-Efficient Fine-Tuning (PEFT) has become a powerful alternative for full fine-tuning so as to adapt pre-trained vision models to downstream tasks, which only tunes a small number of parameters while freezing the vast majority ones to ease storage burden and optimization difficulty. However, existing PEFT methods introduce trainable parameters to the same positions across different… ▽ More

    Submitted 31 August, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: ICCV 2023 Oral

  31. arXiv:2303.07984  [pdf, ps, other

    cs.DS math.FA

    Interlacing Polynomial Method for the Column Subset Selection Problem

    Authors: Jian-Feng Cai, Zhiqiang Xu, Zili Xu

    Abstract: This paper investigates the spectral norm version of the column subset selection problem. Given a matrix $\mathbf{A}\in\mathbb{R}^{n\times d}$ and a positive integer $k\leq\text{rank}(\mathbf{A})$, the objective is to select exactly $k$ columns of $\mathbf{A}$ that minimize the spectral norm of the residual matrix after projecting $\mathbf{A}$ onto the space spanned by the selected columns. We use… ▽ More

    Submitted 7 January, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

    MSC Class: 15A60; 90C27

  32. arXiv:2303.05164  [pdf, other

    cs.CV

    Reliability-Adaptive Consistency Regularization for Weakly-Supervised Point Cloud Segmentation

    Authors: Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai

    Abstract: Weakly-supervised point cloud segmentation with extremely limited labels is highly desirable to alleviate the expensive costs of collecting densely annotated 3D points. This paper explores applying the consistency regularization that is commonly used in weakly-supervised learning, for its point cloud counterpart with multiple data-specific augmentations, which has not been well studied. We observe… ▽ More

    Submitted 14 December, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  33. arXiv:2303.04585  [pdf, other

    cs.SD cs.AI eess.AS

    Exploring Efficient-Tuned Learning Audio Representation Method from BriVL

    Authors: Sen Fang, Yangjian Wu, Bowen Gao, **gwen Cai, Teik Toe Teoh

    Abstract: Recently, researchers have gradually realized that in some cases, the self-supervised pre-training on large-scale Internet data is better than that of high-quality/manually labeled data sets, and multimodal/large models are better than single or bimodal/small models. In this paper, we propose a robust audio representation learning method WavBriVL based on Bridging-Vision-and-Language (BriVL). WavB… ▽ More

    Submitted 28 July, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 13 pages, 2023.3 Finished

  34. arXiv:2303.02538  [pdf, other

    cs.GT

    Properties of Position Matrices and Their Elections

    Authors: Niclas Boehmer, **-Yi Cai, Piotr Faliszewski, Austen Z. Fan, Łukasz Janeczko, Andrzej Kaczmarczyk, Tomasz Wąs

    Abstract: We study the properties of elections that have a given position matrix (in such elections each candidate is ranked on each position by a number of voters specified in the matrix). We show that counting elections that generate a given position matrix is #P-complete. Consequently, sampling such elections uniformly at random seems challenging and we propose a simpler algorithm, without hard guarantee… ▽ More

    Submitted 9 March, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: Accepted to AAAI 2023

  35. arXiv:2302.13083  [pdf, other

    cs.LG

    Knowledge Graph Completion with Counterfactual Augmentation

    Authors: Heng Chang, Jie Cai, Jia Li

    Abstract: Graph Neural Networks (GNNs) have demonstrated great success in Knowledge Graph Completion (KGC) by modeling how entities and relations interact in recent years. However, most of them are designed to learn from the observed graph structure, which appears to have imbalanced relation distribution during the training stage. Motivated by the causal relationship among the entities on a knowledge graph,… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: TheWebConf 2023

  36. arXiv:2302.12944  [pdf, other

    cs.CL cs.AI

    Dependency Dialogue Acts -- Annotation Scheme and Case Study

    Authors: Jon Z. Cai, Brendan King, Margaret Perkoff, Shiran Dudy, Jie Cao, Marie Grace, Natalia Wojarnik, Ananya Ganesh, James H. Martin, Martha Palmer, Marilyn Walker, Jeffrey Flanigan

    Abstract: In this paper, we introduce Dependency Dialogue Acts (DDA), a novel framework for capturing the structure of speaker-intentions in multi-party dialogues. DDA combines and adapts features from existing dialogue annotation frameworks, and emphasizes the multi-relational response structure of dialogues in addition to the dialogue acts and rhetorical relations. It represents the functional, discourse,… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: The 13th International Workshop on Spoken Dialogue Systems Technology

    Journal ref: The 13th International Workshop on Spoken Dialogue Systems Technology 2023

  37. The Study of Circumgalactic Medium with Quasar Pairs

    Authors: Zhi-Fu Chen, Huan-Chang Qin, **-Ting Cai, Yu-Tao Zhou, Zhe-Geng Chen, Ting-Ting Pang, Zhi-Wen Wang

    Abstract: We have collected 10025 foreground-background quasar pairs with projected distances $d_p<500$ kpc from the large quasar catalog of the SDSS DR16Q. We investigate the properties of the Mg II absorption lines with $W_r>0.15$ Å around foreground quasars, including both the LOS (line-of-sights of foreground quasars) and transverse (TRA, perpendicular to the LOS) absorptions. Both the equivalent width… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 15 pages, Accepted in ApJS

  38. arXiv:2302.10454  [pdf, other

    cs.CL cs.LG

    KG-ECO: Knowledge Graph Enhanced Entity Correction for Query Rewriting

    Authors: **glun Cai, Mingda Li, Ziyan Jiang, Eunah Cho, Zheng Chen, Yang Liu, Xing Fan, Chenlei Guo

    Abstract: Query Rewriting (QR) plays a critical role in large-scale dialogue systems for reducing frictions. When there is an entity error, it imposes extra challenges for a dialogue system to produce satisfactory responses. In this work, we propose KG-ECO: Knowledge Graph enhanced Entity COrrection for query rewriting, an entity correction system with corrupt entity span detection and entity retrieval/re-r… ▽ More

    Submitted 22 February, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  39. arXiv:2302.09790  [pdf, other

    cs.CV cs.HC cs.LG

    HTNet: Human Topology Aware Network for 3D Human Pose Estimation

    Authors: Jialun Cai, Hong Liu, Runwei Ding, Wenhao Li, Jianbing Wu, Miaoju Ban

    Abstract: 3D human pose estimation errors would propagate along the human body topology and accumulate at the end joints of limbs. Inspired by the backtracking mechanism in automatic control systems, we design an Intra-Part Constraint module that utilizes the parent nodes as the reference to build topological constraints for end joints at the part level. Further considering the hierarchy of the human topolo… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: ICASSP23 Accepted Paper

  40. arXiv:2302.08570  [pdf, ps, other

    cs.CC

    The complexity of counting planar graph homomorphisms of domain size 3

    Authors: **-Yi Cai, Ashwin Maran

    Abstract: We prove a complexity dichotomy theorem for counting planar graph homomorphisms of domain size 3. Given any 3 by 3 real valued symmetric matrix $H$ defining a graph homomorphism from all planar graphs $G \mapsto Z_H(G)$, we completely classify the computational complexity of this problem according to the matrix $H$. We show that for every $H$, the problem is either polynomial time computable or \#… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 32 pages, 2 figures, accepted by STOC 2023

  41. Bayesian-based hybrid method for rapid optimization of NV center sensors

    Authors: Jiazhao Tian, Ressa S. Said, Fedor Jelezko, Jianming Cai, Liantuan Xiao

    Abstract: NV center is one of the most promising platforms in the field of quantum sensing. Magnetometry based on NV center, especially, has achieved a concrete development in regions of biomedicine and medical diagnostics. Improving the sensitivity of NV center sensor under wide inhomogeneous broadening and filed amplitude drift is one crucial issue of continuous concern, which relies on the coherent contr… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: Sensors 23, 3244 (2023)

  42. Real-time adaptive sensing of nuclear spins by a single-spin quantum sensor

    Authors: **gcheng Wang, Dongxiao Li, Ralf Betzholz, Jianming Cai

    Abstract: Quantum sensing is considered to be one of the most promising subfields of quantum information to deliver practical quantum advantages in real-world applications. However, its impressive capabilities, including high sensitivity, are often hindered by the limited quantum resources available. Here, we incorporate the expected information gain (EIG) and techniques such as accelerated computation into… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: Phys. Rev. Applied 18, 024040 (2022)

  43. Robustness of random-control quantum-state tomography

    Authors: **gcheng Wang, Shaoliang Zhang, Jianming Cai, Zhenyu Liao, Christian Arenz, Ralf Betzholz

    Abstract: In a recently demonstrated quantum-state tomography scheme [Phys. Rev. Lett. 124, 010405 (2020)], a random control field is locally applied to a multipartite system to reconstruct the full quantum state of the system through single-observable measurements. Here, we analyze the robustness of such a tomography scheme against measurement errors. We characterize the sensitivity to measurement errors u… ▽ More

    Submitted 10 August, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 10 pages, 4 figures

    Journal ref: Phys. Rev. A 108, 022408 (2023)

  44. arXiv:2302.06586  [pdf, other

    cs.LG cs.AI cs.CV

    Stitchable Neural Networks

    Authors: Zizheng Pan, Jianfei Cai, Bohan Zhuang

    Abstract: The public model zoo containing enormous powerful pretrained model families (e.g., ResNet/DeiT) has reached an unprecedented scope than ever, which significantly contributes to the success of deep learning. As each model family consists of pretrained models with diverse scales (e.g., DeiT-Ti/S/B), it naturally arises a fundamental question of how to efficiently assemble these readily available mod… ▽ More

    Submitted 28 March, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: CVPR 2023 Highlight; Project is available at https://snnet.github.io/

  45. arXiv:2302.06430  [pdf, other

    cs.LG cs.AI

    Deep Orthogonal Hypersphere Compression for Anomaly Detection

    Authors: Yunhe Zhang, Yan Sun, **yu Cai, Jicong Fan

    Abstract: Many well-known and effective anomaly detection methods assume that a reasonable decision boundary has a hypersphere shape, which however is difficult to obtain in practice and is not sufficiently compact, especially when the data are in high-dimensional spaces. In this paper, we first propose a novel deep anomaly detection model that improves the original hypersphere learning through an orthogona… ▽ More

    Submitted 4 May, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Published in ICLR 2024: https://openreview.net/pdf?id=cJs4oE4m9Q

  46. arXiv:2302.05917  [pdf, other

    cs.LG

    Vector Quantized Wasserstein Auto-Encoder

    Authors: Tung-Long Vuong, Trung Le, He Zhao, Chuanxia Zheng, Mehrtash Harandi, Jianfei Cai, Dinh Phung

    Abstract: Learning deep discrete latent presentations offers a promise of better symbolic and summarized abstractions that are more useful to subsequent downstream tasks. Inspired by the seminal Vector Quantized Variational Auto-Encoder (VQ-VAE), most of work in learning deep discrete representations has mainly focused on improving the original VQ-VAE form and none of them has studied learning deep discrete… ▽ More

    Submitted 17 June, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

  47. arXiv:2302.02369  [pdf, other

    cs.LG cs.AI

    Deep Graph-Level Clustering Using Pseudo-Label-Guided Mutual Information Maximization Network

    Authors: **yu Cai, Yi Han, Wenzhong Guo, Jicong Fan

    Abstract: In this work, we study the problem of partitioning a set of graphs into different groups such that the graphs in the same group are similar while the graphs in different groups are dissimilar. This problem was rarely studied previously, although there have been a lot of work on node clustering and graph classification. The problem is challenging because it is difficult to measure the similarity or… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  48. Strong quantum metrological limit from many-body physics

    Authors: Yaoming Chu, Xiangbei Li, Jianming Cai

    Abstract: Surpassing the standard quantum limit and even reaching the Heisenberg limit using quantum entanglement, represents the Holy Grail of quantum metrology. However, quantum entanglement is a valuable resource that does not come without a price. The exceptional time overhead for the preparation of large-scale entangled states raises disconcerting concerns about whether the Heisenberg limit is fundamen… ▽ More

    Submitted 11 April, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: 7 pages, 3 figures + supplementary information (14 pages)

    Journal ref: Phys. Rev. Lett. 130, 170801 (2023)

  49. arXiv:2301.07966  [pdf, ps, other

    cs.LG math.OC

    Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions

    Authors: Junyang Cai, Khai-Nguyen Nguyen, Nishant Shrestha, Aidan Good, Ruisen Tu, Xin Yu, Shandian Zhe, Thiago Serra

    Abstract: One surprising trait of neural networks is the extent to which their connections can be pruned with little to no effect on accuracy. But when we cross a critical level of parameter sparsity, pruning any further leads to a sudden drop in accuracy. This drop plausibly reflects a loss in model complexity, which we aim to avoid. In this work, we explore how sparsity also affects the geometry of the li… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: (Under review)

  50. arXiv:2301.07336  [pdf, other

    cs.CV

    Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation

    Authors: Son Duy Dao, Hengcan Shi, Dinh Phung, Jianfei Cai

    Abstract: Recent mask proposal models have significantly improved the performance of zero-shot semantic segmentation. However, the use of a `background' embedding during training in these methods is problematic as the resulting model tends to over-learn and assign all unseen classes as the background class instead of their correct labels. Furthermore, they ignore the semantic relationship of text embeddings… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.