Skip to main content

Showing 51–100 of 568 results for author: Shi, M

.
  1. arXiv:2311.00354  [pdf, ps, other

    cs.CR

    Butson Hadamard matrices, bent sequences, and spherical codes

    Authors: Minjia Shi, Danni Lu, Andrés Armario, Ronan Egan, Ferruh Ozbudak, Patrick Solé

    Abstract: We explore a notion of bent sequence attached to the data consisting of an Hadamard matrix of order $n$ defined over the complex $q^{th}$ roots of unity, an eigenvalue of that matrix, and a Galois automorphism from the cyclotomic field of order $q.$ In particular we construct self-dual bent sequences for various $q\le 60$ and lengths $n\le 21.$ Computational construction methods comprise the resol… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  2. arXiv:2310.12511  [pdf, ps, other

    cs.IT

    The weight enumerator polynomials of the lifted codes of the projective Solomon-Stiffler codes

    Authors: Minjia Shi, Shitao Li, Tor Helleseth

    Abstract: Determining the weight distribution of a code is an old and fundamental topic in coding theory that has been thoroughly studied. In 1977, Helleseth, Kløve, and Mykkeltveit presented a weight enumerator polynomial of the lifted code over $\mathbb{F}_{q^\ell}$ of a $q$-ary linear code with significant combinatorial properties, which can determine the support weight distribution of this linear code.… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: This manuscript was first submitted on September 9, 2022

  3. arXiv:2310.09183  [pdf, other

    cs.LG cs.AI cs.DC

    PRIOR: Personalized Prior for Reactivating the Information Overlooked in Federated Learning

    Authors: Mingjia Shi, Yuhao Zhou, Kai Wang, Huaizheng Zhang, Shudong Huang, Qing Ye, Jiangcheng Lv

    Abstract: Classical federated learning (FL) enables training machine learning models without sharing data for privacy preservation, but heterogeneous data characteristic degrades the performance of the localized model. Personalized FL (PFL) addresses this by synthesizing personalized models from a global model via training on local data. Such a global model may overlook the specific information that the cli… ▽ More

    Submitted 10 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

    MSC Class: 68T07 ACM Class: I.2.11

  4. arXiv:2310.07355  [pdf, other

    cs.CV cs.LG

    IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-training

    Authors: Che Liu, Sibo Cheng, Miao**g Shi, Anand Shah, Wenjia Bai, Rossella Arcucci

    Abstract: In the field of medical Vision-Language Pre-training (VLP), significant efforts have been devoted to deriving text and image features from both clinical reports and associated medical images. However, most existing methods may have overlooked the opportunity in leveraging the inherent hierarchical structure of clinical reports, which are generally split into `findings' for descriptive content and… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Under Review

  5. arXiv:2310.04863  [pdf, other

    cs.SD eess.AS

    SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR

    Authors: Yangze Li, Fan Yu, Yuhao Liang, Pengcheng Guo, Mohan Shi, Zhihao Du, Shiliang Zhang, Lei Xie

    Abstract: Joint modeling of multi-speaker ASR and speaker diarization has recently shown promising results in speaker-attributed automatic speech recognition (SA-ASR).Although being able to obtain state-of-the-art (SOTA) performance, most of the studies are based on an autoregressive (AR) decoder which generates tokens one-by-one and results in a large real-time factor (RTF). To speed up inference, we intro… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  6. arXiv:2310.02492  [pdf, other

    cs.CV

    FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

    Authors: Yan Luo, Muhammad Osama Khan, Yu Tian, Min Shi, Zehao Dou, Tobias Elze, Yi Fang, Mengyu Wang

    Abstract: Equity in AI for healthcare is crucial due to its direct impact on human well-being. Despite advancements in 2D medical imaging fairness, the fairness of 3D models remains underexplored, hindered by the small sizes of 3D fairness datasets. Since 3D imaging surpasses 2D imaging in SOTA clinical care, it is critical to understand the fairness of these 3D models. To address this research gap, we cond… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

  7. arXiv:2309.17218  [pdf, other

    cs.CV

    When Epipolar Constraint Meets Non-local Operators in Multi-View Stereo

    Authors: Tianqi Liu, Xinyi Ye, Weiyue Zhao, Zhiyu Pan, Min Shi, Zhiguo Cao

    Abstract: Learning-based multi-view stereo (MVS) method heavily relies on feature matching, which requires distinctive and descriptive representations. An effective solution is to apply non-local feature aggregation, e.g., Transformer. Albeit useful, these techniques introduce heavy computation overheads for MVS. Each pixel densely attends to the whole image. In contrast, we propose to constrain non-local f… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: ICCV2023

  8. arXiv:2309.13573  [pdf, other

    cs.SD eess.AS

    The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR

    Authors: Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu

    Abstract: With the success of the first Multi-channel Multi-party Meeting Transcription challenge (M2MeT), the second M2MeT challenge (M2MeT 2.0) held in ASRU2023 particularly aims to tackle the complex task of \emph{speaker-attributed ASR (SA-ASR)}, which directly addresses the practical and challenging problem of ``who spoke what at when" at typical meeting scenario. We particularly established two sub-tr… ▽ More

    Submitted 5 October, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: 8 pages, Accepted by ASRU2023

  9. arXiv:2309.12003  [pdf, ps, other

    cs.IT cs.CR

    A quaternary analogue of Tang-Ding codes

    Authors: Minjia Shi, Sihui Tao, Jon-Lark Kim, Patrick Sole

    Abstract: In a recent paper, Tang and Ding introduced a class of binary cyclic codes of rate close to one half with a designed lower bound on their minimum distance. The definition involves the base $2$ expansion of the integers in their defining set. In this paper we propose an analogue for quaternary codes. In addition, the performances of the subfield subcode and of the trace code (two binary cyclic code… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  10. arXiv:2309.06732  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.supr-con

    Fermi Surface Nesting with Heavy Quasiparticles in the Locally Noncentrosymmetric Superconductor CeRh$_2$As$_2$

    Authors: Yi Wu, Yongjun Zhang, Sailong Ju, Yong Hu, Yanen Huang, Yanan Zhang, Huali Zhang, Hao Zheng, Guowei Yang, Evrard-Ouicem Eljaouhari, Baopeng Song, Nicholas C. Plumb, Frank Steglich, Ming Shi, Gertrud Zwicknag, Chao Cao, Huiqiu Yuan, Yang Liu

    Abstract: The locally noncentrosymmetric heavy fermion superconductor CeRh$_2$As$_2$ has attracted considerable interests due to its rich superconducting phases, accompanied by a quadrupole density wave and pronounced antiferromagnetic excitations. To understand the underlying physics, we here report measurements from high-resolution angle-resolved photoemission. Our results reveal fine splittings of the co… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: v1 submitted on Sep 13th 2023

  11. arXiv:2309.06497  [pdf, other

    cs.LG cs.DC cs.MS math.OC

    A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

    Authors: Hao-Jun Michael Shi, Tsung-Hsien Lee, Shintaro Iwasaki, Jose Gallego-Posada, Zhi**g Li, Kaushik Rangadurai, Dheevatsa Mudigere, Michael Rabbat

    Abstract: Shampoo is an online and stochastic optimization algorithm belonging to the AdaGrad family of methods for training neural networks. It constructs a block-diagonal preconditioner where each block consists of a coarse Kronecker product approximation to full-matrix AdaGrad for each parameter of the neural network. In this work, we provide a complete description of the algorithm as well as the perform… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 38 pages, 8 figures, 5 tables

  12. arXiv:2309.05683  [pdf, other

    cs.LG cs.AI cs.RO

    EANet: Expert Attention Network for Online Trajectory Prediction

    Authors: Pengfei Yao, Tianlu Mao, Min Shi, **gkai Sun, Zhaoqi Wang

    Abstract: Trajectory prediction plays a crucial role in autonomous driving. Existing mainstream research and continuoual learning-based methods all require training on complete datasets, leading to poor prediction accuracy when sudden changes in scenarios occur and failing to promptly respond and update the model. Whether these methods can make a prediction in real-time and use data instances to update the… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  13. arXiv:2309.01515  [pdf, other

    cs.DC cs.LG

    Federated cINN Clustering for Accurate Clustered Federated Learning

    Authors: Yuhao Zhou, Minjia Shi, Yuxin Tian, Yuanxi Li, Qing Ye, Jiancheng Lv

    Abstract: Federated Learning (FL) presents an innovative approach to privacy-preserving distributed machine learning and enables efficient crowd intelligence on a large scale. However, a significant challenge arises when coordinating FL with crowd intelligence which diverse client groups possess disparate objectives due to data heterogeneity or distinct tasks. To address this challenge, we propose the Feder… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  14. arXiv:2308.16844  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Electronic band reconstruction across the insulator-metal transition in colossal magnetoresistive EuCd2P2

    Authors: Huali Zhang, Feng Du, Xiaoying Zheng, Shuaishuai Luo, Yi Wu, Hao Zheng, Shengtao Cui, Zhe Sun, Zhengtai Liu, Dawei Shen, Michael Smidman, Yu Song, Ming Shi, Zhicheng Zhong, Chao Cao, Huiqiu Yuan, Yang Liu

    Abstract: While colossal magnetoresistance (CMR) in Eu-based compounds is often associated with strong spin-carrier interactions, the underlying reconstruction of the electronic bands is much less understood from spectroscopic experiments. Here using angle-resolved photoemission, we directly observe an electronic band reconstruction across the insulator-metal (and magnetic) transition in the recently discov… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 6 pages, 4 figures

    Journal ref: Phys. Rev. B 108, L241115 (2023)

  15. arXiv:2308.16005  [pdf, other

    quant-ph

    Hybrid Quantum Neural Network Structures for Image Multi-classification

    Authors: Mingrui Shi, Haozhen Situ, Cai Zhang

    Abstract: Image classification is a fundamental computer vision problem, and neural networks offer efficient solutions. With advancing quantum technology, quantum neural networks have gained attention. However, they work only for low-dimensional data and demand dimensionality reduction and quantum encoding. Two recent image classification methods have emerged: one employs PCA dimensionality reduction and an… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  16. arXiv:2308.13415  [pdf, other

    eess.IV cs.CV cs.LG

    An investigation into the impact of deep learning model choice on sex and race bias in cardiac MR segmentation

    Authors: Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Keana Aitcheson, Miao**g Shi, Andrew P. King

    Abstract: In medical imaging, artificial intelligence (AI) is increasingly being used to automate routine tasks. However, these algorithms can exhibit and exacerbate biases which lead to disparate performances between protected groups. We investigate the impact of model choice on how imbalances in subject sex and race in training datasets affect AI-based cine cardiac magnetic resonance image segmentation. W… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  17. arXiv:2308.13411  [pdf, other

    cs.CV

    Harvard Glaucoma Detection and Progression: A Multimodal Multitask Dataset and Generalization-Reinforced Semi-Supervised Learning

    Authors: Yan Luo, Min Shi, Yu Tian, Tobias Elze, Mengyu Wang

    Abstract: Glaucoma is the number one cause of irreversible blindness globally. A major challenge for accurate glaucoma detection and progression forecasting is the bottleneck of limited labeled patients with the state-of-the-art (SOTA) 3D retinal imaging data of optical coherence tomography (OCT). To address the data scarcity issue, this paper proposes two solutions. First, we develop a novel generalization… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  18. arXiv:2308.11270  [pdf

    cond-mat.supr-con

    Green-light p-n Junction Particle Inhomogeneous Phase Enhancement of MgB2 Smart Meta-Superconductor

    Authors: Yao Qi, Duo Chen, Yongbo Li, Chao Sun, Qingyu Hai, Miao Shi, Honggang Chen, Xiaopeng Zhao

    Abstract: Improving the critical temperature (TC), critical magnetic field (HC), and critical current (JC) of superconducting materials has always been one of the most significant challenges in the field of superconductivity, but progress has been slow over the years. Based on the concept of injecting energy to enhance electron pairing states, in this study, we have employed a solid-state sintering method t… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  19. arXiv:2308.05232  [pdf, other

    cs.CV cs.LG

    SegMatch: A semi-supervised learning method for surgical instrument segmentation

    Authors: Meng Wei, Charlie Budd, Luis C. Garcia-Peraza-Herrera, Reuben Dorent, Miao**g Shi, Tom Vercauteren

    Abstract: Surgical instrument segmentation is recognised as a key enabler to provide advanced surgical assistance and improve computer assisted interventions. In this work, we propose SegMatch, a semi supervised learning method to reduce the need for expensive annotation for laparoscopic and robotic surgical images. SegMatch builds on FixMatch, a widespread semi supervised classification pipeline combining… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: preprint under review, 12 pages, 7 figures

  20. arXiv:2308.02916  [pdf, other

    cs.LG cs.AI

    Adversarial Erasing with Pruned Elements: Towards Better Graph Lottery Ticket

    Authors: Yuwen Wang, Shunyu Liu, Kaixuan Chen, Tongtian Zhu, Ji Qiao, Mengjie Shi, Yuanyu Wan, Mingli Song

    Abstract: Graph Lottery Ticket (GLT), a combination of core subgraph and sparse subnetwork, has been proposed to mitigate the computational cost of deep Graph Neural Networks (GNNs) on large input graphs while preserving original performance. However, the winning GLTs in exisiting studies are obtained by applying iterative magnitude-based pruning (IMP) without re-evaluating and re-considering the pruned inf… ▽ More

    Submitted 10 August, 2023; v1 submitted 5 August, 2023; originally announced August 2023.

    Comments: 17 pages, 10 figures, Accept by ECAI2023

  21. arXiv:2308.02313  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    The fate of quasiparticles at high-temperature

    Authors: A. Hunter, S. Beck, E. Cappelli, F. Margot, M. Straub, Y. Alexanian, G. Gatti, M. D. Watson, T. K. Kim, C. Cacho, N. C. Plumb, M. Shi, M. Radović, D. A. Sokolov, A. P. Mackenzie, M. Zingl, J. Mravlje, A. Georges, F. Baumberger, A. Tamai

    Abstract: We study the temperature evolution of quasiparticles in the correlated metal Sr$_2$RuO$_4$. Our angle resolved photoemission data show that quasiparticles persist up to temperatures above 200~K, far beyond the Fermi liquid regime. Extracting the quasiparticle self-energy we demonstrate that the quasiparticle residue $Z$ increases with increasing temperature. Quasiparticles eventually disappear on… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Supplemental Material available upon request

    Journal ref: Phys. Rev. Lett. 131, 236502 (2023)

  22. arXiv:2308.01907  [pdf, other

    cs.CV

    The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

    Authors: Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao

    Abstract: We present the All-Seeing (AS) project: a large-scale data and model for recognizing and understanding everything in the open world. Using a scalable data engine that incorporates human feedback and efficient models in the loop, we create a new dataset (AS-1B) with over 1 billion regions annotated with semantic tags, question-answering pairs, and detailed captions. It covers a wide range of 3.5 mi… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: Technical Report

  23. arXiv:2307.14382  [pdf, other

    cs.LG cs.AI cs.CV

    When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review

    Authors: Maxime Fontana, Michael Spratling, Miao**g Shi

    Abstract: Multi-Task Learning (MTL) aims to learn multiple tasks simultaneously while exploiting their mutual relationships. By using shared resources to simultaneously calculate multiple outputs, this learning paradigm has the potential to have lower memory requirements and inference times compared to the traditional approach of using separate methods for each task. Previous work in MTL has mainly focused… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: 25 pages, 4 figures, 4 tables

  24. arXiv:2307.08695  [pdf, other

    cs.CV

    Neural Video Depth Stabilizer

    Authors: Yiran Wang, Min Shi, Jiaqi Li, Zihao Huang, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin

    Abstract: Video depth estimation aims to infer temporally consistent depth. Some methods achieve temporal consistency by finetuning a single-image depth model during test time using geometry and re-projection constraints, which is inefficient and not robust. An alternative approach is to learn how to enforce temporal consistency from data, but this requires well-designed models and sufficient video depth da… ▽ More

    Submitted 10 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV2023

  25. TreeFormer: a Semi-Supervised Transformer-based Framework for Tree Counting from a Single High Resolution Image

    Authors: Hamed Amini Amirkolaee, Miao**g Shi, Mark Mulligan

    Abstract: Automatic tree density estimation and counting using single aerial and satellite images is a challenging task in photogrammetry and remote sensing, yet has an important role in forest management. In this paper, we propose the first semisupervised transformer-based framework for tree counting which reduces the expensive tree annotations for remote sensing images. Our method, termed as TreeFormer, f… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

  26. arXiv:2307.05183  [pdf, other

    quant-ph

    Quantum-enhanced Electrometer based on Microwave-dressed Rydberg Atoms

    Authors: Shuhe Wu, Dong Zhang, Zhengchun Li, Minwei Shi, Peiyu Yang, **xian Guo, Wei Du, Guzhi Bao, Wei** Zhang

    Abstract: Rydberg atoms have been shown remarkable performance in sensing microwave field. The sensitivity of such an electrometer based on optical readout of atomic ensemble has been demonstrated to approach the photon-shot-noise limit. However, the sensitivity can not be promoted infinitely by increasing the power of probe light due to the increased collision rates and power broadening. Compared with clas… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  27. arXiv:2306.16343  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Electronic Landscape of Kagome Superconductors $\textit{A}$V$_{3}$Sb$_{5}$ ($\textit{A}$ = K, Rb, Cs) from Angle-Resolved Photoemission Spectroscopy

    Authors: Yong Hu, Xianxin Wu, Andreas P. Schnyder, Ming Shi

    Abstract: The recently discovered layered kagome superconductors $\textit{A}$V$_{3}$Sb$_{5}$ ($\textit{A}$ = K, Rb, Cs) have garnered significant attention, as they exhibit an intriguing combination of superconductivity, charge density wave (CDW) order, and nontrivial band topology. As such, these kagome systems serve as an exceptional quantum platform for investigating the intricate interplay between elect… ▽ More

    Submitted 13 November, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Journal ref: npj Quantum Mater. 8, 67 (2023)

  28. Automated Grading and Feedback Tools for Programming Education: A Systematic Review

    Authors: Marcus Messer, Neil C. C. Brown, Michael Kölling, Miao**g Shi

    Abstract: We conducted a systematic literature review on automated grading and feedback tools for programming education. We analysed 121 research papers from 2017 to 2021 inclusive and categorised them based on skills assessed, approach, language paradigm, degree of automation and evaluation techniques. Most papers assess the correctness of assignments in object-oriented languages. Typically, these to… ▽ More

    Submitted 5 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted version of the manuscript

  29. Learning-based sound speed estimation and aberration correction in linear-array photoacoustic imaging

    Authors: Mengjie Shi, Tom Vercauteren, Wenfeng Xia

    Abstract: Photoacoustic (PA) image reconstruction involves acoustic inversion that necessitates the specification of the speed of sound (SoS) within the medium of propagation. Due to the lack of information on the spatial distribution of the SoS within heterogeneous soft tissue, a homogeneous SoS distribution (such as 1540 m/s) is typically assumed in PA image reconstruction, similar to that of ultrasound (… ▽ More

    Submitted 5 March, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

  30. arXiv:2306.09264  [pdf, other

    cs.CV

    Harvard Glaucoma Fairness: A Retinal Nerve Disease Dataset for Fairness Learning and Fair Identity Normalization

    Authors: Yan Luo, Yu Tian, Min Shi, Louis R. Pasquale, Lucy Q. Shen, Nazlee Zebardast, Tobias Elze, Mengyu Wang

    Abstract: Fairness (also known as equity interchangeably) in machine learning is important for societal well-being, but limited public datasets hinder its progress. Currently, no dedicated public medical datasets with imaging data for fairness learning are available, though minority groups suffer from more health issues. To address this gap, we introduce Harvard Glaucoma Fairness (Harvard-GF), a retinal ner… ▽ More

    Submitted 10 March, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted in IEEE Transactions on Medical Imaging

  31. arXiv:2306.09244  [pdf, other

    cs.CV

    Text Promptable Surgical Instrument Segmentation with Vision-Language Models

    Authors: Zijian Zhou, Oluwatosin Alabi, Meng Wei, Tom Vercauteren, Miao**g Shi

    Abstract: In this paper, we propose a novel text promptable surgical instrument segmentation approach to overcome challenges associated with diversity and differentiation of surgical instruments in minimally invasive surgeries. We redefine the task as text promptable, thereby enabling a more nuanced comprehension of surgical instruments and adaptability to new instrument types. Inspired by recent advancemen… ▽ More

    Submitted 8 November, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

    Journal ref: https://proceedings.neurips.cc/paper_files/paper/2023/hash/5af741d487c5f0b08bfe56e11d1883e4-Abstract-Conference.html

  32. arXiv:2306.08762  [pdf, other

    cs.LG cs.AI

    Theoretical Hardness and Tractability of POMDPs in RL with Partial Online State Information

    Authors: Ming Shi, Yingbin Liang, Ness Shroff

    Abstract: Partially observable Markov decision processes (POMDPs) have been widely applied in various real-world applications. However, existing theoretical results have shown that learning in POMDPs is intractable in the worst case, where the main challenge lies in the lack of latent state information. A key fundamental question here is: how much online state information (OSI) is sufficient to achieve trac… ▽ More

    Submitted 11 March, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Submitted for publication

  33. arXiv:2306.08736  [pdf, other

    cs.CV

    LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation

    Authors: Linfeng Yuan, Miao**g Shi, Zijie Yue, Qijun Chen

    Abstract: Referring video object segmentation (RVOS) aims to segment the target instance referred by a given text expression in a video clip. The text expression normally contains sophisticated description of the instance's appearance, action, and relation with others. It is therefore rather difficult for a RVOS model to capture all these attributes correspondingly in the video; in fact, the model often fav… ▽ More

    Submitted 1 April, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: CVPR2024

  34. arXiv:2305.19563  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Zero-Shot Automatic Pronunciation Assessment

    Authors: Hongfu Liu, Mingqian Shi, Ye Wang

    Abstract: Automatic Pronunciation Assessment (APA) is vital for computer-assisted language learning. Prior methods rely on annotated speech-text data to train Automatic Speech Recognition (ASR) models or speech-score data to train regression models. In this work, we propose a novel zero-shot APA method based on the pre-trained acoustic model, HuBERT. Our method involves encoding speech input and corrupting… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted to Interspeech 2023

  35. arXiv:2305.12507  [pdf, other

    astro-ph.SR astro-ph.EP astro-ph.GA physics.plasm-ph physics.space-ph

    Small-amplitude Compressible Magnetohydrodynamic Turbulence Modulated by Collisionless Dam** in Earth's Magnetosheath: Observation Matches Theory

    Authors: Siqi Zhao, Huirong Yan, Terry Z. Liu, Ka Ho Yuen, Mijie Shi

    Abstract: Plasma turbulence is a ubiquitous dynamical process that transfers energy across many spatial and temporal scales and affects energetic particle transport. Recent advances in the understanding of compressible magnetohydrodynamic (MHD) turbulence demonstrate the important role of dam** in sha** energy distributions on small scales, yet its observational evidence is still lacking. This study pro… ▽ More

    Submitted 8 February, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Main text: 5 pages, 6 figures. Accepted by ApJ on Dec. 5, 2023. Published on Feb. 08, 2024

  36. arXiv:2305.12459  [pdf, other

    eess.AS cs.SD

    CASA-ASR: Context-Aware Speaker-Attributed ASR

    Authors: Mohan Shi, Zhihao Du, Qian Chen, Fan Yu, Yangze Li, Shiliang Zhang, Jie Zhang, Li-Rong Dai

    Abstract: Recently, speaker-attributed automatic speech recognition (SA-ASR) has attracted a wide attention, which aims at answering the question ``who spoke what''. Different from modular systems, end-to-end (E2E) SA-ASR minimizes the speaker-dependent recognition errors directly and shows a promising applicability. In this paper, we propose a context-aware SA-ASR (CASA-ASR) model by enhancing the contextu… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech2023

  37. arXiv:2305.12450  [pdf, other

    eess.AS cs.SD

    Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction

    Authors: Mohan Shi, Yuchun Shu, Lingyun Zuo, Qian Chen, Shiliang Zhang, Jie Zhang, Li-Rong Dai

    Abstract: For speech interaction, voice activity detection (VAD) is often used as a front-end. However, traditional VAD algorithms usually need to wait for a continuous tail silence to reach a preset maximum duration before segmentation, resulting in a large latency that affects user experience. In this paper, we propose a novel semantic VAD for low-latency segmentation. Different from existing methods, a f… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech2023

  38. arXiv:2305.10671  [pdf, ps, other

    math.NT cs.IT

    A new method for solving the equation $x^d+(x+1)^d=b$ in $\mathbb{F}_{q^4}$ where $d=q^3+q^2+q-1$

    Authors: Liqin Qian, Minjia Shi, Wei Lu

    Abstract: In this paper, we give a new method answer to a recent conjecture proposed by Budaghyan, Calderini, Carlet, Davidova and Kaleyski about the equation $x^d+(x+1)^d=b$ in $\mathbb{F}_{q^4}$, where $n$ is a positive integer, $q=2^n$ and $d=q^3+q^2+q-1$. In particular, we directly determine the differential spectrum of this power function $x^d$ using methods different from those in the literature. Comp… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  39. arXiv:2305.08561  [pdf, ps, other

    cs.IT math.CO

    Characterization of Plotkin-optimal two-weight codes over finite chain rings and related applications

    Authors: Shitao Li, Minjia Shi

    Abstract: Few-weight codes over finite chain rings are associated with combinatorial objects such as strongly regular graphs (SRGs), strongly walk-regular graphs (SWRGs) and finite geometries, and are also widely used in data storage systems and secret sharing schemes. The first objective of this paper is to characterize all possible parameters of Plotkin-optimal two-homogeneous weight regular projective co… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    MSC Class: 94B05; 05E30

  40. arXiv:2305.06910  [pdf, other

    physics.soc-ph physics.data-an

    Filtering higher-order datasets

    Authors: Nicholas W. Landry, Ilya Amburg, Mirah Shi, Sinan G. Aksoy

    Abstract: Many complex systems often contain interactions between more than two nodes, known as higher-order interactions, which can change the structure of these systems in significant ways. Researchers often assume that all interactions paint a consistent picture of a higher-order dataset's structure. In contrast, the connection patterns of individuals or entities in empirical systems are often stratified… ▽ More

    Submitted 1 November, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: 22 pages, 17 figures

  41. arXiv:2305.02735  [pdf, ps, other

    math.CO cs.DM cs.IT

    Quasi-cyclic perfect codes in Doob graphs and special partitions of Galois rings

    Authors: Minjia Shi, Xiaoxiao Li, Denis S. Krotov, Ferruh Özbudak

    Abstract: The Galois ring GR$(4^Δ)$ is the residue ring $Z_4[x]/(h(x))$, where $h(x)$ is a basic primitive polynomial of degree $Δ$ over $Z_4$. For any odd $Δ$ larger than $1$, we construct a partition of GR$(4^Δ) \backslash \{0\}$ into $6$-subsets of type $\{a,b,-a-b,-a,-b,a+b\}$ and $3$-subsets of type $\{c,-c,2c\}$ such that the partition is invariant under the multiplication by a nonzero element of the… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted version; 7 IEEE TIT pages

    MSC Class: 94B99

    Journal ref: IEEE Trans. Inf. Theory 69(9) 2023, 5597-5603

  42. arXiv:2305.02360  [pdf, other

    cs.CV cs.AI

    Fashionpedia-Ads: Do Your Favorite Advertisements Reveal Your Fashion Taste?

    Authors: Mengyun Shi, Claire Cardie, Serge Belongie

    Abstract: Consumers are exposed to advertisements across many different domains on the internet, such as fashion, beauty, car, food, and others. On the other hand, fashion represents second highest e-commerce shop** category. Does consumer digital record behavior on various fashion ad images reveal their fashion taste? Does ads from other domains infer their fashion taste as well? In this paper, we study… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  43. arXiv:2305.02307  [pdf, other

    cs.CV cs.AI cs.DB

    Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste

    Authors: Mengyun Shi, Serge Belongie, Claire Cardie

    Abstract: Existing fashion datasets do not consider the multi-facts that cause a consumer to like or dislike a fashion image. Even two consumers like a same fashion image, they could like this image for total different reasons. In this paper, we study the reason why a consumer like a certain fashion image. Towards this goal, we introduce an interpretability dataset, Fashionpedia-taste, consist of rich annot… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  44. arXiv:2304.08197  [pdf, ps, other

    cond-mat.str-el cond-mat.mtrl-sci

    Competing charge-density wave instabilities in the kagome metal ScV$_6$Sn$_6$

    Authors: Saizheng Cao, Chenchao Xu, Hiroshi Fukui, Taishun Manjo, Ming Shi, Yang Liu, Chao Cao, Yu Song

    Abstract: Owing to its unique geometry, the kagome lattice hosts various many-body quantum states including frustrated magnetism, superconductivity, and charge-density waves (CDWs), with intense efforts focused on kagome metals exhibiting $2\times2$ CDWs associated with the nesting of van Hove saddle points. Recently, a $\sqrt{3}\times\sqrt{3}$ CDW was discovered in the kagome metal ScV$_6$Sn$_6$ below… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Supplementary Information available upon request

    Journal ref: Nat. Commun. 14, 7671 (2023)

  45. arXiv:2304.06436  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.other

    Hidden magnetism uncovered in charge ordered bilayer kagome material ScV_6Sn_6

    Authors: Z. Guguchia, D. J. Gawryluk, Soohyeon Shin, Z. Hao, C. Mielke III, D. Das, I. Plokhikh, L. Liborio, K. Shenton, Y. Hu, V. Sazgari, M. Medarde, H. Deng, Y. Cai, C. Chen, Y. Jiang, A. Amato, M. Shi, M. Z. Hasan, J. -X. Yin, R. Khasanov, E. Pomjakushina, H. Luetkens

    Abstract: Charge ordered kagome lattices have been demonstrated to be intriguing platforms for studying the intertwining of topology, correlation, and magnetism. The recently discovered charge ordered kagome material ScV_6Sn_6 does not feature a magnetic groundstate or excitations, thus it is often regarded as a conventional paramagnet. Here, using advanced muon-spin rotation spectroscopy, we uncover an une… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 9 pages, 4 figures

  46. arXiv:2304.06431  [pdf

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.supr-con

    Phonon promoted charge density wave in topological kagome metal ScV$_{6}$Sn$_{6}$

    Authors: Yong Hu, Junzhang Ma, Yinxiang Li, Dariusz Jakub Gawryluk, Tianchen Hu, Jérémie Teyssier, Volodymyr Multian, Zhouyi Yin, Yuxiao Jiang, Shuxiang Xu, Soohyeon Shin, Igor Plokhikh, Xinloong Han, Nicholas Clark Plumb, Yang Liu, Jiaxin Yin, Zurab Guguchia, Yue Zhao, Andreas P. Schnyder, Xianxin Wu, Ekaterina Pomjakushina, M. Zahid Hasan, Nanlin Wang, Ming Shi

    Abstract: Charge density wave (CDW) orders in vanadium-based kagome metals have recently received tremendous attention due to their unique properties and intricate interplay with exotic correlated phenomena, topological and symmetry-breaking states. However, the origin of the CDW order remains a topic of debate. The discovery of ScV$_{6}$Sn$_{6}$, a vanadium-based bilayer kagome metal exhibiting an in-plane… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Journal ref: Nat Commun 15, 1658 (2024)

  47. arXiv:2304.04441  [pdf, other

    cs.CV cs.LG

    Self-training with dual uncertainty for semi-supervised medical image segmentation

    Authors: Zhanhong Qiu, Haitao Gan, Ming Shi, Zhongwei Huang, Zhi Yang

    Abstract: In the field of semi-supervised medical image segmentation, the shortage of labeled data is the fundamental problem. How to effectively learn image features from unlabeled images to improve segmentation accuracy is the main research direction in this field. Traditional self-training methods can partially solve the problem of insufficient labeled data by generating pseudo labels for iterative train… ▽ More

    Submitted 10 October, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  48. arXiv:2303.17966  [pdf, other

    cs.LG

    HD-GCN:A Hybrid Diffusion Graph Convolutional Network

    Authors: Zhi Yang, Kang Li, Haitao Gan, Zhongwei Huang, Ming Shi

    Abstract: The information diffusion performance of GCN and its variant models is limited by the adjacency matrix, which can lower their performance. Therefore, we introduce a new framework for graph convolutional networks called Hybrid Diffusion-based Graph Convolutional Network (HD-GCN) to address the limitations of information diffusion caused by the adjacency matrix. In the HD-GCN framework, we initially… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: 16 pages, 4 figures

  49. arXiv:2303.16729  [pdf, ps, other

    cs.IT cs.CR

    Binary self-orthogonal codes which meet the Griesmer bound or have optimal minimum distances

    Authors: Minjia Shi, Shitao Li, Tor Helleseth, Jon-Lark Kim

    Abstract: The purpose of this paper is two-fold. First, we characterize the existence of binary self-orthogonal codes meeting the Griesmer bound by employing Solomon-Stiffler codes and some related residual codes. Second, using such a characterization, we determine the exact value of $d_{so}(n,7)$ except for five special cases and the exact value of $d_{so}(n,8)$ except for 41 special cases, where… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Submitted 20 January, 2023

    MSC Class: 94B05

  50. arXiv:2303.16156  [pdf, ps, other

    cs.GR

    Remarks on on the derivatives of rational Bézier curves

    Authors: Mao Shi

    Abstract: By studying the existing higher order derivation formulas of rational Bézier curves, we find that they fail when the order of the derivative exceeds the degree of the curves. In this paper, we present a new derivation formula for rational Bézier curves that overcomes this drawback and show that the $k$th degree derivative of a $n$th degree rational Bézier curve can be written in terms of a… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 March, 2023; originally announced March 2023.