-
Data-driven Nucleus Subclassification on Colon H&E using Style-transferred Digital Pathology
Authors:
Lucas W. Remedios,
Shunxing Bao,
Samuel W. Remedios,
Ho Hin Lee,
Leon Y. Cai,
Thomas Li,
Ruining Deng,
Nancy R. Newlin,
Adam M. Saunders,
Can Cui,
Jia Li,
Qi Liu,
Ken S. Lau,
Joseph T. Roland,
Mary K Washington,
Lori A. Coburn,
Keith T. Wilson,
Yuankai Huo,
Bennett A. Landman
Abstract:
Understanding the way cells communicate, co-locate, and interrelate is essential to furthering our understanding of how the body functions. H&E is widely available, however, cell subty** often requires expert knowledge and the use of specialized stains. To reduce the annotation burden, AI has been proposed for the classification of cells on H&E. For example, the recent Colon Nucleus Identificati…
▽ More
Understanding the way cells communicate, co-locate, and interrelate is essential to furthering our understanding of how the body functions. H&E is widely available, however, cell subty** often requires expert knowledge and the use of specialized stains. To reduce the annotation burden, AI has been proposed for the classification of cells on H&E. For example, the recent Colon Nucleus Identification and Classification (CoNIC) Challenge focused on labeling 6 cell types on H&E of the colon. However, the CoNIC Challenge was unable to classify epithelial subtypes (progenitor, enteroendocrine, goblet), lymphocyte subtypes (B, helper T, cytotoxic T), and connective subtypes (fibroblasts). We use inter-modality learning to label previously un-labelable cell types on H&E. We take advantage of multiplexed immunofluorescence (MxIF) histology to label 14 cell subclasses. We performed style transfer on the same MxIF tissues to synthesize realistic virtual H&E which we paired with the MxIF-derived cell subclassification labels. We evaluated the efficacy of using a supervised learning scheme where the input was realistic-quality virtual H&E and the labels were MxIF-derived cell subclasses. We assessed our model on private virtual H&E and public real H&E. On virtual H&E, we were able to classify helper T cells and epithelial progenitors with positive predictive values of $0.34 \pm 0.15$ (prevalence $0.03 \pm 0.01$) and $0.47 \pm 0.1$ (prevalence $0.07 \pm 0.02$) respectively, when using ground truth centroid information. On real H&E we could classify helper T cells and epithelial progenitors with upper bound positive predictive values of $0.43 \pm 0.03$ (parent class prevalence 0.21) and $0.94 \pm 0.02$ (parent class prevalence 0.49) when using ground truth centroid information. This is the first work to provide cell type classification for helper T and epithelial progenitor nuclei on H&E.
△ Less
Submitted 15 May, 2024;
originally announced July 2024.
-
Waterfall: Framework for Robust and Scalable Text Watermarking
Authors:
Gregory Kang Ruey Lau,
Xinyuan Niu,
Hieu Dao,
Jiangwei Chen,
Chuan-Sheng Foo,
Bryan Kian Hsiang Low
Abstract:
Protecting intellectual property (IP) of text such as articles and code is increasingly important, especially as sophisticated attacks become possible, such as paraphrasing by large language models (LLMs) or even unauthorized training of LLMs on copyrighted text to infringe such IP. However, existing text watermarking methods are not robust enough against such attacks nor scalable to millions of u…
▽ More
Protecting intellectual property (IP) of text such as articles and code is increasingly important, especially as sophisticated attacks become possible, such as paraphrasing by large language models (LLMs) or even unauthorized training of LLMs on copyrighted text to infringe such IP. However, existing text watermarking methods are not robust enough against such attacks nor scalable to millions of users for practical implementation. In this paper, we propose Waterfall, the first training-free framework for robust and scalable text watermarking applicable across multiple text types (e.g., articles, code) and languages supportable by LLMs, for general text and LLM data provenance. Waterfall comprises several key innovations, such as being the first to use LLM as paraphrasers for watermarking along with a novel combination of techniques that are surprisingly effective in achieving robust verifiability and scalability. We empirically demonstrate that Waterfall achieves significantly better scalability, robust verifiability, and computational efficiency compared to SOTA article-text watermarking methods, and also showed how it could be directly applied to the watermarking of code.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Data-Centric AI in the Age of Large Language Models
Authors:
Xinyi Xu,
Zhaoxuan Wu,
Rui Qiao,
Arun Verma,
Yao Shu,
**gtan Wang,
Xinyuan Niu,
Zhenfeng He,
Jiangwei Chen,
Zijian Zhou,
Gregory Kang Ruey Lau,
Hieu Dao,
Lucas Agussurja,
Rachael Hwee Ling Sim,
Xiaoqiang Lin,
Wenyang Hu,
Zhongxiang Dai,
Pang Wei Koh,
Bryan Kian Hsiang Low
Abstract:
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific…
▽ More
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization. In each scenario, we underscore the importance of data, highlight promising research directions, and articulate the potential impacts on the research community and, where applicable, the society as a whole. For instance, we advocate for a suite of data-centric benchmarks tailored to the scale and complexity of data for LLMs. These benchmarks can be used to develop new data curation methods and document research efforts and results, which can help promote openness and transparency in AI and LLM research.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Save It for the "Hot" Day: An LLM-Empowered Visual Analytics System for Heat Risk Management
Authors:
Haobo Li,
Wong Kam-Kwai,
Yan Luo,
Juntong Chen,
Chengzhong Liu,
Yaxuan Zhang,
Alexis Kai Hon Lau,
Huamin Qu,
Dongyu Liu
Abstract:
The escalating frequency and intensity of heat-related climate events, particularly heatwaves, emphasize the pressing need for advanced heat risk management strategies. Current approaches, primarily relying on numerical models, face challenges in spatial-temporal resolution and in capturing the dynamic interplay of environmental, social, and behavioral factors affecting heat risks. This has led to…
▽ More
The escalating frequency and intensity of heat-related climate events, particularly heatwaves, emphasize the pressing need for advanced heat risk management strategies. Current approaches, primarily relying on numerical models, face challenges in spatial-temporal resolution and in capturing the dynamic interplay of environmental, social, and behavioral factors affecting heat risks. This has led to difficulties in translating risk assessments into effective mitigation actions. Recognizing these problems, we introduce a novel approach leveraging the burgeoning capabilities of Large Language Models (LLMs) to extract rich and contextual insights from news reports. We hence propose an LLM-empowered visual analytics system, Havior, that integrates the precise, data-driven insights of numerical models with nuanced news report information. This hybrid approach enables a more comprehensive assessment of heat risks and better identification, assessment, and mitigation of heat-related threats. The system incorporates novel visualization designs, such as "thermoglyph" and news glyph, enhancing intuitive understanding and analysis of heat risks. The integration of LLM-based techniques also enables advanced information retrieval and semantic knowledge extraction that can be guided by experts' analytics needs. Our case studies on two cities that faced significant heatwave events and interviews with five experts have demonstrated the usefulness of our system in providing in-depth and actionable insights for heat risk management.
△ Less
Submitted 7 June, 2024; v1 submitted 5 June, 2024;
originally announced June 2024.
-
Constraining Inflation with the BICEP/Keck CMB Polarization Experiments
Authors:
The BICEP/Keck Collaboration,
:,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
C. A. Bischoff,
D. Beck,
J. J. Bock,
H. Boenish,
V. Buza,
J. R. Cheshire IV,
J. Connors,
J. Cornelison,
M. Crumrine,
A. Cukierman,
E. V. Denison,
M. Dierickx,
L. Duband,
M. Eiben,
B. Elwood,
S. Fatigoni,
J. P. Filippini,
M. Gao
, et al. (63 additional authors not shown)
Abstract:
The BICEP/$\textit{Keck}$ (BK) series of cosmic microwave background (CMB) polarization experiments has, over the past decade and a half, produced a series of field-leading constraints on cosmic inflation via measurements of the "B-mode" polarization of the CMB. Primordial B modes are directly tied to the amplitude of primordial gravitational waves (PGW), their strength parameterized by the tensor…
▽ More
The BICEP/$\textit{Keck}$ (BK) series of cosmic microwave background (CMB) polarization experiments has, over the past decade and a half, produced a series of field-leading constraints on cosmic inflation via measurements of the "B-mode" polarization of the CMB. Primordial B modes are directly tied to the amplitude of primordial gravitational waves (PGW), their strength parameterized by the tensor-to-scalar ratio, $r$, and thus the energy scale of inflation. Having set the most sensitive constraints to-date on $r$, $σ(r)=0.009$ ($r_{0.05}<0.036, 95\%$ C.L.) using data through the 2018 observing season ("BK18"), the BICEP/$\textit{Keck}$ program has continued to improve its dataset in the years since. We give a brief overview of the BK program and the "BK18" result before discussing the program's ongoing efforts, including the deployment and performance of the $\textit{Keck Array}$'s successor instrument, BICEP Array, improvements to data processing and internal consistency testing, new techniques such as delensing, and how those will ultimately serve to allow BK reach $σ(r) \lesssim 0.003$ using data through the 2027 observing season.
△ Less
Submitted 11 July, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
Thermalization and Criticality on an Analog-Digital Quantum Simulator
Authors:
Trond I. Andersen,
Nikita Astrakhantsev,
Amir H. Karamlou,
Julia Berndtsson,
Johannes Motruk,
Aaron Szasz,
Jonathan A. Gross,
Alexander Schuckert,
Tom Westerhout,
Yaxing Zhang,
Ebrahim Forati,
Dario Rossi,
Bryce Kobrin,
Agustin Di Paolo,
Andrey R. Klots,
Ilya Drozdov,
Vladislav D. Kurilovich,
Andre Petukhov,
Lev B. Ioffe,
Andreas Elben,
Aniket Rath,
Vittorio Vitale,
Benoit Vermersch,
Rajeev Acharya,
Laleh Aghababaie Beni
, et al. (202 additional authors not shown)
Abstract:
Understanding how interacting particles approach thermal equilibrium is a major challenge of quantum simulators. Unlocking the full potential of such systems toward this goal requires flexible initial state preparation, precise time evolution, and extensive probes for final state characterization. We present a quantum simulator comprising 69 superconducting qubits which supports both universal qua…
▽ More
Understanding how interacting particles approach thermal equilibrium is a major challenge of quantum simulators. Unlocking the full potential of such systems toward this goal requires flexible initial state preparation, precise time evolution, and extensive probes for final state characterization. We present a quantum simulator comprising 69 superconducting qubits which supports both universal quantum gates and high-fidelity analog evolution, with performance beyond the reach of classical simulation in cross-entropy benchmarking experiments. Emulating a two-dimensional (2D) XY quantum magnet, we leverage a wide range of measurement techniques to study quantum states after ramps from an antiferromagnetic initial state. We observe signatures of the classical Kosterlitz-Thouless phase transition, as well as strong deviations from Kibble-Zurek scaling predictions attributed to the interplay between quantum and classical coarsening of the correlated domains. This interpretation is corroborated by injecting variable energy density into the initial state, which enables studying the effects of the eigenstate thermalization hypothesis (ETH) in targeted parts of the eigenspectrum. Finally, we digitally prepare the system in pairwise-entangled dimer states and image the transport of energy and vorticity during thermalization. These results establish the efficacy of superconducting analog-digital quantum processors for preparing states across many-body spectra and unveiling their thermalization dynamics.
△ Less
Submitted 8 July, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition
Authors:
Kin Wai Lau,
Yasar Abbas Ur Rehman,
Lai-Man Po
Abstract:
Recent research has successfully adapted vision-based convolutional neural network (CNN) architectures for audio recognition tasks using Mel-Spectrograms. However, these CNNs have high computational costs and memory requirements, limiting their deployment on low-end edge devices. Motivated by the success of efficient vision models like InceptionNeXt and ConvNeXt, we propose AudioRepInceptionNeXt,…
▽ More
Recent research has successfully adapted vision-based convolutional neural network (CNN) architectures for audio recognition tasks using Mel-Spectrograms. However, these CNNs have high computational costs and memory requirements, limiting their deployment on low-end edge devices. Motivated by the success of efficient vision models like InceptionNeXt and ConvNeXt, we propose AudioRepInceptionNeXt, a single-stream architecture. Its basic building block breaks down the parallel multi-branch depth-wise convolutions with descending scales of k x k kernels into a cascade of two multi-branch depth-wise convolutions. The first multi-branch consists of parallel multi-scale 1 x k depth-wise convolutional layers followed by a similar multi-branch employing parallel multi-scale k x 1 depth-wise convolutional layers. This reduces computational and memory footprint while separating time and frequency processing of Mel-Spectrograms. The large kernels capture global frequencies and long activities, while small kernels get local frequencies and short activities. We also reparameterize the multi-branch design during inference to further boost speed without losing accuracy. Experiments show that AudioRepInceptionNeXt reduces parameters and computations by 50%+ and improves inference speed 1.28x over state-of-the-art CNNs like the Slow-Fast while maintaining comparable accuracy. It also learns robustly across a variety of audio recognition tasks. Codes are available at https://github.com/StevenLauHKHK/AudioRepInceptionNeXt.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
PINNACLE: PINN Adaptive ColLocation and Experimental points selection
Authors:
Gregory Kang Ruey Lau,
Apivich Hemachandra,
See-Kiong Ng,
Bryan Kian Hsiang Low
Abstract:
Physics-Informed Neural Networks (PINNs), which incorporate PDEs as soft constraints, train with a composite loss function that contains multiple training point types: different types of collocation points chosen during training to enforce each PDE and initial/boundary conditions, and experimental points which are usually costly to obtain via experiments or simulations. Training PINNs using this l…
▽ More
Physics-Informed Neural Networks (PINNs), which incorporate PDEs as soft constraints, train with a composite loss function that contains multiple training point types: different types of collocation points chosen during training to enforce each PDE and initial/boundary conditions, and experimental points which are usually costly to obtain via experiments or simulations. Training PINNs using this loss function is challenging as it typically requires selecting large numbers of points of different types, each with different training dynamics. Unlike past works that focused on the selection of either collocation or experimental points, this work introduces PINN Adaptive ColLocation and Experimental points selection (PINNACLE), the first algorithm that jointly optimizes the selection of all training point types, while automatically adjusting the proportion of collocation point types as training progresses. PINNACLE uses information on the interaction among training point types, which had not been considered before, based on an analysis of PINN training dynamics via the Neural Tangent Kernel (NTK). We theoretically show that the criterion used by PINNACLE is related to the PINN generalization error, and empirically demonstrate that PINNACLE is able to outperform existing point selection methods for forward, inverse, and transfer learning problems.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Bayesian Federated Model Compression for Communication and Computation Efficiency
Authors:
Chengyu Xia,
Danny H. K. Tsang,
Vincent K. N. Lau
Abstract:
In this paper, we investigate Bayesian model compression in federated learning (FL) to construct sparse models that can achieve both communication and computation efficiencies. We propose a decentralized Turbo variational Bayesian inference (D-Turbo-VBI) FL framework where we firstly propose a hierarchical sparse prior to promote a clustered sparse structure in the weight matrix. Then, by carefull…
▽ More
In this paper, we investigate Bayesian model compression in federated learning (FL) to construct sparse models that can achieve both communication and computation efficiencies. We propose a decentralized Turbo variational Bayesian inference (D-Turbo-VBI) FL framework where we firstly propose a hierarchical sparse prior to promote a clustered sparse structure in the weight matrix. Then, by carefully integrating message passing and VBI with a decentralized turbo framework, we propose the D-Turbo-VBI algorithm which can (i) reduce both upstream and downstream communication overhead during federated training, and (ii) reduce the computational complexity during local inference. Additionally, we establish the convergence property for thr proposed D-Turbo-VBI algorithm. Simulation results show the significant gain of our proposed algorithm over the baselines in reducing communication overhead during federated training and computational complexity of final model.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Doppler Tracking Data of Martian Mission Tianwen-I and Upper Limit of Stochastic Gravitational Wave Background
Authors:
Xiaoming Bi,
Zhongkai Guo,
Xiaobo Zou,
Yong Huang,
Peijia Li,
Jianfeng Cao,
Lue Chen,
Wenlin Tang,
Yun Kau Lau
Abstract:
Two way ranging data for spacecraft tracking of China's first Martian mission Tianwen-I is analysed. Shortly before the spacecraft entered the Mars parking orbit, the two way coherent microwave link between the spacecraft and the Earth resembles a long arm gravitational wave interferometer, with both the spacecraft and the Earth regarded as in an approximate free falling state. By carefully select…
▽ More
Two way ranging data for spacecraft tracking of China's first Martian mission Tianwen-I is analysed. Shortly before the spacecraft entered the Mars parking orbit, the two way coherent microwave link between the spacecraft and the Earth resembles a long arm gravitational wave interferometer, with both the spacecraft and the Earth regarded as in an approximate free falling state. By carefully selecting and analysing data segments of the time series of the two way ranging data during this time span, a parametric statistical model is built for the data segments and an upper limit for the stochastic gravitational waves background (SGWB) is then estimated within the frequency window 0.1Hz to 0.1 mHz. The upper bound improves considerably on those obtained before. In particular, around the deci-Hz band, there is a three orders improvement on the bound obtained previously by the two way ranging data of the Chang e 3 mission. Scientific applications of the upper bound is then considered and a weak upper bound is worked out for axions which is a promising candidate for ultra light dark matter.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Embedding Large Language Models into Extended Reality: Opportunities and Challenges for Inclusion, Engagement, and Privacy
Authors:
Efe Bozkir,
Süleyman Özdel,
Ka Hei Carrie Lau,
Mengdi Wang,
Hong Gao,
Enkelejda Kasneci
Abstract:
Advances in artificial intelligence and human-computer interaction will likely lead to extended reality (XR) becoming pervasive. While XR can provide users with interactive, engaging, and immersive experiences, non-player characters are often utilized in pre-scripted and conventional ways. This paper argues for using large language models (LLMs) in XR by embedding them in avatars or as narratives…
▽ More
Advances in artificial intelligence and human-computer interaction will likely lead to extended reality (XR) becoming pervasive. While XR can provide users with interactive, engaging, and immersive experiences, non-player characters are often utilized in pre-scripted and conventional ways. This paper argues for using large language models (LLMs) in XR by embedding them in avatars or as narratives to facilitate inclusion through prompt engineering and fine-tuning the LLMs. We argue that this inclusion will promote diversity for XR use. Furthermore, the versatile conversational capabilities of LLMs will likely increase engagement in XR, hel** XR become ubiquitous. Lastly, we speculate that combining the information provided to LLM-powered spaces by users and the biometric data obtained might lead to novel privacy invasions. While exploring potential privacy breaches, examining user privacy concerns and preferences is also essential. Therefore, despite challenges, LLM-powered XR is a promising area with several opportunities.
△ Less
Submitted 20 June, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding
Authors:
Yasar Abbas Ur Rehman,
Kin Wai Lau,
Yuyang Xie,
Lan Ma,
Jiajun Shen
Abstract:
The integration of Federated Learning (FL) and Self-supervised Learning (SSL) offers a unique and synergetic combination to exploit the audio data for general-purpose audio understanding, without compromising user data privacy. However, rare efforts have been made to investigate the SSL models in the FL regime for general-purpose audio understanding, especially when the training data is generated…
▽ More
The integration of Federated Learning (FL) and Self-supervised Learning (SSL) offers a unique and synergetic combination to exploit the audio data for general-purpose audio understanding, without compromising user data privacy. However, rare efforts have been made to investigate the SSL models in the FL regime for general-purpose audio understanding, especially when the training data is generated by large-scale heterogeneous audio sources. In this paper, we evaluate the performance of feature-matching and predictive audio-SSL techniques when integrated into large-scale FL settings simulated with non-independently identically distributed (non-iid) data. We propose a novel Federated SSL (F-SSL) framework, dubbed FASSL, that enables learning intermediate feature representations from large-scale decentralized heterogeneous clients, holding unlabelled audio data. Our study has found that audio F-SSL approaches perform on par with the centralized audio-SSL approaches on the audio-retrieval task. Extensive experiments demonstrate the effectiveness and significance of FASSL as it assists in obtaining the optimal global model for state-of-the-art FL aggregation methods.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Nucleus subtype classification using inter-modality learning
Authors:
Lucas W. Remedios,
Shunxing Bao,
Samuel W. Remedios,
Ho Hin Lee,
Leon Y. Cai,
Thomas Li,
Ruining Deng,
Can Cui,
Jia Li,
Qi Liu,
Ken S. Lau,
Joseph T. Roland,
Mary K. Washington,
Lori A. Coburn,
Keith T. Wilson,
Yuankai Huo,
Bennett A. Landman
Abstract:
Understanding the way cells communicate, co-locate, and interrelate is essential to understanding human physiology. Hematoxylin and eosin (H&E) staining is ubiquitously available both for clinical studies and research. The Colon Nucleus Identification and Classification (CoNIC) Challenge has recently innovated on robust artificial intelligence labeling of six cell types on H&E stains of the colon.…
▽ More
Understanding the way cells communicate, co-locate, and interrelate is essential to understanding human physiology. Hematoxylin and eosin (H&E) staining is ubiquitously available both for clinical studies and research. The Colon Nucleus Identification and Classification (CoNIC) Challenge has recently innovated on robust artificial intelligence labeling of six cell types on H&E stains of the colon. However, this is a very small fraction of the number of potential cell classification types. Specifically, the CoNIC Challenge is unable to classify epithelial subtypes (progenitor, endocrine, goblet), lymphocyte subtypes (B, helper T, cytotoxic T), or connective subtypes (fibroblasts, stromal). In this paper, we propose to use inter-modality learning to label previously un-labelable cell types on virtual H&E. We leveraged multiplexed immunofluorescence (MxIF) histology imaging to identify 14 subclasses of cell types. We performed style transfer to synthesize virtual H&E from MxIF and transferred the higher density labels from MxIF to these virtual H&E images. We then evaluated the efficacy of learning in this approach. We identified helper T and progenitor nuclei with positive predictive values of $0.34 \pm 0.15$ (prevalence $0.03 \pm 0.01$) and $0.47 \pm 0.1$ (prevalence $0.07 \pm 0.02$) respectively on virtual H&E. This approach represents a promising step towards automating annotation in digital pathology.
△ Less
Submitted 28 January, 2024; v1 submitted 10 January, 2024;
originally announced January 2024.
-
Mitigating Nonlinear Algorithmic Bias in Binary Classification
Authors:
Wendy Hui,
Wai Kwong Lau
Abstract:
This paper proposes the use of causal modeling to detect and mitigate algorithmic bias that is nonlinear in the protected attribute. We provide a general overview of our approach. We use the German Credit data set, which is available for download from the UC Irvine Machine Learning Repository, to develop (1) a prediction model, which is treated as a black box, and (2) a causal model for bias mitig…
▽ More
This paper proposes the use of causal modeling to detect and mitigate algorithmic bias that is nonlinear in the protected attribute. We provide a general overview of our approach. We use the German Credit data set, which is available for download from the UC Irvine Machine Learning Repository, to develop (1) a prediction model, which is treated as a black box, and (2) a causal model for bias mitigation. In this paper, we focus on age bias and the problem of binary classification. We show that the probability of getting correctly classified as "low risk" is lowest among young people. The probability increases with age nonlinearly. To incorporate the nonlinearity into the causal model, we introduce a higher order polynomial term. Based on the fitted causal model, the de-biased probability estimates are computed, showing improved fairness with little impact on overall classification accuracy. Causal modeling is intuitive and, hence, its use can enhance explicability and promotes trust among different stakeholders of AI.
△ Less
Submitted 7 May, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Validation of Consumer-grade Digital Camera-based Human Activity Evaluation for Upper Limb Exercises and Development of a Therapist-guided, Automated Telerehabilitation Framework and Platform for Stroke Rehabilitation
Authors:
Elton H. L. Yeung,
Yingxian Chen,
Wilton W. T. Fok,
Gary K. K. Lau
Abstract:
Timely and adequate rehabilitation is critical in facilitating post-stroke recovery. However, the organization and delivery of rehabilitation are resource-demanding, and are only available to approximately 25% of stroke survivors in low-to-middle-income countries. Improving access to stroke rehabilitation services through innovative solutions is therefore urgently required. Tele-rehabilitation, wh…
▽ More
Timely and adequate rehabilitation is critical in facilitating post-stroke recovery. However, the organization and delivery of rehabilitation are resource-demanding, and are only available to approximately 25% of stroke survivors in low-to-middle-income countries. Improving access to stroke rehabilitation services through innovative solutions is therefore urgently required. Tele-rehabilitation, which transits care to home- and community settings, has emerged as a promising solution. However, current approaches using video tutorial, teleconference, or other specialized devices face inherent shortfalls that limit their uptake. In this study, we proposed and validated the use of an open-source, markerless motion capture model with consumer-grade devices to overcome these challenges. Our solution enables reliable measurement of the end range of motion during upper limb exercises with near-perfect waveform similarity and intraclass correlation to that of the gold standard Kinect approach. Our multidisciplinary team developed an automated telerehabilitation framework incorporating the validated markerless technique to facilitate a seamless telerehabilitation process. It enables personalized rehabilitation plans with real-time feedback, and individual progress reports using objective quantitative and qualitative features to improve patient monitoring and management, and home-based rehabilitation service uptake and compliance. This study serves as a proof-of-concept in preparation for the future development of a detailed model of care, and feasibility, usability, and cost-effectiveness studies of an automated telerehabilitation platform and framework in improving the state of post-stroke rehabilitation and functional outcome.
△ Less
Submitted 10 February, 2024; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations
Authors:
Tsai Hor Chan,
Kin Wai Lau,
Jiajun Shen,
Guosheng Yin,
Lequan Yu
Abstract:
Uncertainty estimation aims to evaluate the confidence of a trained deep neural network. However, existing uncertainty estimation approaches rely on low-dimensional distributional assumptions and thus suffer from the high dimensionality of latent features. Existing approaches tend to focus on uncertainty on discrete classification probabilities, which leads to poor generalizability to uncertainty…
▽ More
Uncertainty estimation aims to evaluate the confidence of a trained deep neural network. However, existing uncertainty estimation approaches rely on low-dimensional distributional assumptions and thus suffer from the high dimensionality of latent features. Existing approaches tend to focus on uncertainty on discrete classification probabilities, which leads to poor generalizability to uncertainty estimation for other tasks. Moreover, most of the literature requires seeing the out-of-distribution (OOD) data in the training for better estimation of uncertainty, which limits the uncertainty estimation performance in practice because the OOD data are typically unseen. To overcome these limitations, we propose a new framework using data-adaptive high-dimensional hypothesis testing for uncertainty estimation, which leverages the statistical properties of the feature representations. Our method directly operates on latent representations and thus does not require retraining the feature encoder under a modified objective. The test statistic relaxes the feature distribution assumptions to high dimensionality, and it is more discriminative to uncertainties in the latent representations. We demonstrate that encoding features with Bayesian neural networks can enhance testing performance and lead to more accurate uncertainty estimation. We further introduce a family-wise testing procedure to determine the optimal threshold of OOD detection, which minimizes the false discovery rate (FDR). Extensive experiments validate the satisfactory performance of our framework on uncertainty estimation and task-specific prediction over a variety of competitors. The experiments on the OOD detection task also show satisfactory performance of our method when the OOD data are unseen in the training. Codes are available at https://github.com/HKU-MedAI/bnn_uncertainty.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Resonant Scattering of Gravitational Waves With Electromagnetic Waves
Authors:
Ruodi Yan,
Yun Kau Lau
Abstract:
A certain class of exact solutions of Einstein Maxwell spacetime in general relativity is discussed which demonstrates at the level of theory that, when certain parametric resonance condition is met, the interaction of electromagnetic field with a gravitational wave will display certain Liapounov instability and lead to exponential amplification of a gravitational wave train described by certain N…
▽ More
A certain class of exact solutions of Einstein Maxwell spacetime in general relativity is discussed which demonstrates at the level of theory that, when certain parametric resonance condition is met, the interaction of electromagnetic field with a gravitational wave will display certain Liapounov instability and lead to exponential amplification of a gravitational wave train described by certain Newman-Penrose component of the Weyl curvature. In some way akin to a free electron laser in electromagnetic theory, by the conversion of electromagnetic energy into gravitational energy in a coherent way, the feasibility of generating a pulsed laser like intense beam of gravitational wave is displayed.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Detecting and Mitigating Algorithmic Bias in Binary Classification using Causal Modeling
Authors:
Wendy Hui,
Wai Kwong Lau
Abstract:
This paper proposes the use of causal modeling to detect and mitigate algorithmic bias. We provide a brief description of causal modeling and a general overview of our approach. We then use the Adult dataset, which is available for download from the UC Irvine Machine Learning Repository, to develop (1) a prediction model, which is treated as a black box, and (2) a causal model for bias mitigation.…
▽ More
This paper proposes the use of causal modeling to detect and mitigate algorithmic bias. We provide a brief description of causal modeling and a general overview of our approach. We then use the Adult dataset, which is available for download from the UC Irvine Machine Learning Repository, to develop (1) a prediction model, which is treated as a black box, and (2) a causal model for bias mitigation. In this paper, we focus on gender bias and the problem of binary classification. We show that gender bias in the prediction model is statistically significant at the 0.05 level. We demonstrate the effectiveness of the causal model in mitigating gender bias by cross-validation. Furthermore, we show that the overall classification accuracy is improved slightly. Our novel approach is intuitive, easy-to-use, and can be implemented using existing statistical software tools such as "lavaan" in R. Hence, it enhances explainability and promotes trust.
△ Less
Submitted 8 November, 2023; v1 submitted 18 October, 2023;
originally announced October 2023.
-
Results and Limits of Time Division Multiplexing for the BICEP Array High Frequency Receivers
Authors:
S. Fatigoni,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
C. A. Bischoff,
D. Beck,
J. J. Bock,
V. Buza,
J. Cheshire,
J. Connors,
J. Cornelison,
M. Crumrine,
A. J. Cukierman,
E. V. Denison,
M. I. Dierickx,
L. Duband,
M. Eiben,
J. P. Filippini,
A. Fortes,
M. Gao,
C. Giannakopoulos,
N. Goeckner-Wald,
D. C. Goldfinger
, et al. (62 additional authors not shown)
Abstract:
Time-Division Multiplexing is the readout architecture of choice for many ground and space experiments, as it is a very mature technology with proven outstanding low-frequency noise stability, which represents a central challenge in multiplexing. Once fully populated, each of the two BICEP Array high frequency receivers, observing at 150GHz and 220/270GHz, will have 7776 TES detectors tiled on the…
▽ More
Time-Division Multiplexing is the readout architecture of choice for many ground and space experiments, as it is a very mature technology with proven outstanding low-frequency noise stability, which represents a central challenge in multiplexing. Once fully populated, each of the two BICEP Array high frequency receivers, observing at 150GHz and 220/270GHz, will have 7776 TES detectors tiled on the focal plane. The constraints set by these two receivers required a redesign of the warm readout electronics. The new version of the standard Multi Channel Electronics, developed and built at the University of British Columbia, is presented here for the first time. BICEP Array operates Time Division Multiplexing readout technology to the limits of its capabilities in terms of multiplexing rate, noise and crosstalk, and applies them in rigorously demanding scientific application requiring extreme noise performance and systematic error control. Future experiments like CMB-S4 plan to use TES bolometers with Time Division/SQUID-based readout for an even larger number of detectors.
△ Less
Submitted 24 October, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
A unified framework for STAR-RIS coefficients optimization
Authors:
Hancheng Zhu,
Yuanwei Liu,
Yik Chung Wu,
Vincent K. N. Lau
Abstract:
Simultaneously transmitting and reflecting (STAR) reconfigurable intelligent surface (RIS), which serves users located on both sides of the surface, has recently emerged as a promising enhancement to the traditional reflective only RIS. Due to the lack of a unified comparison of communication systems equipped with different modes of STAR-RIS and the performance degradation caused by the constraint…
▽ More
Simultaneously transmitting and reflecting (STAR) reconfigurable intelligent surface (RIS), which serves users located on both sides of the surface, has recently emerged as a promising enhancement to the traditional reflective only RIS. Due to the lack of a unified comparison of communication systems equipped with different modes of STAR-RIS and the performance degradation caused by the constraints involving discrete selection, this paper proposes a unified optimization framework for handling the STAR-RIS operating mode and discrete phase constraints. With a judiciously introduced penalty term, this framework transforms the original problem into two iterative subproblems, with one containing the selection-type constraints, and the other subproblem handling other wireless resource. Convergent point of the whole algorithm is found to be at least a stationary point under mild conditions. As an illustrative example, the proposed framework is applied to a sum-rate maximization problem in the downlink transmission. Simulation results show that the algorithms from the proposed framework outperform other existing algorithms tailored for different STAR-RIS scenarios. Furthermore, it is found that 4 or even 2 discrete phases STAR-RIS could achieve almost the same sum-rate performance as the continuous phase setting, showing for the first time that discrete phase is not necessarily a cause of significant performance degradation.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Quantum Bayesian Optimization
Authors:
Zhongxiang Dai,
Gregory Kang Ruey Lau,
Arun Verma,
Yao Shu,
Bryan Kian Hsiang Low,
Patrick Jaillet
Abstract:
Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent method for optimizing complicated black-box reward functions. Various BO algorithms have been theoretically shown to enjoy upper bounds on their cumulative regret which are sub-linear in the number T of iterations, and a regret lower bound of Omega(sqrt(T)) has been derived which represents the unavoidable regrets f…
▽ More
Kernelized bandits, also known as Bayesian optimization (BO), has been a prevalent method for optimizing complicated black-box reward functions. Various BO algorithms have been theoretically shown to enjoy upper bounds on their cumulative regret which are sub-linear in the number T of iterations, and a regret lower bound of Omega(sqrt(T)) has been derived which represents the unavoidable regrets for any classical BO algorithm. Recent works on quantum bandits have shown that with the aid of quantum computing, it is possible to achieve tighter regret upper bounds better than their corresponding classical lower bounds. However, these works are restricted to either multi-armed or linear bandits, and are hence not able to solve sophisticated real-world problems with non-linear reward functions. To this end, we introduce the quantum-Gaussian process-upper confidence bound (Q-GP-UCB) algorithm. To the best of our knowledge, our Q-GP-UCB is the first BO algorithm able to achieve a regret upper bound of O(polylog T), which is significantly smaller than its regret lower bound of Omega(sqrt(T)) in the classical setting. Moreover, thanks to our novel analysis of the confidence ellipsoid, our Q-GP-UCB with the linear kernel achieves a smaller regret than the quantum linear UCB algorithm from the previous work. We use simulations, as well as an experiment using a real quantum computer, to verify that the theoretical quantum speedup achieved by our Q-GP-UCB is also potentially relevant in practice.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
WTH! Wok the Hydrogen: Measurement of Galactic Neutral Hydrogen in Noisy Urban Environment Using Kitchenware
Authors:
Leo W. H. Fung,
Albert Wai Kit Lau,
Ka Hung Chan,
Ming Tony Shing
Abstract:
Astronomy observation is difficult in urban environments due to the background noise generated by human activities. Consequently, promoting astronomy in metropolitan areas is challenging. In this work, we propose a low-cost, educational experiment called Wok the Hydrogen (WTH) that offers opportunities for scientific observation in urban environments, specifically the observation of the $21$ cm (…
▽ More
Astronomy observation is difficult in urban environments due to the background noise generated by human activities. Consequently, promoting astronomy in metropolitan areas is challenging. In this work, we propose a low-cost, educational experiment called Wok the Hydrogen (WTH) that offers opportunities for scientific observation in urban environments, specifically the observation of the $21$ cm ($f_{21} = 1420.4$ MHz) emission from neutral hydrogen in the Milky Way. We demonstrate how to construct a radio telescope using kitchenware, along with additional electronic equipment that can be easily purchased online. The total system cost is controlled within 150 dollars. We also outline the subsequent data analysis procedures for deriving the recession velocity of galactic hydrogen from the raw data. The system was tested on the campus of the Hong Kong University of Science and Technology, which is located approximately 2 km northeast of the nearest residential area with a population of 0.4 million and about 10 km east of the downtown area with a population of 2 million. We show that a precision of $Δv \approx \pm 20$ km s$^{-1}$ can be achieved for determining the recession velocity of neutral hydrogen with this relatively simple setup, and the precision can be further improved with longer exposure time.
△ Less
Submitted 28 September, 2023; v1 submitted 26 September, 2023;
originally announced September 2023.
-
Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN
Authors:
Kin Wai Lau,
Lai-Man Po,
Yasar Abbas Ur Rehman
Abstract:
Visual Attention Networks (VAN) with Large Kernel Attention (LKA) modules have been shown to provide remarkable performance, that surpasses Vision Transformers (ViTs), on a range of vision-based tasks. However, the depth-wise convolutional layer in these LKA modules incurs a quadratic increase in the computational and memory footprints with increasing convolutional kernel size. To mitigate these p…
▽ More
Visual Attention Networks (VAN) with Large Kernel Attention (LKA) modules have been shown to provide remarkable performance, that surpasses Vision Transformers (ViTs), on a range of vision-based tasks. However, the depth-wise convolutional layer in these LKA modules incurs a quadratic increase in the computational and memory footprints with increasing convolutional kernel size. To mitigate these problems and to enable the use of extremely large convolutional kernels in the attention modules of VAN, we propose a family of Large Separable Kernel Attention modules, termed LSKA. LSKA decomposes the 2D convolutional kernel of the depth-wise convolutional layer into cascaded horizontal and vertical 1-D kernels. In contrast to the standard LKA design, the proposed decomposition enables the direct use of the depth-wise convolutional layer with large kernels in the attention module, without requiring any extra blocks. We demonstrate that the proposed LSKA module in VAN can achieve comparable performance with the standard LKA module and incur lower computational complexity and memory footprints. We also find that the proposed LSKA design biases the VAN more toward the shape of the object than the texture with increasing kernel size. Additionally, we benchmark the robustness of the LKA and LSKA in VAN, ViTs, and the recent ConvNeXt on the five corrupted versions of the ImageNet dataset that are largely unexplored in the previous works. Our extensive experimental results show that the proposed LSKA module in VAN provides a significant reduction in computational complexity and memory footprints with increasing kernel size while outperforming ViTs, ConvNeXt, and providing similar performance compared to the LKA module in VAN on object recognition, object detection, semantic segmentation, and robustness tests.
△ Less
Submitted 19 October, 2023; v1 submitted 4 September, 2023;
originally announced September 2023.
-
Cell Spatial Analysis in Crohn's Disease: Unveiling Local Cell Arrangement Pattern with Graph-based Signatures
Authors:
Shunxing Bao,
Sichen Zhu,
Vasantha L Kolachala,
Lucas W. Remedios,
Yeonjoo Hwang,
Yutong Sun,
Ruining Deng,
Can Cui,
Yike Li,
Jia Li,
Joseph T. Roland,
Qi Liu,
Ken S. Lau,
Subra Kugathasan,
Peng Qiu,
Keith T. Wilson,
Lori A. Coburn,
Bennett A. Landman,
Yuankai Huo
Abstract:
Crohn's disease (CD) is a chronic and relapsing inflammatory condition that affects segments of the gastrointestinal tract. CD activity is determined by histological findings, particularly the density of neutrophils observed on Hematoxylin and Eosin stains (H&E) imaging. However, understanding the broader morphometry and local cell arrangement beyond cell counting and tissue morphology remains cha…
▽ More
Crohn's disease (CD) is a chronic and relapsing inflammatory condition that affects segments of the gastrointestinal tract. CD activity is determined by histological findings, particularly the density of neutrophils observed on Hematoxylin and Eosin stains (H&E) imaging. However, understanding the broader morphometry and local cell arrangement beyond cell counting and tissue morphology remains challenging. To address this, we characterize six distinct cell types from H&E images and develop a novel approach for the local spatial signature of each cell. Specifically, we create a 10-cell neighborhood matrix, representing neighboring cell arrangements for each individual cell. Utilizing t-SNE for non-linear spatial projection in scatter-plot and Kernel Density Estimation contour-plot formats, our study examines patterns of differences in the cellular environment associated with the odds ratio of spatial patterns between active CD and control groups. This analysis is based on data collected at the two research institutes. The findings reveal heterogeneous nearest-neighbor patterns, signifying distinct tendencies of cell clustering, with a particular focus on the rectum region. These variations underscore the impact of data heterogeneity on cell spatial arrangements in CD patients. Moreover, the spatial distribution disparities between the two research sites highlight the significance of collaborative efforts among healthcare organizations. All research analysis pipeline tools are available at https://github.com/MASILab/cellNN.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
AudioInceptionNeXt: TCL AI LAB Submission to EPIC-SOUND Audio-Based-Interaction-Recognition Challenge 2023
Authors:
Kin Wai Lau,
Yasar Abbas Ur Rehman,
Yuyang Xie,
Lan Ma
Abstract:
This report presents the technical details of our submission to the 2023 Epic-Kitchen EPIC-SOUNDS Audio-Based Interaction Recognition Challenge. The task is to learn the map** from audio samples to their corresponding action labels. To achieve this goal, we propose a simple yet effective single-stream CNN-based architecture called AudioInceptionNeXt that operates on the time-frequency log-mel-sp…
▽ More
This report presents the technical details of our submission to the 2023 Epic-Kitchen EPIC-SOUNDS Audio-Based Interaction Recognition Challenge. The task is to learn the map** from audio samples to their corresponding action labels. To achieve this goal, we propose a simple yet effective single-stream CNN-based architecture called AudioInceptionNeXt that operates on the time-frequency log-mel-spectrogram of the audio samples. Motivated by the design of the InceptionNeXt, we propose parallel multi-scale depthwise separable convolutional kernels in the AudioInceptionNeXt block, which enable the model to learn the time and frequency information more effectively. The large-scale separable kernels capture the long duration of activities and the global frequency semantic information, while the small-scale separable kernels capture the short duration of activities and local details of frequency information. Our approach achieved 55.43% of top-1 accuracy on the challenge test set, ranked as 1st on the public leaderboard. Codes are available anonymously at https://github.com/StevenLauHKHK/AudioInceptionNeXt.git.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Feasibility of Universal Anomaly Detection without Knowing the Abnormality in Medical Images
Authors:
Can Cui,
Yaohong Wang,
Shunxing Bao,
Yucheng Tang,
Ruining Deng,
Lucas W. Remedios,
Zuhayr Asad,
Joseph T. Roland,
Ken S. Lau,
Qi Liu,
Lori A. Coburn,
Keith T. Wilson,
Bennett A. Landman,
Yuankai Huo
Abstract:
Many anomaly detection approaches, especially deep learning methods, have been recently developed to identify abnormal image morphology by only employing normal images during training. Unfortunately, many prior anomaly detection methods were optimized for a specific "known" abnormality (e.g., brain tumor, bone fraction, cell types). Moreover, even though only the normal images were used in the tra…
▽ More
Many anomaly detection approaches, especially deep learning methods, have been recently developed to identify abnormal image morphology by only employing normal images during training. Unfortunately, many prior anomaly detection methods were optimized for a specific "known" abnormality (e.g., brain tumor, bone fraction, cell types). Moreover, even though only the normal images were used in the training process, the abnormal images were often employed during the validation process (e.g., epoch selection, hyper-parameter tuning), which might leak the supposed ``unknown" abnormality unintentionally. In this study, we investigated these two essential aspects regarding universal anomaly detection in medical images by (1) comparing various anomaly detection methods across four medical datasets, (2) investigating the inevitable but often neglected issues on how to unbiasedly select the optimal anomaly detection model during the validation phase using only normal images, and (3) proposing a simple decision-level ensemble method to leverage the advantage of different kinds of anomaly detection without knowing the abnormality. The results of our experiments indicate that none of the evaluated methods consistently achieved the best performance across all datasets. Our proposed method enhanced the robustness of performance in general (average AUC 0.956).
△ Less
Submitted 19 August, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Dynamics of magnetization at infinite temperature in a Heisenberg spin chain
Authors:
Eliott Rosenberg,
Trond Andersen,
Rhine Samajdar,
Andre Petukhov,
Jesse Hoke,
Dmitry Abanin,
Andreas Bengtsson,
Ilya Drozdov,
Catherine Erickson,
Paul Klimov,
Xiao Mi,
Alexis Morvan,
Matthew Neeley,
Charles Neill,
Rajeev Acharya,
Richard Allen,
Kyle Anderson,
Markus Ansmann,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Juan Atalaya,
Joseph Bardin,
A. Bilmes,
Gina Bortoli
, et al. (156 additional authors not shown)
Abstract:
Understanding universal aspects of quantum dynamics is an unresolved problem in statistical mechanics. In particular, the spin dynamics of the 1D Heisenberg model were conjectured to belong to the Kardar-Parisi-Zhang (KPZ) universality class based on the scaling of the infinite-temperature spin-spin correlation function. In a chain of 46 superconducting qubits, we study the probability distributio…
▽ More
Understanding universal aspects of quantum dynamics is an unresolved problem in statistical mechanics. In particular, the spin dynamics of the 1D Heisenberg model were conjectured to belong to the Kardar-Parisi-Zhang (KPZ) universality class based on the scaling of the infinite-temperature spin-spin correlation function. In a chain of 46 superconducting qubits, we study the probability distribution, $P(\mathcal{M})$, of the magnetization transferred across the chain's center. The first two moments of $P(\mathcal{M})$ show superdiffusive behavior, a hallmark of KPZ universality. However, the third and fourth moments rule out the KPZ conjecture and allow for evaluating other theories. Our results highlight the importance of studying higher moments in determining dynamic universality classes and provide key insights into universal behavior in quantum systems.
△ Less
Submitted 4 April, 2024; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Synthesizing Speech Test Cases with Text-to-Speech? An Empirical Study on the False Alarms in Automated Speech Recognition Testing
Authors:
Julia Kaiwen Lau,
Kelvin Kai Wen Kong,
Julian Hao Yong,
Per Hoong Tan,
Zhou Yang,
Zi Qian Yong,
Joshua Chern Wey Low,
Chun Yong Chong,
Mei Kuan Lim,
David Lo
Abstract:
Recent studies have proposed the use of Text-To-Speech (TTS) systems to automatically synthesise speech test cases on a scale and uncover a large number of failures in ASR systems. However, the failures uncovered by synthetic test cases may not reflect the actual performance of an ASR system when it transcribes human audio, which we refer to as false alarms. Given a failed test case synthesised fr…
▽ More
Recent studies have proposed the use of Text-To-Speech (TTS) systems to automatically synthesise speech test cases on a scale and uncover a large number of failures in ASR systems. However, the failures uncovered by synthetic test cases may not reflect the actual performance of an ASR system when it transcribes human audio, which we refer to as false alarms. Given a failed test case synthesised from TTS systems, which consists of TTS-generated audio and the corresponding ground truth text, we feed the human audio stating the same text to an ASR system. If human audio can be correctly transcribed, an instance of a false alarm is detected. In this study, we investigate false alarm occurrences in five popular ASR systems using synthetic audio generated from four TTS systems and human audio obtained from two commonly used datasets. Our results show that the least number of false alarms is identified when testing Deepspeech, and the number of false alarms is the highest when testing Wav2vec2. On average, false alarm rates range from 21% to 34% in all five ASR systems. Among the TTS systems used, Google TTS produces the least number of false alarms (17%), and Espeak TTS produces the highest number of false alarms (32%) among the four TTS systems. Additionally, we build a false alarm estimator that flags potential false alarms, which achieves promising results: a precision of 98.3%, a recall of 96.4%, an accuracy of 98.5%, and an F1 score of 97.3%. Our study provides insight into the appropriate selection of TTS systems to generate high-quality speech to test ASR systems. Additionally, a false alarm estimator can be a way to minimise the impact of false alarms and help developers choose suitable test inputs when evaluating ASR systems. The source code used in this paper is publicly available on GitHub at https://github.com/julianyonghao/FAinASRtest.
△ Less
Submitted 18 July, 2023; v1 submitted 27 May, 2023;
originally announced May 2023.
-
Initial On-Sky Performance testing of the Single-Photon Imager for Nanosecond Astrophysics (SPINA) system
Authors:
Albert Wai Kit Lau,
Nurzhan Shaimoldin,
Zhanat Maksut,
Yan Yan Chan,
Mehdi Shafiee,
Bruce Grossan,
George F. Smoot
Abstract:
This work presents an initial on-sky performance measurement of the Single-Photon Imager for Nanosecond Astrophysics (SPINA) system, part of our Ultra-Fast Astronomy (UFA) program. We developed the SPINA system based on the position-sensitive silicon photomultiplier (PS-SiPM) detector to record both photoelectron (P.E.) temporal and spatial information. The initial on-sky testing of the SPINA syst…
▽ More
This work presents an initial on-sky performance measurement of the Single-Photon Imager for Nanosecond Astrophysics (SPINA) system, part of our Ultra-Fast Astronomy (UFA) program. We developed the SPINA system based on the position-sensitive silicon photomultiplier (PS-SiPM) detector to record both photoelectron (P.E.) temporal and spatial information. The initial on-sky testing of the SPINA system was successfully performed on UT 2022 Jul 10, on the 0.7-meter aperture Nazarbayev University Transient Telescope at the Assy-Turgen Astrophysical Observatory (NUTTelA-TAO). We measured stars with a wide range of brightness and a dark region of the sky without stars $< 18$ mag. We measured the SPINA system's spatial resolution to be $<232μm$ (full-width half-maximum, FWHM), limited by the unstable atmosphere. We measured the total background noise (detector dark counts and sky background) of 1914 counts per second (cps) within this resolution element. We also performed a crosstalk map** of the detector, obtaining the crosstalk probability of $\sim0.18$ near the detector's center while reaching $\sim 50\%$ at the edges. We derived a $5σ$ sensitivity of $17.45$ Gaia-BP magnitude in a 1s exposure with no atmospheric extinction by comparing the received flux with Gaia-BP band data. For a $10ms$ window and a false alarm rate of once per 100 nights, we derived a transient sensitivity of 14.06 mag. For a $1μs$ or faster time scale, we are limited by crosstalk to a 15 P.E. detection threshold. In addition, we demonstrated that the SPINA system is capable of capturing changes in the stellar profile FWHM of $\pm1.8\%$ and $\pm5\%$ change in the stellar profile FWHM in $20ms$ and $2ms$ exposures, respectively, as well as capturing stellar light curves on the $ms$ and $μs$ scales.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Stable Quantum-Correlated Many Body States through Engineered Dissipation
Authors:
X. Mi,
A. A. Michailidis,
S. Shabani,
K. C. Miao,
P. V. Klimov,
J. Lloyd,
E. Rosenberg,
R. Acharya,
I. Aleiner,
T. I. Andersen,
M. Ansmann,
F. Arute,
K. Arya,
A. Asfaw,
J. Atalaya,
J. C. Bardin,
A. Bengtsson,
G. Bortoli,
A. Bourassa,
J. Bovaird,
L. Brill,
M. Broughton,
B. B. Buckley,
D. A. Buell,
T. Burger
, et al. (142 additional authors not shown)
Abstract:
Engineered dissipative reservoirs have the potential to steer many-body quantum systems toward correlated steady states useful for quantum simulation of high-temperature superconductivity or quantum magnetism. Using up to 49 superconducting qubits, we prepared low-energy states of the transverse-field Ising model through coupling to dissipative auxiliary qubits. In one dimension, we observed long-…
▽ More
Engineered dissipative reservoirs have the potential to steer many-body quantum systems toward correlated steady states useful for quantum simulation of high-temperature superconductivity or quantum magnetism. Using up to 49 superconducting qubits, we prepared low-energy states of the transverse-field Ising model through coupling to dissipative auxiliary qubits. In one dimension, we observed long-range quantum correlations and a ground-state fidelity of 0.86 for 18 qubits at the critical point. In two dimensions, we found mutual information that extends beyond nearest neighbors. Lastly, by coupling the system to auxiliaries emulating reservoirs with different chemical potentials, we explored transport in the quantum Heisenberg model. Our results establish engineered dissipation as a scalable alternative to unitary evolution for preparing entangled many-body states on noisy quantum processors.
△ Less
Submitted 5 April, 2024; v1 submitted 26 April, 2023;
originally announced April 2023.
-
Phase transition in Random Circuit Sampling
Authors:
A. Morvan,
B. Villalonga,
X. Mi,
S. Mandrà,
A. Bengtsson,
P. V. Klimov,
Z. Chen,
S. Hong,
C. Erickson,
I. K. Drozdov,
J. Chau,
G. Laun,
R. Movassagh,
A. Asfaw,
L. T. A. N. Brandão,
R. Peralta,
D. Abanin,
R. Acharya,
R. Allen,
T. I. Andersen,
K. Anderson,
M. Ansmann,
F. Arute,
K. Arya,
J. Atalaya
, et al. (160 additional authors not shown)
Abstract:
Undesired coupling to the surrounding environment destroys long-range correlations on quantum processors and hinders the coherent evolution in the nominally available computational space. This incoherent noise is an outstanding challenge to fully leverage the computation power of near-term quantum processors. It has been shown that benchmarking Random Circuit Sampling (RCS) with Cross-Entropy Benc…
▽ More
Undesired coupling to the surrounding environment destroys long-range correlations on quantum processors and hinders the coherent evolution in the nominally available computational space. This incoherent noise is an outstanding challenge to fully leverage the computation power of near-term quantum processors. It has been shown that benchmarking Random Circuit Sampling (RCS) with Cross-Entropy Benchmarking (XEB) can provide a reliable estimate of the effective size of the Hilbert space coherently available. The extent to which the presence of noise can trivialize the outputs of a given quantum algorithm, i.e. making it spoofable by a classical computation, is an unanswered question. Here, by implementing an RCS algorithm we demonstrate experimentally that there are two phase transitions observable with XEB, which we explain theoretically with a statistical model. The first is a dynamical transition as a function of the number of cycles and is the continuation of the anti-concentration point in the noiseless case. The second is a quantum phase transition controlled by the error per cycle; to identify it analytically and experimentally, we create a weak link model which allows varying the strength of noise versus coherent evolution. Furthermore, by presenting an RCS experiment with 67 qubits at 32 cycles, we demonstrate that the computational cost of our experiment is beyond the capabilities of existing classical supercomputers, even when accounting for the inevitable presence of noise. Our experimental and theoretical work establishes the existence of transitions to a stable computationally complex phase that is reachable with current quantum processors.
△ Less
Submitted 21 December, 2023; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Cross-scale Multi-instance Learning for Pathological Image Diagnosis
Authors:
Ruining Deng,
Can Cui,
Lucas W. Remedios,
Shunxing Bao,
R. Michael Womick,
Sophie Chiron,
Jia Li,
Joseph T. Roland,
Ken S. Lau,
Qi Liu,
Keith T. Wilson,
Yaohong Wang,
Lori A. Coburn,
Bennett A. Landman,
Yuankai Huo
Abstract:
Analyzing high resolution whole slide images (WSIs) with regard to information across multiple scales poses a significant challenge in digital pathology. Multi-instance learning (MIL) is a common solution for working with high resolution images by classifying bags of objects (i.e. sets of smaller image patches). However, such processing is typically performed at a single scale (e.g., 20x magnifica…
▽ More
Analyzing high resolution whole slide images (WSIs) with regard to information across multiple scales poses a significant challenge in digital pathology. Multi-instance learning (MIL) is a common solution for working with high resolution images by classifying bags of objects (i.e. sets of smaller image patches). However, such processing is typically performed at a single scale (e.g., 20x magnification) of WSIs, disregarding the vital inter-scale information that is key to diagnoses by human pathologists. In this study, we propose a novel cross-scale MIL algorithm to explicitly aggregate inter-scale relationships into a single MIL network for pathological image diagnosis. The contribution of this paper is three-fold: (1) A novel cross-scale MIL (CS-MIL) algorithm that integrates the multi-scale information and the inter-scale relationships is proposed; (2) A toy dataset with scale-specific morphological features is created and released to examine and visualize differential cross-scale attention; (3) Superior performance on both in-house and public datasets is demonstrated by our simple cross-scale MIL strategy. The official implementation is publicly available at https://github.com/hrlblab/CS-MIL.
△ Less
Submitted 16 February, 2024; v1 submitted 31 March, 2023;
originally announced April 2023.
-
Measurement-induced entanglement and teleportation on a noisy quantum processor
Authors:
Jesse C. Hoke,
Matteo Ippoliti,
Eliott Rosenberg,
Dmitry Abanin,
Rajeev Acharya,
Trond I. Andersen,
Markus Ansmann,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Juan Atalaya,
Joseph C. Bardin,
Andreas Bengtsson,
Gina Bortoli,
Alexandre Bourassa,
Jenna Bovaird,
Leon Brill,
Michael Broughton,
Bob B. Buckley,
David A. Buell,
Tim Burger,
Brian Burkett,
Nicholas Bushnell,
Zijun Chen,
Ben Chiaro
, et al. (138 additional authors not shown)
Abstract:
Measurement has a special role in quantum theory: by collapsing the wavefunction it can enable phenomena such as teleportation and thereby alter the "arrow of time" that constrains unitary evolution. When integrated in many-body dynamics, measurements can lead to emergent patterns of quantum information in space-time that go beyond established paradigms for characterizing phases, either in or out…
▽ More
Measurement has a special role in quantum theory: by collapsing the wavefunction it can enable phenomena such as teleportation and thereby alter the "arrow of time" that constrains unitary evolution. When integrated in many-body dynamics, measurements can lead to emergent patterns of quantum information in space-time that go beyond established paradigms for characterizing phases, either in or out of equilibrium. On present-day NISQ processors, the experimental realization of this physics is challenging due to noise, hardware limitations, and the stochastic nature of quantum measurement. Here we address each of these experimental challenges and investigate measurement-induced quantum information phases on up to 70 superconducting qubits. By leveraging the interchangeability of space and time, we use a duality map**, to avoid mid-circuit measurement and access different manifestations of the underlying phases -- from entanglement scaling to measurement-induced teleportation -- in a unified way. We obtain finite-size signatures of a phase transition with a decoding protocol that correlates the experimental measurement record with classical simulation data. The phases display sharply different sensitivity to noise, which we exploit to turn an inherent hardware limitation into a useful diagnostic. Our work demonstrates an approach to realize measurement-induced physics at scales that are at the limits of current NISQ processors.
△ Less
Submitted 17 October, 2023; v1 submitted 8 March, 2023;
originally announced March 2023.
-
Structured Bayesian Compression for Deep Neural Networks Based on The Turbo-VBI Approach
Authors:
Chengyu Xia,
Danny H. K. Tsang,
Vincent K. N. Lau
Abstract:
With the growth of neural network size, model compression has attracted increasing interest in recent research. As one of the most common techniques, pruning has been studied for a long time. By exploiting the structured sparsity of the neural network, existing methods can prune neurons instead of individual weights. However, in most existing pruning methods, surviving neurons are randomly connect…
▽ More
With the growth of neural network size, model compression has attracted increasing interest in recent research. As one of the most common techniques, pruning has been studied for a long time. By exploiting the structured sparsity of the neural network, existing methods can prune neurons instead of individual weights. However, in most existing pruning methods, surviving neurons are randomly connected in the neural network without any structure, and the non-zero weights within each neuron are also randomly distributed. Such irregular sparse structure can cause very high control overhead and irregular memory access for the hardware and even increase the neural network computational complexity. In this paper, we propose a three-layer hierarchical prior to promote a more regular sparse structure during pruning. The proposed three-layer hierarchical prior can achieve per-neuron weight-level structured sparsity and neuron-level structured sparsity. We derive an efficient Turbo-variational Bayesian inferencing (Turbo-VBI) algorithm to solve the resulting model compression problem with the proposed prior. The proposed Turbo-VBI algorithm has low complexity and can support more general priors than existing model compression algorithms. Simulation results show that our proposed algorithm can promote a more regular structure in the pruned neural networks while achieving even better performance in terms of compression rate and inferencing accuracy compared with the baselines.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Joint Task Offloading and Cache Placement for Energy-Efficient Mobile Edge Computing Systems
Authors:
**gxuan Liang,
Hong Xing,
Feng Wang,
Vincent K. N. Lau
Abstract:
This letter investigates a cache-enabled multiuser mobile edge computing (MEC) system with dynamic task arrivals, taking into account the impact of proactive cache placement on the system's overall energy consumption. We consider that an access point (AP) schedules a wireless device (WD) to offload computational tasks while executing the tasks of a finite library in the \emph{task caching} phase,…
▽ More
This letter investigates a cache-enabled multiuser mobile edge computing (MEC) system with dynamic task arrivals, taking into account the impact of proactive cache placement on the system's overall energy consumption. We consider that an access point (AP) schedules a wireless device (WD) to offload computational tasks while executing the tasks of a finite library in the \emph{task caching} phase, such that the nearby WDs with the same task request arriving later can directly download the task results in the \emph{task arrival and execution} phase. We aim for minimizing the system's weighted-sum energy over a finite-time horizon, by jointly optimizing the task caching decision and the MEC execution of the AP, and local computing as well as task offloading of the WDs at each time slot, subject to caching capacity, task causality, and completion deadline constraints. The formulated design problem is a mixed-integer nonlinear program. Under the assumption of fully predicable task arrivals, we first propose a branch-and-bound (BnB) based method to obtain the optimal offline solution. Next, we propose two low-complexity schemes based on convex relaxation and task-popularity, respectively. Finally, numerical results show the benefit of the proposed schemes over existing benchmark schemes.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images
Authors:
Xindi Wu,
KwunFung Lau,
Francesco Ferroni,
Aljoša Ošep,
Deva Ramanan
Abstract:
Self-driving vehicles rely on urban street maps for autonomous navigation. In this paper, we introduce Pix2Map, a method for inferring urban street map topology directly from ego-view images, as needed to continually update and expand existing maps. This is a challenging task, as we need to infer a complex urban road topology directly from raw image data. The main insight of this paper is that thi…
▽ More
Self-driving vehicles rely on urban street maps for autonomous navigation. In this paper, we introduce Pix2Map, a method for inferring urban street map topology directly from ego-view images, as needed to continually update and expand existing maps. This is a challenging task, as we need to infer a complex urban road topology directly from raw image data. The main insight of this paper is that this problem can be posed as cross-modal retrieval by learning a joint, cross-modal embedding space for images and existing maps, represented as discrete graphs that encode the topological layout of the visual surroundings. We conduct our experimental evaluation using the Argoverse dataset and show that it is indeed possible to accurately retrieve street maps corresponding to both seen and unseen roads solely from image data. Moreover, we show that our retrieved maps can be used to update or expand existing maps and even show proof-of-concept results for visual localization and image retrieval from spatial graphs.
△ Less
Submitted 9 April, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
The photometric observation of the quasi-simultaneous mutual eclipse and occultation between Europa and Ganymede on 22 August 2021
Authors:
Chu Wing So,
Godfrey Ho Ching Luk,
Giann On Ching Chung,
Po Kin Leung,
Kenneith Ho Keung Hui,
Jack Lap Chung Cheung,
Ka Wo Chan,
Edwin Lok Hei Yuen,
Lawrence Wai Kwan Lee,
Patrick Kai Ip Lau,
Gloria Wing Shan Cheung,
Prince Chun Lam Chan,
Jason Chun Shing Pun
Abstract:
Mutual events (MEs) are eclipses and occultations among planetary natural satellites. Most of the time, eclipses and occultations occur separately. However, the same satellite pair will exhibit an eclipse and an occultation quasi-simultaneously under particular orbital configurations. This kind of rare event is termed as a quasi-simultaneous mutual event (QSME). During the 2021 campaign of mutual…
▽ More
Mutual events (MEs) are eclipses and occultations among planetary natural satellites. Most of the time, eclipses and occultations occur separately. However, the same satellite pair will exhibit an eclipse and an occultation quasi-simultaneously under particular orbital configurations. This kind of rare event is termed as a quasi-simultaneous mutual event (QSME). During the 2021 campaign of mutual events of jovian satellites, we observed a QSME between Europa and Ganymede. The present study aims to describe and study the event in detail. We observed the QSME with a CCD camera attached to a 300-mm telescope at the Hong Kong Space Museum Sai Kung iObservatory. We obtained the combined flux of Europa and Ganymede from aperture photometry. A geometric model was developed to explain the light curve observed. Our results are compared with theoretical predictions (O-C). We found that our simple geometric model can explain the QSME fairly accurately, and the QSME light curve is a superposition of the light curves of an eclipse and an occultation. Notably, the observed flux drops are within 2.6% of the theoretical predictions. The size of the event central time O-Cs ranges from -14.4 to 43.2 s. Both O-Cs of flux drop and timing are comparable to other studies adopting more complicated models. Given the event rarity, model simplicity and accuracy, we encourage more observations and analysis on QSMEs to improve Solar System ephemerides.
△ Less
Submitted 10 December, 2022;
originally announced December 2022.
-
Purification-based quantum error mitigation of pair-correlated electron simulations
Authors:
T. E. O'Brien,
G. Anselmetti,
F. Gkritsis,
V. E. Elfving,
S. Polla,
W. J. Huggins,
O. Oumarou,
K. Kechedzhi,
D. Abanin,
R. Acharya,
I. Aleiner,
R. Allen,
T. I. Andersen,
K. Anderson,
M. Ansmann,
F. Arute,
K. Arya,
A. Asfaw,
J. Atalaya,
D. Bacon,
J. C. Bardin,
A. Bengtsson,
S. Boixo,
G. Bortoli,
A. Bourassa
, et al. (151 additional authors not shown)
Abstract:
An important measure of the development of quantum computing platforms has been the simulation of increasingly complex physical systems. Prior to fault-tolerant quantum computing, robust error mitigation strategies are necessary to continue this growth. Here, we study physical simulation within the seniority-zero electron pairing subspace, which affords both a computational step** stone to a ful…
▽ More
An important measure of the development of quantum computing platforms has been the simulation of increasingly complex physical systems. Prior to fault-tolerant quantum computing, robust error mitigation strategies are necessary to continue this growth. Here, we study physical simulation within the seniority-zero electron pairing subspace, which affords both a computational step** stone to a fully correlated model, and an opportunity to validate recently introduced ``purification-based'' error-mitigation strategies. We compare the performance of error mitigation based on doubling quantum resources in time (echo verification) or in space (virtual distillation), on up to $20$ qubits of a superconducting qubit quantum processor. We observe a reduction of error by one to two orders of magnitude below less sophisticated techniques (e.g. post-selection); the gain from error mitigation is seen to increase with the system size. Employing these error mitigation strategies enables the implementation of the largest variational algorithm for a correlated chemistry system to-date. Extrapolating performance from these results allows us to estimate minimum requirements for a beyond-classical simulation of electronic structure. We find that, despite the impressive gains from purification-based error mitigation, significant hardware improvements will be required for classically intractable variational chemistry simulations.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Non-Abelian braiding of graph vertices in a superconducting processor
Authors:
Trond I. Andersen,
Yuri D. Lensky,
Kostyantyn Kechedzhi,
Ilya Drozdov,
Andreas Bengtsson,
Sabrina Hong,
Alexis Morvan,
Xiao Mi,
Alex Opremcak,
Rajeev Acharya,
Richard Allen,
Markus Ansmann,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Juan Atalaya,
Ryan Babbush,
Dave Bacon,
Joseph C. Bardin,
Gina Bortoli,
Alexandre Bourassa,
Jenna Bovaird,
Leon Brill,
Michael Broughton,
Bob B. Buckley
, et al. (144 additional authors not shown)
Abstract:
Indistinguishability of particles is a fundamental principle of quantum mechanics. For all elementary and quasiparticles observed to date - including fermions, bosons, and Abelian anyons - this principle guarantees that the braiding of identical particles leaves the system unchanged. However, in two spatial dimensions, an intriguing possibility exists: braiding of non-Abelian anyons causes rotatio…
▽ More
Indistinguishability of particles is a fundamental principle of quantum mechanics. For all elementary and quasiparticles observed to date - including fermions, bosons, and Abelian anyons - this principle guarantees that the braiding of identical particles leaves the system unchanged. However, in two spatial dimensions, an intriguing possibility exists: braiding of non-Abelian anyons causes rotations in a space of topologically degenerate wavefunctions. Hence, it can change the observables of the system without violating the principle of indistinguishability. Despite the well developed mathematical description of non-Abelian anyons and numerous theoretical proposals, the experimental observation of their exchange statistics has remained elusive for decades. Controllable many-body quantum states generated on quantum processors offer another path for exploring these fundamental phenomena. While efforts on conventional solid-state platforms typically involve Hamiltonian dynamics of quasi-particles, superconducting quantum processors allow for directly manipulating the many-body wavefunction via unitary gates. Building on predictions that stabilizer codes can host projective non-Abelian Ising anyons, we implement a generalized stabilizer code and unitary protocol to create and braid them. This allows us to experimentally verify the fusion rules of the anyons and braid them to realize their statistics. We then study the prospect of employing the anyons for quantum computation and utilize braiding to create an entangled state of anyons encoding three logical qubits. Our work provides new insights about non-Abelian braiding and - through the future inclusion of error correction to achieve topological protection - could open a path toward fault-tolerant quantum computing.
△ Less
Submitted 31 May, 2023; v1 submitted 18 October, 2022;
originally announced October 2022.
-
BICEP / Keck XVII: Line of Sight Distortion Analysis: Estimates of Gravitational Lensing, Anisotropic Cosmic Birefringence, Patchy Reionization, and Systematic Errors
Authors:
BICEP/Keck Collaboration,
:,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
D. Beck,
C. A. Bischoff,
J. J. Bock,
H. Boenish,
E. Bullock,
V. Buza,
J. R. Cheshire IV,
J. Connors,
J. Cornelison,
M. Crumrine,
A. Cukierman,
E. V. Denison,
M. Dierickx,
L. Duband,
M. Eiben,
S. Fatigoni,
J. P. Filippini,
S. Fliescher
, et al. (70 additional authors not shown)
Abstract:
We present estimates of line-of-sight distortion fields derived from the 95 GHz and 150 GHz data taken by BICEP2, BICEP3, and Keck Array up to the 2018 observing season, leading to cosmological constraints and a study of instrumental and astrophysical systematics. Cosmological constraints are derived from three of the distortion fields concerning gravitational lensing from large-scale structure, p…
▽ More
We present estimates of line-of-sight distortion fields derived from the 95 GHz and 150 GHz data taken by BICEP2, BICEP3, and Keck Array up to the 2018 observing season, leading to cosmological constraints and a study of instrumental and astrophysical systematics. Cosmological constraints are derived from three of the distortion fields concerning gravitational lensing from large-scale structure, polarization rotation from magnetic fields or an axion-like field, and the screening effect of patchy reionization. We measure an amplitude of the lensing power spectrum $A_L^{φφ}=0.95 \pm 0.20$. We constrain polarization rotation, expressed as the coupling constant of a Chern-Simons electromagnetic term $g_{aγ} \leq 2.6 \times 10^{-2}/H_I$, where $H_I$ is the inflationary Hubble parameter, and an amplitude of primordial magnetic fields smoothed over 1 Mpc $B_{1\text{Mpc}} \leq 6.6 \;\text{nG}$ at 95 GHz. We constrain the root mean square of optical-depth fluctuations in a simple "crinkly surface" model of patchy reionization, finding $A^τ<0.19$ ($2σ$) for the coherence scale of $L_c=100$. We show that all of the distortion fields of the 95 GHz and 150 GHz polarization maps are consistent with simulations including lensed-$Λ$CDM, dust, and noise, with no evidence for instrumental systematics. In some cases, the EB and TB quadratic estimators presented here are more sensitive than our previous map-based null tests at identifying and rejecting spurious B-modes that might arise from instrumental effects. Finally, we verify that the standard deprojection filtering in the BICEP/Keck data processing is effective at removing temperature to polarization leakage.
△ Less
Submitted 5 June, 2023; v1 submitted 14 October, 2022;
originally announced October 2022.
-
BICEP / Keck XVI: Characterizing Dust Polarization through Correlations with Neutral Hydrogen
Authors:
BICEP/Keck Collaboration,
:,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
D. Beck,
C. A. Bischoff,
J. J. Bock,
H. Boenish,
E. Bullock,
V. Buza,
J. R. Cheshire IV,
S. E. Clark,
J. Connors,
J. Cornelison,
M. Crumrine,
A. Cukierman,
E. V. Denison,
M. Dierickx,
L. Duband,
M. Eiben,
S. Fatigoni,
J. P. Filippini
, et al. (71 additional authors not shown)
Abstract:
We characterize Galactic dust filaments by correlating BICEP/Keck and Planck data with polarization templates based on neutral hydrogen (H I) observations. Dust polarization is important for both our understanding of astrophysical processes in the interstellar medium (ISM) and the search for primordial gravitational waves in the cosmic microwave background (CMB). In the diffuse ISM, H I is strongl…
▽ More
We characterize Galactic dust filaments by correlating BICEP/Keck and Planck data with polarization templates based on neutral hydrogen (H I) observations. Dust polarization is important for both our understanding of astrophysical processes in the interstellar medium (ISM) and the search for primordial gravitational waves in the cosmic microwave background (CMB). In the diffuse ISM, H I is strongly correlated with the dust and partly organized into filaments that are aligned with the local magnetic field. We analyze the deep BICEP/Keck data at 95, 150, and 220 GHz, over the low-column-density region of sky where BICEP/Keck has set the best limits on primordial gravitational waves. We separate the H I emission into distinct velocity components and detect dust polarization correlated with the local Galactic H I but not with the H I associated with Magellanic Stream I. We present a robust, multifrequency detection of polarized dust emission correlated with the filamentary H I morphology template down to 95 GHz. For assessing its utility for foreground cleaning, we report that the H I morphology template correlates in B modes at a $\sim$10-65$\%$ level over the multipole range $20 < \ell < 200$ with the BICEP/Keck maps, which contain contributions from dust, CMB, and noise components. We measure the spectral index of the filamentary dust component spectral energy distribution to be $β= 1.54 \pm 0.13$. We find no evidence for decorrelation in this region between the filaments and the rest of the dust field or from the inclusion of dust associated with the intermediate velocity H I. Finally, we explore the morphological parameter space in the H I-based filamentary model.
△ Less
Submitted 13 March, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Cross-scale Attention Guided Multi-instance Learning for Crohn's Disease Diagnosis with Pathological Images
Authors:
Ruining Deng,
Can Cui,
Lucas W. Remedios,
Shunxing Bao,
R. Michael Womick,
Sophie Chiron,
Jia Li,
Joseph T. Roland,
Ken S. Lau,
Qi Liu,
Keith T. Wilson,
Yaohong Wang,
Lori A. Coburn,
Bennett A. Landman,
Yuankai Huo
Abstract:
Multi-instance learning (MIL) is widely used in the computer-aided interpretation of pathological Whole Slide Images (WSIs) to solve the lack of pixel-wise or patch-wise annotations. Often, this approach directly applies "natural image driven" MIL algorithms which overlook the multi-scale (i.e. pyramidal) nature of WSIs. Off-the-shelf MIL algorithms are typically deployed on a single-scale of WSIs…
▽ More
Multi-instance learning (MIL) is widely used in the computer-aided interpretation of pathological Whole Slide Images (WSIs) to solve the lack of pixel-wise or patch-wise annotations. Often, this approach directly applies "natural image driven" MIL algorithms which overlook the multi-scale (i.e. pyramidal) nature of WSIs. Off-the-shelf MIL algorithms are typically deployed on a single-scale of WSIs (e.g., 20x magnification), while human pathologists usually aggregate the global and local patterns in a multi-scale manner (e.g., by zooming in and out between different magnifications). In this study, we propose a novel cross-scale attention mechanism to explicitly aggregate inter-scale interactions into a single MIL network for Crohn's Disease (CD), which is a form of inflammatory bowel disease. The contribution of this paper is two-fold: (1) a cross-scale attention mechanism is proposed to aggregate features from different resolutions with multi-scale interaction; and (2) differential multi-scale attention visualizations are generated to localize explainable lesion patterns. By training ~250,000 H&E-stained Ascending Colon (AC) patches from 20 CD patient and 30 healthy control samples at different scales, our approach achieved a superior Area under the Curve (AUC) score of 0.8924 compared with baseline models. The official implementation is publicly available at https://github.com/hrlblab/CS-MIL.
△ Less
Submitted 15 August, 2022;
originally announced August 2022.
-
Optimized Design for IRS-Assisted Integrated Sensing and Communication Systems in Clutter Environments
Authors:
Chikun Liao,
Feng Wang,
Vincent K. N. Lau
Abstract:
In this paper, we investigate an intelligent reflecting surface (IRS)-assisted integrated sensing and communication (ISAC) system design in a clutter environment. Assisted by an IRS equipped with a uniform linear array (ULA), a multi-antenna base station (BS) is targeted for communicating with multiple communication users (CUs) and sensing multiple targets simultaneously. We consider the IRS-assis…
▽ More
In this paper, we investigate an intelligent reflecting surface (IRS)-assisted integrated sensing and communication (ISAC) system design in a clutter environment. Assisted by an IRS equipped with a uniform linear array (ULA), a multi-antenna base station (BS) is targeted for communicating with multiple communication users (CUs) and sensing multiple targets simultaneously. We consider the IRS-assisted ISAC design in the case with Type-I or Type-II CUs, where each Type-I and Type-II CU can and cannot cancel the interference from sensing signals, respectively. In particular, we aim to maximize the minimum sensing beampattern gain among multiple targets, by jointly optimizing the BS transmit beamforming vectors and the IRS phase shifting matrix, subject to the signal-to-interference-plus-noise ratio (SINR) constraint for each Type-I/Type-II CU, the interference power constraint per clutter, the transmission power constraint at the BS, and the cross-correlation pattern constraint. Due to the coupling of the BS's transmit design variables and the IRS's phase shifting matrix, the formulated max-min IRS-assisted ISAC design problem in the case with Type-I/Type-II CUs is highly non-convex. As such, we propose an efficient algorithm based on the alternating-optimization and semi-definite relaxation (SDR) techniques. In the case with Type-I CUs, we show that the dedicated sensing signal at the BS is always beneficial to improve the sensing performance. By contrast, the dedicated sensing signal at the BS is not required in the case with Type-II CUs. Numerical results are provided to show that the proposed IRS-assisted ISAC design schemes achieve a significant gain over the existing benchmark schemes.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
Thermal Testing for Cryogenic CMB Instrument Optical Design
Authors:
D. C. Goldfinger,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
D. Beck,
C. A. Bischoff,
J. J. Bock,
V. Buza,
J. Cheshire,
J. Connors,
J. Cornelison,
M. Crumrine,
A. J. Cukierman,
E. V. Denison,
M. I. Dierickx,
L. Duband,
M. Eiben,
S. Fatigoni,
J. P. Filippini,
C. Giannakopoulos,
N. Goeckner-Wald,
J. Grayson,
P. K. Grimes
, et al. (61 additional authors not shown)
Abstract:
Observations of the Cosmic Microwave Background rely on cryogenic instrumentation with cold detectors, readout, and optics providing the low noise performance and instrumental stability required to make more sensitive measurements. It is therefore critical to optimize all aspects of the cryogenic design to achieve the necessary performance, with low temperature components and acceptable system coo…
▽ More
Observations of the Cosmic Microwave Background rely on cryogenic instrumentation with cold detectors, readout, and optics providing the low noise performance and instrumental stability required to make more sensitive measurements. It is therefore critical to optimize all aspects of the cryogenic design to achieve the necessary performance, with low temperature components and acceptable system cooling requirements. In particular, we will focus on our use of thermal filters and cold optics, which reduce the thermal load passed along to the cryogenic stages. To test their performance, we have made a series of in situ measurements while integrating the third receiver for the BICEP Array telescope. In addition to characterizing the behavior of this receiver, these measurements continue to refine the models that are being used to inform design choices being made for future instruments.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
2022 Upgrade and Improved Low Frequency Camera Sensitivity for CMB Observation at the South Pole
Authors:
A. Soliman,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
C. A. Bischoff,
D. Beck,
J. J. Bock,
V. Buza,
J. Cheshire,
J. Connors,
J. Cornelison,
M. Crumrine,
A. J. Cukierman,
E. V. Denison,
M. I. Dierickx,
L. Duband,
M. Eiben,
S. Fatigoni,
J. P. Filippini,
C. Giannakopoulos,
N. Goeckner-Wald,
D. C. Goldfinger,
J. Grayson
, et al. (61 additional authors not shown)
Abstract:
Constraining the Galactic foregrounds with multi-frequency Cosmic Microwave Background (CMB) observations is an essential step towards ultimately reaching the sensitivity to measure primordial gravitational waves (PGWs), the sign of inflation after the Big-Bang that would be imprinted on the CMB. The BICEP Array telescope is a set of multi-frequency cameras designed to constrain the energy scale o…
▽ More
Constraining the Galactic foregrounds with multi-frequency Cosmic Microwave Background (CMB) observations is an essential step towards ultimately reaching the sensitivity to measure primordial gravitational waves (PGWs), the sign of inflation after the Big-Bang that would be imprinted on the CMB. The BICEP Array telescope is a set of multi-frequency cameras designed to constrain the energy scale of inflation through CMB B-mode searches while also controlling the polarized galactic foregrounds. The lowest frequency BICEP Array receiver (BA1) has been observing from the South Pole since 2020 and provides 30 GHz and 40 GHz data to characterize the Galactic synchrotron in our CMB maps. In this paper, we present the design of the BA1 detectors and the full optical characterization of the camera including the on-sky performance at the South Pole. The paper also introduces the design challenges during the first observing season including the effect of out-of-band photons on detectors performance. It also describes the tests done to diagnose that effect and the new upgrade to minimize these photons, as well as installing more dichroic detectors during the 2022 deployment season to improve the BA1 sensitivity. We finally report background noise measurements of the detectors with the goal of having photon noise dominated detectors in both optical channels. BA1 achieves an improvement in map** speed compared to the previous deployment season.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Improved Polarization Calibration of the BICEP3 CMB Polarimeter at the South Pole
Authors:
J. Cornelison,
C. Vergès,
P. A. R. Ade,
Z. Ahmed,
M. Amiri,
D. Barkats,
R. Basu Thakur,
D. Beck,
C. A. Bischoff,
J. J. Bock,
V. Buza,
J. R. Cheshire IV,
J. Connors,
M. Crumrine,
A. J. Cukierman,
E. V. Denison,
M. I. Dierickx,
L. Duband,
M. Eiben,
S. Fatigoni,
J. P. Filippini,
C. Giannakopoulos,
N. Goeckner-Wald,
D. C. Goldfinger,
J. Grayson
, et al. (61 additional authors not shown)
Abstract:
The BICEP3 Polarimeter is a small aperture, refracting telescope, dedicated to the observation of the Cosmic Microwave Background (CMB) at 95GHz. It is designed to target degree angular scale polarization patterns, in particular the very-much-sought-after primordial B-mode signal, which is a unique signature of cosmic inflation. The polarized signal from the sky is reconstructed by differencing co…
▽ More
The BICEP3 Polarimeter is a small aperture, refracting telescope, dedicated to the observation of the Cosmic Microwave Background (CMB) at 95GHz. It is designed to target degree angular scale polarization patterns, in particular the very-much-sought-after primordial B-mode signal, which is a unique signature of cosmic inflation. The polarized signal from the sky is reconstructed by differencing co-localized, orthogonally polarized superconducting Transition Edge Sensor (TES) bolometers. In this work, we present absolute measurements of the polarization response of the detectors for more than $\sim 800$ functioning detector pairs of the BICEP3 experiment, out of a total of $\sim 1000$. We use a specifically designed Rotating Polarized Source (RPS) to measure the polarization response at multiple source and telescope boresight rotation angles, to fully map the response over 360 degrees. We present here polarization properties extracted from on-site calibration data taken in January 2022. A similar calibration campaign was performed in 2018, but we found that our constraint was dominated by systematics on the level of $\sim0.5^\circ$. After a number of improvements to the calibration set-up, we are now able to report a significantly lower level of systematic contamination. In the future, such precise measurements will be used to constrain physics beyond the standard cosmological model, namely cosmic birefringence.
△ Less
Submitted 25 August, 2022; v1 submitted 29 July, 2022;
originally announced July 2022.
-
Suppressing quantum errors by scaling a surface code logical qubit
Authors:
Rajeev Acharya,
Igor Aleiner,
Richard Allen,
Trond I. Andersen,
Markus Ansmann,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Juan Atalaya,
Ryan Babbush,
Dave Bacon,
Joseph C. Bardin,
Joao Basso,
Andreas Bengtsson,
Sergio Boixo,
Gina Bortoli,
Alexandre Bourassa,
Jenna Bovaird,
Leon Brill,
Michael Broughton,
Bob B. Buckley,
David A. Buell,
Tim Burger,
Brian Burkett,
Nicholas Bushnell
, et al. (132 additional authors not shown)
Abstract:
Practical quantum computing will require error rates that are well below what is achievable with physical qubits. Quantum error correction offers a path to algorithmically-relevant error rates by encoding logical qubits within many physical qubits, where increasing the number of physical qubits enhances protection against physical errors. However, introducing more qubits also increases the number…
▽ More
Practical quantum computing will require error rates that are well below what is achievable with physical qubits. Quantum error correction offers a path to algorithmically-relevant error rates by encoding logical qubits within many physical qubits, where increasing the number of physical qubits enhances protection against physical errors. However, introducing more qubits also increases the number of error sources, so the density of errors must be sufficiently low in order for logical performance to improve with increasing code size. Here, we report the measurement of logical qubit performance scaling across multiple code sizes, and demonstrate that our system of superconducting qubits has sufficient performance to overcome the additional errors from increasing qubit number. We find our distance-5 surface code logical qubit modestly outperforms an ensemble of distance-3 logical qubits on average, both in terms of logical error probability over 25 cycles and logical error per cycle ($2.914\%\pm 0.016\%$ compared to $3.028\%\pm 0.023\%$). To investigate damaging, low-probability error sources, we run a distance-25 repetition code and observe a $1.7\times10^{-6}$ logical error per round floor set by a single high-energy event ($1.6\times10^{-7}$ when excluding this event). We are able to accurately model our experiment, and from this model we can extract error budgets that highlight the biggest challenges for future systems. These results mark the first experimental demonstration where quantum error correction begins to improve performance with increasing qubit number, illuminating the path to reaching the logical error rates required for computation.
△ Less
Submitted 20 July, 2022; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Field Evaluation of Four Low-cost PM Sensors and Design, Development and Field Evaluation of A Wearable PM Exposure Monitoring System
Authors:
Wei-Ying Yi,
Yu Zhou,
Ya-Fen Chan,
Yee Leung,
Kam-Sang Woo,
Wen-Wei Che,
Kai-Hon Lau,
Jia-Min Chen,
Kwong-Sak Leung
Abstract:
To mitigate the significant biases/errors in research studying the associations between PM and health, which are introduced by the coarse/inadequate assessments of PM exposure from conventional PM monitoring paradigm, a personalized monitoring system consisting of a low-cost wearable PM device is proposed. However, due to the absence of a unifying evaluation protocol for low-cost PM sensors, the e…
▽ More
To mitigate the significant biases/errors in research studying the associations between PM and health, which are introduced by the coarse/inadequate assessments of PM exposure from conventional PM monitoring paradigm, a personalized monitoring system consisting of a low-cost wearable PM device is proposed. However, due to the absence of a unifying evaluation protocol for low-cost PM sensors, the evaluation results/performance specifications from existing studies/datasheets are of limited reference values when attempting to determine the best candidate for the proposed system. In this regard, the authors appeal to the research community to develop a standardized evaluation protocol for low-cost PM sensors/devices, and a unifying attempt is established in this manuscript by adopting the definitive terminology from international documents and the evaluation metrics regarded as best practices. Collocated on the rooftop of the HKUST Supersite, four empirically selected PM sensors were compared against each other and calibrated against two reference monitors. They were then evaluated against the reference following the protocol. The PlanTower PMS-A003 sensor was selected for the wearable device as it outperformed the others in terms of affordability, portability, detection capability, data quality, as well as humidity and condensation insusceptibility. An automated approach was proposed to identify and remove the condensation associated abnormal measurements. The proposed device has better affordability and portability as well as similar usability and data accessibility compared to those existing devices recognized. The first 10 devices were also evaluated and calibrated at the Supersite. Additional 120 units were manufactured and delivered to the subjects to acquire their daily PM2.5 exposures for investigating the association with subclinical atherosclerosis.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
Formation of robust bound states of interacting microwave photons
Authors:
Alexis Morvan,
Trond I. Andersen,
Xiao Mi,
Charles Neill,
Andre Petukhov,
Kostyantyn Kechedzhi,
Dmitry Abanin,
Rajeev Acharya,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Juan Atalaya,
Ryan Babbush,
Dave Bacon,
Joseph C. Bardin,
Joao Basso,
Andreas Bengtsson,
Gina Bortoli,
Alexandre Bourassa,
Jenna Bovaird,
Leon Brill,
Michael Broughton,
Bob B. Buckley,
David A. Buell,
Tim Burger
, et al. (125 additional authors not shown)
Abstract:
Systems of correlated particles appear in many fields of science and represent some of the most intractable puzzles in nature. The computational challenge in these systems arises when interactions become comparable to other energy scales, which makes the state of each particle depend on all other particles. The lack of general solutions for the 3-body problem and acceptable theory for strongly cor…
▽ More
Systems of correlated particles appear in many fields of science and represent some of the most intractable puzzles in nature. The computational challenge in these systems arises when interactions become comparable to other energy scales, which makes the state of each particle depend on all other particles. The lack of general solutions for the 3-body problem and acceptable theory for strongly correlated electrons shows that our understanding of correlated systems fades when the particle number or the interaction strength increases. One of the hallmarks of interacting systems is the formation of multi-particle bound states. In a ring of 24 superconducting qubits, we develop a high fidelity parameterizable fSim gate that we use to implement the periodic quantum circuit of the spin-1/2 XXZ model, an archetypal model of interaction. By placing microwave photons in adjacent qubit sites, we study the propagation of these excitations and observe their bound nature for up to 5 photons. We devise a phase sensitive method for constructing the few-body spectrum of the bound states and extract their pseudo-charge by introducing a synthetic flux. By introducing interactions between the ring and additional qubits, we observe an unexpected resilience of the bound states to integrability breaking. This finding goes against the common wisdom that bound states in non-integrable systems are unstable when their energies overlap with the continuum spectrum. Our work provides experimental evidence for bound states of interacting photons and discovers their stability beyond the integrability limit.
△ Less
Submitted 21 December, 2022; v1 submitted 10 June, 2022;
originally announced June 2022.
-
Noise-resilient Edge Modes on a Chain of Superconducting Qubits
Authors:
Xiao Mi,
Michael Sonner,
Murphy Yuezhen Niu,
Kenneth W. Lee,
Brooks Foxen,
Rajeev Acharya,
Igor Aleiner,
Trond I. Andersen,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Juan Atalaya,
Ryan Babbush,
Dave Bacon,
Joseph C. Bardin,
Joao Basso,
Andreas Bengtsson,
Gina Bortoli,
Alexandre Bourassa,
Leon Brill,
Michael Broughton,
Bob B. Buckley,
David A. Buell,
Brian Burkett,
Nicholas Bushnell
, et al. (103 additional authors not shown)
Abstract:
Inherent symmetry of a quantum system may protect its otherwise fragile states. Leveraging such protection requires testing its robustness against uncontrolled environmental interactions. Using 47 superconducting qubits, we implement the one-dimensional kicked Ising model which exhibits non-local Majorana edge modes (MEMs) with $\mathbb{Z}_2$ parity symmetry. Remarkably, we find that any multi-qub…
▽ More
Inherent symmetry of a quantum system may protect its otherwise fragile states. Leveraging such protection requires testing its robustness against uncontrolled environmental interactions. Using 47 superconducting qubits, we implement the one-dimensional kicked Ising model which exhibits non-local Majorana edge modes (MEMs) with $\mathbb{Z}_2$ parity symmetry. Remarkably, we find that any multi-qubit Pauli operator overlap** with the MEMs exhibits a uniform late-time decay rate comparable to single-qubit relaxation rates, irrespective of its size or composition. This characteristic allows us to accurately reconstruct the exponentially localized spatial profiles of the MEMs. Furthermore, the MEMs are found to be resilient against certain symmetry-breaking noise owing to a prethermalization mechanism. Our work elucidates the complex interplay between noise and symmetry-protected edge modes in a solid-state environment.
△ Less
Submitted 8 December, 2022; v1 submitted 24 April, 2022;
originally announced April 2022.