-
Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation
Authors:
Zhenhuan Zhou,
Along He,
Yanlin Wu,
Rui Yao,
Xueshuo Xie,
Tao Li
Abstract:
In medical images, various types of lesions often manifest significant differences in their shape and texture. Accurate medical image segmentation demands deep learning models with robust capabilities in multi-scale and boundary feature learning. However, previous networks still have limitations in addressing the above issues. Firstly, previous networks simultaneously fuse multi-level features or…
▽ More
In medical images, various types of lesions often manifest significant differences in their shape and texture. Accurate medical image segmentation demands deep learning models with robust capabilities in multi-scale and boundary feature learning. However, previous networks still have limitations in addressing the above issues. Firstly, previous networks simultaneously fuse multi-level features or employ deep supervision to enhance multi-scale learning. However, this may lead to feature redundancy and excessive computational overhead, which is not conducive to network training and clinical deployment. Secondly, the majority of medical image segmentation networks exclusively learn features in the spatial domain, disregarding the abundant global information in the frequency domain. This results in a bias towards low-frequency components, neglecting crucial high-frequency information. To address these problems, we introduce SF-UNet, a spatial-frequency dual-domain attention network. It comprises two main components: the Multi-scale Progressive Channel Attention (MPCA) block, which progressively extract multi-scale features across adjacent encoder layers, and the lightweight Frequency-Spatial Attention (FSA) block, with only 0.05M parameters, enabling concurrent learning of texture and boundary features from both spatial and frequency domains. We validate the effectiveness of the proposed SF-UNet on three public datasets. Experimental results show that compared to previous state-of-the-art (SOTA) medical image segmentation networks, SF-UNet achieves the best performance, and achieves up to 9.4\% and 10.78\% improvement in DSC and IOU. Codes will be released at https://github.com/nkicsl/SF-UNet.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
FRCNet Frequency and Region Consistency for Semi-supervised Medical Image Segmentation
Authors:
Along He,
Tao Li,
Yanlin Wu,
Ke Zou,
Huazhu Fu
Abstract:
Limited labeled data hinder the application of deep learning in medical domain. In clinical practice, there are sufficient unlabeled data that are not effectively used, and semi-supervised learning (SSL) is a promising way for leveraging these unlabeled data. However, existing SSL methods ignore frequency domain and region-level information and it is important for lesion regions located at low fre…
▽ More
Limited labeled data hinder the application of deep learning in medical domain. In clinical practice, there are sufficient unlabeled data that are not effectively used, and semi-supervised learning (SSL) is a promising way for leveraging these unlabeled data. However, existing SSL methods ignore frequency domain and region-level information and it is important for lesion regions located at low frequencies and with significant scale changes. In this paper, we introduce two consistency regularization strategies for semi-supervised medical image segmentation, including frequency domain consistency (FDC) to assist the feature learning in frequency domain and multi-granularity region similarity consistency (MRSC) to perform multi-scale region-level local context information feature learning. With the help of the proposed FDC and MRSC, we can leverage the powerful feature representation capability of them in an effective and efficient way. We perform comprehensive experiments on two datasets, and the results show that our method achieves large performance gains and exceeds other state-of-the-art methods.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis
Authors:
Andy He,
Darren Key,
Mason Bulling,
Andrew Chang,
Skyler Shapiro,
Everett Lee
Abstract:
Graphics Processing Units (GPUs) have become the leading hardware accelerator for deep learning applications and are used widely in training and inference of transformers; transformers have achieved state-of-the-art performance in many areas of machine learning and are especially used in most modern Large Language Models (LLMs). However, GPUs require large amounts of energy, which poses environmen…
▽ More
Graphics Processing Units (GPUs) have become the leading hardware accelerator for deep learning applications and are used widely in training and inference of transformers; transformers have achieved state-of-the-art performance in many areas of machine learning and are especially used in most modern Large Language Models (LLMs). However, GPUs require large amounts of energy, which poses environmental concerns, demands high operational costs, and causes GPUs to be unsuitable for edge computing. We develop an accelerator for transformers, namely, Llama 2, an open-source state-of-the-art LLM, using high level synthesis (HLS) on Field Programmable Gate Arrays (FPGAs). HLS allows us to rapidly prototype FPGA designs without writing code at the register-transfer level (RTL). We name our method HLSTransform, and the FPGA designs we synthesize with HLS achieve up to a 12.75x reduction and 8.25x reduction in energy used per token on the Xilinx Virtex UltraScale+ VU9P FPGA compared to an Intel Xeon Broadwell E5-2686 v4 CPU and NVIDIA RTX 3090 GPU respectively, while increasing inference speeds by up to 2.46x compared to CPU and maintaining 0.53x the speed of an RTX 3090 GPU despite the GPU's 4 times higher base clock rate. With the lack of existing open-source FPGA accelerators for transformers, we open-source our code and document our steps for synthesis. We hope this work will serve as a step in democratizing the use of FPGAs in transformer inference and inspire research into energy-efficient inference methods as a whole. The code can be found on https://github.com/HLSTransform/submission.
△ Less
Submitted 29 April, 2024;
originally announced May 2024.
-
Crushing Surfaces of Positive Genus
Authors:
Benjamin A. Burton,
Thiago de Paiva,
Alexander He,
Connie On Yu Hui
Abstract:
The operation of crushing a normal surface has proven to be a powerful tool in computational $3$-manifold topology, with applications both to triangulation complexity and to algorithms. The main difficulty with crushing is that it can drastically change the topology of a triangulation, so applications to date have been limited to relatively simple surfaces: $2$-spheres, discs, annuli, and closed b…
▽ More
The operation of crushing a normal surface has proven to be a powerful tool in computational $3$-manifold topology, with applications both to triangulation complexity and to algorithms. The main difficulty with crushing is that it can drastically change the topology of a triangulation, so applications to date have been limited to relatively simple surfaces: $2$-spheres, discs, annuli, and closed boundary-parallel surfaces. We give the first detailed analysis of the topological effects of crushing closed essential surfaces of positive genus. To showcase the utility of this new analysis, we use it to prove some results about how triangulation complexity interacts with JSJ decompositions and satellite knots; although similar applications can also be obtained using techniques of Matveev, our approach has the advantage that it avoids the machinery of almost simple spines and handle decompositions.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
MetaSplit: Meta-Split Network for Limited-Stock Product Recommendation
Authors:
Wenhao Wu,
Jialiang Zhou,
Ailong He,
Shuguang Han,
Jufeng Chen,
Bo Zheng
Abstract:
Compared to business-to-consumer (B2C) e-commerce systems, consumer-to-consumer (C2C) e-commerce platforms usually encounter the limited-stock problem, that is, a product can only be sold one time in a C2C system. This poses several unique challenges for click-through rate (CTR) prediction. Due to limited user interactions for each product (i.e. item), the corresponding item embedding in the CTR m…
▽ More
Compared to business-to-consumer (B2C) e-commerce systems, consumer-to-consumer (C2C) e-commerce platforms usually encounter the limited-stock problem, that is, a product can only be sold one time in a C2C system. This poses several unique challenges for click-through rate (CTR) prediction. Due to limited user interactions for each product (i.e. item), the corresponding item embedding in the CTR model may not easily converge. This makes the conventional sequence modeling based approaches cannot effectively utilize user history information since historical user behaviors contain a mixture of items with different volume of stocks. Particularly, the attention mechanism in a sequence model tends to assign higher score to products with more accumulated user interactions, making limited-stock products being ignored and contribute less to the final output. To this end, we propose the Meta-Split Network (MSNet) to split user history sequence regarding to the volume of stock for each product, and adopt differentiated modeling approaches for different sequences. As for the limited-stock products, a meta-learning approach is applied to address the problem of inconvergence, which is achieved by designing meta scaling and shifting networks with ID and side information. In addition, traditional approach can hardly update item embedding once the product is consumed. Thereby, we propose an auxiliary loss that makes the parameters updatable even when the product is no longer in distribution. To the best of our knowledge, this is the first solution addressing the recommendation of limited-stock product. Experimental results on the production dataset and online A/B testing demonstrate the effectiveness of our proposed method.
△ Less
Submitted 27 March, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1092 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 14 June, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Case studies on time-dependent Ginzburg-Landau simulations for superconducting applications
Authors:
Cun Xue,
Qing-Yu Wang,
Han-Xi Ren,
An He,
A. V. Silhanek
Abstract:
The macroscopic electromagnetic properties of type II superconductors are primarily influenced by the behavior of microscopic superconducting flux quantum units. Time-dependent Ginzburg-Landau (TDGL) equations provide an elegant and powerful tool for describing and examining both the statics and dynamics of these superconducting entities. They have been instrumental in replicating and elucidating…
▽ More
The macroscopic electromagnetic properties of type II superconductors are primarily influenced by the behavior of microscopic superconducting flux quantum units. Time-dependent Ginzburg-Landau (TDGL) equations provide an elegant and powerful tool for describing and examining both the statics and dynamics of these superconducting entities. They have been instrumental in replicating and elucidating numerous experimental results over the past decades.This paper provides a comprehensive overview of the progress in TDGL simulations, focusing on three key aspects of superconductor applications. The initial section delves into vortex rectification in superconductors described within the TDGL framework. We specifically highlight the superconducting diode effect achieved through asymmetric pinning landscapes and the reversible manipulation of vortex ratchets with dynamic pinning landscapes. The subsequent section reviews the achievements of TDGL simulations concerning the critical current density of superconductors, emphasizing the optimization of pinning sites, particularly vortex pinning and dynamics in polycrystalline Nb$_3$Sn with grain boundaries. The third part concentrates on numerical modeling of vortex penetration and dynamics in superconducting radio frequency (SRF) cavities, including a discussion of superconductor insulator superconductor multilayer structures. In the last section, we present key findings, insights, and perspectives derived from the discussed simulations.
△ Less
Submitted 4 June, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
Applying Self-supervised Learning to Network Intrusion Detection for Network Flows with Graph Neural Network
Authors:
Renjie Xu,
Guangwei Wu,
Wei** Wang,
Xing Gao,
An He,
Zhengpeng Zhang
Abstract:
Graph Neural Networks (GNNs) have garnered intensive attention for Network Intrusion Detection System (NIDS) due to their suitability for representing the network traffic flows. However, most present GNN-based methods for NIDS are supervised or semi-supervised. Network flows need to be manually annotated as supervisory labels, a process that is time-consuming or even impossible, making NIDS diffic…
▽ More
Graph Neural Networks (GNNs) have garnered intensive attention for Network Intrusion Detection System (NIDS) due to their suitability for representing the network traffic flows. However, most present GNN-based methods for NIDS are supervised or semi-supervised. Network flows need to be manually annotated as supervisory labels, a process that is time-consuming or even impossible, making NIDS difficult to adapt to potentially complex attacks, especially in large-scale real-world scenarios. The existing GNN-based self-supervised methods focus on the binary classification of network flow as benign or not, and thus fail to reveal the types of attack in practice. This paper studies the application of GNNs to identify the specific types of network flows in an unsupervised manner. We first design an encoder to obtain graph embedding, that introduces the graph attention mechanism and considers the edge information as the only essential factor. Then, a self-supervised method based on graph contrastive learning is proposed. The method samples center nodes, and for each center node, generates subgraph by it and its direct neighbor nodes, and corresponding contrastive subgraph from the interpolated graph, and finally constructs positive and negative samples from subgraphs. Furthermore, a structured contrastive loss function based on edge features and graph local topology is introduced. To the best of our knowledge, it is the first GNN-based self-supervised method for the multiclass classification of network flows in NIDS. Detailed experiments conducted on four real-world databases (NF-Bot-IoT, NF-Bot-IoT-v2, NF-CSE-CIC-IDS2018, and NF-CSE-CIC-IDS2018-v2) systematically compare our model with the state-of-the-art supervised and self-supervised models, illustrating the considerable potential of our method. Our code is accessible through https://github.com/renj-xu/NEGSC.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
An algorithm to construct one-vertex triangulations of Heegaard splittings
Authors:
Alexander He,
James Morgan,
Em K. Thompson
Abstract:
Following work of Jaco and Rubinstein (2006), which (non-constructively) proved that any 3-manifold admits a one-vertex layered triangulation, we present an algorithm, with implementation using Regina, that uses a combinatorial presentation of a Heegaard diagram to construct a generalised notion of a layered triangulation. We show that work of Huszár and Spreer (2019) extends to our construction:…
▽ More
Following work of Jaco and Rubinstein (2006), which (non-constructively) proved that any 3-manifold admits a one-vertex layered triangulation, we present an algorithm, with implementation using Regina, that uses a combinatorial presentation of a Heegaard diagram to construct a generalised notion of a layered triangulation. We show that work of Huszár and Spreer (2019) extends to our construction: given a genus-$g$ Heegaard splitting, our algorithm generates a triangulation with cutwidth bounded above by $4g-2$. Beyond Heegaard splittings, our construction actually extends to a natural generalisation of Dehn fillings: given a one-vertex triangulation with a genus-$g$ boundary component $B$, we can construct a one-vertex triangulation of any 3-manifold obtained by filling $B$ with a handlebody. To demonstrate the usefulness of our algorithm, we present findings from preliminary computer searches using this algorithm.
△ Less
Submitted 19 May, 2024; v1 submitted 29 December, 2023;
originally announced December 2023.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1325 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 17 June, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise
Authors:
Zhenning Shi,
Haoshuai Zheng,
Chen Xu,
Changsheng Dong,
Bin Pan,
Xueshuo Xie,
Along He,
Tao Li,
Huazhu Fu
Abstract:
Recently, research on denoising diffusion models has expanded its application to the field of image restoration. Traditional diffusion-based image restoration methods utilize degraded images as conditional input to effectively guide the reverse generation process, without modifying the original denoising diffusion process. However, since the degraded images already include low-frequency informatio…
▽ More
Recently, research on denoising diffusion models has expanded its application to the field of image restoration. Traditional diffusion-based image restoration methods utilize degraded images as conditional input to effectively guide the reverse generation process, without modifying the original denoising diffusion process. However, since the degraded images already include low-frequency information, starting from Gaussian white noise will result in increased sampling steps. We propose Resfusion, a general framework that incorporates the residual term into the diffusion forward process, starting the reverse process directly from the noisy degraded images. The form of our inference process is consistent with the DDPM. We introduced a weighted residual noise, named resnoise, as the prediction target and explicitly provide the quantitative relationship between the residual term and the noise term in resnoise. By leveraging a smooth equivalence transformation, Resfusion determine the optimal acceleration step and maintains the integrity of existing noise schedules, unifying the training and inference processes. The experimental results demonstrate that Resfusion exhibits competitive performance on ISTD dataset, LOL dataset and Raindrop dataset with only five sampling steps. Furthermore, Resfusion can be easily applied to image generation and emerges with strong versatility. Our code and model are available at https://github.com/nkicsl/Resfusion.
△ Less
Submitted 20 May, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
DeepMartNet -- A Martingale Based Deep Neural Network Learning Method for Dirichlet BVPs and Eigenvalue Problems of Elliptic PDEs in R^d
Authors:
Wei Cai,
Andrew He,
Daniel Margolis
Abstract:
In this paper, we propose DeepMartNet - a Martingale based deep neural network learning method for solving Dirichlet boundary value problems (BVPs) and eigenvalue problems for elliptic partial differential equations (PDEs) in high dimensions or domains with complex geometries. The method is based on Varadhan's Martingale problem formulation for the BVPs/eigenvalue problems where a loss function en…
▽ More
In this paper, we propose DeepMartNet - a Martingale based deep neural network learning method for solving Dirichlet boundary value problems (BVPs) and eigenvalue problems for elliptic partial differential equations (PDEs) in high dimensions or domains with complex geometries. The method is based on Varadhan's Martingale problem formulation for the BVPs/eigenvalue problems where a loss function enforcing the Martingale property for the PDE solution is used for an efficient optimization by sampling the stochastic processes associated with corresponding elliptic operators. High dimensional numerical results for BVPs of the linear and nonlinear Poisson-Boltzmann equation and eigenvalue problems of the Laplace equation and a Fokker-Planck equation demonstrate the capability of the proposed DeepMartNet learning method in solving high dimensional PDE problems.
△ Less
Submitted 20 December, 2023; v1 submitted 15 November, 2023;
originally announced November 2023.
-
The Human Behind the Data: Reflections from an Ongoing Co-Design and Deployment of a Data-Navigation Interface for Front-Line Emergency Housing Shelter Staff
Authors:
Teale W Masrani,
Helen Ai He,
Geoffrey Messier
Abstract:
On any night in Canada, at least 35,000 individuals experience homelessness. These individuals use emergency shelters to transition out of homelessness and into permanent housing. We designed and deployed a technology to support front-line staff at the largest emergency housing shelter in Calgary, Canada. Over a period of five months in 2022, we worked closely with front-line staff to co-design an…
▽ More
On any night in Canada, at least 35,000 individuals experience homelessness. These individuals use emergency shelters to transition out of homelessness and into permanent housing. We designed and deployed a technology to support front-line staff at the largest emergency housing shelter in Calgary, Canada. Over a period of five months in 2022, we worked closely with front-line staff to co-design an interface for supporting a holistic understanding of client context and facilitating decision-making. The tool is currently in-use and our collaboration is ongoing. In this paper, we reflect on preliminary findings regarding the second iteration of the tool. We find that supporting shelter staff in understanding the human behind the data was a critical component of design. This work contributes to literature on how data tools may be integrated into homeless shelters in a way that aligns with shelters' values.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Self-Interacting Neutrinos in Light of Large-Scale Structure Data
Authors:
Adam He,
Rui An,
Mikhail M. Ivanov,
Vera Gluscevic
Abstract:
We explore a self-interacting neutrino cosmology in which neutrinos experience a delayed onset of free-streaming. We use the effective field theory of large-scale structure (LSS) to model matter distribution on mildly non-linear scales within the self-interacting neutrino cosmology for the first time. We perform the first combined likelihood analysis of BOSS full-shape galaxy clustering, weak lens…
▽ More
We explore a self-interacting neutrino cosmology in which neutrinos experience a delayed onset of free-streaming. We use the effective field theory of large-scale structure (LSS) to model matter distribution on mildly non-linear scales within the self-interacting neutrino cosmology for the first time. We perform the first combined likelihood analysis of BOSS full-shape galaxy clustering, weak lensing, and Lyman-$α$ forest measurements, together with the cosmic microwave background (CMB) data from Planck. We find that the full data set strongly favors presence of a flavor-universal neutrino self-interaction, with a characteristic energy scale of order $10$ MeV. The preference is at the $\sim 5σ$ level and is primarily driven by the Lyman-$α$ forest measurements and, to a lesser extent, the weak lensing data from DES. The self-interacting neutrino model eases both the Hubble tension and the $S_8$ tension between different cosmological data sets, but it does not resolve either. Finally, we note a preference for a non-zero sum of neutrino masses at the level of $\sim 0.3$ eV under this model, consistent with previous bounds. These results call for further investigation in several directions, and may have significant implications for neutrino physics and for future new-physics searches with galaxy surveys.
△ Less
Submitted 30 May, 2024; v1 submitted 7 September, 2023;
originally announced September 2023.
-
BridgeData V2: A Dataset for Robot Learning at Scale
Authors:
Homer Walke,
Kevin Black,
Abraham Lee,
Moo ** Kim,
Max Du,
Chongyi Zheng,
Tony Zhao,
Philippe Hansen-Estruch,
Quan Vuong,
Andre He,
Vivek Myers,
Kuan Fang,
Chelsea Finn,
Sergey Levine
Abstract:
We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors designed to facilitate research on scalable robot learning. BridgeData V2 contains 60,096 trajectories collected across 24 environments on a publicly available low-cost robot. BridgeData V2 provides extensive task and environment variability, leading to skills that can generalize across environments, domains,…
▽ More
We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors designed to facilitate research on scalable robot learning. BridgeData V2 contains 60,096 trajectories collected across 24 environments on a publicly available low-cost robot. BridgeData V2 provides extensive task and environment variability, leading to skills that can generalize across environments, domains, and institutions, making the dataset a useful resource for a broad range of researchers. Additionally, the dataset is compatible with a wide variety of open-vocabulary, multi-task learning methods conditioned on goal images or natural language instructions. In our experiments, we train 6 state-of-the-art imitation learning and offline reinforcement learning methods on our dataset, and find that they succeed on a suite of tasks requiring varying amounts of generalization. We also demonstrate that the performance of these methods improves with more data and higher capacity models, and that training on a greater variety of skills leads to improved generalization. By publicly sharing BridgeData V2 and our pre-trained models, we aim to accelerate research in scalable robot learning methods. Project page at https://rail-berkeley.github.io/bridgedata
△ Less
Submitted 17 January, 2024; v1 submitted 24 August, 2023;
originally announced August 2023.
-
DVPT: Dynamic Visual Prompt Tuning of Large Pre-trained Models for Medical Image Analysis
Authors:
Along He,
Kai Wang,
Zhihong Wang,
Tao Li,
Huazhu Fu
Abstract:
Limited labeled data makes it hard to train models from scratch in medical domain, and an important paradigm is pre-training and then fine-tuning. Large pre-trained models contain rich representations, which can be adapted to downstream medical tasks. However, existing methods either tune all the parameters or the task-specific layers of the pre-trained models, ignoring the input variations of med…
▽ More
Limited labeled data makes it hard to train models from scratch in medical domain, and an important paradigm is pre-training and then fine-tuning. Large pre-trained models contain rich representations, which can be adapted to downstream medical tasks. However, existing methods either tune all the parameters or the task-specific layers of the pre-trained models, ignoring the input variations of medical images, and thus they are not efficient or effective. In this work, we aim to study parameter-efficient fine-tuning (PEFT) for medical image analysis, and propose a dynamic visual prompt tuning method, named DVPT. It can extract knowledge beneficial to downstream tasks from large models with a few trainable parameters. Firstly, the frozen features are transformed by an lightweight bottleneck layer to learn the domain-specific distribution of downstream medical tasks, and then a few learnable visual prompts are used as dynamic queries and then conduct cross-attention with the transformed features, attempting to acquire sample-specific knowledge that are suitable for each sample. Finally, the features are projected to original feature dimension and aggregated with the frozen features. This DVPT module can be shared between different Transformer layers, further reducing the trainable parameters. To validate DVPT, we conduct extensive experiments with different pre-trained models on medical classification and segmentation tasks. We find such PEFT method can not only efficiently adapt the pre-trained models to the medical domain, but also brings data efficiency with partial labeled data. For example, with 0.5\% extra trainable parameters, our method not only outperforms state-of-the-art PEFT methods, even surpasses the full fine-tuning by more than 2.20\% Kappa score on medical classification task. It can saves up to 60\% labeled data and 99\% storage cost of ViT-B/16.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Authors:
Vivek Myers,
Andre He,
Kuan Fang,
Homer Walke,
Philippe Hansen-Estruch,
Ching-An Cheng,
Mihai Jalobeanu,
Andrey Kolobov,
Anca Dragan,
Sergey Levine
Abstract:
Our goal is for robots to follow natural language instructions like "put the towel next to the microwave." But getting large amounts of labeled data, i.e. data that contains demonstrations of tasks labeled with the language instruction, is prohibitive. In contrast, obtaining policies that respond to image goals is much easier, because any autonomous trial or demonstration can be labeled in hindsig…
▽ More
Our goal is for robots to follow natural language instructions like "put the towel next to the microwave." But getting large amounts of labeled data, i.e. data that contains demonstrations of tasks labeled with the language instruction, is prohibitive. In contrast, obtaining policies that respond to image goals is much easier, because any autonomous trial or demonstration can be labeled in hindsight with its final state as the goal. In this work, we contribute a method that taps into joint image- and goal- conditioned policies with language using only a small amount of language data. Prior work has made progress on this using vision-language models or by jointly training language-goal-conditioned policies, but so far neither method has scaled effectively to real-world robot tasks without significant human annotation. Our method achieves robust performance in the real world by learning an embedding from the labeled data that aligns language not to the goal image, but rather to the desired change between the start and goal images that the instruction corresponds to. We then train a policy on this embedding: the policy benefits from all the unlabeled data, but the aligned embedding provides an interface for language to steer the policy. We show instruction following across a variety of manipulation tasks in different scenes, with generalization to language instructions outside of the labeled data. Videos and code for our approach can be found on our website: https://rail-berkeley.github.io/grif/ .
△ Less
Submitted 17 August, 2023; v1 submitted 30 June, 2023;
originally announced July 2023.
-
A Novel Two-level Causal Inference Framework for On-road Vehicle Quality Issues Diagnosis
Authors:
Qian Wang,
Huanyi Shui,
Thi Tu Trinh Tran,
Milad Zafar Nezhad,
Devesh Upadhyay,
Kamran Paynabar,
Anqi He
Abstract:
In the automotive industry, the full cycle of managing in-use vehicle quality issues can take weeks to investigate. The process involves isolating root causes, defining and implementing appropriate treatments, and refining treatments if needed. The main pain-point is the lack of a systematic method to identify causal relationships, evaluate treatment effectiveness, and direct the next actionable t…
▽ More
In the automotive industry, the full cycle of managing in-use vehicle quality issues can take weeks to investigate. The process involves isolating root causes, defining and implementing appropriate treatments, and refining treatments if needed. The main pain-point is the lack of a systematic method to identify causal relationships, evaluate treatment effectiveness, and direct the next actionable treatment if the current treatment was deemed ineffective. This paper will show how we leverage causal Machine Learning (ML) to speed up such processes. A real-word data set collected from on-road vehicles will be used to demonstrate the proposed framework. Open challenges for vehicle quality applications will also be discussed.
△ Less
Submitted 31 March, 2023;
originally announced April 2023.
-
Lyapunov Exponents and Phase Transitions of Born-Infeld AdS Black Holes
Authors:
Shaojie Yang,
Jun Tao,
Benrong Mu,
Aoyun He
Abstract:
In this paper, we characterize the phase transitons of Born-Infeld AdS black holes in terms of Lyapunov exponents. We calculate the Lyapunov exponents for both null and timelike geodesics. It is found that black hole phase transitions can be described by multiple-valued Lyapunov exponents. And its phase diagram can be characterized by Lyapunov exponents and Hawking temperature. Besides, the change…
▽ More
In this paper, we characterize the phase transitons of Born-Infeld AdS black holes in terms of Lyapunov exponents. We calculate the Lyapunov exponents for both null and timelike geodesics. It is found that black hole phase transitions can be described by multiple-valued Lyapunov exponents. And its phase diagram can be characterized by Lyapunov exponents and Hawking temperature. Besides, the change of Lyapunov exponents can be considered as order parameter, and exists a critical exponent $1/2$ near critical point.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Chinese Intermediate English Learners outdid ChatGPT in deep cohesion: Evidence from English narrative writing
Authors:
Tongquan Zhou,
Siyi Cao,
Siruo Zhou,
Yao Zhang,
Ai**g He
Abstract:
ChatGPT is a publicly available chatbot that can quickly generate texts on given topics, but it is unknown whether the chatbot is really superior to human writers in all aspects of writing and whether its writing quality can be prominently improved on the basis of updating commands. Consequently, this study compared the writing performance on a narrative topic by ChatGPT and Chinese intermediate E…
▽ More
ChatGPT is a publicly available chatbot that can quickly generate texts on given topics, but it is unknown whether the chatbot is really superior to human writers in all aspects of writing and whether its writing quality can be prominently improved on the basis of updating commands. Consequently, this study compared the writing performance on a narrative topic by ChatGPT and Chinese intermediate English (CIE) learners so as to reveal the chatbot's advantage and disadvantage in writing. The data were analyzed in terms of five discourse components using Coh-Metrix (a special instrument for analyzing language discourses), and the results revealed that ChatGPT performed better than human writers in narrativity, word concreteness, and referential cohesion, but worse in syntactic simplicity and deep cohesion in its initial version. After more revision commands were updated, while the resulting version was facilitated in syntactic simplicity, yet it is still lagged far behind CIE learners' writing in deep cohesion. In addition, the correlation analysis of the discourse components suggests that narrativity was correlated with referential cohesion in both ChatGPT and human writers, but the correlations varied within each group.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Finding large counterexamples by selectively exploring the Pachner graph
Authors:
Benjamin A. Burton,
Alexander He
Abstract:
We often rely on censuses of triangulations to guide our intuition in $3$-manifold topology. However, this can lead to misplaced faith in conjectures if the smallest counterexamples are too large to appear in our census. Since the number of triangulations increases super-exponentially with size, there is no way to expand a census beyond relatively small triangulations; the current census only goes…
▽ More
We often rely on censuses of triangulations to guide our intuition in $3$-manifold topology. However, this can lead to misplaced faith in conjectures if the smallest counterexamples are too large to appear in our census. Since the number of triangulations increases super-exponentially with size, there is no way to expand a census beyond relatively small triangulations; the current census only goes up to $10$ tetrahedra. Here, we show that it is feasible to search for large and hard-to-find counterexamples by using heuristics to selectively (rather than exhaustively) enumerate triangulations. We use this idea to find counterexamples to three conjectures which ask, for certain $3$-manifolds, whether one-vertex triangulations always have a "distinctive" edge that would allow us to recognise the $3$-manifold.
△ Less
Submitted 6 March, 2024; v1 submitted 11 March, 2023;
originally announced March 2023.
-
Pseudo Quantum Random Number Generator with Quantum Permutation Pad
Authors:
Randy Kuang,
Dafu Lou,
Alex He,
Chris McKenzie,
Michael Redding
Abstract:
Cryptographic random number generation is critical for any quantum safe encryption. Based on the natural uncertainty of some quantum processes, variety of quantum random number generators or QRNGs have been created with physical quantum processes. They generally generate random numbers with good unpredictable randomness. Of course, physical QRNGs are costic and require physical integrations with c…
▽ More
Cryptographic random number generation is critical for any quantum safe encryption. Based on the natural uncertainty of some quantum processes, variety of quantum random number generators or QRNGs have been created with physical quantum processes. They generally generate random numbers with good unpredictable randomness. Of course, physical QRNGs are costic and require physical integrations with computing systems. This paper proposes a pseudo quantum random number generator with a quantum algorithm called quantum permutation pad or QPP, leveraging the high entropy of quantum permutation space its bijective transformation. Unlike the Boolean algebra where the size of information space is 2n for an n-bit system, an n-bit quantum permutation space consists of 2n! quantum permutation matrices, representing all quantum permutation gates over an n-bit computational basis. This permutation space holds an equivalent Shannon information entropy log_2(2^n!). A QPP can be used to create a pseudo QRNG or pQRNG capable integrated with any classical computing system or directly with any application for good quality deterministic random number generation. Using a QPP pad with 64 8-bit permuation matrices, pQRNG holds 107,776 bits of entropy for the pseudo random number generation, comparing with 4096 bits of entropy in Linux /dev/random. It can be used as a deterministic PRNG or entropy booster of other PRNGs. It can also be used as a whitening algorithm for any hardware random number generator including QRNG without discarding physical bias bits.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Delving into Identify-Emphasize Paradigm for Combating Unknown Bias
Authors:
Bowen Zhao,
Chen Chen,
Qian-Wei Wang,
Anfeng He,
Shu-Tao Xia
Abstract:
Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far from satisfactory; B, the emphasizing strategies only produce suboptimal performance. In this paper…
▽ More
Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far from satisfactory; B, the emphasizing strategies only produce suboptimal performance. In this paper, for challenge A, we propose an effective bias-conflicting scoring method (ECS) to boost the identification accuracy, along with two practical strategies -- peer-picking and epoch-ensemble. For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples. Then, we propose gradient alignment (GA), which employs gradient statistics to balance the contributions of the mined bias-aligned and bias-conflicting samples dynamically throughout the learning process, forcing models to leverage intrinsic features to make fair decisions. Furthermore, we incorporate self-supervised (SS) pretext tasks into training, which enable models to exploit richer features rather than the simple shortcuts, resulting in more robust models. Experiments are conducted on multiple datasets in various settings, demonstrating that the proposed solution can mitigate the impact of unknown biases and achieve state-of-the-art performance.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
$S_8$ Tension in the Context of Dark Matter-Baryon Scattering
Authors:
Adam He,
Mikhail M. Ivanov,
Rui An,
Vera Gluscevic
Abstract:
We explore an interacting dark matter (IDM) model that allows for a fraction of dark matter (DM) to undergo velocity-independent scattering with baryons. In this scenario, structure on small scales is suppressed relative to the cold DM scenario. Using the effective field theory of large-scale structure, we perform the first systematic analysis of BOSS full-shape galaxy clustering data for the IDM…
▽ More
We explore an interacting dark matter (IDM) model that allows for a fraction of dark matter (DM) to undergo velocity-independent scattering with baryons. In this scenario, structure on small scales is suppressed relative to the cold DM scenario. Using the effective field theory of large-scale structure, we perform the first systematic analysis of BOSS full-shape galaxy clustering data for the IDM scenario, and we find that this model alleviates the $S_8$ tension between large-scale structure and Planck data. Adding the $S_8$ prior from DES to our analysis further leads to a mild $\sim3σ$ preference for a non-vanishing DM-baryon scattering cross-section, assuming $\sim 10\%$ of DM is interacting and has a particle mass of 1 MeV. This result produces a modest $\sim 20$% suppression of the linear power at $k\lesssim 1~h$/Mpc, consistent with other small-scale structure observations. Similar scale-dependent power suppression was previously shown to have the potential to resolve $S_8$ tension between cosmological data sets. The validity of the specific IDM model explored here will be critically tested with upcoming galaxy surveys at the interaction level needed to alleviate the $S_8$ tension.
△ Less
Submitted 30 May, 2023; v1 submitted 19 January, 2023;
originally announced January 2023.
-
FIPS Compliant Quantum Secure Communication using Quantum Permutation Pad
Authors:
Alex He,
Dafu Lou,
Eric She,
Shangjie Guo,
Hareesh Watson,
Sibyl Weng,
Maria Perepechaenko,
Rand Kuang
Abstract:
Quantum computing has entered fast development track since Shor's algorithm was proposed in 1994. Multi-cloud services of quantum computing farms are currently available. One of which, IBM quantum computing, presented a road map showing their Kookaburra system with over 4158 qubits will be available in 2025. For the standardization of Post-Quantum Cryptography or PQC, the National Institute of Sta…
▽ More
Quantum computing has entered fast development track since Shor's algorithm was proposed in 1994. Multi-cloud services of quantum computing farms are currently available. One of which, IBM quantum computing, presented a road map showing their Kookaburra system with over 4158 qubits will be available in 2025. For the standardization of Post-Quantum Cryptography or PQC, the National Institute of Standards and Technology or NIST recently announced the first candidates for standardization with one algorithm for key encapsulation mechanism (KEM), Kyber, and three algorithms for digital signatures. NIST has also issued a new call for quantum-safe digital signature algorithms due June 1, 2023. This timeline shows that FIPS-certified quantum-safe TLS protocol would take a predictably long time. However, "steal now, crack later" tactic requires protecting data against future quantum threat actors today. NIST recommended the use of a hybrid mode of TLS 1.3 with its extensions to support PQC. The hybrid mode works for certain cases but FIPS certification for the hybridized cryptomodule might still be required. This paper proposes to take a nested mode to enable TLS 1.3 protocol with quantum-safe data, which can be made available today and is FIPS compliant. We discussed the performance impacts of the handshaking phase of the nested TLS 1.3 with PQC and the symmetric encryption phase. The major impact on performance using the nested mode is in the data symmetric encryption with AES. To overcome this performance reduction, we suggest using quantum encryption with a quantum permutation pad for the data encryption with a minor performance reduction of less than 10 percent.
△ Less
Submitted 28 December, 2023; v1 submitted 30 December, 2022;
originally announced January 2023.
-
FAF: A novel multimodal emotion recognition approach integrating face, body and text
Authors:
Zhongyu Fang,
Aoyun He,
Qihui Yu,
Baopeng Gao,
Wei** Ding,
Tong Zhang,
Lei Ma
Abstract:
Multimodal emotion analysis performed better in emotion recognition depending on more comprehensive emotional clues and multimodal emotion dataset. In this paper, we developed a large multimodal emotion dataset, named "HED" dataset, to facilitate the emotion recognition task, and accordingly propose a multimodal emotion recognition method. To promote recognition accuracy, "Feature After Feature" f…
▽ More
Multimodal emotion analysis performed better in emotion recognition depending on more comprehensive emotional clues and multimodal emotion dataset. In this paper, we developed a large multimodal emotion dataset, named "HED" dataset, to facilitate the emotion recognition task, and accordingly propose a multimodal emotion recognition method. To promote recognition accuracy, "Feature After Feature" framework was used to explore crucial emotional information from the aligned face, body and text samples. We employ various benchmarks to evaluate the "HED" dataset and compare the performance with our method. The results show that the five classification accuracy of the proposed multimodal fusion method is about 83.75%, and the performance is improved by 1.83%, 9.38%, and 21.62% respectively compared with that of individual modalities. The complementarity between each channel is effectively used to improve the performance of emotion recognition. We had also established a multimodal online emotion prediction platform, aiming to provide free emotion prediction to more users.
△ Less
Submitted 20 November, 2022;
originally announced November 2022.
-
Neural Unsupervised Reconstruction of Protolanguage Word Forms
Authors:
Andre He,
Nicholas Tomlin,
Dan Klein
Abstract:
We present a state-of-the-art neural approach to the unsupervised reconstruction of ancient word forms. Previous work in this domain used expectation-maximization to predict simple phonological changes between ancient word forms and their cognates in modern languages. We extend this work with neural models that can capture more complicated phonological and morphological changes. At the same time,…
▽ More
We present a state-of-the-art neural approach to the unsupervised reconstruction of ancient word forms. Previous work in this domain used expectation-maximization to predict simple phonological changes between ancient word forms and their cognates in modern languages. We extend this work with neural models that can capture more complicated phonological and morphological changes. At the same time, we preserve the inductive biases from classical methods by building monotonic alignment constraints into the model and deliberately underfitting during the maximization step. We evaluate our performance on the task of reconstructing Latin from a dataset of cognates across five Romance languages, achieving a notable reduction in edit distance from the target word forms compared to previous methods.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
gMeta: Template-based Regular Expression Generation over Noisy Examples
Authors:
Shujun Wang,
Yongqiang Tian andDengcheng He
Abstract:
Regular expressions (regexes) are widely used in different fields of computer science, such as programming languages, string processing, and databases. However, existing tools for synthesizing or repairing regexes always assume that the input examples are faultless. In real industrial scenarios, this assumption does not entirely hold. Thus, this paper presents a simple but effective templated-base…
▽ More
Regular expressions (regexes) are widely used in different fields of computer science, such as programming languages, string processing, and databases. However, existing tools for synthesizing or repairing regexes always assume that the input examples are faultless. In real industrial scenarios, this assumption does not entirely hold. Thus, this paper presents a simple but effective templated-based approach to generate regular expressions over noisy examples. Specifically, we present a data model (i.e., MetaParam) to extract features of strings for clustering all examples. Then, we propose a practical dynamic thresholding scheme to filter out anomalous examples via detecting knee points on CDF graphs. Finally, we design a template-based algorithm to translate a finite of positve examples to regular expression, which is efficient, interpretable, and extensible. We performed an experimental evaluation on four different extraction tasks applied to real-world datasets and obtained promising results in terms of F-measure. Moreover, gMeta achieves excellent results in real industrial scenarios.
△ Less
Submitted 31 October, 2022; v1 submitted 30 October, 2022;
originally announced October 2022.
-
Representing Marginalized Populations: Challenges in Anthropographics
Authors:
Priya Dhawka,
Helen Ai He,
Wesley Willett
Abstract:
Anthropographics are human-shaped visualizations that have primarily been used within visualization research and data journalism to show humanitarian and demographic data. However, anthropographics have typically been produced by a small group of designers, researchers, and journalists, and most use homogeneous representations of marginalized populations-representations that might have problematic…
▽ More
Anthropographics are human-shaped visualizations that have primarily been used within visualization research and data journalism to show humanitarian and demographic data. However, anthropographics have typically been produced by a small group of designers, researchers, and journalists, and most use homogeneous representations of marginalized populations-representations that might have problematic implications for how viewers perceive the people they represent. In this paper, we use a critical lens to examine anthropographic visualization practices in projects about marginalized populations. We present critiques that identify three potential challenges related to the use of anthropographics and highlight possible unintended consequences-namely (1) creating homogeneous depictions of marginalized populations, (2) treating marginalization as an inclusion criteria, and (3) insufficiently contextualizing datasets about marginalization. Finally, we highlight opportunities for anthropographics research, including the need to develop techniques for representing demographic differences between marginalized populations and for studies exploring other potential effects of anthropographics.
△ Less
Submitted 6 October, 2022; v1 submitted 5 October, 2022;
originally announced October 2022.
-
Monolithic Integration of Embedded III-V Lasers on SOI
Authors:
Wen Qi Wei,
An He,
Bo Yang,
**g-Zhi Huang,
Dong Han,
Min Ming,
Zi Hao Wang,
Xuhan Guo,
Yikai Su,
Jian Jun Zhang,
Ting Wang
Abstract:
Silicon photonic integration has gained great success in many application fields owing to the excellent optical device properties and complementary metal-oxide semiconductor (CMOS) compatibility. Realizing monolithic integration of III-V lasers and silicon photonic components on single silicon wafer is recognized as a long-standing obstacle for ultra-dense photonic integration, which can provide c…
▽ More
Silicon photonic integration has gained great success in many application fields owing to the excellent optical device properties and complementary metal-oxide semiconductor (CMOS) compatibility. Realizing monolithic integration of III-V lasers and silicon photonic components on single silicon wafer is recognized as a long-standing obstacle for ultra-dense photonic integration, which can provide considerable economical, energy efficient and foundry-scalable on-chip light sources, that has not been reported yet. Here, we demonstrate embedded InAs/GaAs quantum dot (QD) lasers directly grown on trenched silicon-on-insulator (SOI) substrate, enabling monolithic integration with butt-coupled silicon waveguides. By utilizing the patterned grating structures inside pre-defined SOI trenches and unique epitaxial method via molecular beam epitaxy (MBE), high-performance embedded InAs QD lasers with out-coupled silicon waveguide are achieved on such template. By resolving the epitaxy and fabrication challenges in such monolithic integrated architecture, embedded III-V lasers on SOI with continuous-wave lasing up to 85 oC are obtained. The maximum output power of 6.8 mW can be measured from the end tip of the butt-coupled silicon waveguides, with estimated coupling efficiency of approximately -7.35 dB. The results presented here provide a scalable and low-cost epitaxial method for realization of on-chip light sources directly coupling to the silicon photonic components for future high-density photonic integration.
△ Less
Submitted 16 July, 2022;
originally announced July 2022.
-
Progressive Multi-scale Consistent Network for Multi-class Fundus Lesion Segmentation
Authors:
Along He,
Kai Wang,
Tao Li,
Wang Bo,
Hong Kang,
Huazhu Fu
Abstract:
Effectively integrating multi-scale information is of considerable significance for the challenging multi-class segmentation of fundus lesions because different lesions vary significantly in scales and shapes. Several methods have been proposed to successfully handle the multi-scale object segmentation. However, two issues are not considered in previous studies. The first is the lack of interactio…
▽ More
Effectively integrating multi-scale information is of considerable significance for the challenging multi-class segmentation of fundus lesions because different lesions vary significantly in scales and shapes. Several methods have been proposed to successfully handle the multi-scale object segmentation. However, two issues are not considered in previous studies. The first is the lack of interaction between adjacent feature levels, and this will lead to the deviation of high-level features from low-level features and the loss of detailed cues. The second is the conflict between the low-level and high-level features, this occurs because they learn different scales of features, thereby confusing the model and decreasing the accuracy of the final prediction. In this paper, we propose a progressive multi-scale consistent network (PMCNet) that integrates the proposed progressive feature fusion (PFF) block and dynamic attention block (DAB) to address the aforementioned issues. Specifically, PFF block progressively integrates multi-scale features from adjacent encoding layers, facilitating feature learning of each layer by aggregating fine-grained details and high-level semantics. As features at different scales should be consistent, DAB is designed to dynamically learn the attentive cues from the fused features at different scales, thus aiming to smooth the essential conflicts existing in multi-scale features. The two proposed PFF and DAB blocks can be integrated with the off-the-shelf backbone networks to address the two issues of multi-scale and feature inconsistency in the multi-class segmentation of fundus lesions, which will produce better feature representation in the feature space. Experimental results on three public datasets indicate that the proposed method is more effective than recent state-of-the-art methods.
△ Less
Submitted 31 May, 2022;
originally announced May 2022.
-
Impact of strain and field ramp functional form on thermomagnetic instabilities in composite Nb3Sn wires with multi-filaments inside the superconducting coil
Authors:
Q. -Y. Wang,
J. -B. Li,
A. He,
W. Liu,
C. Xue,
Y. -H. Zhou
Abstract:
We theoretically analyze the effects of Cu/SC (superconductor) ratio, strain, and the ramp path on thermomagnetic instabilities of the superconducting coil. By considering the multi-filamentary structures, we find that a lower Cu/SC ratio leads to higher temperature peaks. The strain causes a higher frequency of flux jumps and higher voltage peaks. The temperatures recover to working temperature m…
▽ More
We theoretically analyze the effects of Cu/SC (superconductor) ratio, strain, and the ramp path on thermomagnetic instabilities of the superconducting coil. By considering the multi-filamentary structures, we find that a lower Cu/SC ratio leads to higher temperature peaks. The strain causes a higher frequency of flux jumps and higher voltage peaks. The temperatures recover to working temperature more difficultly and SC wires quench earlier in the presence of strain. For the jagged ramp cases, few flux jumps occur at the decreasing branch, whereas frequent flux jumps can be observed promptly when the applied current exceeds the pre-existing peak. Our simulated results agree well with experimental observations in Nb3Sn coils. Additionally, unlike the pulsed flux jumps observed in linear ramp cases, giant and prolonged flux jumps are observed at the increasing branch with a sinusoidal ramp path when the applied current is sufficiently large.
△ Less
Submitted 25 May, 2022;
originally announced May 2022.
-
Effects of Born-Infeld electrodynamics on black hole shadows
Authors:
Aoyun He,
Jun Tao,
Peng Wang,
Yadong Xue,
Lingkai Zhang
Abstract:
In this work, we study the shadow of Born-Infeld (BI) black holes with magnetic monopoles and Schwarzschild black holes immersed in the BI uniform magnetic field. Illuminated by a celestial sphere, black hole images are obtained by using the backward ray-tracing method. For magnetically charged BI black holes, we find that the shadow radius increases with the increase of nonlinear electromagnetics…
▽ More
In this work, we study the shadow of Born-Infeld (BI) black holes with magnetic monopoles and Schwarzschild black holes immersed in the BI uniform magnetic field. Illuminated by a celestial sphere, black hole images are obtained by using the backward ray-tracing method. For magnetically charged BI black holes, we find that the shadow radius increases with the increase of nonlinear electromagnetics effects. For Schwarzschild black holes immersed in the BI uniform magnetic field, photons tend to move towards the axis of symmetric, resulting in stretched shadows along the equatorial plane.
△ Less
Submitted 29 May, 2022; v1 submitted 25 May, 2022;
originally announced May 2022.
-
High-accuracy three-dimensional surface detection in smoothed particle hydrodynamics for free-surface flows
Authors:
Wen-Bin Liu,
Dong-Jun Ma,
Jian-Zhen Qian,
Ming-Yu Zhang,
An-Min He,
Nan-Sheng Liu,
Pei Wang
Abstract:
In this study, we investigate high-accuracy three-dimensional surface detection in smoothed particle hydrodynamics for free-surface flows. A new geometrical method is first developed to enhance the accuracy of free-surface particle detection in complex flows. This method detects free-surface particles via continuous global scanning inside the sphere of a particle through a cone region whose vertex…
▽ More
In this study, we investigate high-accuracy three-dimensional surface detection in smoothed particle hydrodynamics for free-surface flows. A new geometrical method is first developed to enhance the accuracy of free-surface particle detection in complex flows. This method detects free-surface particles via continuous global scanning inside the sphere of a particle through a cone region whose vertex corresponds to the particle position. The particle is identified as a free-surface particle if there exists a cone region with no neighboring particles. Next, an efficient semi-geometrical method is proposed based on the geometrical method to reduce the computational cost. It consists of finding particles near the free surface via position divergence and then detecting these particles using the geometrical method to identify free-surface particles. The accuracy and robustness of the proposed method are demonstrated by performing tests on several model problems.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Microstructure of Charged AdS Black Hole with Minimal Length Effects
Authors:
Ningchen Bai,
Aoyun He,
Jun Tao
Abstract:
In this work, the microstructure of charged AdS black holes under minimal length effects is investigated. We study the thermodynamics of black holes in the extended phase space, where the cosmological constant is regarded as the thermodynamic pressure. The modified Hawking temperature and phase transition are obtained based on the generalized uncertainty principle (GUP). Then, using thermodynamic…
▽ More
In this work, the microstructure of charged AdS black holes under minimal length effects is investigated. We study the thermodynamics of black holes in the extended phase space, where the cosmological constant is regarded as the thermodynamic pressure. The modified Hawking temperature and phase transition are obtained based on the generalized uncertainty principle (GUP). Then, using thermodynamic geometry, we show that the microstructure of black holes can be determined by the ratio of GUP parameter to charge. For a small ratio, the black hole exhibits the typical RN-AdS microstructure with van der Waals phase transition and repulsive/attractive interactions. As the ratio increases, the reentrant phase transition takes place, and both the repulsion-attraction coexisted black hole and the attraction dominated black hole can be found in this case. For a large ratio, the black hole behaves like a Schwarzchild-AdS black hole in which neither phase transition nor repulsive interaction exists. These results suggest that the GUP effect will reduce the repulsive interaction presented by the charged AdS black hole, which can also be qualitatively understood from the perspective of black hole molecules.
△ Less
Submitted 11 May, 2022; v1 submitted 27 April, 2022;
originally announced April 2022.
-
Localizing narrow Fe K$α$ emission within bright AGN
Authors:
Carolina Andonie,
Franz E. Bauer,
Rosamaria Carraro,
Patricia Arevalo,
David M. Alexander,
William N. Brandt,
Johannes Buchner,
Adam He,
Michael J. Koss,
Claudio Ricci,
Vicente Salinas,
Manuel Solimano,
Alessia Tortosa,
Ezequiel Treister
Abstract:
The 6.4 keV Fe Ka emission line is a ubiquitous feature in X-ray spectra of AGN, and its properties track the interaction between the variable primary X-ray continuum and the surrounding structure from which it arises. We clarify the nature and origin of the narrow Fe Ka emission using X-ray spectral, timing, and imaging constraints, plus possible correlations to AGN and host galaxy properties, fo…
▽ More
The 6.4 keV Fe Ka emission line is a ubiquitous feature in X-ray spectra of AGN, and its properties track the interaction between the variable primary X-ray continuum and the surrounding structure from which it arises. We clarify the nature and origin of the narrow Fe Ka emission using X-ray spectral, timing, and imaging constraints, plus possible correlations to AGN and host galaxy properties, for 38 bright nearby AGN ($z<0.5$) from the BAT AGN Spectroscopic Survey. Modeling Chandra and XMM-Newton spectra, we computed line full-width half-maxima (FWHMs) and constructed Fe Ka line and 2-10 keV continuum light curves. The FWHM provides one estimate of the Fe Ka emitting region size, RFeKa, assuming virial motion. A second estimate comes from comparing the degree of correlation between the variability of the continuum and line-only light curves, compared to simulated light curves. Finally, we extracted Chandra radial profiles to place upper limits on RFeKa. We found that for 90% (21/24) of AGN with FWHM measurements, RFeKa is smaller than the fiducial dust sublimation radius, Rsub. Despite a wide range of variability properties, the constraints on the Fe Ka photon reprocessor size independently confirm that RFeKa is smaller than Rsub in 83% of AGN. Finally, the imaging analysis yields loose upper limits for all but two sources; notably, the Circinus Galaxy and NGC 1068 show significant but subdominant extended Fe Ka emission out to $\sim$100 and $\sim$800 pc, respectively. Based on independent constraints, we conclude that the majority of the narrow Fe Ka emission in typical AGN predominantly arises from regions smaller than and presumably inside Rsub, and thus it is associated either with the outer broad line region or outer accretion disk. However, the large diversity of continuum and narrow Fe Ka variability properties are not easily accommodated by a universal scenario.
△ Less
Submitted 21 April, 2022; v1 submitted 20 April, 2022;
originally announced April 2022.
-
Understanding Game-Playing Agents with Natural Language Annotations
Authors:
Nicholas Tomlin,
Andre He,
Dan Klein
Abstract:
We present a new dataset containing 10K human-annotated games of Go and show how these natural language annotations can be used as a tool for model interpretability. Given a board state and its associated comment, our approach uses linear probing to predict mentions of domain-specific terms (e.g., ko, atari) from the intermediate state representations of game-playing agents like AlphaGo Zero. We f…
▽ More
We present a new dataset containing 10K human-annotated games of Go and show how these natural language annotations can be used as a tool for model interpretability. Given a board state and its associated comment, our approach uses linear probing to predict mentions of domain-specific terms (e.g., ko, atari) from the intermediate state representations of game-playing agents like AlphaGo Zero. We find these game concepts are nontrivially encoded in two distinct policy networks, one trained via imitation learning and another trained via reinforcement learning. Furthermore, mentions of domain-specific terms are most easily predicted from the later layers of both models, suggesting that these policy networks encode high-level abstractions similar to those used in the natural language annotations.
△ Less
Submitted 15 April, 2022;
originally announced April 2022.
-
Arc diagrams on 3-manifold spines
Authors:
Jack Brand,
Benjamin A. Burton,
Zsuzsanna Dancso,
Alexander He,
Adele Jackson,
Joan Licata
Abstract:
We develop a theory of link projections to trivalent spines of 3-manifolds. We prove a Reidemeister Theorem providing a set of combinatorial moves sufficient to relate the projections of isotopic links. We also show that any link admits a crossingless projection to any special spine and we refine our theorem to provide a set of combinatorial moves sufficient to relate crossingless diagrams. Finall…
▽ More
We develop a theory of link projections to trivalent spines of 3-manifolds. We prove a Reidemeister Theorem providing a set of combinatorial moves sufficient to relate the projections of isotopic links. We also show that any link admits a crossingless projection to any special spine and we refine our theorem to provide a set of combinatorial moves sufficient to relate crossingless diagrams. Finally, we discuss the connection to Turaev's shadow world, interpreting our result as a statement about shadow equivalence of a class of 4-manifolds.
△ Less
Submitted 13 June, 2023; v1 submitted 4 February, 2022;
originally announced February 2022.
-
The h-vector of a Positroid is a pure O-sequence
Authors:
Amy He,
Pierce Lai,
SuHo Oh
Abstract:
A well-known conjecture of Stanley is that the h-vector of any matroid is a pure O-sequence. There have been numerous papers with partial progress on this conjecture, but it is still wide open. Positroids are special class of linear matroids that play a crucial role in the field of total positivity. In this short note, we prove that Stanley's conjecture holds for positroids.
A well-known conjecture of Stanley is that the h-vector of any matroid is a pure O-sequence. There have been numerous papers with partial progress on this conjecture, but it is still wide open. Positroids are special class of linear matroids that play a crucial role in the field of total positivity. In this short note, we prove that Stanley's conjecture holds for positroids.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
Combating Unknown Bias with Effective Bias-Conflicting Scoring and Gradient Alignment
Authors:
Bowen Zhao,
Chen Chen,
Qian-Wei Wang,
Anfeng He,
Shu-Tao Xia
Abstract:
Models notoriously suffer from dataset biases which are detrimental to robustness and generalization. The identify-emphasize paradigm shows a promising effect in dealing with unknown biases. However, we find that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far from satisfactory; B, the emphasizing strategies just yield suboptimal performance.…
▽ More
Models notoriously suffer from dataset biases which are detrimental to robustness and generalization. The identify-emphasize paradigm shows a promising effect in dealing with unknown biases. However, we find that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far from satisfactory; B, the emphasizing strategies just yield suboptimal performance. In this work, for challenge A, we propose an effective bias-conflicting scoring method to boost the identification accuracy with two practical strategies -- peer-picking and epoch-ensemble. For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples. Then, we propose gradient alignment, which employs gradient statistics to balance the contributions of the mined bias-aligned and bias-conflicting samples dynamically throughout the learning process, forcing models to leverage intrinsic features to make fair decisions. Experiments are conducted on multiple datasets in various settings, demonstrating that the proposed solution can alleviate the impact of unknown biases and achieve state-of-the-art performance.
△ Less
Submitted 27 November, 2022; v1 submitted 25 November, 2021;
originally announced November 2021.
-
Magic-angle Twisted Bilayer Systems with Quadratic-Band-Touching: Exactly Flat Bands with High-Chern Number
Authors:
Ming-Rui Li,
Ai-Lei He,
Hong Yao
Abstract:
Studies of twisted moiré systems have been mainly focused on two-dimensional (2D) materials such as graphene with Dirac points and transition-metal-dichalcogenide so far. Here we propose a twisted bilayer of 2D systems which feature stable quadratic-band-touching points and find exotic physics different from previously studied twisted moiré systems. Specifically, we show that exactly flat bands ca…
▽ More
Studies of twisted moiré systems have been mainly focused on two-dimensional (2D) materials such as graphene with Dirac points and transition-metal-dichalcogenide so far. Here we propose a twisted bilayer of 2D systems which feature stable quadratic-band-touching points and find exotic physics different from previously studied twisted moiré systems. Specifically, we show that exactly flat bands can emerge at magic angles and, more interestingly, each flat band exhibits a high Chern number ($C=\pm 2$). We further consider the effect of Coulomb interactions in such magic-angle twisted systems and find that the ground state supports the quantum anomalous Hall effect with quantized Hall conductivity $2\frac{e^2}{hc}$ at certain filling. Furthermore, the possible physical realization of such twisted bilayer systems will be briefly discussed.
△ Less
Submitted 24 December, 2022; v1 submitted 23 November, 2021;
originally announced November 2021.
-
Computationally Efficient Zero Noise Extrapolation for Quantum Gate Error Mitigation
Authors:
Vincent R. Pascuzzi,
Andre He,
Christian W. Bauer,
Wibe A. de Jong,
Benjamin Nachman
Abstract:
Zero noise extrapolation (ZNE) is a widely used technique for gate error mitigation on near term quantum computers because it can be implemented in software and does not require knowledge of the quantum computer noise parameters. Traditional ZNE requires a significant resource overhead in terms of quantum operations. A recent proposal using a targeted (or random) instead of fixed identity insertio…
▽ More
Zero noise extrapolation (ZNE) is a widely used technique for gate error mitigation on near term quantum computers because it can be implemented in software and does not require knowledge of the quantum computer noise parameters. Traditional ZNE requires a significant resource overhead in terms of quantum operations. A recent proposal using a targeted (or random) instead of fixed identity insertion method (RIIM versus FIIM) requires significantly fewer quantum gates for the same formal precision. We start by showing that RIIM can allow for ZNE to be deployed on deeper circuits than FIIM, but requires many more measurements to maintain the same statistical uncertainty. We develop two extensions to FIIM and RIIM. The List Identity Insertion Method (LIIM) allows to mitigate the error from certain CNOT gates, typically those with the largest error. Set Identity Insertion Method (SIIM) naturally interpolates between the measurement-efficient FIIM and the gate-efficient RIIM, allowing to trade off fewer CNOT gates for more measurements. Finally, we investigate a way to boost the number of measurements, namely to run ZNE in parallel, utilizing as many quantum devices as are available. We explore the performance of RIIM in a parallel setting where there is a non-trivial spread in noise across sets of qubits within or across quantum computers.
△ Less
Submitted 9 March, 2022; v1 submitted 25 October, 2021;
originally announced October 2021.
-
Shadow and Photon Sphere of Black Hole in Clouds of Strings and Quintessence
Authors:
Aoyun He,
Jun Tao,
Yadong Xue,
Lingkai Zhang
Abstract:
In this work, we study the shadow and photon sphere of the black bole in clouds of strings and quintessence with static and infalling spherical accretions. We obtain the geodesics of the photons near a black hole with different impact parameters $b$. The string clouds model and quintessence influence the specific intensity by affecting the geodesic and the average radial position of photons. And t…
▽ More
In this work, we study the shadow and photon sphere of the black bole in clouds of strings and quintessence with static and infalling spherical accretions. We obtain the geodesics of the photons near a black hole with different impact parameters $b$. The string clouds model and quintessence influence the specific intensity by affecting the geodesic and the average radial position of photons. And the range of string clouds parameter $a$ is constrained to ensure that the shadow can be observed. Moreover, we use a model of the photon emissivity $j(ν_e)$ to get the specific intensities. The light sources in the accretion follow a normal distribution with an attenuation factor $γ$. The shadow with static spherical accretion is plotted. The apparent shape of the shadow is a perfect circle, and the value of $γ$ affects the brightness of the photon sphere. We investigate the profile and specific intensity of the shadows with static and infalling spherical accretions respectively. The interior of the shadows with an infalling spherical accretion will be darker than that with the static spherical accretion, and the specific intensity with both static and infalling spherical accretion gradually converges.
△ Less
Submitted 2 April, 2022; v1 submitted 28 September, 2021;
originally announced September 2021.
-
Topological states in a dimerized system with staggered magnetic fluxes
Authors:
Ai-Lei He,
Wei-Wei Luo,
Yuan Zhou,
Yi-Fei Wang,
Hong Yao
Abstract:
The bulk-boundary correspondence is a generic feature of topological states of matter, reflecting the intrinsic relation between topological bulk and boundary states. For example, robust edge states propagate along the edges and corner states gather at corners in the two-dimensional first-order and second-order topological insulators, respectively. Here, we report two kinds of topological states h…
▽ More
The bulk-boundary correspondence is a generic feature of topological states of matter, reflecting the intrinsic relation between topological bulk and boundary states. For example, robust edge states propagate along the edges and corner states gather at corners in the two-dimensional first-order and second-order topological insulators, respectively. Here, we report two kinds of topological states hosting anomalous bulk-boundary correspondence in the extended two-dimensional dimerized lattice with staggered flux threading. At $\frac{1}{2}$-filling, we observe isolated corner states with no fractional charge as well as metallic near-edge states in the $\mathcal{C}=2$ Chern insulator states. At $\frac{1}{4}$-filling, we find a $\mathcal{C}=0$ topologically nontrivial state, where the robust edge states are well localized along edges but bypass corners. These robust topological insulating states significantly differ from both conventional Chern insulators and usual high-order topological insulators.
△ Less
Submitted 13 November, 2022; v1 submitted 16 September, 2021;
originally announced September 2021.
-
Bosonic fractional Chern insulating state at integer fillings in multi-band system
Authors:
Wei-Wei Luo,
Ai-Lei He,
Yi-Fei Wang,
Yuan Zhou,
Chang-De Gong
Abstract:
The integer quantum Hall state occurs when the Landau levels are fully occupied by the fermions, while the fractional quantum Hall state usually emerges when the Landau level is partially filled by the strongly correlated fermions or bosons. Here, we report two fractional Chern insulating states of the hard-core bosons in a multi-band lattice model hosting topological flat bands with high Chern nu…
▽ More
The integer quantum Hall state occurs when the Landau levels are fully occupied by the fermions, while the fractional quantum Hall state usually emerges when the Landau level is partially filled by the strongly correlated fermions or bosons. Here, we report two fractional Chern insulating states of the hard-core bosons in a multi-band lattice model hosting topological flat bands with high Chern number. The previously proposed $ν=1/3$ fractional Chern insulating state inherited from the high Chern number $C=2$ of the lowest topological flat band is revisited by the infinite density matrix renormalization group algorithm. In particular, we numerically identify a bosonic $1/2$-Laughlin-like fractional Chern insulating state at the integer fillings. We show two lower topological flat bands jointly generate an effective $C=1$ Chern band with half-filling. Furthermore, we find a strictly particle-hole-like symmetry between the $ν$ and $3-ν$ filling in our model. These findings extend our understanding of quantum Hall states and offer a new route to realize the novel fractional states in the system with multi-bands and high-Chern numbers.
△ Less
Submitted 13 July, 2021;
originally announced July 2021.
-
Mitigating depolarizing noise on quantum computers with noise-estimation circuits
Authors:
Miroslav Urbanek,
Benjamin Nachman,
Vincent R. Pascuzzi,
Andre He,
Christian W. Bauer,
Wibe A. de Jong
Abstract:
A significant problem for current quantum computers is noise. While there are many distinct noise channels, the depolarizing noise model often appropriately describes average noise for large circuits involving many qubits and gates. We present a method to mitigate the depolarizing noise by first estimating its rate with a noise-estimation circuit and then correcting the output of the target circui…
▽ More
A significant problem for current quantum computers is noise. While there are many distinct noise channels, the depolarizing noise model often appropriately describes average noise for large circuits involving many qubits and gates. We present a method to mitigate the depolarizing noise by first estimating its rate with a noise-estimation circuit and then correcting the output of the target circuit using the estimated rate. The method is experimentally validated on the simulation of the Heisenberg model. We find that our approach in combination with readout-error correction, randomized compiling, and zero-noise extrapolation produces results close to exact results even for circuits containing hundreds of CNOT gates.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
Connecting 3-manifold triangulations with monotonic sequences of elementary moves
Authors:
Benjamin A. Burton,
Alexander He
Abstract:
A key result in computational 3-manifold topology is that any two triangulations of the same 3-manifold are connected by a finite sequence of bistellar flips, also known as Pachner moves. One limitation of this result is that little is known about the structure of this sequence; knowing more about the structure could help both proofs and algorithms. Motivated by this, we consider sequences of move…
▽ More
A key result in computational 3-manifold topology is that any two triangulations of the same 3-manifold are connected by a finite sequence of bistellar flips, also known as Pachner moves. One limitation of this result is that little is known about the structure of this sequence; knowing more about the structure could help both proofs and algorithms. Motivated by this, we consider sequences of moves that are "monotonic" in the sense that they break up into two parts: first, a sequence that monotonically increases the size of the triangulation; and second, a sequence that monotonically decreases the size. We prove that any two one-vertex triangulations of the same 3-manifold, each with at least two tetrahedra, are connected by a monotonic sequence of 2-3 and 2-0 moves. We also study the practical utility of monotonic sequences; specifically, we implement an algorithm to find such sequences, and use this algorithm to perform some detailed computational experiments.
△ Less
Submitted 13 June, 2024; v1 submitted 3 December, 2020;
originally announced December 2020.
-
Reconstruction and interpretation of photon Doppler velocimetry spectrum for ejecta particles from shock-loaded sample in vacuum
Authors:
Xiao-Feng Shi,
Dong-Jun Ma,
Song-lin Dang,
Zong-Qiang Ma,
Hai-Quan Sun,
An-Min He,
Pei Wang
Abstract:
The photon Doppler velocimetry (PDV) spectrum is investigated in an attempt to reveal the particle parameters of ejecta from shock-loaded samples in a vacuum. A GPU-accelerated Monte-Carlo algorithm, which considers the multiple-scattering effects of light, is applied to reconstruct the light field of the ejecta and simulate the corresponding PDV spectrum. The influence of the velocity profile, to…
▽ More
The photon Doppler velocimetry (PDV) spectrum is investigated in an attempt to reveal the particle parameters of ejecta from shock-loaded samples in a vacuum. A GPU-accelerated Monte-Carlo algorithm, which considers the multiple-scattering effects of light, is applied to reconstruct the light field of the ejecta and simulate the corresponding PDV spectrum. The influence of the velocity profile, total area mass, and particle size of the ejecta on the simulated spectra is discussed qualitatively. To facilitate a quantitative discussion, a novel theoretical optical model is proposed in which the single-scattering assumption is applied. With this model, the relationships between the particle parameters of ejecta and the peak information of the PDV spectrum are derived, enabling direct extraction of the particle parameters from the PDV spectrum. The values of the ejecta parameters estimated from the experimental spectrum are in good agreement with those measured by a piezoelectric probe.
△ Less
Submitted 8 January, 2021; v1 submitted 1 December, 2020;
originally announced December 2020.
-
FedSmart: An Auto Updating Federated Learning Optimization Mechanism
Authors:
Anxun He,
Jianzong Wang,
Zhangcheng Huang,
**g Xiao
Abstract:
Federated learning has made an important contribution to data privacy-preserving. Many previous works are based on the assumption that the data are independently identically distributed (IID). As a result, the model performance on non-identically independently distributed (non-IID) data is beyond expectation, which is the concrete situation. Some existing methods of ensuring the model robustness o…
▽ More
Federated learning has made an important contribution to data privacy-preserving. Many previous works are based on the assumption that the data are independently identically distributed (IID). As a result, the model performance on non-identically independently distributed (non-IID) data is beyond expectation, which is the concrete situation. Some existing methods of ensuring the model robustness on non-IID data, like the data-sharing strategy or pretraining, may lead to privacy leaking. In addition, there exist some participants who try to poison the model with low-quality data. In this paper, a performance-based parameter return method for optimization is introduced, we term it FederatedSmart (FedSmart). It optimizes different model for each client through sharing global gradients, and it extracts the data from each client as a local validation set, and the accuracy that model achieves in round t determines the weights of the next round. The experiment results show that FedSmart enables the participants to allocate a greater weight to the ones with similar data distribution.
△ Less
Submitted 15 September, 2020;
originally announced September 2020.
-
Resource Efficient Zero Noise Extrapolation with Identity Insertions
Authors:
Andre He,
Benjamin Nachman,
Wibe A. de Jong,
Christian W. Bauer
Abstract:
In addition to readout errors, two-qubit gate noise is the main challenge for complex quantum algorithms on noisy intermediate-scale quantum (NISQ) computers. These errors are a significant challenge for making accurate calculations for quantum chemistry, nuclear physics, high energy physics, and other emerging scientific and industrial applications. There are two proposals for mitigating two-qubi…
▽ More
In addition to readout errors, two-qubit gate noise is the main challenge for complex quantum algorithms on noisy intermediate-scale quantum (NISQ) computers. These errors are a significant challenge for making accurate calculations for quantum chemistry, nuclear physics, high energy physics, and other emerging scientific and industrial applications. There are two proposals for mitigating two-qubit gate errors: error-correcting codes and zero-noise extrapolation. This paper focuses on the latter, studying it in detail and proposing modifications to existing approaches. In particular, we propose a random identity insertion method (RIIM) that can achieve competitive asymptotic accuracy with far fewer gates than the traditional fixed identity insertion method (FIIM). For example, correcting the leading order depolarizing gate noise requires $n_\text{CNOT}+2$ gates for RIIM instead of $3n_\text{CNOT}$ gates for FIIM. This significant resource saving may enable more accurate results for state-of-the-art calculations on near term quantum hardware.
△ Less
Submitted 10 March, 2020;
originally announced March 2020.