-
SoLar: Sinkhorn Label Refinery for Imbalanced Partial-Label Learning
Authors:
Haobo Wang,
Mingxuan Xia,
Yixuan Li,
Yuren Mao,
Lei Feng,
Gang Chen,
Junbo Zhao
Abstract:
Partial-label learning (PLL) is a peculiar weakly-supervised learning task where the training samples are generally associated with a set of candidate labels instead of single ground truth. While a variety of label disambiguation methods have been proposed in this domain, they normally assume a class-balanced scenario that may not hold in many real-world applications. Empirically, we observe degen…
▽ More
Partial-label learning (PLL) is a peculiar weakly-supervised learning task where the training samples are generally associated with a set of candidate labels instead of single ground truth. While a variety of label disambiguation methods have been proposed in this domain, they normally assume a class-balanced scenario that may not hold in many real-world applications. Empirically, we observe degenerated performance of the prior methods when facing the combinatorial challenge from the long-tailed distribution and partial-labeling. In this work, we first identify the major reasons that the prior work failed. We subsequently propose SoLar, a novel Optimal Transport-based framework that allows to refine the disambiguated labels towards matching the marginal class prior distribution. SoLar additionally incorporates a new and systematic mechanism for estimating the long-tailed class prior distribution under the PLL setup. Through extensive experiments, SoLar exhibits substantially superior results on standardized benchmarks compared to the previous state-of-the-art PLL methods. Code and data are available at: https://github.com/hbzju/SoLar .
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization
Authors:
Leo Feng,
Padideh Nouri,
Aneri Muni,
Yoshua Bengio,
Pierre-Luc Bacon
Abstract:
The ability to accelerate the design of biological sequences can have a substantial impact on the progress of the medical field. The problem can be framed as a global optimization problem where the objective is an expensive black-box function such that we can query large batches restricted with a limitation of a low number of rounds. Bayesian Optimization is a principled method for tackling this p…
▽ More
The ability to accelerate the design of biological sequences can have a substantial impact on the progress of the medical field. The problem can be framed as a global optimization problem where the objective is an expensive black-box function such that we can query large batches restricted with a limitation of a low number of rounds. Bayesian Optimization is a principled method for tackling this problem. However, the astronomically large state space of biological sequences renders brute-force iterating over all possible sequences infeasible. In this paper, we propose MetaRLBO where we train an autoregressive generative model via Meta-Reinforcement Learning to propose promising sequences for selection via Bayesian Optimization. We pose this problem as that of finding an optimal policy over a distribution of MDPs induced by sampling subsets of the data acquired in the previous rounds. Our in-silico experiments show that meta-learning over such ensembles provides robustness against reward misspecification and achieves competitive results compared to existing strong baselines.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Demonstration of three- and four-body interactions between trapped-ion spins
Authors:
Or Katz,
Lei Feng,
Andrew Risinger,
Christopher Monroe,
Marko Cetina
Abstract:
Quantum processors use the native interactions between effective spins to simulate Hamiltonians or execute quantum gates. In most processors, the native interactions are pairwise, limiting the efficiency of controlling entanglement between many qubits. Here we experimentally demonstrate a new class of native interactions between trapped-ion qubits, extending conventional pairwise interactions to h…
▽ More
Quantum processors use the native interactions between effective spins to simulate Hamiltonians or execute quantum gates. In most processors, the native interactions are pairwise, limiting the efficiency of controlling entanglement between many qubits. Here we experimentally demonstrate a new class of native interactions between trapped-ion qubits, extending conventional pairwise interactions to higher order. We realize three- and four-body spin interactions as examples, showing that high-order spin polynomials may serve as a new toolbox for quantum information applications.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Search for relativistic fractionally charged particles in space
Authors:
DAMPE Collaboration,
F. Alemanno,
C. Altomare,
Q. An,
P. Azzarello,
F. C. T. Barbato,
P. Bernardini,
X. J. Bi,
M. S. Cai,
E. Casilli,
E. Catanzani,
J. Chang,
D. Y. Chen,
J. L. Chen,
Z. F. Chen,
M. Y. Cui,
T. S. Cui,
Y. X. Cui,
H. T. Dai,
A. De-Benedittis,
I. De Mitri,
F. de Palma,
M. Deliyergiyev,
A. Di Giovanni,
M. Di Santo
, et al. (126 additional authors not shown)
Abstract:
More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been…
▽ More
More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been few searches for FCPs in cosmic rays carried out in orbit other than AMS-01 flown by a space shuttle and BESS by a balloon at the top of the atmosphere. In this study, we conduct an FCP search in space based on on-orbit data obtained using the DArk Matter Particle Explorer (DAMPE) satellite over a period of five years. Unlike underground experiments, which require an FCP energy of the order of hundreds of GeV, our FCP search starts at only a few GeV. An upper limit of $6.2\times 10^{-10}~~\mathrm{cm^{-2}sr^{-1} s^{-1}}$ is obtained for the flux. Our results demonstrate that DAMPE exhibits higher sensitivity than experiments of similar types by three orders of magnitude that more stringently restricts the conditions for the existence of FCP in primary cosmic rays.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection
Authors:
Xinjiang Wang,
Xingyi Yang,
Shilong Zhang,
Yijiang Li,
Litong Feng,
Shijie Fang,
Chengqi Lyu,
Kai Chen,
Wayne Zhang
Abstract:
In this study, we dive deep into the inconsistency of pseudo targets in semi-supervised object detection (SSOD). Our core observation is that the oscillating pseudo-targets undermine the training of an accurate detector. It injects noise into the student's training, leading to severe overfitting problems. Therefore, we propose a systematic solution, termed ConsistentTeacher, to reduce the inconsis…
▽ More
In this study, we dive deep into the inconsistency of pseudo targets in semi-supervised object detection (SSOD). Our core observation is that the oscillating pseudo-targets undermine the training of an accurate detector. It injects noise into the student's training, leading to severe overfitting problems. Therefore, we propose a systematic solution, termed ConsistentTeacher, to reduce the inconsistency. First, adaptive anchor assignment~(ASA) substitutes the static IoU-based strategy, which enables the student network to be resistant to noisy pseudo-bounding boxes. Then we calibrate the subtask predictions by designing a 3D feature alignment module~(FAM-3D). It allows each classification feature to adaptively query the optimal feature vector for the regression task at arbitrary scales and locations. Lastly, a Gaussian Mixture Model (GMM) dynamically revises the score threshold of pseudo-bboxes, which stabilizes the number of ground truths at an early stage and remedies the unreliable supervision signal during training. ConsistentTeacher provides strong results on a large range of SSOD evaluations. It achieves 40.0 mAP with ResNet-50 backbone given only 10% of annotated MS-COCO data, which surpasses previous baselines using pseudo labels by around 3 mAP. When trained on fully annotated MS-COCO with additional unlabeled data, the performance further increases to 47.7 mAP. Our code is available at \url{https://github.com/Adamdad/ConsistentTeacher}.
△ Less
Submitted 28 March, 2023; v1 submitted 4 September, 2022;
originally announced September 2022.
-
Enjoy the Ride Consciously with CAWA: Context-Aware Advisory Warnings for Automated Driving
Authors:
Erfan Pakdamanian,
Erzhen Hu,
Shili Sheng,
Sarit Kraus,
Seongkook Heo,
Lu Feng
Abstract:
In conditionally automated driving, drivers decoupled from driving while immersed in non-driving-related tasks (NDRTs) could potentially either miss the system-initiated takeover request (TOR) or a sudden TOR may startle them. To better prepare drivers for a safer takeover in an emergency, we propose novel context-aware advisory warnings (CAWA) for automated driving to gently inform drivers. This…
▽ More
In conditionally automated driving, drivers decoupled from driving while immersed in non-driving-related tasks (NDRTs) could potentially either miss the system-initiated takeover request (TOR) or a sudden TOR may startle them. To better prepare drivers for a safer takeover in an emergency, we propose novel context-aware advisory warnings (CAWA) for automated driving to gently inform drivers. This will help them stay vigilant while engaging in NDRTs. The key innovation is that CAWA adapts warning modalities according to the context of NDRTs. We conducted a user study to investigate the effectiveness of CAWA. The study results show that CAWA has statistically significant effects on safer takeover behavior, improved driver situational awareness, less attention demand, and more positive user feedback, compared with uniformly distributed speech-based warnings across all NDRTs.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
The missing link between standing- and traveling-wave resonators
Authors:
Qi Zhong,
Haoqi Zhao,
Liang Feng,
Kurt Busch,
Sahin K. Ozdemir,
Ramy El-Ganainy
Abstract:
Optical resonators are structures that utilize wave interference and feedback to confine light in all three dimensions. Depending on the feedback mechanism, resonators can support either standing- or traveling-wave modes. Over the years, the distinction between these two different types of modes has become so prevalent that nowadays it is one of the main characteristics for classifying optical res…
▽ More
Optical resonators are structures that utilize wave interference and feedback to confine light in all three dimensions. Depending on the feedback mechanism, resonators can support either standing- or traveling-wave modes. Over the years, the distinction between these two different types of modes has become so prevalent that nowadays it is one of the main characteristics for classifying optical resonators. Here, we show that an intermediate link between these two rather different groups exists. In particular, we introduce a new class of photonic resonators that supports a hybrid optical mode, i.e. at one location along the resonator the electromagnetic fields associated with the mode feature a purely standing-wave pattern, while at a different location, the fields of the same mode represent a pure traveling wave. The proposed concept is general and can be implemented using chip-scale photonics as well as free-space optics. Moreover, it can be extended to other wave phenomena such as microwaves and acoustics.
△ Less
Submitted 26 August, 2022;
originally announced August 2022.
-
Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation
Authors:
Lihe Yang,
Lei Qi,
Litong Feng,
Wayne Zhang,
Yinghuan Shi
Abstract:
In this work, we revisit the weak-to-strong consistency framework, popularized by FixMatch from semi-supervised classification, where the prediction of a weakly perturbed image serves as supervision for its strongly perturbed version. Intriguingly, we observe that such a simple pipeline already achieves competitive results against recent advanced works, when transferred to our segmentation scenari…
▽ More
In this work, we revisit the weak-to-strong consistency framework, popularized by FixMatch from semi-supervised classification, where the prediction of a weakly perturbed image serves as supervision for its strongly perturbed version. Intriguingly, we observe that such a simple pipeline already achieves competitive results against recent advanced works, when transferred to our segmentation scenario. Its success heavily relies on the manual design of strong data augmentations, however, which may be limited and inadequate to explore a broader perturbation space. Motivated by this, we propose an auxiliary feature perturbation stream as a supplement, leading to an expanded perturbation space. On the other, to sufficiently probe original image-level augmentations, we present a dual-stream perturbation technique, enabling two strong views to be simultaneously guided by a common weak view. Consequently, our overall Unified Dual-Stream Perturbations approach (UniMatch) surpasses all existing methods significantly across all evaluation protocols on the Pascal, Cityscapes, and COCO benchmarks. Its superiority is also demonstrated in remote sensing interpretation and medical image analysis. We hope our reproduced FixMatch and our results can inspire more future works. Code and logs are available at https://github.com/LiheYoung/UniMatch.
△ Less
Submitted 26 March, 2023; v1 submitted 21 August, 2022;
originally announced August 2022.
-
A Study on Learning and Simulating Personalized Car-Following Driving Style
Authors:
Shili Sheng,
Erfan Pakdamanian,
Kyungtae Han,
Ziran Wang,
Lu Feng
Abstract:
Automated vehicles are gradually entering people's daily life to provide a comfortable driving experience for the users. The generic and user-agnostic automated vehicles have limited ability to accommodate the different driving styles of different users. This limitation not only impacts users' satisfaction but also causes safety concerns. Learning from user demonstrations can provide direct insigh…
▽ More
Automated vehicles are gradually entering people's daily life to provide a comfortable driving experience for the users. The generic and user-agnostic automated vehicles have limited ability to accommodate the different driving styles of different users. This limitation not only impacts users' satisfaction but also causes safety concerns. Learning from user demonstrations can provide direct insights regarding users' driving preferences. However, it is difficult to understand a driver's preference with limited data. In this study, we use a model-free inverse reinforcement learning method to study drivers' characteristics in the car-following scenario from a naturalistic driving dataset, and show this method is capable of representing users' preferences with reward functions. In order to predict the driving styles for drivers with limited data, we apply Gaussian Mixture Models and compute the similarity of a specific driver to the clusters of drivers. We design a personalized adaptive cruise control (P-ACC) system through a partially observable Markov decision process (POMDP) model. The reward function with the model to mimic drivers' driving style is integrated, with a constraint on the relative distance to ensure driving safety. Prediction of the driving styles achieves 85.7% accuracy with the data of less than 10 car-following events. The model-based experimental driving trajectories demonstrate that the P-ACC system can provide a personalized driving experience.
△ Less
Submitted 16 August, 2022;
originally announced August 2022.
-
AutoShard: Automated Embedding Table Sharding for Recommender Systems
Authors:
Daochen Zha,
Louis Feng,
Bhargav Bhushanam,
Dhruv Choudhary,
Jade Nie,
Yuandong Tian,
Jay Chae,
Yinbin Ma,
Arun Kejariwal,
Xia Hu
Abstract:
Embedding learning is an important technique in deep recommendation models to map categorical features to dense vectors. However, the embedding tables often demand an extremely large number of parameters, which become the storage and efficiency bottlenecks. Distributed training solutions have been adopted to partition the embedding tables into multiple devices. However, the embedding tables can ea…
▽ More
Embedding learning is an important technique in deep recommendation models to map categorical features to dense vectors. However, the embedding tables often demand an extremely large number of parameters, which become the storage and efficiency bottlenecks. Distributed training solutions have been adopted to partition the embedding tables into multiple devices. However, the embedding tables can easily lead to imbalances if not carefully partitioned. This is a significant design challenge of distributed systems named embedding table sharding, i.e., how we should partition the embedding tables to balance the costs across devices, which is a non-trivial task because 1) it is hard to efficiently and precisely measure the cost, and 2) the partition problem is known to be NP-hard. In this work, we introduce our novel practice in Meta, namely AutoShard, which uses a neural cost model to directly predict the multi-table costs and leverages deep reinforcement learning to solve the partition problem. Experimental results on an open-sourced large-scale synthetic dataset and Meta's production dataset demonstrate the superiority of AutoShard over the heuristics. Moreover, the learned policy of AutoShard can transfer to sharding tasks with various numbers of tables and different ratios of the unseen tables without any fine-tuning. Furthermore, AutoShard can efficiently shard hundreds of tables in seconds. The effectiveness, transferability, and efficiency of AutoShard make it desirable for production use. Our algorithms have been deployed in Meta production environment. A prototype is available at https://github.com/daochenzha/autoshard
△ Less
Submitted 12 August, 2022;
originally announced August 2022.
-
4D Real-Time GRASP MRI at Sub-Second Temporal Resolution
Authors:
Li Feng
Abstract:
Intra-frame motion blurring, as a major challenge in free-breathing dynamic MRI, can be reduced if high temporal resolution can be achieved. To address this challenge, this work proposes a highly-accelerated 4D (3D+time) real-time MRI framework with sub-second temporal resolution combining standard stack-of-stars golden-angle radial sampling and tailored GRASP-Pro (Golden-angle RAdial Sparse Paral…
▽ More
Intra-frame motion blurring, as a major challenge in free-breathing dynamic MRI, can be reduced if high temporal resolution can be achieved. To address this challenge, this work proposes a highly-accelerated 4D (3D+time) real-time MRI framework with sub-second temporal resolution combining standard stack-of-stars golden-angle radial sampling and tailored GRASP-Pro (Golden-angle RAdial Sparse Parallel) reconstruction. Specifically, 4D real-time MRI acquisition is performed continuously without motion gating or sorting. The k-space centers in stack-of-stars radial data are organized to guide estimation of a temporal basis, with which GRASP-Pro reconstruction is employed to enforce joint low-rank subspace and sparsity constraints. This new basis estimation strategy is the new feature proposed for subspace-based reconstruction in this work to achieve high temporal resolution (e.g., sub-second/3D volume). It does not require sequence modification to acquire additional navigation data, is compatible with commercially available stack-of-stars sequences, and does not need an intermediate reconstruction step. The proposed 4D real-time MRI approach was tested in abdominal motion phantom, free-breathing abdominal MRI, and dynamic contrast-enhanced MRI (DCE-MRI). With the ability to acquire each 3D image in less than one second, intra-frame respiratory blurring can be intrinsically reduced for body applications with our approach, which also eliminates the need for motion detection and motion compensation.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Silicon photonic devices for scalable quantum information applications
Authors:
Lantian Feng,
Ming Zhang,
Jianwei Wang,
Xiaoqi Zhou,
Xiaogang Qiang,
Guangcan Guo,
Xifeng Ren
Abstract:
With high integration density and excellent optical properties, silicon photonics is becoming a promising platform for complete integration and large-scale optical quantum information processing. Scalable quantum information applications need photon generation and detection to be integrated on the same chip, and we have seen that various devices on the silicon photonic chip have been developed for…
▽ More
With high integration density and excellent optical properties, silicon photonics is becoming a promising platform for complete integration and large-scale optical quantum information processing. Scalable quantum information applications need photon generation and detection to be integrated on the same chip, and we have seen that various devices on the silicon photonic chip have been developed for this goal. This paper reviews the relevant research results and state-of-the-art technologies on the silicon photonic chip for scalable quantum applications. Despite the shortcomings, properties of some components have already met the requirements for further expansion. Furthermore, we point out the challenges ahead and further research directions for on-chip scalable quantum information applications.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.
-
A Uniform Convergent Petrov-Galerkin method for a Class of Turning Point Problems
Authors:
Li Feng,
Zhongyi Huang
Abstract:
In this paper, we propose a numerical method for turning point problems in one dimension based on Petrov-Galerkin finite element method (PGFEM). We first give a priori estimate for the turning point problem with a single boundary turning point. Then we use PGFEM to solve it, where test functions are the solutions to piecewise approximate dual problems. We prove that our method has a first-order co…
▽ More
In this paper, we propose a numerical method for turning point problems in one dimension based on Petrov-Galerkin finite element method (PGFEM). We first give a priori estimate for the turning point problem with a single boundary turning point. Then we use PGFEM to solve it, where test functions are the solutions to piecewise approximate dual problems. We prove that our method has a first-order convergence rate in both $L^\infty$ norm and an energy norm when we select the exact solutions to dual problems as test functions. Numerical results show that our scheme is efficient for turning point problems with different types of singularities, and the convergency coincides with our theoretical results.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
GANDSE: Generative Adversarial Network based Design Space Exploration for Neural Network Accelerator Design
Authors:
Lang Feng,
Wenjian Liu,
Chuliang Guo,
Ke Tang,
Cheng Zhuo,
Zhongfeng Wang
Abstract:
With the popularity of deep learning, the hardware implementation platform of deep learning has received increasing interest. Unlike the general purpose devices, e.g., CPU, or GPU, where the deep learning algorithms are executed at the software level, neural network hardware accelerators directly execute the algorithms to achieve higher both energy efficiency and performance improvements. However,…
▽ More
With the popularity of deep learning, the hardware implementation platform of deep learning has received increasing interest. Unlike the general purpose devices, e.g., CPU, or GPU, where the deep learning algorithms are executed at the software level, neural network hardware accelerators directly execute the algorithms to achieve higher both energy efficiency and performance improvements. However, as the deep learning algorithms evolve frequently, the engineering effort and cost of designing the hardware accelerators are greatly increased. To improve the design quality while saving the cost, design automation for neural network accelerators was proposed, where design space exploration algorithms are used to automatically search the optimized accelerator design within a design space. Nevertheless, the increasing complexity of the neural network accelerators brings the increasing dimensions to the design space. As a result, the previous design space exploration algorithms are no longer effective enough to find an optimized design. In this work, we propose a neural network accelerator design automation framework named GANDSE, where we rethink the problem of design space exploration, and propose a novel approach based on the generative adversarial network (GAN) to support an optimized exploration for high dimension large design space. The experiments show that GANDSE is able to find the more optimized designs in negligible time compared with approaches including multilayer perceptron and deep reinforcement learning.
△ Less
Submitted 19 November, 2022; v1 submitted 1 August, 2022;
originally announced August 2022.
-
Flux Variations of Cosmic Ray Air Showers Detected by LHAASO-KM2A During a Thunderstorm on 10 June 2021
Authors:
LHAASO Collaboration,
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Zhe Cao,
Zhen Cao,
J. Chang,
J. F. Chang,
E. S. Chen,
Liang Chen,
Liang Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
S. H. Chen,
S. Z. Chen,
T. L. Chen,
X. J. Chen
, et al. (248 additional authors not shown)
Abstract:
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations…
▽ More
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations of trigger rates (increases or decreases) are found to be strongly dependent on the primary zenith angle. The flux of secondary particles increases significantly, following a similar trend with that of the shower events. To better understand the observed behavior, Monte Carlo simulations are performed with CORSIKA and G4KM2A (a code based on GEANT4). We find that the experimental data (in saturated negative fields) are in good agreement with simulations, assuming the presence of a uniform upward electric field of 700 V/cm with a thickness of 1500 m in the atmosphere above the observation level. Due to the acceleration/deceleration and deflection by the atmospheric electric field, the number of secondary particles with energy above the detector threshold is modified, resulting in the changes in shower detection rate.
△ Less
Submitted 6 December, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
The FASER Detector
Authors:
FASER Collaboration,
Henso Abreu,
Elham Amin Mansour,
Claire Antel,
Akitaka Ariga,
Tomoko Ariga,
Florian Bernlochner,
Tobias Boeckh,
Jamie Boyd,
Lydia Brenner,
Franck Cadoux,
David W. Casper,
Charlotte Cavanagh,
Xin Chen,
Andrea Coccaro,
Olivier Crespo-Lopez,
Stephane Debieux,
Monica D'Onofrio,
Liam Dougherty,
Candan Dozen,
Abdallah Ezzat,
Yannick Favre,
Deion Fellers,
Jonathan L. Feng,
Didier Ferrere
, et al. (72 additional authors not shown)
Abstract:
FASER, the ForwArd Search ExpeRiment, is an experiment dedicated to searching for light, extremely weakly-interacting particles at CERN's Large Hadron Collider (LHC). Such particles may be produced in the very forward direction of the LHC's high-energy collisions and then decay to visible particles inside the FASER detector, which is placed 480 m downstream of the ATLAS interaction point, aligned…
▽ More
FASER, the ForwArd Search ExpeRiment, is an experiment dedicated to searching for light, extremely weakly-interacting particles at CERN's Large Hadron Collider (LHC). Such particles may be produced in the very forward direction of the LHC's high-energy collisions and then decay to visible particles inside the FASER detector, which is placed 480 m downstream of the ATLAS interaction point, aligned with the beam collisions axis. FASER also includes a sub-detector, FASER$ν$, designed to detect neutrinos produced in the LHC collisions and to study their properties. In this paper, each component of the FASER detector is described in detail, as well as the installation of the experiment system and its commissioning using cosmic-rays collected in September 2021 and during the LHC pilot beam test carried out in October 2021. FASER will start taking LHC collision data in 2022, and will run throughout LHC Run 3.
△ Less
Submitted 23 July, 2022;
originally announced July 2022.
-
TaDaa: real time Ticket Assignment Deep learning Auto Advisor for customer support, help desk, and issue ticketing systems
Authors:
Leon Feng,
Jnana Senapati,
Bill Liu
Abstract:
This paper proposes TaDaa: Ticket Assignment Deep learning Auto Advisor, which leverages the latest Transformers models and machine learning techniques quickly assign issues within an organization, like customer support, help desk and alike issue ticketing systems. The project provides functionality to 1) assign an issue to the correct group, 2) assign an issue to the best resolver, and 3) provide…
▽ More
This paper proposes TaDaa: Ticket Assignment Deep learning Auto Advisor, which leverages the latest Transformers models and machine learning techniques quickly assign issues within an organization, like customer support, help desk and alike issue ticketing systems. The project provides functionality to 1) assign an issue to the correct group, 2) assign an issue to the best resolver, and 3) provide the most relevant previously solved tickets to resolvers. We leverage one ticketing system sample dataset, with over 3k+ groups and over 10k+ resolvers to obtain a 95.2% top 3 accuracy on group suggestions and a 79.0% top 5 accuracy on resolver suggestions. We hope this research will greatly improve average issue resolution time on customer support, help desk, and issue ticketing systems.
△ Less
Submitted 20 March, 2023; v1 submitted 18 July, 2022;
originally announced July 2022.
-
ProMix: Combating Label Noise via Maximizing Clean Sample Utility
Authors:
Ruixuan Xiao,
Yiwen Dong,
Haobo Wang,
Lei Feng,
Runze Wu,
Gang Chen,
Junbo Zhao
Abstract:
Learning with Noisy Labels (LNL) has become an appealing topic, as imperfectly annotated data are relatively cheaper to obtain. Recent state-of-the-art approaches employ specific selection mechanisms to separate clean and noisy samples and then apply Semi-Supervised Learning (SSL) techniques for improved performance. However, the selection step mostly provides a medium-sized and decent-enough clea…
▽ More
Learning with Noisy Labels (LNL) has become an appealing topic, as imperfectly annotated data are relatively cheaper to obtain. Recent state-of-the-art approaches employ specific selection mechanisms to separate clean and noisy samples and then apply Semi-Supervised Learning (SSL) techniques for improved performance. However, the selection step mostly provides a medium-sized and decent-enough clean subset, which overlooks a rich set of clean samples. To fulfill this, we propose a novel LNL framework ProMix that attempts to maximize the utility of clean samples for boosted performance. Key to our method, we propose a matched high confidence selection technique that selects those examples with high confidence scores and matched predictions with given labels to dynamically expand a base clean sample set. To overcome the potential side effect of excessive clean set selection procedure, we further devise a novel SSL framework that is able to train balanced and unbiased classifiers on the separated clean and noisy samples. Extensive experiments demonstrate that ProMix significantly advances the current state-of-the-art results on multiple benchmarks with different types and levels of noise. It achieves an average improvement of 2.48\% on the CIFAR-N dataset. The code is available at https://github.com/Justherozen/ProMix
△ Less
Submitted 3 August, 2023; v1 submitted 20 July, 2022;
originally announced July 2022.
-
Direct measurement of vorticity using tracer particles with internal markers
Authors:
Jiaqi Li,
Lei Feng,
Chinmayee Panigrahi,
Jiarong Hong
Abstract:
Current experiment techniques for vorticity measurement suffer from limited spatial and temporal resolution to resolve the small-scale eddy dynamics in turbulence. In this study, we develop a new method for direct vorticity measurement in fluid flows based on digital inline holography (DIH). The DIH system utilizes a collimated laser beam to illuminate the tracers with internal markers and a digit…
▽ More
Current experiment techniques for vorticity measurement suffer from limited spatial and temporal resolution to resolve the small-scale eddy dynamics in turbulence. In this study, we develop a new method for direct vorticity measurement in fluid flows based on digital inline holography (DIH). The DIH system utilizes a collimated laser beam to illuminate the tracers with internal markers and a digital sensor to record the generated holograms. The tracers made of the polydimethylsiloxane (PDMS) prepolymer mixed with internal markers are fabricated using a standard microfluidic droplet generator. A rotation measurement algorithm is developed based on the 3D location reconstruction and tracking of the internal markers and is assessed through synthetic holograms to identify the optimal parameter settings and measurement range (e.g., rotation rate from 0.3 to 0.7 rad/frame under numerical aperture of imaging of 0.25). Our proposed method based on DIH is evaluated by a calibration experiment of single tracer rotation, which yields the same optimal measurement range. Using von Kármán swirling flow setup, we further demonstrate the capability of the approach to simultaneously measure the Lagrangian rotation and translation of multiple tracers. Our method can measure vorticity in a small region on the order of 100 $μ$m or less and can be potentially used to quantify the Kolmogorov-scale vorticity field in turbulent flows.
△ Less
Submitted 20 July, 2022; v1 submitted 18 July, 2022;
originally announced July 2022.
-
Cosmic ray-driven bioenergetics for Life in Molecular Clouds and the Origin of Chemiosmosis
Authors:
Lei Feng
Abstract:
Some models such as the Nebula-Relay hypothesis, predict that the ancestors of Earth's life once lived in molecular clouds. Where does the energy come from for creatures in molecular clouds? In this draft, we proposed a new bioenergetic mechanism that is driven by the cosmic ray ionization of hydrogen molecules. Protons are naturally produced in this scenario, which may be the origin of chemiosmos…
▽ More
Some models such as the Nebula-Relay hypothesis, predict that the ancestors of Earth's life once lived in molecular clouds. Where does the energy come from for creatures in molecular clouds? In this draft, we proposed a new bioenergetic mechanism that is driven by the cosmic ray ionization of hydrogen molecules. Protons are naturally produced in this scenario, which may be the origin of chemiosmosis. Based on this bioenergetics mechanism, we speculate that LUCA is one type of biological hydrogen microbe.
△ Less
Submitted 21 November, 2022; v1 submitted 26 June, 2022;
originally announced June 2022.
-
Neutrino Detection without Neutrino Detectors: Discovering Collider Neutrinos at FASER with Electronic Signals Only
Authors:
Jason Arakawa,
Jonathan L. Feng,
Ahmed Ismail,
Felix Kling,
Michael Waterbury
Abstract:
The detection of collider neutrinos will provide new insights about neutrino production, propagation, and interactions at TeV energies, the highest human-made energies ever observed. During Run 3 of the LHC, the FASER experiment is expected to detect roughly $10^4$ collider neutrinos using its emulsion-based neutrino detector FASER$ν$. In this study, we show that, even without processing the emuls…
▽ More
The detection of collider neutrinos will provide new insights about neutrino production, propagation, and interactions at TeV energies, the highest human-made energies ever observed. During Run 3 of the LHC, the FASER experiment is expected to detect roughly $10^4$ collider neutrinos using its emulsion-based neutrino detector FASER$ν$. In this study, we show that, even without processing the emulsion data, low-level input provided by the electronic detector components of FASER and FASER$ν$ will be able to establish a $5σ$ discovery of collider neutrinos with as little as $5~\text{fb}^{-1}$ of integrated luminosity. These results foreshadow the possible early discovery of collider neutrinos in LHC Run 3.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Towards Better Selective Classification
Authors:
Leo Feng,
Mohamed Osama Ahmed,
Hossein Hajimirsadeghi,
Amir Abdi
Abstract:
We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset. Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit. In this paper, we challenge the aforementioned methods. The results suggest that the super…
▽ More
We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset. Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit. In this paper, we challenge the aforementioned methods. The results suggest that the superior performance of state-of-the-art methods is owed to training a more generalizable classifier rather than their proposed selection mechanisms. We argue that the best performing selection mechanism should instead be rooted in the classifier itself. Our proposed selection strategy uses the classification scores and achieves better results by a significant margin, consistently, across all coverages and all datasets, without any added compute cost. Furthermore, inspired by semi-supervised learning, we propose an entropy-based regularizer that improves the performance of selective classification methods. Our proposed selection mechanism with the proposed entropy-based regularizer achieves new state-of-the-art results.
△ Less
Submitted 1 March, 2023; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Logic-based Reward Sha** for Multi-Agent Reinforcement Learning
Authors:
Ingy ElSayed-Aly,
Lu Feng
Abstract:
Reinforcement learning (RL) relies heavily on exploration to learn from its environment and maximize observed rewards. Therefore, it is essential to design a reward function that guarantees optimal learning from the received experience. Previous work has combined automata and logic based reward sha** with environment assumptions to provide an automatic mechanism to synthesize the reward function…
▽ More
Reinforcement learning (RL) relies heavily on exploration to learn from its environment and maximize observed rewards. Therefore, it is essential to design a reward function that guarantees optimal learning from the received experience. Previous work has combined automata and logic based reward sha** with environment assumptions to provide an automatic mechanism to synthesize the reward function based on the task. However, there is limited work on how to expand logic-based reward sha** to Multi-Agent Reinforcement Learning (MARL). The environment will need to consider the joint state in order to keep track of other agents if the task requires cooperation, thus suffering from the curse of dimensionality with respect to the number of agents. This project explores how logic-based reward sha** for MARL can be designed for different scenarios and tasks. We present a novel method for semi-centralized logic-based MARL reward sha** that is scalable in the number of agents and evaluate it in multiple scenarios.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets
Authors:
Hongxin Wei,
Lue Tao,
Renchunzi Xie,
Lei Feng,
Bo An
Abstract:
Deep neural networks usually perform poorly when the training dataset suffers from extreme class imbalance. Recent studies found that directly training with out-of-distribution data (i.e., open-set samples) in a semi-supervised manner would harm the generalization performance. In this work, we theoretically show that out-of-distribution data can still be leveraged to augment the minority classes f…
▽ More
Deep neural networks usually perform poorly when the training dataset suffers from extreme class imbalance. Recent studies found that directly training with out-of-distribution data (i.e., open-set samples) in a semi-supervised manner would harm the generalization performance. In this work, we theoretically show that out-of-distribution data can still be leveraged to augment the minority classes from a Bayesian perspective. Based on this motivation, we propose a novel method called Open-sampling, which utilizes open-set noisy labels to re-balance the class priors of the training dataset. For each open-set instance, the label is sampled from our pre-defined distribution that is complementary to the distribution of original class priors. We empirically show that Open-sampling not only re-balances the class priors but also encourages the neural network to learn separable representations. Extensive experiments demonstrate that our proposed method significantly outperforms existing data re-balancing methods and can boost the performance of existing state-of-the-art methods.
△ Less
Submitted 5 July, 2022; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Infrared Radiation of Graphene Electrothermal Film Triggered Alpha and Theta Brainwaves
Authors:
Yanghua Lu,
Renyu Yang,
Yue Dai,
Deyi Yuan,
Xutao Yu,
Chang Liu,
Lixuan Feng,
Runjiang Shen,
Can Wang,
Shenyi Dai,
Shisheng Lin
Abstract:
The alpha and theta frequency brainwave activity in Electroencephalogram (EEG) signal has been correlated with attention, inhibitory processes, memory, perceptual abilities, and sleep. The enhanced alpha and theta brainwave activity may bring positive behavioral modifications such as promoting creativity and a quick sleep. Herein, we discover that infrared radiation from multilayer graphene electr…
▽ More
The alpha and theta frequency brainwave activity in Electroencephalogram (EEG) signal has been correlated with attention, inhibitory processes, memory, perceptual abilities, and sleep. The enhanced alpha and theta brainwave activity may bring positive behavioral modifications such as promoting creativity and a quick sleep. Herein, we discover that infrared radiation from multilayer graphene electrothermal film can obviously promote the appearance of alpha and theta brainwave in human mind. In particular, the occurrence frequency of the alpha and theta waves in EEG can be effectively enhanced up to 2.3 and 3.0 times, respectively. And the duration time of the alpha and theta waves in EEG can also be effectively extended. The mechanism may be attributed to the efficient infrared radiation caused by graphene mainly focused on the range from 7 to 14 micron, coinciding with the radiation wavelength of natural human body, which can be effectively absorbed by the human skin and speed up the blood microcirculation and metabolism. The comparative effect of different working temperature and heating materials such as water, Cu and even monolayer graphene are systematically investigated, indicating the infrared radiation from the multilayer graphene electrothermal film at 50 degrees has the largest enhancement effect of alpha and theta brainwaves. The multilayer graphene film electrical heater represents a convenient and surprising way for triggering the alpha and theta brainwaves, which has many potential applications in the area of enlarged health cerements.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Experiments and Facilities for Accelerator-Based Dark Sector Searches
Authors:
Philip Ilten,
Nhan Tran,
Patrick Achenbach,
Akitaka Ariga,
Tomoko Ariga,
Marco Battaglieri,
Jianming Bian,
Pietro Bisio,
Andrea Celentano,
Matthew Citron,
Paolo Crivelli,
Giovanni de Lellis,
Antonia Di Crescenzo,
Milind Diwan,
Jonathan L. Feng,
Corrado Gatto,
Stefania Gori,
Felix Kling,
Luca Marsicano,
Simone M. Mazza,
Josh McFayden,
Laura Molina-Bueno,
Marco Spreafico,
Natalia Toro,
Matthew Toups
, et al. (5 additional authors not shown)
Abstract:
This paper provides an overview of experiments and facilities for accelerator-based dark matter searches as part of the US Community Study on the Future of Particle Physics (Snowmass 2021). Companion white papers to this paper present the physics drivers: thermal dark matter, visible dark portals, and new flavors and rich dark sectors.
This paper provides an overview of experiments and facilities for accelerator-based dark matter searches as part of the US Community Study on the Future of Particle Physics (Snowmass 2021). Companion white papers to this paper present the physics drivers: thermal dark matter, visible dark portals, and new flavors and rich dark sectors.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
Human-AI Shared Control via Policy Dissection
Authors:
Quanyi Li,
Zhenghao Peng,
Haibin Wu,
Lan Feng,
Bolei Zhou
Abstract:
Human-AI shared control allows human to interact and collaborate with AI to accomplish control tasks in complex environments. Previous Reinforcement Learning (RL) methods attempt the goal-conditioned design to achieve human-controllable policies at the cost of redesigning the reward function and training paradigm. Inspired by the neuroscience approach to investigate the motor cortex in primates, w…
▽ More
Human-AI shared control allows human to interact and collaborate with AI to accomplish control tasks in complex environments. Previous Reinforcement Learning (RL) methods attempt the goal-conditioned design to achieve human-controllable policies at the cost of redesigning the reward function and training paradigm. Inspired by the neuroscience approach to investigate the motor cortex in primates, we develop a simple yet effective frequency-based approach called \textit{Policy Dissection} to align the intermediate representation of the learned neural controller with the kinematic attributes of the agent behavior. Without modifying the neural controller or retraining the model, the proposed approach can convert a given RL-trained policy into a human-interactive policy. We evaluate the proposed approach on the RL tasks of autonomous driving and locomotion. The experiments show that human-AI shared control achieved by Policy Dissection in driving task can substantially improve the performance and safety in unseen traffic scenes. With human in the loop, the locomotion robots also exhibit versatile controllable motion skills even though they are only trained to move forward. Our results suggest the promising direction of implementing human-AI shared autonomy through interpreting the learned representation of the autonomous agents. Demo video and code will be made available at https://metadriverse.github.io/policydissect.
△ Less
Submitted 2 March, 2023; v1 submitted 31 May, 2022;
originally announced June 2022.
-
StarGraph: Knowledge Representation Learning based on Incomplete Two-hop Subgraph
Authors:
Hongzhu Li,
Xiangrui Gao,
Linhui Feng,
Yafeng Deng,
Yuhui Yin
Abstract:
Conventional representation learning algorithms for knowledge graphs (KG) map each entity to a unique embedding vector, ignoring the rich information contained in the neighborhood. We propose a method named StarGraph, which gives a novel way to utilize the neighborhood information for large-scale knowledge graphs to obtain entity representations. An incomplete two-hop neighborhood subgraph for eac…
▽ More
Conventional representation learning algorithms for knowledge graphs (KG) map each entity to a unique embedding vector, ignoring the rich information contained in the neighborhood. We propose a method named StarGraph, which gives a novel way to utilize the neighborhood information for large-scale knowledge graphs to obtain entity representations. An incomplete two-hop neighborhood subgraph for each target node is at first generated, then processed by a modified self-attention network to obtain the entity representation, which is used to replace the entity embedding in conventional methods. We achieved SOTA performance on ogbl-wikikg2 and got competitive results on fb15k-237. The experimental results proves that StarGraph is efficient in parameters, and the improvement made on ogbl-wikikg2 demonstrates its great effectiveness of representation learning on large-scale knowledge graphs. The code is now available at \url{https://github.com/hzli-ucas/StarGraph}.
△ Less
Submitted 3 January, 2023; v1 submitted 27 May, 2022;
originally announced May 2022.
-
Balancing Exploration and Exploitation for Solving Large-scale Multiobjective Optimization via Attention Mechanism
Authors:
Haokai Hong,
Min Jiang,
Liang Feng,
Qiuzhen Lin,
Kay Chen Tan
Abstract:
Large-scale multiobjective optimization problems (LSMOPs) refer to optimization problems with multiple conflicting optimization objectives and hundreds or even thousands of decision variables. A key point in solving LSMOPs is how to balance exploration and exploitation so that the algorithm can search in a huge decision space efficiently. Large-scale multiobjective evolutionary algorithms consider…
▽ More
Large-scale multiobjective optimization problems (LSMOPs) refer to optimization problems with multiple conflicting optimization objectives and hundreds or even thousands of decision variables. A key point in solving LSMOPs is how to balance exploration and exploitation so that the algorithm can search in a huge decision space efficiently. Large-scale multiobjective evolutionary algorithms consider the balance between exploration and exploitation from the individual's perspective. However, these algorithms ignore the significance of tackling this issue from the perspective of decision variables, which makes the algorithm lack the ability to search from different dimensions and limits the performance of the algorithm. In this paper, we propose a large-scale multiobjective optimization algorithm based on the attention mechanism, called (LMOAM). The attention mechanism will assign a unique weight to each decision variable, and LMOAM will use this weight to strike a balance between exploration and exploitation from the decision variable level. Nine different sets of LSMOP benchmarks are conducted to verify the algorithm proposed in this paper, and the experimental results validate the effectiveness of our design.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
Mitigating Neural Network Overconfidence with Logit Normalization
Authors:
Hongxin Wei,
Renchunzi Xie,
Hao Cheng,
Lei Feng,
Bo An,
Yixuan Li
Abstract:
Detecting out-of-distribution inputs is critical for safe deployment of machine learning models in the real world. However, neural networks are known to suffer from the overconfidence issue, where they produce abnormally high confidence for both in- and out-of-distribution inputs. In this work, we show that this issue can be mitigated through Logit Normalization (LogitNorm) -- a simple fix to the…
▽ More
Detecting out-of-distribution inputs is critical for safe deployment of machine learning models in the real world. However, neural networks are known to suffer from the overconfidence issue, where they produce abnormally high confidence for both in- and out-of-distribution inputs. In this work, we show that this issue can be mitigated through Logit Normalization (LogitNorm) -- a simple fix to the cross-entropy loss -- by enforcing a constant vector norm on the logits in training. Our method is motivated by the analysis that the norm of the logit keeps increasing during training, leading to overconfident output. Our key idea behind LogitNorm is thus to decouple the influence of output's norm during network optimization. Trained with LogitNorm, neural networks produce highly distinguishable confidence scores between in- and out-of-distribution data. Extensive experiments demonstrate the superiority of LogitNorm, reducing the average FPR95 by up to 42.30% on common benchmarks.
△ Less
Submitted 24 June, 2022; v1 submitted 18 May, 2022;
originally announced May 2022.
-
Prospects for Detecting the Diffuse Supernova Neutrino Background with JUNO
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Antonio Bergnoli,
Thilo Birkenfeld,
Sylvie Blin
, et al. (577 additional authors not shown)
Abstract:
We present the detection potential for the diffuse supernova neutrino background (DSNB) at the Jiangmen Underground Neutrino Observatory (JUNO), using the inverse-beta-decay (IBD) detection channel on free protons. We employ the latest information on the DSNB flux predictions, and investigate in detail the background and its reduction for the DSNB search at JUNO. The atmospheric neutrino induced n…
▽ More
We present the detection potential for the diffuse supernova neutrino background (DSNB) at the Jiangmen Underground Neutrino Observatory (JUNO), using the inverse-beta-decay (IBD) detection channel on free protons. We employ the latest information on the DSNB flux predictions, and investigate in detail the background and its reduction for the DSNB search at JUNO. The atmospheric neutrino induced neutral current (NC) background turns out to be the most critical background, whose uncertainty is carefully evaluated from both the spread of model predictions and an envisaged \textit{in situ} measurement. We also make a careful study on the background suppression with the pulse shape discrimination (PSD) and triple coincidence (TC) cuts. With latest DSNB signal predictions, more realistic background evaluation and PSD efficiency optimization, and additional TC cut, JUNO can reach the significance of 3$σ$ for 3 years of data taking, and achieve better than 5$σ$ after 10 years for a reference DSNB model. In the pessimistic scenario of non-observation, JUNO would strongly improve the limits and exclude a significant region of the model parameter space.
△ Less
Submitted 13 October, 2022; v1 submitted 18 May, 2022;
originally announced May 2022.
-
Mass Testing and Characterization of 20-inch PMTs for JUNO
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
Joao Pedro Athayde Marcondes de Andre,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Antonio Bergnoli
, et al. (541 additional authors not shown)
Abstract:
Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program whic…
▽ More
Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).
△ Less
Submitted 17 September, 2022; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Learning effective dynamics from data-driven stochastic systems
Authors:
Lingyu Feng,
Ting Gao,
Min Dai,
**qiao Duan
Abstract:
Multiscale stochastic dynamical systems have been widely adopted to a variety of scientific and engineering problems due to their capability of depicting complex phenomena in many real world applications. This work is devoted to investigating the effective dynamics for slow-fast stochastic dynamical systems. Given observation data on a short-term period satisfying some unknown slow-fast stochastic…
▽ More
Multiscale stochastic dynamical systems have been widely adopted to a variety of scientific and engineering problems due to their capability of depicting complex phenomena in many real world applications. This work is devoted to investigating the effective dynamics for slow-fast stochastic dynamical systems. Given observation data on a short-term period satisfying some unknown slow-fast stochastic systems, we propose a novel algorithm including a neural network called Auto-SDE to learn invariant slow manifold. Our approach captures the evolutionary nature of a series of time-dependent autoencoder neural networks with the loss constructed from a discretized stochastic differential equation. Our algorithm is also validated to be accurate, stable and effective through numerical experiments under various evaluation metrics.
△ Less
Submitted 29 December, 2023; v1 submitted 9 May, 2022;
originally announced May 2022.
-
Spatially Resolved Moving Radio Burst in Association with an EUV Wave
Authors:
Lei Lu,
Li Feng,
Weiqun Gan
Abstract:
Coronal mass ejections (CMEs) are large clouds of magnetized plasma ejected from the Sun, and are often associated with acceleration of electrons that can result in radio emission via various mechanisms. However, the underlying mechanism relating the CMEs and particle acceleration still remains a subject of heated debate. Here, we report multi-instrument radio and extreme ultraviolet (EUV) imaging…
▽ More
Coronal mass ejections (CMEs) are large clouds of magnetized plasma ejected from the Sun, and are often associated with acceleration of electrons that can result in radio emission via various mechanisms. However, the underlying mechanism relating the CMEs and particle acceleration still remains a subject of heated debate. Here, we report multi-instrument radio and extreme ultraviolet (EUV) imaging of a solar eruption event on 24 September 2011. We determine the emission mechanism of a moving radio burst, identify its three-dimensional (3D) location with respect to a rapidly expanding EUV wave, and find evidence for CME shocks that produce quasiperiodic acceleration of electron beams.
△ Less
Submitted 6 May, 2022;
originally announced May 2022.
-
Asymptotic Independence of the Sum and Maximum of Dependent Random Variables with Applications to High-Dimensional Tests
Authors:
Long Feng,
Tiefeng Jiang,
Xiaoyun Li,
Binghui Liu
Abstract:
For a set of dependent random variables, without stationary or the strong mixing assumptions, we derive the asymptotic independence between their sums and maxima. Then we apply this result to high-dimensional testing problems, where we combine the sum-type and max-type tests and propose a novel test procedure for the one-sample mean test, the two-sample mean test and the regression coefficient tes…
▽ More
For a set of dependent random variables, without stationary or the strong mixing assumptions, we derive the asymptotic independence between their sums and maxima. Then we apply this result to high-dimensional testing problems, where we combine the sum-type and max-type tests and propose a novel test procedure for the one-sample mean test, the two-sample mean test and the regression coefficient test in high-dimensional setting. Based on the asymptotic independence between sums and maxima, the asymptotic distributions of test statistics are established. Simulation studies show that our proposed tests have good performance regardless of data being sparse or not. Examples on real data are also presented to demonstrate the advantages of our proposed methods.
△ Less
Submitted 11 May, 2022; v1 submitted 3 May, 2022;
originally announced May 2022.
-
Every signed planar graph without cycles of length from 4 to 7 is 3-colorable
Authors:
Lan Kaiyang,
Liu Feng
Abstract:
Hu and Li investigate the signed graph version of Erd$\ddot{\mathrm{o}}$s problem: Is there a constant $c$ such that every signed planar graph without $k$-cycles, where $4\leq k\leq c$, is $3$-colorable and prove that each signed planar graph without cycles of length from 4 to 8 is 3-colorable. We give a very short and simple proof of this result and improve it, based on a recent observation.
Hu and Li investigate the signed graph version of Erd$\ddot{\mathrm{o}}$s problem: Is there a constant $c$ such that every signed planar graph without $k$-cycles, where $4\leq k\leq c$, is $3$-colorable and prove that each signed planar graph without cycles of length from 4 to 8 is 3-colorable. We give a very short and simple proof of this result and improve it, based on a recent observation.
△ Less
Submitted 3 May, 2022; v1 submitted 28 April, 2022;
originally announced May 2022.
-
Computationally efficient and data-adaptive changepoint inference in high dimension
Authors:
Guanghui Wang,
Long Feng
Abstract:
High-dimensional changepoint inference that adapts to various change patterns has received much attention recently. We propose a simple, fast yet effective approach for adaptive changepoint testing. The key observation is that two statistics based on aggregating cumulative sum statistics over all dimensions and possible changepoints by taking their maximum and summation, respectively, are asymptot…
▽ More
High-dimensional changepoint inference that adapts to various change patterns has received much attention recently. We propose a simple, fast yet effective approach for adaptive changepoint testing. The key observation is that two statistics based on aggregating cumulative sum statistics over all dimensions and possible changepoints by taking their maximum and summation, respectively, are asymptotically independent under some mild conditions. Hence we are able to form a new test by combining the p-values of the maximum- and summation-type statistics according to their limit null distributions. To this end, we develop new tools and techniques to establish asymptotic distribution of the maximum-type statistic under a more relaxed condition on componentwise correlations among all variables than that in existing literature. The proposed method is simple to use and computationally efficient. It is adaptive to different sparsity levels of change signals, and is comparable to or even outperforms existing approaches as revealed by our numerical studies.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Sub-percent Precision Measurement of Neutrino Oscillation Parameters with JUNO
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato
, et al. (581 additional authors not shown)
Abstract:
JUNO is a multi-purpose neutrino observatory under construction in the south of China. This publication presents new sensitivity estimates for the measurement of the $Δm^2_{31}$, $Δm^2_{21}$, $\sin^2 θ_{12}$, and $\sin^2 θ_{13}$ oscillation parameters using reactor antineutrinos, which is one of the primary physics goals of the experiment. The sensitivities are obtained using the best knowledge av…
▽ More
JUNO is a multi-purpose neutrino observatory under construction in the south of China. This publication presents new sensitivity estimates for the measurement of the $Δm^2_{31}$, $Δm^2_{21}$, $\sin^2 θ_{12}$, and $\sin^2 θ_{13}$ oscillation parameters using reactor antineutrinos, which is one of the primary physics goals of the experiment. The sensitivities are obtained using the best knowledge available to date on the location and overburden of the experimental site, the nuclear reactors in the surrounding area and beyond, the detector response uncertainties, and the reactor antineutrino spectral shape constraints expected from the TAO satellite detector. It is found that the $Δm^2_{31}$, $Δm^2_{21}$, and $\sin^2 θ_{12}$ oscillation parameters will be determined to better than 0.5% precision in six years of data collection, which represents approximately an order of magnitude improvement over existing constraints.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Toward Policy Explanations for Multi-Agent Reinforcement Learning
Authors:
Kayla Boggess,
Sarit Kraus,
Lu Feng
Abstract:
Advances in multi-agent reinforcement learning (MARL) enable sequential decision making for a range of exciting multi-agent applications such as cooperative AI and autonomous driving. Explaining agent decisions is crucial for improving system transparency, increasing user satisfaction, and facilitating human-agent collaboration. However, existing works on explainable reinforcement learning mostly…
▽ More
Advances in multi-agent reinforcement learning (MARL) enable sequential decision making for a range of exciting multi-agent applications such as cooperative AI and autonomous driving. Explaining agent decisions is crucial for improving system transparency, increasing user satisfaction, and facilitating human-agent collaboration. However, existing works on explainable reinforcement learning mostly focus on the single-agent setting and are not suitable for addressing challenges posed by multi-agent environments. We present novel methods to generate two types of policy explanations for MARL: (i) policy summarization about the agent cooperation and task sequence, and (ii) language explanations to answer queries about agent behavior. Experimental results on three MARL domains demonstrate the scalability of our methods. A user study shows that the generated explanations significantly improve user performance and increase subjective ratings on metrics such as user satisfaction.
△ Less
Submitted 23 May, 2022; v1 submitted 26 April, 2022;
originally announced April 2022.
-
Asymptotic Independence of the Quadratic form and Maximum of Independent Random Variables with Applications to High-Dimensional Tests
Authors:
Dachuan Chen,
Decai Liang,
Long Feng
Abstract:
This paper establishes the asymptotic independence between the quadratic form and maximum of a sequence of independent random variables. Based on this theoretical result, we find the asymptotic joint distribution for the quadratic form and maximum, which can be applied into the high-dimensional testing problems. By combining the sum-type test and the max-type test, we propose the Fisher's combinat…
▽ More
This paper establishes the asymptotic independence between the quadratic form and maximum of a sequence of independent random variables. Based on this theoretical result, we find the asymptotic joint distribution for the quadratic form and maximum, which can be applied into the high-dimensional testing problems. By combining the sum-type test and the max-type test, we propose the Fisher's combination tests for the one-sample mean test and two-sample mean test. Under this novel general framework, several strong assumptions in existing literature have been relaxed. Monte Carlo simulation has been done which shows that our proposed tests are strongly robust to both sparse and dense data.
△ Less
Submitted 2 August, 2023; v1 submitted 18 April, 2022;
originally announced April 2022.
-
Rank Based Tests for High Dimensional White Noise
Authors:
Dachuan Chen,
Fengyi Song,
Long Feng
Abstract:
The development of high-dimensional white noise test is important in both statistical theories and applications, where the dimension of the time series can be comparable to or exceed the length of the time series. This paper proposes several distribution-free tests using the rank based statistics for testing the high-dimensional white noise, which are robust to the heavy tails and do not quire the…
▽ More
The development of high-dimensional white noise test is important in both statistical theories and applications, where the dimension of the time series can be comparable to or exceed the length of the time series. This paper proposes several distribution-free tests using the rank based statistics for testing the high-dimensional white noise, which are robust to the heavy tails and do not quire the finite-order moment assumptions for the sample distributions. Three families of rank based tests are analyzed in this paper, including the simple linear rank statistics, non-degenerate U-statistics and degenerate U-statistics. The asymptotic null distributions and rate optimality are established for each family of these tests. Among these tests, the test based on degenerate U-statistics can also detect the non-linear and non-monotone relationships in the autocorrelations. Moreover, this is the first result on the asymptotic distributions of rank correlation statistics which allowing for the cross-sectional dependence in high dimensional data.
△ Less
Submitted 19 July, 2023; v1 submitted 18 April, 2022;
originally announced April 2022.
-
Stability of China's Stock Market: Measure and Forecast by Ricci Curvature on Network
Authors:
Xinyu Wang,
Liang Zhao,
Ning Zhang,
Liu Feng,
Haibo Lin
Abstract:
The systemic stability of a stock market is one of the core issues in the financial field. The market can be regarded as a complex network whose nodes are stocks connected by edges that signify their correlation strength. Since the market is a strongly nonlinear system, it is difficult to measure the macroscopic stability and depict market fluctuations in time. In this paper, we use a geometric me…
▽ More
The systemic stability of a stock market is one of the core issues in the financial field. The market can be regarded as a complex network whose nodes are stocks connected by edges that signify their correlation strength. Since the market is a strongly nonlinear system, it is difficult to measure the macroscopic stability and depict market fluctuations in time. In this paper, we use a geometric measure derived from discrete Ricci curvature to capture the higher-order nonlinear architecture of financial networks. In order to confirm the effectiveness of our method, we use it to analyze the CSI 300 constituents of China's stock market from 2005--2020 and the systemic stability of the market is quantified through the network's Ricci type curvatures. Furthermore, we use a hybrid model to analyze the curvature time series and predict the future trends of the market accurately. As far as we know, this is the first paper to apply Ricci curvature to forecast the systemic stability of domestic stock market, and our results show that Ricci curvature has good explanatory power for the market stability and can be a good indicator to judge the future risk and volatility of the domestic market.
△ Less
Submitted 13 April, 2022;
originally announced April 2022.
-
NMSSM neutralino dark matter for CDF II $W$-boson mass and muon $g-2$ and the promising prospect of direct detection
Authors:
Tian-Peng Tang,
Murat Abdughani,
Lei Feng,
Yue-Lin Sming Tsai,
Jian Wu,
Yi-Zhong Fan
Abstract:
Two experiments from the Fermilab, E989 and CDF II, have reported two anomalies for muon $g-2$ and $W$-boson mass that may indicate the new physics at the low energy scale. Here we examine the possibility of a common origin of these two anomalies in the Next-to-Minimal Supersymmetric Standard Model. Considering various experimental and astrophysical constraints such as the Higgs mass, collider dat…
▽ More
Two experiments from the Fermilab, E989 and CDF II, have reported two anomalies for muon $g-2$ and $W$-boson mass that may indicate the new physics at the low energy scale. Here we examine the possibility of a common origin of these two anomalies in the Next-to-Minimal Supersymmetric Standard Model. Considering various experimental and astrophysical constraints such as the Higgs mass, collider data, flavor physics, dark matter relic density, and direct detection experiments, we find that lighter electroweakinos and sleptons can generate sufficient contributions to muon $g-2$ and $m_W$. Moreover, the corresponding bino-like neutralino dark matter mass is in the $\sim 180-280$ GeV range. Interestingly, the favored DM mass region can soon be entirely probed by ongoing direct detection experiments like PandaX-4T, XENONnT, LUX-ZEPLIN, and DARWIN.
△ Less
Submitted 21 February, 2023; v1 submitted 8 April, 2022;
originally announced April 2022.
-
Is the $W$-boson mass enhanced by the axion-like particle, dark photon, or chameleon dark energy?
Authors:
Guan-Wen Yuan,
Lei Zu,
Lei Feng,
Yi-Fu Cai,
Yi-Zhong Fan
Abstract:
The $W$-boson mass ($m_{W}=80.4335 \pm 0.0094 \mathrm{GeV}$) measured by the Collider Detector at Fermilab collaboration is greater than the standard model (SM) prediction at a confidence level of $7σ$, strongly suggesting the presence of new particles or fields. In the literature, various new particles and/or fields have been introduced to explain the astrophysical and experimental data, and thei…
▽ More
The $W$-boson mass ($m_{W}=80.4335 \pm 0.0094 \mathrm{GeV}$) measured by the Collider Detector at Fermilab collaboration is greater than the standard model (SM) prediction at a confidence level of $7σ$, strongly suggesting the presence of new particles or fields. In the literature, various new particles and/or fields have been introduced to explain the astrophysical and experimental data, and their presence, in principle, may also enhance the $W$-boson mass. In this study, we investigate axion-like particle (ALP), dark photon (DP), and chameleon dark energy (DE) models for a solution to the $W$-boson mass excess. We find that the ALP and DP interpretations have been significantly narrowed down by global electroweak fits. The possibility of attributing the $W-$boson mass anomaly to the chameleon DE is ruled out by other experiments.
△ Less
Submitted 3 October, 2022; v1 submitted 8 April, 2022;
originally announced April 2022.
-
An Instrumented Wheel-On-Limb System of Planetary Rovers for Wheel-Terrain Interactions: System Conception and Preliminary Design
Authors:
Lihang Feng,
Xu Jiang,
Aiguo Song
Abstract:
Understanding the wheel-terrain interaction is of great importance to improve the maneuverability and traversability of the rovers. A well-developed sensing device carried by the rover would greatly facilitate the complex risk-reducing operations on sandy terrains. In this paper, an instrumented wheel-on-limb (WOL) system of planetary rovers for wheel-terrain interaction characterization is presen…
▽ More
Understanding the wheel-terrain interaction is of great importance to improve the maneuverability and traversability of the rovers. A well-developed sensing device carried by the rover would greatly facilitate the complex risk-reducing operations on sandy terrains. In this paper, an instrumented wheel-on-limb (WOL) system of planetary rovers for wheel-terrain interaction characterization is presented. Assuming the function of a passive suspension of the wheel, the WOL system allows itself to follow the terrain contour, and keep the wheel remain lowered onto the ground during rover motion including climbing and descending, as well as deploy and place the wheel on the ground before a drive commanding. The system concept, functional requirements, and pre-design work, as well as the system integration are presented.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Learning Optimal K-space Acquisition and Reconstruction using Physics-Informed Neural Networks
Authors:
Wei Peng,
Li Feng,
Guoying Zhao,
Fang Liu
Abstract:
The inherent slow imaging speed of Magnetic Resonance Image (MRI) has spurred the development of various acceleration methods, typically through heuristically undersampling the MRI measurement domain known as k-space. Recently, deep neural networks have been applied to reconstruct undersampled k-space data and have shown improved reconstruction performance. While most of these methods focus on des…
▽ More
The inherent slow imaging speed of Magnetic Resonance Image (MRI) has spurred the development of various acceleration methods, typically through heuristically undersampling the MRI measurement domain known as k-space. Recently, deep neural networks have been applied to reconstruct undersampled k-space data and have shown improved reconstruction performance. While most of these methods focus on designing novel reconstruction networks or new training strategies for a given undersampling pattern, e.g., Cartesian undersampling or Non-Cartesian sampling, to date, there is limited research aiming to learn and optimize k-space sampling strategies using deep neural networks. This work proposes a novel optimization framework to learn k-space sampling trajectories by considering it as an Ordinary Differential Equation (ODE) problem that can be solved using neural ODE. In particular, the sampling of k-space data is framed as a dynamic system, in which neural ODE is formulated to approximate the system with additional constraints on MRI physics. In addition, we have also demonstrated that trajectory optimization and image reconstruction can be learned collaboratively for improved imaging efficiency and reconstruction performance. Experiments were conducted on different in-vivo datasets (e.g., brain and knee images) acquired with different sequences. Initial results have shown that our proposed method can generate better image quality in accelerated MRI than conventional undersampling schemes in Cartesian and Non-Cartesian acquisitions.
△ Less
Submitted 12 April, 2022; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Photometric redshifts and Galaxy Clusters for DES DR2, DESI DR9, and HSC-SSP PDR3 Data
Authors:
Hu Zou,
Jipeng Sui,
Suijian Xue,
Xu Zhou,
Jun Ma,
Zhimin Zhou,
Jundan Nie,
Tianmeng Zhang,
Lu Feng,
Zhixia Shen,
Jiali Wang
Abstract:
Photometric redshift (photo-z) is a fundamental parameter for multi-wavelength photometric surveys, while galaxy clusters are important cosmological probers and ideal objects for exploring the dense environmental impact on galaxy evolution. We extend our previous work on estimating photo-z and detecting galaxy clusters to the latest data releases of the Dark Energy Spectroscopic Instrument (DESI)…
▽ More
Photometric redshift (photo-z) is a fundamental parameter for multi-wavelength photometric surveys, while galaxy clusters are important cosmological probers and ideal objects for exploring the dense environmental impact on galaxy evolution. We extend our previous work on estimating photo-z and detecting galaxy clusters to the latest data releases of the Dark Energy Spectroscopic Instrument (DESI) imaging surveys, Dark Energy Survey (DES), and Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP) imaging surveys and make corresponding catalogs publicly available for more extensive scientific applications. The photo-z catalogs include accurate measurements of photo-z and stellar mass for about 320, 293, and 134 million galaxies with $r<23$, $i<24$, and $i<25$ in DESI DR9, DES DR2, and HSC-SSP PDR3 data, respectively. The photo-z accuracy is about 0.017, 0.024, and 0.029 and the general redshift coverage is $z<1$, $z<1.2$, and $z<1.6$, respectively for those three surveys. The uncertainties of the logarithmic stellar mass that is inferred from stellar population synthesis fitting is about 0.2 dex. With the above photo-z catalogs, galaxy clusters are detected using a fast cluster-finding algorithm. A total of 532,810, 86,963, and 36,566 galaxy clusters with the number of members larger than 10 are discovered for DESI, DES, and HSC-SSP, respectively. Their photo-z accuracy is at the level of 0.01. The total mass of our clusters are also estimated by using the calibration relations between the optical richness and the mass measurement from X-ray and radio observations. The photo-z and cluster catalogs are available at ScienceDB (https://www.doi.org/10.11922/sciencedb.o00069.00003) and PaperData Repository (https://doi.org/10.12149/101089).
△ Less
Submitted 11 April, 2022; v1 submitted 31 March, 2022;
originally announced March 2022.
-
Four-Vector Optical Dirac Equation and Spin-Orbit Interaction of Structured Light
Authors:
Longlong Feng,
Qianfan Wu
Abstract:
The spin-orbit interaction of light is a crucial concept for understanding the electromagnetic properties of a material and realizing the spin-controlled manipulation of optical fields. Achieving these goals requires a complete description of spin-dependent optical phenomena in the context of vector-wave mechanics. We develop an extended Dirac theory for optical fields in generic media, which was…
▽ More
The spin-orbit interaction of light is a crucial concept for understanding the electromagnetic properties of a material and realizing the spin-controlled manipulation of optical fields. Achieving these goals requires a complete description of spin-dependent optical phenomena in the context of vector-wave mechanics. We develop an extended Dirac theory for optical fields in generic media, which was found to be akin to a non-Hermitian chiral-extension of massive fermions with anomalous magnetic momenta moving in an external pseudo-magnetic field. This similarity allows us to investigate the optical behaviors of a material by effective field theory methods and can find wide applications in metamaterials, photonic topological insulators, etc. We demonstrate this method by studying the spin-orbit interaction of structured light in a spin-degenerate medium and inhomogeneous isotropic medium, which leads to both spin-orbital-Hall effects and spin-to-orbital angular momentum conversion. Of importance, our approach provides simple and clear physical insight into the spin-orbit interaction of light in generic media, and could potentially bridge our understanding of topological insulators between electronic and photonic systems.
△ Less
Submitted 6 November, 2022; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Quantifying the Magnetic Structure of a Coronal Shock Producing a Type II Radio Burst
Authors:
W. Su,
T. M. Li,
X. Cheng,
L. Feng,
P. J. Zhang,
P. F. Chen,
M. D. Ding,
L. J. Chen,
Y. Guo,
Y. Wang,
D. Li,
L. Y. Zhang
Abstract:
Type II radio bursts are thought to be produced by shock waves in the solar atmosphere. However, what magnetic conditions are needed for the generation of type II radio bursts is still a puzzling issue. Here, we quantify the magnetic structure of a coronal shock associated with a type II radio burst. Based on the multi-perspective extreme-ultraviolet observations, we reconstruct the three-dimensio…
▽ More
Type II radio bursts are thought to be produced by shock waves in the solar atmosphere. However, what magnetic conditions are needed for the generation of type II radio bursts is still a puzzling issue. Here, we quantify the magnetic structure of a coronal shock associated with a type II radio burst. Based on the multi-perspective extreme-ultraviolet observations, we reconstruct the three-dimensional (3D) shock surface. By using a magnetic field extrapolation model, we then derive the orientation of the magnetic field relative to the normal of the shock front ($θ_{\rm Bn}$) and Alfvén Mach number ($M_A$) on the shock front. Combining the radio observations from Nancay Radio Heliograph, we obtain the source region of the type II radio burst on the shock front. It is found that the radio burst is generated by a shock with $M_A \gtrsim 1.5$ and a bimodal distribution of $θ_{Bn}$. We also use the Rankine-Hugoniot relations to quantify the properties of the shock downstream. Our results provide a quantitative 3D magnetic structure condition of a coronal shock that produces a type II radio burst.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
ViM: Out-Of-Distribution with Virtual-logit Matching
Authors:
Haoqi Wang,
Zhizhong Li,
Litong Feng,
Wayne Zhang
Abstract:
Most of the existing Out-Of-Distribution (OOD) detection algorithms depend on single input source: the feature, the logit, or the softmax probability. However, the immense diversity of the OOD examples makes such methods fragile. There are OOD samples that are easy to identify in the feature space while hard to distinguish in the logit space and vice versa. Motivated by this observation, we propos…
▽ More
Most of the existing Out-Of-Distribution (OOD) detection algorithms depend on single input source: the feature, the logit, or the softmax probability. However, the immense diversity of the OOD examples makes such methods fragile. There are OOD samples that are easy to identify in the feature space while hard to distinguish in the logit space and vice versa. Motivated by this observation, we propose a novel OOD scoring method named Virtual-logit Matching (ViM), which combines the class-agnostic score from feature space and the In-Distribution (ID) class-dependent logits. Specifically, an additional logit representing the virtual OOD class is generated from the residual of the feature against the principal space, and then matched with the original logits by a constant scaling. The probability of this virtual logit after softmax is the indicator of OOD-ness. To facilitate the evaluation of large-scale OOD detection in academia, we create a new OOD dataset for ImageNet-1K, which is human-annotated and is 8.8x the size of existing datasets. We conducted extensive experiments, including CNNs and vision transformers, to demonstrate the effectiveness of the proposed ViM score. In particular, using the BiT-S model, our method gets an average AUROC 90.91% on four difficult OOD benchmarks, which is 4% ahead of the best baseline. Code and dataset are available at https://github.com/haoqiwang/vim.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.