-
Learning Global-Local Correspondence with Semantic Bottleneck for Logical Anomaly Detection
Authors:
Haiming Yao,
Wenyong Yu,
Wei Luo,
Zhenfeng Qiang,
Donghao Luo,
Xiaotian Zhang
Abstract:
This paper presents a novel framework, named Global-Local Correspondence Framework (GLCF), for visual anomaly detection with logical constraints. Visual anomaly detection has become an active research area in various real-world applications, such as industrial anomaly detection and medical disease diagnosis. However, most existing methods focus on identifying local structural degeneration anomalie…
▽ More
This paper presents a novel framework, named Global-Local Correspondence Framework (GLCF), for visual anomaly detection with logical constraints. Visual anomaly detection has become an active research area in various real-world applications, such as industrial anomaly detection and medical disease diagnosis. However, most existing methods focus on identifying local structural degeneration anomalies and often fail to detect high-level functional anomalies that involve logical constraints. To address this issue, we propose a two-branch approach that consists of a local branch for detecting structural anomalies and a global branch for detecting logical anomalies. To facilitate local-global feature correspondence, we introduce a novel semantic bottleneck enabled by the visual Transformer. Moreover, we develop feature estimation networks for each branch separately to detect anomalies. Our proposed framework is validated using various benchmarks, including industrial datasets, Mvtec AD, Mvtec Loco AD, and the Retinal-OCT medical dataset. Experimental results show that our method outperforms existing methods, particularly in detecting logical anomalies.
△ Less
Submitted 28 March, 2023; v1 submitted 10 March, 2023;
originally announced March 2023.
-
The JUNO experiment Top Tracker
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato
, et al. (592 additional authors not shown)
Abstract:
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector…
▽ More
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector, covering about 60% of the surface above them. The JUNO Top Tracker is constituted by the decommissioned OPERA experiment Target Tracker modules. The technology used consists in walls of two planes of plastic scintillator strips, one per transverse direction. Wavelength shifting fibres collect the light signal emitted by the scintillator strips and guide it to both ends where it is read by multianode photomultiplier tubes. Compared to the OPERA Target Tracker, the JUNO Top Tracker uses new electronics able to cope with the high rate produced by the high rock radioactivity compared to the one in Gran Sasso underground laboratory. This paper will present the new electronics and mechanical structure developed for the Top Tracker of JUNO along with its expected performance based on the current detector simulation.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
JUNO sensitivity to $^7$Be, $pep$, and CNO solar neutrinos
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta
, et al. (592 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented levels of precision. In this paper, we provide estimation of the JUNO sensitivity to 7Be, pep, and CNO solar neutrinos that can be obtained via a spectral analysis above the 0.45 MeV threshold. This study is performed assuming different scenarios of the liquid scintillator radiopurity, ranging from the most opti mistic one corresponding to the radiopurity levels obtained by the Borexino experiment, up to the minimum requirements needed to perform the neutrino mass ordering determination with reactor antineutrinos - the main goal of JUNO. Our study shows that in most scenarios, JUNO will be able to improve the current best measurements on 7Be, pep, and CNO solar neutrino fluxes. We also perform a study on the JUNO capability to detect periodical time variations in the solar neutrino flux, such as the day-night modulation induced by neutrino flavor regeneration in Earth, and the modulations induced by temperature changes driven by helioseismic waves.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Distributed Consistent Multi-robot Cooperative Localization: A Coordinate Transformation Approach
Authors:
Chungeng Tian,
Ning Hao,
Fenghua He,
Haodi Yao
Abstract:
This paper considers the problem of distributed cooperative localization (CL) via robot-to-robot measurements for a multi-robot system. We propose a distributed consistent CL algorithm. The key idea is to perform the EKF-based state estimation in a transformed coordinate system. Specifically, a coordinate transformation is constructed by decomposing the state-propagation Jacobian by which the corr…
▽ More
This paper considers the problem of distributed cooperative localization (CL) via robot-to-robot measurements for a multi-robot system. We propose a distributed consistent CL algorithm. The key idea is to perform the EKF-based state estimation in a transformed coordinate system. Specifically, a coordinate transformation is constructed by decomposing the state-propagation Jacobian by which the correct observability properties are guaranteed. Moreover, the transformed state-propagation Jacobian becomes an identity matrix which is more suitable for distribution. In the proposed algorithm, a server-based framework is adopted to distributely estimate the robot pose in which each robot propagates its pose estimations and the server maintains the correlations. To reduce communication costs, only when the multi-robot system takes a robot-to-robot relative measurement, the robots and the server exchange information to update the pose estimations and the correlations. In addition, no assumptions are made about the type of robots or relative measurements. The proposed algorithm has been validated by experiments and shown to outperform the state-of-art algorithms in terms of consistency and accuracy.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Learning cross space map** via DNN using large scale click-through logs
Authors:
Wei Yu,
Kuiyuan Yang,
Yalong Bai,
Hongxun Yao,
Yong Rui
Abstract:
The gap between low-level visual signals and high-level semantics has been progressively bridged by continuous development of deep neural network (DNN). With recent progress of DNN, almost all image classification tasks have achieved new records of accuracy. To extend the ability of DNN to image retrieval tasks, we proposed a unified DNN model for image-query similarity calculation by simultaneous…
▽ More
The gap between low-level visual signals and high-level semantics has been progressively bridged by continuous development of deep neural network (DNN). With recent progress of DNN, almost all image classification tasks have achieved new records of accuracy. To extend the ability of DNN to image retrieval tasks, we proposed a unified DNN model for image-query similarity calculation by simultaneously modeling image and query in one network. The unified DNN is named the cross space map** (CSM) model, which contains two parts, a convolutional part and a query-embedding part. The image and query are mapped to a common vector space via these two parts respectively, and image-query similarity is naturally defined as an inner product of their map**s in the space. To ensure good generalization ability of the DNN, we learn weights of the DNN from a large number of click-through logs which consists of 23 million clicked image-query pairs between 1 million images and 11.7 million queries. Both the qualitative results and quantitative results on an image retrieval evaluation task with 1000 queries demonstrate the superiority of the proposed method.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
Design, Performance, and Complexity of CRC-Aided List Decoding of Convolutional and Polar Codes for Short Messages
Authors:
Jacob King,
Hanwen Yao,
William Ryan,
Richard D. Wesel
Abstract:
Motivated by the need to communicate short control messages in 5G and beyond, this paper carefully designs codes for cyclic redundancy check (CRC)-aided list decoding of tail-biting convolutional codes (TBCCs) and polar codes. Both codes send a 32-bit message using an 11-bit CRC and 512 transmitted bits. We aim to provide a careful, fair comparison of the error performance and decoding complexity…
▽ More
Motivated by the need to communicate short control messages in 5G and beyond, this paper carefully designs codes for cyclic redundancy check (CRC)-aided list decoding of tail-biting convolutional codes (TBCCs) and polar codes. Both codes send a 32-bit message using an 11-bit CRC and 512 transmitted bits. We aim to provide a careful, fair comparison of the error performance and decoding complexity of polar and TBCC techniques for a specific case. Specifically, a TBCC is designed to match the rate of a (512, 43) polar code, and optimal 11-bit CRCs for both codes are designed. The paper examines the distance spectra of the polar and TBCC codes, illuminating the different distance structures for the two code types. We consider both adaptive and non-adaptive CRC-aided list decoding schemes. For polar codes, an adaptive decoder must start with a larger list size to avoid an error floor. For rate-32/512 codes with an 11-bit CRC, the optimized CRC-TBCC design achieves a lower total failure rate than the optimized CRC-polar design. Simulations showed that the optimized CRC-TBCC design achieved significantly higher throughput than the optimized CRC-polar design, so that the TBCC solution achieved a lower total failure rate while requiring less computational complexity.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
Improving Domain Generalization with Domain Relations
Authors:
Huaxiu Yao,
Xinyu Yang,
Xinyi Pan,
Shengchao Liu,
Pang Wei Koh,
Chelsea Finn
Abstract:
Distribution shift presents a significant challenge in machine learning, where models often underperform during the test stage when faced with a different distribution than the one they were trained on. This paper focuses on domain shifts, which occur when the model is applied to new domains that are different from the ones it was trained on, and propose a new approach called D$^3$G. Unlike previo…
▽ More
Distribution shift presents a significant challenge in machine learning, where models often underperform during the test stage when faced with a different distribution than the one they were trained on. This paper focuses on domain shifts, which occur when the model is applied to new domains that are different from the ones it was trained on, and propose a new approach called D$^3$G. Unlike previous methods that aim to learn a single model that is domain invariant, D$^3$G leverages domain similarities based on domain metadata to learn domain-specific models. Concretely, D$^3$G learns a set of training-domain-specific functions during the training stage and reweights them based on domain relations during the test stage. These domain relations can be directly obtained and learned from domain metadata. Under mild assumptions, we theoretically prove that using domain relations to reweight training-domain-specific functions achieves stronger out-of-domain generalization compared to the conventional averaging approach. Empirically, we evaluate the effectiveness of D$^3$G using real-world datasets for tasks such as temperature regression, land use classification, and molecule-protein binding affinity prediction. Our results show that D$^3$G consistently outperforms state-of-the-art methods.
△ Less
Submitted 16 March, 2024; v1 submitted 6 February, 2023;
originally announced February 2023.
-
New critical states induced by measurement
Authors:
Xinyu Sun,
Hong Yao,
Shao-Kai Jian
Abstract:
Finding new critical states of matter is an important subject in modern many-body physics. Here we study the effect of measurement and postselection on the critical ground state of a Luttinger liquid theory and show that it can lead to qualitatively new critical states. Depending on the Luttinger parameter $K$, the effect of measurement is irrelevant (relevant) at $K>1$ ($K<1$). We reveal that thi…
▽ More
Finding new critical states of matter is an important subject in modern many-body physics. Here we study the effect of measurement and postselection on the critical ground state of a Luttinger liquid theory and show that it can lead to qualitatively new critical states. Depending on the Luttinger parameter $K$, the effect of measurement is irrelevant (relevant) at $K>1$ ($K<1$). We reveal that this causes an entanglement transition between two phases, one with logarithmic entanglement entropy for a subregion ($K>1$), and the other with algebraic entanglement entropy ($K<1$). At the critical point $K=1$, the measurement is marginal, and we find new critical states whose entanglement entropy exhibits a logarithmic behavior with a continuous effective central charge as a function of measurement strength. We also performed numerical density matrix renormalization group and fermionic Gaussian state simulations to support our results. We further discuss promising and feasible routes to experimentally realize new critical states in our work.
△ Less
Submitted 7 May, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
A New WISE Calibration of Stellar Mass
Authors:
T. H. Jarrett,
M. E. Cluver,
Edward N. Taylor,
Sabine Bellstedt,
A. S. G Robotham,
H. F. M. Yao
Abstract:
We derive new empirical scaling relations between WISE mid-infrared galaxy photometry and well-determined stellar masses from SED modeling of a suite of optical-infrared photometry provided by the DR4 Catalogue of the GAMA-KiDS-VIKING survey of the southern G23 field. The mid-infrared source extraction and characterization are drawn from the WISE Extended Source Catalogue (WXSC) and the archival A…
▽ More
We derive new empirical scaling relations between WISE mid-infrared galaxy photometry and well-determined stellar masses from SED modeling of a suite of optical-infrared photometry provided by the DR4 Catalogue of the GAMA-KiDS-VIKING survey of the southern G23 field. The mid-infrared source extraction and characterization are drawn from the WISE Extended Source Catalogue (WXSC) and the archival ALLWISE catalog, combining both resolved and compact galaxies in the G23 sample to a redshift of 0.15. Three scaling relations are derived: W1 3.4 micron luminosity versus stellar mass, and WISE W1-W2, W1-W3 colors versus mass-to-light ratio (sensitive to a variety of galaxy types from passive to star-forming). For each galaxy in the sample, we then derive the combined stellar mass from these scaling relations, producing Mstellar estimates with better than $\sim$25-30% accuracy for galaxies with $>$10$^{9}$ Msolar and $<$40 - 50% for lower luminosity dwarf galaxies. We also provide simple prescriptions for rest-frame corrections and estimating stellar masses using only the W1 flux and the W1-W2 color, making stellar masses more accessible to users of the WISE data. Given a redshift or distance, these new scaling relations will enable stellar mass estimates for any galaxy in the sky detected by WISE with high fidelity across a range of mass-to-light.
△ Less
Submitted 14 January, 2023;
originally announced January 2023.
-
Async-fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level
Authors:
Pu Pang,
Gang Deng,
Kaihao Bai,
Quan Chen,
Shixuan Sun,
Bo Liu,
Yu Xu,
Hongbo Yao,
Zhengheng Wang,
Xiyu Wang,
Zheng Liu,
Zhuo Song,
Yong Yang,
Tao Ma,
Minyi Guo
Abstract:
In-memory key-value stores (IMKVSes) serve many online applications because of their efficiency. To support data backup, popular industrial IMKVSes periodically take a point-in-time snapshot of the in-memory data with the system call fork. However, this mechanism can result in latency spikes for queries arriving during the snapshot period because fork leads the engine into the kernel mode in which…
▽ More
In-memory key-value stores (IMKVSes) serve many online applications because of their efficiency. To support data backup, popular industrial IMKVSes periodically take a point-in-time snapshot of the in-memory data with the system call fork. However, this mechanism can result in latency spikes for queries arriving during the snapshot period because fork leads the engine into the kernel mode in which the engine is out-of-service for queries. In contrast to existing research focusing on optimizing snapshot algorithms, we optimize the fork operation to address the latency spikes problem from the operating system (OS) level, while kee** the data persistent mechanism in IMKVSes unchanged. Specifically, we first conduct an in-depth study to reveal the impact of the fork operation as well as the optimization techniques on query latency. Based on findings in the study, we propose Async-fork to offload the work of copying the page table from the engine (the parent process) to the child process as copying the page table dominates the execution time of fork. To keep data consistent between the parent and the child, we design the proactive synchronization strategy. Async-fork is implemented in the Linux kernel and deployed into the online Redis database in public clouds. Our experiment results show that compared with the default fork method in OS, Async-fork reduces the tail latency of queries arriving during the snapshot period by 81.76% on an 8GB instance and 99.84% on a 64GB instance.
△ Less
Submitted 14 January, 2023;
originally announced January 2023.
-
Compactifications of moduli space of (quasi-)trielliptic K3 surfaces
Authors:
Yitao Chen,
Haoyu Wu,
Hanyu Yao
Abstract:
We study the moduli space $\mathcal{F}_{T_1}$ of quasi-trielliptic K3 surfaces of type I, whose general member is a smooth bidegree $(2,3)$-hypersurface of $\mathbb{P}^1\times \mathbb{P}^2$. Such moduli space plays an important role in the study of the Hassett-Keel-Looijenga program of the moduli space of degree $8$ quasi-polarized K3 surfaces.
In this paper, we consider several natural compacti…
▽ More
We study the moduli space $\mathcal{F}_{T_1}$ of quasi-trielliptic K3 surfaces of type I, whose general member is a smooth bidegree $(2,3)$-hypersurface of $\mathbb{P}^1\times \mathbb{P}^2$. Such moduli space plays an important role in the study of the Hassett-Keel-Looijenga program of the moduli space of degree $8$ quasi-polarized K3 surfaces.
In this paper, we consider several natural compactifications of $\mathcal{F}_{T_1}$, such as the GIT compactification and arithmetic compactifications. We give a complete analysis of GIT stability of $(2,3)$-hypersurfaces and provide a concrete description of the boundary of the GIT compactification. For the Baily--Borel compactification of the quasi-trielliptic K3 surfaces, we also compute the configurations of the boundary by classifying certain lattice embeddings. As an application, we show that $(\mathbb{P}^1\times \mathbb{P}^2,εS)$ with small $ε$ is K-stable if $S$ is a K3 surface with at worst ADE singularities. This gives a concrete description of the boundary of the K-stability compactification via the identification of the GIT stability and the K-stability. We also discuss the connection between the GIT, Baily--Borel compactification, and Looijenga's compactifications by studying the projective models of quasi-trielliptic K3 surfaces.
△ Less
Submitted 30 December, 2022;
originally announced December 2022.
-
Efficient Algorithms for the Bee-Identification Problem
Authors:
Han Mao Kiah,
Alexander Vardy,
Hanwen Yao
Abstract:
The bee-identification problem, formally defined by Tandon, Tan and Varshney (2019), requires the receiver to identify "bees" using a set of unordered noisy measurements. In this previous work, Tandon, Tan, and Varshney studied error exponents and showed that decoding the measurements jointly results in a significantly smaller error exponent.
In this work, we study algorithms related to this joi…
▽ More
The bee-identification problem, formally defined by Tandon, Tan and Varshney (2019), requires the receiver to identify "bees" using a set of unordered noisy measurements. In this previous work, Tandon, Tan, and Varshney studied error exponents and showed that decoding the measurements jointly results in a significantly smaller error exponent.
In this work, we study algorithms related to this joint decoder. First, we demonstrate how to perform joint decoding efficiently. By reducing to the problem of finding perfect matching and minimum-cost matchings, we obtain joint decoders that run in time quadratic and cubic in the number of "bees" for the binary erasure (BEC) and binary symmetric channels (BSC), respectively. Next, by studying the matching algorithms in the context of channel coding, we further reduce the running times by using classical tools like peeling decoders and list-decoders. In particular, we show that our identifier algorithms when used with Reed-Muller codes terminate in almost linear and quadratic time for BEC and BSC, respectively.
Finally, for explicit codebooks, we study when these joint decoders fail to identify the "bees" correctly. Specifically, we provide practical methods of estimating the probability of erroneous identification for given codebooks.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
Biomedical image analysis competitions: The state of current participation practice
Authors:
Matthias Eisenmann,
Annika Reinke,
Vivienn Weru,
Minu Dietlinde Tizabi,
Fabian Isensee,
Tim J. Adler,
Patrick Godau,
Veronika Cheplygina,
Michal Kozubek,
Sharib Ali,
Anubha Gupta,
Jan Kybic,
Alison Noble,
Carlos Ortiz de Solórzano,
Samiksha Pachade,
Caroline Petitjean,
Daniel Sage,
Donglai Wei,
Elizabeth Wilden,
Deepak Alapatt,
Vincent Andrearczyk,
Ujjwal Baid,
Spyridon Bakas,
Niranjan Balu,
Sophia Bano
, et al. (331 additional authors not shown)
Abstract:
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,…
▽ More
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.
△ Less
Submitted 12 September, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
JUNO Sensitivity on Proton Decay $p\to \barνK^+$ Searches
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Antonio Bergnoli,
Thilo Birkenfeld,
Sylvie Blin
, et al. (586 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreov…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreover, the excellent energy resolution of JUNO permits to suppress the sizable background caused by other delayed signals. Based on these advantages, the detection efficiency for the proton decay via $p\to \barνK^+$ is 36.9% with a background level of 0.2 events after 10 years of data taking. The estimated sensitivity based on 200 kton-years exposure is $9.6 \times 10^{33}$ years, competitive with the current best limits on the proton lifetime in this channel.
△ Less
Submitted 26 October, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Universal KPZ scaling in noisy hybrid quantum circuits
Authors:
Shuo Liu,
Ming-Rui Li,
Shi-Xin Zhang,
Shao-Kai Jian,
Hong Yao
Abstract:
Measurement-induced phase transitions (MIPT) have attracted increasing attention due to the rich phenomenology of entanglement structures and their relation with quantum information processing. Since physical systems are unavoidably coupled to environment, quantum noise needs be considered in analyzing a system with MIPT, which may qualitatively modify or even destroy certain entanglement structur…
▽ More
Measurement-induced phase transitions (MIPT) have attracted increasing attention due to the rich phenomenology of entanglement structures and their relation with quantum information processing. Since physical systems are unavoidably coupled to environment, quantum noise needs be considered in analyzing a system with MIPT, which may qualitatively modify or even destroy certain entanglement structure of the system. In this Letter, we investigate the effect of quantum noise modeled by reset quantum channel acting on each site with probability $q$ on MIPT. Based on the numerical results from the Clifford circuits, we show that the quantum noise can qualitatively change the entanglement properties - the entanglement obeys ``area law'' instead of ``volume law'' with projective measurement rate $p<p_{c}$. In the quantum noise induced ``area law'' phase, the entanglement exhibits a novel $q^{-1/3}$ power-law scaling. Using an analytic map** of the quantum model to a classical statistical model, we further show that the ``area law'' entanglement is the consequence of the noise-driven symmetry-breaking field and the $q^{-1/3}$ scaling can be understood as the result of Kardar-Parisi-Zhang (KPZ) fluctuations of the directed polymer with an effective length scale $L_{\rm{eff}} \sim q^{-1}$ in a random environment.
△ Less
Submitted 9 December, 2022; v1 submitted 7 December, 2022;
originally announced December 2022.
-
The Vanishing Decision Boundary Complexity and the Strong First Component
Authors:
Hengshuai Yao
Abstract:
We show that unlike machine learning classifiers, there are no complex boundary structures in the decision boundaries for well-trained deep models. However, we found that the complicated structures do appear in training but they vanish shortly after sha**. This is a pessimistic news if one seeks to capture different levels of complexity in the decision boundary for understanding generalization,…
▽ More
We show that unlike machine learning classifiers, there are no complex boundary structures in the decision boundaries for well-trained deep models. However, we found that the complicated structures do appear in training but they vanish shortly after sha**. This is a pessimistic news if one seeks to capture different levels of complexity in the decision boundary for understanding generalization, which works well in machine learning. Nonetheless, we found that the decision boundaries of predecessor models on the training data are reflective of the final model's generalization. We show how to use the predecessor decision boundaries for studying the generalization of deep models. We have three major findings. One is on the strength of the first principle component of deep models, another about the singularity of optimizers, and the other on the effects of the skip connections in ResNets. Code is at https://github.com/hengshu1/decision_boundary_github.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
Precision measurement of reactor antineutrino oscillation at kilometer-scale baselines by Daya Bay
Authors:
Daya Bay collaboration,
F. P. An,
W. D. Bai,
A. B. Balantekin,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
H. Y. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
Z. Y. Chen,
J. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
Y. Y. Ding,
X. Y. Ding
, et al. (176 additional authors not shown)
Abstract:
We present a new determination of the smallest neutrino mixing angle $θ_{13}$ and the mass-squared difference $Δ{\rm m}^{2}_{32}$ using a final sample of $5.55 \times 10^{6}$ inverse beta-decay (IBD) candidates with the final-state neutron captured on gadolinium. This sample was selected from the complete data set obtained by the Daya Bay reactor neutrino experiment in 3158 days of operation. Comp…
▽ More
We present a new determination of the smallest neutrino mixing angle $θ_{13}$ and the mass-squared difference $Δ{\rm m}^{2}_{32}$ using a final sample of $5.55 \times 10^{6}$ inverse beta-decay (IBD) candidates with the final-state neutron captured on gadolinium. This sample was selected from the complete data set obtained by the Daya Bay reactor neutrino experiment in 3158 days of operation. Compared to the previous Daya Bay results, selection of IBD candidates has been optimized, energy calibration refined, and treatment of backgrounds further improved. The resulting oscillation parameters are ${\rm sin}^{2}2θ_{13} = 0.0851 \pm 0.0024$, $Δ{\rm m}^{2}_{32} = (2.466 \pm 0.060) \times 10^{-3}{\rm eV}^{2}$ for the normal mass ordering or $Δ{\rm m}^{2}_{32} = -(2.571 \pm 0.060) \times 10^{-3} {\rm eV}^{2}$ for the inverted mass ordering.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Wild-Time: A Benchmark of in-the-Wild Distribution Shift over Time
Authors:
Huaxiu Yao,
Caroline Choi,
Bochuan Cao,
Yoonho Lee,
Pang Wei Koh,
Chelsea Finn
Abstract:
Distribution shift occurs when the test distribution differs from the training distribution, and it can considerably degrade performance of machine learning models deployed in the real world. Temporal shifts -- distribution shifts arising from the passage of time -- often occur gradually and have the additional structure of timestamp metadata. By leveraging timestamp metadata, models can potential…
▽ More
Distribution shift occurs when the test distribution differs from the training distribution, and it can considerably degrade performance of machine learning models deployed in the real world. Temporal shifts -- distribution shifts arising from the passage of time -- often occur gradually and have the additional structure of timestamp metadata. By leveraging timestamp metadata, models can potentially learn from trends in past distribution shifts and extrapolate into the future. While recent works have studied distribution shifts, temporal shifts remain underexplored. To address this gap, we curate Wild-Time, a benchmark of 5 datasets that reflect temporal distribution shifts arising in a variety of real-world applications, including patient prognosis and news classification. On these datasets, we systematically benchmark 13 prior approaches, including methods in domain generalization, continual learning, self-supervised learning, and ensemble learning. We use two evaluation strategies: evaluation with a fixed time split (Eval-Fix) and evaluation with a data stream (Eval-Stream). Eval-Fix, our primary evaluation strategy, aims to provide a simple evaluation protocol, while Eval-Stream is more realistic for certain real-world applications. Under both evaluation strategies, we observe an average performance drop of 20% from in-distribution to out-of-distribution data. Existing methods are unable to close this gap. Code is available at https://wild-time.github.io/.
△ Less
Submitted 15 January, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Generalizable Industrial Visual Anomaly Detection with Self-Induction Vision Transformer
Authors:
Haiming Yao,
Wenyong Yu
Abstract:
Industrial vision anomaly detection plays a critical role in the advanced intelligent manufacturing process, while some limitations still need to be addressed under such a context. First, existing reconstruction-based methods struggle with the identity map** of trivial shortcuts where the reconstruction error gap is legible between the normal and abnormal samples, leading to inferior detection c…
▽ More
Industrial vision anomaly detection plays a critical role in the advanced intelligent manufacturing process, while some limitations still need to be addressed under such a context. First, existing reconstruction-based methods struggle with the identity map** of trivial shortcuts where the reconstruction error gap is legible between the normal and abnormal samples, leading to inferior detection capabilities. Then, the previous studies mainly concentrated on the convolutional neural network (CNN) models that capture the local semantics of objects and neglect the global context, also resulting in inferior performance. Moreover, existing studies follow the individual learning fashion where the detection models are only capable of one category of the product while the generalizable detection for multiple categories has not been explored. To tackle the above limitations, we proposed a self-induction vision Transformer(SIVT) for unsupervised generalizable multi-category industrial visual anomaly detection and localization. The proposed SIVT first extracts discriminatory features from pre-trained CNN as property descriptors. Then, the self-induction vision Transformer is proposed to reconstruct the extracted features in a self-supervisory fashion, where the auxiliary induction tokens are additionally introduced to induct the semantics of the original signal. Finally, the abnormal properties can be detected using the semantic feature residual difference. We experimented with the SIVT on existing Mvtec AD benchmarks, the results reveal that the proposed method can advance state-of-the-art detection performance with an improvement of 2.8-6.3 in AUROC, and 3.3-7.6 in AP.
△ Less
Submitted 28 November, 2022; v1 submitted 22 November, 2022;
originally announced November 2022.
-
Normal Reference Attention and Defective Feature Perception Network for Surface Defect Detection
Authors:
Wei Luo,
Haiming Yao,
Wenyong Yu
Abstract:
Visual anomaly detection plays a significant role in the development of industrial automatic product quality inspection. As a result of the utmost imbalance in the amount of normal and abnormal data, growing attention has been given to unsupervised methods for defect detection. Although existing reconstruction-based methods have been widely studied recently, establishing a robust reconstruction mo…
▽ More
Visual anomaly detection plays a significant role in the development of industrial automatic product quality inspection. As a result of the utmost imbalance in the amount of normal and abnormal data, growing attention has been given to unsupervised methods for defect detection. Although existing reconstruction-based methods have been widely studied recently, establishing a robust reconstruction model for various textured surface defect detection remains a challenging task due to homogeneous and nonregular surface textures. In this paper, we propose a novel unsupervised reconstruction-based method called the normal reference attention and defective feature perception network (NDP-Net) to accurately inspect a variety of textured defects. Unlike most reconstruction-based methods, our NDP-Net first employs an encoding module that extracts multi scale discriminative features of the surface textures, which is augmented with the defect discriminative ability by the proposed artificial defects and the novel pixel-level defect perception loss. Subsequently, a novel reference-based attention module (RBAM) is proposed to leverage the normal features of the fixed reference image to repair the defective features and restrain the reconstruction of the defects. Next, the repaired features are fed into a decoding module to reconstruct the normal textured background. Finally, the novel multi scale defect segmentation module (MSDSM) is introduced for precise defect detection and segmentation. In addition, a two-stage training strategy is utilized to enhance the inspection performance.
△ Less
Submitted 13 January, 2023; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Robust d-wave superconductivity from the Su-Schrieffer-Heeger-Hubbard model: possible route to high-temperature superconductivity
Authors:
Hao-Xin Wang,
Yi-Fan Jiang,
Hong Yao
Abstract:
Increasing numerical studies showed that the simplest Hubbard model on the square lattice with strong repulsion may not exhibit high-temperature superconductivity (SC). It is desired to look for other possible microscopic mechanism of realizing high-temperature SC. Here, we explore the interplay between the Su-Schrieffer-Heeger (SSH) electron-phonon coupling (EPC) and the Hubbard repulsion by dens…
▽ More
Increasing numerical studies showed that the simplest Hubbard model on the square lattice with strong repulsion may not exhibit high-temperature superconductivity (SC). It is desired to look for other possible microscopic mechanism of realizing high-temperature SC. Here, we explore the interplay between the Su-Schrieffer-Heeger (SSH) electron-phonon coupling (EPC) and the Hubbard repulsion by density-matrix-renormalization-group (DMRG) simulations. Our state-of-the-art DMRG study showed convincingly that the interplay between strong Hubbard $U$ and moderate Su-Schrieffer-Heeger EPC $λ$ can induce robust $d$-wave SC. The SSH-type EPC can generates effective antiferromagnetic spin-exchange interactions between neighboring sites, which plays a crucial role in the interplay of inducing robust $d$-wave SC. Specifically, for $U=8t$, we find that $d$-wave SC emerges when $λ>λ_c$ with a moderate critical value $λ_c=0.1\sim 0.2$. Our results might shed new light to understanding high-temperature SC in cuprates as well as pave a possible new route in looking for high-temperature SC in other quantum materials with both strong $U$ and moderate $λ$.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Holistic Evaluation of Language Models
Authors:
Percy Liang,
Rishi Bommasani,
Tony Lee,
Dimitris Tsipras,
Dilara Soylu,
Michihiro Yasunaga,
Yian Zhang,
Deepak Narayanan,
Yuhuai Wu,
Ananya Kumar,
Benjamin Newman,
Binhang Yuan,
Bobby Yan,
Ce Zhang,
Christian Cosgrove,
Christopher D. Manning,
Christopher Ré,
Diana Acosta-Navas,
Drew A. Hudson,
Eric Zelikman,
Esin Durmus,
Faisal Ladhak,
Frieda Rong,
Hongyu Ren,
Huaxiu Yao
, et al. (25 additional authors not shown)
Abstract:
Language models (LMs) are becoming the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve the transparency of language models. First, we taxonomize the vast space of potential scenarios (i.e. use cases) and metrics (i.e. desiderata) that are of interest fo…
▽ More
Language models (LMs) are becoming the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve the transparency of language models. First, we taxonomize the vast space of potential scenarios (i.e. use cases) and metrics (i.e. desiderata) that are of interest for LMs. Then we select a broad subset based on coverage and feasibility, noting what's missing or underrepresented (e.g. question answering for neglected English dialects, metrics for trustworthiness). Second, we adopt a multi-metric approach: We measure 7 metrics (accuracy, calibration, robustness, fairness, bias, toxicity, and efficiency) for each of 16 core scenarios when possible (87.5% of the time). This ensures metrics beyond accuracy don't fall to the wayside, and that trade-offs are clearly exposed. We also perform 7 targeted evaluations, based on 26 targeted scenarios, to analyze specific aspects (e.g. reasoning, disinformation). Third, we conduct a large-scale evaluation of 30 prominent language models (spanning open, limited-access, and closed models) on all 42 scenarios, 21 of which were not previously used in mainstream LM evaluation. Prior to HELM, models on average were evaluated on just 17.9% of the core HELM scenarios, with some prominent models not sharing a single scenario in common. We improve this to 96.0%: now all 30 models have been densely benchmarked on the same core scenarios and metrics under standardized conditions. Our evaluation surfaces 25 top-level findings. For full transparency, we release all raw model prompts and completions publicly for further analysis, as well as a general modular toolkit. We intend for HELM to be a living benchmark for the community, continuously updated with new scenarios, metrics, and models.
△ Less
Submitted 1 October, 2023; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Extending structures for left-symmetric bialgebras
Authors:
Tao Zhang,
Hui-Jun Yao
Abstract:
We introduce the concept of braided left-symmetric bialgebras and construct cocycle bicrossproduct left-symmetric bialgebras. As an application, we solve the extending problem for left-symmetric bialgebras by using some non-abelian cohomology theory.
We introduce the concept of braided left-symmetric bialgebras and construct cocycle bicrossproduct left-symmetric bialgebras. As an application, we solve the extending problem for left-symmetric bialgebras by using some non-abelian cohomology theory.
△ Less
Submitted 23 November, 2022; v1 submitted 30 October, 2022;
originally announced November 2022.
-
Spectral properties of 1D extended Hubbard model from bosonization and time-dependent variational principle: applications to 1D cuprate
Authors:
Hao-Xin Wang,
Yi-Ming Wu,
Yi-Fan Jiang,
Hong Yao
Abstract:
Recent ARPES experiments on doped 1D cuprates revealed the importance of effective near-neighbor (NN) attractions in explaining certain features in spectral functions. Here we investigate spectral properties of the extended Hubbard model with the on-site repulsion $U$ and NN interaction $V$, by employing bosonization analysis and the high-precision time-dependent variational principle (TDVP) calcu…
▽ More
Recent ARPES experiments on doped 1D cuprates revealed the importance of effective near-neighbor (NN) attractions in explaining certain features in spectral functions. Here we investigate spectral properties of the extended Hubbard model with the on-site repulsion $U$ and NN interaction $V$, by employing bosonization analysis and the high-precision time-dependent variational principle (TDVP) calculations of the model on 1D chain with up to 300 sites. From state-of-the-art TDVP calculations, we find that the spectral weights of the holon-folding and $3k_F$ branches evolve oppositely as a function of $V$. This peculiar dichotomy may be explained in bosonization analysis from the opposite dependence of exponent that determines the spectral weights on Luttinger parameter $K_ρ$. Moreover, our TDVP calculations of models with fixed $U=8t$ and different $V$ show that $V\approx -1.7t$ may fit the experimental results best, indicating a moderate effective NN attraction in 1D cuprates that might provide some hints towards understanding superconductivity in 2D cuprates.
△ Less
Submitted 3 November, 2022;
originally announced November 2022.
-
Class Interference of Deep Neural Networks
Authors:
Dongcui Diao,
Hengshuai Yao,
Bei Jiang
Abstract:
Recognizing and telling similar objects apart is even hard for human beings. In this paper, we show that there is a phenomenon of class interference with all deep neural networks. Class interference represents the learning difficulty in data, and it constitutes the largest percentage of generalization errors by deep networks. To understand class interference, we propose cross-class tests, class eg…
▽ More
Recognizing and telling similar objects apart is even hard for human beings. In this paper, we show that there is a phenomenon of class interference with all deep neural networks. Class interference represents the learning difficulty in data, and it constitutes the largest percentage of generalization errors by deep networks. To understand class interference, we propose cross-class tests, class ego directions and interference models. We show how to use these definitions to study minima flatness and class interference of a trained model. We also show how to detect class interference during training through label dancing pattern and class dancing notes.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
Asymptotic sign free in interacting fermion models
Authors:
Zi-Xiang Li,
Zhou-Quan Wan,
Hong Yao
Abstract:
As an intrinsically-unbiased approach, quantum Monte Carlo (QMC) is of vital importance in understanding correlated phases of matter. Unfortunately, it often suffers notorious sign problem when simulating interacting fermion models. Here, we show for the first time that there exist interacting fermion models whose sign problem becomes less severe for larger system sizes and eventually disappears i…
▽ More
As an intrinsically-unbiased approach, quantum Monte Carlo (QMC) is of vital importance in understanding correlated phases of matter. Unfortunately, it often suffers notorious sign problem when simulating interacting fermion models. Here, we show for the first time that there exist interacting fermion models whose sign problem becomes less severe for larger system sizes and eventually disappears in the thermodynamic limit, which we dub as "asymptotic sign free". We demonstrate asymptotically-free sign in determinant QMC for various interacting models. Moreover, based on renormalization-group-like ideas we propose a heuristic understanding of the feature of asymptotic sign free. We believe that asymptotic sign free behavior could shed new lights to deepening our understanding of sign problem. More importantly, it can provide a promising way to decipher intriguing physics in correlated models which were conventionally thought not accessible by QMC.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Siamese Transition Masked Autoencoders as Uniform Unsupervised Visual Anomaly Detector
Authors:
Haiming Yao,
Xue Wang,
Wenyong Yu
Abstract:
Unsupervised visual anomaly detection conveys practical significance in many scenarios and is a challenging task due to the unbounded definition of anomalies. Moreover, most previous methods are application-specific, and establishing a unified model for anomalies across application scenarios remains unsolved. This paper proposes a novel hybrid framework termed Siamese Transition Masked Autoencoder…
▽ More
Unsupervised visual anomaly detection conveys practical significance in many scenarios and is a challenging task due to the unbounded definition of anomalies. Moreover, most previous methods are application-specific, and establishing a unified model for anomalies across application scenarios remains unsolved. This paper proposes a novel hybrid framework termed Siamese Transition Masked Autoencoders(ST-MAE) to handle various visual anomaly detection tasks uniformly via deep feature transition. Concretely, the proposed method first extracts hierarchical semantics features from a pre-trained deep convolutional neural network and then develops a feature decoupling strategy to split the deep features into two disjoint feature patch subsets. Leveraging the decoupled features, the ST-MAE is developed with the Siamese encoders that operate on each subset of feature patches and perform the latent representations transition of two subsets, along with a lightweight decoder that reconstructs the original feature from the transitioned latent representation. Finally, the anomalous attributes can be detected using the semantic deep feature residual. Our deep feature transition scheme yields a nontrivial and semantic self-supervisory task to extract prototypical normal patterns, which allows for learning uniform models that generalize well for different visual anomaly detection tasks. The extensive experiments conducted demonstrate that the proposed ST-MAE method can advance state-of-the-art performance on multiple benchmarks across application scenarios with a superior inference efficiency, which exhibits great potential to be the uniform model for unsupervised visual anomaly detection.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Rapid Electromagnetic Induction Imaging with an Optically Raster-Scanned Atomic Magnetometer
Authors:
B. Maddox,
C. Deans,
H. Yao,
Y. Cohen,
F. Renzoni
Abstract:
We present an apparatus to overcome the limitations of mechanical raster-scanning in electromagnetic induction imaging (EMI) techniques by instead performing a 2D optical raster-scan within the vapour cell of a radio-frequency atomic magnetometer (RF-AM). A large cuboidal 87Rb vapour cell is employed to act as the medium of an RF-AM with the pump and probe beams translated in the cell via acousto-…
▽ More
We present an apparatus to overcome the limitations of mechanical raster-scanning in electromagnetic induction imaging (EMI) techniques by instead performing a 2D optical raster-scan within the vapour cell of a radio-frequency atomic magnetometer (RF-AM). A large cuboidal 87Rb vapour cell is employed to act as the medium of an RF-AM with the pump and probe beams translated in the cell via acousto-optics. The technique is shown to give robust and repeatable magnetic measurements over the cell volume and successfully resolves conductive targets with EMI. Optical raster-scanning removes the limitation of slow mechanical actuation and a fast imaging procedure is enacted resolving conductive targets at a rate of 40 ms/pixel.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
Thermodynamic Phase Diagram of Two-Dimensional Bosons in a Quasicrystal Potential
Authors:
Zhaoxuan Zhu,
Hepeng Yao,
Laurent Sanchez-Palencia
Abstract:
Quantum simulation of quasicrystals in synthetic bosonic matter now paves the way to the exploration of these intriguing systems in wide parameter ranges. Yet thermal fluctuations in such systems compete with quantum coherence, and significantly affect the zero-temperature quantum phases. Here we determine the thermodynamic phase diagram of interacting bosons in a two-dimensional, homogeneous quas…
▽ More
Quantum simulation of quasicrystals in synthetic bosonic matter now paves the way to the exploration of these intriguing systems in wide parameter ranges. Yet thermal fluctuations in such systems compete with quantum coherence, and significantly affect the zero-temperature quantum phases. Here we determine the thermodynamic phase diagram of interacting bosons in a two-dimensional, homogeneous quasicrystal potential. Our results are found using quantum Monte Carlo simulations. Finite-size scaling is carefully considered and the quantum phases are systematically distinguished from thermal phases. In particular, we demonstrate stabilization of a genuine Bose glass phase against the normal fluid in sizable parameter ranges. Our results for strong interactions are interpreted using a fermionization picture and experimental relevance is discussed.
△ Less
Submitted 11 July, 2023; v1 submitted 27 October, 2022;
originally announced October 2022.
-
Multi-Domain Long-Tailed Learning by Augmenting Disentangled Representations
Authors:
Xinyu Yang,
Huaxiu Yao,
Allan Zhou,
Chelsea Finn
Abstract:
There is an inescapable long-tailed class-imbalance issue in many real-world classification problems. Current methods for addressing this problem only consider scenarios where all examples come from the same distribution. However, in many cases, there are multiple domains with distinct class imbalance. We study this multi-domain long-tailed learning problem and aim to produce a model that generali…
▽ More
There is an inescapable long-tailed class-imbalance issue in many real-world classification problems. Current methods for addressing this problem only consider scenarios where all examples come from the same distribution. However, in many cases, there are multiple domains with distinct class imbalance. We study this multi-domain long-tailed learning problem and aim to produce a model that generalizes well across all classes and domains. Towards that goal, we introduce TALLY, a method that addresses this multi-domain long-tailed learning problem. Built upon a proposed selective balanced sampling strategy, TALLY achieves this by mixing the semantic representation of one example with the domain-associated nuisances of another, producing a new representation for use as data augmentation. To improve the disentanglement of semantic representations, TALLY further utilizes a domain-invariant class prototype that averages out domain-specific effects. We evaluate TALLY on several benchmarks and real-world datasets and find that it consistently outperforms other state-of-the-art methods in both subpopulation and domain shift. Our code and data have been released at https://github.com/huaxiuyao/TALLY.
△ Less
Submitted 6 October, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Surgical Fine-Tuning Improves Adaptation to Distribution Shifts
Authors:
Yoonho Lee,
Annie S. Chen,
Fahim Tajwar,
Ananya Kumar,
Huaxiu Yao,
Percy Liang,
Chelsea Finn
Abstract:
A common approach to transfer learning under distribution shift is to fine-tune the last few layers of a pre-trained model, preserving learned features while also adapting to the new task. This paper shows that in such settings, selectively fine-tuning a subset of layers (which we term surgical fine-tuning) matches or outperforms commonly used fine-tuning approaches. Moreover, the type of distribu…
▽ More
A common approach to transfer learning under distribution shift is to fine-tune the last few layers of a pre-trained model, preserving learned features while also adapting to the new task. This paper shows that in such settings, selectively fine-tuning a subset of layers (which we term surgical fine-tuning) matches or outperforms commonly used fine-tuning approaches. Moreover, the type of distribution shift influences which subset is more effective to tune: for example, for image corruptions, fine-tuning only the first few layers works best. We validate our findings systematically across seven real-world data tasks spanning three types of distribution shifts. Theoretically, we prove that for two-layer neural networks in an idealized setting, first-layer tuning can outperform fine-tuning all layers. Intuitively, fine-tuning more parameters on a small target dataset can cause information learned during pre-training to be forgotten, and the relevant information depends on the type of shift.
△ Less
Submitted 6 June, 2023; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Model Independent Approach of the JUNO $^8$B Solar Neutrino Program
Authors:
JUNO Collaboration,
Jie Zhao,
Baobiao Yue,
Haoqi Lu,
Yufeng Li,
Jiajie Ling,
Zeyuan Yu,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai
, et al. (579 additional authors not shown)
Abstract:
The physics potential of detecting $^8$B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model independent manner by using three distinct channels of the charged-current (CC), neutral-current (NC) and elastic scattering (ES) interactions. Due to the largest-ever mass of $^{13}$C nuclei in the liquid-scintillator detectors and the {expected} low backg…
▽ More
The physics potential of detecting $^8$B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model independent manner by using three distinct channels of the charged-current (CC), neutral-current (NC) and elastic scattering (ES) interactions. Due to the largest-ever mass of $^{13}$C nuclei in the liquid-scintillator detectors and the {expected} low background level, $^8$B solar neutrinos would be observable in the CC and NC interactions on $^{13}$C for the first time. By virtue of optimized event selections and muon veto strategies, backgrounds from the accidental coincidence, muon-induced isotopes, and external backgrounds can be greatly suppressed. Excellent signal-to-background ratios can be achieved in the CC, NC and ES channels to guarantee the $^8$B solar neutrino observation. From the sensitivity studies performed in this work, we show that JUNO, with ten years of data, can reach the {1$σ$} precision levels of 5%, 8% and 20% for the $^8$B neutrino flux, $\sin^2θ_{12}$, and $Δm^2_{21}$, respectively. It would be unique and helpful to probe the details of both solar physics and neutrino physics. In addition, when combined with SNO, the world-best precision of 3% is expected for the $^8$B neutrino flux measurement.
△ Less
Submitted 6 March, 2024; v1 submitted 15 October, 2022;
originally announced October 2022.
-
ControlVAE: Model-Based Learning of Generative Controllers for Physics-Based Characters
Authors:
Heyuan Yao,
Zhenhua Song,
Baoquan Chen,
Libin Liu
Abstract:
In this paper, we introduce ControlVAE, a novel model-based framework for learning generative motion control policies based on variational autoencoders (VAE). Our framework can learn a rich and flexible latent representation of skills and a skill-conditioned generative control policy from a diverse set of unorganized motion sequences, which enables the generation of realistic human behaviors by sa…
▽ More
In this paper, we introduce ControlVAE, a novel model-based framework for learning generative motion control policies based on variational autoencoders (VAE). Our framework can learn a rich and flexible latent representation of skills and a skill-conditioned generative control policy from a diverse set of unorganized motion sequences, which enables the generation of realistic human behaviors by sampling in the latent space and allows high-level control policies to reuse the learned skills to accomplish a variety of downstream tasks. In the training of ControlVAE, we employ a learnable world model to realize direct supervision of the latent space and the control policy. This world model effectively captures the unknown dynamics of the simulation system, enabling efficient model-based learning of high-level downstream tasks. We also learn a state-conditional prior distribution in the VAE-based generative control policy, which generates a skill embedding that outperforms the non-conditional priors in downstream tasks. We demonstrate the effectiveness of ControlVAE using a diverse set of tasks, which allows realistic and interactive control of the simulated characters.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
C-Mixup: Improving Generalization in Regression
Authors:
Huaxiu Yao,
Yi** Wang,
Linjun Zhang,
James Zou,
Chelsea Finn
Abstract:
Improving the generalization of deep networks is an important open challenge, particularly in domains without plentiful data. The mixup algorithm improves generalization by linearly interpolating a pair of examples and their corresponding labels. These interpolated examples augment the original training set. Mixup has shown promising results in various classification tasks, but systematic analysis…
▽ More
Improving the generalization of deep networks is an important open challenge, particularly in domains without plentiful data. The mixup algorithm improves generalization by linearly interpolating a pair of examples and their corresponding labels. These interpolated examples augment the original training set. Mixup has shown promising results in various classification tasks, but systematic analysis of mixup in regression remains underexplored. Using mixup directly on regression labels can result in arbitrarily incorrect labels. In this paper, we propose a simple yet powerful algorithm, C-Mixup, to improve generalization on regression tasks. In contrast with vanilla mixup, which picks training examples for mixing with uniform probability, C-Mixup adjusts the sampling probability based on the similarity of the labels. Our theoretical analysis confirms that C-Mixup with label similarity obtains a smaller mean square error in supervised regression and meta-regression than vanilla mixup and using feature similarity. Another benefit of C-Mixup is that it can improve out-of-distribution robustness, where the test distribution is different from the training distribution. By selectively interpolating examples with similar labels, it mitigates the effects of domain-associated information and yields domain-invariant representations. We evaluate C-Mixup on eleven datasets, ranging from tabular to video data. Compared to the best prior approach, C-Mixup achieves 6.56%, 4.76%, 5.82% improvements in in-distribution generalization, task generalization, and out-of-distribution robustness, respectively. Code is released at https://github.com/huaxiuyao/C-Mixup.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Knowledge-Driven New Drug Recommendation
Authors:
Zhenbang Wu,
Huaxiu Yao,
Zhe Su,
David M Liebovitz,
Lucas M Glass,
James Zou,
Chelsea Finn,
Jimeng Sun
Abstract:
Drug recommendation assists doctors in prescribing personalized medications to patients based on their health conditions. Existing drug recommendation solutions adopt the supervised multi-label classification setup and only work with existing drugs with sufficient prescription data from many patients. However, newly approved drugs do not have much historical prescription data and cannot leverage e…
▽ More
Drug recommendation assists doctors in prescribing personalized medications to patients based on their health conditions. Existing drug recommendation solutions adopt the supervised multi-label classification setup and only work with existing drugs with sufficient prescription data from many patients. However, newly approved drugs do not have much historical prescription data and cannot leverage existing drug recommendation methods. To address this, we formulate the new drug recommendation as a few-shot learning problem. Yet, directly applying existing few-shot learning algorithms faces two challenges: (1) complex relations among diseases and drugs and (2) numerous false-negative patients who were eligible but did not yet use the new drugs. To tackle these challenges, we propose EDGE, which can quickly adapt to the recommendation for a new drug with limited prescription data from a few support patients. EDGE maintains a drug-dependent multi-phenotype few-shot learner to bridge the gap between existing and new drugs. Specifically, EDGE leverages the drug ontology to link new drugs to existing drugs with similar treatment effects and learns ontology-based drug representations. Such drug representations are used to customize the metric space of the phenotype-driven patient representations, which are composed of a set of phenotypes capturing complex patient health status. Lastly, EDGE eliminates the false-negative supervision signal using an external drug-disease knowledge base. We evaluate EDGE on two real-world datasets: the public EHR data (MIMIC-IV) and private industrial claims data. Results show that EDGE achieves 7.3% improvement on the ROC-AUC score over the best baseline.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Authors:
Haosen Yang,
Deng Huang,
Bin Wen,
Jiannan Wu,
Hongxun Yao,
Yi Jiang,
Xiatian Zhu,
Zehuan Yuan
Abstract:
Masked autoencoders (MAEs) have emerged recently as art self-supervised spatiotemporal representation learners. Inheriting from the image counterparts, however, existing video MAEs still focus largely on static appearance learning whilst are limited in learning dynamic temporal information hence less effective for video downstream tasks. To resolve this drawback, in this work we present a motion-a…
▽ More
Masked autoencoders (MAEs) have emerged recently as art self-supervised spatiotemporal representation learners. Inheriting from the image counterparts, however, existing video MAEs still focus largely on static appearance learning whilst are limited in learning dynamic temporal information hence less effective for video downstream tasks. To resolve this drawback, in this work we present a motion-aware variant -- MotionMAE. Apart from learning to reconstruct individual masked patches of video frames, our model is designed to additionally predict the corresponding motion structure information over time. This motion information is available at the temporal difference of nearby frames. As a result, our model can extract effectively both static appearance and dynamic motion spontaneously, leading to superior spatiotemporal representation learning capability. Extensive experiments show that our MotionMAE outperforms significantly both supervised learning baseline and state-of-the-art MAE alternatives, under both domain-specific and domain-generic pretraining-then-finetuning settings. In particular, when using ViT-B as the backbone our MotionMAE surpasses the prior art model by a margin of 1.2% on Something-Something V2 and 3.2% on UCF101 in domain-specific pretraining setting. Encouragingly, it also surpasses the competing MAEs by a large margin of over 3% on the challenging video object segmentation task. The code is available at https://github.com/happy-hsy/MotionMAE.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
TripleE: Easy Domain Generalization via Episodic Replay
Authors:
Xiaomeng Li,
Hongyu Ren,
Huifeng Yao,
Ziwei Liu
Abstract:
Learning how to generalize the model to unseen domains is an important area of research. In this paper, we propose TripleE, and the main idea is to encourage the network to focus on training on subsets (learning with replay) and enlarge the data space in learning on subsets. Learning with replay contains two core designs, EReplayB and EReplayD, which conduct the replay schema on batch and dataset,…
▽ More
Learning how to generalize the model to unseen domains is an important area of research. In this paper, we propose TripleE, and the main idea is to encourage the network to focus on training on subsets (learning with replay) and enlarge the data space in learning on subsets. Learning with replay contains two core designs, EReplayB and EReplayD, which conduct the replay schema on batch and dataset, respectively. Through this, the network can focus on learning with subsets instead of visiting the global set at a glance, enlarging the model diversity in ensembling. To enlarge the data space in learning on subsets, we verify that an exhaustive and singular augmentation (ESAug) performs surprisingly well on expanding the data space in subsets during replays. Our model dubbed TripleE is frustratingly easy, based on simple augmentation and ensembling. Without bells and whistles, our TripleE method surpasses prior arts on six domain generalization benchmarks, showing that this approach could serve as a step** stone for future research in domain generalization.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Semi-Supervised Domain Generalization for Cardiac Magnetic Resonance Image Segmentation with High Quality Pseudo Labels
Authors:
Wanqin Ma,
Huifeng Yao,
Yiqun Lin,
Jiarong Guo,
Xiaomeng Li
Abstract:
Develo** a deep learning method for medical segmentation tasks heavily relies on a large amount of labeled data. However, the annotations require professional knowledge and are limited in number. Recently, semi-supervised learning has demonstrated great potential in medical segmentation tasks. Most existing methods related to cardiac magnetic resonance images only focus on regular images with si…
▽ More
Develo** a deep learning method for medical segmentation tasks heavily relies on a large amount of labeled data. However, the annotations require professional knowledge and are limited in number. Recently, semi-supervised learning has demonstrated great potential in medical segmentation tasks. Most existing methods related to cardiac magnetic resonance images only focus on regular images with similar domains and high image quality. A semi-supervised domain generalization method was developed in [2], which enhances the quality of pseudo labels on varied datasets. In this paper, we follow the strategy in [2] and present a domain generalization method for semi-supervised medical segmentation. Our main goal is to improve the quality of pseudo labels under extreme MRI Analysis with various domains. We perform Fourier transformation on input images to learn low-level statistics and cross-domain information. Then we feed the augmented images as input to the double cross pseudo supervision networks to calculate the variance among pseudo labels. We evaluate our method on the CMRxMotion dataset [1]. With only partially labeled data and without domain labels, our approach consistently generates accurate segmentation results of cardiac magnetic resonance images with different respiratory motions. Code is available at: https://github.com/MAWanqin2002/STACOM2022Ma
△ Less
Submitted 1 December, 2023; v1 submitted 30 September, 2022;
originally announced September 2022.
-
Cyclegan Network for Sheet Metal Welding Drawing Translation
Authors:
Zhiwei Song,
Hui Yao,
Dan Tian,
Gaohui Zhan
Abstract:
In intelligent manufacturing, the quality of machine translation engineering drawings will directly affect its manufacturing accuracy. Currently, most of the work is manually translated, greatly reducing production efficiency. This paper proposes an automatic translation method for welded structural engineering drawings based on Cyclic Generative Adversarial Networks (CycleGAN). The CycleGAN netwo…
▽ More
In intelligent manufacturing, the quality of machine translation engineering drawings will directly affect its manufacturing accuracy. Currently, most of the work is manually translated, greatly reducing production efficiency. This paper proposes an automatic translation method for welded structural engineering drawings based on Cyclic Generative Adversarial Networks (CycleGAN). The CycleGAN network model of unpaired transfer learning is used to learn the feature map** of real welding engineering drawings to realize automatic translation of engineering drawings. U-Net and PatchGAN are the main network for the generator and discriminator, respectively. Based on removing the identity map** function, a high-dimensional sparse network is proposed to replace the traditional dense network for the Cyclegan generator to improve noise robustness. Increase the residual block hidden layer to increase the resolution of the generated graph. The improved and fine-tuned network models are experimentally validated, computing the gap between real and generated data. It meets the welding engineering precision standard and solves the main problem of low drawing recognition efficiency in the welding manufacturing process. The results show. After training with our model, the PSNR, SSIM and MSE of welding engineering drawings reach about 44.89%, 99.58% and 2.11, respectively, which are superior to traditional networks in both training speed and accuracy.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Segmentation method of U-net sheet metal engineering drawing based on CBAM attention mechanism
Authors:
Zhiwei Song,
Hui Yao
Abstract:
In the manufacturing process of heavy industrial equipment, the specific unit in the welding diagram is first manually redrawn and then the corresponding sheet metal parts are cut, which is inefficient. To this end, this paper proposes a U-net-based method for the segmentation and extraction of specific units in welding engineering drawings. This method enables the cutting device to automatically…
▽ More
In the manufacturing process of heavy industrial equipment, the specific unit in the welding diagram is first manually redrawn and then the corresponding sheet metal parts are cut, which is inefficient. To this end, this paper proposes a U-net-based method for the segmentation and extraction of specific units in welding engineering drawings. This method enables the cutting device to automatically segment specific graphic units according to visual information and automatically cut out sheet metal parts of corresponding shapes according to the segmentation results. This process is more efficient than traditional human-assisted cutting. Two weaknesses in the U-net network will lead to a decrease in segmentation performance: first, the focus on global semantic feature information is weak, and second, there is a large dimensional difference between shallow encoder features and deep decoder features. Based on the CBAM (Convolutional Block Attention Module) attention mechanism, this paper proposes a U-net jump structure model with an attention mechanism to improve the network's global semantic feature extraction ability. In addition, a U-net attention mechanism model with dual pooling convolution fusion is designed, the deep encoder's maximum pooling + convolution features and the shallow encoder's average pooling + convolution features are fused vertically to reduce the dimension difference between the shallow encoder and deep decoder. The dual-pool convolutional attention jump structure replaces the traditional U-net jump structure, which can effectively improve the specific unit segmentation performance of the welding engineering drawing. Using vgg16 as the backbone network, experiments have verified that the IoU, mAP, and Accu of our model in the welding engineering drawing dataset segmentation task are 84.72%, 86.84%, and 99.42%, respectively.
△ Less
Submitted 27 April, 2023; v1 submitted 28 September, 2022;
originally announced September 2022.
-
FreeSeg: Free Mask from Interpretable Contrastive Language-Image Pretraining for Semantic Segmentation
Authors:
Yi Li,
Huifeng Yao,
Hualiang Wang,
Xiaomeng Li
Abstract:
Fully supervised semantic segmentation learns from dense masks, which requires heavy annotation cost for closed set. In this paper, we use natural language as supervision without any pixel-level annotation for open world segmentation. We call the proposed framework as FreeSeg, where the mask is freely available from raw feature map of pretraining model. Compared with zero-shot or openset segmentat…
▽ More
Fully supervised semantic segmentation learns from dense masks, which requires heavy annotation cost for closed set. In this paper, we use natural language as supervision without any pixel-level annotation for open world segmentation. We call the proposed framework as FreeSeg, where the mask is freely available from raw feature map of pretraining model. Compared with zero-shot or openset segmentation, FreeSeg doesn't require any annotated masks, and it widely predicts categories beyond class-agnostic unsupervised segmentation. Specifically, FreeSeg obtains free mask from Image-Text Similarity Map (ITSM) of Interpretable Contrastive Language-Image Pretraining (ICLIP). And our core improvements are the smoothed min pooling for dense ICLIP, with the partial label and pixel strategies for segmentation. Furthermore, FreeSeg is very straight forward without complex design like grou**, clustering or retrieval. Besides the simplicity, the performances of FreeSeg surpass previous state-of-the-art at large margins, e.g. 13.4% higher at mIoU on VOC dataset in the same settings.
△ Less
Submitted 14 December, 2022; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Helical Luttinger liquid on the edge of a 2-dimensional topological antiferromagnet
Authors:
Yang Feng,
**jiang Zhu,
Weiyan Lin,
Zichen Lian,
Yongchao Wang,
Hao Li,
Hongxu Yao,
Qiushi He,
Yin** Pan,
Yang Wu,
**song Zhang,
Yayu Wang,
Xiaodong Zhou,
Jian Shen,
Yihua Wang
Abstract:
Boundary helical Luttinger liquid (HLL) with broken bulk time-reversal symmetry belongs to a unique topological class which may occur in antiferromagnets (AFM). Here, we search for signatures of HLL on the edge of a recently discovered topological AFM, MnBi2Te4 even-layer. Using scanning superconducting quantum interference device, we directly image helical edge current in the AFM ground state app…
▽ More
Boundary helical Luttinger liquid (HLL) with broken bulk time-reversal symmetry belongs to a unique topological class which may occur in antiferromagnets (AFM). Here, we search for signatures of HLL on the edge of a recently discovered topological AFM, MnBi2Te4 even-layer. Using scanning superconducting quantum interference device, we directly image helical edge current in the AFM ground state appearing at its charge neutral point. Such helical edge state accompanies an insulating bulk which is topologically distinct from the ferromagnetic Chern insulator phase as revealed in a magnetic field driven quantum phase transition. The edge conductance of the AFM order follows a power-law as a function of temperature and source-drain bias which serves as strong evidence for HLL. Such HLL scaling is robust at finite fields below the quantum critical point. The observed HLL in a layered AFM semiconductor represents a highly tunable topological matter compatible with future spintronics and quantum computation.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
Search for relativistic fractionally charged particles in space
Authors:
DAMPE Collaboration,
F. Alemanno,
C. Altomare,
Q. An,
P. Azzarello,
F. C. T. Barbato,
P. Bernardini,
X. J. Bi,
M. S. Cai,
E. Casilli,
E. Catanzani,
J. Chang,
D. Y. Chen,
J. L. Chen,
Z. F. Chen,
M. Y. Cui,
T. S. Cui,
Y. X. Cui,
H. T. Dai,
A. De-Benedittis,
I. De Mitri,
F. de Palma,
M. Deliyergiyev,
A. Di Giovanni,
M. Di Santo
, et al. (126 additional authors not shown)
Abstract:
More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been…
▽ More
More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been few searches for FCPs in cosmic rays carried out in orbit other than AMS-01 flown by a space shuttle and BESS by a balloon at the top of the atmosphere. In this study, we conduct an FCP search in space based on on-orbit data obtained using the DArk Matter Particle Explorer (DAMPE) satellite over a period of five years. Unlike underground experiments, which require an FCP energy of the order of hundreds of GeV, our FCP search starts at only a few GeV. An upper limit of $6.2\times 10^{-10}~~\mathrm{cm^{-2}sr^{-1} s^{-1}}$ is obtained for the flux. Our results demonstrate that DAMPE exhibits higher sensitivity than experiments of similar types by three orders of magnitude that more stringently restricts the conditions for the existence of FCP in primary cosmic rays.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Self-supervised Representation Learning on Electronic Health Records with Graph Kernel Infomax
Authors:
Hao-Ren Yao,
Nairen Cao,
Katina Russell,
Der-Chen Chang,
Ophir Frieder,
Jeremy Fineman
Abstract:
Learning Electronic Health Records (EHRs) representation is a preeminent yet under-discovered research topic. It benefits various clinical decision support applications, e.g., medication outcome prediction or patient similarity search. Current approaches focus on task-specific label supervision on vectorized sequential EHR, which is not applicable to large-scale unsupervised scenarios. Recently, c…
▽ More
Learning Electronic Health Records (EHRs) representation is a preeminent yet under-discovered research topic. It benefits various clinical decision support applications, e.g., medication outcome prediction or patient similarity search. Current approaches focus on task-specific label supervision on vectorized sequential EHR, which is not applicable to large-scale unsupervised scenarios. Recently, contrastive learning shows great success on self-supervised representation learning problems. However, complex temporality often degrades the performance. We propose Graph Kernel Infomax, a self-supervised graph kernel learning approach on the graphical representation of EHR, to overcome the previous problems. Unlike the state-of-the-art, we do not change the graph structure to construct augmented views. Instead, we use Kernel Subspace Augmentation to embed nodes into two geometrically different manifold views. The entire framework is trained by contrasting nodes and graph representations on those two manifold views through the commonly used contrastive objectives. Empirically, using publicly available benchmark EHR datasets, our approach yields performance on clinical downstream tasks that exceeds the state-of-the-art. Theoretically, the variation on distance metrics naturally creates different views as data augmentation without changing graph structures.
△ Less
Submitted 20 February, 2024; v1 submitted 1 September, 2022;
originally announced September 2022.
-
Multi-Scale Contrastive Knowledge Co-Distillation for Event Temporal Relation Extraction
Authors:
Hao-Ren Yao,
Luke Breitfeller,
Aakanksha Naik,
Chunxiao Zhou,
Carolyn Rose
Abstract:
Event Temporal Relation Extraction (ETRE) is a crucial yet challenging problem. Event pairs are situated within a discourse at different distances, which we refer to as proximity bands. The temporal ordering communicated about event pairs situated at more remote (i.e., ``long'') or less remote (i.e., ``short'') proximity bands is encoded differently. SOTA ETRE models have tended to perform well on…
▽ More
Event Temporal Relation Extraction (ETRE) is a crucial yet challenging problem. Event pairs are situated within a discourse at different distances, which we refer to as proximity bands. The temporal ordering communicated about event pairs situated at more remote (i.e., ``long'') or less remote (i.e., ``short'') proximity bands is encoded differently. SOTA ETRE models have tended to perform well on events situated at either short or long proximity bands, but not both. Yet, real-world, natural texts contain all types of temporal event-pairs. In this paper, we present MulCo: Multi-Scale Contrastive Knowledge Co-Distillation, a fusion approach that shares knowledge across multiple event pair proximity bands in order to improve performance on all types of temporal datasets. Our experimental results show that MulCo successfully integrates linguistic cues pertaining to temporal reasoning across both short and long proximity bands and achieves new state-of-the-art results on several ETRE benchmark datasets.
△ Less
Submitted 20 March, 2024; v1 submitted 1 September, 2022;
originally announced September 2022.
-
In-vehicle alertness monitoring for older adults
Authors:
Heng Yao,
Sanaz Motamedi,
Wayne C. W. Giang,
Alexandra Kondyli,
Eakta Jain
Abstract:
Alertness monitoring in the context of driving improves safety and saves lives. Computer vision based alertness monitoring is an active area of research. However, the algorithms and datasets that exist for alertness monitoring are primarily aimed at younger adults (18-50 years old). We present a system for in-vehicle alertness monitoring for older adults. Through a design study, we ascertained the…
▽ More
Alertness monitoring in the context of driving improves safety and saves lives. Computer vision based alertness monitoring is an active area of research. However, the algorithms and datasets that exist for alertness monitoring are primarily aimed at younger adults (18-50 years old). We present a system for in-vehicle alertness monitoring for older adults. Through a design study, we ascertained the variables and parameters that are suitable for older adults traveling independently in Level 5 vehicles. We implemented a prototype traveler monitoring system and evaluated the alertness detection algorithm on ten older adults (70 years and older). We report on the system design and implementation at a level of detail that is suitable for the beginning researcher or practitioner. Our study suggests that dataset development is the foremost challenge for develo** alertness monitoring systems targeted at older adults. This study is the first of its kind for a hitherto under-studied population and has implications for future work on algorithm development and system design through participatory methods.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
Discrete time crystal enabled by Stark many-body localization
Authors:
Shuo Liu,
Shi-Xin Zhang,
Chang-Yu Hsieh,
Shengyu Zhang,
Hong Yao
Abstract:
Discrete time crystal (DTC) has recently attracted increasing attention, but most DTC models and their properties are only revealed after disorder average. In this Letter, we propose a simple disorder-free periodically driven model that exhibits nontrivial DTC order stabilized by Stark many-body localization (MBL). We demonstrate the existence of DTC phase by analytical analysis from perturbation…
▽ More
Discrete time crystal (DTC) has recently attracted increasing attention, but most DTC models and their properties are only revealed after disorder average. In this Letter, we propose a simple disorder-free periodically driven model that exhibits nontrivial DTC order stabilized by Stark many-body localization (MBL). We demonstrate the existence of DTC phase by analytical analysis from perturbation theory and convincing numerical evidence from observable dynamics. The new DTC model paves a new promising way for further experiments and deepens our understanding of DTC. Since the DTC order doesn't require special quantum state preparation and the strong disorder average, it can be naturally realized on the noisy intermediate-scale quantum (NISQ) hardware with much fewer resources and repetitions. Moreover, besides the robust subharmonic response, there are other novel robust beating oscillations in Stark-MBL DTC phase which are absent in random or quasi-periodic MBL DTC.
△ Less
Submitted 27 March, 2023; v1 submitted 4 August, 2022;
originally announced August 2022.
-
Braided anti-flexible bialgebras
Authors:
Tao Zhang,
Hui-Jun Yao
Abstract:
We introduce the concept of braided anti-flexible bialgebra and construct cocycle bicrossproduct anti-flexible bialgebras. As an application, we solve the extending problem for anti-flexible bialgebras by using some non-abelian cohomology theory.
We introduce the concept of braided anti-flexible bialgebra and construct cocycle bicrossproduct anti-flexible bialgebras. As an application, we solve the extending problem for anti-flexible bialgebras by using some non-abelian cohomology theory.
△ Less
Submitted 6 November, 2022; v1 submitted 27 July, 2022;
originally announced August 2022.
-
Connecting MeerKAT radio continuum properties to GAMA optical emission-line and WISE mid-infrared activity
Authors:
H. F. M. Yao,
M. E. Cluver,
T. H. Jarrett,
Gyula I. G. Jozsa,
M. G. Santos,
L. Marchetti,
M. J. I. Brown,
Y. A. Gordon,
S. Brough,
A. M. Hopkins,
B. W. Holwerda,
S. P. Driver,
E. M. Sadler
Abstract:
The identification of AGN in large surveys has been hampered by seemingly discordant classifications arising from differing diagnostic methods, usually tracing distinct processes specific to a particular wavelength regime. However, as shown in Yao et al. (2020), the combination of optical emission line measurements and mid-infrared photometry can be used to optimise the discrimination capability b…
▽ More
The identification of AGN in large surveys has been hampered by seemingly discordant classifications arising from differing diagnostic methods, usually tracing distinct processes specific to a particular wavelength regime. However, as shown in Yao et al. (2020), the combination of optical emission line measurements and mid-infrared photometry can be used to optimise the discrimination capability between AGN and star formation activity. In this paper we test our new classification scheme by combining the existing GAMA-WISE data with high-quality MeerKAT radio continuum data covering 8 deg$^2$ of the GAMA G23 region. Using this sample of 1 841 galaxies (z < 0.25), we investigate the total infrared (derived from 12$μ$m) to radio luminosity ratio, q(TIR), and its relationship to optical-infrared AGN and star-forming (SF) classifications. We find that while q(TIR) is efficient at detecting AGN activity in massive galaxies generally appearing quiescent in the infrared, it becomes less reliable for cases where the emission from star formation in the host galaxy is dominant. However, we find that the q(TIR) can identify up to 70 % more AGNs not discernible at optical and/or infrared wavelengths. The median q(TIR) of our SF sample is 2.57 $\pm$ 0.23 consistent with previous local universe estimates.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
On the role of laminar/turbulent interface on energy transfer between scales in bypass transition
Authors:
Hanxun Yao,
George Papadakis
Abstract:
We investigate the role of laminar/turbulent interface in the interscale energy transfer in a boundary layer undergoing bypass transition, with the aid of the Karman-Howarth-Monin-Hill (KHMH) equation. A local binary indicator function is used to detect the interface and employed subsequently to define two-point intermittencies. These are used to decompose the standard-averaged interscale and inte…
▽ More
We investigate the role of laminar/turbulent interface in the interscale energy transfer in a boundary layer undergoing bypass transition, with the aid of the Karman-Howarth-Monin-Hill (KHMH) equation. A local binary indicator function is used to detect the interface and employed subsequently to define two-point intermittencies. These are used to decompose the standard-averaged interscale and interspace energy fluxes into conditionally-averaged components. We find that the inverse cascade in the streamwise direction reported in an earlier work arises due to events across the downstream or upstream interfaces (head or tail respectively) of a turbulent spot. However, the three-dimensional energy flux maps reveal significant differences between these two regions: in the downstream interface, inverse cascade is stronger and dominant over a larger range of streamwise and spanwise separations. We explain this finding by considering a propagating spot of simplified shape as it crosses a fixed streamwise location. We derive also the conditionally-averaged KHMH equation, thus generalising similar equations for single-point statistics to two-point statistics. We compare the three-dimensional maps of the conditionally-averaged production and total energy flux within turbulent spots against the maps of standard-averaged quantities within the fully turbulent region. The results indicate remarkable dynamical similarities between turbulent spots and the fully turbulent region for two-point statistics. This has been known only for single-point quantities, and we show here that the similarity extends to two-point quantities as well.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.