-
Fast Complementary Dynamics via Skinning Eigenmodes
Authors:
Otman Benchekroun,
Jiayi Eris Zhang,
Siddhartha Chaudhuri,
Eitan Grinspun,
Yi Zhou,
Alec Jacobson
Abstract:
We propose a reduced-space elasto-dynamic solver that is well suited for augmenting rigged character animations with secondary motion. At the core of our method is a novel deformation subspace based on Linear Blend Skinning that overcomes many of the shortcomings prior subspace methods face. Our skinning subspace is parameterized entirely by a set of scalar weights, which we can obtain through a s…
▽ More
We propose a reduced-space elasto-dynamic solver that is well suited for augmenting rigged character animations with secondary motion. At the core of our method is a novel deformation subspace based on Linear Blend Skinning that overcomes many of the shortcomings prior subspace methods face. Our skinning subspace is parameterized entirely by a set of scalar weights, which we can obtain through a small, material-aware and rig-sensitive generalized eigenvalue problem. The resulting subspace can easily capture rotational motion and guarantees that the resulting simulation is rotation equivariant. We further propose a simple local-global solver for linear co-rotational elasticity and propose a clustering method to aggregate per-tetrahedra non-linear energetic quantities. The result is a compact simulation that is fully decoupled from the complexity of the mesh.
△ Less
Submitted 19 June, 2023; v1 submitted 21 March, 2023;
originally announced March 2023.
-
Detecting the open-world objects with the help of the Brain
Authors:
Shuailei Ma,
Yuefeng Wang,
Ying Wei,
Peihao Chen,
Zhixiang Ye,
Jiaqi Fan,
Enming Zhang,
Thomas H. Li
Abstract:
Open World Object Detection (OWOD) is a novel computer vision task with a considerable challenge, bridging the gap between classic object detection (OD) benchmarks and real-world object detection. In addition to detecting and classifying seen/known objects, OWOD algorithms are expected to detect unseen/unknown objects and incrementally learn them. The natural instinct of humans to identify unknown…
▽ More
Open World Object Detection (OWOD) is a novel computer vision task with a considerable challenge, bridging the gap between classic object detection (OD) benchmarks and real-world object detection. In addition to detecting and classifying seen/known objects, OWOD algorithms are expected to detect unseen/unknown objects and incrementally learn them. The natural instinct of humans to identify unknown objects in their environments mainly depends on their brains' knowledge base. It is difficult for a model to do this only by learning from the annotation of several tiny datasets. The large pre-trained grounded language-image models - VL (\ie GLIP) have rich knowledge about the open world but are limited to the text prompt. We propose leveraging the VL as the ``Brain'' of the open-world detector by simply generating unknown labels. Leveraging it is non-trivial because the unknown labels impair the model's learning of known objects. In this paper, we alleviate these problems by proposing the down-weight loss function and decoupled detection structure. Moreover, our detector leverages the ``Brain'' to learn novel objects beyond VL through our pseudo-labeling scheme.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Recent Advances and Applications of Machine Learning in Experimental Solid Mechanics: A Review
Authors:
Hanxun **,
Enrui Zhang,
Horacio D. Espinosa
Abstract:
For many decades, experimental solid mechanics has played a crucial role in characterizing and understanding the mechanical properties of natural and novel materials. Recent advances in machine learning (ML) provide new opportunities for the field, including experimental design, data analysis, uncertainty quantification, and inverse problems. As the number of papers published in recent years in th…
▽ More
For many decades, experimental solid mechanics has played a crucial role in characterizing and understanding the mechanical properties of natural and novel materials. Recent advances in machine learning (ML) provide new opportunities for the field, including experimental design, data analysis, uncertainty quantification, and inverse problems. As the number of papers published in recent years in this emerging field is exploding, it is timely to conduct a comprehensive and up-to-date review of recent ML applications in experimental solid mechanics. Here, we first provide an overview of common ML algorithms and terminologies that are pertinent to this review, with emphasis placed on physics-informed and physics-based ML methods. Then, we provide thorough coverage of recent ML applications in traditional and emerging areas of experimental mechanics, including fracture mechanics, biomechanics, nano- and micro-mechanics, architected materials, and 2D material. Finally, we highlight some current challenges of applying ML to multi-modality and multi-fidelity experimental datasets and propose several future research directions. This review aims to provide valuable insights into the use of ML methods as well as a variety of examples for researchers in solid mechanics to integrate into their experiments.
△ Less
Submitted 6 September, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building [Technical Report]
Authors:
Maureen Daum,
Enhao Zhang,
Dong He,
Stephen Mussmann,
Brandon Haynes,
Ranjay Krishna,
Magdalena Balazinska
Abstract:
We introduce VOCALExplore, a system designed to support users in building domain-specific models over video datasets. VOCALExplore supports interactive labeling sessions and trains models using user-supplied labels. VOCALExplore maximizes model quality by automatically deciding how to select samples based on observed skew in the collected labels. It also selects the optimal video representations t…
▽ More
We introduce VOCALExplore, a system designed to support users in building domain-specific models over video datasets. VOCALExplore supports interactive labeling sessions and trains models using user-supplied labels. VOCALExplore maximizes model quality by automatically deciding how to select samples based on observed skew in the collected labels. It also selects the optimal video representations to use when training models by casting feature selection as a rising bandit problem. Finally, VOCALExplore implements optimizations to achieve low latency without sacrificing model performance. We demonstrate that VOCALExplore achieves close to the best possible model quality given candidate acquisition functions and feature extractors, and it does so with low visible latency (~1 second per iteration) and no expensive preprocessing.
△ Less
Submitted 29 September, 2023; v1 submitted 7 March, 2023;
originally announced March 2023.
-
First measurement of the nuclear-recoil ionization yield in silicon at 100 eV
Authors:
M. F. Albakry,
I. Alkhatib,
D. Alonso,
D. W. P. Amaral,
P. An,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
P. S. Barbeau,
C. Bathurst,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott
, et al. (115 additional authors not shown)
Abstract:
We measured the nuclear--recoil ionization yield in silicon with a cryogenic phonon-sensitive gram-scale detector. Neutrons from a mono-energetic beam scatter off of the silicon nuclei at angles corresponding to energy depositions from 4\,keV down to 100\,eV, the lowest energy probed so far. The results show no sign of an ionization production threshold above 100\,eV. These results call for furthe…
▽ More
We measured the nuclear--recoil ionization yield in silicon with a cryogenic phonon-sensitive gram-scale detector. Neutrons from a mono-energetic beam scatter off of the silicon nuclei at angles corresponding to energy depositions from 4\,keV down to 100\,eV, the lowest energy probed so far. The results show no sign of an ionization production threshold above 100\,eV. These results call for further investigation of the ionization yield theory and a comprehensive determination of the detector response function at energies below the keV scale.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
On the Independence Polynomial and Threshold of an Antiregular $k$-Hypergraph
Authors:
Erchuan Zhang
Abstract:
Given an integer $k\geq 3$ and an initial $k-1$ isolated vertices, an {\em antiregular $k$-hypergraph} is constructed by alternatively adding an isolated vertex (connected to no other vertices) or a dominating vertex (connected to every other $k-1$ vertices). Let $a_i$ be the number of independent sets of cardinality $i$ in a hypergraph $H$, then the {\em independence polynomial} of $H$ is defined…
▽ More
Given an integer $k\geq 3$ and an initial $k-1$ isolated vertices, an {\em antiregular $k$-hypergraph} is constructed by alternatively adding an isolated vertex (connected to no other vertices) or a dominating vertex (connected to every other $k-1$ vertices). Let $a_i$ be the number of independent sets of cardinality $i$ in a hypergraph $H$, then the {\em independence polynomial} of $H$ is defined as $I(H;x)=\sum_{i=0}^m a_i x^i$, where $m$ is the size of a maximum independent set. The main purpose of the present paper is to generalise some results of independence polynomials of antiregular graphs to the case of antiregular $k$-hypergraphs. In particular, we derive (semi-)closed formulas for the independence polynomials of antiregular $k$-hypergraphs and prove their log-concavity. Furthermore, we show that antiregular $k$-hypergraphs are {\em $T2$-threshold}, which means there exist a labeling $c$ of the vertex set and a threshold $τ$ such that for any vertex subset $S$ of cardinality $k$, $\sum_{i\in S}c(i)>τ$ if and only if $S$ is a hyperedge.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Exploring Opinion-unaware Video Quality Assessment with Semantic Affinity Criterion
Authors:
Haoning Wu,
Liang Liao,
**gwen Hou,
Chaofeng Chen,
Erli Zhang,
Annan Wang,
Wenxiu Sun,
Qiong Yan,
Weisi Lin
Abstract:
Recent learning-based video quality assessment (VQA) algorithms are expensive to implement due to the cost of data collection of human quality opinions, and are less robust across various scenarios due to the biases of these opinions. This motivates our exploration on opinion-unaware (a.k.a zero-shot) VQA approaches. Existing approaches only considers low-level naturalness in spatial or temporal d…
▽ More
Recent learning-based video quality assessment (VQA) algorithms are expensive to implement due to the cost of data collection of human quality opinions, and are less robust across various scenarios due to the biases of these opinions. This motivates our exploration on opinion-unaware (a.k.a zero-shot) VQA approaches. Existing approaches only considers low-level naturalness in spatial or temporal domain, without considering impacts from high-level semantics. In this work, we introduce an explicit semantic affinity index for opinion-unaware VQA using text-prompts in the contrastive language-image pre-training (CLIP) model. We also aggregate it with different traditional low-level naturalness indexes through gaussian normalization and sigmoid rescaling strategies. Composed of aggregated semantic and technical metrics, the proposed Blind Unified Opinion-Unaware Video Quality Index via Semantic and Technical Metric Aggregation (BUONA-VISTA) outperforms existing opinion-unaware VQA methods by at least 20% improvements, and is more robust than opinion-aware approaches.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
A Search for Low-mass Dark Matter via Bremsstrahlung Radiation and the Migdal Effect in SuperCDMS
Authors:
M. F. Albakry,
I. Alkhatib,
D. Alonso,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott,
J. Cooley,
H. Coombes
, et al. (108 additional authors not shown)
Abstract:
We present a new analysis of previously published of SuperCDMS data using a profile likelihood framework to search for sub-GeV dark matter (DM) particles through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering these possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that are undetectable through the DM-nuc…
▽ More
We present a new analysis of previously published of SuperCDMS data using a profile likelihood framework to search for sub-GeV dark matter (DM) particles through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering these possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that are undetectable through the DM-nucleon elastic scattering channel, given the energy threshold of current experiments. We exclude DM masses down to $220~\textrm{MeV}/c^2$ at $2.7 \times 10^{-30}~\textrm{cm}^2$ via the bremsstrahlung channel. The Migdal channel search provides overall considerably more stringent limits and excludes DM masses down to $30~\textrm{MeV}/c^2$ at $5.0 \times 10^{-30}~\textrm{cm}^2$.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
The Exploration of Knowledge-Preserving Prompts for Document Summarisation
Authors:
Chen Chen,
Wei Emma Zhang,
Alireza Seyed Shakeri,
Makhmoor Fiza
Abstract:
Despite the great development of document summarisation techniques nowadays, factual inconsistencies between the generated summaries and the original texts still occur from time to time. This study explores the possibility of adopting prompts to incorporate factual knowledge into generated summaries. We specifically study prefix-tuning that uses a set of trainable continuous prefix prompts togethe…
▽ More
Despite the great development of document summarisation techniques nowadays, factual inconsistencies between the generated summaries and the original texts still occur from time to time. This study explores the possibility of adopting prompts to incorporate factual knowledge into generated summaries. We specifically study prefix-tuning that uses a set of trainable continuous prefix prompts together with discrete natural language prompts to aid summary generation. Experimental results demonstrate that the trainable prefixes can help the summarisation model extract information from discrete prompts precisely, thus generating knowledge-preserving summaries that are factually consistent with the discrete prompts. The ROUGE improvements of the generated summaries indicate that explicitly adding factual knowledge into the summarisation process could boost the overall performance, showing great potential for applying it to other natural language processing tasks.
△ Less
Submitted 16 May, 2023; v1 submitted 27 January, 2023;
originally announced January 2023.
-
A Stability Timescale for Non-Hierarchical Three-Body Systems
Authors:
Eric Zhang,
Smadar Naoz,
Clifford M. Will
Abstract:
The gravitational three-body problem is a fundamental problem in physics and has significant applications to astronomy. Three-body configurations are often considered stable as long the system is hierarchical; that is, the two orbital distances are well-separated. However, instability, which is often associated with significant energy exchange between orbits, takes time to develop. Assuming two ma…
▽ More
The gravitational three-body problem is a fundamental problem in physics and has significant applications to astronomy. Three-body configurations are often considered stable as long the system is hierarchical; that is, the two orbital distances are well-separated. However, instability, which is often associated with significant energy exchange between orbits, takes time to develop. Assuming two massive objects in a circular orbit and a test particle in an eccentric orbit, we develop an analytical formula estimating the time it takes for the test particle's orbital energy to change by an order of itself. We show its consistency with results from N-body simulations. For eccentric orbits in particular, the instability is primarily driven not by close encounters of the test particle with one of the other bodies, but by the fundamental susceptibility of eccentric orbits to exchange energy at their periapsis. Motivated by recent suggestions that the galactic center may host an intermediate-mass black hole (IMBH) as a companion to the massive black hole Sgr A*, we use our timescale to explore the parameter space that could harbor an IMBH for the lifetime of the S-cluster of stars surrounding Sgr A*. Furthermore, we show that the orbit of an S-star can be stable for long timescales in the presence of other orbital crossing stars, thus suggesting that the S-cluster may be stable for the lifetimes of its member stars.
△ Less
Submitted 2 May, 2023; v1 submitted 19 January, 2023;
originally announced January 2023.
-
EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report]
Authors:
Enhao Zhang,
Maureen Daum,
Dong He,
Brandon Haynes,
Ranjay Krishna,
Magdalena Balazinska
Abstract:
We introduce EQUI-VOCAL: a new system that automatically synthesizes queries over videos from limited user interactions. The user only provides a handful of positive and negative examples of what they are looking for. EQUI-VOCAL utilizes these initial examples and additional ones collected through active learning to efficiently synthesize complex user queries. Our approach enables users to find ev…
▽ More
We introduce EQUI-VOCAL: a new system that automatically synthesizes queries over videos from limited user interactions. The user only provides a handful of positive and negative examples of what they are looking for. EQUI-VOCAL utilizes these initial examples and additional ones collected through active learning to efficiently synthesize complex user queries. Our approach enables users to find events without database expertise, with limited labeling effort, and without declarative specifications or sketches. Core to EQUI-VOCAL's design is the use of spatio-temporal scene graphs in its data model and query language and a novel query synthesis approach that works on large and noisy video data. Our system outperforms two baseline systems -- in terms of F1 score, synthesis time, and robustness to noise -- and can flexibly synthesize complex queries that the baselines do not support.
△ Less
Submitted 8 August, 2023; v1 submitted 2 January, 2023;
originally announced January 2023.
-
Spin excitation continuum to topological magnon crossover and thermal Hall conductivity in Kitaev magnets
Authors:
Emily Z. Zhang,
Reja H. Wilke,
Yong Baek Kim
Abstract:
There has been great interest in identifying a Kitaev quantum spin liquid state in frustrated magnets with bond-dependent interactions. In particular, the experimental report of a half-quantized thermal Hall conductivity in $α$-RuCl$_3$ in the presence of a magnetic field has generated excitement as it could be strong evidence for a field-induced chiral spin liquid. More recent experiments, howeve…
▽ More
There has been great interest in identifying a Kitaev quantum spin liquid state in frustrated magnets with bond-dependent interactions. In particular, the experimental report of a half-quantized thermal Hall conductivity in $α$-RuCl$_3$ in the presence of a magnetic field has generated excitement as it could be strong evidence for a field-induced chiral spin liquid. More recent experiments, however, provide a conflicting interpretation advocating for topological magnons in the field-polarized state as the origin of the non-quantized thermal Hall conductivity observed in their experiments. An inherent difficulty in distinguishing between the two scenarios is the phase transition between a putative two-dimensional spin liquid and the field-polarized state exists only at zero temperature, while the behaviour at finite temperature is mostly crossover phenomena. In this work, we provide insights into the finite temperature crossover behavior between the spin excitation continuum in a quantum spin liquid and topological magnons in the field-polarized state in three different theoretical models with large Kitaev interactions. These models allow for a field-induced phase transition from a spin liquid (or an intermediate field-induced spin liquid) to the field-polarized state in the quantum model. We obtain the dynamical spin structure factor as a function of magnetic field using molecular dynamics simulations and compute thermal Hall conductivity in the field-polarized regime. We demonstrate the gradual evolution of the dynamical spin structure factor exhibiting crossover behaviour near magnetic fields where zero-temperature phase transitions occur in the quantum model. We also examine nonlinear effects on topological magnons and the validity of thermal Hall conductivity computed using linear spin wave theory. We discuss the implications of our results to existing and future experiments.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Authors:
Jiachen Li,
Edwin Zhang,
Ming Yin,
Qinxun Bai,
Yu-Xiang Wang,
William Yang Wang
Abstract:
Behavior constrained policy optimization has been demonstrated to be a successful paradigm for tackling Offline Reinforcement Learning. By exploiting historical transitions, a policy is trained to maximize a learned value function while constrained by the behavior policy to avoid a significant distributional shift. In this paper, we propose our closed-form policy improvement operators. We make a n…
▽ More
Behavior constrained policy optimization has been demonstrated to be a successful paradigm for tackling Offline Reinforcement Learning. By exploiting historical transitions, a policy is trained to maximize a learned value function while constrained by the behavior policy to avoid a significant distributional shift. In this paper, we propose our closed-form policy improvement operators. We make a novel observation that the behavior constraint naturally motivates the use of first-order Taylor approximation, leading to a linear approximation of the policy objective. Additionally, as practical datasets are usually collected by heterogeneous policies, we model the behavior policies as a Gaussian Mixture and overcome the induced optimization difficulties by leveraging the LogSumExp's lower bound and Jensen's Inequality, giving rise to a closed-form policy improvement operator. We instantiate offline RL algorithms with our novel policy improvement operators and empirically demonstrate their effectiveness over state-of-the-art algorithms on the standard D4RL benchmark. Our code is available at https://cfpi-icml23.github.io/.
△ Less
Submitted 22 July, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Authors:
Xingqian Xu,
Zhangyang Wang,
Eric Zhang,
Kai Wang,
Humphrey Shi
Abstract:
Recent advances in diffusion models have set an impressive milestone in many generation tasks, and trending works such as DALL-E2, Imagen, and Stable Diffusion have attracted great interest. Despite the rapid landscape changes, recent new approaches focus on extensions and performance rather than capacity, thus requiring separate models for separate tasks. In this work, we expand the existing sing…
▽ More
Recent advances in diffusion models have set an impressive milestone in many generation tasks, and trending works such as DALL-E2, Imagen, and Stable Diffusion have attracted great interest. Despite the rapid landscape changes, recent new approaches focus on extensions and performance rather than capacity, thus requiring separate models for separate tasks. In this work, we expand the existing single-flow diffusion pipeline into a multi-task multimodal network, dubbed Versatile Diffusion (VD), that handles multiple flows of text-to-image, image-to-text, and variations in one unified model. The pipeline design of VD instantiates a unified multi-flow diffusion framework, consisting of sharable and swappable layer modules that enable the crossmodal generality beyond images and text. Through extensive experiments, we demonstrate that VD successfully achieves the following: a) VD outperforms the baseline approaches and handles all its base tasks with competitive quality; b) VD enables novel extensions such as disentanglement of style and semantics, dual- and multi-context blending, etc.; c) The success of our multi-flow multimodal framework over images and text may inspire further diffusion-based universal AI research. Our code and models are open-sourced at https://github.com/SHI-Labs/Versatile-Diffusion.
△ Less
Submitted 11 January, 2024; v1 submitted 15 November, 2022;
originally announced November 2022.
-
Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives
Authors:
Haoning Wu,
Erli Zhang,
Liang Liao,
Chaofeng Chen,
**gwen Hou,
Annan Wang,
Wenxiu Sun,
Qiong Yan,
Weisi Lin
Abstract:
The rapid increase in user-generated-content (UGC) videos calls for the development of effective video quality assessment (VQA) algorithms. However, the objective of the UGC-VQA problem is still ambiguous and can be viewed from two perspectives: the technical perspective, measuring the perception of distortions; and the aesthetic perspective, which relates to preference and recommendation on conte…
▽ More
The rapid increase in user-generated-content (UGC) videos calls for the development of effective video quality assessment (VQA) algorithms. However, the objective of the UGC-VQA problem is still ambiguous and can be viewed from two perspectives: the technical perspective, measuring the perception of distortions; and the aesthetic perspective, which relates to preference and recommendation on contents. To understand how these two perspectives affect overall subjective opinions in UGC-VQA, we conduct a large-scale subjective study to collect human quality opinions on overall quality of videos as well as perceptions from aesthetic and technical perspectives. The collected Disentangled Video Quality Database (DIVIDE-3k) confirms that human quality opinions on UGC videos are universally and inevitably affected by both aesthetic and technical perspectives. In light of this, we propose the Disentangled Objective Video Quality Evaluator (DOVER) to learn the quality of UGC videos based on the two perspectives. The DOVER proves state-of-the-art performance in UGC-VQA under very high efficiency. With perspective opinions in DIVIDE-3k, we further propose DOVER++, the first approach to provide reliable clear-cut quality evaluations from a single aesthetic or technical perspective. Code at https://github.com/VQAssessment/DOVER.
△ Less
Submitted 7 March, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Exploiting Qubit Reuse through Mid-circuit Measurement and Reset
Authors:
Fei Hua,
Yuwei **,
Yanhao Chen,
Suhas Vittal,
Kevin Krsulich,
Lev S. Bishop,
John Lapeyre,
Ali Javadi-Abhari,
Eddy Z. Zhang
Abstract:
Quantum measurement is important to quantum computing as it extracts the outcome of the circuit at the end of the computation. Previously, all measurements have to be done at the end of the circuit. Otherwise, it will incur significant errors. But it is not the case now. Recently IBM started supporting dynamic circuits through hardware (instead of software by simulator). With mid-circuit hardware…
▽ More
Quantum measurement is important to quantum computing as it extracts the outcome of the circuit at the end of the computation. Previously, all measurements have to be done at the end of the circuit. Otherwise, it will incur significant errors. But it is not the case now. Recently IBM started supporting dynamic circuits through hardware (instead of software by simulator). With mid-circuit hardware measurement, we can improve circuit efficacy and fidelity from three aspects: (a) reduced qubit usage, (b) reduced swap insertion, and (c) improved fidelity. We demonstrate this using real-world applications Bernstein Verizani on real hardware and show that circuit resource usage can be improved by 60\%, and circuit fidelity can be improved by 15\%. We design a compiler-assisted tool that can find and exploit the tradeoff between qubit reuse, fidelity, gate count, and circuit duration. We also developed a method for identifying whether qubit reuse will be beneficial for a given application. We evaluated our method on a representative set of essential applications. We can reduce resource usage by up to 80\% and circuit fidelity by up to 20\%.
△ Less
Submitted 6 February, 2023; v1 submitted 3 November, 2022;
originally announced November 2022.
-
On the Geometry Transferability of the Hybrid Iterative Numerical Solver for Differential Equations
Authors:
Adar Kahana,
Enrui Zhang,
Somdatta Goswami,
George EM Karniadakis,
Rishikesh Ranade,
Jay Pathak
Abstract:
The discovery of fast numerical solvers prompted a clear and rapid shift towards iterative techniques in many applications, especially in computational mechanics, due to the increased necessity for solving very large linear systems. Most numerical solvers are highly dependent on the problem geometry and discretization, facing issues when any of these properties change. The newly developed Hybrid I…
▽ More
The discovery of fast numerical solvers prompted a clear and rapid shift towards iterative techniques in many applications, especially in computational mechanics, due to the increased necessity for solving very large linear systems. Most numerical solvers are highly dependent on the problem geometry and discretization, facing issues when any of these properties change. The newly developed Hybrid Iterative Numerical Transferable Solver (HINTS) combines a standard solver with a neural operator to achieve better performance, focusing on a single geometry at a time. In this work, we explore the "T" in HINTS, i.e., the geometry transferability properties of HINTS. We first propose to directly employ HINTS built for a specific geometry to a different but related geometry without any adjustments. In addition, we propose the integration of an operator level transfer learning with HINTS to even further improve the convergence of HINTS on new geometries and discretizations. We conduct numerical experiments for a Darcy flow problem and a plane-strain elasticity problem. The results show that both the direct application of HINTS and the transfer learning enhanced HINTS are able to accurately solve these problems on different geometries. In addition, using transfer learning, HINTS is able to converge to machine zero even faster than the direct application of HINTS.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
Unrolled Graph Learning for Multi-Agent Collaboration
Authors:
Enpei Zhang,
Shuo Tang,
Xiaowen Dong,
Siheng Chen,
Yanfeng Wang
Abstract:
Multi-agent learning has gained increasing attention to tackle distributed machine learning scenarios under constrictions of data exchanging. However, existing multi-agent learning models usually consider data fusion under fixed and compulsory collaborative relations among agents, which is not as flexible and autonomous as human collaboration. To fill this gap, we propose a distributed multi-agent…
▽ More
Multi-agent learning has gained increasing attention to tackle distributed machine learning scenarios under constrictions of data exchanging. However, existing multi-agent learning models usually consider data fusion under fixed and compulsory collaborative relations among agents, which is not as flexible and autonomous as human collaboration. To fill this gap, we propose a distributed multi-agent learning model inspired by human collaboration, in which the agents can autonomously detect suitable collaborators and refer to collaborators' model for better performance. To implement such adaptive collaboration, we use a collaboration graph to indicate the pairwise collaborative relation. The collaboration graph can be obtained by graph learning techniques based on model similarity between different agents. Since model similarity can not be formulated by a fixed graphical optimization, we design a graph learning network by unrolling, which can learn underlying similar features among potential collaborators. By testing on both regression and classification tasks, we validate that our proposed collaboration model can figure out accurate collaborative relationship and greatly improve agents' learning performance.
△ Less
Submitted 8 June, 2023; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
Authors:
Edwin Zhang,
Yujie Lu,
William Wang,
Amy Zhang
Abstract:
Training generalist agents is difficult across several axes, requiring us to deal with high-dimensional inputs (space), long horizons (time), and generalization to novel tasks. Recent advances with architectures have allowed for improved scaling along one or two of these axes, but are still computationally prohibitive to use. In this paper, we propose to address all three axes by leveraging \textb…
▽ More
Training generalist agents is difficult across several axes, requiring us to deal with high-dimensional inputs (space), long horizons (time), and generalization to novel tasks. Recent advances with architectures have allowed for improved scaling along one or two of these axes, but are still computationally prohibitive to use. In this paper, we propose to address all three axes by leveraging \textbf{L}anguage to \textbf{C}ontrol \textbf{D}iffusion models as a hierarchical planner conditioned on language (LCD). We effectively and efficiently scale diffusion models for planning in extended temporal, state, and task dimensions to tackle long horizon control problems conditioned on natural language instructions, as a step towards generalist agents. Comparing LCD with other state-of-the-art models on the CALVIN language robotics benchmark finds that LCD outperforms other SOTA methods in multi-task success rates, whilst improving inference speed over other comparable diffusion models by 3.3x~15x. We show that LCD can successfully leverage the unique strength of diffusion models to produce coherent long range plans while addressing their weakness in generating low-level details and control.
△ Less
Submitted 17 January, 2024; v1 submitted 27 October, 2022;
originally announced October 2022.
-
Superior damage tolerance of fish skins
Authors:
Emily Zhang,
Chi-Huan Tung,
Luyi Feng,
Yu Ren Zhou
Abstract:
Skin is the largest organ of many animals. Its protective function against hostile environments and predatorial attack makes high mechanical strength a vital characteristic. Here, we measured the mechanical properties of bass fish skins and found that fish skins are highly ductile with a rupture strain of up to 30-40% and a rupture strength of 10-15 MPa. The fish skins exhibit a strain-stiffening…
▽ More
Skin is the largest organ of many animals. Its protective function against hostile environments and predatorial attack makes high mechanical strength a vital characteristic. Here, we measured the mechanical properties of bass fish skins and found that fish skins are highly ductile with a rupture strain of up to 30-40% and a rupture strength of 10-15 MPa. The fish skins exhibit a strain-stiffening behavior. Stretching can effectively eliminate the stress concentrations near the pre-existing holes and edge notches, suggesting that the skins are highly damage tolerant. Our measurement determined a flaw-insensitivity length of several millimeters, which exceeds that of most engineering materials. The strain-stiffening and damage tolerance of fish skins are explained by an agent-based model of collagen network in which the load-bearing collagen microfibers assembled from nanofibrils undergo straightening and reorientation upon stretching. Our study inspires development of artificial skins that are thin, flexible, but highly fracture-resistant and widely applicable in soft robots.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
On Representations of Mean-Field Variational Inference
Authors:
Soumyadip Ghosh,
Yingdong Lu,
Tomasz Nowicki,
Edith Zhang
Abstract:
The mean field variational inference (MFVI) formulation restricts the general Bayesian inference problem to the subspace of product measures. We present a framework to analyze MFVI algorithms, which is inspired by a similar development for general variational Bayesian formulations. Our approach enables the MFVI problem to be represented in three different manners: a gradient flow on Wasserstein sp…
▽ More
The mean field variational inference (MFVI) formulation restricts the general Bayesian inference problem to the subspace of product measures. We present a framework to analyze MFVI algorithms, which is inspired by a similar development for general variational Bayesian formulations. Our approach enables the MFVI problem to be represented in three different manners: a gradient flow on Wasserstein space, a system of Fokker-Planck-like equations and a diffusion process. Rigorous guarantees are established to show that a time-discretized implementation of the coordinate ascent variational inference algorithm in the product Wasserstein space of measures yields a gradient flow in the limit. A similar result is obtained for their associated densities, with the limit being given by a quasi-linear partial differential equation. A popular class of practical algorithms falls in this framework, which provides tools to establish convergence. We hope this framework could be used to guarantee convergence of algorithms in a variety of approaches, old and new, to solve variational inference problems.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Accelerating the Evolutionary Algorithms by Gaussian Process Regression with $ε$-greedy acquisition function
Authors:
Rui Zhong,
Enzhi Zhang,
Masaharu Munetomo
Abstract:
In this paper, we propose a novel method to estimate the elite individual to accelerate the convergence of optimization. Inspired by the Bayesian Optimization Algorithm (BOA), the Gaussian Process Regression (GPR) is applied to approximate the fitness landscape of original problems based on every generation of optimization. And simple but efficient $ε$-greedy acquisition function is employed to fi…
▽ More
In this paper, we propose a novel method to estimate the elite individual to accelerate the convergence of optimization. Inspired by the Bayesian Optimization Algorithm (BOA), the Gaussian Process Regression (GPR) is applied to approximate the fitness landscape of original problems based on every generation of optimization. And simple but efficient $ε$-greedy acquisition function is employed to find a promising solution in the surrogate model. Proximity Optimal Principle (POP) states that well-performed solutions have a similar structure, and there is a high probability of better solutions existing around the elite individual. Based on this hypothesis, in each generation of optimization, we replace the worst individual in Evolutionary Algorithms (EAs) with the elite individual to participate in the evolution process. To illustrate the scalability of our proposal, we combine our proposal with the Genetic Algorithm (GA), Differential Evolution (DE), and CMA-ES. Experimental results in CEC2013 benchmark functions show our proposal has a broad prospect to estimate the elite individual and accelerate the convergence of optimization.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
Accelerating the Genetic Algorithm for Large-scale Traveling Salesman Problems by Cooperative Coevolutionary Pointer Network with Reinforcement Learning
Authors:
Rui Zhong,
Enzhi Zhang,
Masaharu Munetomo
Abstract:
In this paper, we propose a two-stage optimization strategy for solving the Large-scale Traveling Salesman Problems (LSTSPs) named CCPNRL-GA. First, we hypothesize that the participation of a well-performed individual as an elite can accelerate the convergence of optimization. Based on this hypothesis, in the first stage, we cluster the cities and decompose the LSTSPs into multiple subcomponents,…
▽ More
In this paper, we propose a two-stage optimization strategy for solving the Large-scale Traveling Salesman Problems (LSTSPs) named CCPNRL-GA. First, we hypothesize that the participation of a well-performed individual as an elite can accelerate the convergence of optimization. Based on this hypothesis, in the first stage, we cluster the cities and decompose the LSTSPs into multiple subcomponents, and each subcomponent is optimized with a reusable Pointer Network (PtrNet). After subcomponents optimization, we combine all sub-tours to form a valid solution, this solution joins the second stage of optimization with GA. We validate the performance of our proposal on 10 LSTSPs and compare it with traditional EAs. Experimental results show that the participation of an elite individual can greatly accelerate the optimization of LSTSPs, and our proposal has broad prospects for dealing with LSTSPs.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Multi-dataset Training of Transformers for Robust Action Recognition
Authors:
Junwei Liang,
Enwei Zhang,
Jun Zhang,
Chunhua Shen
Abstract:
We study the task of robust feature representations, aiming to generalize well on multiple datasets for action recognition. We build our method on Transformers for its efficacy. Although we have witnessed great progress for video action recognition in the past decade, it remains challenging yet valuable how to train a single model that can perform well across multiple datasets. Here, we propose a…
▽ More
We study the task of robust feature representations, aiming to generalize well on multiple datasets for action recognition. We build our method on Transformers for its efficacy. Although we have witnessed great progress for video action recognition in the past decade, it remains challenging yet valuable how to train a single model that can perform well across multiple datasets. Here, we propose a novel multi-dataset training paradigm, MultiTrain, with the design of two new loss terms, namely informative loss and projection loss, aiming to learn robust representations for action recognition. In particular, the informative loss maximizes the expressiveness of the feature embedding while the projection loss for each dataset mines the intrinsic relations between classes across datasets. We verify the effectiveness of our method on five challenging datasets, Kinetics-400, Kinetics-700, Moments-in-Time, Activitynet and Something-something-v2 datasets. Extensive experimental results show that our method can consistently improve state-of-the-art performance. Code and models are released.
△ Less
Submitted 24 November, 2022; v1 submitted 25 September, 2022;
originally announced September 2022.
-
Document-aware Positional Encoding and Linguistic-guided Encoding for Abstractive Multi-document Summarization
Authors:
Congbo Ma,
Wei Emma Zhang,
Pitawelayalage Dasun Dileepa Pitawela,
Yutong Qu,
Haojie Zhuang,
Hu Wang
Abstract:
One key challenge in multi-document summarization is to capture the relations among input documents that distinguish between single document summarization (SDS) and multi-document summarization (MDS). Few existing MDS works address this issue. One effective way is to encode document positional information to assist models in capturing cross-document relations. However, existing MDS models, such as…
▽ More
One key challenge in multi-document summarization is to capture the relations among input documents that distinguish between single document summarization (SDS) and multi-document summarization (MDS). Few existing MDS works address this issue. One effective way is to encode document positional information to assist models in capturing cross-document relations. However, existing MDS models, such as Transformer-based models, only consider token-level positional information. Moreover, these models fail to capture sentences' linguistic structure, which inevitably causes confusions in the generated summaries. Therefore, in this paper, we propose document-aware positional encoding and linguistic-guided encoding that can be fused with Transformer architecture for MDS. For document-aware positional encoding, we introduce a general protocol to guide the selection of document encoding functions. For linguistic-guided encoding, we propose to embed syntactic dependency relations into the dependency relation mask with a simple but effective non-linear encoding learner for feature learning. Extensive experiments show the proposed model can generate summaries with high quality.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
A Hybrid Iterative Numerical Transferable Solver (HINTS) for PDEs Based on Deep Operator Network and Relaxation Methods
Authors:
Enrui Zhang,
Adar Kahana,
Eli Turkel,
Rishikesh Ranade,
Jay Pathak,
George Em Karniadakis
Abstract:
Iterative solvers of linear systems are a key component for the numerical solutions of partial differential equations (PDEs). While there have been intensive studies through past decades on classical methods such as Jacobi, Gauss-Seidel, conjugate gradient, multigrid methods and their more advanced variants, there is still a pressing need to develop faster, more robust and reliable solvers. Based…
▽ More
Iterative solvers of linear systems are a key component for the numerical solutions of partial differential equations (PDEs). While there have been intensive studies through past decades on classical methods such as Jacobi, Gauss-Seidel, conjugate gradient, multigrid methods and their more advanced variants, there is still a pressing need to develop faster, more robust and reliable solvers. Based on recent advances in scientific deep learning for operator regression, we propose HINTS, a hybrid, iterative, numerical, and transferable solver for differential equations. HINTS combines standard relaxation methods and the Deep Operator Network (DeepONet). Compared to standard numerical solvers, HINTS is capable of providing faster solutions for a wide class of differential equations, while preserving the accuracy close to machine zero. Through an eigenmode analysis, we find that the individual solvers in HINTS target distinct regions in the spectrum of eigenmodes, resulting in a uniform convergence rate and hence exceptional performance of the hybrid solver overall. Moreover, HINTS applies to equations in multidimensions, and is flexible with regards to computational domain and transferable to different discretizations.
△ Less
Submitted 28 August, 2022;
originally announced August 2022.
-
G2Φnet: Relating Genotype and Biomechanical Phenotype of Tissues with Deep Learning
Authors:
Enrui Zhang,
Bart Spronck,
Jay D. Humphrey,
George Em Karniadakis
Abstract:
Many genetic mutations adversely affect the structure and function of load-bearing soft tissues, with clinical sequelae often responsible for disability or death. Parallel advances in genetics and histomechanical characterization provide significant insight into these conditions, but there remains a pressing need to integrate such information. We present a novel genotype-to-biomechanical-phenotype…
▽ More
Many genetic mutations adversely affect the structure and function of load-bearing soft tissues, with clinical sequelae often responsible for disability or death. Parallel advances in genetics and histomechanical characterization provide significant insight into these conditions, but there remains a pressing need to integrate such information. We present a novel genotype-to-biomechanical-phenotype neural network (G2Φnet) for characterizing and classifying biomechanical properties of soft tissues, which serve as important functional readouts of tissue health or disease. We illustrate the utility of our approach by inferring the nonlinear, genotype-dependent constitutive behavior of the aorta for four mouse models involving defects or deficiencies in extracellular constituents. We show that G2Φnet can infer the biomechanical response while simultaneously ascribing the associated genotype correctly by utilizing limited, noisy, and unstructured experimental data. More broadly, G2Φnet provides a powerful method and a paradigm shift for correlating genotype and biomechanical phenotype quantitatively, promising a better understanding of their interplay in biological tissues.
△ Less
Submitted 21 August, 2022;
originally announced August 2022.
-
Brand Celebrity Matching Model Based on Natural Language Processing
Authors:
Heming Yang,
Ke Yang,
Erhan Zhang
Abstract:
Celebrity Endorsement is one of the most significant strategies in brand communication. Nowadays, more and more companies try to build a vivid characteristic for themselves. Therefore, their brand identity communications should accord with some characteristics as humans and regulations. However, the previous works mostly stop by assumptions, instead of proposing a specific way to perform matching…
▽ More
Celebrity Endorsement is one of the most significant strategies in brand communication. Nowadays, more and more companies try to build a vivid characteristic for themselves. Therefore, their brand identity communications should accord with some characteristics as humans and regulations. However, the previous works mostly stop by assumptions, instead of proposing a specific way to perform matching between brands and celebrities. In this paper, we propose a brand celebrity matching model (BCM) based on Natural Language Processing (NLP) techniques. Given a brand and a celebrity, we firstly obtain some descriptive documents of them from the Internet, then summarize these documents, and finally calculate a matching degree between the brand and the celebrity to determine whether they are matched. According to the experimental result, our proposed model outperforms the best baselines with a 0.362 F1 score and 6.3% of accuracy, which indicates the effectiveness and application value of our model in the real-world scene. What's more, to our best knowledge, the proposed BCM model is the first work on using NLP to solve endorsement issues, so it can provide some novel research ideas and methodologies for the following works.
△ Less
Submitted 18 August, 2022;
originally announced August 2022.
-
All-electrical switching of a topological non-collinear antiferromagnet at room temperature
Authors:
Yongcheng Deng,
Xionghua Liu,
Yiyuan Chen,
Zongzheng Du,
Nai Jiang,
Chao Shen,
Enze Zhang,
Houzhi Zheng,
Hai-Zhou Lu,
Kaiyou Wang
Abstract:
Non-collinear antiferromagnetic Weyl semimetals, combining the advantages of a zero stray field and ultrafast spin dynamics as well as a large anomalous Hall effect and the chiral anomaly of Weyl fermions, have attracted extensive interests. However, the all-electrical control of such systems at room temperature, a crucial step toward practical applications, has not been reported. Here using a sma…
▽ More
Non-collinear antiferromagnetic Weyl semimetals, combining the advantages of a zero stray field and ultrafast spin dynamics as well as a large anomalous Hall effect and the chiral anomaly of Weyl fermions, have attracted extensive interests. However, the all-electrical control of such systems at room temperature, a crucial step toward practical applications, has not been reported. Here using a small writing current of around 5*10^{6} A/cm^{2}, we realize the all-electrical current-induced deterministic switching of the non-collinear antiferromagnet Mn3Sn with a strong readout signal at room temperature in the Si/SiO2/Mn3Sn/AlOx structure, without external magnetic field and injected spin current. Our simulations reveal that the switching is originated from the current-induced intrinsic non-collinear spin-orbit torques in Mn3Sn itself. Our findings pave the way for the development of topological antiferromagnetic spintronics.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
Class-Aware Universum Inspired Re-Balance Learning for Long-Tailed Recognition
Authors:
Enhao Zhang,
Chuanxing Geng,
Songcan Chen
Abstract:
Data augmentation for minority classes is an effective strategy for long-tailed recognition, thus develo** a large number of methods. Although these methods all ensure the balance in sample quantity, the quality of the augmented samples is not always satisfactory for recognition, being prone to such problems as over-fitting, lack of diversity, semantic drift, etc. For these issues, we propose th…
▽ More
Data augmentation for minority classes is an effective strategy for long-tailed recognition, thus develo** a large number of methods. Although these methods all ensure the balance in sample quantity, the quality of the augmented samples is not always satisfactory for recognition, being prone to such problems as over-fitting, lack of diversity, semantic drift, etc. For these issues, we propose the Class-aware Universum Inspired Re-balance Learning(CaUIRL) for long-tailed recognition, which endows the Universum with class-aware ability to re-balance individual minority classes from both sample quantity and quality. In particular, we theoretically prove that the classifiers learned by CaUIRL are consistent with those learned under the balanced condition from a Bayesian perspective. In addition, we further develop a higher-order mixup approach, which can automatically generate class-aware Universum(CaU) data without resorting to any external data. Unlike the traditional Universum, such generated Universum additionally takes the domain similarity, class separability, and sample diversity into account. Extensive experiments on benchmark datasets demonstrate the surprising advantages of our method, especially the top1 accuracy in minority classes is improved by 1.9% 6% compared to the state-of-the-art method.
△ Less
Submitted 11 August, 2022; v1 submitted 26 July, 2022;
originally announced July 2022.
-
A Synergistic Compilation Workflow for Tackling Crosstalk in Quantum Machines
Authors:
Fei Hua,
Yuwei **,
Ang Li,
Chenxu Liu,
Meng Wang,
Yanhao Chen,
Chi Zhang,
Ari Hayes,
Samuel Stein,
Minghao Guo,
Yipeng Huang,
Eddy Z. Zhang
Abstract:
Near-term quantum systems tend to be noisy. Crosstalk noise has been recognized as one of several major types of noises in superconducting Noisy Intermediate-Scale Quantum (NISQ) devices. Crosstalk arises from the concurrent execution of two-qubit gates on nearby qubits, such as \texttt{CX}. It might significantly raise the error rate of gates in comparison to running them individually. Crosstalk…
▽ More
Near-term quantum systems tend to be noisy. Crosstalk noise has been recognized as one of several major types of noises in superconducting Noisy Intermediate-Scale Quantum (NISQ) devices. Crosstalk arises from the concurrent execution of two-qubit gates on nearby qubits, such as \texttt{CX}. It might significantly raise the error rate of gates in comparison to running them individually. Crosstalk can be mitigated through scheduling or hardware machine tuning. Prior scientific studies, however, manage crosstalk at a really late phase in the compilation process, usually after hardware map** is done. It may miss great opportunities of optimizing algorithm logic, routing, and crosstalk at the same time. In this paper, we push the envelope by considering all these factors simultaneously at the very early compilation stage. We propose a crosstalk-aware quantum program compilation framework called CQC that can enhance crosstalk mitigation while achieving satisfactory circuit depth. Moreover, we identify opportunities for translation from intermediate representation to the circuit for application-specific crosstalk mitigation, for instance, the \texttt{CX} ladder construction in variational quantum eigensolvers (VQE). Evaluations through simulation and on real IBM-Q devices show that our framework can significantly reduce the error rate by up to 6$\times$, with only $\sim$60\% circuit depth compared to state-of-the-art gate scheduling approaches. In particular, for VQE, we demonstrate 49\% circuit depth reduction with 9.6\% fidelity improvement over prior art on the H4 molecule using IBMQ Guadalupe. Our CQC framework will be released on GitHub.
△ Less
Submitted 8 December, 2023; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Transferability-Guided Cross-Domain Cross-Task Transfer Learning
Authors:
Yang Tan,
Enming Zhang,
Yang Li,
Shao-Lun Huang,
Xiao-** Zhang
Abstract:
We propose two novel transferability metrics F-OTCE (Fast Optimal Transport based Conditional Entropy) and JC-OTCE (Joint Correspondence OTCE) to evaluate how much the source model (task) can benefit the learning of the target task and to learn more transferable representations for cross-domain cross-task transfer learning. Unlike the existing metric that requires evaluating the empirical transfer…
▽ More
We propose two novel transferability metrics F-OTCE (Fast Optimal Transport based Conditional Entropy) and JC-OTCE (Joint Correspondence OTCE) to evaluate how much the source model (task) can benefit the learning of the target task and to learn more transferable representations for cross-domain cross-task transfer learning. Unlike the existing metric that requires evaluating the empirical transferability on auxiliary tasks, our metrics are auxiliary-free such that they can be computed much more efficiently. Specifically, F-OTCE estimates transferability by first solving an Optimal Transport (OT) problem between source and target distributions, and then uses the optimal coupling to compute the Negative Conditional Entropy between source and target labels. It can also serve as a loss function to maximize the transferability of the source model before finetuning on the target task. Meanwhile, JC-OTCE improves the transferability robustness of F-OTCE by including label distances in the OT problem, though it may incur additional computation cost. Extensive experiments demonstrate that F-OTCE and JC-OTCE outperform state-of-the-art auxiliary-free metrics by 18.85% and 28.88%, respectively in correlation coefficient with the ground-truth transfer accuracy. By eliminating the training cost of auxiliary tasks, the two metrics reduces the total computation time of the previous method from 43 minutes to 9.32s and 10.78s, respectively, for a pair of tasks. When used as a loss function, F-OTCE shows consistent improvements on the transfer accuracy of the source model in few-shot classification experiments, with up to 4.41% accuracy gain.
△ Less
Submitted 29 February, 2024; v1 submitted 12 July, 2022;
originally announced July 2022.
-
An optimal transport based characterization of convex order
Authors:
Johannes Wiesel,
Erica Zhang
Abstract:
For probability measures $μ,ν$ and $ρ$ define the cost functionals \begin{align*} C(μ,ρ):=\sup_{π\in Π(μ,ρ)} \int \langle x,y\rangle\, π(dx,dy),\quad C(ν,ρ):=\sup_{π\in Π(ν,ρ)} \int \langle x,y\rangle\, π(dx,dy), \end{align*} where $\langle\cdot, \cdot\rangle$ denotes the scalar product and $Π(\cdot,\cdot)$ is the set of couplings. We show that two probability measures $μ$ and $ν$ on…
▽ More
For probability measures $μ,ν$ and $ρ$ define the cost functionals \begin{align*} C(μ,ρ):=\sup_{π\in Π(μ,ρ)} \int \langle x,y\rangle\, π(dx,dy),\quad C(ν,ρ):=\sup_{π\in Π(ν,ρ)} \int \langle x,y\rangle\, π(dx,dy), \end{align*} where $\langle\cdot, \cdot\rangle$ denotes the scalar product and $Π(\cdot,\cdot)$ is the set of couplings. We show that two probability measures $μ$ and $ν$ on $\mathbb{R}^d$ with finite first moments are in convex order (i.e. $μ\preceq_cν$) iff $C(μ,ρ)\le C(ν,ρ)$ holds for all probability measures $ρ$ on $\mathbb{R}^d$ with bounded support. This generalizes a result by Carlier. Our proof relies on a quantitative bound for the infimum of $\int f\,dν-\int f\,dμ$ over all $1$-Lipschitz functions $f$, which is obtained through optimal transport duality and Brenier's theorem. Building on this result, we derive new proofs of well-known one-dimensional characterizations of convex order. We also describe new computational methods for investigating convex order and applications to model-independent arbitrage strategies in mathematical finance.
△ Less
Submitted 8 March, 2023; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Geometrical frustration versus Kitaev interactions in BaCo$_2$(AsO$_4$)$_2$
Authors:
Thomas Halloran,
Félix Desrochers,
Emily Z. Zhang,
Tong Chen,
Li Ern Chern,
Zhijun Xu,
Barry Winn,
M. K. Graves-Brook,
M. B. Stone,
Alexander I. Kolesnikov,
Yiming Qui,
Ruidan Zhong,
Robert Cava,
Yong Baek Kim,
Collin Broholm
Abstract:
Recently, Co-based honeycomb magnets have been proposed as promising candidate materials to host the Kitaev spin liquid state. One of the front-runners is BaCo$_2$(AsO$_4$)$_2$ (BCAO), where it was suggested that the exchange processes between Co$^{2+}$ ions via the surrounding edge-sharing oxygen octahedra could give rise to bond-dependent Kitaev interactions. In this work, we present and analyze…
▽ More
Recently, Co-based honeycomb magnets have been proposed as promising candidate materials to host the Kitaev spin liquid state. One of the front-runners is BaCo$_2$(AsO$_4$)$_2$ (BCAO), where it was suggested that the exchange processes between Co$^{2+}$ ions via the surrounding edge-sharing oxygen octahedra could give rise to bond-dependent Kitaev interactions. In this work, we present and analyze comprehensive inelastic neutron scattering studies of BCAO with fields in the honeycomb plane. Combining the constraints from the magnon excitations in the high-field polarized state and the inelastic spin structure factor measured in zero magnetic field, we examine two leading theoretical models: the Kitaev-type \JKG model and the \XXZ model. We show that the existing experimental data can be consistently accounted for by the \XXZ model but not by the \JKG model, and we discuss the implications of these results for the realization of a spin liquid phase in BCAO and more generally for the realization of the Kitaev model in cobaltates.
△ Less
Submitted 30 May, 2022;
originally announced May 2022.
-
Effective Field Theory Analysis of CDMSlite Run 2 Data
Authors:
SuperCDMS Collaboration,
M. F. Albakry,
I. Alkhatib,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
D. A. Bauer,
L. V. S. Bezerra,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott
, et al. (105 additional authors not shown)
Abstract:
CDMSlite Run 2 was a search for weakly interacting massive particles (WIMPs) with a cryogenic 600 g Ge detector operated in a high-voltage mode to optimize sensitivity to WIMPs of relatively low mass from 2 - 20 GeV/$c^2$. In this article, we present an effective field theory (EFT) analysis of the CDMSlite Run 2 data using an extended energy range and a comprehensive treatment of the expected back…
▽ More
CDMSlite Run 2 was a search for weakly interacting massive particles (WIMPs) with a cryogenic 600 g Ge detector operated in a high-voltage mode to optimize sensitivity to WIMPs of relatively low mass from 2 - 20 GeV/$c^2$. In this article, we present an effective field theory (EFT) analysis of the CDMSlite Run 2 data using an extended energy range and a comprehensive treatment of the expected background. A binned likelihood Bayesian analysis was performed on the recoil energy data, taking into account the parameters of the EFT interactions and optimizing the data selection with respect to the dominant background components. Energy regions within 5$σ$ of known activation peaks were removed from the analysis. The Bayesian evidences resulting from the different operator hypotheses show that the CDMSlite Run 2 data are consistent with the background-only models and do not allow for a signal interpretation assuming any additional EFT interaction. Consequently, upper limits on the WIMP mass and coupling-coefficient amplitudes and phases are presented for each EFT operator. These limits improve previous CDMSlite Run 2 bounds for WIMP masses above 5 GeV/$c^2$.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Time Series Anomaly Detection via Reinforcement Learning-Based Model Selection
Authors:
Jiuqi Elise Zhang,
Di Wu,
Benoit Boulet
Abstract:
Time series anomaly detection has been recognized as of critical importance for the reliable and efficient operation of real-world systems. Many anomaly detection methods have been developed based on various assumptions on anomaly characteristics. However, due to the complex nature of real-world data, different anomalies within a time series usually have diverse profiles supporting different anoma…
▽ More
Time series anomaly detection has been recognized as of critical importance for the reliable and efficient operation of real-world systems. Many anomaly detection methods have been developed based on various assumptions on anomaly characteristics. However, due to the complex nature of real-world data, different anomalies within a time series usually have diverse profiles supporting different anomaly assumptions. This makes it difficult to find a single anomaly detector that can consistently outperform other models. In this work, to harness the benefits of different base models, we propose a reinforcement learning-based model selection framework. Specifically, we first learn a pool of different anomaly detection models, and then utilize reinforcement learning to dynamically select a candidate model from these base models. Experiments on real-world data have demonstrated that the proposed strategy can indeed outplay all baseline models in terms of overall performance.
△ Less
Submitted 27 July, 2022; v1 submitted 19 May, 2022;
originally announced May 2022.
-
A model aggregation approach for high-dimensional large-scale optimization
Authors:
Haowei Wang,
Ercong Zhang,
Szu Hui Ng,
Giulia Pedrielli
Abstract:
Bayesian optimization (BO) has been widely used in machine learning and simulation optimization. With the increase in computational resources and storage capacities in these fields, high-dimensional and large-scale problems are becoming increasingly common. In this study, we propose a model aggregation method in the Bayesian optimization (MamBO) algorithm for efficiently solving high-dimensional l…
▽ More
Bayesian optimization (BO) has been widely used in machine learning and simulation optimization. With the increase in computational resources and storage capacities in these fields, high-dimensional and large-scale problems are becoming increasingly common. In this study, we propose a model aggregation method in the Bayesian optimization (MamBO) algorithm for efficiently solving high-dimensional large-scale optimization problems. MamBO uses a combination of subsampling and subspace embeddings to collectively address high dimensionality and large-scale issues; in addition, a model aggregation method is employed to address the surrogate model uncertainty issue that arises when embedding is applied. This surrogate model uncertainty issue is largely ignored in the embedding literature and practice, and it is exacerbated when the problem is high-dimensional and data are limited. Our proposed model aggregation method reduces these lower-dimensional surrogate model risks and improves the robustness of the BO algorithm. We derive an asymptotic bound for the proposed aggregated surrogate model and prove the convergence of MamBO. Benchmark numerical experiments indicate that our algorithm achieves superior or comparable performance to other commonly used high-dimensional BO algorithms. Moreover, we apply MamBO to a cascade classifier of a machine learning algorithm for face detection, and the results reveal that MamBO finds settings that achieve higher classification accuracy than the benchmark settings and is computationally faster than other high-dimensional BO algorithms.
△ Less
Submitted 1 June, 2022; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Reconnecting the Estranged Relationships: Optimizing the Influence Propagation in Evolving Networks
Authors:
Taotao Cai,
Qi Lei,
Quan Z. Sheng,
Shuiqiao Yang,
Jian Yang,
Wei Emma Zhang
Abstract:
Influence Maximization (IM), which aims to select a set of users from a social network to maximize the expected number of influenced users, has recently received significant attention for mass communication and commercial marketing. Existing research efforts dedicated to the IM problem depend on a strong assumption: the selected seed users are willing to spread the information after receiving bene…
▽ More
Influence Maximization (IM), which aims to select a set of users from a social network to maximize the expected number of influenced users, has recently received significant attention for mass communication and commercial marketing. Existing research efforts dedicated to the IM problem depend on a strong assumption: the selected seed users are willing to spread the information after receiving benefits from a company or organization. In reality, however, some seed users may be reluctant to spread the information, or need to be paid higher to be motivated. Furthermore, the existing IM works pay little attention to capture user's influence propagation in the future period as well. In this paper, we target a new research problem, named Reconnecting Top-l Relationships (RTlR) query, which aims to find l number of previous existing relationships but being stranged later, such that reconnecting these relationships will maximize the expected benefit of influenced users by the given group in a future period. We prove that the RTlR problem is NP-hard. An efficient greedy algorithm is proposed to answer the RTlR queries with the influence estimation technique and the well-chosen link prediction method to predict the near future network structure. We also design a pruning method to reduce unnecessary probing from candidate edges. Further, a carefully designed order-based algorithm is proposed to accelerate the RTlR queries. Finally, we conduct extensive experiments on real-world datasets to demonstrate the effectiveness and efficiency of our proposed methods.
△ Less
Submitted 10 May, 2022;
originally announced May 2022.
-
Mixed-UNet: Refined Class Activation Map** for Weakly-Supervised Semantic Segmentation with Multi-scale Inference
Authors:
Yang Liu,
Ersi Zhang,
Lulu Xu,
Chufan Xiao,
Xiaoyun Zhong,
Li** Lian,
Fang Li,
Bin Jiang,
Yuhan Dong,
Lan Ma,
Qiming Huang,
Ming Xu,
Yongbing Zhang,
Dongmei Yu,
Chenggang Yan,
Peiwu Qin
Abstract:
Deep learning techniques have shown great potential in medical image processing, particularly through accurate and reliable image segmentation on magnetic resonance imaging (MRI) scans or computed tomography (CT) scans, which allow the localization and diagnosis of lesions. However, training these segmentation models requires a large number of manually annotated pixel-level labels, which are time-…
▽ More
Deep learning techniques have shown great potential in medical image processing, particularly through accurate and reliable image segmentation on magnetic resonance imaging (MRI) scans or computed tomography (CT) scans, which allow the localization and diagnosis of lesions. However, training these segmentation models requires a large number of manually annotated pixel-level labels, which are time-consuming and labor-intensive, in contrast to image-level labels that are easier to obtain. It is imperative to resolve this problem through weakly-supervised semantic segmentation models using image-level labels as supervision since it can significantly reduce human annotation efforts. Most of the advanced solutions exploit class activation map** (CAM). However, the original CAMs rarely capture the precise boundaries of lesions. In this study, we propose the strategy of multi-scale inference to refine CAMs by reducing the detail loss in single-scale reasoning. For segmentation, we develop a novel model named Mixed-UNet, which has two parallel branches in the decoding phase. The results can be obtained after fusing the extracted features from two branches. We evaluate the designed Mixed-UNet against several prevalent deep learning-based segmentation approaches on our dataset collected from the local hospital and public datasets. The validation results demonstrate that our model surpasses available methods under the same supervision level in the segmentation of various lesions from brain imaging.
△ Less
Submitted 6 May, 2022;
originally announced May 2022.
-
Trust-SIoT: Towards Trustworthy Object Classification in the Social Internet of Things
Authors:
Subhash Sagar,
Adnan Mahmood,
Kai Wang,
Quan Z. Sheng,
Wei Emma Zhang
Abstract:
The recent emergence of the promising paradigm of the Social Internet of Things (SIoT) is a result of an intelligent amalgamation of the social networking concepts with the Internet of Things (IoT) objects (also referred to as "things") in an attempt to unravel the challenges of network discovery, navigability, and service composition. This is realized by facilitating the IoT objects to socialize…
▽ More
The recent emergence of the promising paradigm of the Social Internet of Things (SIoT) is a result of an intelligent amalgamation of the social networking concepts with the Internet of Things (IoT) objects (also referred to as "things") in an attempt to unravel the challenges of network discovery, navigability, and service composition. This is realized by facilitating the IoT objects to socialize with one another, i.e., similar to the social interactions amongst the human beings. A fundamental issue that mandates careful attention is to thus establish, and over time, maintain trustworthy relationships amongst these IoT objects. Therefore, a trust framework for SIoT must include object-object interactions, the aspects of social relationships, credible recommendations, etc., however, the existing literature has only focused on some aspects of trust by primarily relying on the conventional approaches that govern linear relationships between input and output. In this paper, an artificial neural network-based trust framework, Trust-SIoT, has been envisaged for identifying the complex non-linear relationships between input and output in a bid to classify the trustworthy objects. Moreover, Trust-SIoT has been designed for capturing a number of key trust metrics as input, i.e., direct trust by integrating both current and past interactions, reliability, and benevolence of an object, credible recommendations, and the degree of relationship by employing a knowledge graph embedding. Finally, we have performed extensive experiments to evaluate the performance of Trust-SIoT vis-a-vis state-of-the-art heuristics on two real-world datasets. The results demonstrate that Trust-SIoT achieves a higher F1 and lower MAE and MSE scores.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Ultrathin, high-speed, all-optical photoacoustic endomicroscopy probe for guiding minimally invasive surgery
Authors:
Tianrui Zhao,
Truc Thuy Pham,
Christian Baker,
Michelle T. Ma,
Sebastien Ourselin,
Tom Vercauteren,
Edward Zhang,
Paul C. Beard,
Wenfeng Xia
Abstract:
Photoacoustic (PA) endoscopy has shown significant potential for clinical diagnosis and surgical guidance. Multimode fibres (MMFs) are becoming increasing attractive for the development of miniature endoscopy probes owing to ultrathin size, low cost and diffraction-limited spatial resolution enabled by wavefront sha**. However, current MMF-based PA endomicroscopy probes are either limited by a b…
▽ More
Photoacoustic (PA) endoscopy has shown significant potential for clinical diagnosis and surgical guidance. Multimode fibres (MMFs) are becoming increasing attractive for the development of miniature endoscopy probes owing to ultrathin size, low cost and diffraction-limited spatial resolution enabled by wavefront sha**. However, current MMF-based PA endomicroscopy probes are either limited by a bulky ultrasound detector or a low imaging speed which hindered their usability. In this work, we report the development of a highly miniaturised and high-speed PA endomicroscopy probe that is integrated within the cannula of a 20 gauge medical needle. This probe comprises a MMF for delivering the PA excitation light and a single-mode optical fibre with a plano-concave microresonator for ultrasound detection. Wavefront sha** with a digital micromirror device enabled rapid raster-scanning of a focused light spot at the distal end of the MMF for tissue interrogation. High-resolution PA imaging of mouse red blood cells covering an area 100 microns in diameter was achieved with the needle probe at ~3 frames per second. Mosaicing imaging was performed after fibre characterisation by translating the needle probe to enlarge the field-of-view in real-time. The developed ultrathin PA endomicroscopy probe is promising for guiding minimally invasive surgery by providing functional, molecular and microstructural information of tissue in real-time.
△ Less
Submitted 6 May, 2022;
originally announced May 2022.
-
Detecting Textual Adversarial Examples Based on Distributional Characteristics of Data Representations
Authors:
Na Liu,
Mark Dras,
Wei Emma Zhang
Abstract:
Although deep neural networks have achieved state-of-the-art performance in various machine learning tasks, adversarial examples, constructed by adding small non-random perturbations to correctly classified inputs, successfully fool highly expressive deep classifiers into incorrect predictions. Approaches to adversarial attacks in natural language tasks have boomed in the last five years using cha…
▽ More
Although deep neural networks have achieved state-of-the-art performance in various machine learning tasks, adversarial examples, constructed by adding small non-random perturbations to correctly classified inputs, successfully fool highly expressive deep classifiers into incorrect predictions. Approaches to adversarial attacks in natural language tasks have boomed in the last five years using character-level, word-level, phrase-level, or sentence-level textual perturbations. While there is some work in NLP on defending against such attacks through proactive methods, like adversarial training, there is to our knowledge no effective general reactive approaches to defence via detection of textual adversarial examples such as is found in the image processing literature. In this paper, we propose two new reactive methods for NLP to fill this gap, which unlike the few limited application baselines from NLP are based entirely on distribution characteristics of learned representations: we adapt one from the image processing literature (Local Intrinsic Dimensionality (LID)), and propose a novel one (MultiDistance Representation Ensemble Method (MDRE)). Adapted LID and MDRE obtain state-of-the-art results on character-level, word-level, and phrase-level attacks on the IMDB dataset as well as on the later two with respect to the MultiNLI dataset. For future research, we publish our code.
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Knowledge-aware Document Summarization: A Survey of Knowledge, Embedding Methods and Architectures
Authors:
Yutong Qu,
Wei Emma Zhang,
Jian Yang,
Lingfei Wu,
Jia Wu
Abstract:
Knowledge-aware methods have boosted a range of natural language processing applications over the last decades. With the gathered momentum, knowledge recently has been pumped into enormous attention in document summarization, one of natural language processing applications. Previous works reported that knowledge-embedded document summarizers excel at generating superior digests, especially in term…
▽ More
Knowledge-aware methods have boosted a range of natural language processing applications over the last decades. With the gathered momentum, knowledge recently has been pumped into enormous attention in document summarization, one of natural language processing applications. Previous works reported that knowledge-embedded document summarizers excel at generating superior digests, especially in terms of informativeness, coherence, and fact consistency. This paper pursues to present the first systematic survey for the state-of-the-art methodologies that embed knowledge into document summarizers. Particularly, we propose novel taxonomies to recapitulate knowledge and knowledge embeddings under the document summarization view. We further explore how embeddings are generated in embedding learning architectures of document summarization models, especially of deep learning models. At last, we discuss the challenges of this topic and future directions.
△ Less
Submitted 9 July, 2022; v1 submitted 24 April, 2022;
originally announced April 2022.
-
Investigating the sources of low-energy events in a SuperCDMS-HVeV detector
Authors:
SuperCDMS Collaboration,
M. F. Albakry,
I. Alkhatib,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
D. A. Bauer,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeño,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott,
J. Cooley
, et al. (104 additional authors not shown)
Abstract:
Recent experiments searching for sub-GeV/$c^2$ dark matter have observed event excesses close to their respective energy thresholds. Although specific to the individual technologies, the measured excess event rates have been consistently reported at or below event energies of a few-hundred eV, or with charges of a few electron-hole pairs. In the present work, we operated a 1-gram silicon SuperCDMS…
▽ More
Recent experiments searching for sub-GeV/$c^2$ dark matter have observed event excesses close to their respective energy thresholds. Although specific to the individual technologies, the measured excess event rates have been consistently reported at or below event energies of a few-hundred eV, or with charges of a few electron-hole pairs. In the present work, we operated a 1-gram silicon SuperCDMS-HVeV detector at three voltages across the crystal (0 V, 60 V and 100 V). The 0 V data show an excess of events in the tens of eV region. Despite this event excess, we demonstrate the ability to set a competitive exclusion limit on the spin-independent dark matter--nucleon elastic scattering cross section for dark matter masses of $\mathcal{O}(100)$ MeV/$c^2$, enabled by operation of the detector at 0 V potential and achievement of a very low $\mathcal{O}(10)$ eV threshold for nuclear recoils. Comparing the data acquired at 0 V, 60 V and 100 V potentials across the crystal, we investigated possible sources of the unexpected events observed at low energy. The data indicate that the dominant contribution to the excess is consistent with a hypothesized luminescence from the printed circuit boards used in the detector holder.
△ Less
Submitted 11 October, 2022; v1 submitted 17 April, 2022;
originally announced April 2022.
-
A Survey on Location-Driven Influence Maximization
Authors:
Taotao Cai,
Quan Z. Sheng,
Xiangyu Song,
Jian Yang,
Shuang Wang,
Wei Emma Zhang,
Jia Wu,
Philip S. Yu
Abstract:
Influence Maximization (IM), which aims to select a set of users from a social network to maximize the expected number of influenced users, is an evergreen hot research topic. Its research outcomes significantly impact real-world applications such as business marketing. The booming location-based network platforms of the last decade appeal to the researchers embedding the location information into…
▽ More
Influence Maximization (IM), which aims to select a set of users from a social network to maximize the expected number of influenced users, is an evergreen hot research topic. Its research outcomes significantly impact real-world applications such as business marketing. The booming location-based network platforms of the last decade appeal to the researchers embedding the location information into traditional IM research. In this survey, we provide a comprehensive review of the existing location-driven IM studies from the perspective of the following key aspects: (1) a review of the application scenarios of these works, (2) the diffusion models to evaluate the influence propagation, and (3) a comprehensive study of the approaches to deal with the location-driven IM problems together with a particular focus on the accelerating techniques. In the end, we draw prospects into the research directions in future IM research.
△ Less
Submitted 14 September, 2022; v1 submitted 17 April, 2022;
originally announced April 2022.
-
Stability of Multi-dimensional Nonlinear Piezoelectric Beam with Viscoelastic Infinite Memory
Authors:
H. E Zhang,
G. Q. Xu,
Z. J. Han
Abstract:
The long time behavior of a kind of fully magnetic effected nonlinear piezoelectric beam with viscoelastic infinite memory is considered. The well-posedness of this nonlinear coupled PDEs system is showed by mean of the semigroup theories and Banach fixed point theorem. Based on frequency domain analysis, it is proved that the corresponding coupled linear system can be indirectly stabilized expone…
▽ More
The long time behavior of a kind of fully magnetic effected nonlinear piezoelectric beam with viscoelastic infinite memory is considered. The well-posedness of this nonlinear coupled PDEs system is showed by mean of the semigroup theories and Banach fixed point theorem. Based on frequency domain analysis, it is proved that the corresponding coupled linear system can be indirectly stabilized exponentially by only one viscoelastic infinite memory term, which is located on one equation of these strongly coupled PDEs. Then the exponential decay of the solution to the nonlinear coupled PDEs' system is established by the energy estimation method under certain condition.
△ Less
Submitted 5 September, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Study of Electroweak Phase Transition in Exotic Higgs Decays at the CEPC
Authors:
Zhen Wang,
Xuliang Zhu,
Elham E Khoda,
Shih-Chieh Hsu,
Nikolaos Konstantinidis,
Ke Li,
Shu Li,
Michael J. Ramsey-Musolf,
Yanda Wu,
Yuwen E. Zhang
Abstract:
A strong first-order electroweak phase transition (EWPT) can be induced by light new physics weakly coupled to the Higgs. This study focuses on a scenario in which the first-order EWPT is driven by a light scalar $s$ with a mass between 15-60 GeV. A search for exotic decays of the Higgs boson into a pair of spin-zero particles, $h \to ss$, where the $s$-boson decays into $b$-quarks promptly is pre…
▽ More
A strong first-order electroweak phase transition (EWPT) can be induced by light new physics weakly coupled to the Higgs. This study focuses on a scenario in which the first-order EWPT is driven by a light scalar $s$ with a mass between 15-60 GeV. A search for exotic decays of the Higgs boson into a pair of spin-zero particles, $h \to ss$, where the $s$-boson decays into $b$-quarks promptly is presented. The search is performed in events where the Higgs boson is produced in association with a $Z$ boson, giving rise to a signature of two charged leptons (electrons or muons) and multiple jets from $b$-quark decays. The analysis is considering a scenario of analysing 5000 fb$^{-1}$ $e^+ e^-$ collision data at $\sqrt{s} = 240 $ GeV from the Circular Electron Positron Collider (CEPC). This study with $4b$ final state conclusively tests the expected sensitivity of probing the light scalars in the CEPC experiment. The sensitivity reach is significantly larger than that can be achieved at the LHC.
△ Less
Submitted 18 March, 2022;
originally announced March 2022.
-
A Strategy for Low-Mass Dark Matter Searches with Cryogenic Detectors in the SuperCDMS SNOLAB Facility
Authors:
SuperCDMS Collaboration,
M. F. Albakry,
I. Alkhatib,
D. W. P. Amaral,
T. Aralis,
T. Aramaki,
I. J. Arnquist,
I. Ataee Langroudy,
E. Azadbakht,
S. Banik,
C. Bathurst,
D. A. Bauer,
R. Bhattacharyya,
P. L. Brink,
R. Bunker,
B. Cabrera,
R. Calkins,
R. A. Cameron,
C. Cartaro,
D. G. Cerdeno,
Y. -Y. Chang,
M. Chaudhuri,
R. Chen,
N. Chott,
J. Cooley
, et al. (103 additional authors not shown)
Abstract:
The SuperCDMS Collaboration is currently building SuperCDMS SNOLAB, a dark matter search focused on nucleon-coupled dark matter in the 1-5 GeV/c$^2$ mass range. Looking to the future, the Collaboration has developed a set of experience-based upgrade scenarios, as well as novel directions, to extend the search for dark matter using the SuperCDMS technology in the SNOLAB facility. The experienced-ba…
▽ More
The SuperCDMS Collaboration is currently building SuperCDMS SNOLAB, a dark matter search focused on nucleon-coupled dark matter in the 1-5 GeV/c$^2$ mass range. Looking to the future, the Collaboration has developed a set of experience-based upgrade scenarios, as well as novel directions, to extend the search for dark matter using the SuperCDMS technology in the SNOLAB facility. The experienced-based scenarios are forecasted to probe many square decades of unexplored dark matter parameter space below 5 GeV/c$^2$, covering over 6 decades in mass: 1-100 eV/c$^2$ for dark photons and axion-like particles, 1-100 MeV/c$^2$ for dark-photon-coupled light dark matter, and 0.05-5 GeV/c$^2$ for nucleon-coupled dark matter. They will reach the neutrino fog in the 0.5-5 GeV/c$^2$ mass range and test a variety of benchmark models and sharp targets. The novel directions involve greater departures from current SuperCDMS technology but promise even greater reach in the long run, and their development must begin now for them to be available in a timely fashion.
The experienced-based upgrade scenarios rely mainly on dramatic improvements in detector performance based on demonstrated scaling laws and reasonable extrapolations of current performance. Importantly, these improvements in detector performance obviate significant reductions in background levels beyond current expectations for the SuperCDMS SNOLAB experiment. Given that the dominant limiting backgrounds for SuperCDMS SNOLAB are cosmogenically created radioisotopes in the detectors, likely amenable only to isotopic purification and an underground detector life-cycle from before crystal growth to detector testing, the potential cost and time savings are enormous and the necessary improvements much easier to prototype.
△ Less
Submitted 1 April, 2023; v1 submitted 16 March, 2022;
originally announced March 2022.
-
A Search for Low-mass Dark Matter via Bremsstrahlung Radiation and the Migdal Effect in SuperCDMS
Authors:
SuperCDMS Collaboration,
Musaab Al-Bakry,
Imran Alkhatib,
Dorian Praia do Amaral,
Taylor Aralis,
Tsuguo Aramaki,
Isaac Arnquist,
Iman Ataee Langroudy,
Elham Azadbakht,
Samir Banik,
Corey Bathurst,
Dan Bauer,
Lucas Bezerra,
Rik Bhattacharyya,
Paul Brink,
Ray Bunker,
Blas Cabrera,
Robert Calkins,
Robert Cameron,
Concetta Cartaro,
David Cerdeno,
Yen-Yung Chang,
Mouli Chaudhuri,
Ran Chen,
Nicholas Chott
, et al. (106 additional authors not shown)
Abstract:
In this paper, we present a re-analysis of SuperCDMS data using a profile likelihood approach to search for sub-GeV dark matter particles (DM) through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that would otherwise be undetectable through the DM-nucle…
▽ More
In this paper, we present a re-analysis of SuperCDMS data using a profile likelihood approach to search for sub-GeV dark matter particles (DM) through two inelastic scattering channels: bremsstrahlung radiation and the Migdal effect. By considering possible inelastic scattering channels, experimental sensitivity can be extended to DM masses that would otherwise be undetectable through the DM-nucleon elastic scattering channel, given the energy threshold of current experiments. We exclude DM masses down to $220~\textrm{MeV}/c^2$ at $2.7 \times 10^{-30}~\textrm{cm}^2$ via the bremsstrahlung channel. The Migdal channel search excludes DM masses down to $30~\textrm{MeV}/c^2$ at $5.0 \times 10^{-30}~\textrm{cm}^2$.
△ Less
Submitted 19 May, 2022; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Interfacing Finite Elements with Deep Neural Operators for Fast Multiscale Modeling of Mechanics Problems
Authors:
Minglang Yin,
Enrui Zhang,
Yue Yu,
George Em Karniadakis
Abstract:
Multiscale modeling is an effective approach for investigating multiphysics systems with largely disparate size features, where models with different resolutions or heterogeneous descriptions are coupled together for predicting the system's response. The solver with lower fidelity (coarse) is responsible for simulating domains with homogeneous features, whereas the expensive high-fidelity (fine) m…
▽ More
Multiscale modeling is an effective approach for investigating multiphysics systems with largely disparate size features, where models with different resolutions or heterogeneous descriptions are coupled together for predicting the system's response. The solver with lower fidelity (coarse) is responsible for simulating domains with homogeneous features, whereas the expensive high-fidelity (fine) model describes microscopic features with refined discretization, often making the overall cost prohibitively high, especially for time-dependent problems. In this work, we explore the idea of multiscale modeling with machine learning and employ DeepONet, a neural operator, as an efficient surrogate of the expensive solver. DeepONet is trained offline using data acquired from the fine solver for learning the underlying and possibly unknown fine-scale dynamics. It is then coupled with standard PDE solvers for predicting the multiscale systems with new boundary/initial conditions in the coupling stage. The proposed framework significantly reduces the computational cost of multiscale simulations since the DeepONet inference cost is negligible, facilitating readily the incorporation of a plurality of interface conditions and coupling schemes. We present various benchmarks to assess accuracy and speedup, and in particular we develop a coupling algorithm for a time-dependent problem, and we also demonstrate coupling of a continuum model (finite element methods, FEM) with a neural operator representation of a particle system (Smoothed Particle Hydrodynamics, SPH) for a uniaxial tension problem with hyperelastic material. What makes this approach unique is that a well-trained over-parametrized DeepONet can generalize well and make predictions at a negligible cost.
△ Less
Submitted 25 February, 2022;
originally announced March 2022.