-
Hibernate Container: A Deflated Container Mode for Fast Startup and High-density Deployment in Serverless Computing
Authors:
Yulin Sun,
Deepak Vij,
Fenge Li,
Wenjian Guo,
Ying Xiong
Abstract:
Serverless computing is a popular cloud computing paradigm, which requires low response latency to handle on-demand user requests. There are two prominent techniques employed for reducing the response latency: keep fully initialized containers alive (Warm Container) or reduce the new container startup (cold start) latency.
This paper presents the 3rd container startup mode: Hibernate Container,…
▽ More
Serverless computing is a popular cloud computing paradigm, which requires low response latency to handle on-demand user requests. There are two prominent techniques employed for reducing the response latency: keep fully initialized containers alive (Warm Container) or reduce the new container startup (cold start) latency.
This paper presents the 3rd container startup mode: Hibernate Container, which starts faster than the cold start container mode and consumes less memory than the Warm Container mode. Hibernate Container is essentially a "deflated" Warm Container. Its application memory is swapped out to disk, the freed memory is reclaimed and file based mmap memory is cleaned-up. The Hibernate Container's deflated memory is inflated in response to user requests. As Hibernate Container's application is fully initialized, its response latency is less than the cold start mode; and as the application memory is deflated, its memory consumption is less than the Warm Container mode. Additionally, when a Hibernate Container is "woken up" to process a request, the Woken-up Container has similar response latency to Warm Container but less memory consumption because not all the deflated memory needs to be inflated. We implemented the Hibernate technique as part of the open source Quark secure container runtime project and our test demonstrated that Hibernate Container consumes about 7\% to 25\% of the Warm Container memory. All of this results in a higher deployment density, lower latency and appreciable improvements in the overall system performance.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Epitaxial growth and electronic structure of Ruddlesden-Popper nickelates ($ \mathrm{La}_{n+1}\mathrm{Ni}_{n}\mathrm{O}_{3n+1}, n=1-5 $)
Authors:
Zi Li,
Wei Guo,
Tingting Zhang,
Jianhui Song,
Tianyi Gao,
Zhengbin Gu,
Yuefeng Nie
Abstract:
We report the epitaxial growth of Ruddlesden-Popper nickelates, $ \mathrm{La}_{n+1}\mathrm{Ni}_{n}\mathrm{O}_{3n+1} $, with $ n $ up to 5 by reactive molecular beam epitaxy (MBE). X-ray diffractions indicate high crystalline quality of these films and transport measurements show strong dependence on the $ n $ values. Angle-resolved photoemission spectroscopy (ARPES) reveals the electronic structur…
▽ More
We report the epitaxial growth of Ruddlesden-Popper nickelates, $ \mathrm{La}_{n+1}\mathrm{Ni}_{n}\mathrm{O}_{3n+1} $, with $ n $ up to 5 by reactive molecular beam epitaxy (MBE). X-ray diffractions indicate high crystalline quality of these films and transport measurements show strong dependence on the $ n $ values. Angle-resolved photoemission spectroscopy (ARPES) reveals the electronic structure of $ \mathrm{La}_{5}\mathrm{Ni}_{4}\mathrm{O}_{13} $, showing a large hole-like pocket centered around the Brillouin zone corner with a $ (π, π) $ back-folded copy.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
Tensor Network Methods for Extracting CFT Data from Fixed-Point Tensors and Defect Coarse Graining
Authors:
Wenhan Guo,
Tzu-Chieh Wei
Abstract:
We present a comprehensive study on the extraction of CFT data using tensor network methods, specially, from the fixed-point tensor of the linearized tensor renormalization group (lTRG) for the 2D classical Ising model near the critical temperature. Utilizing two different methods, we extract operator scaling dimensions and operator-product-expansion (OPE) coefficients by introducing defects on th…
▽ More
We present a comprehensive study on the extraction of CFT data using tensor network methods, specially, from the fixed-point tensor of the linearized tensor renormalization group (lTRG) for the 2D classical Ising model near the critical temperature. Utilizing two different methods, we extract operator scaling dimensions and operator-product-expansion (OPE) coefficients by introducing defects on the lattice and by employing the fixed-point tensor. We also explore the effects of point-like defects in the lattice on the coarse-graining process. We find that there is a correspondence between coarse-grained defect tensors and conformal states obtained from lTRG fixed-point equation. We also analyze the capabilities and limitations of our proposed coarse-graining scheme for tensor networks with point-like defects, which includes graph independent local truncation (GILT) and higher-order tensor renormalization group (HOTRG). Our results provide a better understanding of the capacity and limitations of the tenor renormalization group scheme in coarse-graining defect tensors, and we show that GILT+HOTRG can be used to give accurate two- and four-point functions under specific conditions. We also find that employing the minimal canonical form further improves the stability of the RG flow.
△ Less
Submitted 4 February, 2024; v1 submitted 16 May, 2023;
originally announced May 2023.
-
Integrating Multiple Sources Knowledge for Class Asymmetry Domain Adaptation Segmentation of Remote Sensing Images
Authors:
Kuiliang Gao,
Anzhu Yu,
Xiong You,
Wenyue Guo,
Ke Li,
Ningbo Huang
Abstract:
In the existing unsupervised domain adaptation (UDA) methods for remote sensing images (RSIs) semantic segmentation, class symmetry is an widely followed ideal assumption, where the source and target RSIs have exactly the same class space. In practice, however, it is often very difficult to find a source RSI with exactly the same classes as the target RSI. More commonly, there are multiple source…
▽ More
In the existing unsupervised domain adaptation (UDA) methods for remote sensing images (RSIs) semantic segmentation, class symmetry is an widely followed ideal assumption, where the source and target RSIs have exactly the same class space. In practice, however, it is often very difficult to find a source RSI with exactly the same classes as the target RSI. More commonly, there are multiple source RSIs available. To this end, a novel class asymmetry RSIs domain adaptation method with multiple sources is proposed in this paper, which consists of four key components. Firstly, a multi-branch segmentation network is built to learn an expert for each source RSI. Secondly, a novel collaborative learning method with the cross-domain mixing strategy is proposed, to supplement the class information for each source while achieving the domain adaptation of each source-target pair. Thirdly, a pseudo-label generation strategy is proposed to effectively combine strengths of different experts, which can be flexibly applied to two cases where the source class union is equal to or includes the target class set. Fourthly, a multiview-enhanced knowledge integration module is developed for the high-level knowledge routing and transfer from multiple domains to target predictions.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
A Nonparametric Mixed-Effects Mixture Model for Patterns of Clinical Measurements Associated with COVID-19
Authors:
Xiaoran Ma,
Wensheng Guo,
Mengyang Gu,
Len Usvyat,
Peter Kotanko,
Yuedong Wang
Abstract:
Some patients with COVID-19 show changes in signs and symptoms such as temperature and oxygen saturation days before being positively tested for SARS-CoV-2, while others remain asymptomatic. It is important to identify these subgroups and to understand what biological and clinical predictors are related to these subgroups. This information will provide insights into how the immune system may respo…
▽ More
Some patients with COVID-19 show changes in signs and symptoms such as temperature and oxygen saturation days before being positively tested for SARS-CoV-2, while others remain asymptomatic. It is important to identify these subgroups and to understand what biological and clinical predictors are related to these subgroups. This information will provide insights into how the immune system may respond differently to infection and can further be used to identify infected individuals. We propose a flexible nonparametric mixed-effects mixture model that identifies risk factors and classifies patients with biological changes. We model the latent probability of biological changes using a logistic regression model and trajectories in the latent groups using smoothing splines. We developed an EM algorithm to maximize the penalized likelihood for estimating all parameters and mean functions. We evaluate our methods by simulations and apply the proposed model to investigate changes in temperature in a cohort of COVID-19-infected hemodialysis patients.
△ Less
Submitted 31 May, 2024; v1 submitted 6 May, 2023;
originally announced May 2023.
-
Pre-training Language Model as a Multi-perspective Course Learner
Authors:
Beiduo Chen,
Shaohan Huang,
Zihan Zhang,
Wu Guo,
Zhenhua Ling,
Haizhen Huang,
Furu Wei,
Weiwei Deng,
Qi Zhang
Abstract:
ELECTRA, the generator-discriminator pre-training framework, has achieved impressive semantic construction capability among various downstream tasks. Despite the convincing performance, ELECTRA still faces the challenges of monotonous training and deficient interaction. Generator with only masked language modeling (MLM) leads to biased learning and label imbalance for discriminator, decreasing lea…
▽ More
ELECTRA, the generator-discriminator pre-training framework, has achieved impressive semantic construction capability among various downstream tasks. Despite the convincing performance, ELECTRA still faces the challenges of monotonous training and deficient interaction. Generator with only masked language modeling (MLM) leads to biased learning and label imbalance for discriminator, decreasing learning efficiency; no explicit feedback loop from discriminator to generator results in the chasm between these two components, underutilizing the course learning. In this study, a multi-perspective course learning (MCL) method is proposed to fetch a many degrees and visual angles for sample-efficient pre-training, and to fully leverage the relationship between generator and discriminator. Concretely, three self-supervision courses are designed to alleviate inherent flaws of MLM and balance the label in a multi-perspective way. Besides, two self-correction courses are proposed to bridge the chasm between the two encoders by creating a "correction notebook" for secondary-supervision. Moreover, a course soups trial is conducted to solve the "tug-of-war" dynamics problem of MCL, evolving a stronger pre-trained model. Experimental results show that our method significantly improves ELECTRA's average performance by 2.8% and 3.2% absolute points respectively on GLUE and SQuAD 2.0 benchmarks, and overshadows recent advanced ELECTRA-style models under the same settings. The pre-trained MCL model is available at https://huggingface.co/McmanusChen/MCL-base.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
From Parse-Execute to Parse-Execute-Refine: Improving Semantic Parser for Complex Question Answering over Knowledge Base
Authors:
Wangzhen Guo,
Linyin Luo,
Hanjiang Lai,
Jian Yin
Abstract:
Parsing questions into executable logical forms has showed impressive results for knowledge-base question answering (KBQA). However, complex KBQA is a more challenging task that requires to perform complex multi-step reasoning. Recently, a new semantic parser called KoPL has been proposed to explicitly model the reasoning processes, which achieved the state-of-the-art on complex KBQA. In this pape…
▽ More
Parsing questions into executable logical forms has showed impressive results for knowledge-base question answering (KBQA). However, complex KBQA is a more challenging task that requires to perform complex multi-step reasoning. Recently, a new semantic parser called KoPL has been proposed to explicitly model the reasoning processes, which achieved the state-of-the-art on complex KBQA. In this paper, we further explore how to unlock the reasoning ability of semantic parsers by a simple proposed parse-execute-refine paradigm. We refine and improve the KoPL parser by demonstrating the executed intermediate reasoning steps to the KBQA model. We show that such simple strategy can significantly improve the ability of complex reasoning. Specifically, we propose three components: a parsing stage, an execution stage and a refinement stage, to enhance the ability of complex reasoning. The parser uses the KoPL to generate the transparent logical forms. Then, the execution stage aligns and executes the logical forms over knowledge base to obtain intermediate reasoning processes. Finally, the intermediate step-by-step reasoning processes are demonstrated to the KBQA model in the refinement stage. With the explicit reasoning processes, it is much easier to answer the complex questions. Experiments on benchmark dataset shows that the proposed PER-KBQA performs significantly better than the stage-of-the-art baselines on the complex KBQA.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Semi-supervised Road Updating Network (SRUNet): A Deep Learning Method for Road Updating from Remote Sensing Imagery and Historical Vector Maps
Authors:
Xin Chen,
Anzhu Yu,
Qun Sun,
Wenyue Guo,
Qing Xu,
Bowei Wen
Abstract:
A road is the skeleton of a city and is a fundamental and important geographical component. Currently, many countries have built geo-information databases and gathered large amounts of geographic data. However, with the extensive construction of infrastructure and rapid expansion of cities, automatic updating of road data is imperative to maintain the high quality of current basic geographic infor…
▽ More
A road is the skeleton of a city and is a fundamental and important geographical component. Currently, many countries have built geo-information databases and gathered large amounts of geographic data. However, with the extensive construction of infrastructure and rapid expansion of cities, automatic updating of road data is imperative to maintain the high quality of current basic geographic information. However, obtaining bi-phase images for the same area is difficult, and complex post-processing methods are required to update the existing databases.To solve these problems, we proposed a road detection method based on semi-supervised learning (SRUNet) specifically for road-updating applications; in this approach, historical road information was fused with the latest images to directly obtain the latest state of the road.Considering that the texture of a road is complex, a multi-branch network, named the Map Encoding Branch (MEB) was proposed for representation learning, where the Boundary Enhancement Module (BEM) was used to improve the accuracy of boundary prediction, and the Residual Refinement Module (RRM) was used to optimize the prediction results. Further, to fully utilize the limited amount of label information and to enhance the prediction accuracy on unlabeled images, we utilized the mean teacher framework as the basic semi-supervised learning framework and introduced Regional Contrast (ReCo) in our work to improve the model capacity for distinguishing between the characteristics of roads and background elements.We applied our method to two datasets. Our model can effectively improve the performance of a model with fewer labels. Overall, the proposed SRUNet can provide stable, up-to-date, and reliable prediction results for a wide range of road renewal tasks.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
Securing Autonomous Air Traffic Management: Blockchain Networks Driven by Explainable AI
Authors:
Louise Axon,
Dimitrios Panagiotakopoulos,
Samuel Ayo,
Carolina Sanchez-Hernandez,
Yan Zong,
Simon Brown,
Lei Zhang,
Michael Goldsmith,
Sadie Creese,
Weisi Guo
Abstract:
Air Traffic Management data systems today are inefficient and not scalable to enable future unmanned systems. Current data is fragmented, siloed, and not easily accessible. There is data conflict, misuse, and eroding levels of trust in provenance and accuracy. With increased autonomy in aviation, Artificially Intelligent (AI) enabled unmanned traffic management (UTM) will be more reliant on secure…
▽ More
Air Traffic Management data systems today are inefficient and not scalable to enable future unmanned systems. Current data is fragmented, siloed, and not easily accessible. There is data conflict, misuse, and eroding levels of trust in provenance and accuracy. With increased autonomy in aviation, Artificially Intelligent (AI) enabled unmanned traffic management (UTM) will be more reliant on secure data from diverse stakeholders. There is an urgent need to develop a secure network that has trustworthy data chains and works with the requirements generated by UTM. Here, we review existing research in 3 key interconnected areas: (1) blockchain development for secure data transfer between competing aviation stakeholders, (2) self-learning networking architectures that distribute consensus to achieve secure air traffic control, (3) explainable AI to build trust with human stakeholders and backpropagate requirements for blockchain and network optimisation. When connected together, this new digital ecosystem blueprint is tailored for safety critical UTM sectors. We motivate the readers with a case study, where a federated learning UTM uses real air traffic and weather data is secured and explained to human operators. This emerging area still requires significant research and development by the community to ensure it can enable future autonomous air mobility.
△ Less
Submitted 27 April, 2023;
originally announced April 2023.
-
Magnetocaloric effect and its electric-field regulation in CrI$_3$/metal heterostructure
Authors:
Weiwei He,
Ziming Tang,
Qihua Gong,
Min Yi,
Wanlin Guo
Abstract:
The extraordinary properties of a heterostructure by stacking atom-thick van der Waals (vdW) magnets have been extensively studied. However, the magnetocaloric effect (MCE) of heterostructures that are based on monolayer magnets remains to be explored. Herein, we deliberate MCE of vdW heterostructure composed of a monolayer CrI$_3$ and metal atomic layers (Ag, Hf, Au, and Pb). It is revealed that…
▽ More
The extraordinary properties of a heterostructure by stacking atom-thick van der Waals (vdW) magnets have been extensively studied. However, the magnetocaloric effect (MCE) of heterostructures that are based on monolayer magnets remains to be explored. Herein, we deliberate MCE of vdW heterostructure composed of a monolayer CrI$_3$ and metal atomic layers (Ag, Hf, Au, and Pb). It is revealed that heterostructure engineering by introducing metal substrate can improve MCE of CrI$_3$, particularly boosting relative cooling power to 471.72 $μ$Jm$^{-2}$ and adiabatic temperature change to 2.1 K at 5 T for CrI$_3$/Hf. This improved MCE is ascribed to the enhancement of magnetic moment and intralayer exchange coupling in CrI$_3$ due to the CrI$_3$/metal heterointerface induced charge transfer. Electric field is further found to tune MCE of CrI$_3$ in heterostructures and could shift the peak temperature by around 10 K in CrI$_3$/Hf, thus manipulating the working temperature window of MCE. The discovered electric-field and substrate regulated MCE in CrI$_3$/metal heterostructure opens new avenues for low-dimensional magnetic refrigeration.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
TCR: Short Video Title Generation and Cover Selection with Attention Refinement
Authors:
Yakun Yu,
Jiuding Yang,
Weidong Guo,
Hui Liu,
Yu Xu,
Di Niu
Abstract:
With the widespread popularity of user-generated short videos, it becomes increasingly challenging for content creators to promote their content to potential viewers. Automatically generating appealing titles and covers for short videos can help grab viewers' attention. Existing studies on video captioning mostly focus on generating factual descriptions of actions, which do not conform to video ti…
▽ More
With the widespread popularity of user-generated short videos, it becomes increasingly challenging for content creators to promote their content to potential viewers. Automatically generating appealing titles and covers for short videos can help grab viewers' attention. Existing studies on video captioning mostly focus on generating factual descriptions of actions, which do not conform to video titles intended for catching viewer attention. Furthermore, research for cover selection based on multimodal information is sparse. These problems motivate the need for tailored methods to specifically support the joint task of short video title generation and cover selection (TG-CS) as well as the demand for creating corresponding datasets to support the studies. In this paper, we first collect and present a real-world dataset named Short Video Title Generation (SVTG) that contains videos with appealing titles and covers. We then propose a Title generation and Cover selection with attention Refinement (TCR) method for TG-CS. The refinement procedure progressively selects high-quality samples and highly relevant frames and text tokens within each sample to refine model training. Extensive experiments show that our TCR method is superior to various existing video captioning methods in generating titles and is able to select better covers for noisy real-world short videos.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Machine learning for predicting fatigue properties of additively manufactured materials
Authors:
Min Yi,
Ming Xue,
Peihong Cong,
Yang Song,
Haiyang Zhang,
Lingfeng Wang,
Liucheng Zhou,
Yinghong Li,
Wanlin Guo
Abstract:
Fatigue properties of additively manufactured (AM) materials depend on many factors such as AM processing parameter, microstructure, residual stress, surface roughness, porosities, post-treatments, etc. Their evaluation inevitably requires these factors combined as many as possible, thus resulting in low efficiency and high cost. In recent years, their assessment by leveraging the power of machine…
▽ More
Fatigue properties of additively manufactured (AM) materials depend on many factors such as AM processing parameter, microstructure, residual stress, surface roughness, porosities, post-treatments, etc. Their evaluation inevitably requires these factors combined as many as possible, thus resulting in low efficiency and high cost. In recent years, their assessment by leveraging the power of machine learning (ML) has gained increasing attentions. Here, we present a comprehensive overview on the state-of-the-art progress of applying ML strategies to predict fatigue properties of AM materials, as well as their dependence on AM processing and post-processing parameters such as laser power, scanning speed, layer height, hatch distance, built direction, post-heat temperature, etc. A few attempts in employing feedforward neural network (FNN), convolutional neural network (CNN), adaptive network-based fuzzy system (ANFS), support vector machine (SVM) and random forest (RF) to predict fatigue life and RF to predict fatigue crack growth rate are summarized. The ML models for predicting AM materials' fatigue properties are found intrinsically similar to the commonly used ones, but are modified to involve AM features. Finally, an outlook for challenges (i.e., small dataset, multifarious features, overfitting, low interpretability, unable extension from AM material data to structure life) and potential solutions for the ML prediction of AM materials' fatigue properties is provided.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Characteristic modes of thick brane model: resonances and quasinormal modes
Authors:
Qin Tan,
Wen-Di Guo,
Yu-Peng Zhang,
Yu-Xiao Liu
Abstract:
In this work, we investigate the gravitational quasinormal modes (QNMs) and the gravitational resonances of a thick brane model. We use the asymptotic iteration and shooting methods to obtain the quasinormal frequencies (QNFs) of the brane. On the other hand, we investigate the resonances and their evolution numerically. The results show that the oscillations of the resonances equal (up to numeric…
▽ More
In this work, we investigate the gravitational quasinormal modes (QNMs) and the gravitational resonances of a thick brane model. We use the asymptotic iteration and shooting methods to obtain the quasinormal frequencies (QNFs) of the brane. On the other hand, we investigate the resonances and their evolution numerically. The results show that the oscillations of the resonances equal (up to numerical error) to the real parts of the QNFs, while the dam** rates of the resonances equal to the imaginary parts of the QNFs. The QNMs and resonances, both of them can be regarded as the characteristic modes of the thick brane, are closely related with each other. In addition, the lifetime of these QNMs could be very long, perhaps they might be detected in future accelerator or gravitational wave detector.
△ Less
Submitted 20 February, 2024; v1 submitted 18 April, 2023;
originally announced April 2023.
-
CodeKGC: Code Language Model for Generative Knowledge Graph Construction
Authors:
Zhen Bi,
**g Chen,
Yinuo Jiang,
Feiyu Xiong,
Wei Guo,
Huajun Chen,
Ningyu Zhang
Abstract:
Current generative knowledge graph construction approaches usually fail to capture structural knowledge by simply flattening natural language into serialized texts or a specification language. However, large generative language model trained on structured data such as code has demonstrated impressive capability in understanding natural language for structural prediction and reasoning tasks. Intuit…
▽ More
Current generative knowledge graph construction approaches usually fail to capture structural knowledge by simply flattening natural language into serialized texts or a specification language. However, large generative language model trained on structured data such as code has demonstrated impressive capability in understanding natural language for structural prediction and reasoning tasks. Intuitively, we address the task of generative knowledge graph construction with code language model: given a code-format natural language input, the target is to generate triples which can be represented as code completion tasks. Specifically, we develop schema-aware prompts that effectively utilize the semantic structure within the knowledge graph. As code inherently possesses structure, such as class and function definitions, it serves as a useful model for prior semantic structural knowledge. Furthermore, we employ a rationale-enhanced generation method to boost the performance. Rationales provide intermediate steps, thereby improving knowledge extraction abilities. Experimental results indicate that the proposed approach can obtain better performance on benchmark datasets compared with baselines. Code and datasets are available in https://github.com/zjunlp/DeepKE/tree/main/example/llm.
△ Less
Submitted 18 January, 2024; v1 submitted 18 April, 2023;
originally announced April 2023.
-
AGNN: Alternating Graph-Regularized Neural Networks to Alleviate Over-Smoothing
Authors:
Zhaoliang Chen,
Zhihao Wu,
Zhenghong Lin,
Shi** Wang,
Claudia Plant,
Wenzhong Guo
Abstract:
Graph Convolutional Network (GCN) with the powerful capacity to explore graph-structural data has gained noticeable success in recent years. Nonetheless, most of the existing GCN-based models suffer from the notorious over-smoothing issue, owing to which shallow networks are extensively adopted. This may be problematic for complex graph datasets because a deeper GCN should be beneficial to propaga…
▽ More
Graph Convolutional Network (GCN) with the powerful capacity to explore graph-structural data has gained noticeable success in recent years. Nonetheless, most of the existing GCN-based models suffer from the notorious over-smoothing issue, owing to which shallow networks are extensively adopted. This may be problematic for complex graph datasets because a deeper GCN should be beneficial to propagating information across remote neighbors. Recent works have devoted effort to addressing over-smoothing problems, including establishing residual connection structure or fusing predictions from multi-layer models. Because of the indistinguishable embeddings from deep layers, it is reasonable to generate more reliable predictions before conducting the combination of outputs from various layers. In light of this, we propose an Alternating Graph-regularized Neural Network (AGNN) composed of Graph Convolutional Layer (GCL) and Graph Embedding Layer (GEL). GEL is derived from the graph-regularized optimization containing Laplacian embedding term, which can alleviate the over-smoothing problem by periodic projection from the low-order feature space onto the high-order space. With more distinguishable features of distinct layers, an improved Adaboost strategy is utilized to aggregate outputs from each layer, which explores integrated embeddings of multi-hop neighbors. The proposed model is evaluated via a large number of experiments including performance comparison with some multi-layer or multi-order graph neural networks, which reveals the superior performance improvement of AGNN compared with state-of-the-art models.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Axial gravitational quasinormal modes of a self-dual black hole in loop quantum gravity
Authors:
Sen Yang,
Wen-Di Guo,
Qin Tan,
Yu-Xiao Liu
Abstract:
We study the axial gravitational quasinormal modes of a self-dual black hole in loop quantum gravity. Considering the axial perturbation of the background spacetime, we obtain the Schrödinger-like master equation. Then we calculate the quasinormal frequencies with the Wentzel-Kramers-Brillouin approximation and the asymptotic iteration method. We also investigate the numerical evolution of an init…
▽ More
We study the axial gravitational quasinormal modes of a self-dual black hole in loop quantum gravity. Considering the axial perturbation of the background spacetime, we obtain the Schrödinger-like master equation. Then we calculate the quasinormal frequencies with the Wentzel-Kramers-Brillouin approximation and the asymptotic iteration method. We also investigate the numerical evolution of an initial wave packet on the self-dual black hole spacetime. We find the quantum correction parameter $P$ positively affects the absolute values of both the real and imaginary parts of quasinormal frequencies. We derive the relation between the parameters of the circular null geodesics and quasinormal frequencies in the eikonal limit for the self-dual black hole, and numerically verify this relation.
△ Less
Submitted 5 May, 2023; v1 submitted 13 April, 2023;
originally announced April 2023.
-
Attributed Multi-order Graph Convolutional Network for Heterogeneous Graphs
Authors:
Zhaoliang Chen,
Zhihao Wu,
Luying Zhong,
Claudia Plant,
Shi** Wang,
Wenzhong Guo
Abstract:
Heterogeneous graph neural networks aim to discover discriminative node embeddings and relations from multi-relational networks.One challenge of heterogeneous graph learning is the design of learnable meta-paths, which significantly influences the quality of learned embeddings.Thus, in this paper, we propose an Attributed Multi-Order Graph Convolutional Network (AMOGCN), which automatically studie…
▽ More
Heterogeneous graph neural networks aim to discover discriminative node embeddings and relations from multi-relational networks.One challenge of heterogeneous graph learning is the design of learnable meta-paths, which significantly influences the quality of learned embeddings.Thus, in this paper, we propose an Attributed Multi-Order Graph Convolutional Network (AMOGCN), which automatically studies meta-paths containing multi-hop neighbors from an adaptive aggregation of multi-order adjacency matrices. The proposed model first builds different orders of adjacency matrices from manually designed node connections. After that, an intact multi-order adjacency matrix is attached from the automatic fusion of various orders of adjacency matrices. This process is supervised by the node semantic information, which is extracted from the node homophily evaluated by attributes. Eventually, we utilize a one-layer simplifying graph convolutional network with the learned multi-order adjacency matrix, which is equivalent to the cross-hop node information propagation with multi-layer graph neural networks. Substantial experiments reveal that AMOGCN gains superior semi-supervised classification performance compared with state-of-the-art competitors.
△ Less
Submitted 18 April, 2023; v1 submitted 13 April, 2023;
originally announced April 2023.
-
CSST WL preparation I: forecast the impact from non-Gaussian covariances and requirements on systematics-control
Authors:
Ji Yao,
Huanyuan Shan,
Ran Li,
Youhua Xu,
Dongwei Fan,
Dezi Liu,
Pengjie Zhang,
Yu Yu,
Chengliang Wei,
Bin Hu,
Nan Li,
Zuhui Fan,
Haojie Xu,
Wuzheng Guo
Abstract:
The precise estimation of the statistical errors and accurate removal of the systematical errors are the two major challenges for the stage IV cosmic shear surveys. We explore their impact for the China Space-Station Telescope (CSST) with survey area $\sim17,500°^2$ up to redshift $\sim4$. We consider statistical error contributed from Gaussian covariance, connected non-Gaussian covariance and sup…
▽ More
The precise estimation of the statistical errors and accurate removal of the systematical errors are the two major challenges for the stage IV cosmic shear surveys. We explore their impact for the China Space-Station Telescope (CSST) with survey area $\sim17,500°^2$ up to redshift $\sim4$. We consider statistical error contributed from Gaussian covariance, connected non-Gaussian covariance and super-sample covariance. We find the non-Gaussian covariances, which is dominated by the super-sample covariance, can largely reduce the signal-to-noise of the two-point statistics for CSST, leading to a $\sim1/3$ loss in the figure-of-merit for the matter clustering properties ($σ_8-Ω_m$ plane) and $1/6$ in the dark energy equation-of-state ($w_0-w_a$ plane). We further put requirements of systematics-mitigation on: intrinsic alignment of galaxies, baryonic feedback, shear multiplicative bias, and bias in the redshift distribution, for an unbiased cosmology. The $10^{-2}$ to $10^{-3}$ level requirements emphasize strong needs in related studies, to support future model selections and the associated priors for the nuisance parameters.
△ Less
Submitted 16 November, 2023; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Monolayer polar metals with large piezoelectricity derived from MoSi$_2$N$_4$
Authors:
Yan Yin,
Qihua Gong,
Min Yi,
Wanlin Guo
Abstract:
The advancement of two-dimensional polar metals tends to be limited by the incompatibility between electric polarity and metallicity as well as dimension reduction. Here, we report polar and metallic Janus monolayers of MoSi$_2$N$_4$ family by breaking the out-of-plane (OOP) structural symmetry through Z (P/As) substitution of N. Despite the semiconducting nature of MoSi$_2$X$_4$ (X=N/P/As), four…
▽ More
The advancement of two-dimensional polar metals tends to be limited by the incompatibility between electric polarity and metallicity as well as dimension reduction. Here, we report polar and metallic Janus monolayers of MoSi$_2$N$_4$ family by breaking the out-of-plane (OOP) structural symmetry through Z (P/As) substitution of N. Despite the semiconducting nature of MoSi$_2$X$_4$ (X=N/P/As), four Janus MoSi$_2$N$_{x}$Z$_{4-x}$ monolayers are found to be polar metals owing to the weak coupling between the conducting electrons and electric polarity. The metallicity is originated from the Z substitution induced delocalization of occupied electrons in Mo-d orbitals. The OOP electric polarizations around 10$-$203 pC/m are determined by the asymmetric OOP charge distribution due to the non-centrosymmetric Janus structure. The corresponding OOP piezoelectricity is further revealed as high as 39$-$153 pC/m and 0.10$-$0.31 pm/V for piezoelectric strain and stress coefficients, respectively. The results demonstrate polar metallicity and high OOP piezoelectricity in Janus MoSi$_2$N$_{x}$Z$_{4-x}$ monolayers and open new vistas for exploiting unusual coexisting properties in monolayers derived from MoSi$_2$N$_4$ family.
△ Less
Submitted 11 June, 2023; v1 submitted 9 April, 2023;
originally announced April 2023.
-
Role of electrodes in study of hydrovoltaic effects
Authors:
Chunxiao Zheng,
Sunmiao Fang,
Weicun Chu,
** Tan,
Bingkun Tian,
Xiaofeng Jiang,
Wanlin Guo
Abstract:
The last decade has witnessed the emergence of hydrovoltaic technology, which can harvest electricity from different forms of water movement, such as raindrops, waves, flows, moisture, and natural evaporation. In particular, the evaporation-induced hydrovoltaic effect received great attention since its discovery in 2017 due to its negative heat emission property. Nevertheless, the influence of ele…
▽ More
The last decade has witnessed the emergence of hydrovoltaic technology, which can harvest electricity from different forms of water movement, such as raindrops, waves, flows, moisture, and natural evaporation. In particular, the evaporation-induced hydrovoltaic effect received great attention since its discovery in 2017 due to its negative heat emission property. Nevertheless, the influence of electrode reactions in evaporation-induced power generation is not negligible due to the chemical reaction between active metal electrodes and water, which leads to " exceptional " power generation. Herein, we designed a series of experiments based on air-laid paper devices with electrodes of different activities as the top and bottom electrodes. To verify the contribution of electrodes, we compared the output performance of different electrode combinations when the device is partially-wetted and fully-wetted. The device hydrophilicity, salt concentration, and acidity or basicity of solutions are also comprehensively investigated. It is demonstrated that the chemical reaction of active metals (Zn, Cu, Ag, etc.) with different aqueous solutions can generate considerable electrical energy and significantly distort the device performance, especially for Zn electrodes with an output voltage from ~1.26 to ~1.52 V and current from ~1.24 to ~75.69 μA. To promote the long-term development of hydrovoltaic technology, we recommend use of inert electrodes in hydrovoltaic studies, such as Au and Pt, especially in water and moisture environment.
△ Less
Submitted 7 April, 2023;
originally announced April 2023.
-
Effective model for superconductivity in magic-angle graphene
Authors:
Disha Hou,
Yuhai Liu,
Toshihiro Sato,
Fakher F. Assaad,
Wenan Guo,
Zhenjiu Wang
Abstract:
We carry out large-scale quantum Monte Carlo simulations of a candidate field theory for the onset of superconductivity in magic-angle twisted bilayer graphene. The correlated insulating state at charge neutrality spontaneously breaks U(1) Moiré valley symmetry. Owing to the topological nature of the bands, skyrmion defects of the order parameter carry charge $2e$ and condense upon do**. In our…
▽ More
We carry out large-scale quantum Monte Carlo simulations of a candidate field theory for the onset of superconductivity in magic-angle twisted bilayer graphene. The correlated insulating state at charge neutrality spontaneously breaks U(1) Moiré valley symmetry. Owing to the topological nature of the bands, skyrmion defects of the order parameter carry charge $2e$ and condense upon do**. In our calculations we encode the U(1) symmetry by an internal degree of freedom such that it is not broken upon lattice regularization. Furthermore, the skyrmion carries the same charge. The nature of the do**-induced phase transitions depends on the strength of the easy-plane anisotropy that reduces the SU(2) valley symmetry to U(1) $\times \mathbb{Z}_2 $. For large anisotropy, we observe two distinct transitions separated by phase coexistence. While the insulator to superconducting transition is of mean-field character, the U(1) transition is consistent with three-dimensional XY criticality. Hence, the coupling between the gapless charge excitations of the superconducting phase and the XY order parameter is irrelevant. At small anisotropy, we observe a first-order transition characterized by phase separation.
△ Less
Submitted 3 May, 2023; v1 submitted 5 April, 2023;
originally announced April 2023.
-
Properties and Potential Applications of Random Functional-Linked Types of Neural Networks
Authors:
Guang-Yong Chen,
Yong-Hang Yu,
Min Gan,
C. L. Philip Chen,
Wenzhong Guo
Abstract:
Random functional-linked types of neural networks (RFLNNs), e.g., the extreme learning machine (ELM) and broad learning system (BLS), which avoid suffering from a time-consuming training process, offer an alternative way of learning in deep structure. The RFLNNs have achieved excellent performance in various classification and regression tasks, however, the properties and explanations of these net…
▽ More
Random functional-linked types of neural networks (RFLNNs), e.g., the extreme learning machine (ELM) and broad learning system (BLS), which avoid suffering from a time-consuming training process, offer an alternative way of learning in deep structure. The RFLNNs have achieved excellent performance in various classification and regression tasks, however, the properties and explanations of these networks are ignored in previous research. This paper gives some insights into the properties of RFLNNs from the viewpoints of frequency domain, and discovers the presence of frequency principle in these networks, that is, they preferentially capture low-frequencies quickly and then fit the high frequency components during the training process. These findings are valuable for understanding the RFLNNs and expanding their applications. Guided by the frequency principle, we propose a method to generate a BLS network with better performance, and design an efficient algorithm for solving Poison's equation in view of the different frequency principle presenting in the Jacobi iterative method and BLS network.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Giant magnetocaloric effect in magnets down to the monolayer limit
Authors:
Weiwei He,
Yan Yin,
Qihua Gong,
Richard F. L. Evans,
Oliver Gutfleisch,
Baixiang Xu,
Min Yi,
Wanlin Guo
Abstract:
Two-dimensional magnets could potentially revolutionize information technology, but their potential application to cooling technology and magnetocaloric effect (MCE) in a material down to the monolayer limit remain unexplored. Herein, we reveal through multiscale calculations the existence of giant MCE and its strain tunability in monolayer magnets such as CrX$_3$ (X=F, Cl, Br, I), CrAX (A=O, S, S…
▽ More
Two-dimensional magnets could potentially revolutionize information technology, but their potential application to cooling technology and magnetocaloric effect (MCE) in a material down to the monolayer limit remain unexplored. Herein, we reveal through multiscale calculations the existence of giant MCE and its strain tunability in monolayer magnets such as CrX$_3$ (X=F, Cl, Br, I), CrAX (A=O, S, Se; X=F, Cl, Br, I), and Fe$_3$GeTe$_2$. The maximum adiabatic temperature change ($ΔT_\text{ad}^\text{max}$), maximum isothermal magnetic entropy change, and specific cooling power in monolayer CrF$_3$ are found as high as 11 K, 35 $μ$Jm$^{-2}$K$^{-1}$, and 3.5 nWcm$^{-2}$ under a magnetic field of 5 T, respectively. A 2% biaxial and 5% $a$-axis uniaxial compressive strain can remarkably increase $ΔT_\text{ad}^\text{max}$ of CrCl$_3$ and CrOF by 230% and 37% (up to 15.3 and 6.0 K), respectively. It is found that large net magnetic moment per unit area favors improved MCE. These findings advocate the giant-MCE monolayer magnets, opening new opportunities for magnetic cooling at nanoscale.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Probing Complex-energy Topology via Non-Hermitian Absorption Spectroscopy in a Trapped Ion Simulator
Authors:
Mingming Cao,
Kai Li,
Wending Zhao,
Weixuan Guo,
Bingxiag Qi,
Xiuying Chang,
Zichao Zhou,
Yong Xu,
Luming Duan
Abstract:
Non-Hermitian systems generically have complex energies, which may host topological structures, such as links or knots. While there has been great progress in experimentally engineering non-Hermitian models in quantum simulators, it remains a significant challenge to experimentally probe complex energies in these systems, thereby making it difficult to directly diagnose complex-energy topology. He…
▽ More
Non-Hermitian systems generically have complex energies, which may host topological structures, such as links or knots. While there has been great progress in experimentally engineering non-Hermitian models in quantum simulators, it remains a significant challenge to experimentally probe complex energies in these systems, thereby making it difficult to directly diagnose complex-energy topology. Here, we experimentally realize a two-band non-Hermitian model with a single trapped ion whose complex eigenenergies exhibit the unlink, unknot or Hopf link topological structures. Based on non-Hermitian absorption spectroscopy, we couple one system level to an auxiliary level through a laser beam and then experimentally measure the population of the ion on the auxiliary level after a long period of time. Complex eigenenergies are then extracted, illustrating the unlink, unknot or Hopf link topological structure. Our work demonstrates that complex energies can be experimentally measured in quantum simulators via non-Hermitian absorption spectroscopy, thereby opening the door for exploring various complex-energy properties in non-Hermitian quantum systems, such as trapped ions, cold atoms, superconducting circuits or solid-state spin systems.
△ Less
Submitted 27 March, 2023;
originally announced March 2023.
-
GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation
Authors:
Ji Qi,
Jifan Yu,
Teng Tu,
Kunyu Gao,
Yifan Xu,
Xinyu Guan,
Xiaozhi Wang,
Yuxiao Dong,
Bin Xu,
Lei Hou,
Juanzi Li,
Jie Tang,
Weidong Guo,
Hui Liu,
Yu Xu
Abstract:
Despite the recent emergence of video captioning models, how to generate vivid, fine-grained video descriptions based on the background knowledge (i.e., long and informative commentary about the domain-specific scenes with appropriate reasoning) is still far from being solved, which however has great applications such as automatic sports narrative. In this paper, we present GOAL, a benchmark of ov…
▽ More
Despite the recent emergence of video captioning models, how to generate vivid, fine-grained video descriptions based on the background knowledge (i.e., long and informative commentary about the domain-specific scenes with appropriate reasoning) is still far from being solved, which however has great applications such as automatic sports narrative. In this paper, we present GOAL, a benchmark of over 8.9k soccer video clips, 22k sentences, and 42k knowledge triples for proposing a challenging new task setting as Knowledge-grounded Video Captioning (KGVC). Moreover, we conduct experimental adaption of existing methods to show the difficulty and potential directions for solving this valuable and applicable task. Our data and code are available at https://github.com/THU-KEG/goal.
△ Less
Submitted 5 October, 2023; v1 submitted 26 March, 2023;
originally announced March 2023.
-
$SL(2,R)\times U(1)$ symmetry and quasinormal modes in the self-dual warped AdS black hole
Authors:
Yuan Chen,
Wei Guo,
Kai Shi,
Hongbao Zhang
Abstract:
The algebraic approach to the spectrum of quasinormal modes has been made as simple as possible for the BTZ black hole by the strategy developed in \cite{Zhang}. By working with the self-dual warped AdS black hole, we demonstrate in an explicit way that such a strategy can be well adapted to those warped AdS balck holes with the $SL(2,R)\times U(1)$ isometry. To this end, we first introduce two as…
▽ More
The algebraic approach to the spectrum of quasinormal modes has been made as simple as possible for the BTZ black hole by the strategy developed in \cite{Zhang}. By working with the self-dual warped AdS black hole, we demonstrate in an explicit way that such a strategy can be well adapted to those warped AdS balck holes with the $SL(2,R)\times U(1)$ isometry. To this end, we first introduce two associated tensor fields with the quadratic Casimir of $SL(2,R)\times U(1)$ Lie algebra in the self-dual warped AdS black hole and show that they correspond essentially to the metric and volume element up to a constant prefactor, respectively. Then without appealing to any concrete coordinate system, we can further show that the solutions to the equations of motion for the scalar, vector, spinor fields all fall into the representations of the $SL(2,R)\times U(1)$ Lie algebra by a purely abstract tensor and spinor analysis. Accordingly, the corresponding spectrum of quasinormal modes for each fixed azimuthal quantum number can be derived algebraically as the infinite tower of descendants of the highest weight mode of the $SL(2,R)$ Lie subalgebra.
△ Less
Submitted 31 May, 2023; v1 submitted 21 March, 2023;
originally announced March 2023.
-
Metric Search for Rank List Compatibility Matching with Applications
Authors:
Wenqi Guo,
Jeffrey Uhlmann
Abstract:
As online dating has become more popular in the past few years, an efficient and effective algorithm to match users is needed. In this project, we proposed a new dating matching algorithm that uses Kendall-Tau distance to measure the similarity between users based on their ranking for items in a list. (e.g., their favourite sports, music, etc.) To increase the performance of the search process, we…
▽ More
As online dating has become more popular in the past few years, an efficient and effective algorithm to match users is needed. In this project, we proposed a new dating matching algorithm that uses Kendall-Tau distance to measure the similarity between users based on their ranking for items in a list. (e.g., their favourite sports, music, etc.) To increase the performance of the search process, we applied a tree-based searching structure, Cascading Metric Tree (CMT), on this metric. The tree is built on ranked lists from all the users; when a query target and a radius are provided, our algorithm can return users within the radius of the target. We tested the scaling of this searching method on a synthetic dataset by varying list length, population size, and query radius. We observed that the algorithm is able to query the best matching people for the user in a practical time, given reasonable parameters. We also provided potential future improvements that can be made to this algorithm based on the limitations. Finally, we offered more use cases of this search structure on Kendall-Tau distance and new insight into real-world applications of distance search structures.
△ Less
Submitted 10 August, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Extraordinary surface critical behavior induced by symmetry-protected topological state of a two-dimensional quantum magnet
Authors:
Zhe Wang,
Fan Zhang,
Wenan Guo
Abstract:
Using Quantum Monte Carlo simulations, we study spin-1/2 diagonal ladders coupled by ferromagnetic Heisenberg interactions. The model can also be viewed as usual ladders with ferromagnetic rung couplings coupled by antiferromagnetic diagonal couplings. We find that the model hosts a striped magnetic ordered phase and two topological nontrivial Haldane phases, separated by two quantum critical poin…
▽ More
Using Quantum Monte Carlo simulations, we study spin-1/2 diagonal ladders coupled by ferromagnetic Heisenberg interactions. The model can also be viewed as usual ladders with ferromagnetic rung couplings coupled by antiferromagnetic diagonal couplings. We find that the model hosts a striped magnetic ordered phase and two topological nontrivial Haldane phases, separated by two quantum critical points. We show that the two quantum critical points are all in the three-dimensional O(3) universality class irrelevant to the topological properties of the Haldane phases. The properties of the surface formed by ladder ends in the two Haldane phases are studied. We find that the surface states are both gapless due to the symmetry-protected topological bulk states. We further demonstrate that extraordinary surface critical behaviors are realized at both critical points on such gapless surfaces without enhancing the surface coupling. Notably, the surface is not expected to be ordered in the three-dimensional classical O(3) critical point, suggesting that the topological properties of the Haldane phases are responsible for such surface critical behavior.
△ Less
Submitted 19 March, 2023;
originally announced March 2023.
-
The JUNO experiment Top Tracker
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato
, et al. (592 additional authors not shown)
Abstract:
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector…
▽ More
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector, covering about 60% of the surface above them. The JUNO Top Tracker is constituted by the decommissioned OPERA experiment Target Tracker modules. The technology used consists in walls of two planes of plastic scintillator strips, one per transverse direction. Wavelength shifting fibres collect the light signal emitted by the scintillator strips and guide it to both ends where it is read by multianode photomultiplier tubes. Compared to the OPERA Target Tracker, the JUNO Top Tracker uses new electronics able to cope with the high rate produced by the high rock radioactivity compared to the one in Gran Sasso underground laboratory. This paper will present the new electronics and mechanical structure developed for the Top Tracker of JUNO along with its expected performance based on the current detector simulation.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
JUNO sensitivity to $^7$Be, $pep$, and CNO solar neutrinos
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta
, et al. (592 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented levels of precision. In this paper, we provide estimation of the JUNO sensitivity to 7Be, pep, and CNO solar neutrinos that can be obtained via a spectral analysis above the 0.45 MeV threshold. This study is performed assuming different scenarios of the liquid scintillator radiopurity, ranging from the most opti mistic one corresponding to the radiopurity levels obtained by the Borexino experiment, up to the minimum requirements needed to perform the neutrino mass ordering determination with reactor antineutrinos - the main goal of JUNO. Our study shows that in most scenarios, JUNO will be able to improve the current best measurements on 7Be, pep, and CNO solar neutrino fluxes. We also perform a study on the JUNO capability to detect periodical time variations in the solar neutrino flux, such as the day-night modulation induced by neutrino flavor regeneration in Earth, and the modulations induced by temperature changes driven by helioseismic waves.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Compressed Interaction Graph based Framework for Multi-behavior Recommendation
Authors:
Wei Guo,
Chang Meng,
Enming Yuan,
Zhicheng He,
Huifeng Guo,
Yingxue Zhang,
Bo Chen,
Yaochen Hu,
Ruiming Tang,
Xiu Li,
Rui Zhang
Abstract:
Multi-types of user behavior data (e.g., clicking, adding to cart, and purchasing) are recorded in most real-world recommendation scenarios, which can help to learn users' multi-faceted preferences. However, it is challenging to explore multi-behavior data due to the unbalanced data distribution and sparse target behavior, which lead to the inadequate modeling of high-order relations when treating…
▽ More
Multi-types of user behavior data (e.g., clicking, adding to cart, and purchasing) are recorded in most real-world recommendation scenarios, which can help to learn users' multi-faceted preferences. However, it is challenging to explore multi-behavior data due to the unbalanced data distribution and sparse target behavior, which lead to the inadequate modeling of high-order relations when treating multi-behavior data ''as features'' and gradient conflict in multitask learning when treating multi-behavior data ''as labels''. In this paper, we propose CIGF, a Compressed Interaction Graph based Framework, to overcome the above limitations. Specifically, we design a novel Compressed Interaction Graph Convolution Network (CIGCN) to model instance-level high-order relations explicitly. To alleviate the potential gradient conflict when treating multi-behavior data ''as labels'', we propose a Multi-Expert with Separate Input (MESI) network with separate input on the top of CIGCN for multi-task learning. Comprehensive experiments on three large-scale real-world datasets demonstrate the superiority of CIGF. Ablation studies and in-depth analysis further validate the effectiveness of our proposed model in capturing high-order relations and alleviating gradient conflict. The source code and datasets are available at https://github.com/MC-CV/CIGF.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition
Authors:
Zhijie Shen,
Wu Guo,
Bin Gu
Abstract:
In this paper, we propose a language-universal adapter learning framework based on a pre-trained model for end-to-end multilingual automatic speech recognition (ASR). For acoustic modeling, the wav2vec 2.0 pre-trained model is fine-tuned by inserting language-specific and language-universal adapters. An online knowledge distillation is then used to enable the language-universal adapters to learn b…
▽ More
In this paper, we propose a language-universal adapter learning framework based on a pre-trained model for end-to-end multilingual automatic speech recognition (ASR). For acoustic modeling, the wav2vec 2.0 pre-trained model is fine-tuned by inserting language-specific and language-universal adapters. An online knowledge distillation is then used to enable the language-universal adapters to learn both language-specific and universal features. The linguistic information confusion is also reduced by leveraging language identifiers (LIDs). With LIDs we perform a position-wise modification on the multi-head attention outputs. In the inference procedure, the language-specific adapters are removed while the language-universal adapters are kept activated. The proposed method improves the recognition accuracy and addresses the linear increase of the number of adapters' parameters with the number of languages in common multilingual ASR systems. Experiments on the BABEL dataset confirm the effectiveness of the proposed framework. Compared to the conventional multilingual model, a 3.3% absolute error rate reduction is achieved. The code is available at: https://github.com/shen9712/UniversalAdapterLearning.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
Securing IoT Communication using Physical Sensor Data -- Graph Layer Security with Federated Multi-Agent Deep Reinforcement Learning
Authors:
Liang Wang,
Zhuangkun Wei,
Weisi Guo
Abstract:
Internet-of-Things (IoT) devices are often used to transmit physical sensor data over digital wireless channels. Traditional Physical Layer Security (PLS)-based cryptography approaches rely on accurate channel estimation and information exchange for key generation, which irrevocably ties key quality with digital channel estimation quality. Recently, we proposed a new concept called Graph Layer Sec…
▽ More
Internet-of-Things (IoT) devices are often used to transmit physical sensor data over digital wireless channels. Traditional Physical Layer Security (PLS)-based cryptography approaches rely on accurate channel estimation and information exchange for key generation, which irrevocably ties key quality with digital channel estimation quality. Recently, we proposed a new concept called Graph Layer Security (GLS), where digital keys are derived from physical sensor readings. The sensor readings between legitimate users are correlated through a common background infrastructure environment (e.g., a common water distribution network or electric grid). The challenge for GLS has been how to achieve distributed key generation. This paper presents a Federated multi-agent Deep reinforcement learning-assisted Distributed Key generation scheme (FD2K), which fully exploits the common features of physical dynamics to establish secret key between legitimate users. We present for the first time initial experimental results of GLS with federated learning, achieving considerable security performance in terms of key agreement rate (KAR), and key randomness.
△ Less
Submitted 24 February, 2023;
originally announced February 2023.
-
A holistically 3D-printed flexible millimeter-wave Doppler radar: Towards fully printed high-frequency multilayer flexible hybrid electronics systems
Authors:
Hong Tang,
Yingjie Zhang,
Bowen Zheng,
Sensong An,
Mohammad Haerinia,
Yunxi Dong,
Yi Huang,
Wei Guo,
Hualiang Zhang
Abstract:
Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology. It unlocks tremendous market potential by realizing low-cost flexible circuits and systems that can be conformally integrated into various applications. However, the operating frequencies of most reported FHE systems are relatively low. It is als…
▽ More
Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology. It unlocks tremendous market potential by realizing low-cost flexible circuits and systems that can be conformally integrated into various applications. However, the operating frequencies of most reported FHE systems are relatively low. It is also worth to note that reported FHE systems have been limited to relatively simple design concept (since complex systems will impose challenges in aspects such as multilayer interconnections, printing materials, and bonding layers). Here, we report a fully 3D-printed flexible four-layer millimeter-wave Doppler radar (i.e., a millimeter-wave FHE system). The sensing performance and flexibility of the 3D-printed radar are characterized and validated by general field tests and bending tests, respectively. Our results demonstrate the feasibility of develo** fully 3D-printed high-frequency multilayer FHE, which can be conformally integrated into irregular surfaces (e.g., vehicle bumpers) for applications such as vehicle radars and wearable electronics.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws
Authors:
Kush Bhatia,
Wenshuo Guo,
Jacob Steinhardt
Abstract:
Specifying reward functions for complex tasks like object manipulation or driving is challenging to do by hand. Reward learning seeks to address this by learning a reward model using human feedback on selected query policies. This shifts the burden of reward specification to the optimal design of the queries. We propose a theoretical framework for studying reward learning and the associated optima…
▽ More
Specifying reward functions for complex tasks like object manipulation or driving is challenging to do by hand. Reward learning seeks to address this by learning a reward model using human feedback on selected query policies. This shifts the burden of reward specification to the optimal design of the queries. We propose a theoretical framework for studying reward learning and the associated optimal experiment design problem. Our framework models rewards and policies as nonparametric functions belonging to subsets of Reproducing Kernel Hilbert Spaces (RKHSs). The learner receives (noisy) oracle access to a true reward and must output a policy that performs well under the true reward. For this setting, we first derive non-asymptotic excess risk bounds for a simple plug-in estimator based on ridge regression. We then solve the query design problem by optimizing these risk bounds with respect to the choice of query set and obtain a finite sample statistical rate, which depends primarily on the eigenvalue spectrum of a certain linear operator on the RKHSs. Despite the generality of these results, our bounds are stronger than previous bounds developed for more specialized problems. We specifically show that the well-studied problem of Gaussian process (GP) bandit optimization is a special case of our framework, and that our bounds either improve or are competitive with known regret guarantees for the Matérn kernel.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
A Survey on User Behavior Modeling in Recommender Systems
Authors:
Zhicheng He,
Weiwen Liu,
Wei Guo,
Jiarui Qin,
Yingxue Zhang,
Yaochen Hu,
Ruiming Tang
Abstract:
User Behavior Modeling (UBM) plays a critical role in user interest learning, which has been extensively used in recommender systems. Crucial interactive patterns between users and items have been exploited, which brings compelling improvements in many recommendation tasks. In this paper, we attempt to provide a thorough survey of this research topic. We start by reviewing the research background…
▽ More
User Behavior Modeling (UBM) plays a critical role in user interest learning, which has been extensively used in recommender systems. Crucial interactive patterns between users and items have been exploited, which brings compelling improvements in many recommendation tasks. In this paper, we attempt to provide a thorough survey of this research topic. We start by reviewing the research background of UBM. Then, we provide a systematic taxonomy of existing UBM research works, which can be categorized into four different directions including Conventional UBM, Long-Sequence UBM, Multi-Type UBM, and UBM with Side Information. Within each direction, representative models and their strengths and weaknesses are comprehensively discussed. Besides, we elaborate on the industrial practices of UBM methods with the hope of providing insights into the application value of existing UBM solutions. Finally, we summarize the survey and discuss the future prospects of this field.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
A Flexible Multi-view Multi-modal Imaging System for Outdoor Scenes
Authors:
Meng Zhang,
Wenxuan Guo,
Bohao Fan,
Yifan Chen,
Jianjiang Feng,
Jie Zhou
Abstract:
Multi-view imaging systems enable uniform coverage of 3D space and reduce the impact of occlusion, which is beneficial for 3D object detection and tracking accuracy. However, existing imaging systems built with multi-view cameras or depth sensors are limited by the small applicable scene and complicated composition. In this paper, we propose a wireless multi-view multi-modal 3D imaging system gene…
▽ More
Multi-view imaging systems enable uniform coverage of 3D space and reduce the impact of occlusion, which is beneficial for 3D object detection and tracking accuracy. However, existing imaging systems built with multi-view cameras or depth sensors are limited by the small applicable scene and complicated composition. In this paper, we propose a wireless multi-view multi-modal 3D imaging system generally applicable to large outdoor scenes, which consists of a master node and several slave nodes. Multiple spatially distributed slave nodes equipped with cameras and LiDARs are connected to form a wireless sensor network. While providing flexibility and scalability, the system applies automatic spatio-temporal calibration techniques to obtain accurate 3D multi-view multi-modal data. This system is the first imaging system that integrates mutli-view RGB cameras and LiDARs in large outdoor scenes among existing 3D imaging systems. We perform point clouds based 3D object detection and long-term tracking using the 3D imaging dataset collected by this system. The experimental results show that multi-view point clouds greatly improve 3D object detection and tracking accuracy regardless of complex and various outdoor environments.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
A learned conservative semi-Lagrangian finite volume scheme for transport simulations
Authors:
Yongsheng Chen,
Wei Guo,
Xinghui Zhong
Abstract:
Semi-Lagrangian (SL) schemes are known as a major numerical tool for solving transport equations with many advantages and have been widely deployed in the fields of computational fluid dynamics, plasma physics modeling, numerical weather prediction, among others. In this work, we develop a novel machine learning-assisted approach to accelerate the conventional SL finite volume (FV) schemes. The pr…
▽ More
Semi-Lagrangian (SL) schemes are known as a major numerical tool for solving transport equations with many advantages and have been widely deployed in the fields of computational fluid dynamics, plasma physics modeling, numerical weather prediction, among others. In this work, we develop a novel machine learning-assisted approach to accelerate the conventional SL finite volume (FV) schemes. The proposed scheme avoids the expensive tracking of upstream cells but attempts to learn the SL discretization from the data by incorporating specific inductive biases in the neural network, significantly simplifying the algorithm implementation and leading to improved efficiency. In addition, the method delivers sharp shock transitions and a level of accuracy that would typically require a much finer grid with traditional transport solvers. Numerical tests demonstrate the effectiveness and efficiency of the proposed method.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Leveraging Reviews: Learning to Price with Buyer and Seller Uncertainty
Authors:
Wenshuo Guo,
Nika Haghtalab,
Kirthevasan Kandasamy,
Ellen Vitercik
Abstract:
In online marketplaces, customers have access to hundreds of reviews for a single product. Buyers often use reviews from other customers that share their type -- such as height for clothing, skin type for skincare products, and location for outdoor furniture -- to estimate their values, which they may not know a priori. Customers with few relevant reviews may hesitate to make a purchase except at…
▽ More
In online marketplaces, customers have access to hundreds of reviews for a single product. Buyers often use reviews from other customers that share their type -- such as height for clothing, skin type for skincare products, and location for outdoor furniture -- to estimate their values, which they may not know a priori. Customers with few relevant reviews may hesitate to make a purchase except at a low price, so for the seller, there is a tension between setting high prices and ensuring that there are enough reviews so that buyers can confidently estimate their values. Simultaneously, sellers may use reviews to gauge the demand for items they wish to sell.
In this work, we study this pricing problem in an online setting where the seller interacts with a set of buyers of finitely many types, one by one, over a series of $T$ rounds. At each round, the seller first sets a price. Then a buyer arrives and examines the reviews of the previous buyers with the same type, which reveal those buyers' ex-post values. Based on the reviews, the buyer decides to purchase if they have good reason to believe that their ex-ante utility is positive. Crucially, the seller does not know the buyer's type when setting the price, nor even the distribution over types. We provide a no-regret algorithm that the seller can use to obtain high revenue. When there are $d$ types, after $T$ rounds, our algorithm achieves a problem-independent $\tilde O(T^{2/3}d^{1/3})$ regret bound. However, when the smallest probability $q_{\text{min}}$ that any given type appears is large, specifically when $q_{\text{min}} \in Ω(d^{-2/3}T^{-1/3})$, then the same algorithm achieves a $\tilde O(T^{1/2}q_{\text{min}}^{-1/2})$ regret bound. We complement these upper bounds with matching lower bounds in both regimes, showing that our algorithm is minimax optimal up to lower-order terms.
△ Less
Submitted 11 September, 2023; v1 submitted 19 February, 2023;
originally announced February 2023.
-
Testing Super-Eddington Accretion onto a Supermassive Black Hole: Reverberation Map** of PG 1119+120
Authors:
Fergus R. Donnan,
Juan V. Hernández Santisteban,
Keith Horne,
Chen Hu,
Pu Du,
Yan-Rong Li,
Ming Xiao,
Luis C. Ho,
Jesús Aceituno,
Jian-Min Wang,
Wei-Jian Guo,
Sen Yang,
Bo-Wei Jiang,
Zhu-Heng Yao
Abstract:
We measure the black hole mass and investigate the accretion flow around the local ($z=0.0502$) quasar PG 1119+120. Spectroscopic monitoring with Calar Alto provides H$β$ lags and linewidths from which we estimate a black hole mass of $\log \left(M_{\bullet}/\mathrm{M}_{\odot} \right) = 7.0$, uncertain by $\sim0.4$ dex. High cadence photometric monitoring over two years with the Las Cumbres Observ…
▽ More
We measure the black hole mass and investigate the accretion flow around the local ($z=0.0502$) quasar PG 1119+120. Spectroscopic monitoring with Calar Alto provides H$β$ lags and linewidths from which we estimate a black hole mass of $\log \left(M_{\bullet}/\mathrm{M}_{\odot} \right) = 7.0$, uncertain by $\sim0.4$ dex. High cadence photometric monitoring over two years with the Las Cumbres Observatory provides lightcurves in 7 optical bands suitable for intensive continuum reverberation map**. We identify variability on two timescales. Slower variations on a 100-day timescale exhibit excess flux and increased lag in the $u'$ band and are thus attributable to diffuse bound-free continuum emission from the broad line region. Faster variations that we attribute to accretion disc reprocessing lack a $u'$-band excess and have flux and delay spectra consistent with either $τ\propto λ^{4/3}$, as expected for a temperature structure of $T(R) \propto R^{-3/4}$ for a thin accretion disc, or $τ\propto λ^{2}$ expected for a slim disc. Decomposing the flux into variable (disc) and constant (host galaxy) components, we find the disc SED to be flatter than expected with $f_ν \sim \rm{const}$. Modelling the SED predicts an Eddington ratio of $λ_{\rm Edd} > 1$, where the flat spectrum can be reproduced by a slim disc with little dust extinction or a thin disc which requires more dust extinction. While this accretion is super-Eddington, the geometry is still unclear, however a slim disc is expected due to the high radiation pressure at these accretion rates, and is entirely consistent with our observations.
△ Less
Submitted 16 May, 2023; v1 submitted 18 February, 2023;
originally announced February 2023.
-
Unique Identification of 50,000+ Virtual Reality Users from Head & Hand Motion Data
Authors:
Vivek Nair,
Wenbo Guo,
Justus Mattern,
Rui Wang,
James F. O'Brien,
Louis Rosenberg,
Dawn Song
Abstract:
With the recent explosive growth of interest and investment in virtual reality (VR) and the so-called "metaverse," public attention has rightly shifted toward the unique security and privacy threats that these platforms may pose. While it has long been known that people reveal information about themselves via their motion, the extent to which this makes an individual globally identifiable within v…
▽ More
With the recent explosive growth of interest and investment in virtual reality (VR) and the so-called "metaverse," public attention has rightly shifted toward the unique security and privacy threats that these platforms may pose. While it has long been known that people reveal information about themselves via their motion, the extent to which this makes an individual globally identifiable within virtual reality has not yet been widely understood. In this study, we show that a large number of real VR users (N=55,541) can be uniquely and reliably identified across multiple sessions using just their head and hand motion relative to virtual objects. After training a classification model on 5 minutes of data per person, a user can be uniquely identified amongst the entire pool of 50,000+ with 94.33% accuracy from 100 seconds of motion, and with 73.20% accuracy from just 10 seconds of motion. This work is the first to truly demonstrate the extent to which biomechanics may serve as a unique identifier in VR, on par with widely used biometrics such as facial or fingerprint recognition.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Metal-bonded perovskite lead hydride with phonon-mediated superconductivity up to 46 K under atmospheric pressure
Authors:
Yong He,
Juan Du,
Shi-ming Liu,
Chong Tian,
Wen-hui Guo,
Min Zhang,
Yao-hui Zhu,
Hong-xia Zhong,
Xinqiang Wang,
Jun-jie Shi
Abstract:
In the search for high-temperature superconductivity in hydrides, a plethora of multi-hydrogen superconductors have been theoretically predicted, and some have been synthesized experimentally under ultrahigh pressures of several hundred GPa. However, the impracticality of these high-pressure methods has been a persistent issue. In response, we propose a new approach to achieve high-temperature sup…
▽ More
In the search for high-temperature superconductivity in hydrides, a plethora of multi-hydrogen superconductors have been theoretically predicted, and some have been synthesized experimentally under ultrahigh pressures of several hundred GPa. However, the impracticality of these high-pressure methods has been a persistent issue. In response, we propose a new approach to achieve high-temperature superconductivity under atmospheric pressure by implanting hydrogen into lead to create a stable few-hydrogen metal-bonded perovskite, Pb$_4$H. This approach diverges from the popular design methodology of multi-hydrogen covalent high critical temperature ($T_c$) superconductors under ultrahigh pressure. By solving the anisotropic Migdal-Eliashberg (ME) equations, we demonstrate that perovskite Pb$_4$H is a typical phonon-mediated superconductor with a $T_c$ of 46 K, which is six times higher than that of bulk Pb (7.22 K) and higher than that of MgB$_2$ (39 K). The high $T_c$ can be attributed to the strong electron-phonon coupling (EPC) strength of 2.45, which arises from hydrogen implantation in lead that induces several high-frequency optical phonon modes with a relatively large phonon linewidth resulting from H atom vibration. The metallic-bonding in perovskite Pb$_4$H not only improves the structural stability but also guarantees better ductility than the widely investigated multi-hydrogen, iron-based, and cuprate superconductors. These results suggest that there is potential for the exploration of new high-temperature superconductors under atmospheric pressure and may reignite interest in their experimental synthesis soon.
△ Less
Submitted 17 April, 2023; v1 submitted 10 February, 2023;
originally announced February 2023.
-
A portable and high intensity 24 keV neutron source based on $^{124}$Sb-$^{9}$Be photoneutrons and an iron filter
Authors:
A. Biekert,
C. Chang,
L. Chaplinsky,
C. W. Fink,
W. D. Frey,
M. Garcia-Sciveres,
W. Guo,
S. A. Hertel,
X. Li,
J. Lin,
M. Lisovenko,
R. Mahapatra,
D. N. McKinsey,
S. Mehrotra,
N. Mirabolfathi,
P. K. Patel,
B. Penning,
H. D. Pinckney,
M. Reed,
R. K. Romani,
B. Sadoulet,
R. J. Smith,
P. Sorensen,
B. Suerfu,
A. Suzuki
, et al. (5 additional authors not shown)
Abstract:
A portable monoenergetic 24 keV neutron source based on the $^{124}$Sb-$^9$Be photoneutron reaction and an iron filter has been constructed and characterized. The coincidence of the neutron energy from SbBe and the low interaction cross-section with iron (mean free path up to 29 cm) makes pure iron specially suited to shield against gamma rays from $^{124}$Sb decays while letting through the neutr…
▽ More
A portable monoenergetic 24 keV neutron source based on the $^{124}$Sb-$^9$Be photoneutron reaction and an iron filter has been constructed and characterized. The coincidence of the neutron energy from SbBe and the low interaction cross-section with iron (mean free path up to 29 cm) makes pure iron specially suited to shield against gamma rays from $^{124}$Sb decays while letting through the neutrons. To increase the $^{124}$Sb activity and thus the neutron flux, a $>$1 GBq $^{124}$Sb source was produced by irradiating a natural Sb metal pellet with a high flux of thermal neutrons in a nuclear reactor. The design of the source shielding structure makes for easy transportation and deployment. A hydrogen gas proportional counter is used to characterize the neutrons emitted by the source and a NaI detector is used for gamma background characterization. At the exit opening of the neutron beam, the characterization determined the neutron flux in the energy range 20-25 keV to be 5.36$\pm$0.20 neutrons per cm$^2$ per second and the total gamma flux to be 213$\pm$6 gammas per cm$^2$ per second (numbers scaled to 1 GBq activity of the $^{124}$Sb source). A liquid scintillator detector is demonstrated to be sensitive to neutrons with incident kinetic energies from 8 to 17 keV, so it can be paired with the source as a backing detector for neutron scattering calibration experiments. This photoneutron source provides a good tool for in-situ low energy nuclear recoil calibration for dark matter experiments and coherent elastic neutrino-nucleus scattering experiments.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Deep Graph-Level Clustering Using Pseudo-Label-Guided Mutual Information Maximization Network
Authors:
**yu Cai,
Yi Han,
Wenzhong Guo,
Jicong Fan
Abstract:
In this work, we study the problem of partitioning a set of graphs into different groups such that the graphs in the same group are similar while the graphs in different groups are dissimilar. This problem was rarely studied previously, although there have been a lot of work on node clustering and graph classification. The problem is challenging because it is difficult to measure the similarity or…
▽ More
In this work, we study the problem of partitioning a set of graphs into different groups such that the graphs in the same group are similar while the graphs in different groups are dissimilar. This problem was rarely studied previously, although there have been a lot of work on node clustering and graph classification. The problem is challenging because it is difficult to measure the similarity or distance between graphs. One feasible approach is using graph kernels to compute a similarity matrix for the graphs and then performing spectral clustering, but the effectiveness of existing graph kernels in measuring the similarity between graphs is very limited. To solve the problem, we propose a novel method called Deep Graph-Level Clustering (DGLC). DGLC utilizes a graph isomorphism network to learn graph-level representations by maximizing the mutual information between the representations of entire graphs and substructures, under the regularization of a clustering module that ensures discriminative representations via pseudo labels. DGLC achieves graph-level representation learning and graph-level clustering in an end-to-end manner. The experimental results on six benchmark datasets of graphs show that our DGLC has state-of-the-art performance in comparison to many baselines.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline
Authors:
Yang Li,
Xinyan Chen,
Wenxuan Guo,
Xijun Li,
Wanqian Luo,
Junhua Huang,
Hui-Ling Zhen,
Mingxuan Yuan,
Junchi Yan
Abstract:
Industrial SAT formula generation is a critical yet challenging task. Existing SAT generation approaches can hardly simultaneously capture the global structural properties and maintain plausible computational hardness. We first present an in-depth analysis for the limitation of previous learning methods in reproducing the computational hardness of original instances, which may stem from the inhere…
▽ More
Industrial SAT formula generation is a critical yet challenging task. Existing SAT generation approaches can hardly simultaneously capture the global structural properties and maintain plausible computational hardness. We first present an in-depth analysis for the limitation of previous learning methods in reproducing the computational hardness of original instances, which may stem from the inherent homogeneity in their adopted split-merge procedure. On top of the observations that industrial formulae exhibit clear community structure and oversplit substructures lead to the difficulty in semantic formation of logical structures, we propose HardSATGEN, which introduces a fine-grained control mechanism to the neural split-merge paradigm for SAT formula generation to better recover the structural and computational properties of the industrial benchmarks. Experiments including evaluations on private and practical corporate testbed show the superiority of HardSATGEN being the only method to successfully augment formulae maintaining similar computational hardness and capturing the global structural properties simultaneously. Compared to the best previous methods, the average performance gains achieve 38.5% in structural statistics, 88.4% in computational metrics, and over 140.7% in the effectiveness of guiding solver tuning by our generated instances. Source code is available at http://github.com/Thinklab-SJTU/HardSATGEN
△ Less
Submitted 8 February, 2024; v1 submitted 4 February, 2023;
originally announced February 2023.
-
Coordinating Distributed Example Orders for Provably Accelerated Training
Authors:
A. Feder Cooper,
Wentao Guo,
Khiem Pham,
Tiancheng Yuan,
Charlie F. Ruan,
Yucheng Lu,
Christopher De Sa
Abstract:
Recent research on online Gradient Balancing (GraB) has revealed that there exist permutation-based example orderings for SGD that are guaranteed to outperform random reshuffling (RR). Whereas RR arbitrarily permutes training examples, GraB leverages stale gradients from prior epochs to order examples -- achieving a provably faster convergence rate than RR. However, GraB is limited by design: whil…
▽ More
Recent research on online Gradient Balancing (GraB) has revealed that there exist permutation-based example orderings for SGD that are guaranteed to outperform random reshuffling (RR). Whereas RR arbitrarily permutes training examples, GraB leverages stale gradients from prior epochs to order examples -- achieving a provably faster convergence rate than RR. However, GraB is limited by design: while it demonstrates an impressive ability to scale-up training on centralized data, it does not naturally extend to modern distributed ML workloads. We therefore propose Coordinated Distributed GraB (CD-GraB), which uses insights from prior work on kernel thinning to translate the benefits of provably faster permutation-based example ordering to distributed settings. With negligible overhead, CD-GraB exhibits a linear speedup in convergence rate over centralized GraB and outperforms distributed RR on a variety of benchmark tasks.
△ Less
Submitted 21 December, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
Universal Detection of Backdoor Attacks via Density-based Clustering and Centroids Analysis
Authors:
Wei Guo,
Benedetta Tondi,
Mauro Barni
Abstract:
We propose a Universal Defence against backdoor attacks based on Clustering and Centroids Analysis (CCA-UD). The goal of the defence is to reveal whether a Deep Neural Network model is subject to a backdoor attack by inspecting the training dataset. CCA-UD first clusters the samples of the training set by means of density-based clustering. Then, it applies a novel strategy to detect the presence o…
▽ More
We propose a Universal Defence against backdoor attacks based on Clustering and Centroids Analysis (CCA-UD). The goal of the defence is to reveal whether a Deep Neural Network model is subject to a backdoor attack by inspecting the training dataset. CCA-UD first clusters the samples of the training set by means of density-based clustering. Then, it applies a novel strategy to detect the presence of poisoned clusters. The proposed strategy is based on a general misclassification behaviour observed when the features of a representative example of the analysed cluster are added to benign samples. The capability of inducing a misclassification error is a general characteristic of poisoned samples, hence the proposed defence is attack-agnostic. This marks a significant difference with respect to existing defences, that, either can defend against only some types of backdoor attacks, or are effective only when some conditions on the poisoning ratio or the kind of triggering signal used by the attacker are satisfied.
Experiments carried out on several classification tasks and network architectures, considering different types of backdoor attacks (with either clean or corrupted labels), and triggering signals, including both global and local triggering signals, as well as sample-specific and source-specific triggers, reveal that the proposed method is very effective to defend against backdoor attacks in all the cases, always outperforming the state of the art techniques.
△ Less
Submitted 5 October, 2023; v1 submitted 11 January, 2023;
originally announced January 2023.
-
The permutability of $σ_i$-sylowizers of some $σ_i$-subgroups in finite groups
Authors:
Zhenya Liu,
Wenbin Guo
Abstract:
Let $σ=\{σ_{i}|i\in I\}$ be a partition of the set of all primes $\mathbb{P}$, $G$ a finite group and $σ(G)=\{σ_{i}|σ_{i}\cap π(|G|)\neq\emptyset\}$. A subgroup $S$ of a group $G$ is called a $σ_i$-sylowizer of a $σ_i$-subgroup $R$ in $G$ if $S$ is maximal in $G$ with respect to having $R$ as its Hall $σ_i$-subgroup. The main aim of this paper is to investigate the influence of $σ_i$-sylowizers on…
▽ More
Let $σ=\{σ_{i}|i\in I\}$ be a partition of the set of all primes $\mathbb{P}$, $G$ a finite group and $σ(G)=\{σ_{i}|σ_{i}\cap π(|G|)\neq\emptyset\}$. A subgroup $S$ of a group $G$ is called a $σ_i$-sylowizer of a $σ_i$-subgroup $R$ in $G$ if $S$ is maximal in $G$ with respect to having $R$ as its Hall $σ_i$-subgroup. The main aim of this paper is to investigate the influence of $σ_i$-sylowizers on the structure of finite groups. We obtained some new characterizations of supersoluble groups by the permutability of the $σ_i$-sylowizers of some $σ_i$-subgroups.
△ Less
Submitted 5 January, 2023;
originally announced January 2023.
-
QPanda: high-performance quantum computing framework for multiple application scenarios
Authors:
Menghan Dou,
Tianrui Zou,
Yuan Fang,
**g Wang,
Dongyi Zhao,
Lei Yu,
Boying Chen,
Wenbo Guo,
Ye Li,
Zhaoyun Chen,
Guo** Guo
Abstract:
With the birth of Noisy Intermediate Scale Quantum (NISQ) devices and the verification of "quantum supremacy" in random number sampling and boson sampling, more and more fields hope to use quantum computers to solve specific problems, such as aerodynamic design, route allocation, financial option prediction, quantum chemical simulation to find new materials, and the challenge of quantum cryptograp…
▽ More
With the birth of Noisy Intermediate Scale Quantum (NISQ) devices and the verification of "quantum supremacy" in random number sampling and boson sampling, more and more fields hope to use quantum computers to solve specific problems, such as aerodynamic design, route allocation, financial option prediction, quantum chemical simulation to find new materials, and the challenge of quantum cryptography to automotive industry security. However, these fields still need to constantly explore quantum algorithms that adapt to the current NISQ machine, so a quantum programming framework that can face multi-scenarios and application needs is required. Therefore, this paper proposes QPanda, an application scenario-oriented quantum programming framework with high-performance simulation. Such as designing quantum chemical simulation algorithms based on it to explore new materials, building a quantum machine learning framework to serve finance, etc. This framework implements high-performance simulation of quantum circuits, a configuration of the fusion processing backend of quantum computers and supercomputers, and compilation and optimization methods of quantum programs for NISQ machines. Finally, the experiment shows that quantum jobs can be executed with high fidelity on the quantum processor using quantum circuit compile and optimized interface and have better simulation performance.
△ Less
Submitted 29 December, 2022;
originally announced December 2022.
-
Simulation of Fermionic and Bosonic Critical Points with Emergent SO(5) Symmetry
Authors:
Toshihiro Sato,
Zhenjiu Wang,
Yuhai Liu,
Disha Hou,
Martin Hohenadler,
Wenan Guo,
Fakher F. Assaad
Abstract:
We introduce a model of Dirac fermions in 2+1 dimensions with a semimetallic, a quantum spin-Hall insulating (QSHI), and an s-wave superconducting (SSC) phase. The phase diagram features a multicritical point at which all three phases meet as well as a QSHI-SSC deconfined critical point. The QSHI and SSC orders correspond to mutually anti-commuting mass terms of the Dirac Hamiltonian. Based on thi…
▽ More
We introduce a model of Dirac fermions in 2+1 dimensions with a semimetallic, a quantum spin-Hall insulating (QSHI), and an s-wave superconducting (SSC) phase. The phase diagram features a multicritical point at which all three phases meet as well as a QSHI-SSC deconfined critical point. The QSHI and SSC orders correspond to mutually anti-commuting mass terms of the Dirac Hamiltonian. Based on this algebraic property, SO(5) symmetric field theories have been put forward to describe both types of critical points. Using quantum Monte Carlo simulations, we directly study the operator that rotates between QSHI and SSC states. The results suggest that it commutes with the low-energy effective Hamiltonian at criticality but has a gap in the ordered phases. This implies an emergent SO(5) symmetry at both the multicritical and the deconfined critical points.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.