-
Chloride Ion Erosion of Pre-Stressed Concrete Bridges in Cold Regions
Authors:
Hongtao Cui,
Yi Zhuo,
Dongyuan Ke,
Zhonglong Li,
Shunlong Li
Abstract:
The erosion of chloride ions in concrete bridges will accelerate the corrosion of reinforcement, which is an important reason for the decline of bridge durability. The erosion process of chloride ion, especially deicing salt solution in cold regions, is complex and has many influencing factors. It is very important to use accurate and effective methods to analyze the chloride ion erosion process i…
▽ More
The erosion of chloride ions in concrete bridges will accelerate the corrosion of reinforcement, which is an important reason for the decline of bridge durability. The erosion process of chloride ion, especially deicing salt solution in cold regions, is complex and has many influencing factors. It is very important to use accurate and effective methods to analyze the chloride ion erosion process in concrete. In this study, the pre-stressed concrete bridge retired in the cold region was taken as the research object, and the specimens from the whole bridge are obtained by the method of core drilling sampling. The concentration of chloride ion was measured at different depths of the specimens. The process of chloride ion erosion was simulated in two-dimensional space through COMSOL multi-physical field simulation, and compared with the measured results. The simulation method proposed in this paper has good reliability and accuracy.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Cesno: Possibility of Creating a New Programming Language
Authors:
Ozelot Vanilla,
**gxiang Yu,
Hemn Barzan Abdalla,
Haozhe Cui
Abstract:
Programming languages are incredibly versatile, enabling developers to create applications and programs that suit their individual requirements. This article introduces a new language called Cesno, designed from the ground up to offer an advanced, user-friendly, and easy-to-use programming environment. Cesno's syntax is similar to other popular languages, making it simple to learn and work with. I…
▽ More
Programming languages are incredibly versatile, enabling developers to create applications and programs that suit their individual requirements. This article introduces a new language called Cesno, designed from the ground up to offer an advanced, user-friendly, and easy-to-use programming environment. Cesno's syntax is similar to other popular languages, making it simple to learn and work with. It incorporates features from other languages, such as syntactic sugar, a built-in library, support for functional programming, object-oriented programming, dynamic ty**, a type system, and a variety of function parameters and restrictions. This article will explore the design of Cesno's grammar, provide a brief overview of how Cesno processes and compiles code, and provide examples of what Cesno's code looks like and how it can aid in development.
△ Less
Submitted 30 March, 2023; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Observation of Rydberg moiré excitons
Authors:
Qianying Hu,
Zhen Zhan,
Huiying Cui,
Yalei Zhang,
Feng **,
Xuan Zhao,
Mingjie Zhang,
Zhichuan Wang,
Qingming Zhang,
Kenji Watanabe,
Takashi Taniguchi,
Xuewei Cao,
Wu-Ming Liu,
Fengcheng Wu,
Shengjun Yuan,
Yang Xu
Abstract:
Rydberg excitons, the solid-state counterparts of Rydberg atoms, have sparked considerable interest in harnessing their quantum application potentials, whereas a major challenge is realizing their spatial confinement and manipulation. Lately, the rise of two-dimensional moiré superlattices with highly tunable periodic potentials provides a possible pathway. Here, we experimentally demonstrate this…
▽ More
Rydberg excitons, the solid-state counterparts of Rydberg atoms, have sparked considerable interest in harnessing their quantum application potentials, whereas a major challenge is realizing their spatial confinement and manipulation. Lately, the rise of two-dimensional moiré superlattices with highly tunable periodic potentials provides a possible pathway. Here, we experimentally demonstrate this capability through the observation of Rydberg moiré excitons (XRM), which are moiré trapped Rydberg excitons in monolayer semiconductor WSe2 adjacent to twisted bilayer graphene. In the strong coupling regime, the XRM manifest as multiple energy splittings, pronounced redshift, and narrowed linewidth in the reflectance spectra, highlighting their charge-transfer character where electron-hole separation is enforced by the strongly asymmetric interlayer Coulomb interactions. Our findings pave the way for pursuing novel physics and quantum technology exploitation based on the excitonic Rydberg states.
△ Less
Submitted 17 March, 2023;
originally announced March 2023.
-
Effective Hamiltonian approach to the exact dynamics of open system by complex discretization approximation for environment
Authors:
H. T. Cui,
Y. A. Yan,
M. Qin,
X. X. Yi
Abstract:
The discretization approximation method commonly used to simulate the open dynamics of system coupled to the environment in continuum often suffers from the recurrence. To address this issue, this paper proposes a noval generalization of the discretization approximation method in the complex plane using complex Gauss quadratures. The effective Hamiltonian can be constructed by this way, which is n…
▽ More
The discretization approximation method commonly used to simulate the open dynamics of system coupled to the environment in continuum often suffers from the recurrence. To address this issue, this paper proposes a noval generalization of the discretization approximation method in the complex plane using complex Gauss quadratures. The effective Hamiltonian can be constructed by this way, which is non-Hermitian and demonstrates the complex energy modes with negative imaginary part, describing accurately the dissipative dynamics of the system. This method is applied to examine the dynamics in two exactly solvable models: the dephasing model and the single-excitation open dynamics in the Aubry-André-Harper model. This approach not only significantly reduces recurrence and improve the effectiveness of calculation, but also provide the microscopic viewpoint on the dynamics of system through the effective Hamiltonian. In addition, a simple relationship between the parameters in computation and the effectiveness of evaluation is also established.
△ Less
Submitted 27 May, 2024; v1 submitted 12 March, 2023;
originally announced March 2023.
-
Characterization of the response of IHEP-IME LGAD with shallow carbon to Gamma Irradiation
Authors:
Weiyi Sun,
Yunyun Fan,
Mei Zhao,
Han Cui,
Chengjun Yu,
Shuqi Li,
Yuan Feng,
Xinhui Huang,
Zhijun Liang,
Xuewei Jia,
Wei Wang,
Tianya Wu,
Mengzhao Li,
João Guimarães da Costa,
Gaobo Xu
Abstract:
Low Gain Avalanche Detectors (LGAD) for the High-Granularity Timing Detector (HGTD) are crucial in reducing pileups in the High-Luminosity Large Hadron Collider. Numerous studies have been conducted on the bulk irradiation damage of LGADs. However, few studies have been carried out on the surface irradiation damage of LGAD sensors with shallow carbon implantation. In this paper, the IHEP-IME LGADs…
▽ More
Low Gain Avalanche Detectors (LGAD) for the High-Granularity Timing Detector (HGTD) are crucial in reducing pileups in the High-Luminosity Large Hadron Collider. Numerous studies have been conducted on the bulk irradiation damage of LGADs. However, few studies have been carried out on the surface irradiation damage of LGAD sensors with shallow carbon implantation. In this paper, the IHEP-IME LGADs with shallow carbon implantation were irradiated up to 2 MGy using gamma irradiation to investigate surface damage. Important characteristic parameters, including leakage currents, breakdown voltage (BV), inter-pad resistances, and capacitances, were tested before and after irradiation. The results showed that the leakage current and BV increased after irradiation, whereas overall inter-pad resistance exhibited minimal change and remained above $10^9\ Ω$ before and after irradiation. Capacitance was found to be less than 4.5 pF with a slight decrease in the gain layer depletion voltage (V$_{gl}$) after irradiation. No parameter affected by the inter-pad separation was observed before and after irradiation. All characteristic parameters meet the requirements of HGTD, and this design can be used to further optimization.
△ Less
Submitted 8 June, 2023; v1 submitted 10 March, 2023;
originally announced March 2023.
-
Bus Admittance Matrix Revisited: Is It Outdated on Modern Computers?
Authors:
Hantao Cui
Abstract:
Bus admittance matrix is widely used in power engineering for modeling networks. Being highly sparse, it requires fewer CPU operations when used for calculations. Meanwhile, sparse matrix calculations involve numerous indexing and scalar operations, which are unfavorable to modern processors. Without using the admittance matrix, nodal power injections and the corresponding sparse Jacobian can be c…
▽ More
Bus admittance matrix is widely used in power engineering for modeling networks. Being highly sparse, it requires fewer CPU operations when used for calculations. Meanwhile, sparse matrix calculations involve numerous indexing and scalar operations, which are unfavorable to modern processors. Without using the admittance matrix, nodal power injections and the corresponding sparse Jacobian can be computed by an element-wise method, which consists of a highly regular, vectorized evaluation step and a reduction step. This paper revisits the admittance matrix from the computational performance perspective by comparing it with the element-wise method. Case studies show that the admittance matrix method is generally slower than the element-wise method for grid test cases with thousands to hundreds of thousands of buses, especially on CPUs with support for wide vector instructions. This paper also analyzes the impact of the width of vector instructions and memory speed to predict the trend for future computers.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
On the Laplacian matching root integral variation
Authors:
Yi Wang,
Hai-Jian Cui,
Sebastian M. Cioabă
Abstract:
In this paper, we devote to investigating the circumstances under which the addition of an edge to a graph will cause the Laplacian matching roots to change only by integer quantities. We prove that the Laplacian matching root integral variation in one place is impossible and the Laplacian matching root integral variation in two places is also impossible under some constraints.
In this paper, we devote to investigating the circumstances under which the addition of an edge to a graph will cause the Laplacian matching roots to change only by integer quantities. We prove that the Laplacian matching root integral variation in one place is impossible and the Laplacian matching root integral variation in two places is also impossible under some constraints.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Deterministic equivalent and error universality of deep random features learning
Authors:
Dominik Schröder,
Hugo Cui,
Daniil Dmitriev,
Bruno Loureiro
Abstract:
This manuscript considers the problem of learning a random Gaussian network function using a fully connected network with frozen intermediate layers and trainable readout layer. This problem can be seen as a natural generalization of the widely studied random features model to deeper architectures. First, we prove Gaussian universality of the test error in a ridge regression setting where the lear…
▽ More
This manuscript considers the problem of learning a random Gaussian network function using a fully connected network with frozen intermediate layers and trainable readout layer. This problem can be seen as a natural generalization of the widely studied random features model to deeper architectures. First, we prove Gaussian universality of the test error in a ridge regression setting where the learner and target networks share the same intermediate layers, and provide a sharp asymptotic formula for it. Establishing this result requires proving a deterministic equivalent for traces of the deep random features sample covariance matrices which can be of independent interest. Second, we conjecture the asymptotic Gaussian universality of the test error in the more general setting of arbitrary convex losses and generic learner/target architectures. We provide extensive numerical evidence for this conjecture, which requires the derivation of closed-form expressions for the layer-wise post-activation population covariances. In light of our results, we investigate the interplay between architecture design and implicit regularization.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
Bayes-optimal Learning of Deep Random Networks of Extensive-width
Authors:
Hugo Cui,
Florent Krzakala,
Lenka Zdeborová
Abstract:
We consider the problem of learning a target function corresponding to a deep, extensive-width, non-linear neural network with random Gaussian weights. We consider the asymptotic limit where the number of samples, the input dimension and the network width are proportionally large. We propose a closed-form expression for the Bayes-optimal test error, for regression and classification tasks. We furt…
▽ More
We consider the problem of learning a target function corresponding to a deep, extensive-width, non-linear neural network with random Gaussian weights. We consider the asymptotic limit where the number of samples, the input dimension and the network width are proportionally large. We propose a closed-form expression for the Bayes-optimal test error, for regression and classification tasks. We further compute closed-form expressions for the test errors of ridge regression, kernel and random features regression. We find, in particular, that optimally regularized ridge regression, as well as kernel regression, achieve Bayes-optimal performances, while the logistic loss yields a near-optimal test error for classification. We further show numerically that when the number of samples grows faster than the dimension, ridge and kernel methods become suboptimal, while neural networks achieve test error close to zero from quadratically many samples.
△ Less
Submitted 21 June, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
Millimetre-wave Radar for Low-Cost 3D Imaging: A Performance Study
Authors:
Han Cui,
Jiacheng Wu,
Naim Dahnoun
Abstract:
Millimetre-wave (mmWave) radars can generate 3D point clouds to represent objects in the scene. However, the accuracy and density of the generated point cloud can be lower than a laser sensor. Although researchers have used mmWave radars for various applications, there are few quantitative evaluations on the quality of the point cloud generated by the radar and there is a lack of a standard on how…
▽ More
Millimetre-wave (mmWave) radars can generate 3D point clouds to represent objects in the scene. However, the accuracy and density of the generated point cloud can be lower than a laser sensor. Although researchers have used mmWave radars for various applications, there are few quantitative evaluations on the quality of the point cloud generated by the radar and there is a lack of a standard on how this quality can be assessed. This work aims to fill the gap in the literature. A radar simulator is built to evaluate the most common data processing chains of 3D point cloud construction and to examine the capability of the mmWave radar as a 3D imaging sensor under various factors. It will be shown that the radar detection can be noisy and have an imbalance distribution. To address the problem, a novel super-resolution point cloud construction (SRPC) algorithm is proposed to improve the spatial resolution of the point cloud and is shown to be able to produce a more natural point cloud and reduce outliers.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
Breaking the Boundaries of Knowledge Space: Analyzing the Knowledge Spanning on the Q&A Website through Word Embeddings
Authors:
Haochuan Cui,
Tiewei Li,
Cheng-Jun Wang
Abstract:
The challenge of raising a creative question exists in recombining different categories of knowledge. However, the impact of recombination remains controversial. Drawing on the theories of knowledge recombination and category spanning, we propose that both the distance of knowledge spanning and the hierarchy of knowledge shape the appeal of questions. Using word embedding models and the data colle…
▽ More
The challenge of raising a creative question exists in recombining different categories of knowledge. However, the impact of recombination remains controversial. Drawing on the theories of knowledge recombination and category spanning, we propose that both the distance of knowledge spanning and the hierarchy of knowledge shape the appeal of questions. Using word embedding models and the data collected from a large online knowledge market (N = 463,545), we find that the impact of knowledge spanning on the appeal of questions is parabolic: the appeal of questions increases up to a threshold, after which point the positive effect reverses. However, the nonlinear influence of knowledge spanning is contingent upon the hierarchy of knowledge. The theoretical and practical implications of these findings for future research on knowledge recombination are discussed. We fill the research gap by conceptualizing question asking as knowledge spanning and highlighting the theoretical underpinnings of the knowledge hierarchy.
△ Less
Submitted 29 January, 2023; v1 submitted 23 January, 2023;
originally announced January 2023.
-
Unpacking the Essential Tension of Knowledge Recombination: Analyzing the Impact of Knowledge Spanning on Citation Counts and Disruptive Innovation
Authors:
Cheng-Jun Wang,
Lihan Yan,
Haochuan Cui
Abstract:
Drawing on the theories of knowledge recombination, we aim to unpack the essential tension between tradition and innovation in scientific research. Using the American Physical Society data and computational methods, we analyze the impact of knowledge spanning on both citation counts and disruptive innovation. The findings show that knowledge spanning has a U-shaped impact on disruptive innovation.…
▽ More
Drawing on the theories of knowledge recombination, we aim to unpack the essential tension between tradition and innovation in scientific research. Using the American Physical Society data and computational methods, we analyze the impact of knowledge spanning on both citation counts and disruptive innovation. The findings show that knowledge spanning has a U-shaped impact on disruptive innovation. In contrast, there is an inverted U-shaped relationship between knowledge spanning and citation counts, and the inverted U-shaped effect is moderated by team size. This study contributes to the theories of knowledge recombination by suggesting that both intellectual conformism and knowledge recombination can lead to disruptive innovation. That is, when evaluating the quality of scientific research with disruptive innovation, the essential tension seems to disappear.
△ Less
Submitted 28 January, 2023; v1 submitted 23 January, 2023;
originally announced January 2023.
-
Crowd3D: Towards Hundreds of People Reconstruction from a Single Image
Authors:
Hao Wen,
**g Huang,
Huili Cui,
Haozhe Lin,
YuKun Lai,
Lu Fang,
Kun Li
Abstract:
Image-based multi-person reconstruction in wide-field large scenes is critical for crowd analysis and security alert. However, existing methods cannot deal with large scenes containing hundreds of people, which encounter the challenges of large number of people, large variations in human scale, and complex spatial distribution. In this paper, we propose Crowd3D, the first framework to reconstruct…
▽ More
Image-based multi-person reconstruction in wide-field large scenes is critical for crowd analysis and security alert. However, existing methods cannot deal with large scenes containing hundreds of people, which encounter the challenges of large number of people, large variations in human scale, and complex spatial distribution. In this paper, we propose Crowd3D, the first framework to reconstruct the 3D poses, shapes and locations of hundreds of people with global consistency from a single large-scene image. The core of our approach is to convert the problem of complex crowd localization into pixel localization with the help of our newly defined concept, Human-scene Virtual Interaction Point (HVIP). To reconstruct the crowd with global consistency, we propose a progressive reconstruction network based on HVIP by pre-estimating a scene-level camera and a ground plane. To deal with a large number of persons and various human sizes, we also design an adaptive human-centric crop** scheme. Besides, we contribute a benchmark dataset, LargeCrowd, for crowd reconstruction in a large scene. Experimental results demonstrate the effectiveness of the proposed method. The code and datasets will be made public.
△ Less
Submitted 1 April, 2023; v1 submitted 23 January, 2023;
originally announced January 2023.
-
Neighborhood-Regularized Self-Training for Learning with Few Labels
Authors:
Ran Xu,
Yue Yu,
Hejie Cui,
Xuan Kan,
Yanqiao Zhu,
Joyce Ho,
Chao Zhang,
Carl Yang
Abstract:
Training deep neural networks (DNNs) with limited supervision has been a popular research topic as it can significantly alleviate the annotation burden. Self-training has been successfully applied in semi-supervised learning tasks, but one drawback of self-training is that it is vulnerable to the label noise from incorrect pseudo labels. Inspired by the fact that samples with similar labels tend t…
▽ More
Training deep neural networks (DNNs) with limited supervision has been a popular research topic as it can significantly alleviate the annotation burden. Self-training has been successfully applied in semi-supervised learning tasks, but one drawback of self-training is that it is vulnerable to the label noise from incorrect pseudo labels. Inspired by the fact that samples with similar labels tend to share similar representations, we develop a neighborhood-based sample selection approach to tackle the issue of noisy pseudo labels. We further stabilize self-training via aggregating the predictions from different rounds during sample selection. Experiments on eight tasks show that our proposed method outperforms the strongest self-training baseline with 1.83% and 2.51% performance gain for text and graph datasets on average. Our further analysis demonstrates that our proposed data selection strategy reduces the noise of pseudo labels by 36.8% and saves 57.3% of the time when compared with the best baseline. Our code and appendices will be uploaded to https://github.com/ritaranx/NeST.
△ Less
Submitted 15 February, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Shape-Aware Fine-Grained Classification of Erythroid Cells
Authors:
Ye Wang,
Rui Ma,
Xiaoqing Ma,
Honghua Cui,
Yubin Xiao,
Xuan Wu,
You Zhou
Abstract:
Fine-grained classification and counting of bone marrow erythroid cells are vital for evaluating the health status and formulating therapeutic schedules for leukemia or hematopathy. Due to the subtle visual differences between different types of erythroid cells, it is challenging to apply existing image-based deep learning models for fine-grained erythroid cell classification. Moreover, there is n…
▽ More
Fine-grained classification and counting of bone marrow erythroid cells are vital for evaluating the health status and formulating therapeutic schedules for leukemia or hematopathy. Due to the subtle visual differences between different types of erythroid cells, it is challenging to apply existing image-based deep learning models for fine-grained erythroid cell classification. Moreover, there is no large open-source datasets on erythroid cells to support the model training. In this paper, we introduce BMEC (Bone Morrow Erythroid Cells), the first large fine-grained image dataset of erythroid cells, to facilitate more deep learning research on erythroid cells. BMEC contains 5,666 images of individual erythroid cells, each of which is extracted from the bone marrow erythroid cell smears and professionally annotated to one of the four types of erythroid cells. To distinguish the erythroid cells, one key indicator is the cell shape which is closely related to the cell growth and maturation. Therefore, we design a novel shape-aware image classification network for fine-grained erythroid cell classification. The shape feature is extracted from the shape mask image and aggregated to the raw image feature with a shape attention module. With the shape-attended image feature, our network achieved superior classification performance (81.12\% top-1 accuracy) on the BMEC dataset comparing to the baseline methods. Ablation studies also demonstrate the effectiveness of incorporating the shape information for the fine-grained cell classification. To further verify the generalizability of our method, we tested our network on two additional public white blood cells (WBC) datasets and the results show our shape-aware method can generally outperform recent state-of-the-art works on classifying the WBC. The code and BMEC dataset can be found on https://github.com/wangye8899/BMEC.
△ Less
Submitted 27 December, 2022;
originally announced December 2022.
-
Sums of Reciprocals of Recurrence Relations
Authors:
Hao Cui,
Xiaoyu Cui,
Sophia C. Davis,
Irfan Durmić,
Qingcheng Hu,
Lisa Liu,
Steven J. Miller,
Feng** Ren,
Alicia Smith Reina,
Eliel Sosis
Abstract:
There is a growing literature on sums of reciprocals of polynomial functions of recurrence relations with constant coefficients and fixed depth, such as Fibonacci and Tribonacci numbers, products of such numbers, and balancing numbers (numbers $n$ such that the sum of the integers less than $n$ equals the sum of the $r$ integers immediately after, for some $r$ which is called the balancer of $n$;…
▽ More
There is a growing literature on sums of reciprocals of polynomial functions of recurrence relations with constant coefficients and fixed depth, such as Fibonacci and Tribonacci numbers, products of such numbers, and balancing numbers (numbers $n$ such that the sum of the integers less than $n$ equals the sum of the $r$ integers immediately after, for some $r$ which is called the balancer of $n$; If $n$ is included in the summation, we have the cobalancing numbers, and $r$ is called the cobalancer of $n$). We generalize previous work to reciprocal sums of depth two recurrence sequences with arbitrary coefficients and the Tribonacci numbers, and show our method provides an alternative proof of some existing results.
We define $(a,b)$ balancing and cobalancing numbers, where $a$ and $b$ are constants that multiply the left-hand side and right-hand side respectively, and derive recurrence relations describing these sequences. We show that for balancing numbers, the coefficients $(3,1)$ is unique such that every integer is a $(3,1)$ balancing number, and proved there does not exist an analogous set of coefficients for cobalancing numbers. We also found patterns for certain coefficients that have no balancing or cobalancing numbers.
△ Less
Submitted 1 February, 2023; v1 submitted 5 December, 2022;
originally announced December 2022.
-
ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests
Authors:
Haochuan Cui,
Junjie Sheng,
Bo **,
Yiqiu Hu,
Li Su,
Lei Zhu,
Wenli Zhou,
Xiangfeng Wang
Abstract:
With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences. By analyzing the impact of request heterogeneity on some popular heuristic schedulers, it can be found that existing scheduling algorithms can not handle the request heterog…
▽ More
With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences. By analyzing the impact of request heterogeneity on some popular heuristic schedulers, it can be found that existing scheduling algorithms can not handle the request heterogeneity properly and efficiently. In this paper, a plug-and-play virtual machine scheduling intensifier, called Resource Assigner (ReAssigner), is proposed to enhance the scheduling efficiency of any given scheduler for heterogeneous requests. The key idea of ReAssigner is to pre-assign roles to physical resources and let resources of the same role form a virtual cluster to handle homogeneous requests. ReAssigner can cooperate with arbitrary schedulers by restricting their scheduling space to virtual clusters. With evaluations on the real dataset from Huawei Cloud, the proposed ReAssigner achieves significant scheduling performance improvement compared with some state-of-the-art scheduling methods.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
CodeExp: Explanatory Code Document Generation
Authors:
Haotian Cui,
Chenglong Wang,
Junjie Huang,
Jeevana Priya Inala,
Todd Mytkowicz,
Bo Wang,
Jianfeng Gao,
Nan Duan
Abstract:
Develo** models that can automatically generate detailed code explanation can greatly benefit software maintenance and programming education. However, existing code-to-text generation models often produce only high-level summaries of code that do not capture implementation-level choices essential for these scenarios. To fill in this gap, we propose the code explanation generation task. We first…
▽ More
Develo** models that can automatically generate detailed code explanation can greatly benefit software maintenance and programming education. However, existing code-to-text generation models often produce only high-level summaries of code that do not capture implementation-level choices essential for these scenarios. To fill in this gap, we propose the code explanation generation task. We first conducted a human study to identify the criteria for high-quality explanatory docstring for code. Based on that, we collected and refined a large-scale code docstring corpus and formulated automatic evaluation metrics that best match human assessments. Finally, we present a multi-stage fine-tuning strategy and baseline models for the task. Our experiments show that (1) our refined training dataset lets models achieve better performance in the explanation generation tasks compared to larger unrefined data (15x larger), and (2) fine-tuned models can generate well-structured long docstrings comparable to human-written ones. We envision our training dataset, human-evaluation protocol, recommended metrics, and fine-tuning strategy can boost future code explanation research. The code and annotated data are available at https://github.com/subercui/CodeExp.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
DiME and AGVis: A Distributed Messaging Environment and Geographical Visualizer for Large-scale Power System Simulation
Authors:
Nicholas Parsly,
**ning Wang,
Nick West,
Qiwei Zhang,
Hantao Cui,
Fangxing Li
Abstract:
This paper introduces the messaging environment and the geographical visualization tool of the CURENT Large-scale Testbed (LTB) that can be used for large-scale power system closed-loop simulation. First, Distributed Messaging Environment (DiME) implements an asynchronous shared workspace to enable high-concurrent data exchange. Second, Another Grid Visualizer (AGVis) is presented as a geovisualiz…
▽ More
This paper introduces the messaging environment and the geographical visualization tool of the CURENT Large-scale Testbed (LTB) that can be used for large-scale power system closed-loop simulation. First, Distributed Messaging Environment (DiME) implements an asynchronous shared workspace to enable high-concurrent data exchange. Second, Another Grid Visualizer (AGVis) is presented as a geovisualization tool that facilitates the visualization of real-time power system simulation. Third, case studies show the use of DiME and AGVis. The results demonstrate that, with the modular structure, the LTB is capable of not only federal use for real-time, large-scale power system simulation, but also independent use for customized power system research.
△ Less
Submitted 17 October, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Execution-based Evaluation for Data Science Code Generation Models
Authors:
Junjie Huang,
Chenglong Wang,
Jipeng Zhang,
Cong Yan,
Haotian Cui,
Jeevana Priya Inala,
Colin Clement,
Nan Duan,
Jianfeng Gao
Abstract:
Code generation models can benefit data scientists' productivity by automatically generating code from context and text descriptions. An important measure of the modeling progress is whether a model can generate code that can correctly execute to solve the task. However, due to the lack of an evaluation dataset that directly supports execution-based model evaluation, existing work relies on code s…
▽ More
Code generation models can benefit data scientists' productivity by automatically generating code from context and text descriptions. An important measure of the modeling progress is whether a model can generate code that can correctly execute to solve the task. However, due to the lack of an evaluation dataset that directly supports execution-based model evaluation, existing work relies on code surface form similarity metrics (e.g., BLEU, CodeBLEU) for model selection, which can be inaccurate.
To remedy this, we introduce ExeDS, an evaluation dataset for execution evaluation for data science code generation tasks. ExeDS contains a set of 534 problems from Jupyter Notebooks, each consisting of code context, task description, reference program, and the desired execution output. With ExeDS, we evaluate the execution performance of five state-of-the-art code generation models that have achieved high surface-form evaluation scores. Our experiments show that models with high surface-form scores do not necessarily perform well on execution metrics, and execution-based metrics can better capture model code generation errors. Source code and data can be found at https://github.com/Jun-jie-Huang/ExeDS
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
A Roadmap to Asymptotic Properties with Applications to COVID-19 Data
Authors:
Elvis Han Cui
Abstract:
Asymptotic properties of statistical estimators play a significant role both in practice and in theory. However, many asymptotic results in statistics rely heavily on the independent and identically distributed (iid) assumption, which is not realistic when we have fixed designs. In this article, we build a roadmap of general procedures for deriving asymptotic properties under fixed designs and the…
▽ More
Asymptotic properties of statistical estimators play a significant role both in practice and in theory. However, many asymptotic results in statistics rely heavily on the independent and identically distributed (iid) assumption, which is not realistic when we have fixed designs. In this article, we build a roadmap of general procedures for deriving asymptotic properties under fixed designs and the observations need not to be iid. We further provide their applications in many statistical applications. Finally, we apply our results to Poisson regression using a COVID-19 dataset as an illustration to demonstrate the power of these results in practice.
△ Less
Submitted 6 October, 2022;
originally announced November 2022.
-
Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks
Authors:
Yue Yu,
Xuan Kan,
Hejie Cui,
Ran Xu,
Yujia Zheng,
Xiangchen Song,
Yanqiao Zhu,
Kun Zhang,
Razieh Nabi,
Ying Guo,
Chao Zhang,
Carl Yang
Abstract:
Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downs…
▽ More
Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downstream prediction tasks and can lead to inferior results for GNN-based models. To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis. The key component of TBDS is the brain network generator which adopts a DAG learning approach to transform the raw time-series into task-aware brain connectivities. Besides, we design an additional contrastive regularization to inject task-specific knowledge during the brain network generation process. Comprehensive experiments on two fMRI datasets, namely Adolescent Brain Cognitive Development (ABCD) and Philadelphia Neuroimaging Cohort (PNC) datasets demonstrate the efficacy of TBDS. In addition, the generated brain networks also highlight the prediction-related brain regions and thus provide unique interpretations of the prediction results. Our implementation will be published to https://github.com/yueyu1030/TBDS upon acceptance.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
Improving Motion Forecasting for Autonomous Driving with the Cycle Consistency Loss
Authors:
Titas Chakraborty,
Akshay Bhagat,
Henggang Cui
Abstract:
Robust motion forecasting of the dynamic scene is a critical component of an autonomous vehicle. It is a challenging problem due to the heterogeneity in the scene and the inherent uncertainties in the problem. To improve the accuracy of motion forecasting, in this work, we identify a new consistency constraint in this task, that is an agent's future trajectory should be coherent with its history o…
▽ More
Robust motion forecasting of the dynamic scene is a critical component of an autonomous vehicle. It is a challenging problem due to the heterogeneity in the scene and the inherent uncertainties in the problem. To improve the accuracy of motion forecasting, in this work, we identify a new consistency constraint in this task, that is an agent's future trajectory should be coherent with its history observations and visa versa. To leverage this property, we propose a novel cycle consistency training scheme and define a novel cycle loss to encourage this consistency. In particular, we reverse the predicted future trajectory backward in time and feed it back into the prediction model to predict the history and compute the loss as an additional cycle loss term. Through our experiments on the Argoverse dataset, we demonstrate that cycle loss can improve the performance of competitive motion forecasting models.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
Linear nonsaturating magnetoresistance in kagome superconductor CsV3Sb5 thin flakes
Authors:
Xinjian Wei,
Congkuan Tian,
Hang Cui,
Yongkai Li,
Shaobo Liu,
Ya Feng,
Jian Cui,
Yuanjun Song,
Zhiwei Wang,
Jian-Hao Chen
Abstract:
Linear nonsaturating magnetoresistance (LMR) represents a class of anomalous resistivity response to external magnetic field that has been observed in a variety of materials including but not limited to topological semi-metals, high-Tc superconductors and materials with charge/spin density wave (CDW/SDW) orders. Here we report the observation of LMR in layered kagome superconductor and CDW materia…
▽ More
Linear nonsaturating magnetoresistance (LMR) represents a class of anomalous resistivity response to external magnetic field that has been observed in a variety of materials including but not limited to topological semi-metals, high-Tc superconductors and materials with charge/spin density wave (CDW/SDW) orders. Here we report the observation of LMR in layered kagome superconductor and CDW material CsV3Sb5 thin flakes, as well as the dimensional crossover and temperature (T) crossover of such LMR. Specifically, in ultrathin CsV3Sb5 crystals, the magnetoresistance (MR) exhibits a crossover from LMR at low T to quadratic B dependence above the CDW transition temperature; the MR also exhibits a crossover from LMR to sublinear MR for sample thickness at around ~20 nm at low T. We discuss several possible origins of the LMR and attribute the effect to two-dimensional (2D) CDW fluctuations. Our results may provide a new perspective for understanding the interactions between competing orders in kagome superconductors.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
Laugh Betrays You? Learning Robust Speaker Representation From Speech Containing Non-Verbal Fragments
Authors:
Yuke Lin,
Xiaoyi Qin,
Huahua Cui,
Zhenyi Zhu,
Ming Li
Abstract:
The success of automatic speaker verification shows that discriminative speaker representations can be extracted from neutral speech. However, as a kind of non-verbal voice, laughter should also carry speaker information intuitively. Thus, this paper focuses on exploring speaker verification about utterances containing non-verbal laughter segments. We collect a set of clips with laughter component…
▽ More
The success of automatic speaker verification shows that discriminative speaker representations can be extracted from neutral speech. However, as a kind of non-verbal voice, laughter should also carry speaker information intuitively. Thus, this paper focuses on exploring speaker verification about utterances containing non-verbal laughter segments. We collect a set of clips with laughter components by conducting a laughter detection script on VoxCeleb and part of the CN-Celeb dataset. To further filter untrusted clips, probability scores are calculated by our binary laughter detection classifier, which is pre-trained by pure laughter and neutral speech. After that, based on the clips whose scores are over the threshold, we construct trials under two different evaluation scenarios: Laughter-Laughter (LL) and Speech-Laughter (SL). Then a novel method called Laughter-Splicing based Network (LSN) is proposed, which can significantly boost performance in both scenarios and maintain the performance on the neutral speech, such as the VoxCeleb1 test set. Specifically, our system achieves relative 20% and 22% improvement on Laughter-Laughter and Speech-Laughter trials, respectively. The meta-data and sample clips have been released at https://github.com/nevermoreLin/Laugh_LSN.
△ Less
Submitted 20 November, 2023; v1 submitted 28 October, 2022;
originally announced October 2022.
-
A Tutorial on Statistical Models Based on Counting Processes
Authors:
Elvis Han Cui
Abstract:
Since the famous paper written by Kaplan and Meier in 1958, survival analysis has become one of the most important fields in statistics. Nowadays it is one of the most important statistical tools in analyzing epidemiological and clinical data including COVID-19 pandemic. This article reviews some of the most celebrated and important results and methods, including consistency, asymptotic normality,…
▽ More
Since the famous paper written by Kaplan and Meier in 1958, survival analysis has become one of the most important fields in statistics. Nowadays it is one of the most important statistical tools in analyzing epidemiological and clinical data including COVID-19 pandemic. This article reviews some of the most celebrated and important results and methods, including consistency, asymptotic normality, bias and variance estimation, in survival analysis and the treatment is parallel to the monograph Statistical Models Based on Counting Processes. Other models and results such as semi-Markov models and the Turnbull's estimator that jump out of the classical counting process martingale framework are also discussed.
△ Less
Submitted 23 October, 2022; v1 submitted 30 September, 2022;
originally announced October 2022.
-
PoliGraph: Automated Privacy Policy Analysis using Knowledge Graphs
Authors:
Hao Cui,
Rahmadi Trimananda,
Athina Markopoulou,
Scott Jordan
Abstract:
Privacy policies disclose how an organization collects and handles personal information. Recent work has made progress in leveraging natural language processing (NLP) to automate privacy policy analysis and extract data collection statements from different sentences, considered in isolation from each other. In this paper, we view and analyze, for the first time, the entire text of a privacy policy…
▽ More
Privacy policies disclose how an organization collects and handles personal information. Recent work has made progress in leveraging natural language processing (NLP) to automate privacy policy analysis and extract data collection statements from different sentences, considered in isolation from each other. In this paper, we view and analyze, for the first time, the entire text of a privacy policy in an integrated way. In terms of methodology: (1) we define PoliGraph, a type of knowledge graph that captures statements in a privacy policy as relations between different parts of the text; and (2) we develop an NLP-based tool, PoliGraph-er, to automatically extract PoliGraph from the text. In addition, (3) we revisit the notion of ontologies, previously defined in heuristic ways, to capture subsumption relations between terms. We make a clear distinction between local and global ontologies to capture the context of individual privacy policies, application domains, and privacy laws. Using a public dataset for evaluation, we show that PoliGraph-er identifies 40% more collection statements than prior state-of-the-art, with 97% precision. In terms of applications, PoliGraph enables automated analysis of a corpus of privacy policies and allows us to: (1) reveal common patterns in the texts across different privacy policies, and (2) assess the correctness of the terms as defined within a privacy policy. We also apply PoliGraph to: (3) detect contradictions in a privacy policy, where we show false alarms by prior work, and (4) analyze the consistency of privacy policies and network traffic, where we identify significantly more clear disclosures than prior work.
△ Less
Submitted 20 June, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Brain Network Transformer
Authors:
Xuan Kan,
Wei Dai,
Hejie Cui,
Zilong Zhang,
Ying Guo,
Carl Yang
Abstract:
Human brains are commonly modeled as networks of Regions of Interest (ROIs) and their connections for the understanding of brain functions and mental disorders. Recently, Transformer-based models have been studied over different types of data, including graphs, shown to bring performance gains widely. In this work, we study Transformer-based models for brain network analysis. Driven by the unique…
▽ More
Human brains are commonly modeled as networks of Regions of Interest (ROIs) and their connections for the understanding of brain functions and mental disorders. Recently, Transformer-based models have been studied over different types of data, including graphs, shown to bring performance gains widely. In this work, we study Transformer-based models for brain network analysis. Driven by the unique properties of data, we model brain networks as graphs with nodes of fixed size and order, which allows us to (1) use connection profiles as node features to provide natural and low-cost positional information and (2) learn pair-wise connection strengths among ROIs with efficient attention weights across individuals that are predictive towards downstream analysis tasks. Moreover, we propose an Orthonormal Clustering Readout operation based on self-supervised soft clustering and orthonormal projection. This design accounts for the underlying functional modules that determine similar behaviors among groups of ROIs, leading to distinguishable cluster-aware node embeddings and informative graph embeddings. Finally, we re-standardize the evaluation pipeline on the only one publicly available large-scale brain network dataset of ABIDE, to enable meaningful comparison of different models. Experiment results show clear improvements of our proposed Brain Network Transformer on both the public ABIDE and our restricted ABCD datasets. The implementation is available at https://github.com/Wayfear/BrainNetworkTransformer.
△ Less
Submitted 15 October, 2022; v1 submitted 12 October, 2022;
originally announced October 2022.
-
Super resolution dual-energy cone-beam CT imaging with dual-layer flat-panel detector
Authors:
Ting Su,
Jiongtao Zhu,
Xin Zhang,
Dong Zeng,
Yuhang Tan,
Han Cui,
Hairong Zheng,
Jianhua Ma,
Dong Liang,
Yongshuai Ge
Abstract:
For medical cone-beam computed tomography (CBCT) imaging, the native receptor array of the flat-panel detector (FPD) is usually binned into a reduced matrix size. By doing so, the signal readout speed can be increased by over 4-9 times at the expense of sacrificing the spatial resolution by at least 50%-67%. Clearly, such tradition poses a main bottleneck in generating high spatial resolution and…
▽ More
For medical cone-beam computed tomography (CBCT) imaging, the native receptor array of the flat-panel detector (FPD) is usually binned into a reduced matrix size. By doing so, the signal readout speed can be increased by over 4-9 times at the expense of sacrificing the spatial resolution by at least 50%-67%. Clearly, such tradition poses a main bottleneck in generating high spatial resolution and high temporal resolution CBCT images at the same time. In addition, the conventional FPD is also difficult in generating dual-energy CBCT images. In this paper, we propose an innovative super resolution dual-energy CBCT imaging method, named as suRi, based on dual-layer FPD (DL-FPD) to overcome these aforementioned difficulties at once. With suRi, specifically, an 1D or 2D sub-pixel (half pixel in this study) shifted binning is applied to replace the conventionally aligned binning to double the spatial sampling rate during the dual-energy data acquisition. As a result, the suRi approach provides a new strategy to enable high signal readout speed and high spatial resolution CBCT imaging with FPD. Moreover, a penalized likelihood material decomposition algorithm is developed to directly reconstruct the high resolution bases from the dual-energy CBCT projections containing spatial sub-pixel shifts. Experiments based on the single-layer FPD and DL-FPD are performed with physical phantoms and biological specimen to validate this newly developed suRi method. The synthesized monochromatic CT imaging results demonstrate that suRi can significantly improve the spatial image resolution by 46.15%. We believe the developed suRi method would be capable to greatly enhance the imaging performance of the DL-FPD based dual-energy CBCT systems in future.
△ Less
Submitted 17 October, 2022; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Bicoptor: Two-round Secure Three-party Non-linear Computation without Preprocessing for Privacy-preserving Machine Learning
Authors:
Li**g Zhou,
Ziyu Wang,
Hongrui Cui,
Qingrui Song,
Yu Yu
Abstract:
The overhead of non-linear functions dominates the performance of the secure multiparty computation (MPC) based privacy-preserving machine learning (PPML). This work introduces a family of novel secure three-party computation (3PC) protocols, Bicoptor, which improve the efficiency of evaluating non-linear functions. The basis of Bicoptor is a new sign determination protocol, which relies on a clev…
▽ More
The overhead of non-linear functions dominates the performance of the secure multiparty computation (MPC) based privacy-preserving machine learning (PPML). This work introduces a family of novel secure three-party computation (3PC) protocols, Bicoptor, which improve the efficiency of evaluating non-linear functions. The basis of Bicoptor is a new sign determination protocol, which relies on a clever use of the truncation protocol proposed in SecureML (S\&P 2017). Our 3PC sign determination protocol only requires two communication rounds, and does not involve any preprocessing. Such sign determination protocol is well-suited for computing non-linear functions in PPML, e.g. the activation function ReLU, Maxpool, and their variants. We develop suitable protocols for these non-linear functions, which form a family of GPU-friendly protocols, Bicoptor. All Bicoptor protocols only require two communication rounds without preprocessing. We evaluate Bicoptor under a 3-party LAN network over a public cloud, and achieve more than 370,000 DReLU/ReLU or 41,000 Maxpool (find the maximum value of nine inputs) operations per second. Under the same settings and environment, our ReLU protocol has a one or even two orders of magnitude improvement to the state-of-the-art works, Falcon (PETS 2021) or Edabits (CRYPTO 2020), respectively without batch processing.
△ Less
Submitted 19 April, 2024; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Requirement analysis for dE/dx measurement and PID performance at the CEPC baseline detector
Authors:
Yongfeng Zhu,
Shanzhen Chen,
Hanhua Cui,
Manqi Ruan
Abstract:
The Circular Electron-Positron Collider (CEPC) can be operated not only as a Higgs factory but also as a Z-boson factory, offering great opportunities for flavor physics studies where Particle Identification (PID) is critical. The baseline detector of the CEPC could record TOF and dE/dx information that can be used to distinguish particles of different species. We quantify the physics requirements…
▽ More
The Circular Electron-Positron Collider (CEPC) can be operated not only as a Higgs factory but also as a Z-boson factory, offering great opportunities for flavor physics studies where Particle Identification (PID) is critical. The baseline detector of the CEPC could record TOF and dE/dx information that can be used to distinguish particles of different species. We quantify the physics requirements and detector performance using physics benchmark analyzes with full simulation. We conclude that at the benchmark TOF performance of $50\,$ps, the dE/dx resolution should be better than 3% for incident particles in the barrel region with a relevant energy larger than $2\, $GeV/c. This performance leads to an efficiency/purity of $K^{\pm}$ identification 97%/96%, $D^0\to π^+K^-$ reconstruction 68.19%/89.05%, and $φ\to K^+K^-$ reconstruction 82.26%/77.70%, providing solid support for relevant CEPC flavor physics measurements.
△ Less
Submitted 14 November, 2022; v1 submitted 28 September, 2022;
originally announced September 2022.
-
D-optimal Approximate Design for Binary Regression and Quantal Response in Toxicology Studies
Authors:
Elvis Han Cui
Abstract:
We provide a systematic treatment of $D$-optimal design for binary regression and quantal response models in toxicology studies. For the two-parameter case, we provide an analytical equation (WC equation) for computing the $D$-optimal design quickly and when analytical solution is not available, we apply particle swarm optimization to solve for the $D$-optimal design. Examples with various link fu…
▽ More
We provide a systematic treatment of $D$-optimal design for binary regression and quantal response models in toxicology studies. For the two-parameter case, we provide an analytical equation (WC equation) for computing the $D$-optimal design quickly and when analytical solution is not available, we apply particle swarm optimization to solve for the $D$-optimal design. Examples with various link functions are given as well as the sensitivity functions. We extend the two-parameter case to three-parameter case by providing a neat formula for the determinant of the information matrix. We also suggest practitioners to work with the neat formula to derive optimal designs for three-parameter binary regression models.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Virtual Inertia Scheduling for Power Systems with High Penetration of Inverter-based Resources
Authors:
Buxin She,
Fangxing Li,
Hantao Cui,
**nng Wang,
Qiwei Zhang,
Rui Bo
Abstract:
This paper proposes a new concept called virtual inertia scheduling (VIS) to efficiently handle the high penetration of inverter-based resources (IBRs). VIS is an inertia management framework that targets security-constrained and economy-oriented inertia scheduling and generation dispatch of power systems with a large scale of renewable generations. Specifically, it schedules the proper power sett…
▽ More
This paper proposes a new concept called virtual inertia scheduling (VIS) to efficiently handle the high penetration of inverter-based resources (IBRs). VIS is an inertia management framework that targets security-constrained and economy-oriented inertia scheduling and generation dispatch of power systems with a large scale of renewable generations. Specifically, it schedules the proper power setting points and reserved capacities of both synchronous generators and IBRs, as well as the control modes and control parameters of IBRs to provide secure and cost-effective inertia support. First, a uniform system model is employed to quantify the frequency dynamics of the IBRs-penetrated power system after disturbances. Based on the model, the s-domain and time-domain analytical responses of IBRs with inertia support capability are derived. Then, VIS-based real-time economic dispatch (VIS-RTED) is formulated to minimize generation and reserve costs, with a full consideration of dynamic frequency constraints and derived inertia support reserve constraints. The virtual inertia and dam** of IBRs are formulated as decision variables. To address the non-linearity of dynamic constraints, deep learning-assisted linearization is employed to solve the optimization problem. Finally, the proposed VIS-RTED is demonstrated on a modified IEEE 39-bus system. A full-order time-domain simulation is performed to verify the scheduling results.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
On the self-consistency of DFT-1/2
Authors:
Hanli Cui,
Shengxin Yang,
Kan-Hao Xue,
**hai Huang,
Xiangshui Miao
Abstract:
DFT-1/2 is an efficient band gap rectification method for density functional theory (DFT) under local density approximation (LDA) or generalized gradient approximation. It was suggested that non-self-consistent DFT-1/2 should be used for highly ionic insulators like LiF, while self-consistent DFT-1/2 should still be used for other compounds. Nevertheless, there is no quantitative criterion prescri…
▽ More
DFT-1/2 is an efficient band gap rectification method for density functional theory (DFT) under local density approximation (LDA) or generalized gradient approximation. It was suggested that non-self-consistent DFT-1/2 should be used for highly ionic insulators like LiF, while self-consistent DFT-1/2 should still be used for other compounds. Nevertheless, there is no quantitative criterion prescribed for which implementation should work for an arbitrary insulator, which leads to severe ambiguity in this method. In this work we analyze the impact of self-consistency in DFT-1/2 and shell DFT-1/2 calculations in insulators or semiconductors with ionic bonds, covalent bonds and intermediate cases, and show that self-consistency is required even for highly ionic insulators for globally better electronic structure details. The self-energy correction renders electrons more localized around the anions in self-consistent LDA-1/2. The well-known delocalization error of LDA is rectified, but with strong overcorrection due to the presence of additional self-energy potential. However, in non-self-consistent LDA-1/2 calculations, the electron wavefunctions indicate that such localization is much more severe and beyond a reasonable range, because the strong Coulomb repulsion is not counted in the Hamiltonian. Another common drawback of non-self-consistent LDA-1/2 lies in that the ionicity of the bonding gets substantially enhanced, and the band gap can be enormously high in mixed ionic-covalent compounds like $\mathrm{TiO_2}$. The impact of LDA-1/2-induced stress is also discussed comprehensively.
△ Less
Submitted 4 September, 2022;
originally announced September 2022.
-
Competition for popularity and interventions on a Chinese microblogging site
Authors:
Hao Cui,
János Kertész
Abstract:
Microblogging sites are important vehicles for the users to obtain information and shape public opinion thus they are arenas of continuous competition for popularity. Most popular topics are usually indicated on ranking lists. In this study, we investigate the public attention dynamics through the Hot Search List (HSL) of the Chinese microblog Sina Weibo, where trending hashtags are ranked based o…
▽ More
Microblogging sites are important vehicles for the users to obtain information and shape public opinion thus they are arenas of continuous competition for popularity. Most popular topics are usually indicated on ranking lists. In this study, we investigate the public attention dynamics through the Hot Search List (HSL) of the Chinese microblog Sina Weibo, where trending hashtags are ranked based on a multi-dimensional search volume index. We characterize the rank dynamics by the time spent by hashtags on the list, the time of the day they appear there, the rank diversity, and by the ranking trajectories. We show how the circadian rhythm affects the popularity of hashtags, and observe categories of their rank trajectories by a machine learning clustering algorithm. By analyzing patterns of ranking dynamics using various measures, we identify anomalies that are likely to result from the platform provider's intervention into the ranking, including the anchoring of hashtags to certain ranks on the HSL. We propose a simple model of ranking that explains the mechanism of this anchoring effect. We found an over-representation of hashtags related to international politics at 3 out of 4 anchoring ranks on the HSL, indicating possible manipulations of public opinion.
△ Less
Submitted 30 November, 2022; v1 submitted 22 August, 2022;
originally announced August 2022.
-
Two Heads are Better than One: Robust Learning Meets Multi-branch Models
Authors:
Dong Huang,
Qingwen Bu,
Yuhao Qing,
Haowen Pi,
Sen Wang,
Heming Cui
Abstract:
Deep neural networks (DNNs) are vulnerable to adversarial examples, in which DNNs are misled to false outputs due to inputs containing imperceptible perturbations. Adversarial training, a reliable and effective method of defense, may significantly reduce the vulnerability of neural networks and becomes the de facto standard for robust learning. While many recent works practice the data-centric phi…
▽ More
Deep neural networks (DNNs) are vulnerable to adversarial examples, in which DNNs are misled to false outputs due to inputs containing imperceptible perturbations. Adversarial training, a reliable and effective method of defense, may significantly reduce the vulnerability of neural networks and becomes the de facto standard for robust learning. While many recent works practice the data-centric philosophy, such as how to generate better adversarial examples or use generative models to produce additional training data, we look back to the models themselves and revisit the adversarial robustness from the perspective of deep feature distribution as an insightful complementarity. In this paper, we propose Branch Orthogonality adveRsarial Training (BORT) to obtain state-of-the-art performance with solely the original dataset for adversarial training. To practice our design idea of integrating multiple orthogonal solution spaces, we leverage a simple and straightforward multi-branch neural network that eclipses adversarial attacks with no increase in inference time. We heuristically propose a corresponding loss function, branch-orthogonal loss, to make each solution space of the multi-branch model orthogonal. We evaluate our approach on CIFAR-10, CIFAR-100, and SVHN against \ell_{\infty} norm-bounded perturbations of size ε= 8/255, respectively. Exhaustive experiments are conducted to show that our method goes beyond all state-of-the-art methods without any tricks. Compared to all methods that do not use additional data for training, our models achieve 67.3% and 41.5% robust accuracy on CIFAR-10 and CIFAR-100 (improving upon the state-of-the-art by +7.23% and +9.07%). We also outperform methods using a training set with a far larger scale than ours. All our models and codes are available online at https://github.com/huangd1999/BORT.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
Flux Variations of Cosmic Ray Air Showers Detected by LHAASO-KM2A During a Thunderstorm on 10 June 2021
Authors:
LHAASO Collaboration,
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Zhe Cao,
Zhen Cao,
J. Chang,
J. F. Chang,
E. S. Chen,
Liang Chen,
Liang Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
S. H. Chen,
S. Z. Chen,
T. L. Chen,
X. J. Chen
, et al. (248 additional authors not shown)
Abstract:
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations…
▽ More
The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations of trigger rates (increases or decreases) are found to be strongly dependent on the primary zenith angle. The flux of secondary particles increases significantly, following a similar trend with that of the shower events. To better understand the observed behavior, Monte Carlo simulations are performed with CORSIKA and G4KM2A (a code based on GEANT4). We find that the experimental data (in saturated negative fields) are in good agreement with simulations, assuming the presence of a uniform upward electric field of 700 V/cm with a thickness of 1500 m in the atmosphere above the observation level. Due to the acceleration/deceleration and deflection by the atmospheric electric field, the number of secondary particles with energy above the detector threshold is modified, resulting in the changes in shower detection rate.
△ Less
Submitted 6 December, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
Impact of Internal Algebraic Variable Treatment on Transient Stability Simulation Performance
Authors:
Hantao Cui
Abstract:
It is a general notion that, in transient stability simulations, reducing the number of algebraic variables for the differential-algebraic equations (DAE) can improve the simulation performance. Many simulation programs split algebraic variables internal to a dynamic model from the full DAE and evaluate them outside each iterative step, using results from the previous iteration. The updated intern…
▽ More
It is a general notion that, in transient stability simulations, reducing the number of algebraic variables for the differential-algebraic equations (DAE) can improve the simulation performance. Many simulation programs split algebraic variables internal to a dynamic model from the full DAE and evaluate them outside each iterative step, using results from the previous iteration. The updated internal variables are then treated as constants when solving for the current iteration. This letter discusses how such a split formulation can impact simulation performance. Case studies using various systems with synchronous generator and converter models demonstrate the impact of the split on the convergence pattern and simulation performance.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
Interpretable Graph Neural Networks for Connectome-Based Brain Disorder Analysis
Authors:
Hejie Cui,
Wei Dai,
Yanqiao Zhu,
Xiaoxiao Li,
Lifang He,
Carl Yang
Abstract:
Human brains lie at the core of complex neurobiological systems, where the neurons, circuits, and subsystems interact in enigmatic ways. Understanding the structural and functional mechanisms of the brain has long been an intriguing pursuit for neuroscience research and clinical disorder therapy. Map** the connections of the human brain as a network is one of the most pervasive paradigms in neur…
▽ More
Human brains lie at the core of complex neurobiological systems, where the neurons, circuits, and subsystems interact in enigmatic ways. Understanding the structural and functional mechanisms of the brain has long been an intriguing pursuit for neuroscience research and clinical disorder therapy. Map** the connections of the human brain as a network is one of the most pervasive paradigms in neuroscience. Graph Neural Networks (GNNs) have recently emerged as a potential method for modeling complex network data. Deep models, on the other hand, have low interpretability, which prevents their usage in decision-critical contexts like healthcare. To bridge this gap, we propose an interpretable framework to analyze disorder-specific Regions of Interest (ROIs) and prominent connections. The proposed framework consists of two modules: a brain-network-oriented backbone model for disease prediction and a globally shared explanation generator that highlights disorder-specific biomarkers including salient ROIs and important connections. We conduct experiments on three real-world datasets of brain disorders. The results verify that our framework can obtain outstanding performance and also identify meaningful biomarkers. All code for this work is available at https://github.com/HennyJie/IBGNN.git.
△ Less
Submitted 23 July, 2022; v1 submitted 30 June, 2022;
originally announced July 2022.
-
Efficient Adaptive Federated Optimization of Federated Learning for IoT
Authors:
Zunming Chen,
Hongyan Cui,
Ensen Wu,
Yu Xi
Abstract:
The proliferation of the Internet of Things (IoT) and widespread use of devices with sensing, computing, and communication capabilities have motivated intelligent applications empowered by artificial intelligence. The classical artificial intelligence algorithms require centralized data collection and processing which are challenging in realistic intelligent IoT applications due to growing data pr…
▽ More
The proliferation of the Internet of Things (IoT) and widespread use of devices with sensing, computing, and communication capabilities have motivated intelligent applications empowered by artificial intelligence. The classical artificial intelligence algorithms require centralized data collection and processing which are challenging in realistic intelligent IoT applications due to growing data privacy concerns and distributed datasets. Federated Learning (FL) has emerged as a distributed privacy-preserving learning framework that enables IoT devices to train global model through sharing model parameters. However, inefficiency due to frequent parameters transmissions significantly reduce FL performance. Existing acceleration algorithms consist of two main type including local update considering trade-offs between communication and computation and parameter compression considering trade-offs between communication and precision. Jointly considering these two trade-offs and adaptively balancing their impacts on convergence have remained unresolved. To solve the problem, this paper proposes a novel efficient adaptive federated optimization (EAFO) algorithm to improve efficiency of FL, which minimizes the learning error via jointly considering two variables including local update and parameter compression and enables FL to adaptively adjust the two variables and balance trade-offs among computation, communication and precision. The experiment results illustrate that comparing with state-of-the-art algorithms, the proposed EAFO can achieve higher accuracies faster.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Decentralized and Coordinated Vf Control for Islanded Microgrids Considering DER Inadequacy and Demand Control
Authors:
Buxin She,
Fangxing Li,
Hantao Cui,
**ning Wang,
Liang Min,
Oroghene Oboreh Snapps,
Rui Bo
Abstract:
This paper proposes a decentralized and coordinated voltage and frequency (Vf) control framework for islanded microgrids, with full consideration of the limited capacity of distributed energy resources (DERs) and Vf dependent load. First, the concept of DER inadequacy is illustrated with the challenges it poses. Then, a decentralized and coordinated control framework is proposed to regulate the ou…
▽ More
This paper proposes a decentralized and coordinated voltage and frequency (Vf) control framework for islanded microgrids, with full consideration of the limited capacity of distributed energy resources (DERs) and Vf dependent load. First, the concept of DER inadequacy is illustrated with the challenges it poses. Then, a decentralized and coordinated control framework is proposed to regulate the output of inverter based generations and reallocate limited DER capacity for Vf control. The control framework is composed of a power regulator and a Vf regulator, which generates the supplementary signals for the primary controller. The power regulator regulates the output of grid forming inverters according to the real time capacity constraints of DERs, while the Vf regulator improves the Vf deviation by leveraging the load sensitivity to Vf. Next, the static feasibility and small signal stability of the proposed method are rigorously proven through mathematical formulation and eigenvalue analysis. Finally, a MATLAB Simulink simulation demonstrates the functionalities of the control framework. A few goals are fulfilled within the decentralized and coordinated framework, such as making the best use of limited DERs capacity, enhancing the DC side stability of inverter based generations, and reducing involuntary load shedding.
△ Less
Submitted 8 April, 2023; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Fusion of Model-free Reinforcement Learning with Microgrid Control: Review and Vision
Authors:
Buxin She,
Fangxing Li,
Hantao Cui,
**gqiu Zhang,
Rui Bo
Abstract:
Challenges and opportunities coexist in microgrids as a result of emerging large-scale distributed energy resources (DERs) and advanced control techniques. In this paper, a comprehensive review of microgrid control is presented with its fusion of model-free reinforcement learning (MFRL). A high-level research map of microgrid control is developed from six distinct perspectives, followed by bottom-…
▽ More
Challenges and opportunities coexist in microgrids as a result of emerging large-scale distributed energy resources (DERs) and advanced control techniques. In this paper, a comprehensive review of microgrid control is presented with its fusion of model-free reinforcement learning (MFRL). A high-level research map of microgrid control is developed from six distinct perspectives, followed by bottom-level modularized control blocks illustrating the configurations of grid-following (GFL) and grid-forming (GFM) inverters. Then, mainstream MFRL algorithms are introduced with an explanation of how MFRL can be integrated into the existing control framework. Next, the application guideline of MFRL is summarized with a discussion of three fusing approaches, i.e., model identification and parameter tuning, supplementary signal generation, and controller substitution, with the existing control framework. Finally, the fundamental challenges associated with adopting MFRL in microgrid control and corresponding insights for addressing these concerns are fully discussed.
△ Less
Submitted 6 February, 2023; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Data-Efficient Brain Connectome Analysis via Multi-Task Meta-Learning
Authors:
Yi Yang,
Yanqiao Zhu,
Hejie Cui,
Xuan Kan,
Lifang He,
Ying Guo,
Carl Yang
Abstract:
Brain networks characterize complex connectivities among brain regions as graph structures, which provide a powerful means to study brain connectomes. In recent years, graph neural networks have emerged as a prevalent paradigm of learning with structured data. However, most brain network datasets are limited in sample sizes due to the relatively high cost of data acquisition, which hinders the dee…
▽ More
Brain networks characterize complex connectivities among brain regions as graph structures, which provide a powerful means to study brain connectomes. In recent years, graph neural networks have emerged as a prevalent paradigm of learning with structured data. However, most brain network datasets are limited in sample sizes due to the relatively high cost of data acquisition, which hinders the deep learning models from sufficient training. Inspired by meta-learning that learns new concepts fast with limited training examples, this paper studies data-efficient training strategies for analyzing brain connectomes in a cross-dataset setting. Specifically, we propose to meta-train the model on datasets of large sample sizes and transfer the knowledge to small datasets. In addition, we also explore two brain-network-oriented designs, including atlas transformation and adaptive task reweighing. Compared to other pre-training strategies, our meta-learning-based approach achieves higher and stabler performance, which demonstrates the effectiveness of our proposed solutions. The framework is also able to derive new insights regarding the similarities among datasets and diseases in a data-driven fashion.
△ Less
Submitted 9 June, 2022;
originally announced June 2022.
-
FBNETGEN: Task-aware GNN-based fMRI Analysis via Functional Brain Network Generation
Authors:
Xuan Kan,
Hejie Cui,
Joshua Lukemire,
Ying Guo,
Carl Yang
Abstract:
Functional magnetic resonance imaging (fMRI) is one of the most common imaging modalities to investigate brain functions. Recent studies in neuroscience stress the great potential of functional brain networks constructed from fMRI data for clinical predictions. Traditional functional brain networks, however, are noisy and unaware of downstream prediction tasks, while also incompatible with the dee…
▽ More
Functional magnetic resonance imaging (fMRI) is one of the most common imaging modalities to investigate brain functions. Recent studies in neuroscience stress the great potential of functional brain networks constructed from fMRI data for clinical predictions. Traditional functional brain networks, however, are noisy and unaware of downstream prediction tasks, while also incompatible with the deep graph neural network (GNN) models. In order to fully unleash the power of GNNs in network-based fMRI analysis, we develop FBNETGEN, a task-aware and interpretable fMRI analysis framework via deep brain network generation. In particular, we formulate (1) prominent region of interest (ROI) features extraction, (2) brain networks generation, and (3) clinical predictions with GNNs, in an end-to-end trainable model under the guidance of particular prediction tasks. Along with the process, the key novel component is the graph generator which learns to transform raw time-series features into task-oriented brain networks. Our learnable graphs also provide unique interpretations by highlighting prediction-related brain regions. Comprehensive experiments on two datasets, i.e., the recently released and currently largest publicly available fMRI dataset Adolescent Brain Cognitive Development (ABCD), and the widely-used fMRI dataset PNC, prove the superior effectiveness and interpretability of FBNETGEN. The implementation is available at https://github.com/Wayfear/FBNETGEN.
△ Less
Submitted 29 May, 2022; v1 submitted 24 May, 2022;
originally announced May 2022.
-
Design and testing of LGAD sensor with shallow carbon implantation
Authors:
Kewei Wu,
Xuewei Jia,
Tao Yang,
Mengzhao Li,
Wei Wang,
Mei Zhao,
Zhijun Liang,
Joao Guimaraes da Costa,
Yunyun Fan,
Han Cui,
Alissa Howard,
Gregor Kramberger,
Xin Shi,
Yuekun Heng,
Yuhang Tan,
Bo Liu,
Yuan Feng,
Shuqi Li,
Mengran Li,
Chengjun Yu,
Xuan Yang,
Mingjie Zhai,
Gaobo Xu,
Gang** Yan,
Qionghua Zhai
, et al. (4 additional authors not shown)
Abstract:
The low gain avalanche detectors (LGADs) are thin sensors with fast charge collection which in combination with internal gain deliver an outstanding time resolution of about 30 ps. High collision rates and consequent large particle rates crossing the detectors at the upgraded Large Hadron Collider (LHC) in 2028 will lead to radiation damage and deteriorated performance of the LGADs. The main conse…
▽ More
The low gain avalanche detectors (LGADs) are thin sensors with fast charge collection which in combination with internal gain deliver an outstanding time resolution of about 30 ps. High collision rates and consequent large particle rates crossing the detectors at the upgraded Large Hadron Collider (LHC) in 2028 will lead to radiation damage and deteriorated performance of the LGADs. The main consequence of radiation damage is loss of gain layer do** (acceptor removal) which requires an increase of bias voltage to compensate for the loss of charge collection efficiency and consequently time resolution. The Institute of High Energy Physics (IHEP), Chinese Academy of Sciences (CAS) has developed a process based on the Institute of Microelectronics (IME), CAS capability to enrich the gain layer with carbon to reduce the acceptor removal effect by radiation. After 1 MeV neutron equivalent fluence of 2.5$\times$10$^{15}$ n$_{eq}$/cm$^{2}$, which is the maximum fluence to which sensors will be exposed at ATLAS High Granularity Timing Detector (HGTD), the IHEP-IME second version (IHEP-IMEv2) 50 $μ$m LGAD sensors already deliver adequate charge collection > 4 fC and time resolution < 50 ps at voltages < 400 V. The operation voltages of these 50 $μ$m devices are well below those at which single event burnout may occur.
△ Less
Submitted 31 May, 2022; v1 submitted 10 May, 2022;
originally announced May 2022.
-
Temporally and Spatially variant-resolution illumination patterns in computational ghost imaging
Authors:
Dong Zhou,
Jie Cao,
Huan Cui,
Li-Xing Lin,
Haoyu Zhang,
Yingqiang Zhang,
Qun Hao
Abstract:
Conventional computational ghost imaging (CGI) uses light carrying a sequence of patterns with uniform-resolution to illuminate the object, then performs correlation calculation based on the light intensity value reflected by the target and the preset patterns to obtain object image. It requires a large number of measurements to obtain high-quality images, especially if high-resolution images are…
▽ More
Conventional computational ghost imaging (CGI) uses light carrying a sequence of patterns with uniform-resolution to illuminate the object, then performs correlation calculation based on the light intensity value reflected by the target and the preset patterns to obtain object image. It requires a large number of measurements to obtain high-quality images, especially if high-resolution images are to be obtained. To solve this problem, we developed temporally variable-resolution illumination patterns, replacing the conventional uniform-resolution illumination patterns with a sequence of patterns of different imaging resolutions. In addition, we propose to combine temporally variable-resolution illumination patterns and spatially variable-resolution structure to develop temporally and spatially variable-resolution (TSV) illumination patterns, which not only improve the imaging quality of the region of interest (ROI) but also improve the robustness to noise. The methods using proposed illumination patterns are verified by simulations and experiments compared with CGI. For the same number of measurements, the method using temporally variable-resolution illumination patterns has better imaging quality than CGI, but it is less robust to noise. The method using TSV illumination patterns has better imaging quality in ROI than the method using temporally variable-resolution illumination patterns and CGI under the same number of measurements. We also experimentally verify that the method using TSV patterns have better imaging performance when applied to higher resolution imaging. The proposed methods are expected to solve the current computational ghost imaging that is difficult to achieve high-resolution and high-quality imaging.
△ Less
Submitted 14 May, 2022; v1 submitted 5 May, 2022;
originally announced May 2022.
-
Tracking, Profiling, and Ad Targeting in the Alexa Echo Smart Speaker Ecosystem
Authors:
Umar Iqbal,
Pouneh Nikkhah Bahrami,
Rahmadi Trimananda,
Hao Cui,
Alexander Gamero-Garrido,
Daniel Dubois,
David Choffnes,
Athina Markopoulou,
Franziska Roesner,
Zubair Shafiq
Abstract:
Smart speakers collect voice commands, which can be used to infer sensitive information about users. Given the potential for privacy harms, there is a need for greater transparency and control over the data collected, used, and shared by smart speaker platforms as well as third party skills supported on them. To bridge this gap, we build a framework to measure data collection, usage, and sharing b…
▽ More
Smart speakers collect voice commands, which can be used to infer sensitive information about users. Given the potential for privacy harms, there is a need for greater transparency and control over the data collected, used, and shared by smart speaker platforms as well as third party skills supported on them. To bridge this gap, we build a framework to measure data collection, usage, and sharing by the smart speaker platforms. We apply our framework to the Amazon smart speaker ecosystem. Our results show that Amazon and third parties, including advertising and tracking services that are unique to the smart speaker ecosystem, collect smart speaker interaction data. We also find that Amazon processes smart speaker interaction data to infer user interests and uses those inferences to serve targeted ads to users. Smart speaker interaction also leads to ad targeting and as much as 30X higher bids in ad auctions, from third party advertisers. Finally, we find that Amazon's and third party skills' data practices are often not clearly disclosed in their policy documents.
△ Less
Submitted 13 October, 2023; v1 submitted 22 April, 2022;
originally announced April 2022.
-
Exchange bias in van der Waals MnBi$_2$Te$_4$/Cr$_2$Ge$_2$Te$_6$ heterostructure
Authors:
**g-Zhi Fang,
Hao-Nan Cui,
Shuo Wang,
**g-Di Lu,
Xin-Jie Liu,
Guang-Yu Zhu,
Mao-Sen Qin,
Jian-Kun Wang,
Ze-Nan Wu,
Yan-Fei Wu,
Shou-Guo Wang,
Zhongming Wei,
**xing Zhang,
Ben-Chuan Lin,
Zhi-Min Liao,
Dapeng Yu
Abstract:
The layered van der Waals (vdW) material MnBi$_2$Te$_4$ is an intrinsic magnetic topological insulator with various topological phases such as quantum anomalous Hall effect (QAHE) and axion states. However, both the zero-field and high-temperature QAHE are not easy to realize. It is theoretically proposed that the exchange bias can be introduced in the MnBi2Te4/ferromagnetic (FM) insulator heteros…
▽ More
The layered van der Waals (vdW) material MnBi$_2$Te$_4$ is an intrinsic magnetic topological insulator with various topological phases such as quantum anomalous Hall effect (QAHE) and axion states. However, both the zero-field and high-temperature QAHE are not easy to realize. It is theoretically proposed that the exchange bias can be introduced in the MnBi2Te4/ferromagnetic (FM) insulator heterostructures and thus opens the surface states gap, making it easier to realize the zero-field or high-temperature QAHE. Here we report the electrically tunable exchange bias in the van der Waals MnBi$_2$Te$_4$/Cr$_2$Ge$_2$Te$_6$ heterostructure. The exchange bias emerges over a critical magnetic field and reaches the maximum value near the magnetic band gap. Moreover, the exchange bias was experienced by the antiferromagnetic (AFM) MnBi$_2$Te$_4$ layer rather than the FM layer. Such van der Waals heterostructure provides a promising platform to study the novel exchange bias effect and explore the possible high-temperature QAHE.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Importance is in your attention: agent importance prediction for autonomous driving
Authors:
Christopher Hazard,
Akshay Bhagat,
Balarama Raju Buddharaju,
Zhongtao Liu,
Yunming Shao,
Lu Lu,
Sammy Omari,
Henggang Cui
Abstract:
Trajectory prediction is an important task in autonomous driving. State-of-the-art trajectory prediction models often use attention mechanisms to model the interaction between agents. In this paper, we show that the attention information from such models can also be used to measure the importance of each agent with respect to the ego vehicle's future planned trajectory. Our experiment results on t…
▽ More
Trajectory prediction is an important task in autonomous driving. State-of-the-art trajectory prediction models often use attention mechanisms to model the interaction between agents. In this paper, we show that the attention information from such models can also be used to measure the importance of each agent with respect to the ego vehicle's future planned trajectory. Our experiment results on the nuPlans dataset show that our method can effectively find and rank surrounding agents by their impact on the ego's plan.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
BrainGB: A Benchmark for Brain Network Analysis with Graph Neural Networks
Authors:
Hejie Cui,
Wei Dai,
Yanqiao Zhu,
Xuan Kan,
Antonio Aodong Chen Gu,
Joshua Lukemire,
Liang Zhan,
Lifang He,
Ying Guo,
Carl Yang
Abstract:
Map** the connectome of the human brain using structural or functional connectivity has become one of the most pervasive paradigms for neuroimaging analysis. Recently, Graph Neural Networks (GNNs) motivated from geometric deep learning have attracted broad interest due to their established power for modeling complex networked data. Despite their superior performance in many fields, there has not…
▽ More
Map** the connectome of the human brain using structural or functional connectivity has become one of the most pervasive paradigms for neuroimaging analysis. Recently, Graph Neural Networks (GNNs) motivated from geometric deep learning have attracted broad interest due to their established power for modeling complex networked data. Despite their superior performance in many fields, there has not yet been a systematic study of how to design effective GNNs for brain network analysis. To bridge this gap, we present BrainGB, a benchmark for brain network analysis with GNNs. BrainGB standardizes the process by (1) summarizing brain network construction pipelines for both functional and structural neuroimaging modalities and (2) modularizing the implementation of GNN designs. We conduct extensive experiments on datasets across cohorts and modalities and recommend a set of general recipes for effective GNN designs on brain networks. To support open and reproducible research on GNN-based brain network analysis, we host the BrainGB website at https://braingb.us with models, tutorials, examples, as well as an out-of-box Python package. We hope that this work will provide useful empirical evidence and offer insights for future research in this novel and promising direction.
△ Less
Submitted 28 November, 2022; v1 submitted 17 March, 2022;
originally announced April 2022.