-
Boosting the Performance of Degraded Reads in RS-coded Distributed Storage Systems
Authors:
Tian Xie,
Juntao Fang,
Shenggang wan,
Changsheng Xie,
Xubin He
Abstract:
Reed-Solomon (RS) codes have been increasingly adopted by distributed storage systems in place of replication,because they provide the same level of availability with much lower storage overhead. However, a key drawback of those RS-coded distributed storage systems is the poor latency of degraded reads, which can be incurred by data failures or hot spots,and are not rare in production environments…
▽ More
Reed-Solomon (RS) codes have been increasingly adopted by distributed storage systems in place of replication,because they provide the same level of availability with much lower storage overhead. However, a key drawback of those RS-coded distributed storage systems is the poor latency of degraded reads, which can be incurred by data failures or hot spots,and are not rare in production environments. To address this issue, we propose a novel parallel reconstruction solution called APLS. APLS leverages all surviving source nodes to send the data needed by degraded reads and chooses light-loaded starter nodes to receive the reconstructed data of those degraded reads. Hence, the latency of the degraded reads can be improved.Prototy**-based experiments are conducted to compare APLS with ECPipe, the state-of-the-art solution of improving the latency of degraded reads. The experimental results demonstrate that APLS effectively reduces the latency, particularly under heavy or medium workloads.
△ Less
Submitted 18 June, 2023;
originally announced June 2023.
-
SelFLoc: Selective Feature Fusion for Large-scale Point Cloud-based Place Recognition
Authors:
Qibo Qiu,
Haiming Gao,
Wenxiao Wang,
Zhiyi Su,
Tian Xie,
Wei Hua,
Xiaofei He
Abstract:
Point cloud-based place recognition is crucial for mobile robots and autonomous vehicles, especially when the global positioning sensor is not accessible. LiDAR points are scattered on the surface of objects and buildings, which have strong shape priors along different axes. To enhance message passing along particular axes, Stacked Asymmetric Convolution Block (SACB) is designed, which is one of t…
▽ More
Point cloud-based place recognition is crucial for mobile robots and autonomous vehicles, especially when the global positioning sensor is not accessible. LiDAR points are scattered on the surface of objects and buildings, which have strong shape priors along different axes. To enhance message passing along particular axes, Stacked Asymmetric Convolution Block (SACB) is designed, which is one of the main contributions in this paper. Comprehensive experiments demonstrate that asymmetric convolution and its corresponding strategies employed by SACB can contribute to the more effective representation of point cloud feature. On this basis, Selective Feature Fusion Block (SFFB), which is formed by stacking point- and channel-wise gating layers in a predefined sequence, is proposed to selectively boost salient local features in certain key regions, as well as to align the features before fusion phase. SACBs and SFFBs are combined to construct a robust and accurate architecture for point cloud-based place recognition, which is termed SelFLoc. Comparative experimental results show that SelFLoc achieves the state-of-the-art (SOTA) performance on the Oxford and other three in-house benchmarks with an improvement of 1.6 absolute percentages on mean average recall@1.
△ Less
Submitted 5 June, 2023; v1 submitted 1 June, 2023;
originally announced June 2023.
-
OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities
Authors:
Yuanzhen Xie,
Tao Xie,
Mingxiong Lin,
WenTao Wei,
Chenglin Li,
Beibei Kong,
Lei Chen,
Chengxiang Zhuo,
Bo Hu,
Zang Li
Abstract:
In most current research, large language models (LLMs) are able to perform reasoning tasks by generating chains of thought through the guidance of specific prompts. However, there still exists a significant discrepancy between their capability in solving complex reasoning problems and that of humans. At present, most approaches focus on chains of thought (COT) and tool use, without considering the…
▽ More
In most current research, large language models (LLMs) are able to perform reasoning tasks by generating chains of thought through the guidance of specific prompts. However, there still exists a significant discrepancy between their capability in solving complex reasoning problems and that of humans. At present, most approaches focus on chains of thought (COT) and tool use, without considering the adoption and application of human cognitive frameworks. It is well-known that when confronting complex reasoning challenges, humans typically employ various cognitive abilities, and necessitate interaction with all aspects of tools, knowledge, and the external environment information to accomplish intricate tasks. This paper introduces a novel intelligent framework, referred to as OlaGPT. OlaGPT carefully studied a cognitive architecture framework, and propose to simulate certain aspects of human cognition. The framework involves approximating different cognitive modules, including attention, memory, reasoning, learning, and corresponding scheduling and decision-making mechanisms. Inspired by the active learning mechanism of human beings, it proposes a learning unit to record previous mistakes and expert opinions, and dynamically refer to them to strengthen their ability to solve similar problems. The paper also outlines common effective reasoning frameworks for human problem-solving and designs Chain-of-Thought (COT) templates accordingly. A comprehensive decision-making mechanism is also proposed to maximize model accuracy. The efficacy of OlaGPT has been stringently evaluated on multiple reasoning datasets, and the experimental outcomes reveal that OlaGPT surpasses state-of-the-art benchmarks, demonstrating its superior performance. Our implementation of OlaGPT is available on GitHub: \url{https://github.com/oladata-team/OlaGPT}.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Sensing orbital hybridization of graphene-diamond interface with a single spin
Authors:
Yucheng Hao,
Zhi** Yang,
Zeyu Li,
Xi Kong,
Wenna Tang,
Tianyu Xie,
Shaoyi Xu,
Xiangyu Ye,
Pei Yu,
Pengfei Wang,
Ya Wang,
Zhenhua Qiao,
Libo Gao,
Jian-Hua Jiang,
Fazhan Shi,
Jiangfeng Du
Abstract:
Interfacial interactions are crucial in a variety of fields and can greatly affect the electric, magnetic, and chemical properties of materials. Among them, interface orbital hybridization plays a fundamental role in the properties of surface electrons such as dispersion, interaction, and ground states. Conventional measurements of electronic states at interfaces such as scanning tunneling microsc…
▽ More
Interfacial interactions are crucial in a variety of fields and can greatly affect the electric, magnetic, and chemical properties of materials. Among them, interface orbital hybridization plays a fundamental role in the properties of surface electrons such as dispersion, interaction, and ground states. Conventional measurements of electronic states at interfaces such as scanning tunneling microscopes are all based on electric interactions which, however, suffer from strong perturbation on these electrons. Here we unveil a new experimental detection of interface electrons based on the weak magnetic interactions between them and the nitrogen-vacancy (NV) center in diamond. With negligible perturbation on the interface electrons, their physical properties can be revealed by the NV spin coherence time. In our system, the interface interaction leads to significant decreases in both the density and coherence time of the electron spins at the diamond-graphene interface. Furthermore, together with electron spin resonance spectra and first-principle calculations, we can retrieve the effect of interface electron orbital hybridization. Our study opens a new pathway toward the microscopic probing of interfacial electronic states with weak magnetic interactions and provides a new avenue for future research on material interfaces.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction
Authors:
Danyang Zhang,
Zhennan Shen,
Rui Xie,
Situo Zhang,
Tianbao Xie,
Zihan Zhao,
Siyuan Chen,
Lu Chen,
Hongshen Xu,
Ruisheng Cao,
Kai Yu
Abstract:
The Graphical User Interface (GUI) is pivotal for human interaction with the digital world, enabling efficient device control and the completion of complex tasks. Recent progress in Large Language Models (LLMs) and Vision Language Models (VLMs) offers the chance to create advanced GUI agents. To ensure their effectiveness, there's a pressing need for qualified benchmarks that provide trustworthy a…
▽ More
The Graphical User Interface (GUI) is pivotal for human interaction with the digital world, enabling efficient device control and the completion of complex tasks. Recent progress in Large Language Models (LLMs) and Vision Language Models (VLMs) offers the chance to create advanced GUI agents. To ensure their effectiveness, there's a pressing need for qualified benchmarks that provide trustworthy and reproducible evaluations -- a challenge current benchmarks often fail to address. To tackle this issue, we introduce Mobile-Env, a comprehensive toolkit tailored for creating GUI benchmarks in the Android mobile environment. Mobile-Env offers an isolated and controllable setting for reliable evaluations, and accommodates intermediate instructions and rewards to reflect real-world usage more naturally. Utilizing Mobile-Env, we collect an open-world task set across various real-world apps and a fixed world set, WikiHow, which captures a significant amount of dynamic online contents for fully controllable and reproducible evaluation. We conduct comprehensive evaluations of LLM agents using these benchmarks. Our findings reveal that even advanced models (e.g., GPT-4V and LLaMA-3) struggle with tasks that are relatively simple for humans. This highlights a crucial gap in current models and underscores the importance of develo** more capable foundation models and more effective GUI agent frameworks.
△ Less
Submitted 13 June, 2024; v1 submitted 14 May, 2023;
originally announced May 2023.
-
Synthesizing PET images from High-field and Ultra-high-field MR images Using Joint Diffusion Attention Model
Authors:
Taofeng Xie,
Chentao Cao,
Zhuoxu Cui,
Yu Guo,
Caiying Wu,
Xuemei Wang,
Qingneng Li,
Zhanli Hu,
Tao Sun,
Ziru Sang,
Yihang Zhou,
Yanjie Zhu,
Dong Liang,
Qiyu **,
Hongwu Zeng,
Guoqing Chen,
Haifeng Wang
Abstract:
MRI and PET are crucial diagnostic tools for brain diseases, as they provide complementary information on brain structure and function. However, PET scanning is costly and involves radioactive exposure, resulting in a lack of PET. Moreover, simultaneous PET and MRI at ultra-high-field are currently hardly infeasible. Ultra-high-field imaging has unquestionably proven valuable in both clinical and…
▽ More
MRI and PET are crucial diagnostic tools for brain diseases, as they provide complementary information on brain structure and function. However, PET scanning is costly and involves radioactive exposure, resulting in a lack of PET. Moreover, simultaneous PET and MRI at ultra-high-field are currently hardly infeasible. Ultra-high-field imaging has unquestionably proven valuable in both clinical and academic settings, especially in the field of cognitive neuroimaging. These motivate us to propose a method for synthetic PET from high-filed MRI and ultra-high-field MRI. From a statistical perspective, the joint probability distribution (JPD) is the most direct and fundamental means of portraying the correlation between PET and MRI. This paper proposes a novel joint diffusion attention model which has the joint probability distribution and attention strategy, named JDAM. JDAM has a diffusion process and a sampling process. The diffusion process involves the gradual diffusion of PET to Gaussian noise by adding Gaussian noise, while MRI remains fixed. JPD of MRI and noise-added PET was learned in the diffusion process. The sampling process is a predictor-corrector. PET images were generated from MRI by JPD of MRI and noise-added PET. The predictor is a reverse diffusion process and the corrector is Langevin dynamics. Experimental results on the public Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset demonstrate that the proposed method outperforms state-of-the-art CycleGAN for high-field MRI (3T MRI). Finally, synthetic PET images from the ultra-high-field (5T MRI and 7T MRI) be attempted, providing a possibility for ultra-high-field PET-MRI imaging.
△ Less
Submitted 19 June, 2024; v1 submitted 5 May, 2023;
originally announced May 2023.
-
A Spatial Calibration Method for Robust Cooperative Perception
Authors:
Zhiying Song,
Tenghui Xie,
Hailiang Zhang,
Jiaxin Liu,
Fuxi Wen,
Jun Li
Abstract:
Cooperative perception is a promising technique for intelligent and connected vehicles through vehicle-to-everything (V2X) cooperation, provided that accurate pose information and relative pose transforms are available. Nevertheless, obtaining precise positioning information often entails high costs associated with navigation systems. {Hence, it is required to calibrate relative pose information f…
▽ More
Cooperative perception is a promising technique for intelligent and connected vehicles through vehicle-to-everything (V2X) cooperation, provided that accurate pose information and relative pose transforms are available. Nevertheless, obtaining precise positioning information often entails high costs associated with navigation systems. {Hence, it is required to calibrate relative pose information for multi-agent cooperative perception.} This paper proposes a simple but effective object association approach named context-based matching (CBM), which identifies inter-agent object correspondences using intra-agent geometrical context. In detail, this method constructs contexts using the relative position of the detected bounding boxes, followed by local context matching and global consensus maximization. The optimal relative pose transform is estimated based on the matched correspondences, followed by cooperative perception fusion. Extensive experiments are conducted on both the simulated and real-world datasets. Even with larger inter-agent localization errors, high object association precision and decimeter-level relative pose calibration accuracy are achieved among the cooperating agents.
△ Less
Submitted 22 February, 2024; v1 submitted 24 April, 2023;
originally announced April 2023.
-
Inverse Design of Next-generation Superconductors Using Data-driven Deep Generative Models
Authors:
Daniel Wines,
Tian Xie,
Kamal Choudhary
Abstract:
Finding new superconductors with a high critical temperature ($T_c$) has been a challenging task due to computational and experimental costs. We present a diffusion model inspired by the computer vision community to generate new superconductors with unique structures and chemical compositions. Specifically, we used a crystal diffusion variational autoencoder (CDVAE) along with atomistic line graph…
▽ More
Finding new superconductors with a high critical temperature ($T_c$) has been a challenging task due to computational and experimental costs. We present a diffusion model inspired by the computer vision community to generate new superconductors with unique structures and chemical compositions. Specifically, we used a crystal diffusion variational autoencoder (CDVAE) along with atomistic line graph neural network (ALIGNN) pretrained models and the Joint Automated Repository for Various Integrated Simulations (JARVIS) superconducting database of density functional theory (DFT) calculations to generate new superconductors with a high success rate. We started with a DFT dataset of $\approx$1000 superconducting materials to train the diffusion model. We used the model to generate 3000 new structures, which along with pre-trained ALIGNN screening results in 61 candidates. For the top candidates, we performed DFT calculations for validation. Such approaches go beyond the funnel-like materials design approaches and allow for the inverse design of next-generation materials.
△ Less
Submitted 27 July, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT
Authors:
Tong Xie,
Yuwei Wan,
Wei Huang,
Yufei Zhou,
Yixuan Liu,
Qingyuan Linghu,
Shaozhou Wang,
Chunyu Kit,
Clara Grazian,
Wenjie Zhang,
Bram Hoex
Abstract:
The amount of data has growing significance in exploring cutting-edge materials and a number of datasets have been generated either by hand or automated approaches. However, the materials science field struggles to effectively utilize the abundance of data, especially in applied disciplines where materials are evaluated based on device performance rather than their properties. This article present…
▽ More
The amount of data has growing significance in exploring cutting-edge materials and a number of datasets have been generated either by hand or automated approaches. However, the materials science field struggles to effectively utilize the abundance of data, especially in applied disciplines where materials are evaluated based on device performance rather than their properties. This article presents a new natural language processing (NLP) task called structured information inference (SII) to address the complexities of information extraction at the device level in materials science. We accomplished this task by tuning GPT-3 on an existing perovskite solar cell FAIR (Findable, Accessible, Interoperable, Reusable) dataset with 91.8% F1-score and extended the dataset with data published since its release. The produced data is formatted and normalized, enabling its direct utilization as input in subsequent data analysis. This feature empowers materials scientists to develop models by selecting high-quality review articles within their domain. Additionally, we designed experiments to predict the electrical performance of solar cells and design materials or devices with targeted parameters using large language models (LLMs). Our results demonstrate comparable performance to traditional machine learning methods without feature selection, highlighting the potential of LLMs to acquire scientific knowledge and design new materials akin to materials scientists.
△ Less
Submitted 12 April, 2023; v1 submitted 5 April, 2023;
originally announced April 2023.
-
Continuous spin excitations in the three-dimensional frustrated magnet K2Ni2(SO4)3
Authors:
Weiliang Yao,
Qing Huang,
Tao Xie,
Andrey Podlesnyak,
Alexander Brassington,
Chengkun Xing,
Ranuri S. Dissanayaka Mudiyanselage,
Weiwei Xie,
Shengzhi Zhang,
Minseong Lee,
Vivien S. Zapf,
Xiaojian Bai,
D. Alan Tennant,
Jian Liu,
Haidong Zhou
Abstract:
Continuous spin excitations are widely recognized as one of the hallmarks of novel spin states in quantum magnets, such as quantum spin liquids (QSLs). Here, we report the observation of such kind of excitations in K2Ni2(SO4)3, which consists of two sets of intersected spin-1 Ni2+ trillium lattices. Our inelastic neutron scattering measurement on single crystals clearly shows a dominant excitation…
▽ More
Continuous spin excitations are widely recognized as one of the hallmarks of novel spin states in quantum magnets, such as quantum spin liquids (QSLs). Here, we report the observation of such kind of excitations in K2Ni2(SO4)3, which consists of two sets of intersected spin-1 Ni2+ trillium lattices. Our inelastic neutron scattering measurement on single crystals clearly shows a dominant excitation continuum, which exhibits a distinct temperature-dependent behavior from that of spin waves, and is rooted in strong quantum spin fluctuations. Further using the self-consistent-gaussian-approximation method, we determined the fourth- and fifth-nearest neighbor exchange interactions are dominant. These two bonds together form a unique three-dimensional network of corner-sharing tetrahedra, which we name as ''hyper-trillium'' lattice. Our results provide direct evidence for the existence of QSL features in K2Ni2(SO4)3 and highlight the potential for the hyper-trillium lattice to host frustrated quantum magnetism.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Adversarial Model for Offline Reinforcement Learning
Authors:
Mohak Bhardwaj,
Tengyang Xie,
Byron Boots,
Nan Jiang,
Ching-An Cheng
Abstract:
We propose a novel model-based offline Reinforcement Learning (RL) framework, called Adversarial Model for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary reference policy regardless of data coverage. ARMOR is designed to optimize policies for the worst-case performance relative to the reference policy through adversarially training a Markov d…
▽ More
We propose a novel model-based offline Reinforcement Learning (RL) framework, called Adversarial Model for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary reference policy regardless of data coverage. ARMOR is designed to optimize policies for the worst-case performance relative to the reference policy through adversarially training a Markov decision process model. In theory, we prove that ARMOR, with a well-tuned hyperparameter, can compete with the best policy within data coverage when the reference policy is supported by the data. At the same time, ARMOR is robust to hyperparameter choices: the policy learned by ARMOR, with "any" admissible hyperparameter, would never degrade the performance of the reference policy, even when the reference policy is not covered by the dataset. To validate these properties in practice, we design a scalable implementation of ARMOR, which by adversarial training, can optimize policies without using model ensembles in contrast to typical model-based methods. We show that ARMOR achieves competent performance with both state-of-the-art offline model-free and model-based RL algorithms and can robustly improve the reference policy over various hyperparameter choices.
△ Less
Submitted 24 December, 2023; v1 submitted 21 February, 2023;
originally announced February 2023.
-
Reliability Assurance for Deep Neural Network Architectures Against Numerical Defects
Authors:
Linyi Li,
Yuhao Zhang,
Luyao Ren,
Yingfei Xiong,
Tao Xie
Abstract:
With the widespread deployment of deep neural networks (DNNs), ensuring the reliability of DNN-based systems is of great importance. Serious reliability issues such as system failures can be caused by numerical defects, one of the most frequent defects in DNNs. To assure high reliability against numerical defects, in this paper, we propose the RANUM approach including novel techniques for three re…
▽ More
With the widespread deployment of deep neural networks (DNNs), ensuring the reliability of DNN-based systems is of great importance. Serious reliability issues such as system failures can be caused by numerical defects, one of the most frequent defects in DNNs. To assure high reliability against numerical defects, in this paper, we propose the RANUM approach including novel techniques for three reliability assurance tasks: detection of potential numerical defects, confirmation of potential-defect feasibility, and suggestion of defect fixes. To the best of our knowledge, RANUM is the first approach that confirms potential-defect feasibility with failure-exhibiting tests and suggests fixes automatically. Extensive experiments on the benchmarks of 63 real-world DNN architectures show that RANUM outperforms state-of-the-art approaches across the three reliability assurance tasks. In addition, when the RANUM-generated fixes are compared with developers' fixes on open-source projects, in 37 out of 40 cases, RANUM-generated fixes are equivalent to or even better than human fixes.
△ Less
Submitted 23 April, 2023; v1 submitted 12 February, 2023;
originally announced February 2023.
-
OAMatcher: An Overlap** Areas-based Network for Accurate Local Feature Matching
Authors:
Kun Dai,
Tao Xie,
Ke Wang,
Zhiqiang Jiang,
Ruifeng Li,
Lijun Zhao
Abstract:
Local feature matching is an essential component in many visual applications. In this work, we propose OAMatcher, a Tranformer-based detector-free method that imitates humans behavior to generate dense and accurate matches. Firstly, OAMatcher predicts overlap** areas to promote effective and clean global context aggregation, with the key insight that humans focus on the overlap** areas instead…
▽ More
Local feature matching is an essential component in many visual applications. In this work, we propose OAMatcher, a Tranformer-based detector-free method that imitates humans behavior to generate dense and accurate matches. Firstly, OAMatcher predicts overlap** areas to promote effective and clean global context aggregation, with the key insight that humans focus on the overlap** areas instead of the entire images after multiple observations when matching keypoints in image pairs. Technically, we first perform global information integration across all keypoints to imitate the humans behavior of observing the entire images at the beginning of feature matching. Then, we propose Overlap** Areas Prediction Module (OAPM) to capture the keypoints in co-visible regions and conduct feature enhancement among them to simulate that humans transit the focus regions from the entire images to overlap** regions, hence realizeing effective information exchange without the interference coming from the keypoints in non overlap** areas. Besides, since humans tend to leverage probability to determine whether the match labels are correct or not, we propose a Match Labels Weight Strategy (MLWS) to generate the coefficients used to appraise the reliability of the ground-truth match labels, while alleviating the influence of measurement noise coming from the data. Moreover, we integrate depth-wise convolution into Tranformer encoder layers to ensure OAMatcher extracts local and global feature representation concurrently. Comprehensive experiments demonstrate that OAMatcher outperforms the state-of-the-art methods on several benchmarks, while exhibiting excellent robustness to extreme appearance variants. The source code is available at https://github.com/DK-HU/OAMatcher.
△ Less
Submitted 11 February, 2023;
originally announced February 2023.
-
Effective Random Test Generation for Deep Learning Compilers
Authors:
Luyao Ren,
ZiHeng Wang,
Yingfei Xiong,
Li Zhang,
Guoyue Jiang,
Tao Xie
Abstract:
Deep learning compilers help address difficulties of deploying deep learning models on diverse types of hardware. Testing deep learning compilers is highly crucial, because they are impacting countless AI applications that use them for model optimization and deployment. To test deep learning compilers, random testing, being popularly used for compiler testing practices, faces the challenge of gene…
▽ More
Deep learning compilers help address difficulties of deploying deep learning models on diverse types of hardware. Testing deep learning compilers is highly crucial, because they are impacting countless AI applications that use them for model optimization and deployment. To test deep learning compilers, random testing, being popularly used for compiler testing practices, faces the challenge of generating semantically valid test inputs, i.e., deep learning models that satisfy the semantic model specifications (in short as semantic specifications). To tackle this challenge, in this paper, we propose a novel approach named Isra, including a domain-specific constraint solver that resolves the constraints from the semantic specifications without backtracking. We implement and apply our approach on three popular real-world deep learning compilers including TVM, Glow, and a commercial compiler. The evaluation results show that Isra is more effective than the state-of-the-art approaches and the baseline approaches on constructing valid test inputs for compiler-bug detection, and Isra successfully finds 24 previously unknown bugs in released versions of the three compilers. These results indicate effectiveness and practical value of Isra.
△ Less
Submitted 4 February, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
Ferromagnetism of sputtered Fe3GeTe2 ultrathin films in the absence of two-dimensional crystalline order
Authors:
Qianwen Zhao,
ChaoChao Xia,
Hanying Zhang,
Baiqing Jiang,
Tunan Xie,
Kaihua Lou,
Chong Bi
Abstract:
The discovery of ferromagnetism in two-dimensional (2D) monolayers has stimulated growing research interest in both spintronics and material science. However, these 2D ferromagnetic layers are mainly prepared through an incompatible approach for large-scale fabrication and integration, and moreover, a fundamental question whether the observed ferromagnetism actually correlates with the 2D crystall…
▽ More
The discovery of ferromagnetism in two-dimensional (2D) monolayers has stimulated growing research interest in both spintronics and material science. However, these 2D ferromagnetic layers are mainly prepared through an incompatible approach for large-scale fabrication and integration, and moreover, a fundamental question whether the observed ferromagnetism actually correlates with the 2D crystalline order has not been explored. Here, we choose a typical 2D ferromagnetic material, Fe3GeTe2, to address these two issues by investigating its ferromagnetism in an amorphous state. We have fabricated nanometer-thick amorphous Fe3GeTe2 films approaching the monolayer thickness limit of crystallized Fe3GeTe2 (0.8 nm) through magnetron sputtering. Compared to crystallized Fe3GeTe2, we found that the basic ferromagnetic attributes, such as the Curie temperature that directly reflects magnetic exchange interactions and local anisotropic energy, do not change significantly in the amorphous states. This is attributed to that the short-range atomic order, as confirmed by valence state analysis, is almost the same for both phases. The persistence of ferromagnetism in the ultrathin amorphous counterpart has also been confirmed through magnetoresistance measurements, where two unconventional switching dips arising from electrical transport within domain walls are clearly observed in the amorphous Fe3GeTe2 single layer. These results indicate that the long-range ferromagnetic order of crystallized Fe3GeTe2 may not correlate to the 2D crystalline order and the corresponding ferromagnetic attributes can be utilized in an amorphous state which suits large-scale fabrication in a semiconductor technology-compatible manner for spintronics applications.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
CoderEval: A Benchmark of Pragmatic Code Generation with Generative Pre-trained Models
Authors:
Hao Yu,
Bo Shen,
Dezhi Ran,
Jiaxin Zhang,
Qi Zhang,
Yuchi Ma,
Guangtai Liang,
Ying Li,
Qianxiang Wang,
Tao Xie
Abstract:
Code generation models based on the pre-training and fine-tuning paradigm have been increasingly attempted by both academia and industry, resulting in well-known industrial models such as Codex, CodeGen, and PanGu-Coder. To evaluate the effectiveness of these models, multiple existing benchmarks are proposed, including only cases of generating a standalone function, i.e., a function that may invok…
▽ More
Code generation models based on the pre-training and fine-tuning paradigm have been increasingly attempted by both academia and industry, resulting in well-known industrial models such as Codex, CodeGen, and PanGu-Coder. To evaluate the effectiveness of these models, multiple existing benchmarks are proposed, including only cases of generating a standalone function, i.e., a function that may invoke or access only built-in functions and standard libraries. However, non-standalone functions, which typically are not included in the existing benchmarks, constitute more than 70% of the functions in popular open-source projects, and evaluating models' effectiveness on standalone functions cannot reflect these models' effectiveness on pragmatic code generation scenarios.
To help bridge the preceding gap, in this paper, we propose a benchmark named CoderEval, consisting of 230 Python and 230 Java code generation tasks carefully curated from popular real-world open-source projects and a self-contained execution platform to automatically assess the functional correctness of generated code. CoderEval supports code generation tasks from six levels of context dependency, where context refers to code elements such as types, APIs, variables, and consts defined outside the function under generation but within the dependent third-party libraries, current class, file, or project. CoderEval can be used to evaluate the effectiveness of models in generating code beyond only standalone functions. By evaluating three code generation models on CoderEval, we find that the effectiveness of these models in generating standalone functions is substantially higher than that in generating non-standalone functions. Our analysis highlights the current progress and pinpoints future directions to further improve a model's effectiveness by leveraging contextual information for pragmatic code generation.
△ Less
Submitted 23 February, 2024; v1 submitted 1 February, 2023;
originally announced February 2023.
-
Pressure-induced coevolution of transport properties and lattice stability in CaK(Fe1-xNix)4As4 (x= 0.04 and 0) superconductors with and without spin-vortex crystal state
Authors:
Pengyu Wang,
Chang Liu,
Run Yang,
Shu Cai,
Tao Xie,
**g Guo,
**yu Zhao,
**yu Han,
Si** Long,
Yazhou Zhou,
Yanchun Li,
Xiaodong Li,
Huiqian Luo,
Shiliang Li,
Qi Wu,
Xianggang Qiu,
Tao Xiang,
Liling Sun
Abstract:
Here we report the first investigation on correlation between the transport properties and the corresponding stability of the lattice structure for CaK(Fe1-xNix)4As4 (x=0.04 and 0), a new type of putative topological superconductors, with and without a spin-vortex crystal (SVC) state in a wide pressure range involving superconducting to non-superconducting transition and the half- to full-collapse…
▽ More
Here we report the first investigation on correlation between the transport properties and the corresponding stability of the lattice structure for CaK(Fe1-xNix)4As4 (x=0.04 and 0), a new type of putative topological superconductors, with and without a spin-vortex crystal (SVC) state in a wide pressure range involving superconducting to non-superconducting transition and the half- to full-collapse of tetragonal (h-cT and f-cT) phases, by the complementary measurements of high-pressure resistance, Hall coefficient and synchrotron X-ray diffraction. We identify the three critical pressures, P1 that is the turn-on critical pressure of the h-cT phase transition and it coincides with the critical pressure for the sign change of Hall coefficient from positive to negative, a manifestation of the Fermi surface reconstruction, P2 that is the turn-off pressures of the h-cT phase transition, and P3 that is the critical pressure of the f-cT phase transition. By comparing the high-pressure results measured from the two kinds of samples, we find a distinct left-shift of the P1 for the doped sample, at the pressure of which its SVC state is fully suppressed, however the P2 and the P3 remain the same as that of the undoped one. Our results not only provide a consistent understanding on the results reported before, but also demonstrate the importance of the Fe-As bonding in stabilizing the superconductivity of the iron pnictide superconductors through the pressure window.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Practitioners' Expectations on Code Completion
Authors:
Chaozheng Wang,
Junhao Hu,
Cuiyun Gao,
Yu **,
Tao Xie,
Hailiang Huang,
Zhenyu Lei,
Yuetang Deng
Abstract:
Code completion has become a common practice for programmers during their daily programming activities. It aims at automatically predicting the next tokens or lines that the programmers tend to use. A good code completion tool can substantially save keystrokes and improve the programming efficiency for programmers. Recently, various techniques for code completion have been proposed for usage in pr…
▽ More
Code completion has become a common practice for programmers during their daily programming activities. It aims at automatically predicting the next tokens or lines that the programmers tend to use. A good code completion tool can substantially save keystrokes and improve the programming efficiency for programmers. Recently, various techniques for code completion have been proposed for usage in practice. However, it is still unclear what are practitioners' expectations on code completion and whether existing research has met their demands. To fill the gap, we perform an empirical study by first interviewing 15 practitioners and then surveying 599 practitioners from 18 IT companies about their expectations on code completion. We then compare the practitioners' demands with current research via conducting a literature review of papers on code completion published in premier publication venues from 2012 to 2022. Based on the comparison, we highlight the directions desirable for researchers to invest efforts towards develo** code completion techniques for meeting practitioners' expectations.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching
Authors:
Tao Xie,
Kun Dai,
Ke Wang,
Ruifeng Li,
Lijun Zhao
Abstract:
Local feature matching between images remains a challenging task, especially in the presence of significant appearance variations, e.g., extreme viewpoint changes. In this work, we propose DeepMatcher, a deep Transformer-based network built upon our investigation of local feature matching in detector-free methods. The key insight is that local feature matcher with deep layers can capture more huma…
▽ More
Local feature matching between images remains a challenging task, especially in the presence of significant appearance variations, e.g., extreme viewpoint changes. In this work, we propose DeepMatcher, a deep Transformer-based network built upon our investigation of local feature matching in detector-free methods. The key insight is that local feature matcher with deep layers can capture more human-intuitive and simpler-to-match features. Based on this, we propose a Slimming Transformer (SlimFormer) dedicated for DeepMatcher, which leverages vector-based attention to model relevance among all keypoints and achieves long-range context aggregation in an efficient and effective manner. A relative position encoding is applied to each SlimFormer so as to explicitly disclose relative distance information, further improving the representation of keypoints. A layer-scale strategy is also employed in each SlimFormer to enable the network to assimilate message exchange from the residual block adaptively, thus allowing it to simulate the human behaviour that humans can acquire different matching cues each time they scan an image pair. To facilitate a better adaption of the SlimFormer, we introduce a Feature Transition Module (FTM) to ensure a smooth transition in feature scopes with different receptive fields. By interleaving the self- and cross-SlimFormer multiple times, DeepMatcher can easily establish pixel-wise dense matches at coarse level. Finally, we perceive the match refinement as a combination of classification and regression problems and design Fine Matches Module to predict confidence and offset concurrently, thereby generating robust and accurate matches. Experimentally, we show that DeepMatcher significantly outperforms the state-of-the-art methods on several benchmarks, demonstrating the superior matching capability of DeepMatcher.
△ Less
Submitted 8 January, 2023;
originally announced January 2023.
-
A Contact Proxy Splitting Method for Lagrangian Solid-Fluid Coupling
Authors:
Tianyi Xie,
Minchen Li,
Yin Yang,
Chenfanfu Jiang
Abstract:
We present a robust and efficient method for simulating Lagrangian solid-fluid coupling based on a new operator splitting strategy. We use variational formulations to approximate fluid properties and solid-fluid interactions, and introduce a unified two-way coupling formulation for SPH fluids and FEM solids using interior point barrier-based frictional contact. We split the resulting optimization…
▽ More
We present a robust and efficient method for simulating Lagrangian solid-fluid coupling based on a new operator splitting strategy. We use variational formulations to approximate fluid properties and solid-fluid interactions, and introduce a unified two-way coupling formulation for SPH fluids and FEM solids using interior point barrier-based frictional contact. We split the resulting optimization problem into a fluid phase and a solid-coupling phase using a novel time-splitting approach with augmented contact proxies, and propose efficient custom linear solvers. Our technique accounts for fluids interaction with nonlinear hyperelastic objects of different geometries and codimensions, while maintaining an algorithmically guaranteed non-penetrating criterion. Comprehensive benchmarks and experiments demonstrate the efficacy of our method.
△ Less
Submitted 5 January, 2023;
originally announced January 2023.
-
Spin excitations in the kagome-lattice metallic antiferromagnet Fe$_{0.89}$Co$_{0.11}$Sn
Authors:
Tao Xie,
Qiangwei Yin,
Qi Wang,
A. I. Kolesnikov,
G. E. Granroth,
D. L. Abernathy,
Dongliang Gong,
Zhi** Yin,
Hechang Lei,
A. Podlesnyak
Abstract:
Kagome-lattice materials have attracted tremendous interest due to the broad prospect for seeking superconductivity, quantum spin liquid states, and topological electronic structures. Among them, the transition-metal kagome lattices are high-profile objects for the combination of topological properties, rich magnetism, and multiple-orbital physics. Here we report an inelastic neutron scattering st…
▽ More
Kagome-lattice materials have attracted tremendous interest due to the broad prospect for seeking superconductivity, quantum spin liquid states, and topological electronic structures. Among them, the transition-metal kagome lattices are high-profile objects for the combination of topological properties, rich magnetism, and multiple-orbital physics. Here we report an inelastic neutron scattering study on the spin dynamics of a kagome-lattice antiferromagnetic metal Fe$_{0.89}$Co$_{0.11}$Sn. Although the magnetic excitations can be observed up to $\sim$250 meV, well-defined spin waves are only identified below $\sim$90 meV and can be modeled using Heisenberg exchange with ferromagnetic in-plane nearest-neighbor coupling $J_1$, in-plane next-nearest-neighbor coupling $J_2$, and antiferromagnetic (AFM) interlayer coupling $J_c$ under linear spin-wave theory. Above $\sim$90 meV, the spin waves enter the itinerant Stoner continuum and become highly damped particle-hole excitations. At the K point of the Brillouin zone, we reveal a possible band crossing of the spin wave, which indicates a potential Dirac magnon. Our results uncover the evolution of the spin excitations from the planar AFM state to the axial AFM state in Fe$_{0.89}$Co$_{0.11}$Sn, solve the magnetic Hamiltonian for both states, and confirm the significant influence of the itinerant magnetism on the spin excitations.
△ Less
Submitted 29 December, 2022; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Confinement of many-body Bethe strings
Authors:
Jiahao Yang,
Tao Xie,
S. E. Nikitin,
Jianda Wu,
A. Podlesnyak
Abstract:
Based on Bethe-ansatz approach and inelastic neutron scattering experiments, we reveal evolution of confinement of many-body Bethe strings in ordered regions of quasi-one-dimensional antiferromagnet $\rm YbAlO_3$. In the antiferromagnetic phase, the spin dynamics is dominated by the confined length-1 Bethe strings, whose dominancy in the high-energy branch of the excitation spectrum yields to the…
▽ More
Based on Bethe-ansatz approach and inelastic neutron scattering experiments, we reveal evolution of confinement of many-body Bethe strings in ordered regions of quasi-one-dimensional antiferromagnet $\rm YbAlO_3$. In the antiferromagnetic phase, the spin dynamics is dominated by the confined length-1 Bethe strings, whose dominancy in the high-energy branch of the excitation spectrum yields to the confined length-2 Bethe strings when the material is tuned to the spin-density-wave phase. In the thermal-induced disordered region, the confinement effect disappears, and the system restores the conventional quantum integrable physics of the one-dimensional Heisenberg model. Our results establish a unified picture based on Bethe string for the spin dynamics in different magnetic phases of $\rm YbAlO_3$, and thus provide profound insight into the many-body quantum magnetism.
△ Less
Submitted 5 June, 2023; v1 submitted 6 December, 2022;
originally announced December 2022.
-
Temperature-dependent behaviors of single spin defects in solids determined with Hz-level precision
Authors:
Shaoyi Xu,
Mingzhe Liu,
Tianyu Xie,
Zhiyuan Zhao,
Qian Shi,
Pei Yu,
Chang-Kui Duan,
Fazhan Shi,
Jiangfeng Du
Abstract:
Revealing the properties of single spin defects in solids is essential for quantum applications based on solid-state systems. However, it is intractable to investigate the temperature-dependent properties of single defects, due to the low precision for single-defect measurements in contrast to defect ensembles. Here we report that the temperature dependence of the Hamiltonian parameters for single…
▽ More
Revealing the properties of single spin defects in solids is essential for quantum applications based on solid-state systems. However, it is intractable to investigate the temperature-dependent properties of single defects, due to the low precision for single-defect measurements in contrast to defect ensembles. Here we report that the temperature dependence of the Hamiltonian parameters for single negatively charged nitrogen-vacancy (NV$^{-}$) centers in diamond is precisely measured, and the results find a reasonable agreement with first-principles calculations. Particularly, the hyperfine interactions with randomly distributed $^{13}$C nuclear spins are clearly observed to vary with temperature, and the relevant coefficients are measured with Hz-level precision. The temperature-dependent behaviors are attributed to both thermal expansion and lattice vibrations by first-principles calculations. Our results pave the way for taking nuclear spins as more stable thermometers at nanoscale. The methods developed here for high-precision measurements and first-principles calculations can be further extended to other solid-state spin defects.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
99.92%-Fidelity CNOT Gates in Solids by Filtering Time-dependent and Quantum Noises
Authors:
Tianyu Xie,
Zhiyuan Zhao,
Shaoyi Xu,
Xi Kong,
Zhi** Yang,
Mengqi Wang,
Ya Wang,
Fazhan Shi,
Jiangfeng Du
Abstract:
Inevitable interactions with the reservoir largely degrade the performance of non-local gates, which hinders practical quantum computation from coming into existence. Here we experimentally demonstrate a 99.920(7)\%-fidelity controlled-NOT gate by suppressing the complicated noise in a solid-state spin system at room temperature. We found that the fidelity limited at 99\% in previous works results…
▽ More
Inevitable interactions with the reservoir largely degrade the performance of non-local gates, which hinders practical quantum computation from coming into existence. Here we experimentally demonstrate a 99.920(7)\%-fidelity controlled-NOT gate by suppressing the complicated noise in a solid-state spin system at room temperature. We found that the fidelity limited at 99\% in previous works results from only considering static noise, and thus, in this work, time-dependent noise and quantum noise are also included. All noises are dynamically corrected by an exquisitely designed shaped pulse, giving the resulting error below $10^{-4}$. The residual gate error is mainly originated from the longitudinal relaxation and the waveform distortion that can both be further reduced technically. Our noise-resistant method is universal, and will benefit other solid-state spin systems.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Interdisciplinary Discovery of Nanomaterials Based on Convolutional Neural Networks
Authors:
Tong Xie,
Yuwei Wan,
Weijian Li,
Qingyuan Linghu,
Shaozhou Wang,
Yalun Cai,
Han Liu,
Chunyu Kit,
Clara Grazian,
Bram Hoex
Abstract:
The material science literature contains up-to-date and comprehensive scientific knowledge of materials. However, their content is unstructured and diverse, resulting in a significant gap in providing sufficient information for material design and synthesis. To this end, we used natural language processing (NLP) and computer vision (CV) techniques based on convolutional neural networks (CNN) to di…
▽ More
The material science literature contains up-to-date and comprehensive scientific knowledge of materials. However, their content is unstructured and diverse, resulting in a significant gap in providing sufficient information for material design and synthesis. To this end, we used natural language processing (NLP) and computer vision (CV) techniques based on convolutional neural networks (CNN) to discover valuable experimental-based information about nanomaterials and synthesis methods in energy-material-related publications. Our first system, TextMaster, extracts opinions from texts and classifies them into challenges and opportunities, achieving 94% and 92% accuracy, respectively. Our second system, GraphMaster, realizes data extraction of tables and figures from publications with 98.3\% classification accuracy and 4.3% data extraction mean square error. Our results show that these systems could assess the suitability of materials for a certain application by evaluation of synthesis insights and case analysis with detailed references. This work offers a fresh perspective on mining knowledge from scientific literature, providing a wide swatch to accelerate nanomaterial research through CNN.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Brain PET Synthesis from MRI Using Joint Probability Distribution of Diffusion Model at Ultrahigh Fields
Authors:
Taofeng Xie,
Chentao Cao,
Zhuoxu Cui,
Fanshi Li,
Zidong Wei,
Yanjie Zhu,
Ye Li,
Dong Liang,
Qiyu **,
Guoqing Chen,
Haifeng Wang
Abstract:
MRI and PET are important modalities and can provide complementary information for the diagnosis of brain diseases because MRI can provide structural information of brain and PET can obtain functional information of brain. However, PET is usually missing. Especially, simultaneous PET and MRI imaging at ultrahigh field is not achievable in the current. Thus, synthetic PET using MRI at ultrahigh fie…
▽ More
MRI and PET are important modalities and can provide complementary information for the diagnosis of brain diseases because MRI can provide structural information of brain and PET can obtain functional information of brain. However, PET is usually missing. Especially, simultaneous PET and MRI imaging at ultrahigh field is not achievable in the current. Thus, synthetic PET using MRI at ultrahigh field is essential. In this paper, we synthetic PET using MRI as a guide by joint probability distribution of diffusion model (JPDDM). Meanwhile, We utilized our model in Ultrahigh Fields.
△ Less
Submitted 15 April, 2023; v1 submitted 16 November, 2022;
originally announced November 2022.
-
Adaptive Joint Estimation of Temporal Vertex and Edge Signals
Authors:
Yi Yan,
Tian Xie,
Ercan E. Kuruoglu
Abstract:
The adaptive estimation of coexisting temporal vertex (node) and edge signals on graphs is a critical task when a change in edge signals influences the temporal dynamics of the vertex signals. However, the current Graph Signal Processing algorithms mostly consider only the signals existing on the graph vertices and have neglected the fact that signals can reside on the edges. We propose an Adaptiv…
▽ More
The adaptive estimation of coexisting temporal vertex (node) and edge signals on graphs is a critical task when a change in edge signals influences the temporal dynamics of the vertex signals. However, the current Graph Signal Processing algorithms mostly consider only the signals existing on the graph vertices and have neglected the fact that signals can reside on the edges. We propose an Adaptive Joint Vertex-Edge Estimation (AJVEE) algorithm for jointly estimating time-varying vertex and edge signals through a time-varying regression, incorporating both vertex signal filtering and edge signal filtering. Accompanying AJVEE is a newly proposed Adaptive Least Mean Square procedure based on the Hodge Laplacian (ALMS-Hodge), which is inspired by classical adaptive filters combining simplicial filtering and simplicial regression. AJVEE is able to operate jointly on the vertices and edges by merging two ALMS-Hodge algorithms specified on the vertices and edges into a unified formulation. A more generalized case extending AJVEE beyond the vertices and edges is being discussed. Experimenting on real-world traffic networks and population mobility networks, we have confirmed that our proposed AJVEE algorithm could accurately and jointly track time-varying vertex and edge signals on graphs.
△ Less
Submitted 7 May, 2024; v1 submitted 11 November, 2022;
originally announced November 2022.
-
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data
Authors:
Tengyang Xie,
Mohak Bhardwaj,
Nan Jiang,
Ching-An Cheng
Abstract:
We propose a new model-based offline RL framework, called Adversarial Models for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary baseline policy regardless of data coverage. Based on the concept of relative pessimism, ARMOR is designed to optimize for the worst-case relative performance when facing uncertainty. In theory, we prove that the lea…
▽ More
We propose a new model-based offline RL framework, called Adversarial Models for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary baseline policy regardless of data coverage. Based on the concept of relative pessimism, ARMOR is designed to optimize for the worst-case relative performance when facing uncertainty. In theory, we prove that the learned policy of ARMOR never degrades the performance of the baseline policy with any admissible hyperparameter, and can learn to compete with the best policy within data coverage when the hyperparameter is well tuned, and the baseline policy is supported by the data. Such a robust policy improvement property makes ARMOR especially suitable for building real-world learning systems, because in practice ensuring no performance degradation is imperative before considering any benefit learning can bring.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations
Authors:
Xiang Fu,
Zhenghao Wu,
Wujie Wang,
Tian Xie,
Sinan Keten,
Rafael Gomez-Bombarelli,
Tommi Jaakkola
Abstract:
Molecular dynamics (MD) simulation techniques are widely used for various natural science applications. Increasingly, machine learning (ML) force field (FF) models begin to replace ab-initio simulations by predicting forces directly from atomic structures. Despite significant progress in this area, such techniques are primarily benchmarked by their force/energy prediction errors, even though the p…
▽ More
Molecular dynamics (MD) simulation techniques are widely used for various natural science applications. Increasingly, machine learning (ML) force field (FF) models begin to replace ab-initio simulations by predicting forces directly from atomic structures. Despite significant progress in this area, such techniques are primarily benchmarked by their force/energy prediction errors, even though the practical use case would be to produce realistic MD trajectories. We aim to fill this gap by introducing a novel benchmark suite for learned MD simulation. We curate representative MD systems, including water, organic molecules, a peptide, and materials, and design evaluation metrics corresponding to the scientific objectives of respective systems. We benchmark a collection of state-of-the-art (SOTA) ML FF models and illustrate, in particular, how the commonly benchmarked force accuracy is not well aligned with relevant simulation metrics. We demonstrate when and how selected SOTA methods fail, along with offering directions for further improvement. Specifically, we identify stability as a key metric for ML models to improve. Our benchmark suite comes with a comprehensive open-source codebase for training and simulation with ML FFs to facilitate future work.
△ Less
Submitted 26 August, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Complete field-induced spectral response of the spin-1/2 triangular-lattice antiferromagnet CsYbSe$_2$
Authors:
Tao Xie,
A. A. Eberharter,
Jie Xing,
S. Nishimoto,
M. Brando,
P. Khanenko,
J. Sichelschmidt,
A. A. Turrini,
D. G. Mazzone,
P. G. Naumov,
L. D. Sanjeewa,
N. Harrison,
A. S. Sefat,
B. Normand,
A. M. Lauchli,
A. Podlesnyak,
S. E. Nikitin
Abstract:
Fifty years after Anderson's resonating valence-bond proposal, the spin-1/2 triangular-lattice Heisenberg antiferromagnet (TLHAF) remains the ultimate platform to explore highly entangled quantum spin states in proximity to magnetic order. Yb-based delafossites are ideal candidate TLHAF materials, which allow experimental access to the full range of applied in-plane magnetic fields. We perform a s…
▽ More
Fifty years after Anderson's resonating valence-bond proposal, the spin-1/2 triangular-lattice Heisenberg antiferromagnet (TLHAF) remains the ultimate platform to explore highly entangled quantum spin states in proximity to magnetic order. Yb-based delafossites are ideal candidate TLHAF materials, which allow experimental access to the full range of applied in-plane magnetic fields. We perform a systematic neutron scattering study of CsYbSe$_2$, first proving the Heisenberg character of the interactions and quantifying the second-neighbour coupling. We then measure the complex evolution of the excitation spectrum, finding extensive continuum features near the 120$^{\circ}$-ordered state, throughout the 1/3-magnetization plateau and beyond this up to saturation. We perform cylinder matrix-product-state (MPS) calculations to obtain an unbiased numerical benchmark for the TLHAF and spectacular agreement with the experimental spectra. The measured and calculated longitudinal spectral functions reflect the role of multi-magnon bound and scattering states. These results provide valuable insight into unconventional field-induced spin excitations in frustrated quantum materials.
△ Less
Submitted 6 October, 2023; v1 submitted 10 October, 2022;
originally announced October 2022.
-
The Role of Coverage in Online Reinforcement Learning
Authors:
Tengyang Xie,
Dylan J. Foster,
Yu Bai,
Nan Jiang,
Sham M. Kakade
Abstract:
Coverage conditions -- which assert that the data logging distribution adequately covers the state space -- play a fundamental role in determining the sample complexity of offline reinforcement learning. While such conditions might seem irrelevant to online reinforcement learning at first glance, we establish a new connection by showing -- somewhat surprisingly -- that the mere existence of a data…
▽ More
Coverage conditions -- which assert that the data logging distribution adequately covers the state space -- play a fundamental role in determining the sample complexity of offline reinforcement learning. While such conditions might seem irrelevant to online reinforcement learning at first glance, we establish a new connection by showing -- somewhat surprisingly -- that the mere existence of a data distribution with good coverage can enable sample-efficient online RL. Concretely, we show that coverability -- that is, existence of a data distribution that satisfies a ubiquitous coverage condition called concentrability -- can be viewed as a structural property of the underlying MDP, and can be exploited by standard algorithms for sample-efficient exploration, even when the agent does not know said distribution. We complement this result by proving that several weaker notions of coverage, despite being sufficient for offline RL, are insufficient for online RL. We also show that existing complexity measures for online RL, including Bellman rank and Bellman-Eluder dimension, fail to optimally capture coverability, and propose a new complexity measure, the sequential extrapolation coefficient, to provide a unification.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
On two cycles of consecutive even lengths
Authors:
Jun Gao,
Binlong Li,
Jie Ma,
Tianying Xie
Abstract:
Bondy and Vince showed that every graph with minimum degree at least three contains two cycles of lengths differing by one or two.We prove the following average degree counterpart that every $n$-vertex graph $G$ with at least $\frac52(n-1)$ edges, unless $4|(n-1)$ and every block of $G$ is a clique $K_5$, contains two cycles of consecutive even lengths. Our proof is mainly based on structural anal…
▽ More
Bondy and Vince showed that every graph with minimum degree at least three contains two cycles of lengths differing by one or two.We prove the following average degree counterpart that every $n$-vertex graph $G$ with at least $\frac52(n-1)$ edges, unless $4|(n-1)$ and every block of $G$ is a clique $K_5$, contains two cycles of consecutive even lengths. Our proof is mainly based on structural analysis, and a crucial step which may be of independent interest shows that the same conclusion holds for every 3-connected graph with at least 6 vertices. This solves a special case of a conjecture of Verstraëte. The quantitative bound is tight and also provides the optimal extremal number for cycles of length two modulo four.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
Binding Language Models in Symbolic Languages
Authors:
Zhoujun Cheng,
Tianbao Xie,
Peng Shi,
Chengzu Li,
Rahul Nadkarni,
Yushi Hu,
Caiming Xiong,
Dragomir Radev,
Mari Ostendorf,
Luke Zettlemoyer,
Noah A. Smith,
Tao Yu
Abstract:
Though end-to-end neural approaches have recently been dominating NLP tasks in both performance and ease-of-use, they lack interpretability and robustness. We propose Binder, a training-free neural-symbolic framework that maps the task input to a program, which (1) allows binding a unified API of language model (LM) functionalities to a programming language (e.g., SQL, Python) to extend its gramma…
▽ More
Though end-to-end neural approaches have recently been dominating NLP tasks in both performance and ease-of-use, they lack interpretability and robustness. We propose Binder, a training-free neural-symbolic framework that maps the task input to a program, which (1) allows binding a unified API of language model (LM) functionalities to a programming language (e.g., SQL, Python) to extend its grammar coverage and thus tackle more diverse questions, (2) adopts an LM as both the program parser and the underlying model called by the API during execution, and (3) requires only a few in-context exemplar annotations. Specifically, we employ GPT-3 Codex as the LM. In the parsing stage, with only a few in-context exemplars, Codex is able to identify the part of the task input that cannot be answerable by the original programming language, correctly generate API calls to prompt Codex to solve the unanswerable part, and identify where to place the API calls while being compatible with the original grammar. In the execution stage, Codex can perform versatile functionalities (e.g., commonsense QA, information extraction) given proper prompts in the API calls. Binder achieves state-of-the-art results on WikiTableQuestions and TabFact datasets, with explicit output programs that benefit human debugging. Note that previous best systems are all finetuned on tens of thousands of task-specific samples, while Binder only uses dozens of annotations as in-context exemplars without any training. Our code is available at https://github.com/HKUNLP/Binder .
△ Less
Submitted 28 February, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
zkBridge: Trustless Cross-chain Bridges Made Practical
Authors:
Tiancheng Xie,
Jiaheng Zhang,
Zerui Cheng,
Fan Zhang,
Yupeng Zhang,
Yongzheng Jia,
Dan Boneh,
Dawn Song
Abstract:
Blockchains have seen growing traction with cryptocurrencies reaching a market cap of over 1 trillion dollars, major institution investors taking interests, and global impacts on governments, businesses, and individuals. Also growing significantly is the heterogeneity of the ecosystem where a variety of blockchains co-exist. Cross-chain bridge is a necessary building block in this multi-chain ecos…
▽ More
Blockchains have seen growing traction with cryptocurrencies reaching a market cap of over 1 trillion dollars, major institution investors taking interests, and global impacts on governments, businesses, and individuals. Also growing significantly is the heterogeneity of the ecosystem where a variety of blockchains co-exist. Cross-chain bridge is a necessary building block in this multi-chain ecosystem. Existing solutions, however, either suffer from performance issues or rely on trust assumptions of committees that significantly lower the security. Recurring attacks against bridges have cost users more than 1.5 billion USD. In this paper, we introduce zkBridge, an efficient cross-chain bridge that guarantees strong security without external trust assumptions. With succinct proofs, zkBridge not only guarantees correctness, but also significantly reduces on-chain verification cost. We propose novel succinct proof protocols that are orders-of-magnitude faster than existing solutions for workload in zkBridge. With a modular design, zkBridge enables a broad spectrum of use cases and capabilities, including message passing, token transferring, and other computational logic operating on state changes from different chains. To demonstrate the practicality of zkBridge, we implemented a prototype bridge from Cosmos to Ethereum, a particularly challenging direction that involves large proof circuits that existing systems cannot efficiently handle. Our evaluation shows that zkBridge achieves practical performance: proof generation takes less than 20 seconds, while verifying proofs on-chain costs less than 230K gas. For completeness, we also implemented and evaluated the direction from Ethereum to other EVM-compatible chains (such as BSC) which involves smaller circuits and incurs much less overhead.
△ Less
Submitted 1 October, 2022;
originally announced October 2022.
-
NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries
Authors:
Yiru Chen,
Ryan Li,
Austin Mac,
Tianbao Xie,
Tao Yu,
Eugene Wu
Abstract:
We develop NL2INTERFACE to explore the potential of generating usable interactive multi-visualization interfaces from natural language queries. With NL2INTERFACE, users can directly write natural language queries to automatically generate a fully interactive multi-visualization interface without any extra effort of learning a tool or programming language. Further, users can interact with the inter…
▽ More
We develop NL2INTERFACE to explore the potential of generating usable interactive multi-visualization interfaces from natural language queries. With NL2INTERFACE, users can directly write natural language queries to automatically generate a fully interactive multi-visualization interface without any extra effort of learning a tool or programming language. Further, users can interact with the interfaces to easily transform the data and quickly see the results in the visualizations.
△ Less
Submitted 24 September, 2022; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Novel Constructions of Mutually Unbiased Tripartite Absolutely Maximally Entangled Bases
Authors:
Tian Xie,
Yajuan Zang,
Hui-Juan Zuo,
Shao-Ming Fei
Abstract:
We develop a new technique to construct mutually unbiased tripartite absolutely maximally entangled bases. We first explore the tripartite absolutely maximally entangled bases and mutually unbiased bases in $\mathbb{C}^{d} \otimes \mathbb{C}^{d} \otimes \mathbb{C}^{d}$ based on mutually orthogonal Latin squares. Then we generalize the approach to the case of…
▽ More
We develop a new technique to construct mutually unbiased tripartite absolutely maximally entangled bases. We first explore the tripartite absolutely maximally entangled bases and mutually unbiased bases in $\mathbb{C}^{d} \otimes \mathbb{C}^{d} \otimes \mathbb{C}^{d}$ based on mutually orthogonal Latin squares. Then we generalize the approach to the case of $\mathbb{C}^{d_{1}} \otimes \mathbb{C}^{d_{2}} \otimes \mathbb{C}^{d_{1}d_{2}}$ by mutually weak orthogonal Latin squares. The concise direct constructions of mutually unbiased tripartite absolutely maximally entangled bases are remarkably presented with generality. Detailed examples in $\mathbb{C}^{3} \otimes \mathbb{C}^{3} \otimes \mathbb{C}^{3},$ $\mathbb{C}^{2} \otimes \mathbb{C}^{2} \otimes \mathbb{C}^{4}$ and $\mathbb{C}^{2} \otimes \mathbb{C}^{5} \otimes \mathbb{C}^{10}$ are provided to illustrate the advantages of our approach.
△ Less
Submitted 17 September, 2022;
originally announced September 2022.
-
Spin fluctuations in the 112-type iron-based superconductor Ca$_{0.82}$La$_{0.18}$Fe$_{0.96}$Ni$_{0.04}$As$_{2}$
Authors:
Tao Xie,
Chang Liu,
Ryoichi Kajimoto,
Kazuhiko Ikeuchi,
Shiliang Li,
Huiqian Luo
Abstract:
We report time-of-flight inelastic neutron scattering (INS) investigations on the spin fluctuation spectrum in the 112-type iron-based superconductor (FeSC) Ca$_{0.82}$La$_{0.18}$Fe$_{0.96}$Ni$_{0.04}$As$_{2}$ (CaLa-112). In comparison to the 122-type FeSCs with a centrosymmetric tetragonal lattice structure (space group $I4/mmm$) at room temperature and an in-plane stripe-type antiferromagnetic (…
▽ More
We report time-of-flight inelastic neutron scattering (INS) investigations on the spin fluctuation spectrum in the 112-type iron-based superconductor (FeSC) Ca$_{0.82}$La$_{0.18}$Fe$_{0.96}$Ni$_{0.04}$As$_{2}$ (CaLa-112). In comparison to the 122-type FeSCs with a centrosymmetric tetragonal lattice structure (space group $I4/mmm$) at room temperature and an in-plane stripe-type antiferromagnetic (AF) order at low temperature, the 112 system has a noncentrosymmetric structure (space group $P2_{1}$) with additional zigzag arsenic chains between Ca/La layers and a magnetic ground state with similar wavevector $\mathbf{Q}_{\mathrm{AF}}$ but different orientations of ordered moments in the parent compounds. Our INS study clearly reveals that the in-plane dispersions and the bandwidth of spin excitations in the superconducting CaLa-112 closely resemble to those in 122 systems. While the total fluctuating moments $\langle m^2 \rangle\approx 4.6\pm0.2 μ_B^2$/Fe are larger than 122 system, the dynamic correlation lengths are similar ($ξ\approx 10$ Å). These results suggest that superconductivity in iron arsenides may have a common magnetic origin under similar magnetic exchange couplings with a dual nature from local moments and itinerant electrons, despite their different magnetic patterns and lattice symmetries.
△ Less
Submitted 16 September, 2022;
originally announced September 2022.
-
Observation of Floquet topological phases with large Chern numbers
Authors:
Kai Yang,
Shaoyi Xu,
Longwen Zhou,
Zhiyuan Zhao,
Tianyu Xie,
Zhe Ding,
Wenchao Ma,
Jiangbin Gong,
Fazhan Shi,
Jiangfeng Du
Abstract:
One of the most intriguing advantage of Floquet engineering is to generate new phases with large topological invariants. In this work, we experimentally simulate a periodically quenched generalized Haldane model on an NV center in diamond, and observe its Floquet Chern insulator phases with Chern numbers $C=1,2,4$ by imaging the static and dynamic spin textures in momentum space. Our work reveals…
▽ More
One of the most intriguing advantage of Floquet engineering is to generate new phases with large topological invariants. In this work, we experimentally simulate a periodically quenched generalized Haldane model on an NV center in diamond, and observe its Floquet Chern insulator phases with Chern numbers $C=1,2,4$ by imaging the static and dynamic spin textures in momentum space. Our work reveals the power of Floquet driving in transforming system's topology and generating large Chern number phases. It further establishes a unique experimental scheme to detect Floquet topological phases in two and higher spatial dimensions.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Perpendicular magnetic anisotropy in as-deposited CoFeB/MgO thin films
Authors:
Kaihua Lou,
Tunan Xie,
Qianwen Zhao,
Baiqing Jiang,
ChaoChao Xia,
Hanying Zhang,
Zhihong Yao,
Chong Bi
Abstract:
Fabrication of perpendicularly magnetized ferromagnetic films on various buffer layers, especially on numerous newly discovered spin-orbit torque (SOT) materials to construct energy-efficient spin-orbitronic devices, is a long-standing challenge. Even for the widely used CoFeB/MgO structures, perpendicular magnetic anisotropy (PMA) can only be established on limited buffer layers through post-anne…
▽ More
Fabrication of perpendicularly magnetized ferromagnetic films on various buffer layers, especially on numerous newly discovered spin-orbit torque (SOT) materials to construct energy-efficient spin-orbitronic devices, is a long-standing challenge. Even for the widely used CoFeB/MgO structures, perpendicular magnetic anisotropy (PMA) can only be established on limited buffer layers through post-annealing above 300 °C. Here, we report that the PMA of CoFeB/MgO films can be established reliably on various buffer layers in the absence of post-annealing. Further results show that precise control of MgO thickness, which determines oxygen diffusion in the underneath CoFeB layer, is the key to obtaining the as-deposited PMA. Interestingly, contrary to previous understanding, post-annealing does not influence the well-established as-deposited PMA significantly but indeed enhances unsaturated PMA with a thick MgO layer by modulating oxygen distributions, rather than crystallinity or Co- and Fe-O bonding. Moreover, our results indicate that oxygen diffusion also plays a critical role in the PMA degradation at high temperature. These results provide a practical approach to build spin-orbitronic devices based on various high-efficient SOT materials.
△ Less
Submitted 31 August, 2022;
originally announced August 2022.
-
Soft mechanical metamaterials with transformable topology protected by stress caching
Authors:
Jason Christopher Jolly,
Binjie **,
Lishuai **,
YoungJoo Lee,
Tao Xie,
Stefano Gonella,
Kai Sun,
Xiaoming Mao,
Shu Yang
Abstract:
Maxwell lattice metamaterials possess a rich phase space with distinct topological states featuring mechanically polarized edge behaviors and strongly asymmetric acoustic responses. Until now, demonstrations of non-trivial topological behaviors from Maxwell lattices have been limited to either monoliths with locked configurations or reconfigurable mechanical linkages. This work introduces a transf…
▽ More
Maxwell lattice metamaterials possess a rich phase space with distinct topological states featuring mechanically polarized edge behaviors and strongly asymmetric acoustic responses. Until now, demonstrations of non-trivial topological behaviors from Maxwell lattices have been limited to either monoliths with locked configurations or reconfigurable mechanical linkages. This work introduces a transformable topological mechanical metamaterial (TTMM) made from a shape memory polymer and based on a generalized kagome lattice. It is capable of reversibly exploring topologically distinct phases of the non-trivial phase space via a kinematic strategy that converts sparse mechanical inputs at free edge pairs into a biaxial, global transformation that switches its topological state. Thanks to the shape memory effect, all configurations are stable even in the absence of confinement or a continuous mechanical input. Topologically-protected mechanical behaviors, while robust against structural (with broken hinges) or conformational defects (up to ~55% mis-rotations), are shown to be vulnerable to the adverse effects of stored elastic energy from prior transformations (up to a ~70% reduction in edge stiffness ratios, depending on hinge width). Interestingly, we show that shape memory polymer's intrinsic phase transitions that modulate chain mobility can effectively shield a dynamic metamaterial's topological response (with a 100% recovery) from its own kinematic stress history, an effect we refer to as "stress caching".
△ Less
Submitted 16 August, 2022;
originally announced August 2022.
-
Examining graph neural networks for crystal structures: limitations and opportunities for capturing periodicity
Authors:
Sheng Gong,
Tian Xie,
Yang Shao-Horn,
Rafael Gomez-Bombarelli,
Jeffrey C. Grossman
Abstract:
Historically, materials informatics has relied on human-designed descriptors of materials structures. In recent years, graph neural networks (GNNs) have been proposed for learning representations of crystal structures from data end-to-end producing vectorial embeddings that are optimized for downstream prediction tasks. However, a systematic scheme is lacking to analyze and understand the limits o…
▽ More
Historically, materials informatics has relied on human-designed descriptors of materials structures. In recent years, graph neural networks (GNNs) have been proposed for learning representations of crystal structures from data end-to-end producing vectorial embeddings that are optimized for downstream prediction tasks. However, a systematic scheme is lacking to analyze and understand the limits of GNNs for capturing crystal structures. In this work, we propose to use human-designed descriptors as a bank of human knowledge to test whether black-box GNNs can capture the knowledge of crystal structures. We find that current state-of-the-art GNNs cannot capture the periodicity of crystal structures well, and we analyze the limitations of the GNN models that result in this failure from three aspects: local expressive power, long-range information, and readout function. We propose an initial solution, hybridizing descriptors with GNNs, to improve the prediction of GNNs for materials properties, especially phonon internal energy and heat capacity with 90% lower errors, and we analyze the mechanisms for the improved prediction. All the analysis can be extended easily to other deep representation learning models, human-designed descriptors, and systems such as molecules and amorphous materials.
△ Less
Submitted 27 March, 2023; v1 submitted 9 August, 2022;
originally announced August 2022.
-
A cloud platform for automating and sharing analysis of raw simulation data from high throughput polymer molecular dynamics simulations
Authors:
Tian Xie,
Ha-Kyung Kwon,
Daniel Schweigert,
Sheng Gong,
Arthur France-Lanord,
Arash Khajeh,
Emily Crabb,
Michael Puzon,
Chris Fajardo,
Will Powelson,
Yang Shao-Horn,
Jeffrey C. Grossman
Abstract:
Open material databases storing hundreds of thousands of material structures and their corresponding properties have become the cornerstone of modern computational materials science. Yet, the raw outputs of the simulations, such as the trajectories from molecular dynamics simulations and charge densities from density functional theory calculations, are generally not shared due to their huge size.…
▽ More
Open material databases storing hundreds of thousands of material structures and their corresponding properties have become the cornerstone of modern computational materials science. Yet, the raw outputs of the simulations, such as the trajectories from molecular dynamics simulations and charge densities from density functional theory calculations, are generally not shared due to their huge size. In this work, we describe a cloud-based platform to facilitate the sharing of raw data and enable the fast post-processing in the cloud to extract new properties defined by the user. As an initial demonstration, our database currently includes 6286 molecular dynamics trajectories for amorphous polymer electrolytes and 5.7 terabytes of data. We create a public analysis library at https://github.com/TRI-AMDD/htp_md to extract multiple properties from the raw data, using both expert designed functions and machine learning models. The analysis is run automatically with computation in the cloud, and results then populate a database that can be accessed publicly. Our platform encourages users to contribute both new trajectory data and analysis functions via public interfaces. Newly analyzed properties will be incorporated into the database. Finally, we create a front-end user interface at https://www.htpmd.matr.io for browsing and visualization of our data. We envision the platform to be a new way of sharing raw data and new insights for the computational materials science community.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Differentiable Subdivision Surface Fitting
Authors:
Tianhao Xie
Abstract:
In this paper, we present a powerful differentiable surface fitting technique to derive a compact surface representation for a given dense point cloud or mesh, with application in the domains of graphics and CAD/CAM. We have chosen the Loop subdivision surface, which in the limit yields the smooth surface underlying the point cloud, and can handle complex surface topology better than other popular…
▽ More
In this paper, we present a powerful differentiable surface fitting technique to derive a compact surface representation for a given dense point cloud or mesh, with application in the domains of graphics and CAD/CAM. We have chosen the Loop subdivision surface, which in the limit yields the smooth surface underlying the point cloud, and can handle complex surface topology better than other popular compact representations, such as NURBS. The principal idea is to fit the Loop subdivision surface not directly to the point cloud, but to the IMLS (implicit moving least squares) surface defined over the point cloud. As both Loop subdivision and IMLS have analytical expressions, we are able to formulate the problem as an unconstrained minimization problem of a completely differentiable function that can be solved with standard numerical solvers. Differentiability enables us to integrate the subdivision surface into any deep learning method for point clouds or meshes. We demonstrate the versatility and potential of this approach by using it in conjunction with a differentiable renderer to robustly reconstruct compact surface representations of spatial-temporal sequences of dense meshes.
△ Less
Submitted 19 October, 2022; v1 submitted 2 August, 2022;
originally announced August 2022.
-
Observation of SQUID-like behavior in fiber laser with intra-cavity epsilon-near-zero effect
Authors:
Jiaye Wu,
Xuanyi Liu,
Boris A. Malomed,
Kuan-Chang Chang,
Minghe Zhao,
Kang Qi,
Yanhua Sha,
Ze Tao Xie,
Marco Clementi,
Camille-Sophie Brès,
Shengdong Zhang,
H. Y. Fu,
Qian Li
Abstract:
Establishing relations between fundamental effects in far-flung areas of physics is a subject of great interest in the current research. We here report realization of a novel photonic system akin to the radio-frequency superconducting quantum interference device (RF-SQUID), in a fiber laser cavity with epsilon-near-zero (ENZ) nanolayers as intra-cavity components. Emulating the RF-SQUID scheme, th…
▽ More
Establishing relations between fundamental effects in far-flung areas of physics is a subject of great interest in the current research. We here report realization of a novel photonic system akin to the radio-frequency superconducting quantum interference device (RF-SQUID), in a fiber laser cavity with epsilon-near-zero (ENZ) nanolayers as intra-cavity components. Emulating the RF-SQUID scheme, the photonic counterpart of the supercurrent, represented by the optical wave, circulates in the cavity, passing through effective optical potential barriers. Different ENZ wavelengths translate into distinct spectral outputs through the variation of cavity resonances, emulating the situation with a frequency-varying tank circuit in the RF-SQUID. Due to the presence of the ENZ element, the optical potential barrier is far lower for selected frequency components, granting them advantage in the gain-resource competition. The findings reported in this work provide a deeper insight into the ultrafast ENZ photonics, revealing a new path towards the design of nanophotonic on-chip devices with various operational functions, and offer a new approach to study superconducting and quantum-mechanical systems.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
GDsmith: Detecting Bugs in Graph Database Engines
Authors:
Wei Lin,
Ziyue Hua,
Luyao Ren,
Zongyang Li,
Lu Zhang,
Tao Xie
Abstract:
Graph database engines stand out in the era of big data for their efficiency of modeling and processing linked data. There is a strong need of testing graph database engines. However, random testing, the most practical way of automated test generation, faces the challenges of semantic validity, non-empty result, and behavior diversity to detect bugs in graph database engines. To address these chal…
▽ More
Graph database engines stand out in the era of big data for their efficiency of modeling and processing linked data. There is a strong need of testing graph database engines. However, random testing, the most practical way of automated test generation, faces the challenges of semantic validity, non-empty result, and behavior diversity to detect bugs in graph database engines. To address these challenges, in this paper, we propose GDsmith, the first black-box approach for testing graph database engines. It ensures that each randomly generated Cypher query satisfies the semantic requirements via skeleton generation and completion. GDsmith includes our technique to increase the probability of producing Cypher queries that return non-empty results by leveraging three types of structural mutation strategies. GDsmith also includes our technique to improve the behavior diversity of the generated Cypher queries by selecting property keys according to their previous frequencies when generating new queries. Our evaluation results demonstrate that GDsmith is effective and efficient for automated query generation and substantially outperforms the baseline. GDsmith successfully detects 27 previously unknown bugs on the released versions of three popular open-source graph database engines and receive positive feedback from their developers.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
Interaction-Grounded Learning with Action-inclusive Feedback
Authors:
Tengyang Xie,
Akanksha Saran,
Dylan J. Foster,
Lekan Molu,
Ida Momennejad,
Nan Jiang,
Paul Mineiro,
John Langford
Abstract:
Consider the problem setting of Interaction-Grounded Learning (IGL), in which a learner's goal is to optimally interact with the environment with no explicit reward to ground its policies. The agent observes a context vector, takes an action, and receives a feedback vector, using this information to effectively optimize a policy with respect to a latent reward function. Prior analyzed approaches f…
▽ More
Consider the problem setting of Interaction-Grounded Learning (IGL), in which a learner's goal is to optimally interact with the environment with no explicit reward to ground its policies. The agent observes a context vector, takes an action, and receives a feedback vector, using this information to effectively optimize a policy with respect to a latent reward function. Prior analyzed approaches fail when the feedback vector contains the action, which significantly limits IGL's success in many potential scenarios such as Brain-computer interface (BCI) or Human-computer interface (HCI) applications. We address this by creating an algorithm and analysis which allows IGL to work even when the feedback vector contains the action, encoded in any fashion. We provide theoretical guarantees and large-scale experiments based on supervised datasets to demonstrate the effectiveness of the new approach.
△ Less
Submitted 12 October, 2022; v1 submitted 16 June, 2022;
originally announced June 2022.
-
Double Sampling Randomized Smoothing
Authors:
Linyi Li,
Jiawei Zhang,
Tao Xie,
Bo Li
Abstract:
Neural networks (NNs) are known to be vulnerable against adversarial perturbations, and thus there is a line of work aiming to provide robustness certification for NNs, such as randomized smoothing, which samples smoothing noises from a certain distribution to certify the robustness for a smoothed classifier. However, as shown by previous work, the certified robust radius in randomized smoothing s…
▽ More
Neural networks (NNs) are known to be vulnerable against adversarial perturbations, and thus there is a line of work aiming to provide robustness certification for NNs, such as randomized smoothing, which samples smoothing noises from a certain distribution to certify the robustness for a smoothed classifier. However, as shown by previous work, the certified robust radius in randomized smoothing suffers from scaling to large datasets ("curse of dimensionality"). To overcome this hurdle, we propose a Double Sampling Randomized Smoothing (DSRS) framework, which exploits the sampled probability from an additional smoothing distribution to tighten the robustness certification of the previous smoothed classifier. Theoretically, under mild assumptions, we prove that DSRS can certify $Θ(\sqrt d)$ robust radius under $\ell_2$ norm where $d$ is the input dimension, implying that DSRS may be able to break the curse of dimensionality of randomized smoothing. We instantiate DSRS for a generalized family of Gaussian smoothing and propose an efficient and sound computing method based on customized dual optimization considering sampling error. Extensive experiments on MNIST, CIFAR-10, and ImageNet verify our theory and show that DSRS certifies larger robust radii than existing baselines consistently under different settings. Code is available at https://github.com/llylly/DSRS.
△ Less
Submitted 31 January, 2023; v1 submitted 16 June, 2022;
originally announced June 2022.
-
A Class of Mean-Field Games with Optimal Stop** and its Inverse Problem
Authors:
Jianhui Huang,
Tinghan Xie
Abstract:
This paper revisits the well-studied \emph{optimal stop**} problem but within the \emph{large-population} framework. In particular, two classes of optimal stop** problems are formulated by taking into account the \emph{relative performance criteria}. It is remarkable the relative performance criteria, also understood by the \emph{Joneses preference}, \emph{habit formation utility}, or \emph{re…
▽ More
This paper revisits the well-studied \emph{optimal stop**} problem but within the \emph{large-population} framework. In particular, two classes of optimal stop** problems are formulated by taking into account the \emph{relative performance criteria}. It is remarkable the relative performance criteria, also understood by the \emph{Joneses preference}, \emph{habit formation utility}, or \emph{relative wealth concern} in economics and finance, plays an important role in explaining various decision behaviors such as price bubbles. By introducing such criteria in large-population setting, a given agent can compare his individual stop** rule with the average behaviors of its cohort. The associated mean-field games are formulated in order to derive the decentralized stop** rules. The related consistency conditions are characterized via some coupled equation system and the asymptotic Nash equilibrium properties are also verified. In addition, some \emph{inverse} mean-field optimal stop** problem is also introduced and discussed.
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
Towards A Proactive ML Approach for Detecting Backdoor Poison Samples
Authors:
Xiangyu Qi,
Tinghao Xie,
Jiachen T. Wang,
Tong Wu,
Saeed Mahloujifar,
Prateek Mittal
Abstract:
Adversaries can embed backdoors in deep learning models by introducing backdoor poison samples into training datasets. In this work, we investigate how to detect such poison samples to mitigate the threat of backdoor attacks. First, we uncover a post-hoc workflow underlying most prior work, where defenders passively allow the attack to proceed and then leverage the characteristics of the post-atta…
▽ More
Adversaries can embed backdoors in deep learning models by introducing backdoor poison samples into training datasets. In this work, we investigate how to detect such poison samples to mitigate the threat of backdoor attacks. First, we uncover a post-hoc workflow underlying most prior work, where defenders passively allow the attack to proceed and then leverage the characteristics of the post-attacked model to uncover poison samples. We reveal that this workflow does not fully exploit defenders' capabilities, and defense pipelines built on it are prone to failure or performance degradation in many scenarios. Second, we suggest a paradigm shift by promoting a proactive mindset in which defenders engage proactively with the entire model training and poison detection pipeline, directly enforcing and magnifying distinctive characteristics of the post-attacked model to facilitate poison detection. Based on this, we formulate a unified framework and provide practical insights on designing detection pipelines that are more robust and generalizable. Third, we introduce the technique of Confusion Training (CT) as a concrete instantiation of our framework. CT applies an additional poisoning attack to the already poisoned dataset, actively decoupling benign correlation while exposing backdoor patterns to detection. Empirical evaluations on 4 datasets and 14 types of attacks validate the superiority of CT over 14 baseline defenses.
△ Less
Submitted 17 June, 2023; v1 submitted 26 May, 2022;
originally announced May 2022.
-
Circumventing Backdoor Defenses That Are Based on Latent Separability
Authors:
Xiangyu Qi,
Tinghao Xie,
Yiming Li,
Saeed Mahloujifar,
Prateek Mittal
Abstract:
Recent studies revealed that deep learning is susceptible to backdoor poisoning attacks. An adversary can embed a hidden backdoor into a model to manipulate its predictions by only modifying a few training data, without controlling the training process. Currently, a tangible signature has been widely observed across a diverse set of backdoor poisoning attacks -- models trained on a poisoned datase…
▽ More
Recent studies revealed that deep learning is susceptible to backdoor poisoning attacks. An adversary can embed a hidden backdoor into a model to manipulate its predictions by only modifying a few training data, without controlling the training process. Currently, a tangible signature has been widely observed across a diverse set of backdoor poisoning attacks -- models trained on a poisoned dataset tend to learn separable latent representations for poison and clean samples. This latent separation is so pervasive that a family of backdoor defenses directly take it as a default assumption (dubbed latent separability assumption), based on which to identify poison samples via cluster analysis in the latent space. An intriguing question consequently follows: is the latent separation unavoidable for backdoor poisoning attacks? This question is central to understanding whether the assumption of latent separability provides a reliable foundation for defending against backdoor poisoning attacks. In this paper, we design adaptive backdoor poisoning attacks to present counter-examples against this assumption. Our methods include two key components: (1) a set of trigger-planted samples correctly labeled to their semantic classes (other than the target class) that can regularize backdoor learning; (2) asymmetric trigger planting strategies that help to boost attack success rate (ASR) as well as to diversify latent representations of poison samples. Extensive experiments on benchmark datasets verify the effectiveness of our adaptive attacks in bypassing existing latent separation based backdoor defenses. Moreover, our attacks still maintain a high attack success rate with negligible clean accuracy drop. Our studies call for defense designers to take caution when leveraging latent separation as an assumption in their defenses.
△ Less
Submitted 3 March, 2023; v1 submitted 26 May, 2022;
originally announced May 2022.