-
Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
Authors:
Xun Long Ng,
Kian Eng Ong,
Qichen Zheng,
Yun Ni,
Si Yong Yeo,
Jun Liu
Abstract:
Understanding animals' behaviors is significant for a wide range of applications. However, existing animal behavior datasets have limitations in multiple aspects, including limited numbers of animal classes, data samples and provided tasks, and also limited variations in environmental conditions and viewpoints. To address these limitations, we create a large and diverse dataset, Animal Kingdom, th…
▽ More
Understanding animals' behaviors is significant for a wide range of applications. However, existing animal behavior datasets have limitations in multiple aspects, including limited numbers of animal classes, data samples and provided tasks, and also limited variations in environmental conditions and viewpoints. To address these limitations, we create a large and diverse dataset, Animal Kingdom, that provides multiple annotated tasks to enable a more thorough understanding of natural animal behaviors. The wild animal footages used in our dataset record different times of the day in extensive range of environments containing variations in backgrounds, viewpoints, illumination and weather conditions. More specifically, our dataset contains 50 hours of annotated videos to localize relevant animal behavior segments in long videos for the video grounding task, 30K video sequences for the fine-grained multi-label action recognition task, and 33K frames for the pose estimation task, which correspond to a diverse range of animals with 850 species across 6 major animal classes. Such a challenging and comprehensive dataset shall be able to facilitate the community to develop, adapt, and evaluate various types of advanced methods for animal behavior analysis. Moreover, we propose a Collaborative Action Recognition (CARe) model that learns general and specific features for action recognition with unseen new animals. This method achieves promising performance in our experiments. Our dataset can be found at https://sutdcv.github.io/Animal-Kingdom.
△ Less
Submitted 3 June, 2022; v1 submitted 17 April, 2022;
originally announced April 2022.
-
Facing the Illusion and Reality of Safety in Social VR
Authors:
Qingxiao Zheng,
Tue Ngoc Do,
Lingqing Wang,
Yun Huang
Abstract:
The ethical design of social Virtual Reality (VR) is not a new topic, but "safety" concerns of using social VR are escalated to a different level given the heat of the Metaverse. For example, it was reported that nearly half of the female-identifying VR participants have had at least one instance of virtual sexual harassment. Feeling safe is a basic human right - in any place, regardless in real o…
▽ More
The ethical design of social Virtual Reality (VR) is not a new topic, but "safety" concerns of using social VR are escalated to a different level given the heat of the Metaverse. For example, it was reported that nearly half of the female-identifying VR participants have had at least one instance of virtual sexual harassment. Feeling safe is a basic human right - in any place, regardless in real or virtual spaces. In this paper, we are seeking to understand the discrepancy between user concerns and designs in protecting user safety in social VR applications. We study safety concerns on social VR experience first by analyzing Twitter posts and then synthesize practices on safety protection adopted by four mainstream social VR platforms. We argue that future research and platforms should explore the design of social VR with boundary-awareness.
△ Less
Submitted 16 April, 2022; v1 submitted 14 April, 2022;
originally announced April 2022.
-
Automatic Facial Skin Feature Detection for Everyone
Authors:
Qian Zheng,
Ankur Purwar,
Heng Zhao,
Guang Liang Lim,
Ling Li,
Debasish Behera,
Qian Wang,
Min Tan,
Rizhao Cai,
Jennifer Werner,
Dennis Sng,
Maurice van Steensel,
Weisi Lin,
Alex C Kot
Abstract:
Automatic assessment and understanding of facial skin condition have several applications, including the early detection of underlying health problems, lifestyle and dietary treatment, skin-care product recommendation, etc. Selfies in the wild serve as an excellent data resource to democratize skin quality assessment, but suffer from several data collection challenges.The key to guaranteeing an ac…
▽ More
Automatic assessment and understanding of facial skin condition have several applications, including the early detection of underlying health problems, lifestyle and dietary treatment, skin-care product recommendation, etc. Selfies in the wild serve as an excellent data resource to democratize skin quality assessment, but suffer from several data collection challenges.The key to guaranteeing an accurate assessment is accurate detection of different skin features. We present an automatic facial skin feature detection method that works across a variety of skin tones and age groups for selfies in the wild. To be specific, we annotate the locations of acne, pigmentation, and wrinkle for selfie images with different skin tone colors, severity levels, and lighting conditions. The annotation is conducted in a two-phase scheme with the help of a dermatologist to train volunteers for annotation. We employ Unet++ as the network architecture for feature detection. This work shows that the two-phase annotation scheme can robustly detect the accurate locations of acne, pigmentation, and wrinkle for selfie images with different ethnicities, skin tone colors, severity levels, age groups, and lighting conditions.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
Authors:
Qing** Zheng,
Jiankang Deng,
Zheng Zhu,
Ying Li,
Stefanos Zafeiriou
Abstract:
This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binar…
▽ More
This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr.
△ Less
Submitted 27 March, 2022;
originally announced March 2022.
-
CGUA: Context-Guided and Unpaired-Assisted Weakly Supervised Person Search
Authors:
Chengyou Jia,
Minnan Luo,
Caixia Yan,
Xiaojun Chang,
Qinghua Zheng
Abstract:
Recently, weakly supervised person search is proposed to discard human-annotated identities and train the model with only bounding box annotations. A natural way to solve this problem is to separate it into detection and unsupervised re-identification (Re-ID) steps. However, in this way, two important clues in unconstrained scene images are ignored. On the one hand, existing unsupervised Re-ID mod…
▽ More
Recently, weakly supervised person search is proposed to discard human-annotated identities and train the model with only bounding box annotations. A natural way to solve this problem is to separate it into detection and unsupervised re-identification (Re-ID) steps. However, in this way, two important clues in unconstrained scene images are ignored. On the one hand, existing unsupervised Re-ID models only leverage cropped images from scene images but ignore its rich context information. On the other hand, there are numerous unpaired persons in real-world scene images. Directly dealing with them as independent identities leads to the long-tail effect, while completely discarding them can result in serious information loss. In light of these challenges, we introduce a Context-Guided and Unpaired-Assisted (CGUA) weakly supervised person search framework. Specifically, we propose a novel Context-Guided Cluster (CGC) algorithm to leverage context information in the clustering process and an Unpaired-Assisted Memory (UAM) unit to distinguish unpaired and paired persons by pushing them away. Extensive experiments demonstrate that the proposed approach can surpass the state-of-the-art weakly supervised methods by a large margin (more than 5% mAP on CUHK-SYSU). Moreover, our method achieves comparable or better performance to the state-of-the-art supervised methods by leveraging more diverse unlabeled data. Codes and models will be released soon.
△ Less
Submitted 27 March, 2022;
originally announced March 2022.
-
Enhanced and controllable reflected group delay based on Tamm surface plasmons with Dirac semimetals
Authors:
Qiwen Zheng,
Wenguang Lu,
Shen** Wang,
Xinmin Zhao,
Leyong Jiang
Abstract:
In this paper, the reflected group delay from a multilayer structure where Dirac semimetal is coated on one-dimensional photonic crystal (1D PC) separated by a spacer layer is investigated theoretically. It is shown that the group delayof reflected beam in this structure can be significant enhanced negatively and can be switched from negative to positive. The enhanced group delay originates from t…
▽ More
In this paper, the reflected group delay from a multilayer structure where Dirac semimetal is coated on one-dimensional photonic crystal (1D PC) separated by a spacer layer is investigated theoretically. It is shown that the group delayof reflected beam in this structure can be significant enhanced negatively and can be switched from negative to positive. The enhanced group delay originates from the steep phase change caused by the excitation of Tamm plasmons at the interface between the Dirac semimetal and spacer layer. It is clear that the positive and negative group delay can be actively tuned through the Fermi energy and the relaxation time of the Dirac semimetal. We believe this enhanced and tunable delay scheme is promising for fabricating optical delay devices and other applications at middle infrared band.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
UX Research on Conversational Human-AI Interaction: A Literature Review of the ACM Digital Library
Authors:
Qingxiao Zheng,
Yiliu Tang,
Yiren Liu,
Weizi Liu,
Yun Huang
Abstract:
Early conversational agents (CAs) focused on dyadic human-AI interaction between humans and the CAs, followed by the increasing popularity of polyadic human-AI interaction, in which CAs are designed to mediate human-human interactions. CAs for polyadic interactions are unique because they encompass hybrid social interactions, i.e., human-CA, human-to-human, and human-to-group behaviors. However, r…
▽ More
Early conversational agents (CAs) focused on dyadic human-AI interaction between humans and the CAs, followed by the increasing popularity of polyadic human-AI interaction, in which CAs are designed to mediate human-human interactions. CAs for polyadic interactions are unique because they encompass hybrid social interactions, i.e., human-CA, human-to-human, and human-to-group behaviors. However, research on polyadic CAs is scattered across different fields, making it challenging to identify, compare, and accumulate existing knowledge. To promote the future design of CA systems, we conducted a literature review of ACM publications and identified a set of works that conducted UX (user experience) research. We qualitatively synthesized the effects of polyadic CAs into four aspects of human-human interactions, i.e., communication, engagement, connection, and relationship maintenance. Through a mixed-method analysis of the selected polyadic and dyadic CA studies, we developed a suite of evaluation measurements on the effects. Our findings show that designing with social boundaries, such as privacy, disclosure, and identification, is crucial for ethical polyadic CAs. Future research should also advance usability testing methods and trust-building guidelines for conversational AI.
△ Less
Submitted 20 February, 2022;
originally announced February 2022.
-
Online Decision Transformer
Authors:
Qinqing Zheng,
Amy Zhang,
Aditya Grover
Abstract:
Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling. However, any practical instantiation of RL also involves an online component, where policies pretrained on passive offline datasets are finetuned via taskspecific interactions wit…
▽ More
Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling. However, any practical instantiation of RL also involves an online component, where policies pretrained on passive offline datasets are finetuned via taskspecific interactions with the environment. We propose Online Decision Transformers (ODT), an RL algorithm based on sequence modeling that blends offline pretraining with online finetuning in a unified framework. Our framework uses sequence-level entropy regularizers in conjunction with autoregressive modeling objectives for sample-efficient exploration and finetuning. Empirically, we show that ODT is competitive with the state-of-the-art in absolute performance on the D4RL benchmark but shows much more significant gains during the finetuning procedure.
△ Less
Submitted 13 July, 2022; v1 submitted 11 February, 2022;
originally announced February 2022.
-
Condensation Droplet Sieve
Authors:
Chen Ma,
Zhi** Yuan,
Li Chen,
Lin Wang,
Wei Tong,
Cun**g Lv,
Quanshui Zheng
Abstract:
Large droplets emerging during dropwise condensation impair surface properties such as anti-fogging/frosting ability and heat transfer efficiency. How to spontaneously detach massive randomly distributed droplets with controlled sizes has remained a great challenge. Herein, we present a general solution called condensation droplet sieve, through fabricating microscale thin-walled lattice (TWL) str…
▽ More
Large droplets emerging during dropwise condensation impair surface properties such as anti-fogging/frosting ability and heat transfer efficiency. How to spontaneously detach massive randomly distributed droplets with controlled sizes has remained a great challenge. Herein, we present a general solution called condensation droplet sieve, through fabricating microscale thin-walled lattice (TWL) structures coated with a superhydrophobic layer. Growing droplets were observed to jumped off this TWL surface with 100% probability once becoming slightly larger than the lattices. The maximum radius and residual volume of droplets were strictly confined to 16 μm and 3.2 nl/mm2 respectively, greatly surpassing the current state of the art. We reveal that this extremely efficient jum** is attributed to the large tolerance of coalescence mismatch and effective isolation of droplets between neighbouring lattices. Our work provides a new perspective for the design and fabrication of high-performance anti-dew materials.
△ Less
Submitted 6 February, 2022;
originally announced February 2022.
-
Toward Enhanced Robustness in Unsupervised Graph Representation Learning: A Graph Information Bottleneck Perspective
Authors:
Jihong Wang,
Minnan Luo,
Jundong Li,
Ziqi Liu,
Jun Zhou,
Qinghua Zheng
Abstract:
Recent studies have revealed that GNNs are vulnerable to adversarial attacks. Most existing robust graph learning methods measure model robustness based on label information, rendering them infeasible when label information is not available. A straightforward direction is to employ the widely used Infomax technique from typical Unsupervised Graph Representation Learning (UGRL) to learn robust unsu…
▽ More
Recent studies have revealed that GNNs are vulnerable to adversarial attacks. Most existing robust graph learning methods measure model robustness based on label information, rendering them infeasible when label information is not available. A straightforward direction is to employ the widely used Infomax technique from typical Unsupervised Graph Representation Learning (UGRL) to learn robust unsupervised representations. Nonetheless, directly transplanting the Infomax technique from typical UGRL to robust UGRL may involve a biased assumption. In light of the limitation of Infomax, we propose a novel unbiased robust UGRL method called Robust Graph Information Bottleneck (RGIB), which is grounded in the Information Bottleneck (IB) principle. Our RGIB attempts to learn robust node representations against adversarial perturbations by preserving the original information in the benign graph while eliminating the adversarial information in the adversarial graph. There are mainly two challenges to optimize RGIB: 1) high complexity of adversarial attack to perturb node features and graph structure jointly in the training procedure; 2) mutual information estimation upon adversarially attacked graphs. To tackle these problems, we further propose an efficient adversarial training strategy with only feature perturbations and an effective mutual information estimator with subgraph-level summary. Moreover, we theoretically establish a connection between our proposed RGIB and the robustness of downstream classifiers, revealing that RGIB can provide a lower bound on the adversarial risk of downstream classifiers. Extensive experiments over several benchmarks and downstream tasks demonstrate the effectiveness and superiority of our proposed method.
△ Less
Submitted 8 June, 2023; v1 submitted 21 January, 2022;
originally announced January 2022.
-
Variational design for a structural family of CAD models
Authors:
Qiang Zou,
Qiqiang Zheng,
Zhihong Tang,
Shuming Gao
Abstract:
Variational design is a well-recognized CAD technique due to the increased design efficiency. It often presents as a parametric family of CAD models. Although effective, this way of working cannot handle design requirements that go beyond parametric changes. Such design requirements are not uncommon today due to the increasing popularity of product customization. In particular, there is often a ne…
▽ More
Variational design is a well-recognized CAD technique due to the increased design efficiency. It often presents as a parametric family of CAD models. Although effective, this way of working cannot handle design requirements that go beyond parametric changes. Such design requirements are not uncommon today due to the increasing popularity of product customization. In particular, there is often a need for designing a new model out of an existing structural family of models, which share a structural pattern but have individually varied detail features. To facilitate such design requirements, a new method is presented in this paper. The idea is to express the underlying structural pattern in terms of a submodel composed of the maximum common design features of the family, and then to build a single master model by attaching to the submodel all detail design features in the family. This master model is a representative model for the family and contains all the features. By removing unwanted detail features and adding new features, the master model can be easily adapted into a new design, while kee** aligned with the family, structurally. Effectiveness of this method has been validated by a series of case studies and comparisons of increasing complexity.
△ Less
Submitted 8 January, 2022;
originally announced January 2022.
-
FAVER: Blind Quality Prediction of Variable Frame Rate Videos
Authors:
Qi Zheng,
Zhengzhong Tu,
Pavan C. Madhusudana,
Xiaoyang Zeng,
Alan C. Bovik,
Yibo Fan
Abstract:
Video quality assessment (VQA) remains an important and challenging problem that affects many applications at the widest scales. Recent advances in mobile devices and cloud computing techniques have made it possible to capture, process, and share high resolution, high frame rate (HFR) videos across the Internet nearly instantaneously. Being able to monitor and control the quality of these streamed…
▽ More
Video quality assessment (VQA) remains an important and challenging problem that affects many applications at the widest scales. Recent advances in mobile devices and cloud computing techniques have made it possible to capture, process, and share high resolution, high frame rate (HFR) videos across the Internet nearly instantaneously. Being able to monitor and control the quality of these streamed videos can enable the delivery of more enjoyable content and perceptually optimized rate control. Accordingly, there is a pressing need to develop VQA models that can be deployed at enormous scales. While some recent effects have been applied to full-reference (FR) analysis of variable frame rate and HFR video quality, the development of no-reference (NR) VQA algorithms targeting frame rate variations has been little studied. Here, we propose a first-of-a-kind blind VQA model for evaluating HFR videos, which we dub the Framerate-Aware Video Evaluator w/o Reference (FAVER). FAVER uses extended models of spatial natural scene statistics that encompass space-time wavelet-decomposed video signals, to conduct efficient frame rate sensitive quality prediction. Our extensive experiments on several HFR video quality datasets show that FAVER outperforms other blind VQA algorithms at a reasonable computational cost. To facilitate reproducible research and public evaluation, an implementation of FAVER is being made freely available online: \url{https://github.com/uniqzheng/HFR-BVQA}.
△ Less
Submitted 5 January, 2022;
originally announced January 2022.
-
Imaging field-tuned quantum Hall broken-symmetry orders and quantum Hall conducting channel in charge-neutral graphene WSe2 heterostructure
Authors:
Qi Zheng,
Mo-Han Zhang,
Ya-Ning Ren,
Lin He
Abstract:
The zeroth Landau level (0LL) in graphene has emerged as a flat-band platform in which distinct many-body phases can be explored with unprecedented control by simply tuning the strength and/or direction of magnetic fields1-22. A rich set of quantum Hall ferromagnetic (QHFM) phases with different lattice-scale symmetry-breaking orders are predicted to be realized in high magnetic fields when the 0L…
▽ More
The zeroth Landau level (0LL) in graphene has emerged as a flat-band platform in which distinct many-body phases can be explored with unprecedented control by simply tuning the strength and/or direction of magnetic fields1-22. A rich set of quantum Hall ferromagnetic (QHFM) phases with different lattice-scale symmetry-breaking orders are predicted to be realized in high magnetic fields when the 0LL in graphene is half filled1-8,13-16. Here we report a field-tuned continuous quantum phase transition of different valley orderings in QHFM of charge-neutral graphene on insulating tungsten diselenide (WSe2). The phase transition is clearly revealed by anomalous field-dependent energy gap in the half-filled 0LL. Via atomic resolution imaging of electronic wavefunctions during the phase transition, we observe microscopic signatures of field-tuned continuous-varied valley polarization and valley inversion, which are unexpected and beyond current theory predictions. Moreover, the topological quantum Hall conducting channel of the graphene is directly imaged when the substrate (WSe2) introduces band bending of the 0LL.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
A Search for the Cosmic Ray Boosted Sub-GeV Dark Matter at the PandaX-II Experiment
Authors:
Xiangyi Cui,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Yunshan Cheng,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Ke Han,
Changda He,
**rong He,
Di Huang,
Yanlin Huang,
Zhou Huang,
Ruquan Hou,
Xiangdong Ji,
Yonglin Ju
, et al. (54 additional authors not shown)
Abstract:
We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China **** Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enha…
▽ More
We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China **** Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enhance the signal sensitivity. Our result excludes the dark matter-nucleon elastic scattering cross section between 10$^{-31}$cm$^{2}$ and 10$^{-28}$cm$^{2}$ for a dark matter masses from 0.1 MeV/$c^2$ to 0.1 GeV/$c^2$, with a large parameter space previously unexplored by experimental collaborations.
△ Less
Submitted 11 April, 2022; v1 submitted 16 December, 2021;
originally announced December 2021.
-
Low Radioactive Material Screening and Background Control for the PandaX-4T Experiment
Authors:
Zhicheng Qian,
Lin Si,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Yunshan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Ke Han,
Changda He,
**rong He,
Di Huang,
Yanlin Huang,
Zhou Huang,
Ruquan Hou
, et al. (54 additional authors not shown)
Abstract:
PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China **** Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low backgro…
▽ More
PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China **** Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low background materials were selected to assemble the detector. Surface treatment procedures were investigated to further suppress radioactive background. Combining measured results and Monte Carlo simulation, the total material background rates of PandaX-4T in the energy region of 1-25 keV$\rm{}_{ee}$ are estimated to be (9.9 $\pm$ 1.9) $\times \ 10^{-3}$ mDRU for electron recoil and (2.8 $\pm$ 0.6) $\times \ 10^{-4}$ mDRU for nuclear recoil. In addition, $^{nat}$Kr in the detector is estimated to be <8 ppt.
△ Less
Submitted 23 April, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition
Authors:
Changxu Cheng,
Bohan Li,
Qi Zheng,
Yongpan Wang,
Wenyu Liu
Abstract:
Semantic information has been proved effective in scene text recognition. Most existing methods tend to couple both visual and semantic information in an attention-based decoder. As a result, the learning of semantic features is prone to have a bias on the limited vocabulary of the training set, which is called vocabulary reliance. In this paper, we propose a novel Visual-Semantic Decoupling Netwo…
▽ More
Semantic information has been proved effective in scene text recognition. Most existing methods tend to couple both visual and semantic information in an attention-based decoder. As a result, the learning of semantic features is prone to have a bias on the limited vocabulary of the training set, which is called vocabulary reliance. In this paper, we propose a novel Visual-Semantic Decoupling Network (VSDN) to address the problem. Our VSDN contains a Visual Decoder (VD) and a Semantic Decoder (SD) to learn purer visual and semantic feature representation respectively. Besides, a Semantic Encoder (SE) is designed to match SD, which can be pre-trained together by additional inexpensive large vocabulary via a simple word correction task. Thus the semantic feature is more unbiased and precise to guide the visual feature alignment and enrich the final character representation. Experiments show that our method achieves state-of-the-art or competitive results on the standard benchmarks, and outperforms the popular baseline by a large margin under circumstances where the training set has a small size of vocabulary.
△ Less
Submitted 24 November, 2021;
originally announced November 2021.
-
Structural Origin of Boson Peak in Glasses
Authors:
Yuan Tian,
Xiaozhe Shen,
Qingyang Gao,
Zhen Lu,
Jie Yang,
Qiang Zheng,
Christopher Florencio Aleman,
Duan Luo,
Alexander Hume Reid,
Bin Xu,
Michael Falk,
Howard Sheng,
Jianming Cao,
Xijie Wang,
Mingwei Chen
Abstract:
Boson peak, the excess low energy excitations in the terahertz regime, is one of the most unique features of disordered systems and has been linked to many anomalous properties of glass materials. The nature and structural origin of the boson peak remain elusive and have been debated for more than a half century mainly due to the lack of real-time and real-space experimental insights of the dynami…
▽ More
Boson peak, the excess low energy excitations in the terahertz regime, is one of the most unique features of disordered systems and has been linked to many anomalous properties of glass materials. The nature and structural origin of the boson peak remain elusive and have been debated for more than a half century mainly due to the lack of real-time and real-space experimental insights of the dynamic phenomenon. In this work we employed femtosecond MeV ultrafast electron diffraction to characterize the atomic dynamics of metallic glasses in real time. The experiment reveals collective atomic oscillations, presented in elastic electron scattering and atomic pair distribution functions, within the boson peak frequency range of 1.0-1.8 THz in both reciprocal and real space. It was found that the oscillation frequency has reciprocal dependence on interatomic pair distances and the corresponding wave velocity experimentally affirms the transverse acoustic wave nature of the boson peak. The observed strong correlation between THz acoustic vibrations and coherent electron scattering provides compelling evidence that the boson peak originates from the collective transverse vibrational modes of structurally ordered atoms in the disordered system.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning
Authors:
Qinkai Zheng,
Xu Zou,
Yuxiao Dong,
Yukuo Cen,
Da Yin,
Jiarong Xu,
Yang Yang,
Jie Tang
Abstract:
Adversarial attacks on graphs have posed a major threat to the robustness of graph machine learning (GML) models. Naturally, there is an ever-escalating arms race between attackers and defenders. However, the strategies behind both sides are often not fairly compared under the same and realistic conditions. To bridge this gap, we present the Graph Robustness Benchmark (GRB) with the goal of provid…
▽ More
Adversarial attacks on graphs have posed a major threat to the robustness of graph machine learning (GML) models. Naturally, there is an ever-escalating arms race between attackers and defenders. However, the strategies behind both sides are often not fairly compared under the same and realistic conditions. To bridge this gap, we present the Graph Robustness Benchmark (GRB) with the goal of providing a scalable, unified, modular, and reproducible evaluation for the adversarial robustness of GML models. GRB standardizes the process of attacks and defenses by 1) develo** scalable and diverse datasets, 2) modularizing the attack and defense implementations, and 3) unifying the evaluation protocol in refined scenarios. By leveraging the GRB pipeline, the end-users can focus on the development of robust GML models with automated data processing and experimental evaluations. To support open and reproducible research on graph adversarial learning, GRB also hosts public leaderboards across different scenarios. As a starting point, we conduct extensive experiments to benchmark baseline techniques. GRB is open-source and welcomes contributions from the community. Datasets, codes, leaderboards are available at https://cogdl.ai/grb/home.
△ Less
Submitted 8 November, 2021;
originally announced November 2021.
-
Light yield and field dependence measurement in PandaX-II dual-phase xenon detector
Authors:
Zhou Huang,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Yunshan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Ke Han,
Changda He,
**rong He,
Di Huang,
Yanlin Huang,
Ruquan Hou,
Xiangdong Ji,
Yonglin Ju
, et al. (54 additional authors not shown)
Abstract:
The dual-phase xenon time projection chamber (TPC) is one of the most sensitive detector technology for dark matter direct search, where the energy deposition of incoming particle can be converted into photons and electrons through xenon excitation and ionization. The detector response to signal energy deposition varies significantly with the electric field in liquid xenon. We study the detector's…
▽ More
The dual-phase xenon time projection chamber (TPC) is one of the most sensitive detector technology for dark matter direct search, where the energy deposition of incoming particle can be converted into photons and electrons through xenon excitation and ionization. The detector response to signal energy deposition varies significantly with the electric field in liquid xenon. We study the detector's light yield and its dependence on the electric field in the PandaX-II dual-phase detector containing 580~kg liquid xenon in the sensitive volume. From our measurements, the light yield at electric fields from 0~V/cm to 317~V/cm is obtained for energy depositions up to 236~keV.
△ Less
Submitted 3 December, 2021; v1 submitted 2 November, 2021;
originally announced November 2021.
-
Neural Relightable Participating Media Rendering
Authors:
Quan Zheng,
Gurprit Singh,
Hans-Peter Seidel
Abstract:
Learning neural radiance fields of a scene has recently allowed realistic novel view synthesis of the scene, but they are limited to synthesize images under the original fixed lighting condition. Therefore, they are not flexible for the eagerly desired tasks like relighting, scene editing and scene composition. To tackle this problem, several recent methods propose to disentangle reflectance and i…
▽ More
Learning neural radiance fields of a scene has recently allowed realistic novel view synthesis of the scene, but they are limited to synthesize images under the original fixed lighting condition. Therefore, they are not flexible for the eagerly desired tasks like relighting, scene editing and scene composition. To tackle this problem, several recent methods propose to disentangle reflectance and illumination from the radiance field. These methods can cope with solid objects with opaque surfaces but participating media are neglected. Also, they take into account only direct illumination or at most one-bounce indirect illumination, thus suffer from energy loss due to ignoring the high-order indirect illumination. We propose to learn neural representations for participating media with a complete simulation of global illumination. We estimate direct illumination via ray tracing and compute indirect illumination with spherical harmonics. Our approach avoids computing the lengthy indirect bounces and does not suffer from energy loss. Our experiments on multiple scenes show that our approach achieves superior visual quality and numerical performance compared to state-of-the-art methods, and it can generalize to deal with solid objects with opaque surfaces as well.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
PPSGCN: A Privacy-Preserving Subgraph Sampling Based Distributed GCN Training Method
Authors:
Binchi Zhang,
Minnan Luo,
Shangbin Feng,
Ziqi Liu,
Jun Zhou,
Qinghua Zheng
Abstract:
Graph convolutional networks (GCNs) have been widely adopted for graph representation learning and achieved impressive performance. For larger graphs stored separately on different clients, distributed GCN training algorithms were proposed to improve efficiency and scalability. However, existing methods directly exchange node features between different clients, which results in data privacy leakag…
▽ More
Graph convolutional networks (GCNs) have been widely adopted for graph representation learning and achieved impressive performance. For larger graphs stored separately on different clients, distributed GCN training algorithms were proposed to improve efficiency and scalability. However, existing methods directly exchange node features between different clients, which results in data privacy leakage. Federated learning was incorporated in graph learning to tackle data privacy, while they suffer from severe performance drop due to non-iid data distribution. Besides, these approaches generally involve heavy communication and memory overhead during the training process. In light of these problems, we propose a Privacy-Preserving Subgraph sampling based distributed GCN training method (PPSGCN), which preserves data privacy and significantly cuts back on communication and memory overhead. Specifically, PPSGCN employs a star-topology client-server system. We firstly sample a local node subset in each client to form a global subgraph, which greatly reduces communication and memory costs. We then conduct local computation on each client with features or gradients of the sampled nodes. Finally, all clients securely communicate with the central server with homomorphic encryption to combine local results while preserving data privacy. Compared with federated graph learning methods, our PPSGCN model is trained on a global graph to avoid the negative impact of local data distribution. We prove that our PPSGCN algorithm would converge to a local optimum with probability 1. Experiment results on three prevalent benchmarks demonstrate that our algorithm significantly reduces communication and memory overhead while maintaining desirable performance. Further studies not only demonstrate the fast convergence of PPSGCN, but discuss the trade-off between communication and local computation cost as well.
△ Less
Submitted 22 October, 2021;
originally announced October 2021.
-
A Soft-Rigid Hybrid Gripper with Lateral Compliance and Dexterous In-hand Manipulation
Authors:
Wenpei Zhu,
Chenghua Lu,
Qule Zheng,
Zhonggui Fang,
Haichuan Che,
Kailuan Tang,
Mingchao Zhu,
Sicong Liu,
Zheng Wang
Abstract:
Soft grippers are receiving growing attention due to their compliance-based interactive safety and dexterity. Hybrid gripper (soft actuators enhanced by rigid constraints) is a new trend in soft gripper design. With right structural components actuated by soft actuators, they could achieve excellent gras** adaptability and payload, while also being easy to model and control with conventional kin…
▽ More
Soft grippers are receiving growing attention due to their compliance-based interactive safety and dexterity. Hybrid gripper (soft actuators enhanced by rigid constraints) is a new trend in soft gripper design. With right structural components actuated by soft actuators, they could achieve excellent gras** adaptability and payload, while also being easy to model and control with conventional kinematics. However, existing works were mostly focused on achieving superior payload and perception with simple planar workspaces, resulting in far less dexterity compared with conventional grippers. In this work, we took inspiration from the human Metacarpophalangeal (MCP) joint and proposed a new hybrid gripper design with 8 independent muscles. It was shown that adding the MCP complexity was critical in enabling a range of novel features in the hybrid gripper, including in-hand manipulation, lateral passive compliance, as well as new control modes. A prototype gripper was fabricated and tested on our proprietary dual-arm robot platform with vision guided gras**. With very lightweight pneumatic bellows soft actuators, the gripper could grasp objects over 25 times its own weight with lateral compliance. Using the dual-arm platform, highly anthropomorphic dexterous manipulations were demonstrated using two hybrid grippers, from Tug-of-war on a rigid rod, to passing a soft towel between two grippers using in-hand manipulation. Matching with the novel features and performance specifications of the proposed hybrid gripper, the underlying modeling, actuation, control, and experimental validation details were also presented, offering a promising approach to achieving enhanced dexterity, strength, and compliance in robotic grippers.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Interweaving Polar Charge Orders in a Layered Metallic Super-atomic Crystal
Authors:
Shuya Xing,
Linlu Wu,
Zilu Wang,
Xu Chen,
Haining Liu,
Shuo Han,
Le Lei,
linwei Zhou,
Qi Zheng,
Li Huang,
Xiao Lin,
Liming Xie,
Xiaolong Chen,
Hong-Jun Gao,
Zhihai Cheng,
Jiangang Guo,
Shancai Wang,
Wei Ji
Abstract:
Electronic properties of super-atomic crystals have not been sufficiently explored due to the versatility of their building units; moreover, their inter-unit couplings are even poorly understood. Here, we present a joint experiment-theory investigation of a rational-designed layered super-atomic crystal of Au6Te12Se8 cubes, stacked by non-covalent inter-cube quasi-bonds. We found a sequential-emer…
▽ More
Electronic properties of super-atomic crystals have not been sufficiently explored due to the versatility of their building units; moreover, their inter-unit couplings are even poorly understood. Here, we present a joint experiment-theory investigation of a rational-designed layered super-atomic crystal of Au6Te12Se8 cubes, stacked by non-covalent inter-cube quasi-bonds. We found a sequential-emerged anisotropic triple-cube charge-density-wave (tc-CDW) and polarized metallic states below 120 K, as revealed via scanning tunneling microscopy/spectroscopy, angle-resolved photoemission spectroscopy, transport measurement, Raman spectra, and density functional theory. The polarized states are locked in an anti-parallel configuration, which is required for maintaining the inversion symmetry of the center-cube in the tc-CDW. The anti-polar metallic states are thus interweaved by the charge-density-wave and the polarized metallic states, and primarily ascribed to electronic effects via theoretical calculations. This work not only demonstrates a microscopic picture of the interweaved CDW and polarized charge orders in the super-atomic crystal of ATS, but also sheds light on expanding the existing category of quantum materials to non-covalent solids.
△ Less
Submitted 12 October, 2022; v1 submitted 18 October, 2021;
originally announced October 2021.
-
Coexistence of electron whispering-gallery modes and atomic collapse states in graphene WSe2 heterostructure quantum dots
Authors:
Qi Zheng,
Yu-Chen Zhuang,
Qing-Feng Sun,
Lin He
Abstract:
The relativistic massless charge carriers with a Fermi velocity of about c300 in graphene enable us to realize two distinct types of resonances (c, the speed of light in vacuum). One is electron whispering-gallery mode in graphene quantum dots arising from the Klein tunneling of the massless Dirac fermions. The other is atomic collapse state, which has never been observed in experiment with real a…
▽ More
The relativistic massless charge carriers with a Fermi velocity of about c300 in graphene enable us to realize two distinct types of resonances (c, the speed of light in vacuum). One is electron whispering-gallery mode in graphene quantum dots arising from the Klein tunneling of the massless Dirac fermions. The other is atomic collapse state, which has never been observed in experiment with real atoms due to the difficulty of producing heavy nuclei with charge Z 170, however, can be realized near a Coulomb impurity in graphene with a charge Z 1 because of the small velocity of the Dirac excitations. Here, unexpectedly, we demonstrate that both the electron whispering-gallery modes and atomic collapse states coexist in grapheneWSe2 heterostructure quantum dots due to the Coulomb-like potential near their edges. By applying a perpendicular magnetic field, evolution from the atomic collapse states to unusual Landau levels in the collapse regime are explored for the first time.
△ Less
Submitted 13 October, 2021;
originally announced October 2021.
-
Epoch of Reionization Power Spectrum Limits from Murchison Widefield Array Data Targeted at EoR1 Field
Authors:
M. Rahimi,
B. Pindor,
J. L. B. Line,
N. Barry,
C. M. Trott,
R. L. Webster,
C. H. Jordan,
M. Wilensky,
S. Yoshiura,
A. Beardsley,
J. Bowman,
R. Byrne,
A. Chokshi,
B. J. Hazelton,
K. Hasegawa,
E. Howard,
B. Greig,
D. Jacobs,
R. Joseph,
M. Kolopanis,
C. Lynch,
B. McKinley,
D. A. Mitchell,
S. Murray,
M. F. Morales
, et al. (6 additional authors not shown)
Abstract:
Current attempts to measure the 21cm Power Spectrum of neutral hydrogen during the Epoch of Reionization are limited by systematics which produce measured upper limits above both the thermal noise and the expected cosmological signal. These systematics arise from a combination of observational, instrumental, and analysis effects. In order to further understand and mitigate these effects, it is ins…
▽ More
Current attempts to measure the 21cm Power Spectrum of neutral hydrogen during the Epoch of Reionization are limited by systematics which produce measured upper limits above both the thermal noise and the expected cosmological signal. These systematics arise from a combination of observational, instrumental, and analysis effects. In order to further understand and mitigate these effects, it is instructive to explore different aspects of existing datasets. One such aspect is the choice of observing field. To date, MWA EoR observations have largely focused on the EoR0 field. In this work, we present a new detailed analysis of the EoR1 field. The EoR1 field is one of the coldest regions of the Southern radio sky, but contains the very bright radio galaxy Fornax-A. The presence of this bright extended source in the primary beam of the interferometer makes the calibration and analysis of EoR1 particularly challenging. We demonstrate the effectiveness of a recently developed shapelet model of Fornax-A in improving the results from this field. We also describe and apply a series of data quality metrics which identify and remove systematically contaminated data. With substantially improved source models, upgraded analysis algorithms and enhanced data quality metrics, we determine EoR power spectrum upper limits based on analysis of the best $\sim$14-hours data observed during 2015 and 2014 at redshifts 6.5, 6.8 and 7.1, with the lowest $2σ$ upper limit at z=6.5 of $Δ^2 \leq (73.78 ~\mathrm{mK)^2}$ at $k=0.13~\mathrm{h~ Mpc^{-1}}$, improving on previous EoR1 measurement results.
△ Less
Submitted 7 October, 2021;
originally announced October 2021.
-
SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis
Authors:
Chengxi Li,
Feiyu Gao,
Jiajun Bu,
Lu Xu,
Xiang Chen,
Yu Gu,
Zirui Shao,
Qi Zheng,
Ningyu Zhang,
Yongpan Wang,
Zhi Yu
Abstract:
Aspect-based sentiment analysis (ABSA) is an emerging fine-grained sentiment analysis task that aims to extract aspects, classify corresponding sentiment polarities and find opinions as the causes of sentiment. The latest research tends to solve the ABSA task in a unified way with end-to-end frameworks. Yet, these frameworks get fine-tuned from downstream tasks without any task-adaptive modificati…
▽ More
Aspect-based sentiment analysis (ABSA) is an emerging fine-grained sentiment analysis task that aims to extract aspects, classify corresponding sentiment polarities and find opinions as the causes of sentiment. The latest research tends to solve the ABSA task in a unified way with end-to-end frameworks. Yet, these frameworks get fine-tuned from downstream tasks without any task-adaptive modification. Specifically, they do not use task-related knowledge well or explicitly model relations between aspect and opinion terms, hindering them from better performance. In this paper, we propose SentiPrompt to use sentiment knowledge enhanced prompts to tune the language model in the unified framework. We inject sentiment knowledge regarding aspects, opinions, and polarities into prompt and explicitly model term relations via constructing consistency and polarity judgment templates from the ground truth triplets. Experimental results demonstrate that our approach can outperform strong baselines on Triplet Extraction, Pair Extraction, and Aspect Term Extraction with Sentiment Classification by a notable margin.
△ Less
Submitted 16 September, 2021;
originally announced September 2021.
-
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Authors:
Caixia Yan,
Xiaojun Chang,
Minnan Luo,
Huan Liu,
Xiaoqin Zhang,
Qinghua Zheng
Abstract:
Zero-shot object detection (ZSD), the task that extends conventional detection models to detecting objects from unseen categories, has emerged as a new challenge in computer vision. Most existing approaches tackle the ZSD task with a strict map**-transfer strategy, which may lead to suboptimal ZSD results: 1) the learning process of those models ignores the available unseen class information, an…
▽ More
Zero-shot object detection (ZSD), the task that extends conventional detection models to detecting objects from unseen categories, has emerged as a new challenge in computer vision. Most existing approaches tackle the ZSD task with a strict map**-transfer strategy, which may lead to suboptimal ZSD results: 1) the learning process of those models ignores the available unseen class information, and thus can be easily biased towards the seen categories; 2) the original visual feature space is not well-structured and lack of discriminative information. To address these issues, we develop a novel Semantics-Guided Contrastive Network for ZSD, named ContrastZSD, a detection framework that first brings contrastive learning mechanism into the realm of zero-shot detection. Particularly, ContrastZSD incorporates two semantics-guided contrastive learning subnets that contrast between region-category and region-region pairs respectively. The pairwise contrastive tasks take advantage of additional supervision signals derived from both ground truth label and pre-defined class similarity distribution. Under the guidance of those explicit semantic supervision, the model can learn more knowledge about unseen categories to avoid the bias problem to seen concepts, while optimizing the data structure of visual features to be more discriminative for better visual-semantic alignment. Extensive experiments are conducted on two popular benchmarks for ZSD, i.e., PASCAL VOC and MS COCO. Results show that our method outperforms the previous state-of-the-art on both ZSD and generalized ZSD tasks.
△ Less
Submitted 31 December, 2021; v1 submitted 3 September, 2021;
originally announced September 2021.
-
Probing quantum many-body correlations by universal ram** dynamics
Authors:
Libo Liang,
Wei Zheng,
Ruixiao Yao,
Qinpei Zheng,
Zhiyuan Yao,
Tian-Gang Zhou,
Qi Huang,
Zhongchi Zhang,
Jilai Ye,
Xiaoji Zhou,
Xuzong Chen,
Wenlan Chen,
Hui Zhai,
Jiazhong Hu
Abstract:
Ram** a physical parameter is one of the most common experimental protocols in studying a quantum system, and ram** dynamics has been widely used in preparing a quantum state and probing physical properties. Here, we present a novel method of probing quantum many-body correlation by ram** dynamics. We ramp a Hamiltonian parameter to the same target value from different initial values and wit…
▽ More
Ram** a physical parameter is one of the most common experimental protocols in studying a quantum system, and ram** dynamics has been widely used in preparing a quantum state and probing physical properties. Here, we present a novel method of probing quantum many-body correlation by ram** dynamics. We ramp a Hamiltonian parameter to the same target value from different initial values and with different velocities, and we show that the first-order correction on the finite ram** velocity is universal and path-independent, revealing a novel quantum many-body correlation function of the equilibrium phases at the target values. We term this method as the non-adiabatic linear response since this is the leading order correction beyond the adiabatic limit. We demonstrate this method experimentally by studying the Bose-Hubbard model with ultracold atoms in three-dimensional optical lattices. Unlike the conventional linear response that reveals whether the quasi-particle dispersion of a quantum phase is gapped or gapless, this probe is more sensitive to whether the quasi-particle lifetime is long enough such that the quantum phase possesses a well-defined quasi-particle description. In the Bose-Hubbard model, this non-adiabatic linear response is significant in the quantum critical regime where well-defined quasi-particles are absent. And in contrast, this response is vanishingly small in both superfluid and Mott insulators which possess well-defined quasi-particles. Because our proposal uses the most common experimental protocol, we envision that our method can find broad applications in probing various quantum systems.
△ Less
Submitted 4 January, 2023; v1 submitted 1 September, 2021;
originally announced September 2021.
-
A 500 MS/s waveform digitizer for PandaX dark matter experiments
Authors:
Changda He,
Jianglai Liu,
Xiangxiang Ren,
Xiaofeng Shang,
Xikai Wei,
Mingxin Wang,
Jijun Yang,
**qun Yang,
Yong Yang,
Guang** Zhang,
Qibin Zheng
Abstract:
Waveform digitizers are key readout instruments in particle physics experiments. In this paper, we present a waveform digitizer for the PandaX dark matter experiments. It supports both external-trigger readout and triggerless readout, accommodating the needs of low rate full-waveform readout and channel-independent low threshold acquisition, respectively. This digitizer is a 8-channel VME board wi…
▽ More
Waveform digitizers are key readout instruments in particle physics experiments. In this paper, we present a waveform digitizer for the PandaX dark matter experiments. It supports both external-trigger readout and triggerless readout, accommodating the needs of low rate full-waveform readout and channel-independent low threshold acquisition, respectively. This digitizer is a 8-channel VME board with a sampling rate of 500 MS/s and 14-bit resolution for each channel. A digitizer system consisting of 72 channels has been tested in situ of the PandaX-4T experiment. We report the system performance with real data.
△ Less
Submitted 22 December, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
The microstructural dependence of ionic transport in bi-continuous nanoporous metal
Authors:
Congcheng Wang,
Anson Tsang,
Diwen Xiao,
Yuan Xu,
Shida Yang,
Ling-Zhi Liu,
Qiang Zheng,
Pan Liu,
Hai-Jun **,
Qing Chen
Abstract:
Ionic transports in nanopores hold the key to unlocking the full potential of bi-continuous nanoporous (NP) metals as advanced electrodes in electrochemical devices. The precise control of the uniform NP metal structures also provides us a unique opportunity to understand how complex structures determine transports at nanoscales. For NP Au from the dealloying of a Ag-Au alloy, we can tune the pore…
▽ More
Ionic transports in nanopores hold the key to unlocking the full potential of bi-continuous nanoporous (NP) metals as advanced electrodes in electrochemical devices. The precise control of the uniform NP metal structures also provides us a unique opportunity to understand how complex structures determine transports at nanoscales. For NP Au from the dealloying of a Ag-Au alloy, we can tune the pore size in the range of 13 nm to 2.4 microns and the porosity between 38% and 69% via isothermal coarsening. For NP Ag from the reduction-induced decomposition of AgCl, we can control additionally its structural hierarchy and pore orientation. We measure the effective ionic conductivities of 1 M NaClO4 through these NP metals as membranes, which range from 7% to 44% of that of a free solution, corresponding to calculated pore tortuosities between 2.7 and 1.3. The tortuosity of NP Au displays weak dependences on both the pore size and the porosity, consistent with the observed self-similarity in the coarsening, except for those of pores < 25 nm, which we consider deviating from the well-coarsened pore geometry. For NP Ag, the low tortuosity of the hierarchical structure can be explained with the Maxwell-Garnett equation and that of the oriented structure underlines the random orientation as the cause of slow transport in other NP metals. At last, we achieve high current densities of CO2 reduction with these two low-tortuosity NP Ags, demonstrating the significance of the structure-transport relationships for designing functional NP metals.
△ Less
Submitted 25 August, 2021;
originally announced August 2021.
-
A Nested Cross Decomposition Algorithm for Power System Capacity Expansion with Multiscale Uncertainties
Authors:
Zhouchun Huang,
Qipeng P. Zheng,
Andrew L. Liu
Abstract:
Modern electric power systems have witnessed rapidly increasing penetration of renewable energy, storage, electrical vehicles and various demand response resources. The electric infrastructure planning is thus facing more challenges due to the variability and uncertainties arising from the diverse new resources. This study aims to develop a multistage and multiscale stochastic mixed integer progra…
▽ More
Modern electric power systems have witnessed rapidly increasing penetration of renewable energy, storage, electrical vehicles and various demand response resources. The electric infrastructure planning is thus facing more challenges due to the variability and uncertainties arising from the diverse new resources. This study aims to develop a multistage and multiscale stochastic mixed integer programming (MM-SMIP) model to capture both the coarse-temporal-scale uncertainties, such as investment cost and long-run demand stochasticity, and fine-temporal-scale uncertainties, such as hourly renewable energy output and electricity demand uncertainties, for the power system capacity expansion problem. To be applied to a real power system, the resulting model will lead to extremely large-scale mixed integer programming problems, which suffer not only the well-known curse of dimensionality, but also computational difficulties with a vast number of integer variables at each stage. In addressing such challenges associated with the MM-SMIP model, we propose a nested cross decomposition algorithm that consists of two layers of decomposition, that is, the Dantzig-Wolfe decomposition and L-shaped decomposition. The algorithm exhibits promising computational performance under our numerical study, and is especially amenable to parallel computing, which will also be demonstrated through the computational results.
△ Less
Submitted 19 August, 2021;
originally announced August 2021.
-
Legislator Representation Learning with Social Context and Expert Knowledge
Authors:
Shangbin Feng,
Zhaoxuan Tan,
Zilong Chen,
Peisheng Yu,
Qinghua Zheng,
Xiaojun Chang,
Minnan Luo
Abstract:
Modeling the ideological perspectives of political actors is an essential task in computational political science with applications in many downstream tasks. Existing approaches are generally limited to textual data and voting records, while they neglect the rich social context and valuable expert knowledge for holistic evaluation. In this paper, we propose a representation learning framework of p…
▽ More
Modeling the ideological perspectives of political actors is an essential task in computational political science with applications in many downstream tasks. Existing approaches are generally limited to textual data and voting records, while they neglect the rich social context and valuable expert knowledge for holistic evaluation. In this paper, we propose a representation learning framework of political actors that jointly leverages social context and expert knowledge. Specifically, we retrieve and extract factual statements about legislators to leverage social context information. We then construct a heterogeneous information network to incorporate social context and use relational graph neural networks to learn legislator representations. Finally, we train our model with three objectives to align representation learning with expert knowledge, model ideological stance consistency, and simulate the echo chamber phenomenon. Extensive experiments demonstrate that our learned representations successfully advance the state-of-the-art in three downstream tasks. Further analysis proves the correlation between learned legislator representations and various socio-political factors, as well as bearing out the necessity of social context and expert knowledge in modeling political actors.
△ Less
Submitted 3 January, 2022; v1 submitted 9 August, 2021;
originally announced August 2021.
-
KGAP: Knowledge Graph Augmented Political Perspective Detection in News Media
Authors:
Shangbin Feng,
Zilong Chen,
Wenqian Zhang,
Qingyao Li,
Qinghua Zheng,
Xiaojun Chang,
Minnan Luo
Abstract:
Identifying political perspectives in news media has become an important task due to the rapid growth of political commentary and the increasingly polarized political ideologies. Previous approaches focus on textual content and leave out the rich social and political context that is essential in the perspective detection process. To address this limitation, we propose KGAP, a political perspective…
▽ More
Identifying political perspectives in news media has become an important task due to the rapid growth of political commentary and the increasingly polarized political ideologies. Previous approaches focus on textual content and leave out the rich social and political context that is essential in the perspective detection process. To address this limitation, we propose KGAP, a political perspective detection method that incorporates external domain knowledge. Specifically, we construct a political knowledge graph to serve as domain-specific external knowledge. We then construct heterogeneous information networks to represent news documents, which jointly model news text and external knowledge. Finally, we adopt relational graph neural networks and conduct political perspective detection as graph-level classification. Extensive experiments demonstrate that our method consistently achieves the best performance on two real-world perspective detection benchmarks. Ablation studies further bear out the necessity of external knowledge and the effectiveness of our graph-based approach.
△ Less
Submitted 17 May, 2022; v1 submitted 9 August, 2021;
originally announced August 2021.
-
Readout electronics and data acquisition system of PandaX-4T experiment
Authors:
Jijun Yang,
Xun Chen,
Changda He,
Di Huang,
Yanlin Huang,
Jianglai Liu,
Xiangxiang Ren,
Anqing Wang,
Meng Wang,
Binbin Yan,
Kai Yin,
**qun Yang,
Yong Yang,
Qibin Zheng
Abstract:
PandaX-4T is a dark matter direct detection experiment located in China ** underground laboratory. The central apparatus is a dual-phase xenon detector containing 4 ton liquid xenon in the sensitive volume, with about 500 photomultipliers instrumented in the top and the bottom of the detector. In this paper we present a completely new system of readout electronics and data acquisition in the…
▽ More
PandaX-4T is a dark matter direct detection experiment located in China ** underground laboratory. The central apparatus is a dual-phase xenon detector containing 4 ton liquid xenon in the sensitive volume, with about 500 photomultipliers instrumented in the top and the bottom of the detector. In this paper we present a completely new system of readout electronics and data acquisition in the PandaX-4T experiment. Compared to the one used in the previous PandaX dark matter experiments, the new system features triggerless readout and higher bandwidth. With triggerless readout, dark matter searches are not affected by the efficiency loss of external triggers. The system records single photelectron signals of the dominant PMTs with an average efficiency of 96\%, and achieves the bandwidth of more than 450 MB/s. The system has been used to successfully acquire data during the commissioning runs of PandaX-4T.
△ Less
Submitted 16 February, 2022; v1 submitted 7 August, 2021;
originally announced August 2021.
-
RockGPT: Reconstructing three-dimensional digital rocks from single two-dimensional slice from the perspective of video generation
Authors:
Qiang Zheng,
Dongxiao Zhang
Abstract:
Random reconstruction of three-dimensional (3D) digital rocks from two-dimensional (2D) slices is crucial for elucidating the microstructure of rocks and its effects on pore-scale flow in terms of numerical modeling, since massive samples are usually required to handle intrinsic uncertainties. Despite remarkable advances achieved by traditional process-based methods, statistical approaches and rec…
▽ More
Random reconstruction of three-dimensional (3D) digital rocks from two-dimensional (2D) slices is crucial for elucidating the microstructure of rocks and its effects on pore-scale flow in terms of numerical modeling, since massive samples are usually required to handle intrinsic uncertainties. Despite remarkable advances achieved by traditional process-based methods, statistical approaches and recently famous deep learning-based models, few works have focused on producing several kinds of rocks with one trained model and allowing the reconstructed samples to satisfy certain given properties, such as porosity. To fill this gap, we propose a new framework, named RockGPT, which is composed of VQ-VAE and conditional GPT, to synthesize 3D samples based on a single 2D slice from the perspective of video generation. The VQ-VAE is utilized to compress high-dimensional input video, i.e., the sequence of continuous rock slices, to discrete latent codes and reconstruct them. In order to obtain diverse reconstructions, the discrete latent codes are modeled using conditional GPT in an autoregressive manner, while incorporating conditional information from a given slice, rock type, and porosity. We conduct two experiments on five kinds of rocks, and the results demonstrate that RockGPT can produce different kinds of rocks with the same model, and the reconstructed samples can successfully meet certain specified porosities. In a broader sense, through leveraging the proposed conditioning scheme, RockGPT constitutes an effective way to build a general model to produce multiple kinds of rocks simultaneously that also satisfy user-defined properties.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Constraining the 21cm brightness temperature of the IGM at $z$=6.6 around LAEs with the Murchison Widefield Array
Authors:
Cathryn M. Trott,
C. H. Jordan,
J. L. B. Line,
C. R. Lynch,
S. Yoshiura,
B. McKinley,
P. Dayal,
B. Pindor,
A. Hutter,
K. Takahashi,
R. B. Wayth,
N. Barry,
A. Beardsley,
J. Bowman,
R. Byrne,
A. Chokshi,
B. Greig,
K. Hasegawa,
B. J. Hazelton,
E. Howard,
D. Jacobs,
M. Kolopanis,
D. A. Mitchell,
M. F. Morales,
S. Murray
, et al. (7 additional authors not shown)
Abstract:
The locations of Ly-$α$ emitting galaxies (LAEs) at the end of the Epoch of Reionisation (EoR) are expected to correlate with regions of ionised hydrogen, traced by the redshifted 21~cm hyperfine line. Map** the neutral hydrogen around regions with detected and localised LAEs offers an avenue to constrain the brightness temperature of the Universe within the EoR by providing an expectation for t…
▽ More
The locations of Ly-$α$ emitting galaxies (LAEs) at the end of the Epoch of Reionisation (EoR) are expected to correlate with regions of ionised hydrogen, traced by the redshifted 21~cm hyperfine line. Map** the neutral hydrogen around regions with detected and localised LAEs offers an avenue to constrain the brightness temperature of the Universe within the EoR by providing an expectation for the spatial distribution of the gas, thereby providing prior information unavailable to power spectrum measurements. We use a test set of 12 hours of observations from the Murchison Widefield Array (MWA) in extended array configuration, to constrain the neutral hydrogen signature of 58 LAEs, detected with the Subaru Hypersuprime Cam in the \textit{Silverrush} survey, centred on $z$=6.58. We assume that detectable emitters reside in the centre of ionised HII bubbles during the end of reionization, and predict the redshifted neutral hydrogen signal corresponding to the remaining neutral regions using a set of different ionised bubble radii. A prewhitening matched filter detector is introduced to assess detectability. We demonstrate the ability to detect, or place limits upon, the amplitude of brightness temperature fluctuations, and the characteristic HII bubble size. With our limited data, we constrain the brightness temperature of neutral hydrogen to $Δ{\rm T}_B<$30 mK ($<$200 mK) at 95% (99%) confidence for lognormally-distributed bubbles of radii, $R_B =$ 15$\pm$2$h^{-1}$cMpc.
△ Less
Submitted 30 July, 2021;
originally announced July 2021.
-
Dark Matter Search Results from the PandaX-4T Commissioning Run
Authors:
Yue Meng,
Zhou Wang,
Yi Tao,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Yunshan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Ke Han,
Changda He,
**rong He,
Di Huang,
Yanlin Huang,
Zhou Huang
, et al. (54 additional authors not shown)
Abstract:
We report the first dark matter search results using the commissioning data from PandaX-4T. Using a time projection chamber with 3.7-tonne of liquid xenon target and an exposure of 0.63 tonne$\cdot$year, 1058 candidate events are identified within an approximate nuclear recoil energy window between 5 and 100 keV. No significant excess over background is observed. Our data set a stringent limit to…
▽ More
We report the first dark matter search results using the commissioning data from PandaX-4T. Using a time projection chamber with 3.7-tonne of liquid xenon target and an exposure of 0.63 tonne$\cdot$year, 1058 candidate events are identified within an approximate nuclear recoil energy window between 5 and 100 keV. No significant excess over background is observed. Our data set a stringent limit to the dark matter-nucleon spin-independent interactions, with a lowest excluded cross section (90% C.L.) of $3.8\times10^{-47} $cm$^2$ at a dark matter mass of 30 GeV/$c^2$.
△ Less
Submitted 17 December, 2021; v1 submitted 28 July, 2021;
originally announced July 2021.
-
Inference for High Dimensional Censored Quantile Regression
Authors:
Zhe Fei,
Qi Zheng,
Hyokyoung G. Hong,
Yi Li
Abstract:
With the availability of high dimensional genetic biomarkers, it is of interest to identify heterogeneous effects of these predictors on patients' survival, along with proper statistical inference. Censored quantile regression has emerged as a powerful tool for detecting heterogeneous effects of covariates on survival outcomes. To our knowledge, there is little work available to draw inference on…
▽ More
With the availability of high dimensional genetic biomarkers, it is of interest to identify heterogeneous effects of these predictors on patients' survival, along with proper statistical inference. Censored quantile regression has emerged as a powerful tool for detecting heterogeneous effects of covariates on survival outcomes. To our knowledge, there is little work available to draw inference on the effects of high dimensional predictors for censored quantile regression. This paper proposes a novel procedure to draw inference on all predictors within the framework of global censored quantile regression, which investigates covariate-response associations over an interval of quantile levels, instead of a few discrete values. The proposed estimator combines a sequence of low dimensional model estimates that are based on multi-sample splittings and variable selection. We show that, under some regularity conditions, the estimator is consistent and asymptotically follows a Gaussian process indexed by the quantile level. Simulation studies indicate that our procedure can properly quantify the uncertainty of the estimates in high dimensional settings. We apply our method to analyze the heterogeneous effects of SNPs residing in lung cancer pathways on patients' survival, using the Boston Lung Cancer Survival Cohort, a cancer epidemiology study on the molecular mechanism of lung cancer.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.
-
The GLEAM 200 MHz Local Radio Luminosity Function for AGN and Star-forming Galaxies
Authors:
T. M. O. Franzen,
N. Seymour,
E. M. Sadler,
T. Mauch,
S. V. White,
C. A. Jackson,
R. Chhetri,
B. Quici,
M. E. Bell,
J. R. Callingham,
K. S. Dwarakanath,
B. For,
B. M. Gaensler,
P. J. Hancock,
L. Hindson,
N. Hurley-Walker,
M. Johnston-Hollitt,
A. D. Kapinska,
E. Lenc,
B. McKinley,
J. Morgan,
A. R. Offringa,
P. Procopio,
L. Staveley-Smith,
R. B. Wayth
, et al. (2 additional authors not shown)
Abstract:
The GaLactic and Extragalactic All-sky Murchison Widefield Array (GLEAM) is a radio continuum survey at 76-227 MHz of the entire southern sky (Declination $<+30°$) with an angular resolution of $\approx 2$ arcmin. In this paper, we combine GLEAM data with optical spectroscopy from the 6dF Galaxy Survey to construct a sample of 1,590 local (median $z \approx 0.064$) radio sources with…
▽ More
The GaLactic and Extragalactic All-sky Murchison Widefield Array (GLEAM) is a radio continuum survey at 76-227 MHz of the entire southern sky (Declination $<+30°$) with an angular resolution of $\approx 2$ arcmin. In this paper, we combine GLEAM data with optical spectroscopy from the 6dF Galaxy Survey to construct a sample of 1,590 local (median $z \approx 0.064$) radio sources with $S_{200\,\mathrm{MHz}} > 55$ mJy across an area of $\approx 16,700~\mathrm{deg}^{2}$. From the optical spectra, we identify the dominant physical process responsible for the radio emission from each galaxy: 73 per cent are fuelled by an active galactic nucleus (AGN) and 27 per cent by star formation. We present the local radio luminosity function for AGN and star-forming galaxies at 200 MHz and characterise the typical radio spectra of these two populations between 76 MHz and $\sim 1$ GHz. For the AGN, the median spectral index between 200 MHz and $\sim 1$ GHz, $α_{\mathrm{high}}$, is $-0.600 \pm 0.010$ (where $S \propto ν^α$) and the median spectral index within the GLEAM band, $α_{\mathrm{low}}$, is $-0.704 \pm 0.011$. For the star-forming galaxies, the median value of $α_{\mathrm{high}}$ is $-0.650 \pm 0.010$ and the median value of $α_{\mathrm{low}}$ is $-0.596 \pm 0.015$. Among the AGN population, flat-spectrum sources are more common at lower radio luminosity, suggesting the existence of a significant population of weak radio AGN that remain core-dominated even at low frequencies. However, around 4 per cent of local radio AGN have ultra-steep radio spectra at low frequencies ($α_{\mathrm{low}} < -1.2$). These ultra-steep-spectrum sources span a wide range in radio luminosity, and further work is needed to clarify their nature.
△ Less
Submitted 19 July, 2021;
originally announced July 2021.
-
Learning Aesthetic Layouts via Visual Guidance
Authors:
Qingyuan Zheng,
Zhuoru Li,
Adam Bargteil
Abstract:
We explore computational approaches for visual guidance to aid in creating aesthetically pleasing art and graphic design. Our work complements and builds on previous work that developed models for how humans look at images. Our approach comprises three steps. First, we collected a dataset of art masterpieces and labeled the visual fixations with state-of-art vision models. Second, we clustered the…
▽ More
We explore computational approaches for visual guidance to aid in creating aesthetically pleasing art and graphic design. Our work complements and builds on previous work that developed models for how humans look at images. Our approach comprises three steps. First, we collected a dataset of art masterpieces and labeled the visual fixations with state-of-art vision models. Second, we clustered the visual guidance templates of the art masterpieces with unsupervised learning. Third, we developed a pipeline using generative adversarial networks to learn the principles of visual guidance and that can produce aesthetically pleasing layouts. We show that the aesthetic visual guidance principles can be learned and integrated into a high-dimensional model and can be queried by the features of graphic elements. We evaluate our approach by generating layouts on various drawings and graphic designs. Moreover, our model considers the color and structure of graphic elements when generating layouts. Consequently, we believe our tool, which generates multiple aesthetic layout options in seconds, can help artists create beautiful art and graphic designs.
△ Less
Submitted 13 July, 2021;
originally announced July 2021.
-
Listen As You Wish: Audio based Event Detection via Text-to-Audio Grounding in Smart Cities
Authors:
Haoyu Tang,
Yunxiao Wang,
Jihua Zhu,
Shuaike Zhang,
Mingzhu Xu,
Qinghai Zheng,
Yupeng Hu
Abstract:
With the development of internet of things technologies, tremendous sensor audio data has been produced, which poses great challenges to audio-based event detection in smart cities. In this paper, we target a challenging audio-based event detection task, namely, text-to-audio grounding. In addition to precisely localizing all of the desired on- and off-sets in the untrimmed audio, this challenging…
▽ More
With the development of internet of things technologies, tremendous sensor audio data has been produced, which poses great challenges to audio-based event detection in smart cities. In this paper, we target a challenging audio-based event detection task, namely, text-to-audio grounding. In addition to precisely localizing all of the desired on- and off-sets in the untrimmed audio, this challenging new task requires extensive acoustic and linguistic comprehension as well as the reasoning for the crossmodal matching relations between the audio and query. The current approaches often treat the query as an entire one through a global query representation in order to address those issues. We contend that this strategy has several drawbacks. Firstly, the interactions between the query and the audio are not fully utilized. Secondly, it has not distinguished the importance of different keywords in a query. In addition, since the audio clips are of arbitrary lengths, there exist many segments which are irrelevant to the query but have not been filtered out in the approach. This further hinders the effective grounding of desired segments. Motivated by the above concerns, a novel Cross-modal Graph Interaction (CGI) model is proposed to comprehensively model the relations between the words in a query through a novel language graph. To capture the fine-grained relevances between the audio and query, a cross-modal attention module is introduced to generate snippet-specific query representations and automatically assign higher weights to keywords with more important semantics. Furthermore, we develop a cross-gating module for the audio and query to weaken irrelevant parts and emphasize the important ones.
△ Less
Submitted 23 December, 2023; v1 submitted 26 June, 2021;
originally announced June 2021.
-
Horizontal Position Reconstruction in PandaX-II
Authors:
Dan Zhang,
Andi Tan,
Abdusalam Abdukerim,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Ke Han,
Changda He,
Shengming He,
Di Huang,
Yan Huang,
Yanlin Huang,
Zhou Huang,
Xiangdong Ji,
Yonglin Ju
, et al. (47 additional authors not shown)
Abstract:
Dual-phase noble-gas time projection chambers (TPCs) have improved the sensitivities for dark matter direct search in past decades. The capability of TPCs to reconstruct 3-D vertexes of keV scale recoilings is one of the most advantageous features. In this work, we develop two horizontal position reconstruction algorithms for the PandaX-II dark matter search experiment using the dual-phase liquid…
▽ More
Dual-phase noble-gas time projection chambers (TPCs) have improved the sensitivities for dark matter direct search in past decades. The capability of TPCs to reconstruct 3-D vertexes of keV scale recoilings is one of the most advantageous features. In this work, we develop two horizontal position reconstruction algorithms for the PandaX-II dark matter search experiment using the dual-phase liquid xenon TPC. Both algorithms are optimized by the $^{83m}$Kr calibration events and use photon distribution of ionization signals among photomultiplier tubes to infer the positions. According to the events coming from the gate electrode, the uncertainties in the horizontal positions are 3.4 mm (3.9 mm) in the analytical (simulation-based) algorithm for an ionization signal with several thousand photon electrons in the center of the TPC
△ Less
Submitted 7 October, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.
-
TDGIA:Effective Injection Attacks on Graph Neural Networks
Authors:
Xu Zou,
Qinkai Zheng,
Yuxiao Dong,
Xinyu Guan,
Evgeny Kharlamov,
Jialiang Lu,
Jie Tang
Abstract:
Graph Neural Networks (GNNs) have achieved promising performance in various real-world applications. However, recent studies have shown that GNNs are vulnerable to adversarial attacks. In this paper, we study a recently-introduced realistic attack scenario on graphs -- graph injection attack (GIA). In the GIA scenario, the adversary is not able to modify the existing link structure and node attrib…
▽ More
Graph Neural Networks (GNNs) have achieved promising performance in various real-world applications. However, recent studies have shown that GNNs are vulnerable to adversarial attacks. In this paper, we study a recently-introduced realistic attack scenario on graphs -- graph injection attack (GIA). In the GIA scenario, the adversary is not able to modify the existing link structure and node attributes of the input graph, instead the attack is performed by injecting adversarial nodes into it. We present an analysis on the topological vulnerability of GNNs under GIA setting, based on which we propose the Topological Defective Graph Injection Attack (TDGIA) for effective injection attacks. TDGIA first introduces the topological defective edge selection strategy to choose the original nodes for connecting with the injected ones. It then designs the smooth feature optimization objective to generate the features for the injected nodes. Extensive experiments on large-scale datasets show that TDGIA can consistently and significantly outperform various attack baselines in attacking dozens of defense GNN models. Notably, the performance drop on target GNNs resultant from TDGIA is more than double the damage brought by the best attack solution among hundreds of submissions on KDD-CUP 2020.
△ Less
Submitted 9 November, 2021; v1 submitted 11 June, 2021;
originally announced June 2021.
-
High Entropy Oxide Relaxor Ferroelectrics
Authors:
Yogesh Sharma,
Min-Cheol Lee,
Krishna C. Pitike,
Karuna K. Mishra,
Qiang Zheng,
Xiang Gao,
Brianna L. Musico,
Alessandro R. Mazza,
Ram S. Katiyar,
Veerle Keppens,
Matthew Brahlek,
Dmitry A. Yarotski,
Rohit P. Prasankumar,
Ai** Chen,
Valentino R. Cooper,
T. Zac Ward
Abstract:
Relaxor ferrolectrics are important in technological applications due to a strong electromechanical response, energy storage capacity, electrocaloric effect, and pyroelectric energy conversion properties. Current efforts to discover and design new materials in this class generally rely on substitutional do** of known ferroelectrics, as slight changes to local compositional order can significantl…
▽ More
Relaxor ferrolectrics are important in technological applications due to a strong electromechanical response, energy storage capacity, electrocaloric effect, and pyroelectric energy conversion properties. Current efforts to discover and design new materials in this class generally rely on substitutional do** of known ferroelectrics, as slight changes to local compositional order can significantly affect the Curie temperature, morphotropic phase boundary, and electromechanical responses. In this work, we demonstrate that moving to the strong limit of compositional complexity in an ABO3 perovskite allows stabilization of novel relaxor responses that do not rely on a single narrow phase transition region. Entropy-assisted synthesis approaches are used to create single crystal Ba(Ti0.2Sn0.2Zr0.2Hf0.2Nb0.2)O3 [Ba(5B)O] films. The high levels of configurational disorder present in this system is found to influence dielectric relaxation, phase transitions, nano-polar domain formation, and Curie temperature. Temperature-dependent dielectric, Raman spectroscopy and second-harmonic generation measurements reveal multiple phase transitions, a high Curie temperature of 570 K, and the relaxor ferroelectric nature of Ba(5B)O films. The first principles theory calculations are used to predict possible combinations of cations to quantify the relative feasibility of formation of highly disordered single-phase perovskite systems. The ability to stabilize single-phase perovskites with such a large number of different cations on the B-sites offers new possibilities for designing high-performance materials for piezoelectric, pyroelectric and tunable dielectric applications.
△ Less
Submitted 1 June, 2021;
originally announced June 2021.
-
Classifying States of Cooking Objects Using Convolutional Neural Network
Authors:
Qi Zheng
Abstract:
Automated cooking machine is a goal for the future. The main aim is to make the cooking process easier, safer, and create human welfare. To allow robots to accurately perform the cooking activities, it is important for them to understand the cooking environment and recognize the objects, especially correctly identifying the state of the cooking objects. This will significantly improve the correctn…
▽ More
Automated cooking machine is a goal for the future. The main aim is to make the cooking process easier, safer, and create human welfare. To allow robots to accurately perform the cooking activities, it is important for them to understand the cooking environment and recognize the objects, especially correctly identifying the state of the cooking objects. This will significantly improve the correctness of the following cooking recipes. In this project, several parts of the experiment were conducted to design a robust deep convolutional neural network for classifying the state of the cooking objects from scratch. The model is evaluated by using various techniques, such as adjusting architecture layers, tuning key hyperparameters, and using different optimization techniques to maximize the accuracy of state classification.
△ Less
Submitted 30 April, 2021;
originally announced May 2021.
-
A new MWA limit on the 21 cm Power Spectrum at Redshifts $\sim$ 13 $-$ 17
Authors:
S. Yoshiura,
B. Pindor,
J. L. B. Line,
N. Barry,
C. M. Trott,
A. Beardsley,
J. Bowman,
R. Byrne,
A. Chokshi,
B. J. Hazelton,
K. Hasegawa,
E. Howard,
B. Greig,
D. Jacobs,
C. H. Jordan,
R. Joseph,
M. Kolopanis,
C. Lynch,
B. McKinley,
D. A. Mitchell,
M. F. Morales,
S. G. Murray,
J. C. Pober,
M. Rahimi,
K. Takahashi
, et al. (7 additional authors not shown)
Abstract:
Observations in the lowest MWA band between $75-100$ MHz have the potential to constrain the distribution of neutral hydrogen in the intergalactic medium at redshift $\sim 13-17$. Using 15 hours of MWA data, we analyse systematics in this band such as radio-frequency interference (RFI), ionospheric and wide field effects. By updating the position of point sources, we mitigate the direction indepen…
▽ More
Observations in the lowest MWA band between $75-100$ MHz have the potential to constrain the distribution of neutral hydrogen in the intergalactic medium at redshift $\sim 13-17$. Using 15 hours of MWA data, we analyse systematics in this band such as radio-frequency interference (RFI), ionospheric and wide field effects. By updating the position of point sources, we mitigate the direction independent calibration error due to ionospheric offsets. Our calibration strategy is optimized for the lowest frequency bands by reducing the number of direction dependent calibrators and taking into account radio sources within a wider field of view. We remove data polluted by systematics based on the RFI occupancy and ionospheric conditions, finally selecting 5.5 hours of the cleanest data. Using these data, we obtain two sigma upper limits on the 21 cm power spectrum in the range of $0.1\lessapprox k \lessapprox 1 ~\rm ~h~Mpc^{-1}$ and at $z$=14.2, 15.2 and 16.5, with the lowest limit being $6.3\times 10^6 ~\rm mK^2$ at $\rm k=0.14 \rm ~h~Mpc^{-1}$ and at $z=15.2$ with a possibility of a few \% of signal loss due to direction independent calibration.
△ Less
Submitted 26 May, 2021;
originally announced May 2021.
-
Pouring Dynamics Estimation Using Gated Recurrent Units
Authors:
Qi Zheng
Abstract:
One of the most commonly performed manipulation in a human's daily life is pouring. Many factors have an effect on target accuracy, including pouring velocity, rotation angle, geometric of the source, and the receiving containers. This paper presents an approach to increase the repeatability and accuracy of the robotic manipulator by estimating the change in the amount of water of the pouring cup…
▽ More
One of the most commonly performed manipulation in a human's daily life is pouring. Many factors have an effect on target accuracy, including pouring velocity, rotation angle, geometric of the source, and the receiving containers. This paper presents an approach to increase the repeatability and accuracy of the robotic manipulator by estimating the change in the amount of water of the pouring cup to a sequence of pouring actions using multiple layers of the deep recurrent neural network, especially gated recurrent units (GRU). The proposed GRU model achieved a validation mean squared error as low as 1e-4 (lbf) for the predicted value of weight f(t). This paper contains a comprehensive evaluation and analysis of numerous experiments with various designs of recurrent neural networks and hyperparameters fine-tuning.
△ Less
Submitted 7 May, 2021;
originally announced May 2021.
-
External Calibrator in Global Signal Experiment for Detection of the Epoch of Reionization
Authors:
Yan Huang,
Xiang-** Wu,
Quan Guo,
Qian Zheng,
Biying Li,
Huanyuan Shan,
Kejia Lee,
Haiguang Xu
Abstract:
We present a conceptual design study of external calibrators in the 21 cm experiment towards detecting the globally averaged radiation of the epoch of reionization (EoR). Employment of external calibrator instead of internal calibrator commonly used in current EoR experiments allows to remove instrumental effects such as beam pattern, receiver gain and instability of the system if the conventional…
▽ More
We present a conceptual design study of external calibrators in the 21 cm experiment towards detecting the globally averaged radiation of the epoch of reionization (EoR). Employment of external calibrator instead of internal calibrator commonly used in current EoR experiments allows to remove instrumental effects such as beam pattern, receiver gain and instability of the system if the conventional three-position switch measurements are implemented in a short time interval. Furthermore, in the new design the antenna system is placed in an underground anechoic chamber with an open/closing ceiling to maximally reduce the environmental effect such as RFI and ground radiation/reflection. It appears that three of the four external calibrators proposed in this paper, including two indoor artificial transmitters and one outdoor celestial radiation (the Galactic polarization), fail to meet our purpose. Diurnal motion of the Galactic diffuse emission turns to be the most possible source as an external calibrator, for which we have discussed the observational strategy and the algorithm of extracting the EoR signal.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
GIPA: General Information Propagation Algorithm for Graph Learning
Authors:
Qinkai Zheng,
Houyi Li,
Peng Zhang,
Zhixiong Yang,
Guowei Zhang,
Xintan Zeng,
Yongchao Liu
Abstract:
Graph neural networks (GNNs) have been popularly used in analyzing graph-structured data, showing promising results in various applications such as node classification, link prediction and network recommendation. In this paper, we present a new graph attention neural network, namely GIPA, for attributed graph data learning. GIPA consists of three key components: attention, feature propagation and…
▽ More
Graph neural networks (GNNs) have been popularly used in analyzing graph-structured data, showing promising results in various applications such as node classification, link prediction and network recommendation. In this paper, we present a new graph attention neural network, namely GIPA, for attributed graph data learning. GIPA consists of three key components: attention, feature propagation and aggregation. Specifically, the attention component introduces a new multi-layer perceptron based multi-head to generate better non-linear feature map** and representation than conventional implementations such as dot-product. The propagation component considers not only node features but also edge features, which differs from existing GNNs that merely consider node features. The aggregation component uses a residual connection to generate the final embedding. We evaluate the performance of GIPA using the Open Graph Benchmark proteins (ogbn-proteins for short) dataset. The experimental results reveal that GIPA can beat the state-of-the-art models in terms of prediction accuracy, e.g., GIPA achieves an average test ROC-AUC of $0.8700\pm 0.0010$ and outperforms all the previous methods listed in the ogbn-proteins leaderboard.
△ Less
Submitted 10 August, 2021; v1 submitted 12 May, 2021;
originally announced May 2021.
-
Constraining self-interacting dark matter with the full dataset of PandaX-II
Authors:
Jijun Yang,
Abdusalam Abdukerim,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Changbo Fu,
Mengting Fu,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xuyuan Guo,
Ke Han,
Changda He,
Shengming He,
Di Huang,
Yan Huang,
Ran Huo,
Yanlin Huang,
Zhou Huang,
Xiangdong Ji,
Yonglin Ju
, et al. (47 additional authors not shown)
Abstract:
Self-interacting Dark Matter (SIDM) is a leading candidate proposed to solve discrepancies between predictions of the prevailing cold dark matter theory and observations of galaxies. Many SIDM models predict the existence of a light force carrier that mediate strong dark matter self-interactions. If the mediator couples to the standard model particles, it could produce characteristic signals in da…
▽ More
Self-interacting Dark Matter (SIDM) is a leading candidate proposed to solve discrepancies between predictions of the prevailing cold dark matter theory and observations of galaxies. Many SIDM models predict the existence of a light force carrier that mediate strong dark matter self-interactions. If the mediator couples to the standard model particles, it could produce characteristic signals in dark matter direct detection experiments. We report searches for SIDM models with a light mediator using the full dataset of the PandaX-II experiment, based on a total exposure of 132 tonne-days. No significant excess over background is found, and our likelihood analysis leads to a strong upper limit on the dark matter-nucleon coupling strength. We further combine the PandaX-II constraints and those from observations of the light element abundances in the early universe, and show that direct detection and cosmological probes can provide complementary constraints on dark matter models with a light mediator.
△ Less
Submitted 24 October, 2021; v1 submitted 29 April, 2021;
originally announced April 2021.