-
Unsupervised Representation Learning for Time Series: A Review
Authors:
Qianwen Meng,
Hangwei Qian,
Yong Liu,
Yonghui Xu,
Zhiqi Shen,
Lizhen Cui
Abstract:
Unsupervised representation learning approaches aim to learn discriminative feature representations from unlabeled data, without the requirement of annotating every sample. Enabling unsupervised representation learning is extremely crucial for time series data, due to its unique annotation bottleneck caused by its complex characteristics and lack of visual cues compared with other data modalities.…
▽ More
Unsupervised representation learning approaches aim to learn discriminative feature representations from unlabeled data, without the requirement of annotating every sample. Enabling unsupervised representation learning is extremely crucial for time series data, due to its unique annotation bottleneck caused by its complex characteristics and lack of visual cues compared with other data modalities. In recent years, unsupervised representation learning techniques have advanced rapidly in various domains. However, there is a lack of systematic analysis of unsupervised representation learning approaches for time series. To fill the gap, we conduct a comprehensive literature review of existing rapidly evolving unsupervised representation learning approaches for time series. Moreover, we also develop a unified and standardized library, named ULTS (i.e., Unsupervised Learning for Time Series), to facilitate fast implementations and unified evaluations on various models. With ULTS, we empirically evaluate state-of-the-art approaches, especially the rapidly evolving contrastive learning methods, on 9 diverse real-world datasets. We further discuss practical considerations as well as open research challenges on unsupervised representation learning for time series to facilitate future research in this field.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
Multi-robot Path Planning with Rapidly-exploring Random Disjointed-Trees
Authors:
Biru Zhang,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
Multi-robot path planning is a computational process involving finding paths for each robot from its start to the goal while ensuring collision-free operation. It is widely used in robots and autonomous driving. However, the computational time of multi-robot path planning algorithms is enormous, resulting in low efficiency in practical applications. To address this problem, this article proposes a…
▽ More
Multi-robot path planning is a computational process involving finding paths for each robot from its start to the goal while ensuring collision-free operation. It is widely used in robots and autonomous driving. However, the computational time of multi-robot path planning algorithms is enormous, resulting in low efficiency in practical applications. To address this problem, this article proposes a novel multi-robot path planning algorithm (Multi-Agent Rapidly-exploring Random Disjointed-Trees*, MA-RRdT*) based on multi-tree random sampling. The proposed algorithm is based on a single-robot path planning algorithm (Rapidly-exploring Random disjointed-Trees*, RRdT*). The novel MA-RRdT* algorithm has the advantages of fast speed, high space exploration efficiency, and suitability for complex maps. Comparative experiments are completed to evaluate the effectiveness of MA-RRdT*. The final experimental results validate the superior performance of the MA-RRdT* algorithm in terms of time cost and space exploration efficiency.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
Virtual Reality Based Robot Teleoperation via Human-Scene Interaction
Authors:
Lingxiao Meng,
Jiangshan Liu,
Wei Chai,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
Robot teleoperation gains great success in various situations, including chemical pollution rescue, disaster relief, and long-distance manipulation. In this article, we propose a virtual reality (VR) based robot teleoperation system to achieve more efficient and natural interaction with humans in different scenes. A user-friendly VR interface is designed to help users interact with a desktop scene…
▽ More
Robot teleoperation gains great success in various situations, including chemical pollution rescue, disaster relief, and long-distance manipulation. In this article, we propose a virtual reality (VR) based robot teleoperation system to achieve more efficient and natural interaction with humans in different scenes. A user-friendly VR interface is designed to help users interact with a desktop scene using their hands efficiently and intuitively. To improve user experience and reduce workload, we simulate the process in the physics engine to help build a preview of the scene after manipulation in the virtual scene before execution. We conduct experiments with different users and compare our system with a direct control method across several teleoperation tasks. The user study demonstrates that the proposed system enables users to perform operations more instinctively with a lighter mental workload. Users can perform pick-and-place and object-stacking tasks in a considerably short time, even for beginners. Our code is available at https://github.com/lingxiaomeng/VR_Teleoperation_Gen3.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Short-range correlations and momentum distributions in mirror nuclei 3H and 3He
Authors:
Qi Meng,
Ziyang Lu,
Chang Xu
Abstract:
Motivated by recent high-energy electron and $\rm ^3H$ and $\rm ^3He$ nuclei scattering experiment in Jefferson Lab (Nature 609, 41 (2022)), the short-range correlations (SRCs) between nucleon pairs for 3-nucleon systems are microscopically studied using realistic $NN$ 2-body interaction and two-Gaussian type $NNN$ 3-body interaction. The wave functions of both $\rm ^3H$ and $\rm ^3He$ are obtaine…
▽ More
Motivated by recent high-energy electron and $\rm ^3H$ and $\rm ^3He$ nuclei scattering experiment in Jefferson Lab (Nature 609, 41 (2022)), the short-range correlations (SRCs) between nucleon pairs for 3-nucleon systems are microscopically studied using realistic $NN$ 2-body interaction and two-Gaussian type $NNN$ 3-body interaction. The wave functions of both $\rm ^3H$ and $\rm ^3He$ are obtained by solving 3-body Schrödinger equations using Gaussian expansion method (GEM). The differences of one-nucleon and nucleon-nucleon momentum distributions between $\rm ^3H$ and $\rm ^3He$ are analyzed in detail. The results show that the percentages of $pn$-SRC pairs are significantly enhanced as compared with those of $nn(pp)$-SRC ones in $\rm ^3H$ and $\rm ^3He$ nuclei, which is consistent with the experimental findings.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
Ni-O-Ag catalyst enables 103-m$^2$ artificial photosynthesis with >16% solar-to-chemical energy conversion efficiency
Authors:
Yaguang Li,
Fanqi Meng,
Qixuan Wu,
Dachao Yuan,
Haixiao Wang,
Bang Liu,
Junwei Wang,
Xingyuan San,
Lin Gu,
Shufang Wang,
Qingbo Meng
Abstract:
Herein, NiO nanosheets supported with Ag single atoms are synthesized for photothermal CO2 hydrogenation to achieve 1065 mmol g$^{-1}$ h$^{-1}$ of CO production rate under 1 sun irradiation, revealing the unparalleled weak sunlight driven reverse water-gas shift reaction (RWGS) activity. This performance is attributed to the coupling effect of Ag-O-Ni sites to enhance the hydrogenation of CO$_2$ a…
▽ More
Herein, NiO nanosheets supported with Ag single atoms are synthesized for photothermal CO2 hydrogenation to achieve 1065 mmol g$^{-1}$ h$^{-1}$ of CO production rate under 1 sun irradiation, revealing the unparalleled weak sunlight driven reverse water-gas shift reaction (RWGS) activity. This performance is attributed to the coupling effect of Ag-O-Ni sites to enhance the hydrogenation of CO$_2$ and weaken the CO adsorption, resulting in 1434 mmol g$^{-1}$ h$^{-1}$ of CO yield at 300$^\circ$ C, surpassing any low-temperature RWGS performances ever reported. Building on this, we integrated the 2D Ni$_1$Ag$_{0.02}$O$_1$ supported photothermal RWGS with commercial photovoltaic electrolytic water splitting, leading to the realization of 103 m$^2$ scale artificial photosynthesis system (CO$_2$+H$_2$$\to$CO+H$_2$O) with a daily CO yield of 18.70 m$^3$, a photochemical energy conversion efficiency of >16%, over 90% H$_2$ ultilization efficiency, outperforming other types of artificial photosynthesis. The results of this research chart a promising course for designing practical, natural sunlight-driven artificial photosynthesis systems and highly efficient platinum-free CO$_2$ hydrogenation catalysts. This work is a significant step towards harnessing solar energy more efficiently and sustainably, opening exciting possibilities for future research and development in this area.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Multinary Alloying Suppresses Defect Formation in Emerging Inorganic Solar Cells
Authors:
Jiangjian Shi,
**lin Wang,
Fanqi Meng,
Jiazheng Zhou,
Xiao Xu,
Kang Yin,
Licheng Lou,
Menghan Jiao,
Bowen Zhang,
Huijue Wu,
Yanhong Luo,
Dongmei Li,
Qingbo Meng
Abstract:
The Cu2ZnSn(S, Se)4 (CZTSSe) emerging inorganic solar cell is highly promising for accelerating the large-scale and low-cost applications of thin-film photovoltaics. It possesses distinct advantages such as abundant and non-toxic constituent elements, high material stability, and excellent compatibility with industrial processes. However, CZTSSe solar cells still face challenges related to complex…
▽ More
The Cu2ZnSn(S, Se)4 (CZTSSe) emerging inorganic solar cell is highly promising for accelerating the large-scale and low-cost applications of thin-film photovoltaics. It possesses distinct advantages such as abundant and non-toxic constituent elements, high material stability, and excellent compatibility with industrial processes. However, CZTSSe solar cells still face challenges related to complex defects and charge losses. To overcome these limitations and improve the efficiency of CZTSSe solar cells, it is crucial to experimentally identify and mitigate deep defects. In this study, we reveal that the dominant deep defect in CZTSSe materials exhibits donor characteristics. We propose that incomplete cation exchange during the multi-step crystallization reactions of CZTSSe is the kinetics mechanism responsible for the defect formation. To address this issue, we introduce an elemental synergistic alloying approach aimed at weakening the metal-chalcogen bond strength and the stability of intermediate phases. This alloying strategy has facilitated the kinetics of cation exchange, leading to a significant reduction in charge losses within the CZTSSe absorber. As a result, we have achieved a cell efficiency of over 14.5%. These results represent a significant advancement for emerging inorganic solar cells and additionally bring more opportunities for the precise identification and regulation of defects in a wider range of multinary inorganic compounds.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Power-law Dynamic arising from machine learning
Authors:
Wei Chen,
Weitao Du,
Zhi-Ming Ma,
Qi Meng
Abstract:
We study a kind of new SDE that was arisen from the research on optimization in machine learning, we call it power-law dynamic because its stationary distribution cannot have sub-Gaussian tail and obeys power-law. We prove that the power-law dynamic is ergodic with unique stationary distribution, provided the learning rate is small enough. We investigate its first exist time. In particular, we com…
▽ More
We study a kind of new SDE that was arisen from the research on optimization in machine learning, we call it power-law dynamic because its stationary distribution cannot have sub-Gaussian tail and obeys power-law. We prove that the power-law dynamic is ergodic with unique stationary distribution, provided the learning rate is small enough. We investigate its first exist time. In particular, we compare the exit times of the (continuous) power-law dynamic and its discretization. The comparison can help guide machine learning algorithm.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition
Authors:
Ziao Li,
Junyi Wang,
Bangli Liu,
Haibin Cai,
Mohamad Saada,
Qinggang Meng
Abstract:
Recently, the significant achievements have been made in skeleton-based human action recognition with the emergence of graph convolutional networks (GCNs). However, the state-of-the-art (SOTA) models used for this task focus on constructing more complex higher-order connections between joint nodes to describe skeleton information, which leads to complex inference processes and high computational c…
▽ More
Recently, the significant achievements have been made in skeleton-based human action recognition with the emergence of graph convolutional networks (GCNs). However, the state-of-the-art (SOTA) models used for this task focus on constructing more complex higher-order connections between joint nodes to describe skeleton information, which leads to complex inference processes and high computational costs. To address the slow inference speed caused by overly complex model structures, we introduce re-parameterization and over-parameterization techniques to GCNs and propose two novel high-performance inference GCNs, namely HPI-GCN-RP and HPI-GCN-OP. After the completion of model training, model parameters are fixed. HPI-GCN-RP adopts re-parameterization technique to transform high-performance training model into fast inference model through linear transformations, which achieves a higher inference speed with competitive model performance. HPI-GCN-OP further utilizes over-parameterization technique to achieve higher performance improvement by introducing additional inference parameters, albeit with slightly decreased inference speed. The experimental results on the two skeleton-based action recognition datasets demonstrate the effectiveness of our approach. Our HPI-GCN-OP achieves performance comparable to the current SOTA models, with inference speeds five times faster. Specifically, our HPI-GCN-OP achieves an accuracy of 93\% on the cross-subject split of the NTU-RGB+D 60 dataset, and 90.1\% on the cross-subject benchmark of the NTU-RGB+D 120 dataset. Code is available at github.com/lizaowo/HPI-GCN.
△ Less
Submitted 18 June, 2024; v1 submitted 29 May, 2023;
originally announced May 2023.
-
Complex-valued neural operator assisted soliton identification
Authors:
Ming Zhang,
Qi Meng,
Deng Zhang,
Yue Wang,
Guanghui Wang,
Zhiming Ma,
Li Chen,
Tie-Yan Liu
Abstract:
The numerical determination of solitary states is an important topic for such research areas as Bose-Einstein condensates, nonlinear optics, plasma physics, etc. In this paper, we propose a data-driven approach for identifying solitons based on dynamical solutions of real-time differential equations. Our approach combines a machine-learning architecture called the complex-valued neural operator (C…
▽ More
The numerical determination of solitary states is an important topic for such research areas as Bose-Einstein condensates, nonlinear optics, plasma physics, etc. In this paper, we propose a data-driven approach for identifying solitons based on dynamical solutions of real-time differential equations. Our approach combines a machine-learning architecture called the complex-valued neural operator (CNO) with an energy-restricted gradient optimization. The former serves as a generalization of the traditional neural operator to the complex domain, and constructs a smooth map** between the initial and final states; the latter facilitates the search for solitons by constraining the energy space. We concretely demonstrate this approach on the quasi-one-dimensional Bose-Einstein condensate with homogeneous and inhomogeneous nonlinearities. Our work offers a new idea for data-driven effective modeling and studies of solitary waves in nonlinear physical systems.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy
Authors:
Yameng Zhang,
Long Bai,
Li Liu,
Hongliang Ren,
Max Q. -H. Meng
Abstract:
Due to its non-invasive and painless characteristics, wireless capsule endoscopy has become the new gold standard for assessing gastrointestinal disorders. Omissions, however, could occur throughout the examination since controlling capsule endoscope can be challenging. In this work, we control the magnetic capsule endoscope for the coverage scanning task in the stomach based on reinforcement lear…
▽ More
Due to its non-invasive and painless characteristics, wireless capsule endoscopy has become the new gold standard for assessing gastrointestinal disorders. Omissions, however, could occur throughout the examination since controlling capsule endoscope can be challenging. In this work, we control the magnetic capsule endoscope for the coverage scanning task in the stomach based on reinforcement learning so that the capsule can comprehensively scan every corner of the stomach. We apply a well-made virtual platform named VR-Caps to simulate the process of stomach coverage scanning with a capsule endoscope model. We utilize and compare two deep reinforcement learning algorithms, the Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) algorithms, to train the permanent magnetic agent, which actuates the capsule endoscope directly via magnetic fields and then optimizes the scanning efficiency of stomach coverage. We analyze the pros and cons of the two algorithms with different hyperparameters and achieve a coverage rate of 98.04% of the stomach area within 150.37 seconds.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Style Transfer Enabled Sim2Real Framework for Efficient Learning of Robotic Ultrasound Image Analysis Using Simulated Data
Authors:
Keyu Li,
Xinyu Mao,
Chengwei Ye,
Ang Li,
Yangxin Xu,
Max Q. -H. Meng
Abstract:
Robotic ultrasound (US) systems have shown great potential to make US examinations easier and more accurate. Recently, various machine learning techniques have been proposed to realize automatic US image interpretation for robotic US acquisition tasks. However, obtaining large amounts of real US imaging data for training is usually expensive or even unfeasible in some clinical applications. An alt…
▽ More
Robotic ultrasound (US) systems have shown great potential to make US examinations easier and more accurate. Recently, various machine learning techniques have been proposed to realize automatic US image interpretation for robotic US acquisition tasks. However, obtaining large amounts of real US imaging data for training is usually expensive or even unfeasible in some clinical applications. An alternative is to build a simulator to generate synthetic US data for training, but the differences between simulated and real US images may result in poor model performance. This work presents a Sim2Real framework to efficiently learn robotic US image analysis tasks based only on simulated data for real-world deployment. A style transfer module is proposed based on unsupervised contrastive learning and used as a preprocessing step to convert the real US images into the simulation style. Thereafter, a task-relevant model is designed to combine CNNs with vision transformers to generate the task-dependent prediction with improved generalization ability. We demonstrate the effectiveness of our method in an image regression task to predict the probe position based on US images in robotic transesophageal echocardiography (TEE). Our results show that using only simulated US data and a small amount of unlabelled real data for training, our method can achieve comparable performance to semi-supervised and fully supervised learning methods. Moreover, the effectiveness of our previously proposed CT-based US image simulation method is also indirectly confirmed.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Heterojunction interface regulation to realize high-performance flexible Kesterite solar cells
Authors:
Xiao Xu,
Jiazheng Zhou,
Kang Yin,
**lin Wang,
Licheng Lou,
Dongmei Li,
Jiangjian Shi,
Huijue Wu,
Yanhong Luo,
Qingbo Meng
Abstract:
Flexible Cu2ZnSn(S, Se)4 (CZTSSe) solar cells take the advantages of environmental friendliness, low cost, and multi-scenario applications, and have drawn extensive attention in recent years. Compared with rigid devices, the lack of alkali metal elements in the flexible substrate is the main factor limiting the performance of flexible CZTSSe solar cells. This work proposes a Rb ion additive strate…
▽ More
Flexible Cu2ZnSn(S, Se)4 (CZTSSe) solar cells take the advantages of environmental friendliness, low cost, and multi-scenario applications, and have drawn extensive attention in recent years. Compared with rigid devices, the lack of alkali metal elements in the flexible substrate is the main factor limiting the performance of flexible CZTSSe solar cells. This work proposes a Rb ion additive strategy to simultaneously regulate the CZTSSe film surface properties and the CdS chemical bath deposition (CBD) processes. Material and chemical characterization reveals that Rb ions can passivate the detrimental Se0 cluster defect and additionally provide a more active surface for the CdS epitaxial growth. Furthermore, Rb can also coordinate with thiourea (TU) in the CBD solution and improve the ion-by-ion deposition of the CdS layer. Finally, the flexible CZTSSe cell fabricated by this strategy has reached a high total-area efficiency of 12.63% (active-area efficiency of 13.2%), with its VOC and FF reaching 538 mV and 0.70, respectively. This work enriches the alkali metal passivation strategies and provides new ideas for further improving flexible CZTSSe solar cells in the future.
△ Less
Submitted 10 May, 2023;
originally announced May 2023.
-
SFC: Near-Source Congestion Signaling and Flow Control
Authors:
Yanfang Le,
Jeongkeun Lee,
Jeremias Blendin,
Jiayi Chen,
Georgios Nikolaidis,
Rong Pan,
Robert Soule,
Aditya Akella,
Pedro Yebenes Segura,
Arjun singhvi,
Yuliang Li,
Qingkai Meng,
Changhoon Kim,
Serhat Arslan
Abstract:
State-of-the-art congestion control algorithms for data centers alone do not cope well with transient congestion and high traffic bursts. To help with these, we revisit the concept of direct \emph{backward} feedback from switches and propose Back-to-Sender (BTS) signaling to many concurrent incast senders. Combining it with our novel approach to in-network caching, we achieve near-source sub-RTT c…
▽ More
State-of-the-art congestion control algorithms for data centers alone do not cope well with transient congestion and high traffic bursts. To help with these, we revisit the concept of direct \emph{backward} feedback from switches and propose Back-to-Sender (BTS) signaling to many concurrent incast senders. Combining it with our novel approach to in-network caching, we achieve near-source sub-RTT congestion signaling. Source Flow Control (SFC) combines these two simple signaling mechanisms to instantly pause traffic sources, hence avoiding the head-of-line blocking problem of conventional hop-by-hop flow control. Our prototype system and scale simulations demonstrate that near-source signaling can significantly reduce the message completion time of various workloads in the presence of incast, complementing existing congestion control algorithms. Our results show that SFC can reduce the $99^{th}$-percentile flow completion times by $1.2-6\times$ and the peak switch buffer usage by $2-3\times$ compared to the recent incast solutions.
△ Less
Submitted 30 April, 2023;
originally announced May 2023.
-
Direct Visual Servoing Based on Discrete Orthogonal Moments
Authors:
Yuhan Chen,
Max Q. -H. Meng,
Li Liu
Abstract:
This paper proposes a new approach to achieve direct visual servoing (DVS) based on discrete orthogonal moments (DOMs). DVS is performed in such a way that the extraction of geometric primitives, matching, and tracking steps in the conventional feature-based visual servoing pipeline can be bypassed. Although DVS enables highly precise positioning, it suffers from a limited convergence domain and p…
▽ More
This paper proposes a new approach to achieve direct visual servoing (DVS) based on discrete orthogonal moments (DOMs). DVS is performed in such a way that the extraction of geometric primitives, matching, and tracking steps in the conventional feature-based visual servoing pipeline can be bypassed. Although DVS enables highly precise positioning, it suffers from a limited convergence domain and poor robustness due to the extreme nonlinearity of the cost function to be minimized and the presence of redundant data between visual features. To tackle these issues, we propose a generic and augmented framework that considers DOMs as visual features. By using the Tchebichef, Krawtchouk, and Hahn moments as examples, we not only present the strategies for adaptively tuning the parameters and order of the visual features but also exhibit an analytical formulation of the associated interaction matrix. Simulations demonstrate the robustness and accuracy of our approach, as well as its advantages over the state-of-the-art. Real-world experiments have also been performed to validate the effectiveness of our approach.
△ Less
Submitted 10 November, 2023; v1 submitted 27 April, 2023;
originally announced April 2023.
-
Quasi-two-body decays $B_c\to D^*h\to Dπh$ in the perturbative QCD
Authors:
Yan-Chao Zhao,
Zhi-Qing Zhang,
Zi-Yu Zhang,
Zhi-Jie Sun,
Qiu-Bo Meng
Abstract:
In this work, we investigate the quasi-two-body decays $B_c\to D^*h\to Dπh$ with $h = (K^0,π^0,η,η^{\prime})$ using the perturbative QCD(PQCD) approach. The description of final state interactions between the $Dπ$ pair is achieved through the two-meson distribution amplitudes(DAs), which are normalized to the time-like form factor. The PQCD predictions on the branching ratios of the quasi-two-body…
▽ More
In this work, we investigate the quasi-two-body decays $B_c\to D^*h\to Dπh$ with $h = (K^0,π^0,η,η^{\prime})$ using the perturbative QCD(PQCD) approach. The description of final state interactions between the $Dπ$ pair is achieved through the two-meson distribution amplitudes(DAs), which are normalized to the time-like form factor. The PQCD predictions on the branching ratios of the quasi-two-body decays $B_c\to D^*h\to Dπh$ show an obvious hierarchy: $Br(B_{c}^+ \to D^{*+} K^{0}\to D^0π^+K^{0})=({5.22}_{-0.74}^{+0.86})\times{10}^{-6}, Br(B_{c}^+ \to D^{*+} π^{0}\to D^0π^+π^{0})=(0.93\pm0.26)\times{10}^{-7}, Br(B_{c}^+ \to D^{*+} η\to D^0π^+η) =({2.83}_{-0.52}^{+0.59})\times{10}^{-8}$ and $Br(B_{c}^+ \to D^{*+} η^\prime\to D^0π^+η^\prime)=({1.89}_{-0.36}^{+0.40})\times{10}^{-8}$.
From the invariant mass $m_{Dπ}$-dependence of the decay spectrum for each channel, one can find that the branching fraction is concentrated in a narrow region around the $D^{*}$ pole mass. So one can obtain the branching ratios for the corresponding two-body decays $B_c\to D^{*+}h$ under the narrow width approximation. We find that the branching ratios of the decays $B_c\to D^{*+}h$ are consistent well with the previous PQCD calculations within errors. These predictions will be tested by the future experiments.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
Curricular Object Manipulation in LiDAR-based Object Detection
Authors:
Ziyue Zhu,
Qiang Meng,
Xiao Wang,
Ke Wang,
Liujiang Yan,
Jian Yang
Abstract:
This paper explores the potential of curriculum learning in LiDAR-based 3D object detection by proposing a curricular object manipulation (COM) framework. The framework embeds the curricular training strategy into both the loss design and the augmentation process. For the loss design, we propose the COMLoss to dynamically predict object-level difficulties and emphasize objects of different difficu…
▽ More
This paper explores the potential of curriculum learning in LiDAR-based 3D object detection by proposing a curricular object manipulation (COM) framework. The framework embeds the curricular training strategy into both the loss design and the augmentation process. For the loss design, we propose the COMLoss to dynamically predict object-level difficulties and emphasize objects of different difficulties based on training stages. On top of the widely-used augmentation technique called GT-Aug in LiDAR detection tasks, we propose a novel COMAug strategy which first clusters objects in ground-truth database based on well-designed heuristics. Group-level difficulties rather than individual ones are then predicted and updated during training for stable results. Model performance and generalization capabilities can be improved by sampling and augmenting progressively more difficult objects into the training samples. Extensive experiments and ablation studies reveal the superior and generality of the proposed framework. The code is available at https://github.com/ZZY816/COM.
△ Less
Submitted 9 April, 2023;
originally announced April 2023.
-
Uncertainty-inspired Open Set Learning for Retinal Anomaly Identification
Authors:
Meng Wang,
Tian Lin,
Lianyu Wang,
Aidi Lin,
Ke Zou,
Xinxing Xu,
Yi Zhou,
Yuanyuan Peng,
Qingquan Meng,
Yiming Qian,
Guoyao Deng,
Zhiqun Wu,
Junhong Chen,
Jianhong Lin,
Mingzhi Zhang,
Weifang Zhu,
Changqing Zhang,
Daoqiang Zhang,
Rick Siow Mong Goh,
Yong Liu,
Chi Pui Pang,
Xinjian Chen,
Haoyu Chen,
Huazhu Fu
Abstract:
Failure to recognize samples from the classes unseen during training is a major limitation of artificial intelligence in the real-world implementation for recognition and classification of retinal anomalies. We established an uncertainty-inspired open-set (UIOS) model, which was trained with fundus images of 9 retinal conditions. Besides assessing the probability of each category, UIOS also calcul…
▽ More
Failure to recognize samples from the classes unseen during training is a major limitation of artificial intelligence in the real-world implementation for recognition and classification of retinal anomalies. We established an uncertainty-inspired open-set (UIOS) model, which was trained with fundus images of 9 retinal conditions. Besides assessing the probability of each category, UIOS also calculated an uncertainty score to express its confidence. Our UIOS model with thresholding strategy achieved an F1 score of 99.55%, 97.01% and 91.91% for the internal testing set, external target categories (TC)-JSIEC dataset and TC-unseen testing set, respectively, compared to the F1 score of 92.20%, 80.69% and 64.74% by the standard AI model. Furthermore, UIOS correctly predicted high uncertainty scores, which would prompt the need for a manual check in the datasets of non-target categories retinal diseases, low-quality fundus images, and non-fundus images. UIOS provides a robust method for real-world screening of retinal anomalies.
△ Less
Submitted 29 August, 2023; v1 submitted 8 April, 2023;
originally announced April 2023.
-
Revisiting Context Aggregation for Image Matting
Authors:
Qinglin Liu,
Xiaoqian Lv,
Quanling Meng,
Zonglin Li,
Xiangyuan Lan,
Shuo Yang,
Sheng** Zhang,
Liqiang Nie
Abstract:
Traditional studies emphasize the significance of context information in improving matting performance. Consequently, deep learning-based matting methods delve into designing pooling or affinity-based context aggregation modules to achieve superior results. However, these modules cannot well handle the context scale shift caused by the difference in image size during training and inference, result…
▽ More
Traditional studies emphasize the significance of context information in improving matting performance. Consequently, deep learning-based matting methods delve into designing pooling or affinity-based context aggregation modules to achieve superior results. However, these modules cannot well handle the context scale shift caused by the difference in image size during training and inference, resulting in matting performance degradation. In this paper, we revisit the context aggregation mechanisms of matting networks and find that a basic encoder-decoder network without any context aggregation modules can actually learn more universal context aggregation, thereby achieving higher matting performance compared to existing methods. Building on this insight, we present AEMatter, a matting network that is straightforward yet very effective. AEMatter adopts a Hybrid-Transformer backbone with appearance-enhanced axis-wise learning (AEAL) blocks to build a basic network with strong context aggregation learning capability. Furthermore, AEMatter leverages a large image training strategy to assist the network in learning context aggregation from data. Extensive experiments on five popular matting datasets demonstrate that the proposed AEMatter outperforms state-of-the-art matting methods by a large margin.
△ Less
Submitted 14 May, 2024; v1 submitted 3 April, 2023;
originally announced April 2023.
-
Collaborative Trolley Transportation System with Autonomous Nonholonomic Robots
Authors:
Bingyi Xia,
Hao Luan,
Ziqi Zhao,
Xuheng Gao,
Peijia Xie,
Anxing Xiao,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
Cooperative object transportation using multiple robots has been intensively studied in the control and robotics literature, but most approaches are either only applicable to omnidirectional robots or lack a complete navigation and decision-making framework that operates in real time. This paper presents an autonomous nonholonomic multi-robot system and an end-to-end hierarchical autonomy framewor…
▽ More
Cooperative object transportation using multiple robots has been intensively studied in the control and robotics literature, but most approaches are either only applicable to omnidirectional robots or lack a complete navigation and decision-making framework that operates in real time. This paper presents an autonomous nonholonomic multi-robot system and an end-to-end hierarchical autonomy framework for collaborative luggage trolley transportation. This framework finds kinematic-feasible paths, computes online motion plans, and provides feedback that enables the multi-robot system to handle long lines of luggage trolleys and navigate obstacles and pedestrians while dealing with multiple inherently complex and coupled constraints. We demonstrate the designed collaborative trolley transportation system through practical transportation tasks, and the experiment results reveal their effectiveness and reliability in complex and dynamic environments.
△ Less
Submitted 21 July, 2023; v1 submitted 12 March, 2023;
originally announced March 2023.
-
FabricFolding: Learning Efficient Fabric Folding without Expert Demonstrations
Authors:
Can He,
Lingxiao Meng,
Zhirui Sun,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
Autonomous fabric manipulation is a challenging task due to complex dynamics and potential self-occlusion during fabric handling. An intuitive method of fabric folding manipulation first involves obtaining a smooth and unfolded fabric configuration before the folding process begins. However, the combination of quasi-static actions such as pick & place and dynamic action like fling proves inadequat…
▽ More
Autonomous fabric manipulation is a challenging task due to complex dynamics and potential self-occlusion during fabric handling. An intuitive method of fabric folding manipulation first involves obtaining a smooth and unfolded fabric configuration before the folding process begins. However, the combination of quasi-static actions such as pick & place and dynamic action like fling proves inadequate in effectively unfolding long-sleeved T-shirts with sleeves mostly tucked inside the garment. To address this limitation, this paper introduces an improved quasi-static action called pick & drag, specifically designed to handle this type of fabric configuration. Additionally, an efficient dual-arm manipulation system is designed in this paper, which combines quasi-static (including pick & place and pick & drag) and dynamic fling actions to flexibly manipulate fabrics into unfolded and smooth configurations. Subsequently, keypoints of the fabric are detected, enabling autonomous folding. To address the scarcity of publicly available keypoint detection datasets for real fabric, we gathered images of various fabric configurations and types in real scenes to create a comprehensive keypoint dataset for fabric folding. This dataset aims to enhance the success rate of keypoint detection. Moreover, we evaluate the effectiveness of our proposed system in real-world settings, where it consistently and reliably unfolds and folds various types of fabrics, including challenging situations such as long-sleeved T-shirts with most parts of sleeves tucked inside the garment. Specifically, our method achieves a coverage rate of 0.822 and a success rate of 0.88 for long-sleeved T-shirts folding.
△ Less
Submitted 11 September, 2023; v1 submitted 12 March, 2023;
originally announced March 2023.
-
A Systematic Evaluation of Different Indoor Localization Methods in Robotic Autonomous Luggage Trolley Collection at Airports
Authors:
Zhirui Sun,
Weinan Chen,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
This article addresses the localization problem in robotic autonomous luggage trolley collection at airports and provides a systematic evaluation of different methods to solve it. The robotic autonomous luggage trolley collection is a complex system that involves object detection, localization, motion planning and control, manipulation, etc. Among these components, effective localization is essent…
▽ More
This article addresses the localization problem in robotic autonomous luggage trolley collection at airports and provides a systematic evaluation of different methods to solve it. The robotic autonomous luggage trolley collection is a complex system that involves object detection, localization, motion planning and control, manipulation, etc. Among these components, effective localization is essential for the robot to employ subsequent motion planning and end-effector manipulation because it can provide a correct goal position. In this article, we survey four popular and representative localization methods to achieve object localization in the luggage collection process, including radio frequency identification (RFID), Keypoints, ultrawideband (UWB), and Reflectors. To test their performance, we construct a qualitative evaluation framework with Localization Accuracy, Mobile Power Supplies, Coverage Area, Cost, and Scalability. Besides, we conduct a series of quantitative experiments regarding Localization Accuracy and Success Rate on a real-world robotic autonomous luggage trolley collection system. We further analyze the performance of different localization methods based on experiment results, revealing that the Keypoints method is most suitable for indoor environments to achieve the luggage trolley collection.
△ Less
Submitted 11 March, 2023;
originally announced March 2023.
-
Controlling selenization equilibrium enables high-quality Cu2ZnSn(S, Se)4 absorbers for efficient solar cells
Authors:
Xiao Xu,
Jiazheng Zhou,
Kang Yin,
**lin Wang,
Licheng Lou,
Menghan Jiao,
Bowen Zhang,
Dongmei Li,
Jiangjian Shi,
Huijue Wu,
Yanhong Luo,
Qingbo Meng
Abstract:
Cu2ZnSn(S, Se)4 (CZTSSe) is one of most competitive photovoltaic materials for its earth-abundant reserves, environmental friendliness, and high stability.The quality of CZTSSe absorber determines the power-conversion efficiency (PCE) of CZTSSe solar cells. The absorber's quality lies on post-selenization process, which is the reaction of Cu-Zn-Sn precursor and selenium vapor. And the post-seleniz…
▽ More
Cu2ZnSn(S, Se)4 (CZTSSe) is one of most competitive photovoltaic materials for its earth-abundant reserves, environmental friendliness, and high stability.The quality of CZTSSe absorber determines the power-conversion efficiency (PCE) of CZTSSe solar cells. The absorber's quality lies on post-selenization process, which is the reaction of Cu-Zn-Sn precursor and selenium vapor. And the post-selenization is dependent on various factors (e.g. temperature, precursor composition, reaction atmosphere, etc).However, synergistic regulation of these factors cannot be realized under a widely-used single-temperature zone selenization condition.Here, in our dual-temperature zone selenization scheme, a solid-liquid and solid-gas (solid precursor and liquid/gas phase Se) synergistic reaction strategy has been developed to precisely regulate the selenization. Pre-deposited excess liquid Se provides high Se chemical potential to drive a direct and fast formation of the CZTSSe phase, significantly reducing the amount of binary and ternary compounds within phase evolution. And organics removal can be accomplished via a synergistic optimization of Se condensation and subsequent volatilization. We achieve a high-performance CZTSSe solar cell with a remarkable PCE of 13.6%, and the highest large-area PCE of 12.0% (over 1cm2). Our strategy will provide a new idea for further improving efficiency of CZTSSe solar cells via phase evolution regulation, and also for other complicated multi-compound synthesis.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks
Authors:
Qingyan Meng,
Mingqing Xiao,
Shen Yan,
Yisen Wang,
Zhouchen Lin,
Zhi-Quan Luo
Abstract:
Spiking Neural Networks (SNNs) are promising energy-efficient models for neuromorphic computing. For training the non-differentiable SNN models, the backpropagation through time (BPTT) with surrogate gradients (SG) method has achieved high performance. However, this method suffers from considerable memory cost and training time during training. In this paper, we propose the Spatial Learning Throug…
▽ More
Spiking Neural Networks (SNNs) are promising energy-efficient models for neuromorphic computing. For training the non-differentiable SNN models, the backpropagation through time (BPTT) with surrogate gradients (SG) method has achieved high performance. However, this method suffers from considerable memory cost and training time during training. In this paper, we propose the Spatial Learning Through Time (SLTT) method that can achieve high performance while greatly improving training efficiency compared with BPTT. First, we show that the backpropagation of SNNs through the temporal domain contributes just a little to the final calculated gradients. Thus, we propose to ignore the unimportant routes in the computational graph during backpropagation. The proposed method reduces the number of scalar multiplications and achieves a small memory occupation that is independent of the total time steps. Furthermore, we propose a variant of SLTT, called SLTT-K, that allows backpropagation only at K time steps, then the required number of scalar multiplications is further reduced and is independent of the total time steps. Experiments on both static and neuromorphic datasets demonstrate superior training efficiency and performance of our SLTT. In particular, our method achieves state-of-the-art accuracy on ImageNet, while the memory cost and training time are reduced by more than 70% and 50%, respectively, compared with BPTT.
△ Less
Submitted 7 August, 2023; v1 submitted 28 February, 2023;
originally announced February 2023.
-
NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition
Authors:
Xinquan Huang,
Wenlei Shi,
Qi Meng,
Yue Wang,
Xiaotian Gao,
Jia Zhang,
Tie-Yan Liu
Abstract:
Neural networks have shown great potential in accelerating the solution of partial differential equations (PDEs). Recently, there has been a growing interest in introducing physics constraints into training neural PDE solvers to reduce the use of costly data and improve the generalization ability. However, these physics constraints, based on certain finite dimensional approximations over the funct…
▽ More
Neural networks have shown great potential in accelerating the solution of partial differential equations (PDEs). Recently, there has been a growing interest in introducing physics constraints into training neural PDE solvers to reduce the use of costly data and improve the generalization ability. However, these physics constraints, based on certain finite dimensional approximations over the function space, must resolve the smallest scaled physics to ensure the accuracy and stability of the simulation, resulting in high computational costs from large input, output, and neural networks. This paper proposes a general acceleration methodology called NeuralStagger by spatially and temporally decomposing the original learning tasks into several coarser-resolution subtasks. We define a coarse-resolution neural solver for each subtask, which requires fewer computational resources, and jointly train them with the vanilla physics-constrained loss by simply arranging their outputs to reconstruct the original solution. Due to the perfect parallelism between them, the solution is achieved as fast as a coarse-resolution neural solver. In addition, the trained solvers bring the flexibility of simulating with multiple levels of resolution. We demonstrate the successful application of NeuralStagger on 2D and 3D fluid dynamics simulations, which leads to an additional $10\sim100\times$ speed-up. Moreover, the experiment also shows that the learned model could be well used for optimal control.
△ Less
Submitted 27 May, 2023; v1 submitted 20 February, 2023;
originally announced February 2023.
-
Learning Neural Operators on Riemannian Manifolds
Authors:
Gengxiang Chen,
Xu Liu,
Qinglu Meng,
Lu Chen,
Changqing Liu,
Yingguang Li
Abstract:
In Artificial Intelligence (AI) and computational science, learning the map**s between functions (called operators) defined on complex computational domains is a common theoretical challenge. Recently, Neural Operator emerged as a promising framework with a discretisation-independent model structure to break the fixed-dimension limitation of classical deep learning models. However, existing oper…
▽ More
In Artificial Intelligence (AI) and computational science, learning the map**s between functions (called operators) defined on complex computational domains is a common theoretical challenge. Recently, Neural Operator emerged as a promising framework with a discretisation-independent model structure to break the fixed-dimension limitation of classical deep learning models. However, existing operator learning methods mainly focus on regular computational domains, and many components of these methods rely on Euclidean structural data. In real-life applications, many operator learning problems are related to complex computational domains such as complex surfaces and solids, which are non-Euclidean and widely referred to as Riemannian manifolds. Here, we report a new concept, Neural Operator on Riemannian Manifolds (NORM), which generalises Neural Operator from being limited to Euclidean spaces to being applicable to Riemannian manifolds, and can learn the map** between functions defined on any real-life complex geometries, while preserving the discretisation-independent model structure. NORM shifts the function-to-function map** to finite-dimensional map** in the Laplacian eigenfunctions' subspace of geometry, and holds universal approximation property in learning operators on Riemannian manifolds even with only one fundamental block. The theoretical and experimental analysis prove that NORM is a significant step forward in operator learning and has the potential to solve complex problems in many fields of applications sharing the same nature and theoretical principle.
△ Less
Submitted 12 December, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Monte Carlo Neural PDE Solver for Learning PDEs via Probabilistic Representation
Authors:
Rui Zhang,
Qi Meng,
Rongchan Zhu,
Yue Wang,
Wenlei Shi,
Shihua Zhang,
Zhi-Ming Ma,
Tie-Yan Liu
Abstract:
In scenarios with limited available data, training the function-to-function neural PDE solver in an unsupervised manner is essential. However, the efficiency and accuracy of existing methods are constrained by the properties of numerical algorithms, such as finite difference and pseudo-spectral methods, integrated during the training stage. These methods necessitate careful spatiotemporal discreti…
▽ More
In scenarios with limited available data, training the function-to-function neural PDE solver in an unsupervised manner is essential. However, the efficiency and accuracy of existing methods are constrained by the properties of numerical algorithms, such as finite difference and pseudo-spectral methods, integrated during the training stage. These methods necessitate careful spatiotemporal discretization to achieve reasonable accuracy, leading to significant computational challenges and inaccurate simulations, particularly in cases with substantial spatiotemporal variations. To address these limitations, we propose the Monte Carlo Neural PDE Solver (MCNP Solver) for training unsupervised neural solvers via the PDEs' probabilistic representation, which regards macroscopic phenomena as ensembles of random particles. Compared to other unsupervised methods, MCNP Solver naturally inherits the advantages of the Monte Carlo method, which is robust against spatiotemporal variations and can tolerate coarse step size. In simulating the trajectories of particles, we employ Heun's method for the convection process and calculate the expectation via the probability density function of neighbouring grid points during the diffusion process. These techniques enhance accuracy and circumvent the computational issues associated with Monte Carlo sampling. Our numerical experiments on convection-diffusion, Allen-Cahn, and Navier-Stokes equations demonstrate significant improvements in accuracy and efficiency compared to other unsupervised baselines. The source code will be publicly available at: https://github.com/optray/MCNP.
△ Less
Submitted 20 May, 2024; v1 submitted 10 February, 2023;
originally announced February 2023.
-
SPIDE: A Purely Spike-based Method for Training Feedback Spiking Neural Networks
Authors:
Mingqing Xiao,
Qingyan Meng,
Zongpeng Zhang,
Yisen Wang,
Zhouchen Lin
Abstract:
Spiking neural networks (SNNs) with event-based computation are promising brain-inspired models for energy-efficient applications on neuromorphic hardware. However, most supervised SNN training methods, such as conversion from artificial neural networks or direct training with surrogate gradients, require complex computation rather than spike-based operations of spiking neurons during training. In…
▽ More
Spiking neural networks (SNNs) with event-based computation are promising brain-inspired models for energy-efficient applications on neuromorphic hardware. However, most supervised SNN training methods, such as conversion from artificial neural networks or direct training with surrogate gradients, require complex computation rather than spike-based operations of spiking neurons during training. In this paper, we study spike-based implicit differentiation on the equilibrium state (SPIDE) that extends the recently proposed training method, implicit differentiation on the equilibrium state (IDE), for supervised learning with purely spike-based computation, which demonstrates the potential for energy-efficient training of SNNs. Specifically, we introduce ternary spiking neuron couples and prove that implicit differentiation can be solved by spikes based on this design, so the whole training procedure, including both forward and backward passes, is made as event-driven spike computation, and weights are updated locally with two-stage average firing rates. Then we propose to modify the reset membrane potential to reduce the approximation error of spikes. With these key components, we can train SNNs with flexible structures in a small number of time steps and with firing sparsity during training, and the theoretical estimation of energy costs demonstrates the potential for high efficiency. Meanwhile, experiments show that even with these constraints, our trained models can still achieve competitive results on MNIST, CIFAR-10, CIFAR-100, and CIFAR10-DVS. Our code is available at https://github.com/pkuxmq/SPIDE-FSNN.
△ Less
Submitted 31 January, 2023;
originally announced February 2023.
-
Magnetostriction, piezomagnetism and domain nucleation in a kagome antiferromagnet
Authors:
Qingkai Meng,
Jianting Dong,
Pan Nie,
Liangcai Xu,
**hua Wang,
Shan Jiang,
Huakun Zuo,
Jia Zhang,
Xiaokang Li,
Zengwei Zhu,
Leon Balents,
Kamran Behnia
Abstract:
Whenever the elastic energy of a solid depends on magnetic field, there is a magnetostrictive response. Field-linear magnetostriction implies piezomagnetism and vice versa. Here, we show that Mn$_3$Sn, a non-collinear antiferromanget with Weyl nodes, hosts a large and almost perfectly linear magnetostriction even at room temperature. The longitudinal and transverse magnetostriction, with opposite…
▽ More
Whenever the elastic energy of a solid depends on magnetic field, there is a magnetostrictive response. Field-linear magnetostriction implies piezomagnetism and vice versa. Here, we show that Mn$_3$Sn, a non-collinear antiferromanget with Weyl nodes, hosts a large and almost perfectly linear magnetostriction even at room temperature. The longitudinal and transverse magnetostriction, with opposite signs and similar amplitude are restricted to the kagome planes and the out-of-plane response is negligibly small. By studying four different samples with different Mn:Sn ratios, we find a clear correlation between the linear magnetostriction, the spontaneous magnetization and the concentration of Sn vacancies. The recently reported piezomagnetic data fits in our picture. We show that linear magnetostriction and piezomagnetism are both driven by the field-induced in-plane twist of spins. A quantitative account of the experimental data requires the distortion of the spin texture by Sn vacancies. We find that the field-induced domain nucleation within the hysteresis loop corresponds to a phase transition. Within the hysteresis loop, a concomitant mesoscopic modulation of local strain and spin twist angles, leading to twisto-magnetic stripes, arises as a result of the competition between elastic and magnetic energies.
△ Less
Submitted 28 March, 2024; v1 submitted 16 January, 2023;
originally announced January 2023.
-
Closed-Loop Magnetic Manipulation for Robotic Transesophageal Echocardiography
Authors:
Keyu Li,
Yangxin Xu,
Ziqi Zhao,
Ang Li,
Max Q. -H. Meng
Abstract:
This paper presents a closed-loop magnetic manipulation framework for robotic transesophageal echocardiography (TEE) acquisitions. Different from previous work on intracorporeal robotic ultrasound acquisitions that focus on continuum robot control, we first investigate the use of magnetic control methods for more direct, intuitive, and accurate manipulation of the distal tip of the probe. We modif…
▽ More
This paper presents a closed-loop magnetic manipulation framework for robotic transesophageal echocardiography (TEE) acquisitions. Different from previous work on intracorporeal robotic ultrasound acquisitions that focus on continuum robot control, we first investigate the use of magnetic control methods for more direct, intuitive, and accurate manipulation of the distal tip of the probe. We modify a standard TEE probe by attaching a permanent magnet and an inertial measurement unit sensor to the probe tip and replacing the flexible gastroscope with a soft tether containing only wires for transmitting ultrasound signals, and show that 6-DOF localization and 5-DOF closed-loop control of the probe can be achieved with an external permanent magnet based on the fusion of internal inertial measurement and external magnetic field sensing data. The proposed method does not require complex structures or motions of the actuator and the probe compared with existing magnetic manipulation methods. We have conducted extensive experiments to validate the effectiveness of the framework in terms of localization accuracy, update rate, workspace size, and tracking accuracy. In addition, our results obtained on a realistic cardiac tissue-mimicking phantom show that the proposed framework is applicable in real conditions and can generally meet the requirements for tele-operated TEE acquisitions.
△ Less
Submitted 28 May, 2023; v1 submitted 16 January, 2023;
originally announced January 2023.
-
Open Case Studies: Statistics and Data Science Education through Real-World Applications
Authors:
Carrie Wright,
Qier Meng,
Michael R. Breshock,
Lyla Atta,
Margaret A. Taub,
Leah R Jager,
John Muschelli,
Stephanie C. Hicks
Abstract:
With unprecedented and growing interest in data science education, there are limited educator materials that provide meaningful opportunities for learners to practice statistical thinking, as defined by Wild and Pfannkuch (1999), with messy data addressing real-world challenges. As a solution, Nolan and Speed (1999) advocated for bringing applications to the forefront in undergraduate statistics c…
▽ More
With unprecedented and growing interest in data science education, there are limited educator materials that provide meaningful opportunities for learners to practice statistical thinking, as defined by Wild and Pfannkuch (1999), with messy data addressing real-world challenges. As a solution, Nolan and Speed (1999) advocated for bringing applications to the forefront in undergraduate statistics curriculum with the use of in-depth case studies to encourage and develop statistical thinking in the classroom. Limitations to this approach include the significant time investment required to develop a case study -- namely, to select a motivating question and to create an illustrative data analysis -- and the domain expertise needed. As a result, case studies based on realistic challenges, not toy examples, are scarce. To address this, we developed the Open Case Studies (https://www.opencasestudies.org) project, which offers a new statistical and data science education case study model. This educational resource provides self-contained, multimodal, peer-reviewed, and open-source guides (or case studies) from real-world examples for active experiences of complete data analyses. We developed an educator's guide describing how to most effectively use the case studies, how to modify and adapt components of the case studies in the classroom, and how to contribute new case studies. (https://www.opencasestudies.org/OCS_Guide).
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Modelling particle collisions in moderately dense curtain impacted by an incident shock wave
Authors:
Pikai Zhang,
Huangwei Zhang,
Yun Feng Zhang,
Shangpeng Li,
Qingyang Meng
Abstract:
The interactions between an incident shock and moderately dense particle curtain are simulated with the Eulerian-Lagrangian method. A customized solver based on OpenFOAM is extended with an improved drag model and collision model, and then validated against two benchmark experiments. In this work, parametric studies are performed considering different particle sizes, volume fractions, and curtain…
▽ More
The interactions between an incident shock and moderately dense particle curtain are simulated with the Eulerian-Lagrangian method. A customized solver based on OpenFOAM is extended with an improved drag model and collision model, and then validated against two benchmark experiments. In this work, parametric studies are performed considering different particle sizes, volume fractions, and curtain thicknesses. It is found that smaller particle size and larger volume fractions lead to stronger reflected shock and weaker transmitted shock. Different expansion stages of the curtain fronts are also studied in detail. Attention is paid to the particle collision effects on the curtain evolution behaviours. According to our results, for the mono-dispersed particle curtain, the collision effects on curtain front behaviors are small, even when the initial particle volume fraction is as high as 20%. This is due to the positive velocity gradient across the curtain after the shock wave passage, leading to faster motion of downstream particles than the upstream ones and hence no collision occurs. For the bi-dispersed particle curtain, the collision effects become important in the mixing region of different-size particles. Collisions decelerate small particles while accelerate large ones and cause velocity scattering. Moreover, increasing the bi-dispersed curtain thickness leads to multiple collision force peaks due to the local particle accumulations, which is the result of the delayed separation of different particle groups. Our results indicate that the collision model may be unnecessary to predict curtain fronts in mono-dispersed particles, but in bi-dispersed particles, the collision effects are important and therefore must be modelled.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
MHCCL: Masked Hierarchical Cluster-Wise Contrastive Learning for Multivariate Time Series
Authors:
Qianwen Meng,
Hangwei Qian,
Yong Liu,
Lizhen Cui,
Yonghui Xu,
Zhiqi Shen
Abstract:
Learning semantic-rich representations from raw unlabeled time series data is critical for downstream tasks such as classification and forecasting. Contrastive learning has recently shown its promising representation learning capability in the absence of expert annotations. However, existing contrastive approaches generally treat each instance independently, which leads to false negative pairs tha…
▽ More
Learning semantic-rich representations from raw unlabeled time series data is critical for downstream tasks such as classification and forecasting. Contrastive learning has recently shown its promising representation learning capability in the absence of expert annotations. However, existing contrastive approaches generally treat each instance independently, which leads to false negative pairs that share the same semantics. To tackle this problem, we propose MHCCL, a Masked Hierarchical Cluster-wise Contrastive Learning model, which exploits semantic information obtained from the hierarchical structure consisting of multiple latent partitions for multivariate time series. Motivated by the observation that fine-grained clustering preserves higher purity while coarse-grained one reflects higher-level semantics, we propose a novel downward masking strategy to filter out fake negatives and supplement positives by incorporating the multi-granularity information from the clustering hierarchy. In addition, a novel upward masking strategy is designed in MHCCL to remove outliers of clusters at each partition to refine prototypes, which helps speed up the hierarchical clustering process and improves the clustering quality. We conduct experimental evaluations on seven widely-used multivariate time series datasets. The results demonstrate the superiority of MHCCL over the state-of-the-art approaches for unsupervised time series representation learning.
△ Less
Submitted 30 March, 2023; v1 submitted 2 December, 2022;
originally announced December 2022.
-
EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks
Authors:
Liyu Shi,
Xiaoyan Li,
Weiming Hu,
Haoyuan Chen,
**g Chen,
Zizhen Fan,
Minghe Gao,
Yujie **g,
Guotao Lu,
Deguo Ma,
Zhiyu Ma,
Qingtao Meng,
Dechao Tang,
Hongzan Sun,
Marcin Grzegorzek,
Shouliang Qi,
Yueyang Teng,
Chen Li
Abstract:
Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when comp…
▽ More
Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when computer technology is used to aid in diagnosis. Methods: This present study provided a new publicly available Enteroscope Biopsy Histopathological Hematoxylin and Eosin Image Dataset for Image Segmentation Tasks (EBHI-Seg). To demonstrate the validity and extensiveness of EBHI-Seg, the experimental results for EBHI-Seg are evaluated using classical machine learning methods and deep learning methods. Results: The experimental results showed that deep learning methods had a better image segmentation performance when utilizing EBHI-Seg. The maximum accuracy of the Dice evaluation metric for the classical machine learning method is 0.948, while the Dice evaluation metric for the deep learning method is 0.965. Conclusion: This publicly available dataset contained 5,170 images of six types of tumor differentiation stages and the corresponding ground truth images. The dataset can provide researchers with new segmentation algorithms for medical diagnosis of colorectal cancer, which can be used in the clinical setting to help doctors and patients.
△ Less
Submitted 6 December, 2022; v1 submitted 1 December, 2022;
originally announced December 2022.
-
Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images
Authors:
Meng Wang,
Kai Yu,
Chun-Mei Feng,
Ke Zou,
Yanyu Xu,
Qingquan Meng,
Rick Siow Mong Goh,
Yong Liu,
Huazhu Fu
Abstract:
Focusing on the complicated pathological features, such as blurred boundaries, severe scale differences between symptoms, background noise interference, etc., in the task of retinal edema lesions joint segmentation from OCT images and enabling the segmentation results more reliable. In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network, which can provide accur…
▽ More
Focusing on the complicated pathological features, such as blurred boundaries, severe scale differences between symptoms, background noise interference, etc., in the task of retinal edema lesions joint segmentation from OCT images and enabling the segmentation results more reliable. In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network, which can provide accurate segmentation results with reliability assessment. Specifically, aiming at improving the model's ability to learn the complex pathological features of retinal edema lesions in OCT images, we develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module of our newly designed. Meanwhile, to make the segmentation results more reliable, a novel uncertainty segmentation head based on the subjective logical evidential theory is introduced to generate the final segmentation results with a corresponding overall uncertainty evaluation score map. We conduct comprehensive experiments on the public database of AI-Challenge 2018 for retinal edema lesions segmentation, and the results show that our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches. The code will be released on: https://github.com/LooKing9218/ReliableRESeg.
△ Less
Submitted 1 January, 2024; v1 submitted 1 December, 2022;
originally announced December 2022.
-
Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search
Authors:
Zihan Wang,
Qi Meng,
HaiFeng Lan,
XinRui Zhang,
KeHao Guo,
Akshat Gupta
Abstract:
Speech emotion recognition (SER) classifies audio into emotion categories such as Happy, Angry, Fear, Disgust and Neutral. While Speech Emotion Recognition (SER) is a common application for popular languages, it continues to be a problem for low-resourced languages, i.e., languages with no pretrained speech-to-text recognition models. This paper firstly proposes a language-specific model that extr…
▽ More
Speech emotion recognition (SER) classifies audio into emotion categories such as Happy, Angry, Fear, Disgust and Neutral. While Speech Emotion Recognition (SER) is a common application for popular languages, it continues to be a problem for low-resourced languages, i.e., languages with no pretrained speech-to-text recognition models. This paper firstly proposes a language-specific model that extract emotional information from multiple pre-trained speech models, and then designs a multi-domain model that simultaneously performs SER for various languages. Our multidomain model employs a multi-gating mechanism to generate unique weighted feature combination for each language, and also searches for specific neural network structure for each language through a neural architecture search module. In addition, we introduce a contrastive auxiliary loss to build more separable representations for audio data. Our experiments show that our model raises the state-of-the-art accuracy by 3% for German and 14.3% for French.
△ Less
Submitted 15 November, 2022; v1 submitted 31 October, 2022;
originally announced November 2022.
-
Ultrafast formation of topological defects in a 2D charge density wave
Authors:
Yun Cheng,
Alfred Zong,
Lijun Wu,
Qing** Meng,
Wei Xia,
Fengfeng Qi,
Pengfei Zhu,
Xiao Zou,
Tao Jiang,
Yanfeng Guo,
Jasper van Wezel,
Anshul Kogar,
Michael W. Zuerch,
Jie Zhang,
Yimei Zhu,
Dao Xiang
Abstract:
Topological defects play a key role in nonequilibrium phase transitions, ranging from birth of the early universe to quantum critical behavior of ultracold atoms. In solids, transient defects are known to generate a variety of hidden orders not accessible in equilibrium, but how defects are formed at the nanometer lengthscale and femtosecond timescale remains unknown. Here, we employ an intense la…
▽ More
Topological defects play a key role in nonequilibrium phase transitions, ranging from birth of the early universe to quantum critical behavior of ultracold atoms. In solids, transient defects are known to generate a variety of hidden orders not accessible in equilibrium, but how defects are formed at the nanometer lengthscale and femtosecond timescale remains unknown. Here, we employ an intense laser pulse to create topological defects in a 2D charge density wave, and track their morphology and dynamics with ultrafast electron diffraction. Leveraging its high temporal resolution and sensitivity in detecting weak diffuse signals, we discover a dual-stage growth of 1D domain walls within 1 ps, a process not dictated by the order parameter amplitude but instead mediated by a nonthermal population of longitudinal optical phonons. Our work provides a framework for ultrafast engineering of topological defects based on selective excitation of collective modes, opening new avenues for dynamical control of nonequilibrium phases in correlated materials.
△ Less
Submitted 10 November, 2022;
originally announced November 2022.
-
A precisely regulating phase evolution strategy for highly efficient kesterite solar cells
Authors:
Jiazheng Zhou,
Xiao Xu,
Huijue Wu,
**lin Wang,
Licheng Lou,
Kang Yin,
Yuancai Gong,
Jiangjian Shi,
Yanhong Luo,
Dongmei Li,
Hao Xin,
Qingbo Meng
Abstract:
Phase evolution during the selenization is crucial for high-quality kesterite Cu2ZnSn(S, Se)4 (CZTSSe) absorbers and efficient solar cells. Herein, we regulate kinetic process of phase evolution from Cu+-Sn4+-MOE (MOE: 2-methoxyethanol) system by precisely controlling positive chamber pressure. We found that, at the heating-up stage, Se vapor concentration is intentionally suppressed in low-temper…
▽ More
Phase evolution during the selenization is crucial for high-quality kesterite Cu2ZnSn(S, Se)4 (CZTSSe) absorbers and efficient solar cells. Herein, we regulate kinetic process of phase evolution from Cu+-Sn4+-MOE (MOE: 2-methoxyethanol) system by precisely controlling positive chamber pressure. We found that, at the heating-up stage, Se vapor concentration is intentionally suppressed in low-temperature region, which effectively reduces collision probability between the CZTS and Se atoms, thus remarkably inhibiting formation of secondary phases on the surface and multiple-step phase evolution processes. This strategy enables the phase evolution to start at relatively higher temperature and thereby leading to high crystalline quality CZTSSe absorber with fewer defects, and corresponding CZTSSe solar cell can present 14.1% efficiency (total area), which is the highest result so far. This work provides important insights into selenization mechanism of CZTSSe absorbers and explores a new way of kinetic regulation strategy to simplify the phase evolution path to efficient CZTSSe solar cells.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Extrinsic Manipulation on a Support Plane by Learning Regras**
Authors:
Peng Xu,
Zhiyuan Chen,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
Extrinsic manipulation, a technique that enables robots to leverage extrinsic resources for object manipulation, presents practical yet challenging scenarios. Particularly in the context of extrinsic manipulation on a supporting plane, regras** becomes essential for achieving the desired final object poses. This process involves sequential operation steps and stable placements of objects, which…
▽ More
Extrinsic manipulation, a technique that enables robots to leverage extrinsic resources for object manipulation, presents practical yet challenging scenarios. Particularly in the context of extrinsic manipulation on a supporting plane, regras** becomes essential for achieving the desired final object poses. This process involves sequential operation steps and stable placements of objects, which provide grasp space for the robot. To address this challenge, we focus on predicting diverse placements of objects on the plane using deep neural networks. A framework that comprises orientation generation, placement refinement, and placement discrimination stages is proposed, leveraging point clouds to obtain precise and diverse stable placements. To facilitate training, a large-scale dataset is constructed, encompassing stable object placements and contact information between objects. Through extensive experiments, our approach is demonstrated to outperform the start-of-the-art, achieving an accuracy rate of 90.4\% and a diversity rate of 81.3\% in predicted placements. Furthermore, we validate the effectiveness of our approach through real-robot experiments, demonstrating its capability to compute sequential pick-and-place steps based on the predicted placements for regras** objects to goal poses that are not readily attainable within a single step. Videos and dataset are available at https://sites.google.com/view/pmvlr2022/.
△ Less
Submitted 11 July, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
Online Training Through Time for Spiking Neural Networks
Authors:
Mingqing Xiao,
Qingyan Meng,
Zongpeng Zhang,
Di He,
Zhouchen Lin
Abstract:
Spiking neural networks (SNNs) are promising brain-inspired energy-efficient models. Recent progress in training methods has enabled successful deep SNNs on large-scale tasks with low latency. Particularly, backpropagation through time (BPTT) with surrogate gradients (SG) is popularly used to achieve high performance in a very small number of time steps. However, it is at the cost of large memory…
▽ More
Spiking neural networks (SNNs) are promising brain-inspired energy-efficient models. Recent progress in training methods has enabled successful deep SNNs on large-scale tasks with low latency. Particularly, backpropagation through time (BPTT) with surrogate gradients (SG) is popularly used to achieve high performance in a very small number of time steps. However, it is at the cost of large memory consumption for training, lack of theoretical clarity for optimization, and inconsistency with the online property of biological learning and rules on neuromorphic hardware. Other works connect spike representations of SNNs with equivalent artificial neural network formulation and train SNNs by gradients from equivalent map**s to ensure descent directions. But they fail to achieve low latency and are also not online. In this work, we propose online training through time (OTTT) for SNNs, which is derived from BPTT to enable forward-in-time learning by tracking presynaptic activities and leveraging instantaneous loss and gradients. Meanwhile, we theoretically analyze and prove that gradients of OTTT can provide a similar descent direction for optimization as gradients based on spike representations under both feedforward and recurrent conditions. OTTT only requires constant training memory costs agnostic to time steps, avoiding the significant memory costs of BPTT for GPU training. Furthermore, the update rule of OTTT is in the form of three-factor Hebbian learning, which could pave a path for online on-chip learning. With OTTT, it is the first time that two mainstream supervised SNN training methods, BPTT with SG and spike representation-based training, are connected, and meanwhile in a biologically plausible form. Experiments on CIFAR-10, CIFAR-100, ImageNet, and CIFAR10-DVS demonstrate the superior performance of our method on large-scale static and neuromorphic datasets in small time steps.
△ Less
Submitted 31 December, 2022; v1 submitted 9 October, 2022;
originally announced October 2022.
-
Transmission of hydrogen detonation across a curtain of dilute inert particles
Authors:
Yong Xu,
Pikai Zhang,
Qingyang Meng,
Shangpeng Li,
Huangwei Zhang
Abstract:
Transmission of hydrogen detonation wave (DW) in an inert particle curtain is simulated using the Eulerian-Lagrangian approach with gas-particle two-way coupling. A detailed chemical mechanism is used for hydrogen detonative combustion and parametric studies are conducted based on a two-dimensional computational domain. A detonation map of propagation and extinction corresponding to various partic…
▽ More
Transmission of hydrogen detonation wave (DW) in an inert particle curtain is simulated using the Eulerian-Lagrangian approach with gas-particle two-way coupling. A detailed chemical mechanism is used for hydrogen detonative combustion and parametric studies are conducted based on a two-dimensional computational domain. A detonation map of propagation and extinction corresponding to various particle sizes, concentrations, and curtain thicknesses is plotted. It is shown that the critical curtain thickness decreases considerably when the particle concentration is less than the critical value. The effects of curtain thickness on the trajectories of peak pressure, shock front speed, and heat release rate are examined. Three propagation modes of the DW in particle curtain are found: detonation transmission, partial extinction and detonation reinitiation, and detonation extinction. The chemical explosive mode analysis confirms that a detonation re-initiation event is caused by a re-initiation point with high pressure and explosive propensity, resulting from transverse shock focusing. The influence of particle dimeter and concentration, and curtain thickness on the DW are also examined with peak pressure trajectories, shock speed, and interphase exchange rates of energy and momentum. Furthermore, the evolutions of curtain morphologies are analyzed by the particle velocity, volume fraction, Stokes drag and Archimedes force. This analysis confirms the importance of the drag force in influencing the change of curtain morphologies. Different curtain evolution regimes are found: quasi-stationary regime, shrinkage regime, constant-thickness regime, and expansion regime. Finally, the influences of the curtain thickness on the characteristic time of curtain evolutions are studied.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
Optomechanical Effects in Nanocavity-enhanced Resonant Raman Scattering of a Single Molecule
Authors:
Xuan-Ming Shen,
Yuan Zhang,
Shun** Zhang,
Yao Zhang,
Qiu-Shi Meng,
Guangchao Zheng,
Siyuan Lv,
Luxia Wang,
Roberto A. Boto,
Chongxin Shan,
Javier Aizpurua
Abstract:
In this article, we address the optomechanical effects in surface-enhanced resonant Raman scattering (SERRS) from a single molecule in a nano-particle on mirror (NPoM) nanocavity by develo** a quantum master equation theory, which combines macroscopic quantum electrodynamics and electron-vibration interaction within the framework of open quantum system theory. We supplement the theory with elect…
▽ More
In this article, we address the optomechanical effects in surface-enhanced resonant Raman scattering (SERRS) from a single molecule in a nano-particle on mirror (NPoM) nanocavity by develo** a quantum master equation theory, which combines macroscopic quantum electrodynamics and electron-vibration interaction within the framework of open quantum system theory. We supplement the theory with electromagnetic simulations and time-dependent density functional theory calculations in order to study the SERRS of a methylene blue molecule in a realistic NPoM nanocavity. The simulations allow us not only to identify the conditions to achieve conventional optomechanical effects, such as vibrational pum**, non-linear scaling of Stokes and anti-Stokes scattering, but also to discovery distinct behaviors, such as the saturation of exciton population, the emergence of Mollow triplet side-bands, and higher-order Raman scattering. All in all, our study might guide further investigations of optomechanical effects in resonant Raman scattering.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Structure and dynamics of spray detonation in n-heptane droplet-vapor-air mixtures
Authors:
Qingyang Meng,
Majie Zhao,
Yong Xu,
Liangqi Zhang,
Huangwei Zhang
Abstract:
Spray detonation in n-heptane two-phase mixtures is simulated using Eulerian Lagrangian method. Two-dimensional configuration is considered, and the effects of droplet diameter and liquid equivalence ratio on detonation propagation, structure, and dynamics are investigated. The results show that the average detonation propagation speed first increases and then decreases as liquid equivalence ratio…
▽ More
Spray detonation in n-heptane two-phase mixtures is simulated using Eulerian Lagrangian method. Two-dimensional configuration is considered, and the effects of droplet diameter and liquid equivalence ratio on detonation propagation, structure, and dynamics are investigated. The results show that the average detonation propagation speed first increases and then decreases as liquid equivalence ratio changes, and the speed peaks at higher liquid equivalence ratio for larger droplets. The triple points and transverse detonations vaporize or aerodynamically expel the droplets from their trajectories, resulting in non-uniform distributions of fuel vapor and reaction zones behind the detonation. In addition, droplet dispersion distance in the post-detonation area increases for larger droplets due to lower evaporation. Moreover, small droplets generally lead to higher detonated n-heptane fraction, and fuel detonative combustion directly affects the variations of detonated fuel fraction. For larger droplets, V shaped dependence on liquid equivalence ratio is seen for large droplets, dominated by variations of post-detonation deflagration. It is found that spray detonation structure is significantly influenced by liquid fuel equivalence ratio and droplet diameter. The dependence of key locations in spray detonation structure on liquid fuel properties is also evaluated, e.g., reaction front and sonic plane. Furthermore, the leading shock Mach number slightly decreases with droplet size. When the liquid equivalence ratio is high, spray detonation exhibits pronounced unsteadiness, such as instantaneous or complete extinction. Either extinction is caused by strong heat absorption of evaporating droplets behind the shock. Moreover, localized detonative spot is observed due to the compression of multiple transverse shocks.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Optimized Design Method for Satellite Constellation Configuration Based on Real-time Coverage Area Evaluation
Authors:
Jiahao Zhou,
Boheng Li,
Qingxiang Meng
Abstract:
When using constellation synergy to image large areas for reconnaissance, it is required to achieve the coverage capability requirements with minimal consumption of observation resources to obtain the most optimal constellation observation scheme. With the minimum number of satellites and meeting the real-time ground coverage requirements as the optimization objectives, this paper proposes an opti…
▽ More
When using constellation synergy to image large areas for reconnaissance, it is required to achieve the coverage capability requirements with minimal consumption of observation resources to obtain the most optimal constellation observation scheme. With the minimum number of satellites and meeting the real-time ground coverage requirements as the optimization objectives, this paper proposes an optimized design of satellite constellation configuration for full coverage of large-scale regional imaging by using an improved simulated annealing algorithm combined with the real-time coverage evaluation method of hexagonal discretization. The algorithm can adapt to experimental conditions, has good efficiency, and can meet industrial accuracy requirements. The effectiveness and adaptability of the algorithm are tested in simulation applications.
△ Less
Submitted 5 December, 2022; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Comprehensive Evaluation of Emergency Shelters in Wuhan City Based on GIS
Authors:
Tingyu Luo,
Boheng Li,
Qingxiang Meng
Abstract:
Emergency shelters, which reflect the city's ability to respond to and deal with major public emergencies to a certain extent, are essential to a modern urban emergency management system. This paper is based on spatial analysis methods, using Analytic Hierarchy Process to analyze the suitability of the 28 emergency shelters in Wuhan City. The Technique for Order Preference by Similarity to an Idea…
▽ More
Emergency shelters, which reflect the city's ability to respond to and deal with major public emergencies to a certain extent, are essential to a modern urban emergency management system. This paper is based on spatial analysis methods, using Analytic Hierarchy Process to analyze the suitability of the 28 emergency shelters in Wuhan City. The Technique for Order Preference by Similarity to an Ideal Solution is further used to evaluate the accommodation capacity of emergency shelters in central urban areas, which provides a reference for the optimization of existing shelters and the site selection of new shelters, and provides a basis for improving the service capacity of shelters. The results show that the overall situation of emergency shelters in Wuhan is good, with 96\% of the places reaching the medium level or above, but the suitability level needs to be further improved, especially the effectiveness and accessibility. Among the seven central urban areas in Wuhan, Hongshan District has the strongest accommodation capacity while Jianghan District has the weakest, with noticeable differences.
△ Less
Submitted 5 December, 2022; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Efficient Extraction of Hot Carriers in Perovskite Quantum Dot through Building State Coupled Complex
Authors:
Yusheng Li,
Junke Jiang,
Dandan Wang,
Dong Liu,
Shota Yajima,
Hua Li,
Akihito Fuchimoto,
Hongshi Li,
Guozheng Shi,
Shuzi Hayase,
Shuxia Tao,
Jiangjian Shi,
Qingbo Meng,
Chao Ding,
Qing Shen
Abstract:
Utilizing hot carriers is the crucial approach for solar cell to exceed the thermodynamic detailed balance limit, yet effective extraction of hot carriers in absorber materials via most commonly used semiconductor acceptors has been a challenge in both materials and photophysics research for many years. Herein, we build series of CsPbI3 quantum dot and fullerene derivative systems to explore the d…
▽ More
Utilizing hot carriers is the crucial approach for solar cell to exceed the thermodynamic detailed balance limit, yet effective extraction of hot carriers in absorber materials via most commonly used semiconductor acceptors has been a challenge in both materials and photophysics research for many years. Herein, we build series of CsPbI3 quantum dot and fullerene derivative systems to explore the decisive factors of this process and have for the first time realized efficient hot carrier extraction in these systems (maximum extraction efficiency ~ 84%). We find building the systems as state-coupled complexes creates new carrier transport channels at about 0.22 eV above CsPbI3 quantum dot bandgap, which facilitates highly efficient HC extraction. Our research directly visualizes the inner connection of molecule interaction and ultrafast hot carrier extraction. The knowledge and strategy gained here are of universal meaning, taking an important step forward true hot carrier photovoltaics.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Mesh-based 3D Motion Tracking in Cardiac MRI using Deep Learning
Authors:
Qingjie Meng,
Wenjia Bai,
Tianrui Liu,
Declan P O'Regan,
Daniel Rueckert
Abstract:
3D motion estimation from cine cardiac magnetic resonance (CMR) images is important for the assessment of cardiac function and diagnosis of cardiovascular diseases. Most of the previous methods focus on estimating pixel-/voxel-wise motion fields in the full image space, which ignore the fact that motion estimation is mainly relevant and useful within the object of interest, e.g., the heart. In thi…
▽ More
3D motion estimation from cine cardiac magnetic resonance (CMR) images is important for the assessment of cardiac function and diagnosis of cardiovascular diseases. Most of the previous methods focus on estimating pixel-/voxel-wise motion fields in the full image space, which ignore the fact that motion estimation is mainly relevant and useful within the object of interest, e.g., the heart. In this work, we model the heart as a 3D geometric mesh and propose a novel deep learning-based method that can estimate 3D motion of the heart mesh from 2D short- and long-axis CMR images. By develo** a differentiable mesh-to-image rasterizer, the method is able to leverage the anatomical shape information from 2D multi-view CMR images for 3D motion estimation. The differentiability of the rasterizer enables us to train the method end-to-end. One advantage of the proposed method is that by tracking the motion of each vertex, it is able to keep the vertex correspondence of 3D meshes between time frames, which is important for quantitative assessment of the cardiac function on the mesh. We evaluate the proposed method on CMR images acquired from the UK Biobank study. Experimental results show that the proposed method quantitatively and qualitatively outperforms both conventional and learning-based cardiac motion tracking methods.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Kinova Gemini: Interactive Robot Gras** with Visual Reasoning and Conversational AI
Authors:
Hanxiao Chen,
Jiankun Wang,
Max Q. -H. Meng
Abstract:
To facilitate recent advances in robotics and AI for delicate collaboration between humans and machines, we propose the Kinova Gemini, an original robotic system that integrates conversational AI dialogue and visual reasoning to make the Kinova Gen3 lite robot help people retrieve objects or complete perception-based pick-and-place tasks. When a person walks up to Kinova Gen3 lite, our Kinova Gemi…
▽ More
To facilitate recent advances in robotics and AI for delicate collaboration between humans and machines, we propose the Kinova Gemini, an original robotic system that integrates conversational AI dialogue and visual reasoning to make the Kinova Gen3 lite robot help people retrieve objects or complete perception-based pick-and-place tasks. When a person walks up to Kinova Gen3 lite, our Kinova Gemini is able to fulfill the user's requests in three different applications: (1) It can start a natural dialogue with people to interact and assist humans to retrieve objects and hand them to the user one by one. (2) It detects diverse objects with YOLO v3 and recognize color attributes of the item to ask people if they want to grasp it via the dialogue or enable the user to choose which specific one is required. (3) It applies YOLO v3 to recognize multiple objects and let you choose two items for perception-based pick-and-place tasks such as "Put the banana into the bowl" with visual reasoning and conversational interaction.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
Multiple topological nodal structure in LaSb2 with large linear magnetoresistance
Authors:
Y. X. Qiao,
Z. C. Tao,
F. Y. Wang,
Huaiqiang Wang,
Z. C. Jiang,
Z. T. Liu,
Soohyun Cho,
F. Y. Zhang,
Q. K. Meng,
W. Xia,
Y. C. Yang,
Z. Huang,
J. S. Liu,
Z. H. Liu,
Z. W. Zhu,
S. Qiao,
Y. F. Guo,
Haijun Zhang,
Dawei Shen
Abstract:
Unconventional fermions in the immensely studied topological semimetals are the source for rich exotic topological properties. Here, using symmetry analysis and first-principles calculations, we propose the coexistence of multiple topological nodal structure in LaSb2, including topological nodal surfaces, nodal lines and in particular eightfold degenerate nodal points, which have been scarcely obs…
▽ More
Unconventional fermions in the immensely studied topological semimetals are the source for rich exotic topological properties. Here, using symmetry analysis and first-principles calculations, we propose the coexistence of multiple topological nodal structure in LaSb2, including topological nodal surfaces, nodal lines and in particular eightfold degenerate nodal points, which have been scarcely observed in a single material. Further, utilizing high resolution angle-resolved photoemission spectroscopy in combination with Shubnikov-de Haas quantum oscillations measurements, we confirm the existence of nodal surfaces and eightfold degenerate nodal points in LaSb2, and extract the π Berry phase proving the non-trivial electronic band structure topology therein. The intriguing multiple topological nodal structure might play a crucial role in giving rise to the large linear magnetoresistance. Our work renews the insights into the exotic topological phenomena in LaSb2 and its analogous.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
Provable Adaptivity of Adam under Non-uniform Smoothness
Authors:
Bohan Wang,
Yushun Zhang,
Huishuai Zhang,
Qi Meng,
Ruoyu Sun,
Zhi-Ming Ma,
Tie-Yan Liu,
Zhi-Quan Luo,
Wei Chen
Abstract:
Adam is widely adopted in practical applications due to its fast convergence. However, its theoretical analysis is still far from satisfactory. Existing convergence analyses for Adam rely on the bounded smoothness assumption, referred to as the \emph{L-smooth condition}. Unfortunately, this assumption does not hold for many deep learning tasks. Moreover, we believe that this assumption obscures th…
▽ More
Adam is widely adopted in practical applications due to its fast convergence. However, its theoretical analysis is still far from satisfactory. Existing convergence analyses for Adam rely on the bounded smoothness assumption, referred to as the \emph{L-smooth condition}. Unfortunately, this assumption does not hold for many deep learning tasks. Moreover, we believe that this assumption obscures the true benefit of Adam, as the algorithm can adapt its update magnitude according to local smoothness. This important feature of Adam becomes irrelevant when assuming globally bounded smoothness. This paper studies the convergence of randomly reshuffled Adam (RR Adam) with diminishing learning rate, which is the major version of Adam adopted in deep learning tasks. We present the first convergence analysis of RR Adam without the bounded smoothness assumption. We demonstrate that RR Adam can maintain its convergence properties when smoothness is linearly bounded by the gradient norm, referred to as the \emph{$(L_0, L_1)$-smooth condition. We further compare Adam to SGD when both methods use diminishing learning rate. We refine the existing lower bound of SGD and show that SGD can be slower than Adam. To our knowledge, this is the first time that Adam and SGD are rigorously compared in the same setting and the advantage of Adam is revealed.
△ Less
Submitted 24 June, 2024; v1 submitted 21 August, 2022;
originally announced August 2022.
-
Does Lorentz-symmetric design boost network performance in jet physics?
Authors:
Congqiao Li,
Huilin Qu,
Sitian Qian,
Qi Meng,
Shiqi Gong,
Jue Zhang,
Tie-Yan Liu,
Qiang Li
Abstract:
In the deep learning era, improving the neural network performance in jet physics is a rewarding task as it directly contributes to more accurate physics measurements at the LHC. Recent research has proposed various network designs in consideration of the full Lorentz symmetry, but its benefit is still not systematically asserted, given that there remain many successful networks without taking it…
▽ More
In the deep learning era, improving the neural network performance in jet physics is a rewarding task as it directly contributes to more accurate physics measurements at the LHC. Recent research has proposed various network designs in consideration of the full Lorentz symmetry, but its benefit is still not systematically asserted, given that there remain many successful networks without taking it into account. We conduct a detailed study on the Lorentz-symmetric design. We propose two generalized approaches for modifying a network - these methods are experimented on Particle Flow Network, ParticleNet, and LorentzNet, and exhibit a general performance gain. We also reveal that the notable improvement attributed to the "pairwise mass" feature in the network is due to its introduction of a structure that fully complies with Lorentz symmetry. We confirm that Lorentz-symmetry preservation serves as a strong inductive bias of jet physics, hence calling for attention to such general recipes in future network designs.
△ Less
Submitted 7 March, 2024; v1 submitted 16 August, 2022;
originally announced August 2022.