-
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
Authors:
Chengchun Shi,
Runzhe Wan,
Ge Song,
Shikai Luo,
Rui Song,
Hongtu Zhu
Abstract:
The two-sided markets such as ride-sharing companies often involve a group of subjects who are making sequential decisions across time and/or location. With the rapid development of smart phones and internet of things, they have substantially transformed the transportation landscape of human beings. In this paper we consider large-scale fleet management in ride-sharing companies that involve multi…
▽ More
The two-sided markets such as ride-sharing companies often involve a group of subjects who are making sequential decisions across time and/or location. With the rapid development of smart phones and internet of things, they have substantially transformed the transportation landscape of human beings. In this paper we consider large-scale fleet management in ride-sharing companies that involve multiple units in different areas receiving sequences of products (or treatments) over time. Major technical challenges, such as policy evaluation, arise in those studies because (i) spatial and temporal proximities induce interference between locations and times; and (ii) the large number of locations results in the curse of dimensionality. To address both challenges simultaneously, we introduce a multi-agent reinforcement learning (MARL) framework for carrying policy evaluation in these studies. We propose novel estimators for mean outcomes under different products that are consistent despite the high-dimensionality of state-action space. The proposed estimator works favorably in simulation experiments. We further illustrate our method using a real dataset obtained from a two-sided marketplace company to evaluate the effects of applying different subsidizing policies. A Python implementation of our proposed method is available at https://github.com/RunzheStat/CausalMARL.
△ Less
Submitted 26 March, 2023; v1 submitted 21 February, 2022;
originally announced February 2022.
-
Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation
Authors:
Shuangjie Xu,
Rui Wan,
Maosheng Ye,
Xiaoyi Zou,
Tongyi Cao
Abstract:
Two major challenges of 3D LiDAR Panoptic Segmentation (PS) are that point clouds of an object are surface-aggregated and thus hard to model the long-range dependency especially for large instances, and that objects are too close to separate each other. Recent literature addresses these problems by time-consuming grou** processes such as dual-clustering, mean-shift offsets, etc., or by bird-eye-…
▽ More
Two major challenges of 3D LiDAR Panoptic Segmentation (PS) are that point clouds of an object are surface-aggregated and thus hard to model the long-range dependency especially for large instances, and that objects are too close to separate each other. Recent literature addresses these problems by time-consuming grou** processes such as dual-clustering, mean-shift offsets, etc., or by bird-eye-view (BEV) dense centroid representation that downplays geometry. However, the long-range geometry relationship has not been sufficiently modeled by local feature learning from the above methods. To this end, we present SCAN, a novel sparse cross-scale attention network to first align multi-scale sparse features with global voxel-encoded attention to capture the long-range relationship of instance context, which can boost the regression accuracy of the over-segmented large objects. For the surface-aggregated points, SCAN adopts a novel sparse class-agnostic representation of instance centroids, which can not only maintain the sparsity of aligned features to solve the under-segmentation on small objects, but also reduce the computation amount of the network through sparse convolution. Our method outperforms previous methods by a large margin in the SemanticKITTI dataset for the challenging 3D PS task, achieving 1st place with a real-time inference speed.
△ Less
Submitted 16 January, 2022;
originally announced January 2022.
-
Enhancing Low-Light Images in Real World via Cross-Image Disentanglement
Authors:
Lanqing Guo,
Renjie Wan,
Wenhan Yang,
Alex Kot,
Bihan Wen
Abstract:
Images captured in the low-light condition suffer from low visibility and various imaging artifacts, e.g., real noise. Existing supervised enlightening algorithms require a large set of pixel-aligned training image pairs, which are hard to prepare in practice. Though weakly-supervised or unsupervised methods can alleviate such challenges without using paired training images, some real-world artifa…
▽ More
Images captured in the low-light condition suffer from low visibility and various imaging artifacts, e.g., real noise. Existing supervised enlightening algorithms require a large set of pixel-aligned training image pairs, which are hard to prepare in practice. Though weakly-supervised or unsupervised methods can alleviate such challenges without using paired training images, some real-world artifacts inevitably get falsely amplified because of the lack of corresponded supervision. In this paper, instead of using perfectly aligned images for training, we creatively employ the misaligned real-world images as the guidance, which are considerably easier to collect. Specifically, we propose a Cross-Image Disentanglement Network (CIDN) to separately extract cross-image brightness and image-specific content features from low/normal-light images. Based on that, CIDN can simultaneously correct the brightness and suppress image artifacts in the feature domain, which largely increases the robustness to the pixel shifts. Furthermore, we collect a new low-light image enhancement dataset consisting of misaligned training images with real-world corruptions. Experimental results show that our model achieves state-of-the-art performances on both the newly proposed dataset and other popular low-light datasets.
△ Less
Submitted 7 July, 2022; v1 submitted 9 January, 2022;
originally announced January 2022.
-
DRINet++: Efficient Voxel-as-point Point Cloud Segmentation
Authors:
Maosheng Ye,
Rui Wan,
Shuangjie Xu,
Tongyi Cao,
Qifeng Chen
Abstract:
Recently, many approaches have been proposed through single or multiple representations to improve the performance of point cloud semantic segmentation. However, these works do not maintain a good balance among performance, efficiency, and memory consumption. To address these issues, we propose DRINet++ that extends DRINet by enhancing the sparsity and geometric properties of a point cloud with a…
▽ More
Recently, many approaches have been proposed through single or multiple representations to improve the performance of point cloud semantic segmentation. However, these works do not maintain a good balance among performance, efficiency, and memory consumption. To address these issues, we propose DRINet++ that extends DRINet by enhancing the sparsity and geometric properties of a point cloud with a voxel-as-point principle. To improve efficiency and performance, DRINet++ mainly consists of two modules: Sparse Feature Encoder and Sparse Geometry Feature Enhancement. The Sparse Feature Encoder extracts the local context information for each point, and the Sparse Geometry Feature Enhancement enhances the geometric properties of a sparse point cloud via multi-scale sparse projection and attentive multi-scale fusion. In addition, we propose deep sparse supervision in the training phase to help convergence and alleviate the memory consumption problem. Our DRINet++ achieves state-of-the-art outdoor point cloud segmentation on both SemanticKITTI and Nuscenes datasets while running significantly faster and consuming less memory.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
Learning Meta Pattern for Face Anti-Spoofing
Authors:
Rizhao Cai,
Zhi Li,
Renjie Wan,
Haoliang Li,
Yongjian Hu,
Alex Chichung Kot
Abstract:
Face Anti-Spoofing (FAS) is essential to secure face recognition systems and has been extensively studied in recent years. Although deep neural networks (DNNs) for the FAS task have achieved promising results in intra-dataset experiments with similar distributions of training and testing data, the DNNs' generalization ability is limited under the cross-domain scenarios with different distributions…
▽ More
Face Anti-Spoofing (FAS) is essential to secure face recognition systems and has been extensively studied in recent years. Although deep neural networks (DNNs) for the FAS task have achieved promising results in intra-dataset experiments with similar distributions of training and testing data, the DNNs' generalization ability is limited under the cross-domain scenarios with different distributions of training and testing data. To improve the generalization ability, recent hybrid methods have been explored to extract task-aware handcrafted features (e.g., Local Binary Pattern) as discriminative information for the input of DNNs. However, the handcrafted feature extraction relies on experts' domain knowledge, and how to choose appropriate handcrafted features is underexplored. To this end, we propose a learnable network to extract Meta Pattern (MP) in our learning-to-learn framework. By replacing handcrafted features with the MP, the discriminative information from MP is capable of learning a more generalized model. Moreover, we devise a two-stream network to hierarchically fuse the input RGB image and the extracted MP by using our proposed Hierarchical Fusion Module (HFM). We conduct comprehensive experiments and show that our MP outperforms the compared handcrafted features. Also, our proposed method with HFM and the MP can achieve state-of-the-art performance on two different domain generalization evaluation benchmarks.
△ Less
Submitted 17 May, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
-
Low-Light Image Enhancement with Normalizing Flow
Authors:
Yufei Wang,
Renjie Wan,
Wenhan Yang,
Haoliang Li,
Lap-Pui Chau,
Alex C. Kot
Abstract:
To enhance low-light images to normally-exposed ones is highly ill-posed, namely that the map** relationship between them is one-to-many. Previous works based on the pixel-wise reconstruction losses and deterministic processes fail to capture the complex conditional distribution of normally exposed images, which results in improper brightness, residual noise, and artifacts. In this paper, we inv…
▽ More
To enhance low-light images to normally-exposed ones is highly ill-posed, namely that the map** relationship between them is one-to-many. Previous works based on the pixel-wise reconstruction losses and deterministic processes fail to capture the complex conditional distribution of normally exposed images, which results in improper brightness, residual noise, and artifacts. In this paper, we investigate to model this one-to-many relationship via a proposed normalizing flow model. An invertible network that takes the low-light images/features as the condition and learns to map the distribution of normally exposed images into a Gaussian distribution. In this way, the conditional distribution of the normally exposed images can be well modeled, and the enhancement process, i.e., the other inference direction of the invertible network, is equivalent to being constrained by a loss function that better describes the manifold structure of natural images during the training. The experimental results on the existing benchmark datasets show our method achieves better quantitative and qualitative results, obtaining better-exposed illumination, less noise and artifact, and richer colors.
△ Less
Submitted 13 September, 2021;
originally announced September 2021.
-
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Authors:
Runzhe Wan,
Lin Ge,
Rui Song
Abstract:
How to explore efficiently is a central problem in multi-armed bandits. In this paper, we introduce the metadata-based multi-task bandit problem, where the agent needs to solve a large number of related multi-armed bandit tasks and can leverage some task-specific features (i.e., metadata) to share knowledge across tasks. As a general framework, we propose to capture task relations through the lens…
▽ More
How to explore efficiently is a central problem in multi-armed bandits. In this paper, we introduce the metadata-based multi-task bandit problem, where the agent needs to solve a large number of related multi-armed bandit tasks and can leverage some task-specific features (i.e., metadata) to share knowledge across tasks. As a general framework, we propose to capture task relations through the lens of Bayesian hierarchical models, upon which a Thompson sampling algorithm is designed to efficiently learn task relations, share information, and minimize the cumulative regrets. Two concrete examples for Gaussian bandits and Bernoulli bandits are carefully analyzed. The Bayes regret for Gaussian bandits clearly demonstrates the benefits of information sharing with our algorithm. The proposed method is further supported by extensive experiments.
△ Less
Submitted 13 August, 2021;
originally announced August 2021.
-
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching
Authors:
Runzhe Wan,
Sheng Zhang,
Chengchun Shi,
Shikai Luo,
Rui Song
Abstract:
Order dispatch is one of the central problems to ride-sharing platforms. Recently, value-based reinforcement learning algorithms have shown promising performance on this problem. However, in real-world applications, the non-stationarity of the demand-supply system poses challenges to re-utilizing data generated in different time periods to learn the value function. In this work, motivated by the f…
▽ More
Order dispatch is one of the central problems to ride-sharing platforms. Recently, value-based reinforcement learning algorithms have shown promising performance on this problem. However, in real-world applications, the non-stationarity of the demand-supply system poses challenges to re-utilizing data generated in different time periods to learn the value function. In this work, motivated by the fact that the relative relationship between the values of some states is largely stable across various environments, we propose a pattern transfer learning framework for value-based reinforcement learning in the order dispatch problem. Our method efficiently captures the value patterns by incorporating a concordance penalty. The superior performance of the proposed method is supported by experiments.
△ Less
Submitted 18 June, 2021; v1 submitted 27 May, 2021;
originally announced May 2021.
-
Quasistatic kinetic avalanches and self-organized criticality in deviatorically loaded granular media
Authors:
Jordi Baró,
Mehdi Pouragha,
Richard Wan,
Jörn Davidsen
Abstract:
The behavior of granular media under quasi-static loading has recently been shown to attain a stable evolution state corresponding to a manifold in the space of micromechanical variables. This state is characterized by sudden transitions between metastable jammed states, involving the partial micromechanical rearrangement of the granular medium. Using numerical simulations of two-dimensional granu…
▽ More
The behavior of granular media under quasi-static loading has recently been shown to attain a stable evolution state corresponding to a manifold in the space of micromechanical variables. This state is characterized by sudden transitions between metastable jammed states, involving the partial micromechanical rearrangement of the granular medium. Using numerical simulations of two-dimensional granular media under quasistatic biaxial compression, we show that the dynamics in the stable evolution state is characterized by scale-free avalanches well before the macromechanical stationary flow regime traditionally linked to a self-organized critical state. This, together with the non-uniqueness and the non-monotony of macroscopic deformation curves, suggests that the statistical avalanche properties and the susceptibilities of the system cannot be reduced to a function of the macromechanical state. The associated scaling exponents are non-universal and depend on the interactions between particles. For stiffer particles (or samples at low confining pressure) we find distributions of avalanche properties compatible with the predictions of mean-field theory. The scaling exponents decrease below the mean-field values for softer interactions between particles. These lower exponents are consistent with observations for amorphous solids at their critical point. We specifically discuss the relationship between microscopic and macroscopic variables, including the relation between the external stress drop and the internal potential energy released during kinetic avalanches.
△ Less
Submitted 13 May, 2021;
originally announced May 2021.
-
Deeply-Debiased Off-Policy Interval Estimation
Authors:
Chengchun Shi,
Runzhe Wan,
Victor Chernozhukov,
Rui Song
Abstract:
Off-policy evaluation learns a target policy's value with a historical dataset generated by a different behavior policy. In addition to a point estimate, many applications would benefit significantly from having a confidence interval (CI) that quantifies the uncertainty of the point estimate. In this paper, we propose a novel deeply-debiasing procedure to construct an efficient, robust, and flexib…
▽ More
Off-policy evaluation learns a target policy's value with a historical dataset generated by a different behavior policy. In addition to a point estimate, many applications would benefit significantly from having a confidence interval (CI) that quantifies the uncertainty of the point estimate. In this paper, we propose a novel deeply-debiasing procedure to construct an efficient, robust, and flexible CI on a target policy's value. Our method is justified by theoretical results and numerical experiments. A Python implementation of the proposed procedure is available at https://github.com/RunzheStat/D2OPE.
△ Less
Submitted 7 June, 2021; v1 submitted 10 May, 2021;
originally announced May 2021.
-
A Conversational Agent System for Dietary Supplements Use
Authors:
Esha Singh,
Anu Bompelli,
Ruyuan Wan,
Jiang Bian,
Serguei Pakhomov,
Rui Zhang
Abstract:
Dietary supplements (DS) have been widely used by consumers, but the information around the efficacy and safety of DS is disparate or incomplete, thus creating barriers for consumers to find information effectively. Conversational agent (CA) systems have been applied to the healthcare domain, but there is no such a system to answer consumers regarding DS use, although widespread use of DS. In this…
▽ More
Dietary supplements (DS) have been widely used by consumers, but the information around the efficacy and safety of DS is disparate or incomplete, thus creating barriers for consumers to find information effectively. Conversational agent (CA) systems have been applied to the healthcare domain, but there is no such a system to answer consumers regarding DS use, although widespread use of DS. In this study, we develop the first CA system for DS use
△ Less
Submitted 10 May, 2021; v1 submitted 4 April, 2021;
originally announced April 2021.
-
Social and behavioral determinants of health in the era of artificial intelligence with electronic health records: A sco** review
Authors:
Anusha Bompelli,
Yanshan Wang,
Ruyuan Wan,
Esha Singh,
Yuqi Zhou,
Lin Xu,
David Oniani,
Bhavani Singh Agnikula Kshatriya,
Joyce,
E. Balls-Berry,
Rui Zhang
Abstract:
Background: There is growing evidence that social and behavioral determinants of health (SBDH) play a substantial effect in a wide range of health outcomes. Electronic health records (EHRs) have been widely employed to conduct observational studies in the age of artificial intelligence (AI). However, there has been little research into how to make the most of SBDH information from EHRs. Methods: A…
▽ More
Background: There is growing evidence that social and behavioral determinants of health (SBDH) play a substantial effect in a wide range of health outcomes. Electronic health records (EHRs) have been widely employed to conduct observational studies in the age of artificial intelligence (AI). However, there has been little research into how to make the most of SBDH information from EHRs. Methods: A systematic search was conducted in six databases to find relevant peer-reviewed publications that had recently been published. Relevance was determined by screening and evaluating the articles. Based on selected relevant studies, a methodological analysis of AI algorithms leveraging SBDH information in EHR data was provided. Results: Our synthesis was driven by an analysis of SBDH categories, the relationship between SBDH and healthcare-related statuses, and several NLP approaches for extracting SDOH from clinical literature. Discussion: The associations between SBDH and health outcomes are complicated and diverse; several pathways may be involved. Using Natural Language Processing (NLP) technology to support the extraction of SBDH and other clinical ideas simplifies the identification and extraction of essential concepts from clinical data, efficiently unlocks unstructured data, and aids in the resolution of unstructured data-related issues. Conclusion: Despite known associations between SBDH and disease, SBDH factors are rarely investigated as interventions to improve patient outcomes. Gaining knowledge about SBDH and how SBDH data can be collected from EHRs using NLP approaches and predictive models improves the chances of influencing health policy change for patient wellness, and ultimately promoting health and health equity.
Keywords: Social and Behavioral Determinants of Health, Artificial Intelligence, Electronic Health Records, Natural Language Processing, Predictive Model
△ Less
Submitted 13 June, 2021; v1 submitted 22 January, 2021;
originally announced February 2021.
-
Relativistic Langevin dynamics: charm versus beauty
Authors:
Shuang Li,
Wei Xiong,
Renzhuo Wan
Abstract:
The production of heavy quarks (charm and beauty) provides unique insights into the transport properties of the Quark-Gluon Plasma (QGP) in heavy-ion collisions. Experimentally, the nuclear modification factor ${{R}_{\rm AA}}$ and the azimuthal anisotropy coefficient ${v}_{\rm 2}$ of heavy-flavor mesons are powerful observables to study the medium-related effects, such as energy loss and collectiv…
▽ More
The production of heavy quarks (charm and beauty) provides unique insights into the transport properties of the Quark-Gluon Plasma (QGP) in heavy-ion collisions. Experimentally, the nuclear modification factor ${{R}_{\rm AA}}$ and the azimuthal anisotropy coefficient ${v}_{\rm 2}$ of heavy-flavor mesons are powerful observables to study the medium-related effects, such as energy loss and collectivity, on the heavy quark propagation through the QGP evolution. The latest measurements of the prompt and non-prompt open heavy-flavor hadrons allow a systematic comparison of the transport behaviors probed by charm and beauty quarks. In this work we make such an attempt utilizing our recently developed framework. By performing a quantitative investigation of ${{R}_{\rm AA}}$ and ${v}_{\rm 2}$, it is found that both charm and beauty quarks are efficient probes to capture the dynamical features of QGP, in particular the resulting mass hierarchy for the energy loss and azimuthal anisotropy, which are well inherited by the various $D/B$-meson species. Moreover, our calculations can describe simultaneously ${{R}_{\rm AA}}$ and ${v}_{\rm 2}$ data for the prompt and non-prompt $D^{0}$ mesons in central ($0-10\%$) and semi-central ($30-50\%$) Pb--Pb collisions at $\sqrt{s_{\rm NN}}=5.02~{\rm TeV}$. The predictions for $B$-meson observables for upcoming experimental tests are also made down to the low momentum region.
△ Less
Submitted 4 December, 2020;
originally announced December 2020.
-
Multiscale Modeling of Elasto-Plasticity in Heterogeneous Geomaterials Based on Continuum Micromechanics
Authors:
Mahdad Eghbalian,
Mehdi Pouragha,
Richard Wan
Abstract:
In this paper, we investigate some micromechanical aspects of elasto-plasticity in heterogeneous geomaterials. The aim is to upscale the elasto-plastic behavior for a representative volume of the material which is indeed a very challenging task due to the irreversible deformations involved. Considering the plastic strains as eigen-strains allows us to employ the powerful tools offered by Continuum…
▽ More
In this paper, we investigate some micromechanical aspects of elasto-plasticity in heterogeneous geomaterials. The aim is to upscale the elasto-plastic behavior for a representative volume of the material which is indeed a very challenging task due to the irreversible deformations involved. Considering the plastic strains as eigen-strains allows us to employ the powerful tools offered by Continuum Micromechanics which are mainly developed for upscaling of eigen-stressed elastic media. The validity of such eigen-strain based formulation of multiscale elasto-plasticity is herein examined by comparing its predictions against Finite Element (FE) simulations.
△ Less
Submitted 24 November, 2020;
originally announced November 2020.
-
$L^p$ bound for the Hilbert transform along variable non-flat curves
Authors:
Renhui Wan
Abstract:
We prove the $L^p$ bound for the Hilbert transform along variable non-flat curves $(t,u(x)[t]^α+v(x)[t]^β)$, where $α$ and $β$ satisfy
$α\neq β,\ α\neq 1,\ β\neq 1.$
Comparing with the associated theorem in \cite{GHLJ} investigating the case $α=β\neq 1$, our result is more general while the proof is more involved. To achieve our goal, we divide the frequency of the objective function into thre…
▽ More
We prove the $L^p$ bound for the Hilbert transform along variable non-flat curves $(t,u(x)[t]^α+v(x)[t]^β)$, where $α$ and $β$ satisfy
$α\neq β,\ α\neq 1,\ β\neq 1.$
Comparing with the associated theorem in \cite{GHLJ} investigating the case $α=β\neq 1$, our result is more general while the proof is more involved. To achieve our goal, we divide the frequency of the objective function into three cases and take different strategies to control these cases. Furthermore, we need to introduce a "short" shift maximal function $\mathfrak{M}^{[n]}$ to establish some pointwise estimate.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.
-
CS-MCNet:A Video Compressive Sensing Reconstruction Network with Interpretable Motion Compensation
Authors:
Bowen Huang,
**jia Zhou,
Xiao Yan,
Ming'e **g,
Rentao Wan,
Yibo Fan
Abstract:
In this paper, a deep neural network with interpretable motion compensation called CS-MCNet is proposed to realize high-quality and real-time decoding of video compressive sensing. Firstly, explicit multi-hypothesis motion compensation is applied in our network to extract correlation information of adjacent frames(as shown in Fig. 1), which improves the recover performance. And then, a residual mo…
▽ More
In this paper, a deep neural network with interpretable motion compensation called CS-MCNet is proposed to realize high-quality and real-time decoding of video compressive sensing. Firstly, explicit multi-hypothesis motion compensation is applied in our network to extract correlation information of adjacent frames(as shown in Fig. 1), which improves the recover performance. And then, a residual module further narrows down the gap between reconstruction result and original signal. The overall architecture is interpretable by using algorithm unrolling, which brings the benefits of being able to transfer prior knowledge about the conventional algorithms. As a result, a PSNR of 22dB can be achieved at 64x compression ratio, which is about 4% to 9% better than state-of-the-art methods. In addition, due to the feed-forward architecture, the reconstruction can be processed by our network in real time and up to three orders of magnitude faster than traditional iterative methods.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Domain Generalization for Medical Imaging Classification with Linear-Dependency Regularization
Authors:
Haoliang Li,
YuFei Wang,
Renjie Wan,
Shiqi Wang,
Tie-Qiang Li,
Alex C. Kot
Abstract:
Recently, we have witnessed great progress in the field of medical imaging classification by adopting deep neural networks. However, the recent advanced models still require accessing sufficiently large and representative datasets for training, which is often unfeasible in clinically realistic environments. When trained on limited datasets, the deep neural network is lack of generalization capabil…
▽ More
Recently, we have witnessed great progress in the field of medical imaging classification by adopting deep neural networks. However, the recent advanced models still require accessing sufficiently large and representative datasets for training, which is often unfeasible in clinically realistic environments. When trained on limited datasets, the deep neural network is lack of generalization capability, as the trained deep neural network on data within a certain distribution (e.g. the data captured by a certain device vendor or patient population) may not be able to generalize to the data with another distribution.
In this paper, we introduce a simple but effective approach to improve the generalization capability of deep neural networks in the field of medical imaging classification. Motivated by the observation that the domain variability of the medical images is to some extent compact, we propose to learn a representative feature space through variational encoding with a novel linear-dependency regularization term to capture the shareable information among medical data collected from different domains. As a result, the trained neural network is expected to equip with better generalization capability to the "unseen" medical data. Experimental results on two challenging medical imaging classification tasks indicate that our method can achieve better cross-domain generalization capability compared with state-of-the-art baselines.
△ Less
Submitted 29 October, 2020; v1 submitted 27 September, 2020;
originally announced September 2020.
-
Light Can Hack Your Face! Black-box Backdoor Attack on Face Recognition Systems
Authors:
Haoliang Li,
Yufei Wang,
Xiaofei Xie,
Yang Liu,
Shiqi Wang,
Renjie Wan,
Lap-Pui Chau,
Alex C. Kot
Abstract:
Deep neural networks (DNN) have shown great success in many computer vision applications. However, they are also known to be susceptible to backdoor attacks. When conducting backdoor attacks, most of the existing approaches assume that the targeted DNN is always available, and an attacker can always inject a specific pattern to the training data to further fine-tune the DNN model. However, in prac…
▽ More
Deep neural networks (DNN) have shown great success in many computer vision applications. However, they are also known to be susceptible to backdoor attacks. When conducting backdoor attacks, most of the existing approaches assume that the targeted DNN is always available, and an attacker can always inject a specific pattern to the training data to further fine-tune the DNN model. However, in practice, such attack may not be feasible as the DNN model is encrypted and only available to the secure enclave.
In this paper, we propose a novel black-box backdoor attack technique on face recognition systems, which can be conducted without the knowledge of the targeted DNN model. To be specific, we propose a backdoor attack with a novel color stripe pattern trigger, which can be generated by modulating LED in a specialized waveform. We also use an evolutionary computing strategy to optimize the waveform for backdoor attack. Our backdoor attack can be conducted in a very mild condition: 1) the adversary cannot manipulate the input in an unnatural way (e.g., injecting adversarial noise); 2) the adversary cannot access the training database; 3) the adversary has no knowledge of the training model as well as the training set used by the victim party.
We show that the backdoor trigger can be quite effective, where the attack success rate can be up to $88\%$ based on our simulation study and up to $40\%$ based on our physical-domain study by considering the task of face recognition and verification based on at most three-time attempts during authentication. Finally, we evaluate several state-of-the-art potential defenses towards backdoor attacks, and find that our attack can still be effective. We highlight that our study revealed a new physical backdoor attack, which calls for the attention of the security issue of the existing face recognition/verification techniques.
△ Less
Submitted 15 September, 2020;
originally announced September 2020.
-
Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control
Authors:
Runzhe Wan,
Xinyu Zhang,
Rui Song
Abstract:
Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health. Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses. In the face of an emerging infectious disease, a crucial question for policymakers is how to make the trade-off and implement the appropriate intervent…
▽ More
Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health. Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses. In the face of an emerging infectious disease, a crucial question for policymakers is how to make the trade-off and implement the appropriate interventions timely given the huge uncertainty. In this work, we propose a Multi-Objective Model-based Reinforcement Learning framework to facilitate data-driven decision-making and minimize the overall long-term cost. Specifically, at each decision point, a Bayesian epidemiological model is first learned as the environment model, and then the proposed model-based multi-objective planning algorithm is applied to find a set of Pareto-optimal policies. This framework, combined with the prediction bands for each policy, provides a real-time decision support tool for policymakers. The application is demonstrated with the spread of COVID-19 in China.
△ Less
Submitted 26 February, 2022; v1 submitted 9 September, 2020;
originally announced September 2020.
-
Batch Policy Learning in Average Reward Markov Decision Processes
Authors:
Peng Liao,
Zhengling Qi,
Runzhe Wan,
Predrag Klasnja,
Susan Murphy
Abstract:
We consider the batch (off-line) policy learning problem in the infinite horizon Markov Decision Process. Motivated by mobile health applications, we focus on learning a policy that maximizes the long-term average reward. We propose a doubly robust estimator for the average reward and show that it achieves semiparametric efficiency. Further we develop an optimization algorithm to compute the optim…
▽ More
We consider the batch (off-line) policy learning problem in the infinite horizon Markov Decision Process. Motivated by mobile health applications, we focus on learning a policy that maximizes the long-term average reward. We propose a doubly robust estimator for the average reward and show that it achieves semiparametric efficiency. Further we develop an optimization algorithm to compute the optimal policy in a parameterized stochastic policy class. The performance of the estimated policy is measured by the difference between the optimal average reward in the policy class and the average reward of the estimated policy and we establish a finite-sample regret guarantee. The performance of the method is illustrated by simulation studies and an analysis of a mobile health study promoting physical activity.
△ Less
Submitted 17 September, 2022; v1 submitted 22 July, 2020;
originally announced July 2020.
-
Spherical Motion Dynamics: Learning Dynamics of Neural Network with Normalization, Weight Decay, and SGD
Authors:
Ruosi Wan,
Zhanxing Zhu,
Xiangyu Zhang,
Jian Sun
Abstract:
In this work, we comprehensively reveal the learning dynamics of neural network with normalization, weight decay (WD), and SGD (with momentum), named as Spherical Motion Dynamics (SMD). Most related works study SMD by focusing on "effective learning rate" in "equilibrium" condition, where weight norm remains unchanged. However, their discussions on why equilibrium condition can be reached in SMD i…
▽ More
In this work, we comprehensively reveal the learning dynamics of neural network with normalization, weight decay (WD), and SGD (with momentum), named as Spherical Motion Dynamics (SMD). Most related works study SMD by focusing on "effective learning rate" in "equilibrium" condition, where weight norm remains unchanged. However, their discussions on why equilibrium condition can be reached in SMD is either absent or less convincing. Our work investigates SMD by directly exploring the cause of equilibrium condition. Specifically, 1) we introduce the assumptions that can lead to equilibrium condition in SMD, and prove that weight norm can converge at linear rate with given assumptions; 2) we propose "angular update" as a substitute for effective learning rate to measure the evolving of neural network in SMD, and prove angular update can also converge to its theoretical value at linear rate; 3) we verify our assumptions and theoretical results on various computer vision tasks including ImageNet and MSCOCO with standard settings. Experiment results show our theoretical findings agree well with empirical observations.
△ Less
Submitted 27 November, 2020; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Angle-based Search Space Shrinking for Neural Architecture Search
Authors:
Yiming Hu,
Yuding Liang,
Zichao Guo,
Ruosi Wan,
Xiangyu Zhang,
Yichen Wei,
Qingyi Gu,
Jian Sun
Abstract:
In this work, we present a simple and general search space shrinking method, called Angle-Based search space Shrinking (ABS), for Neural Architecture Search (NAS). Our approach progressively simplifies the original search space by drop** unpromising candidates, thus can reduce difficulties for existing NAS methods to find superior architectures. In particular, we propose an angle-based metric to…
▽ More
In this work, we present a simple and general search space shrinking method, called Angle-Based search space Shrinking (ABS), for Neural Architecture Search (NAS). Our approach progressively simplifies the original search space by drop** unpromising candidates, thus can reduce difficulties for existing NAS methods to find superior architectures. In particular, we propose an angle-based metric to guide the shrinking process. We provide comprehensive evidences showing that, in weight-sharing supernet, the proposed metric is more stable and accurate than accuracy-based and magnitude-based metrics to predict the capability of child models. We also show that the angle-based metric can converge fast while training supernet, enabling us to get promising shrunk search spaces efficiently. ABS can easily apply to most of NAS approaches (e.g. SPOS, FairNAS, ProxylessNAS, DARTS and PDARTS). Comprehensive experiments show that ABS can dramatically enhance existing NAS approaches by providing a promising shrunk search space.
△ Less
Submitted 16 July, 2020; v1 submitted 28 April, 2020;
originally announced April 2020.
-
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making
Authors:
Chengchun Shi,
Runzhe Wan,
Rui Song,
Wenbin Lu,
Ling Leng
Abstract:
The Markov assumption (MA) is fundamental to the empirical validity of reinforcement learning. In this paper, we propose a novel Forward-Backward Learning procedure to test MA in sequential decision making. The proposed test does not assume any parametric form on the joint distribution of the observed data and plays an important role for identifying the optimal policy in high-order Markov decision…
▽ More
The Markov assumption (MA) is fundamental to the empirical validity of reinforcement learning. In this paper, we propose a novel Forward-Backward Learning procedure to test MA in sequential decision making. The proposed test does not assume any parametric form on the joint distribution of the observed data and plays an important role for identifying the optimal policy in high-order Markov decision processes and partially observable MDPs. We apply our test to both synthetic datasets and a real data example from mobile health studies to illustrate its usefulness.
△ Less
Submitted 5 February, 2020;
originally announced February 2020.
-
Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization
Authors:
Junjie Yan,
Ruosi Wan,
Xiangyu Zhang,
Wei Zhang,
Yichen Wei,
Jian Sun
Abstract:
Batch Normalization (BN) is one of the most widely used techniques in Deep Learning field. But its performance can awfully degrade with insufficient batch size. This weakness limits the usage of BN on many computer vision tasks like detection or segmentation, where batch size is usually small due to the constraint of memory consumption. Therefore many modified normalization techniques have been pr…
▽ More
Batch Normalization (BN) is one of the most widely used techniques in Deep Learning field. But its performance can awfully degrade with insufficient batch size. This weakness limits the usage of BN on many computer vision tasks like detection or segmentation, where batch size is usually small due to the constraint of memory consumption. Therefore many modified normalization techniques have been proposed, which either fail to restore the performance of BN completely, or have to introduce additional nonlinear operations in inference procedure and increase huge consumption. In this paper, we reveal that there are two extra batch statistics involved in backward propagation of BN, on which has never been well discussed before. The extra batch statistics associated with gradients also can severely affect the training of deep neural network. Based on our analysis, we propose a novel normalization method, named Moving Average Batch Normalization (MABN). MABN can completely restore the performance of vanilla BN in small batch cases, without introducing any additional nonlinear operations in inference procedure. We prove the benefits of MABN by both theoretical analysis and experiments. Our experiments demonstrate the effectiveness of MABN in multiple computer vision tasks including ImageNet and COCO. The code has been released in https://github.com/megvii-model/MABN.
△ Less
Submitted 8 April, 2020; v1 submitted 19 January, 2020;
originally announced January 2020.
-
Towards Making Deep Transfer Learning Never Hurt
Authors:
Ruosi Wan,
Haoyi Xiong,
Xingjian Li,
Zhanxing Zhu,
Jun Huan
Abstract:
Transfer learning have been frequently used to improve deep neural network training through incorporating weights of pre-trained networks as the starting-point of optimization for regularization. While deep transfer learning can usually boost the performance with better accuracy and faster convergence, transferring weights from inappropriate networks hurts training procedure and may lead to even l…
▽ More
Transfer learning have been frequently used to improve deep neural network training through incorporating weights of pre-trained networks as the starting-point of optimization for regularization. While deep transfer learning can usually boost the performance with better accuracy and faster convergence, transferring weights from inappropriate networks hurts training procedure and may lead to even lower accuracy. In this paper, we consider deep transfer learning as minimizing a linear combination of empirical loss and regularizer based on pre-trained weights, where the regularizer would restrict the training procedure from lowering the empirical loss, with conflicted descent directions (e.g., derivatives). Following the view, we propose a novel strategy making regularization-based Deep Transfer learning Never Hurt (DTNH) that, for each iteration of training procedure, computes the derivatives of the two terms separately, then re-estimates a new descent direction that does not hurt the empirical loss minimization while preserving the regularization affects from the pre-trained weights. Extensive experiments have been done using common transfer learning regularizers, such as L2-SP and knowledge distillation, on top of a wide range of deep transfer learning benchmarks including Caltech, MIT indoor 67, CIFAR-10 and ImageNet. The empirical results show that the proposed descent direction estimation strategy DTNH can always improve the performance of deep transfer learning tasks based on all above regularizers, even when transferring pre-trained weights from inappropriate networks. All in all, DTNH strategy can improve state-of-the-art regularizers in all cases with 0.1%--7% higher accuracy in all experiments.
△ Less
Submitted 18 November, 2019;
originally announced November 2019.
-
Face Image Reflection Removal
Authors:
Renjie Wan,
Boxin Shi,
Haoliang Li,
Ling-Yu Duan,
Alex C. Kot
Abstract:
Face images captured through the glass are usually contaminated by reflections. The non-transmitted reflections make the reflection removal more challenging than for general scenes, because important facial features are completely occluded. In this paper, we propose and solve the face image reflection removal problem. We remove non-transmitted reflections by incorporating inpainting ideas into a g…
▽ More
Face images captured through the glass are usually contaminated by reflections. The non-transmitted reflections make the reflection removal more challenging than for general scenes, because important facial features are completely occluded. In this paper, we propose and solve the face image reflection removal problem. We remove non-transmitted reflections by incorporating inpainting ideas into a guided reflection removal framework and recover facial features by considering various face-specific priors. We use a newly collected face reflection image dataset to train our model and compare with state-of-the-art methods. The proposed method shows advantages in estimating reflection-free face images for improving face recognition.
△ Less
Submitted 3 March, 2019;
originally announced March 2019.
-
Probing the transport properties of Quark-Gluon Plasma via heavy-flavor Boltzmann and Langevin dynamics
Authors:
Shuang Li,
Chaowen Wang,
Renzhuo Wan,
**feng Liao
Abstract:
The heavy quark propagation behavior inside the quark-gluon plasma (QGP), is usually described in terms of the Boltzmann dynamics, which can be reduced to the Langevin approach by assuming a small momentum transfer for the scattering processes between heavy quarks and the QGP constituents. In this work, the temperature and energy dependence of the transport coefficients are calculated in the frame…
▽ More
The heavy quark propagation behavior inside the quark-gluon plasma (QGP), is usually described in terms of the Boltzmann dynamics, which can be reduced to the Langevin approach by assuming a small momentum transfer for the scattering processes between heavy quarks and the QGP constituents. In this work, the temperature and energy dependence of the transport coefficients are calculated in the framework of both Boltzmann and Langevin dynamics. The derived transport coefficients are found to be systematically larger in the Boltzmann approach as compared with the Langevin, in particular in the high temperature and high energy region. Within each of the two theoretical frameworks, we simulate the charm quark production and the subsequent evolution processes in relativistic heavy-ion collisions. We find that the total in-medium energy loss is larger from the Langevin dynamics, resulting in a smaller (larger) $R_{\rm AA}$ at high (low) $p_{\rm T}$, for both the charm quark and heavy-flavor mesons. Meanwhile, the Boltzmann model is found to induce larger $v_{\rm 2}$, in particular at moderate $p_{\rm T}$, as well as stronger broadening behavior for the azimuthal distributions. By comparing the model calculations with available experimental measurements for D-mesons, we find that the Langevin approach is more favored by the $R_{\rm AA}$ data while the Boltzmann approach is more favored favor by the $v_{\rm 2}$ data. A simultaneous description of both observables appear challenging for both models.
△ Less
Submitted 20 May, 2019; v1 submitted 14 January, 2019;
originally announced January 2019.
-
Jet shape modifications at the LHC energies by JEWEL
Authors:
Renzhuo Wan,
Lei Ding,
Xi Gui,
Fan Yang,
Shuang Li,
Daicui Zhou
Abstract:
Jet shape measurements are strongly suggested to explore the microscopic evolution mechanism of parton-medium interaction in ultra-relativistic heavy-ion collisions. In this paper, jet shape modifications are quantified by fragmentation function $F(z)$, relative momentum $p_{T}^{rel}$, density of charged particles $ρ(r)$, jet angularity $girth$, jet momentum dispersion $p_{T}^{Disp}$ and $LeSub$ f…
▽ More
Jet shape measurements are strongly suggested to explore the microscopic evolution mechanism of parton-medium interaction in ultra-relativistic heavy-ion collisions. In this paper, jet shape modifications are quantified by fragmentation function $F(z)$, relative momentum $p_{T}^{rel}$, density of charged particles $ρ(r)$, jet angularity $girth$, jet momentum dispersion $p_{T}^{Disp}$ and $LeSub$ for proton-proton collisions at $900GeV$, $5.02TeV$, $7TeV$ and $13TeV$, as well as for lead-lead collisions at $2.76TeV$ and $5.02TeV$. A differential jet shape parameters $D_{girth}$ is proposed and studied at smaller-radius jet $r<0.3$. The results indicate that medium effect is dominant for jet shape modifications, which is less dependent on center of mass energy. The jet fragmentation is enhanced significantly at very low $z<0.02$ and fragmented jet constituents are spread to larger jet-radius linearly for $p_{T}^{rel}<1$. The waveform attenuate phenomena is observed in $p_{T}^{rel}$, $girth$ and $D_{girth}$ distributions. The comparison results on $D_{girth}$ from $pp$ to $Pb+Pb$ where the wave-like distribution in $pp$ collision is ahead of $Pb+Pb$ collisions in small jet-radius intervals, is interesting to hint the medium effect.
△ Less
Submitted 4 March, 2019; v1 submitted 25 December, 2018;
originally announced December 2018.
-
Learning and Anticipating Future Actions During Exploratory Data Analysis
Authors:
Ran Wan,
Roman Garnett,
Alvitta Ottley
Abstract:
The goal of visual analytics is to create a symbiosis between human and computer by leveraging their unique strengths. While this model has demonstrated immense success, we are yet to realize the full potential of such a human-computer partnership. In a perfect collaborative mixed-initiative system, the computer must possess skills for learning and anticipating the users' needs. Addressing this ga…
▽ More
The goal of visual analytics is to create a symbiosis between human and computer by leveraging their unique strengths. While this model has demonstrated immense success, we are yet to realize the full potential of such a human-computer partnership. In a perfect collaborative mixed-initiative system, the computer must possess skills for learning and anticipating the users' needs. Addressing this gap, we propose a framework for inferring focus areas from passive observations of the user's actions, thereby allowing accurate predictions of future events. We evaluate this technique with a crime map and demonstrate that users' clicks appear in our prediction set 95% - 97% of the time. Further analysis shows that we can achieve high prediction accuracy typically after three clicks. Altogether, we show that passive observations of interaction data can reveal valuable information that will allow the system to learn and anticipate future events, laying the foundation for next-generation tools.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
On the temporal decay for the 2D non-resistive incompressible MHD equations
Authors:
Renhui Wan
Abstract:
Califano-Chiuderi \cite{CC} gave the numerical observation that the energy of the MHD equations is dissipated at a rate independent of the ohmic resistivity, which was first proved by \cite{RWXZ}[Ren et al., J. Funct. Anal., 2014] (the initial data near $(0,\vec{e}_1)$, $\vec{e}_1=(1,0)$). Precisely, they showed some explicit decay rates of solutions in $L^2$ norm. So a nature question is whether…
▽ More
Califano-Chiuderi \cite{CC} gave the numerical observation that the energy of the MHD equations is dissipated at a rate independent of the ohmic resistivity, which was first proved by \cite{RWXZ}[Ren et al., J. Funct. Anal., 2014] (the initial data near $(0,\vec{e}_1)$, $\vec{e}_1=(1,0)$). Precisely, they showed some explicit decay rates of solutions in $L^2$ norm. So a nature question is whether the obtained decay rates in \cite{RWXZ} is optimal. In this paper, we aim at giving the explicit decay rates of solutions in both $L^2$ norm and $L^\infty$ norm. In particular, our decay rate in terms of $L^2$ norm improves the previous work \cite{RWXZ}.
△ Less
Submitted 4 November, 2018; v1 submitted 29 June, 2018;
originally announced June 2018.
-
Neural Control Variates for Variance Reduction
Authors:
Ruosi Wan,
Mingjun Zhong,
Haoyi Xiong,
Zhanxing Zhu
Abstract:
In statistics and machine learning, approximation of an intractable integration is often achieved by using the unbiased Monte Carlo estimator, but the variances of the estimation are generally high in many applications. Control variates approaches are well-known to reduce the variance of the estimation. These control variates are typically constructed by employing predefined parametric functions o…
▽ More
In statistics and machine learning, approximation of an intractable integration is often achieved by using the unbiased Monte Carlo estimator, but the variances of the estimation are generally high in many applications. Control variates approaches are well-known to reduce the variance of the estimation. These control variates are typically constructed by employing predefined parametric functions or polynomials, determined by using those samples drawn from the relevant distributions. Instead, we propose to construct those control variates by learning neural networks to handle the cases when test functions are complex. In many applications, obtaining a large number of samples for Monte Carlo estimation is expensive, which may result in overfitting when training a neural network. We thus further propose to employ auxiliary random variables induced by the original ones to extend data samples for training the neural networks. We apply the proposed control variates with augmented variables to thermodynamic integration and reinforcement learning. Experimental results demonstrate that our method can achieve significant variance reduction compared with other alternatives.
△ Less
Submitted 15 October, 2019; v1 submitted 31 May, 2018;
originally announced June 2018.
-
CRRN: Multi-Scale Guided Concurrent Reflection Removal Network
Authors:
Renjie Wan,
Boxin Shi,
Ling-Yu Duan,
Ah-Hwee Tan,
Alex C. Kot
Abstract:
Removing the undesired reflections from images taken through the glass is of broad application to various computer vision tasks. Non-learning based methods utilize different handcrafted priors such as the separable sparse gradients caused by different levels of blurs, which often fail due to their limited description capability to the properties of real-world reflections. In this paper, we propose…
▽ More
Removing the undesired reflections from images taken through the glass is of broad application to various computer vision tasks. Non-learning based methods utilize different handcrafted priors such as the separable sparse gradients caused by different levels of blurs, which often fail due to their limited description capability to the properties of real-world reflections. In this paper, we propose the Concurrent Reflection Removal Network (CRRN) to tackle this problem in a unified framework. Our proposed network integrates image appearance information and multi-scale gradient information with human perception inspired loss function, and is trained on a new dataset with 3250 reflection images taken under diverse real-world scenes. Extensive experiments on a public benchmark dataset show that the proposed method performs favorably against state-of-the-art methods.
△ Less
Submitted 30 May, 2018;
originally announced May 2018.
-
Triple Exponential Relaxation Dynamics in a Metallacrown-Based {$Dy^{III}Cu^{II}_5$} 3d-4f Single-Molecule Magnet
Authors:
Quan-Wen Li,
Rui-Chen Wan,
** Wang,
Yan-Cong Chen,
Jun-Liang Liu,
Daniel Reta,
Nicholas F. Chilton,
Zhen-Xing Wang,
Ming-Liang Tong
Abstract:
The interplay of strong single-ion anisotropy and magnetic interactions often give rise to novel magnetic behavior and can provide additional routes for controlling magnetization dynamics. However, novel effects arising from interactions between lanthanide and transition-metal ions are nowadays rarely observed. Herein, a {$Dy^{III}Cu^{II}_5$} 3d-4f single-molecule magnet (SMM) is constructed as a…
▽ More
The interplay of strong single-ion anisotropy and magnetic interactions often give rise to novel magnetic behavior and can provide additional routes for controlling magnetization dynamics. However, novel effects arising from interactions between lanthanide and transition-metal ions are nowadays rarely observed. Herein, a {$Dy^{III}Cu^{II}_5$} 3d-4f single-molecule magnet (SMM) is constructed as a rigid and planar [15-MC-5] metallacrown (MC), where the $Dy^{III}$ ion is trapped in the central pseudo-$D_{5h}$ pocket. A strong axial crystal field (CF) imbues the $Dy^{III}$ ion with large Ising-type magnetic anisotropy, and we are able to observe and model the magnetic interactions between the $Cu^{II}-Cu^{II}$ and $Dy^{III}-Cu^{II}$ pairs. Butterfly-shaped magnetic hysteresis shows clear steps at $\pm$0.4 T, coincident with level crossings in our model exchange Hamiltonian between the {$Cu^{II}_5$} and $Dy^{III}$ spin systems. Most intriguingly, this air-stable SMM exhibits three distinct regimes in its magnetic relaxation dynamics, all clearly displaying an exponential dependence on temperature.
△ Less
Submitted 16 April, 2018;
originally announced April 2018.
-
Context-Aware Mixed Reality: A Framework for Ubiquitous Interaction
Authors:
Long Chen,
Wen Tang,
Nigel John,
Tao Ruan Wan,
Jian Jun Zhang
Abstract:
Mixed Reality (MR) is a powerful interactive technology that yields new types of user experience. We present a semantic based interactive MR framework that exceeds the current geometry level approaches, a step change in generating high-level context-aware interactions. Our key insight is to build semantic understanding in MR that not only can greatly enhance user experience through object-specific…
▽ More
Mixed Reality (MR) is a powerful interactive technology that yields new types of user experience. We present a semantic based interactive MR framework that exceeds the current geometry level approaches, a step change in generating high-level context-aware interactions. Our key insight is to build semantic understanding in MR that not only can greatly enhance user experience through object-specific behaviours, but also pave the way for solving complex interaction design challenges. The framework generates semantic properties of the real world environment through dense scene reconstruction and deep image understanding. We demonstrate our approach with a material-aware prototype system for generating context-aware physical interactions between the real and the virtual objects. Quantitative and qualitative evaluations are carried out and the results show that the framework delivers accurate and fast semantic information in interactive MR environment, providing effective semantic level interactions.
△ Less
Submitted 14 March, 2018;
originally announced March 2018.
-
Global well-posedness for the 2D Boussinesq equations with a velocity dam** term
Authors:
Renhui Wan
Abstract:
In this paper, we prove global well-posedness of smooth solutions to the two-dimensional incompressible Boussinesq equations with only a velocity dam** term when the initial data is close to an nontrivial equilibrium state $(0,x_2)$. As a by-product, under this equilibrium state, our result gives a positive answer to the question proposed by [ACWX] (see P.3597).
In this paper, we prove global well-posedness of smooth solutions to the two-dimensional incompressible Boussinesq equations with only a velocity dam** term when the initial data is close to an nontrivial equilibrium state $(0,x_2)$. As a by-product, under this equilibrium state, our result gives a positive answer to the question proposed by [ACWX] (see P.3597).
△ Less
Submitted 23 December, 2018; v1 submitted 8 August, 2017;
originally announced August 2017.
-
Augmented Reality for Depth Cues in Monocular Minimally Invasive Surgery
Authors:
Long Chen,
Wen Tang,
Nigel W. John,
Tao Ruan Wan,
Jian Jun Zhang
Abstract:
One of the major challenges in Minimally Invasive Surgery (MIS) such as laparoscopy is the lack of depth perception. In recent years, laparoscopic scene tracking and surface reconstruction has been a focus of investigation to provide rich additional information to aid the surgical process and compensate for the depth perception issue. However, robust 3D surface reconstruction and augmented reality…
▽ More
One of the major challenges in Minimally Invasive Surgery (MIS) such as laparoscopy is the lack of depth perception. In recent years, laparoscopic scene tracking and surface reconstruction has been a focus of investigation to provide rich additional information to aid the surgical process and compensate for the depth perception issue. However, robust 3D surface reconstruction and augmented reality with depth perception on the reconstructed scene are yet to be reported. This paper presents our work in this area. First, we adopt a state-of-the-art visual simultaneous localization and map** (SLAM) framework - ORB-SLAM - and extend the algorithm for use in MIS scenes for reliable endoscopic camera tracking and salient point map**. We then develop a robust global 3D surface reconstruction frame- work based on the sparse point clouds extracted from the SLAM framework. Our approach is to combine an outlier removal filter within a Moving Least Squares smoothing algorithm and then employ Poisson surface reconstruction to obtain smooth surfaces from the unstructured sparse point cloud. Our proposed method has been quantitatively evaluated compared with ground-truth camera trajectories and the organ model surface we used to render the synthetic simulation videos. In vivo laparoscopic videos used in the tests have demonstrated the robustness and accuracy of our proposed framework on both camera tracking and surface reconstruction, illustrating the potential of our algorithm for depth augmentation and depth-corrected augmented reality in MIS with monocular endoscopes.
△ Less
Submitted 1 March, 2017;
originally announced March 2017.
-
Ill-posedness for the 3D inhomogeneous Navier-Stokes equations in the critical Besov space near $L^6$ framework
Authors:
Renhui Wan
Abstract:
We prove the ill-posedness for the 3D incompressible inhomogeneous Navier-stokes equations in critical Besov space. In particular, a norm inflation happens in finite time with the initial data satisfying $$\|a_0\|_{\dot{B}_{p,1}^\frac{3}{p}}+\|u_0\|_{\dot{B}_{6,1}^{-\frac{1}{2}}}\le δ,\ p>6$$ or $$\|a_0\|_{\dot{B}_{6,1}^\frac{1}{2}}+\|u_0\|_{\dot{B}_{p,1}^{\frac{3}{p}-1}}\le δ,\ p>6.$$ To obtain t…
▽ More
We prove the ill-posedness for the 3D incompressible inhomogeneous Navier-stokes equations in critical Besov space. In particular, a norm inflation happens in finite time with the initial data satisfying $$\|a_0\|_{\dot{B}_{p,1}^\frac{3}{p}}+\|u_0\|_{\dot{B}_{6,1}^{-\frac{1}{2}}}\le δ,\ p>6$$ or $$\|a_0\|_{\dot{B}_{6,1}^\frac{1}{2}}+\|u_0\|_{\dot{B}_{p,1}^{\frac{3}{p}-1}}\le δ,\ p>6.$$ To obtain the norm inflation, we construct a special class of initial data and introduce a modified pressure. Comparing with the classical Navier-Stokes equations in $L^\infty$ framework, we can obtain the ill-posedness for the inhomogeneous case in near $L^6$ framework.
△ Less
Submitted 12 October, 2017; v1 submitted 15 September, 2016;
originally announced September 2016.
-
Two-photon assisted clock comparison to picosecond precision
Authors:
Shi-Wei Zhang,
Jia-Zheng Song,
Yin-** Yao,
Ren-Gang Wan,
Tong-Yi Zhang
Abstract:
We have experimentally demonstrated a clock comparison scheme utilizing time-correlated photon pairs generated from the spontaneous parametric down conversion process of a laser pumped beta-barium borate crystal. The coincidence of two-photon events are analyzed by the cross correlation of the two time stamp sequences. Combining the coarse and fine part of the time differences at different resolut…
▽ More
We have experimentally demonstrated a clock comparison scheme utilizing time-correlated photon pairs generated from the spontaneous parametric down conversion process of a laser pumped beta-barium borate crystal. The coincidence of two-photon events are analyzed by the cross correlation of the two time stamp sequences. Combining the coarse and fine part of the time differences at different resolutions, a 64 ps precision for clock synchronization has been realized. We also investigate the effects of hardware devices used in the system on the precision of clock comparison. The results indicate that the detector's time jitter and the background noise will degrade the system performance. With this method, comparison and synchronization of two remote clocks could be implemented with a precision at the level of a few tens of picoseconds.
△ Less
Submitted 28 September, 2015;
originally announced September 2015.
-
Global well-posedness to the subcritical Oldroyd-B type models in 2D
Authors:
Renhui Wan
Abstract:
We prove the global well-posedness to the 2D Oldroyd-B type models with $νΛ^{2α}u$ and $ηΛ^{2β}τ$ satisfying $(i)\ α>1, η=0$ or $(ii)\ α=1,\ β>0$. By establishing the gradient estimate of $u$, $τ$ and $L^\infty$ bound of ${\rm curl u+Λ^{-2}curldiv τ}$, Elgidi-Rousset (Commun. Pure Appl. Math. online, 2015) obtained the global well-posedness for the case $ν=0$, $β=1$. However, for the cases $(i)$ a…
▽ More
We prove the global well-posedness to the 2D Oldroyd-B type models with $νΛ^{2α}u$ and $ηΛ^{2β}τ$ satisfying $(i)\ α>1, η=0$ or $(ii)\ α=1,\ β>0$. By establishing the gradient estimate of $u$, $τ$ and $L^\infty$ bound of ${\rm curl u+Λ^{-2}curldiv τ}$, Elgidi-Rousset (Commun. Pure Appl. Math. online, 2015) obtained the global well-posedness for the case $ν=0$, $β=1$. However, for the cases $(i)$ and $(ii)$, it is difficult to improve the regularity of $u$ and $τ$ directly, especially when $α\rightarrow 1^{+}$ in case $(i)$ and $β\rightarrow 0^{+}$ in case $(ii)$. To overcome this difficulty, we exploit a new structure of the equations coming from the dissipation and coupled term. Then we prove the global well-posedness to these cases by energy method which brings us closer to the more interesting case $α=1$, $η=0$.
△ Less
Submitted 28 September, 2015; v1 submitted 25 September, 2015;
originally announced September 2015.
-
Global well-posedness to the 3D incompressible MHD equations with a new class of large initial data
Authors:
Renhui Wan
Abstract:
We obtain the global well-posedness to the 3D incompressible magnetohydrodynamics (MHD) equations in Besov space with negative index of regularity. Particularly, we can get the global solutions for a new class of large initial data. As a byproduct, this result improves the corresponding result in \cite{HHW}. In addition, we also get the global result for this system in $\mathcalχ^{-1}(\R^3)$ origi…
▽ More
We obtain the global well-posedness to the 3D incompressible magnetohydrodynamics (MHD) equations in Besov space with negative index of regularity. Particularly, we can get the global solutions for a new class of large initial data. As a byproduct, this result improves the corresponding result in \cite{HHW}. In addition, we also get the global result for this system in $\mathcalχ^{-1}(\R^3)$ originally developed in \cite{LL}. More precisely, we only assume that the norm of initial data is exactly smaller than the sum of viscosity and diffusivity parameters.
△ Less
Submitted 25 September, 2015;
originally announced September 2015.
-
Asymmetric Nanoparticle May Go Active at Room Temperature
Authors:
Nan Sheng,
YuSong Tu,
Pan Guo,
RongZheng Wan,
ZuoWei Wang,
Hai** Fang
Abstract:
Using molecular dynamics simulations, we show that an asymmetrically shaped nanoparticle in dilute solution possesses a spontaneously curved trajectory within finite time interval, instead of the generally expected random walk. This unexpected dynamic behavior has a similarity to that of active matters, such as swimming bacteria, cells or even fishes, but is of a different physical origin. The key…
▽ More
Using molecular dynamics simulations, we show that an asymmetrically shaped nanoparticle in dilute solution possesses a spontaneously curved trajectory within finite time interval, instead of the generally expected random walk. This unexpected dynamic behavior has a similarity to that of active matters, such as swimming bacteria, cells or even fishes, but is of a different physical origin. The key to the curved trajectory lies in the non-zero resultant force originated from the imbalance of the collision forces acted by surrounding solvent molecules on the shaped nanoparticle during its orientation regulation. Theoretical formulae based on the microscopic observation have been derived to describe this non-zero force and the resulted motion of the nanoparticle.
△ Less
Submitted 11 October, 2016; v1 submitted 30 August, 2015;
originally announced August 2015.
-
Global small solutions to a tropical climate model without thermal diffusion
Authors:
Renhui Wan
Abstract:
We obtain the global well-posedness of classical solutions to a tropical climate model derived by Feireisl-Majda-Pauluis in \cite{FMP} with only the dissipation of the first baroclinic model of the velocity ($-ηΔv$) under small initial data. The main difficulty is the absence of thermal diffusion as the work by Li-Titi in \cite{LT}. To overcome it, we exploit the structure of the equations coming…
▽ More
We obtain the global well-posedness of classical solutions to a tropical climate model derived by Feireisl-Majda-Pauluis in \cite{FMP} with only the dissipation of the first baroclinic model of the velocity ($-ηΔv$) under small initial data. The main difficulty is the absence of thermal diffusion as the work by Li-Titi in \cite{LT}. To overcome it, we exploit the structure of the equations coming from the coupled terms, dissipation term and damp term. Then we find the hidden thermal diffusion. In addition, based on the Littlewood-Palay theory, we establish a generalized commutator estimate, which may be applied to other partial differential equations.
△ Less
Submitted 6 July, 2015; v1 submitted 23 June, 2015;
originally announced June 2015.
-
Spontaneous Directional Motion of Shaped Nanoparticle
Authors:
Nan Sheng,
YuSong Tu,
Pan Guo,
RongZheng Wan,
ZuoWei Wang,
Hai** Fang
Abstract:
In nanoscale space and pico- to nanoseconds enormous physical, chemical and biological processes take place, while the motions of involved particles/molecules under thermal fluctuations are usually analyzed using the conventional theory of diffusive Brownian motion based on both sufficiently long time averaging and assumptions of spherical particle shapes. Here, using molecular dynamics simulation…
▽ More
In nanoscale space and pico- to nanoseconds enormous physical, chemical and biological processes take place, while the motions of involved particles/molecules under thermal fluctuations are usually analyzed using the conventional theory of diffusive Brownian motion based on both sufficiently long time averaging and assumptions of spherical particle shapes. Here, using molecular dynamics simulations, we show that asymmetrically shaped nanoparticles in dilute solutions possess spontaneous directional motion of the center of mass within a finite time interval. The driving force for this unexpected directional motion lies in the imbalance of the interactions experienced by their constituent atoms during the orientation regulation at timescales before the onset of diffusive Brownian motion. Theoretical formulae have been derived to describe the mean displacement and the variance of this directional motion. Our study potentially takes an important step towards establishing a complete theoretical framework for describing the motions of variously-shaped particles in solutions over all timescales from ballistic to diffusive regime.
△ Less
Submitted 28 January, 2016; v1 submitted 9 March, 2015;
originally announced March 2015.
-
On the uniqueness for the 2D MHD equations without magnetic diffusion
Authors:
Renhui Wan
Abstract:
In this paper, we obtain the uniqueness of the 2D MHD equations, which fills the gap of recent work \cite{1} by Chemin et al.
In this paper, we obtain the uniqueness of the 2D MHD equations, which fills the gap of recent work \cite{1} by Chemin et al.
△ Less
Submitted 12 March, 2015;
originally announced March 2015.
-
Simulation of Schottky-Barrier Phosphorene Transistors
Authors:
Runlai Wan,
Xi Cao,
**g Guo
Abstract:
Schottky barrier field-effect transistors (SBFETs) based on few and mono layer phosphorene are simulated by the non-equilibrium Green's function formalism. It is shown that scaling down the gate oxide thickness results in pronounced ambipolar I-V characteristics and significant increase of the minimal leakage current. The problem of leakage is especially severe when the gate insulator is thin and…
▽ More
Schottky barrier field-effect transistors (SBFETs) based on few and mono layer phosphorene are simulated by the non-equilibrium Green's function formalism. It is shown that scaling down the gate oxide thickness results in pronounced ambipolar I-V characteristics and significant increase of the minimal leakage current. The problem of leakage is especially severe when the gate insulator is thin and the number of layer is large, but can be effectively suppressed by reducing phosphorene to mono or bilayer. Different from two-dimensional graphene and layered dichalcogenide materials, both the ON-current of the phosphorene SBFETs and the metal-semiconductor contact resistance between metal and phosphorene strongly depend on the transport crystalline direction.
△ Less
Submitted 21 August, 2014;
originally announced August 2014.
-
Local well-posedness for the Hall-MHD equations with fractional magnetic diffusion
Authors:
Dongho Chae,
Renhui Wan,
Jiahong Wu
Abstract:
The Hall-magnetohydrodynamics (Hall-MHD) equations, rigorously derived from kinetic models, are useful in describing many physical phenomena in geophysics and astrophysics. This paper studies the local well-posedness of classical solutions to the Hall-MHD equations with the magnetic diffusion given by a fractional Laplacian operator, $(-Δ)^α$. Due to the presence of the Hall term in the Hall-MHD e…
▽ More
The Hall-magnetohydrodynamics (Hall-MHD) equations, rigorously derived from kinetic models, are useful in describing many physical phenomena in geophysics and astrophysics. This paper studies the local well-posedness of classical solutions to the Hall-MHD equations with the magnetic diffusion given by a fractional Laplacian operator, $(-Δ)^α$. Due to the presence of the Hall term in the Hall-MHD equations, standard energy estimates appear to indicate that we need $α\ge 1$ in order to obtain the local well-posedness. This paper breaks the barrier and shows that the fractional Hall-MHD equations are locally well-posed for any $α>\frac12$. The approach here fully exploits the smoothing effects of the dissipation and establishes the local bounds for the Sobolev norms through the Besov space techniques. The method presented here may be applicable to similar situations involving other partial differential equations.
△ Less
Submitted 29 January, 2015; v1 submitted 2 April, 2014;
originally announced April 2014.
-
Tunneling induced dark states and controllable fluorescence spectrum in quantum-dot molecules
Authors:
Si-Cong Tian,
Ren-Gang Wan,
Cun-Zhu Tong,
Yong-Qiang Ning,
Li-Jun Wang
Abstract:
We theoretically investigate the spectrum of the fluorescence from triple quantum-dot molecules and demonstrate that it is possible to use tunneling to induce dark states. Unlike the atomic system, in quantum-dot molecules we can use tunneling to create the dark states and control fluorescence emission, requiring no coupling lasers. And interesting features such as quenching and narrowing of the f…
▽ More
We theoretically investigate the spectrum of the fluorescence from triple quantum-dot molecules and demonstrate that it is possible to use tunneling to induce dark states. Unlike the atomic system, in quantum-dot molecules we can use tunneling to create the dark states and control fluorescence emission, requiring no coupling lasers. And interesting features such as quenching and narrowing of the fluorescence can be obtained. We also explain the spectrum with the transition properties of the dressed states generated by the coupling of the laser and the two tunneling. The quenching of the fluorescence is due to the tunneling induced dark states, while the narrowing of the central peak is due to the slow decay rate of the dressed levels.
△ Less
Submitted 12 November, 2013;
originally announced November 2013.
-
Tunneling induced transparency and controllable group velocity in triple and multiple quantum-dot molecules
Authors:
Si-Cong Tian,
Cun-Zhu Tong,
Ren-Gang Wan,
Yong-Qiang Ning,
Li-Jun Wang
Abstract:
We analyze the interaction of a triple quantum dot molecules controlled by the tunneling coupling instead of coupling laser. A general analytic expression for the steady-state linear susceptibility for a probe-laser field is obtained and we show that the system can exhibit two transparency windows. The group velocity of the probe-laser pulse is also analyzed. By changing the tunneling couplings, t…
▽ More
We analyze the interaction of a triple quantum dot molecules controlled by the tunneling coupling instead of coupling laser. A general analytic expression for the steady-state linear susceptibility for a probe-laser field is obtained and we show that the system can exhibit two transparency windows. The group velocity of the probe-laser pulse is also analyzed. By changing the tunneling couplings, two laser pulses with different central frequency can propagate with the same group velocity. And the group velocity can be as low as 300 m/s in our system. We extend our analysis to the case of multiple quantum dot molecules (the number of the quantum dots is N) and show that the system can exhibit at most N-1 transparency windows. And at most N-1 laser pulses with different central frequencies can be slowed down.
△ Less
Submitted 17 October, 2013;
originally announced October 2013.
-
Dynamical bag in a chiral quark model
Authors:
Duojie Jia,
LianChun Yu,
Rui-Bin Wan
Abstract:
A type of bag function is proposed to make the MIT bag surface of baryon dynamical. It is illustrated through renormalization of the quark field that the softening of chiral bag gives rise to a model of chiral quark with effectively-generated mass of quark, in which confined quark moves in the background of nonlinear pion. A prediction of bag constant $B$ $\simeq 2f_π^{2}m_π^{2}\allowbreak $ is ma…
▽ More
A type of bag function is proposed to make the MIT bag surface of baryon dynamical. It is illustrated through renormalization of the quark field that the softening of chiral bag gives rise to a model of chiral quark with effectively-generated mass of quark, in which confined quark moves in the background of nonlinear pion. A prediction of bag constant $B$ $\simeq 2f_π^{2}m_π^{2}\allowbreak $ is made. With two free parameters, the self-coupling $e$ of pion and the confining scale $a$, the computed mass, the charge root-mean-square radius and magnetic moment of the proton are in good agreement with the experimental values.
△ Less
Submitted 3 August, 2013;
originally announced August 2013.
-
Asymmetrical free diffusion with orientation-dependence of molecules in finite timescales
Authors:
Nan Sheng,
Yusong Tu,
Pan Guo,
Rongzheng Wan,
Hai** Fang
Abstract:
Using molecular dynamics simulations, we show that free diffusion of a nanoscale particle (molecule) with asymmetric structure critically depends on the orientation in a finite timescale of picoseconds to nanoseconds. In a timescale of ~100 ps, there are ~10% more possibilities for the particle moving along the initial orientation than moving opposite to the orientation; and the diffusion distance…
▽ More
Using molecular dynamics simulations, we show that free diffusion of a nanoscale particle (molecule) with asymmetric structure critically depends on the orientation in a finite timescale of picoseconds to nanoseconds. In a timescale of ~100 ps, there are ~10% more possibilities for the particle moving along the initial orientation than moving opposite to the orientation; and the diffusion distances of the particle reach ~1 nm. We find that the key to this observation is the orientation-dependence of the dam** force to the moving of the nanoscale particle and a finite time is required to regulate the particle orientation. This finding extends the work of Einstein to nano-world beyond random Brownian motion, thus will have a critical role in the understanding of the nanoscale world.
△ Less
Submitted 26 July, 2013;
originally announced July 2013.