-
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Authors:
Chunyu Xuan,
Yazhe Niu,
Yuan Pu,
Shuai Hu,
Yu Liu,
**g Yang
Abstract:
Monte Carlo Tree Search (MCTS)-based algorithms, such as MuZero and its derivatives, have achieved widespread success in various decision-making domains. These algorithms employ the reanalyze process to enhance sample efficiency from stale data, albeit at the expense of significant wall-clock time consumption. To address this issue, we propose a general approach named ReZero to boost tree search o…
▽ More
Monte Carlo Tree Search (MCTS)-based algorithms, such as MuZero and its derivatives, have achieved widespread success in various decision-making domains. These algorithms employ the reanalyze process to enhance sample efficiency from stale data, albeit at the expense of significant wall-clock time consumption. To address this issue, we propose a general approach named ReZero to boost tree search operations for MCTS-based algorithms. Specifically, drawing inspiration from the one-armed bandit model, we reanalyze training samples through a backward-view reuse technique which obtains the value estimation of a certain child node in advance. To further adapt to this design, we periodically reanalyze the entire buffer instead of frequently reanalyzing the mini-batch. The synergy of these two designs can significantly reduce the search cost and meanwhile guarantee or even improve performance, simplifying both data collecting and reanalyzing. Experiments conducted on Atari environments and board games demonstrate that ReZero substantially improves training speed while maintaining high sample efficiency. The code is available as part of the LightZero benchmark at https://github.com/opendilab/LightZero.
△ Less
Submitted 28 May, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Addressing the Scalability Bottleneck of Semantic Technologies at Bosch
Authors:
Diego Rincon-Yanez,
Mohamed H. Gad-Elrab,
Daria Stepanova,
Kien Trung Tran,
Cuong Chu Xuan,
Baifan Zhou,
Evgeny Karlamov
Abstract:
At the heart of smart manufacturing is real-time semi-automatic decision-making. Such decisions are vital for optimizing production lines, e.g., reducing resource consumption, improving the quality of discrete manufacturing operations, and optimizing the actual products, e.g., optimizing the sampling rate for measuring product dimensions during production. Such decision-making relies on massive in…
▽ More
At the heart of smart manufacturing is real-time semi-automatic decision-making. Such decisions are vital for optimizing production lines, e.g., reducing resource consumption, improving the quality of discrete manufacturing operations, and optimizing the actual products, e.g., optimizing the sampling rate for measuring product dimensions during production. Such decision-making relies on massive industrial data thus posing a real-time processing bottleneck.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Constrained Proximal Policy Optimization
Authors:
Chengbin Xuan,
Feng Zhang,
Faliang Yin,
Hak-Keung Lam
Abstract:
The problem of constrained reinforcement learning (CRL) holds significant importance as it provides a framework for addressing critical safety satisfaction concerns in the field of reinforcement learning (RL). However, with the introduction of constraint satisfaction, the current CRL methods necessitate the utilization of second-order optimization or primal-dual frameworks with additional Lagrangi…
▽ More
The problem of constrained reinforcement learning (CRL) holds significant importance as it provides a framework for addressing critical safety satisfaction concerns in the field of reinforcement learning (RL). However, with the introduction of constraint satisfaction, the current CRL methods necessitate the utilization of second-order optimization or primal-dual frameworks with additional Lagrangian multipliers, resulting in increased complexity and inefficiency during implementation. To address these issues, we propose a novel first-order feasible method named Constrained Proximal Policy Optimization (CPPO). By treating the CRL problem as a probabilistic inference problem, our approach integrates the Expectation-Maximization framework to solve it through two steps: 1) calculating the optimal policy distribution within the feasible region (E-step), and 2) conducting a first-order update to adjust the current policy towards the optimal policy obtained in the E-step (M-step). We establish the relationship between the probability ratios and KL divergence to convert the E-step into a convex optimization problem. Furthermore, we develop an iterative heuristic algorithm from a geometric perspective to solve this problem. Additionally, we introduce a conservative update mechanism to overcome the constraint violation issue that occurs in the existing feasible region method. Empirical evaluations conducted in complex and uncertain environments validate the effectiveness of our proposed method, as it performs at least as well as other baselines.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Active control of particle position by boundary slip in inertial microfluidics
Authors:
Chengliang Xuan,
Weiyin Liang,
Bing He,
Binghai Wen
Abstract:
Inertial microfluidic is able to focus and separate particles in microchannels based on the characteristic geometry and intrinsic hydrodynamic effect. Yet, the vertical position of suspended particles in the microchannel cannot be manipulated in real time. In this study, we utilize the boundary slip effect to regulate the parabolic velocity distribution of fluid in the microchannel and present a s…
▽ More
Inertial microfluidic is able to focus and separate particles in microchannels based on the characteristic geometry and intrinsic hydrodynamic effect. Yet, the vertical position of suspended particles in the microchannel cannot be manipulated in real time. In this study, we utilize the boundary slip effect to regulate the parabolic velocity distribution of fluid in the microchannel and present a scheme to active control the vertical position of particles in inertial microfluidics. The flow field of a microchannel with a unilateral slip boundary is equivalent to that of the microchannel widened by the relevant slip length, and the particle equilibrium positions in the two microchannels are consistent consequently. Then, we simulate the lateral migrations of three kinds of typical particles, namely circle, ellipse, and rectangle in the microchannel. Unlike the smooth trajectories of circular particles, the motions of the elliptical and rectangular particles are accompanied by regular fluctuations and non-uniform rotations due to their non-circular geometries. The results demonstrate that the unilateral slip boundary can effectively control the vertical equilibrium position of particles. Thus, the present scheme enables to active manipulate the particles positions in vertical direction and can promote more accurate focusing, separating, and transport in inertial microfluidics.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
Optimal Ternary Linear Complementary Dual Codes
Authors:
Liangdong Lu,
Ruihu Li,
Qiang Fu,
Chen Xuan,
Wen** Ma
Abstract:
Linear complementary dual (LCD) codes introduced by Massey are the codes whose intersections with their dual codes are trivial. It can help to improve the security of the information processed by sensitive devices, especially against side-channel attacks (SCA) and fault invasive attacks. In this paper, By construction of puncturing, extending, shortening and combination codes, many good ternary LC…
▽ More
Linear complementary dual (LCD) codes introduced by Massey are the codes whose intersections with their dual codes are trivial. It can help to improve the security of the information processed by sensitive devices, especially against side-channel attacks (SCA) and fault invasive attacks. In this paper, By construction of puncturing, extending, shortening and combination codes, many good ternary LCD codes are presented. We give a Table 1 with the values of $d_{LCD}(n,k)$ for length $ n \leq 20$. In addition, Many of these ternary LCD codes given in this paper are optimal which are saturating the lower or upper bound of Grassl's codetable in \cite{Grassl} and some of them are nearly optimal.
△ Less
Submitted 25 December, 2020; v1 submitted 3 December, 2020;
originally announced December 2020.
-
An End-to-End Encryption Solution for Enterprise Content Applications
Authors:
Chaoting Xuan
Abstract:
The content host services (like Dropbox, OneDrive, and Google Drive) used by enterprise customers are deployed either on premise or in cloud. Because users may store business-sensitive data (contents) in these hosting services, they may want to protect their data from disclosure to anyone else, even IT administrators. Unfortunately, even contents (files) are encrypted in the hosting services, they…
▽ More
The content host services (like Dropbox, OneDrive, and Google Drive) used by enterprise customers are deployed either on premise or in cloud. Because users may store business-sensitive data (contents) in these hosting services, they may want to protect their data from disclosure to anyone else, even IT administrators. Unfortunately, even contents (files) are encrypted in the hosting services, they sometimes are still accessible to IT administrators today. The sensitive data could be exposed to public if the IT administrator turns malicious (like disgruntled employee) or his account is compromised by hackers.
We propose an end-to-end encryption (E2EE) solution to address this challenge. The user data is encrypted at client side (mobile device) and remains encrypted in transit and at rest on server. Specifically, we design a new method to allow master secret recover and escrow, while protecting them from being accessed by malicious administrators. In addition, we present a content (file) encryption scheme that achieves privacy, and granular access control. And it can be seamlessly integrated with major content host services used by business users today.
△ Less
Submitted 1 June, 2020;
originally announced June 2020.
-
From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories
Authors:
Songwei Ge,
Curtis Xuan,
Ruihua Song,
Chao Zou,
Wei Liu,
** Zhou
Abstract:
Sound effects play an essential role in producing high-quality radio stories but require enormous labor cost to add. In this paper, we address the problem of automatically adding sound effects to radio stories with a retrieval-based model. However, directly implementing a tag-based retrieval model leads to high false positives due to the ambiguity of story contents. To solve this problem, we intro…
▽ More
Sound effects play an essential role in producing high-quality radio stories but require enormous labor cost to add. In this paper, we address the problem of automatically adding sound effects to radio stories with a retrieval-based model. However, directly implementing a tag-based retrieval model leads to high false positives due to the ambiguity of story contents. To solve this problem, we introduce a retrieval-based framework hybridized with a semantic inference model which helps to achieve robust retrieval results. Our model relies on fine-designed features extracted from the context of candidate triggers. We collect two story dubbing datasets through crowdsourcing to analyze the setting of adding sound effects and to train and test our proposed methods. We further discuss the importance of each feature and introduce several heuristic rules for the trade-off between precision and recall. Together with the text-to-speech technology, our results reveal a promising automatic pipeline on producing high-quality radio stories.
△ Less
Submitted 20 August, 2019;
originally announced August 2019.
-
Vision-based Robotic Arm Imitation by Human Gesture
Authors:
Cheng Xuan,
Zhiqiang Tang,
**xin Xu
Abstract:
One of the most efficient ways for a learning-based robotic arm to learn to process complex tasks as human, is to directly learn from observing how human complete those tasks, and then imitate. Our idea is based on success of Deep Q-Learning (DQN) algorithm according to reinforcement learning, and then extend to Deep Deterministic Policy Gradient (DDPG) algorithm. We developed a learning-based met…
▽ More
One of the most efficient ways for a learning-based robotic arm to learn to process complex tasks as human, is to directly learn from observing how human complete those tasks, and then imitate. Our idea is based on success of Deep Q-Learning (DQN) algorithm according to reinforcement learning, and then extend to Deep Deterministic Policy Gradient (DDPG) algorithm. We developed a learning-based method, combining modified DDPG and visual imitation network. Our approach acquires frames only from a monocular camera, and no need to either construct a 3D environment or generate actual points. The result we expected during training, was that robot would be able to move as almost the same as how human hands did.
△ Less
Submitted 4 October, 2018; v1 submitted 14 March, 2017;
originally announced March 2017.
-
The Plateau-Rayleigh instability in solids is a simple phase separation
Authors:
Chen Xuan,
John S. Biggins
Abstract:
A long elastic cylinder, radius $a$ and shear-modulus $μ$, becomes unstable given sufficient surface tension $γ$. We show this instability can be simply understood by considering the energy, $E(λ)$, of such a cylinder subject to a homogenous longitudinal stretch $λ$. Although $E(λ)$ has a unique minimum, if surface tension is sufficient ($Γ\equivγ/(aμ)>\sqrt{32}$) it looses convexity in a finite r…
▽ More
A long elastic cylinder, radius $a$ and shear-modulus $μ$, becomes unstable given sufficient surface tension $γ$. We show this instability can be simply understood by considering the energy, $E(λ)$, of such a cylinder subject to a homogenous longitudinal stretch $λ$. Although $E(λ)$ has a unique minimum, if surface tension is sufficient ($Γ\equivγ/(aμ)>\sqrt{32}$) it looses convexity in a finite region. We use a Maxwell construction to show that, if stretched into this region, the cylinder will phase separate into two segments with different stretches $λ_1$ and $λ_2$. Our model thus explains why the instability has infinite wavelength, and allows us to calculate the instability's sub-critical hysteresis loop (as a function of imposed stretch), showing that instability proceeds with constant amplitude and at constant (positive) tension as the cylinder is stretched between $λ_1$ and $λ_2$. We use full nonlinear finite-element calculations to verify these predictions, and to characterize the interface between the two phases. Near $Γ=\sqrt{32}$ the length of such an interface diverges introducing a new length-scale and allowing us to construct a 1-D effective theory. This treatment yields an analytic expression for the interface itself, revealing its characteristic length grows as $l_{wall}\sim a/\sqrt{Γ-\sqrt{32}}$.
△ Less
Submitted 13 January, 2017;
originally announced January 2017.
-
Exploring the cylindrical photo-bending shape in polydomain nematic glass
Authors:
Chen Xuan,
Changwei Xu,
Yongzhong Huo
Abstract:
This paper explores different photo-bending shapes in polydomain nematic glass. The motivation is to explain the phenomenon in experiment [1] under polarized light in which a nematic film curls into an circular arc, like part of a cylindrical surface. Polarized light triggers photo-isomerization and therefore makes liquid crystals (LCs) contract along their directors. We apply the Sachs limit to h…
▽ More
This paper explores different photo-bending shapes in polydomain nematic glass. The motivation is to explain the phenomenon in experiment [1] under polarized light in which a nematic film curls into an circular arc, like part of a cylindrical surface. Polarized light triggers photo-isomerization and therefore makes liquid crystals (LCs) contract along their directors. We apply the Sachs limit to homogenize the deformation of polydomain LC glass. Photo-strain can be either contraction or expansion through the material. Bending shapes can be anticlastic, bowl-shaped and cylindrical affected by Poisson ratio and illumination intensity. An explanation for the cylindrical bend and ways to observe other shapes are given in a parameter plane.
△ Less
Submitted 17 May, 2016;
originally announced May 2016.
-
Polarization dependence of photo-mechanical behavior of monodomain liquid crystal polymeric materials
Authors:
Chen Xuan,
Changwei Xu,
Yongzhong Huo
Abstract:
Polarization dependence of opto-mechanical behavior of monodomain photochromic glassy liquid crystal (LC) polymers under polarized ultraviolet light (PUV) is studied. Trans-cis photo-isomerization is generally known to be most intense at 'parallel illumination' (polarization parallel to LC director), as light-medium interactions are active when polarization aligns with trainsition dipole moment. W…
▽ More
Polarization dependence of opto-mechanical behavior of monodomain photochromic glassy liquid crystal (LC) polymers under polarized ultraviolet light (PUV) is studied. Trans-cis photo-isomerization is generally known to be most intense at 'parallel illumination' (polarization parallel to LC director), as light-medium interactions are active when polarization aligns with trainsition dipole moment. We show that at parallel illumination though cis isomers are converted from trans the most near surface, they can be the least below certain light propagation depth. Membrane force, an average effect of trans-cis conversion over propagation depths, shows a monotonic polarization dependence, i.e. maximum at parallel illumination, which agrees well with experiment [1]. However, under strong illumination, cis fraction/photo-contraction distribution through depths shows deep penetration, switching over the polarization dependence in photo-moment, which is related to photo-contraction gradient ---- photo-moment can be maximum at 'perpendicular illumination' (polarization perpendicular to director) under strong light. We give both intuitive explanation and analytical demonstration in thin strip limit for the switchover.
△ Less
Submitted 22 April, 2016;
originally announced May 2016.
-
Finite wavelength surface-tension driven instabilities in soft solids, including instability in a cylindrical channel through an elastic solid
Authors:
Chen Xuan,
John Biggins
Abstract:
We deploy linear stability analysis to find the threshold wavelength ($λ$) and surface tension ($γ$) of Rayleigh-Plateau type "peristaltic" instabilities in incompressible neo-Hookean solids in a range of cylindrical geometries with radius $R_0$. First we consider a solid cylinder, and recover the well-known, infinite wavelength instability for $γ\ge6 μR_0$, where $μ$ is the solid's shear modulus.…
▽ More
We deploy linear stability analysis to find the threshold wavelength ($λ$) and surface tension ($γ$) of Rayleigh-Plateau type "peristaltic" instabilities in incompressible neo-Hookean solids in a range of cylindrical geometries with radius $R_0$. First we consider a solid cylinder, and recover the well-known, infinite wavelength instability for $γ\ge6 μR_0$, where $μ$ is the solid's shear modulus. Second, we consider a volume-conserving (e.g.\ fluid filled and sealed) cylindrical cavity through an infinite solid, and demonstrate infinite wavelength instability for $γ\ge 2 μR_0$. Third, we consider a solid cylinder embedded in a different infinite solid, and find a finite wavelength instability with $λ\propto R_0$, at surface tension $γ\propto μR_0$, where the constants depend on the two solids' modulus ratio. Finally, we consider an empty cylindrical channel (or filled with expellable fluid) through an infinite solid, and find an instability with finite wavelength, $λ\approx2 R_0$, for $γ\ge 2.543... μR_0$. Using finite-strain numerics, we show such a channel jumps at instability to a highly peristaltic state, likely precipitating it's blockage or failure. We argue that finite wavelengths are generic for elasto-capillary instabilities, with the simple cylinder's infinite wavelength being the exception rather than the rule.
△ Less
Submitted 19 July, 2016; v1 submitted 16 December, 2015;
originally announced December 2015.
-
Deep optical penetration dynamics in photo-bending
Authors:
Daniel Corbett,
Chen Xuan,
Mark Warner
Abstract:
We model both the photo-stationary state and dynamics of an illuminated, photo-sensitive, glassy liquid crystalline sheet. To illustrate the interplay between local tilt $θ$ of the sheet, effective incident intensity, curvature and dynamics, we adopt the simplest variation of local incident light intensity with angle, that is $\cosθ$. The tilt in the stationary state never overshoots the vertical,…
▽ More
We model both the photo-stationary state and dynamics of an illuminated, photo-sensitive, glassy liquid crystalline sheet. To illustrate the interplay between local tilt $θ$ of the sheet, effective incident intensity, curvature and dynamics, we adopt the simplest variation of local incident light intensity with angle, that is $\cosθ$. The tilt in the stationary state never overshoots the vertical, but maximum curvature could be seen in the middle of the sheet for intense light. In dynamics, overshoot and self-eclipsing arise, revealing how important moving fronts of light penetration are. Eclipsing is qualitatively as in the experiments of Ikeda and Yu (2003).
△ Less
Submitted 6 July, 2015; v1 submitted 6 July, 2015;
originally announced July 2015.
-
Coherent Resonances Observed in the Dissociative Electron Attachments to Carbon Monoxide
Authors:
Xu-Dong Wang,
Chuan-** Xuan,
Yi Luo,
Shan Xi Tian
Abstract:
Succeeding our previous finding about coherent interference of the resonant states of CO^- formed by the low-energy electron attachment [Phys. Rev. A 88, 012708 (2013)], here we provide more evidences of the coherent interference, in particular, we find the state configuration change in the interference with the increase of electron attachment energy by measuring the completely backward distributi…
▽ More
Succeeding our previous finding about coherent interference of the resonant states of CO^- formed by the low-energy electron attachment [Phys. Rev. A 88, 012708 (2013)], here we provide more evidences of the coherent interference, in particular, we find the state configuration change in the interference with the increase of electron attachment energy by measuring the completely backward distributions of the O^- fragment ion of the temporary CO^- in an energy range 11.3-12.6 eV. Therefore, different pure states, namely, coherent resonances, can be formed when the close-lying resonant states are coherently superposed by a broad-band electron pulse.
△ Less
Submitted 2 March, 2015;
originally announced March 2015.
-
Strong microwave absorption observed in dielectric La1.5Sr0.5NiO4 nanoparticles
Authors:
P. T. Tho,
C. T. A. Xuan,
D. M. Quang,
T. N. Bach,
N. T. H. Le,
T. D. Thanh,
N. X. Phuc,
D. N. H. Nam
Abstract:
La$_{1.5}$Sr$_{0.5}$NiO$_4$ is well known to have a colossal dielectric constant ($\varepsilon_R>10^7$). The La$_{1.5}$Sr$_{0.5}$NiO$_4$ nanoparticle powder was prepared by a combinational method of solid state reaction and high-energy ball milling. Magnetic measurements show that the material has a very small magnetic moment and paramagnetic characteristic at room temperature. The mixture of the…
▽ More
La$_{1.5}$Sr$_{0.5}$NiO$_4$ is well known to have a colossal dielectric constant ($\varepsilon_R>10^7$). The La$_{1.5}$Sr$_{0.5}$NiO$_4$ nanoparticle powder was prepared by a combinational method of solid state reaction and high-energy ball milling. Magnetic measurements show that the material has a very small magnetic moment and paramagnetic characteristic at room temperature. The mixture of the nanoparticle powder (40% vol.) and paraffin (60% vol.) coated in the form of flat layers of different thicknesses ($t$) exhibits strong microwave absorption resonances in the 4-18 GHz range. The reflection loss ($RL$) decreases with $t$ and reaches down to -36.7 dB for $t=3.0$ mm. The impedance matching ($|Z|=Z_0=377$ $Ω$), rather than the phase matching mechanism, is found responsible for the resonance observed in the samples with $1<t\leq3.0$ mm. Further increase of the thickness leads to $|Z|>Z_0$ at all frequencies and a reduced absorption. The influence of non-metal backing is also discussed. Our observation suggests that La$_{1.5}$Sr$_{0.5}$NiO$_4$ nanoparticles could be used as good fillers for high performance radar absorbing material.
△ Less
Submitted 12 August, 2013;
originally announced August 2013.