-
Origin of extended Main Sequence Turn Off in open cluster NGC 2355
Authors:
Jayanand Maurya,
M. R. Samal,
Louis Amard,
Yu Zhang,
Hubiao Niu,
Sang Chul Kim,
Y. C. Joshi,
B. Kumar
Abstract:
The presence of extended Main Sequence Turn-Off (eMSTO) in the open clusters has been attributed to various factors, such as spread in rotation rates, binary stars, and dust-like extinction from stellar excretion discs. We present a comprehensive analysis of the eMSTO in the open cluster NGC 2355. Using spectra from the Gaia-ESO archives, we find that the stars in the red part of the eMSTO have a…
▽ More
The presence of extended Main Sequence Turn-Off (eMSTO) in the open clusters has been attributed to various factors, such as spread in rotation rates, binary stars, and dust-like extinction from stellar excretion discs. We present a comprehensive analysis of the eMSTO in the open cluster NGC 2355. Using spectra from the Gaia-ESO archives, we find that the stars in the red part of the eMSTO have a higher mean v sin i value of 135.3$\pm$4.6 km s$^{-1}$ compared to the stars in the blue part that have an average v sin i equal to 81.3$\pm$5.6 km s$^{-1}$. This suggests that the eMSTO in NGC 2355 is possibly caused by the spread in rotation rates of stars. We do not find any substantial evidence of the dust-like extinction from the eMSTO stars using ultraviolet data from the Swift survey. The estimated synchronization time for low mass ratio close binaries in the blue part of the eMSTO suggests that they would be mostly slow-rotating if present. However, the stars in the blue part of the eMSTO are preferentially located in the outer region of the cluster indicating that they may lack low mass ratio close binaries. The spread in rotation rates of eMSTO stars in NGC 2355 is most likely caused by the star-disc interaction mechanism. The stars in the lower main sequence beyond the eMSTO region of NGC 2355 are slow-rotating (mean v sin i = 26.5$\pm$1.3 km s$^{-1}$) possibly due to the magnetic braking of their rotations.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
From Macro to Micro: Boosting micro-expression recognition via pre-training on macro-expression videos
Authors:
Hanting Li,
Hong**g Niu,
Feng Zhao
Abstract:
Micro-expression recognition (MER) has drawn increasing attention in recent years due to its potential applications in intelligent medical and lie detection. However, the shortage of annotated data has been the major obstacle to further improve deep-learning based MER methods. Intuitively, utilizing sufficient macro-expression data to promote MER performance seems to be a feasible solution. Howeve…
▽ More
Micro-expression recognition (MER) has drawn increasing attention in recent years due to its potential applications in intelligent medical and lie detection. However, the shortage of annotated data has been the major obstacle to further improve deep-learning based MER methods. Intuitively, utilizing sufficient macro-expression data to promote MER performance seems to be a feasible solution. However, the facial patterns of macro-expressions and micro-expressions are significantly different, which makes naive transfer learning methods difficult to deploy directly. To tacle this issue, we propose a generalized transfer learning paradigm, called \textbf{MA}cro-expression \textbf{TO} \textbf{MI}cro-expression (MA2MI). Under our paradigm, networks can learns the ability to represent subtle facial movement by reconstructing future frames. In addition, we also propose a two-branch micro-action network (MIACNet) to decouple facial position features and facial action features, which can help the network more accurately locate facial action locations. Extensive experiments on three popular MER benchmarks demonstrate the superiority of our method.
△ Less
Submitted 4 June, 2024; v1 submitted 26 May, 2024;
originally announced May 2024.
-
ASGrasp: Generalizable Transparent Object Reconstruction and Gras** from RGB-D Active Stereo Camera
Authors:
Jun Shi,
Yong A,
Yixiang **,
Dingzhe Li,
Haoyu Niu,
Zhezhu **,
He Wang
Abstract:
In this paper, we tackle the problem of gras** transparent and specular objects. This issue holds importance, yet it remains unsolved within the field of robotics due to failure of recover their accurate geometry by depth cameras. For the first time, we propose ASGrasp, a 6-DoF grasp detection network that uses an RGB-D active stereo camera. ASGrasp utilizes a two-layer learning-based stereo net…
▽ More
In this paper, we tackle the problem of gras** transparent and specular objects. This issue holds importance, yet it remains unsolved within the field of robotics due to failure of recover their accurate geometry by depth cameras. For the first time, we propose ASGrasp, a 6-DoF grasp detection network that uses an RGB-D active stereo camera. ASGrasp utilizes a two-layer learning-based stereo network for the purpose of transparent object reconstruction, enabling material-agnostic object gras** in cluttered environments. In contrast to existing RGB-D based grasp detection methods, which heavily depend on depth restoration networks and the quality of depth maps generated by depth cameras, our system distinguishes itself by its ability to directly utilize raw IR and RGB images for transparent object geometry reconstruction. We create an extensive synthetic dataset through domain randomization, which is based on GraspNet-1Billion. Our experiments demonstrate that ASGrasp can achieve over 90% success rate for generalizable transparent object gras** in both simulation and the real via seamless sim-to-real transfer. Our method significantly outperforms SOTA networks and even surpasses the performance upper bound set by perfect visible point cloud inputs.Project page: https://pku-epic.github.io/ASGrasp
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
xMTrans: Temporal Attentive Cross-Modality Fusion Transformer for Long-Term Traffic Prediction
Authors:
Huy Quang Ung,
Hao Niu,
Minh-Son Dao,
Shinya Wada,
Atsunori Minamikawa
Abstract:
Traffic predictions play a crucial role in intelligent transportation systems. The rapid development of IoT devices allows us to collect different kinds of data with high correlations to traffic predictions, fostering the development of efficient multi-modal traffic prediction models. Until now, there are few studies focusing on utilizing advantages of multi-modal data for traffic predictions. In…
▽ More
Traffic predictions play a crucial role in intelligent transportation systems. The rapid development of IoT devices allows us to collect different kinds of data with high correlations to traffic predictions, fostering the development of efficient multi-modal traffic prediction models. Until now, there are few studies focusing on utilizing advantages of multi-modal data for traffic predictions. In this paper, we introduce a novel temporal attentive cross-modality transformer model for long-term traffic predictions, namely xMTrans, with capability of exploring the temporal correlations between the data of two modalities: one target modality (for prediction, e.g., traffic congestion) and one support modality (e.g., people flow). We conducted extensive experiments to evaluate our proposed model on traffic congestion and taxi demand predictions using real-world datasets. The results showed the superiority of xMTrans against recent state-of-the-art methods on long-term traffic predictions. In addition, we also conducted a comprehensive ablation study to further analyze the effectiveness of each module in xMTrans.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
AutoInspect: Towards Long-Term Autonomous Industrial Inspection
Authors:
Michal Staniaszek,
Tobit Flatscher,
Joseph Rowell,
Hanlin Niu,
Wenxing Liu,
Yang You,
Robert Skilton,
Maurice Fallon,
Nick Hawes
Abstract:
We give an overview of AutoInspect, a ROS-based software system for robust and extensible mission-level autonomy. Over the past three years AutoInspect has been deployed in a variety of environments, including at a mine, a chemical plant, a mock oil rig, decommissioned nuclear power plants, and a fusion reactor for durations ranging from hours to weeks. The system combines robust map** and local…
▽ More
We give an overview of AutoInspect, a ROS-based software system for robust and extensible mission-level autonomy. Over the past three years AutoInspect has been deployed in a variety of environments, including at a mine, a chemical plant, a mock oil rig, decommissioned nuclear power plants, and a fusion reactor for durations ranging from hours to weeks. The system combines robust map** and localisation with graph-based autonomous navigation, mission execution, and scheduling to achieve a complete autonomous inspection system. The time from arrival at a new site to autonomous mission execution can be under an hour. It is deployed on a Boston Dynamics Spot robot using a custom sensing and compute payload called Frontier. In this work we go into detail of the system's performance in two long-term deployments of 49 days at a robotics test facility, and 35 days at the Joint European Torus (JET) fusion reactor in Oxfordshire, UK.
△ Less
Submitted 23 April, 2024; v1 submitted 19 April, 2024;
originally announced April 2024.
-
Statistical analysis of pulsar flux density distribution
Authors:
H. W. Xu,
R. S. Zhao,
Erbil Gugercinoglu,
H. Liu,
D. Li,
P. Wang,
C. H. Niu,
C. Miao,
X. Zhu,
R. W. Tian,
W. L. Li,
S. D. Wang,
Z. F. Tu,
Q. J. Zhi,
S. J. Dang,
L. H. Shang,
S. Xiao
Abstract:
This study presents a comprehensive analysis of the spectral properties of 886 pulsars across a wide frequency range from 20MHz to 343.5GHz, including a total of 86 millisecond pulsars. The majority of the pulsars exhibit power-law behavior in their spectra, although some exceptions are observed. Five different spectral models, namely simple power-law, broken power-law, low-frequency turn-over, hi…
▽ More
This study presents a comprehensive analysis of the spectral properties of 886 pulsars across a wide frequency range from 20MHz to 343.5GHz, including a total of 86 millisecond pulsars. The majority of the pulsars exhibit power-law behavior in their spectra, although some exceptions are observed. Five different spectral models, namely simple power-law, broken power-law, low-frequency turn-over, high-frequency cut-off, and double turn-over, were employed to explore the spectral behaviors. The average spectral index for pulsars modeled with a simple power-law is found to be -1.64 +/-0.80, consistent with previous studies. Additionally, significant correlations between the spectral index and characteristic parameters are observed particularly in millisecond pulsars, while no strong correlation is observed in normal pulsars. Different models show variations in the most influential characteristic parameters associated with the spectral index, indicating diverse dominant radiation mechanisms in millisecond pulsars.Finally, this study identifies 22 pulsars of the Gigahertz-peaked Spectra (GPS) type for the first time based on the Akaike information criterion.
△ Less
Submitted 16 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Multi-Objective Trajectory Planning with Dual-Encoder
Authors:
Beibei Zhang,
Tian Xiang,
Chentao Mao,
Yuhua Zheng,
Shuai Li,
Haoyi Niu,
Xiangming Xi,
Wenyuan Bai,
Feng Gao
Abstract:
Time-jerk optimal trajectory planning is crucial in advancing robotic arms' performance in dynamic tasks. Traditional methods rely on solving complex nonlinear programming problems, bringing significant delays in generating optimized trajectories. In this paper, we propose a two-stage approach to accelerate time-jerk optimal trajectory planning. Firstly, we introduce a dual-encoder based transform…
▽ More
Time-jerk optimal trajectory planning is crucial in advancing robotic arms' performance in dynamic tasks. Traditional methods rely on solving complex nonlinear programming problems, bringing significant delays in generating optimized trajectories. In this paper, we propose a two-stage approach to accelerate time-jerk optimal trajectory planning. Firstly, we introduce a dual-encoder based transformer model to establish a good preliminary trajectory. This trajectory is subsequently refined through sequential quadratic programming to improve its optimality and robustness. Our approach outperforms the state-of-the-art by up to 79.72\% in reducing trajectory planning time. Compared with existing methods, our method shrinks the optimality gap with the objective function value decreasing by up to 29.9\%.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Image enhancement algorithm for absorption imaging
Authors:
Pengcheng Zheng,
Songqian Zhang,
Zhu Ma,
Haipo Niu,
Jiatao Wu,
Zerui Huang,
Chengyin Han,
Bo Lu,
Peiliang Liu,
Chaohong Lee
Abstract:
The noise in absorption imaging of cold atoms significantly impacts measurement accuracy across a range of applications with ultracold atoms. It is crucial to adopt an approach that offers effective denoising capabilities without compromising the unique structure of the atoms. Here we introduce a novel image enhancement algorithm for cold atomic absorption imaging. The algorithm successfully suppr…
▽ More
The noise in absorption imaging of cold atoms significantly impacts measurement accuracy across a range of applications with ultracold atoms. It is crucial to adopt an approach that offers effective denoising capabilities without compromising the unique structure of the atoms. Here we introduce a novel image enhancement algorithm for cold atomic absorption imaging. The algorithm successfully suppresses background noise, enhancing image contrast significantly. Experimental results showcase that this approach can enhance the accuracy of cold atom particle number measurements by approximately tenfold, all while preserving essential information. Moreover, the method exhibits exceptional performance and robustness when confronted with fringe noise and multi-component imaging scenarios, offering high stability. Importantly, the optimization process is entirely automated, eliminating the need for manual parameter selection. The method is both compatible and practical, making it applicable across various absorption imaging fields.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Authors:
Jianxiong Li,
**liang Zheng,
Yinan Zheng,
Liyuan Mao,
Xiao Hu,
Sijie Cheng,
Haoyi Niu,
Jihao Liu,
Yu Liu,
**g**g Liu,
Ya-Qin Zhang,
Xianyuan Zhan
Abstract:
Multimodal pretraining is an effective strategy for the trinity of goals of representation learning in autonomous robots: 1) extracting both local and global task progressions; 2) enforcing temporal consistency of visual representation; 3) capturing trajectory-level language grounding. Most existing methods approach these via separate objectives, which often reach sub-optimal solutions. In this pa…
▽ More
Multimodal pretraining is an effective strategy for the trinity of goals of representation learning in autonomous robots: 1) extracting both local and global task progressions; 2) enforcing temporal consistency of visual representation; 3) capturing trajectory-level language grounding. Most existing methods approach these via separate objectives, which often reach sub-optimal solutions. In this paper, we propose a universal unified objective that can simultaneously extract meaningful task progression information from image sequences and seamlessly align them with language instructions. We discover that via implicit preferences, where a visual trajectory inherently aligns better with its corresponding language instruction than mismatched pairs, the popular Bradley-Terry model can transform into representation learning through proper reward reparameterizations. The resulted framework, DecisionNCE, mirrors an InfoNCE-style objective but is distinctively tailored for decision-making tasks, providing an embodied representation learning framework that elegantly extracts both local and global task progression features, with temporal consistency enforced through implicit time contrastive learning, while ensuring trajectory-level instruction grounding via multimodal joint encoding. Evaluation on both simulated and real robots demonstrates that DecisionNCE effectively facilitates diverse downstream policy learning tasks, offering a versatile solution for unified representation and reward learning. Project Page: https://2toinf.github.io/DecisionNCE/
△ Less
Submitted 23 May, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
A Comprehensive Survey of Cross-Domain Policy Transfer for Embodied Agents
Authors:
Haoyi Niu,
Jianming Hu,
Guyue Zhou,
Xianyuan Zhan
Abstract:
The burgeoning fields of robot learning and embodied AI have triggered an increasing demand for large quantities of data. However, collecting sufficient unbiased data from the target domain remains a challenge due to costly data collection processes and stringent safety requirements. Consequently, researchers often resort to data from easily accessible source domains, such as simulation and labora…
▽ More
The burgeoning fields of robot learning and embodied AI have triggered an increasing demand for large quantities of data. However, collecting sufficient unbiased data from the target domain remains a challenge due to costly data collection processes and stringent safety requirements. Consequently, researchers often resort to data from easily accessible source domains, such as simulation and laboratory environments, for cost-effective data acquisition and rapid model iteration. Nevertheless, the environments and embodiments of these source domains can be quite different from their target domain counterparts, underscoring the need for effective cross-domain policy transfer approaches. In this paper, we conduct a systematic review of existing cross-domain policy transfer methods. Through a nuanced categorization of domain gaps, we encapsulate the overarching insights and design considerations of each problem setting. We also provide a high-level discussion about the key methodologies used in cross-domain policy transfer problems. Lastly, we summarize the open challenges that lie beyond the capabilities of current paradigms and discuss potential future directions in this field.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Joint Learning of Local and Global Features for Aspect-based Sentiment Classification
Authors:
Hao Niu,
Yun Xiong,
Xiaosu Wang,
Philip S. Yu
Abstract:
Aspect-based sentiment classification (ASC) aims to judge the sentiment polarity conveyed by the given aspect term in a sentence. The sentiment polarity is not only determined by the local context but also related to the words far away from the given aspect term. Most recent efforts related to the attention-based models can not sufficiently distinguish which words they should pay more attention to…
▽ More
Aspect-based sentiment classification (ASC) aims to judge the sentiment polarity conveyed by the given aspect term in a sentence. The sentiment polarity is not only determined by the local context but also related to the words far away from the given aspect term. Most recent efforts related to the attention-based models can not sufficiently distinguish which words they should pay more attention to in some cases. Meanwhile, graph-based models are coming into ASC to encode syntactic dependency tree information. But these models do not fully leverage syntactic dependency trees as they neglect to incorporate dependency relation tag information into representation learning effectively. In this paper, we address these problems by effectively modeling the local and global features. Firstly, we design a local encoder containing: a Gaussian mask layer and a covariance self-attention layer. The Gaussian mask layer tends to adjust the receptive field around aspect terms adaptively to deemphasize the effects of unrelated words and pay more attention to local information. The covariance self-attention layer can distinguish the attention weights of different words more obviously. Furthermore, we propose a dual-level graph attention network as a global encoder by fully employing dependency tag information to capture long-distance information effectively. Our model achieves state-of-the-art performance on both SemEval 2014 and Twitter datasets.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Training models using forces computed by stochastic electronic structure methods
Authors:
David M. Ceperley,
Scott Jensen,
Yubo Yang,
Hongwei Niu,
Carlo Pierleoni,
Markus Holzmann
Abstract:
Quantum Monte Carlo (QMC) can play a very important role in generating accurate data needed for constructing potential energy surfaces. We argue that QMC has advantages in terms of a smaller systematic bias and an ability to cover phase space more completely. The stochastic noise can ease the training of the machine learning model. We discuss how stochastic errors affect the generation of effectiv…
▽ More
Quantum Monte Carlo (QMC) can play a very important role in generating accurate data needed for constructing potential energy surfaces. We argue that QMC has advantages in terms of a smaller systematic bias and an ability to cover phase space more completely. The stochastic noise can ease the training of the machine learning model. We discuss how stochastic errors affect the generation of effective models by analyzing the errors within a linear least squares procedure, finding that there is an advantage to having many relatively imprecise data points for constructing models. We then analyze the effect of noise on a model of many-body silicon finding that noise in some situations improves the resulting model. We then study the effect of QMC noise on two machine learning models of dense hydrogen used in a recent study of its phase diagram. The noise enable us to estimate the errors in the model. We conclude with a discussion of future research problems.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Layer-by-layer phase transformation in Ti$_3$O$_5$ revealed by machine learning molecular dynamics simulations
Authors:
Mingfeng Liu,
Jiantao Wang,
Junwei Hu,
Peitao Liu,
Haiyang Niu,
Xuexi Yan,
Jiangxu Li,
Haile Yan,
Bo Yang,
Yan Sun,
Chunlin Chen,
Georg Kresse,
Liang Zuo,
Xing-Qiu Chen
Abstract:
Reconstructive phase transitions involving breaking and reconstruction of primary chemical bonds are ubiquitous and important for many technological applications. In contrast to displacive phase transitions, the dynamics of reconstructive phase transitions are usually slow due to the large energy barrier. Nevertheless, the reconstructive phase transformation from $β$- to $λ$-Ti$_3$O$_5$ exhibits a…
▽ More
Reconstructive phase transitions involving breaking and reconstruction of primary chemical bonds are ubiquitous and important for many technological applications. In contrast to displacive phase transitions, the dynamics of reconstructive phase transitions are usually slow due to the large energy barrier. Nevertheless, the reconstructive phase transformation from $β$- to $λ$-Ti$_3$O$_5$ exhibits an ultrafast and reversible behavior. Despite extensive studies, the underlying microscopic mechanism remains unclear. Here, we discover a kinetically favorable in-plane nucleated layer-by-layer transformation mechanism through metadynamics and large-scale molecular dynamics simulations. This is enabled by develo** an efficient machine learning potential with near first-principles accuracy through an on-the-fly active learning method and an advanced sampling technique. Our results reveal that the $β$-$λ$ phase transformation initiates with the formation of two-dimensional nuclei in the $ab$-plane and then proceeds layer-by-layer through a multistep barrier-lowering kinetic process via intermediate metastable phases. Our work not only provides important insight into the ultrafast and reversible nature of the $β$-$λ$ transition, but also presents useful strategies and methods for tackling other complex structural phase transitions.
△ Less
Submitted 2 April, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
Stackelberg Driver Model for Continual Policy Improvement in Scenario-Based Closed-Loop Autonomous Driving
Authors:
Haoyi Niu,
Qimao Chen,
Yingyue Li,
Yi Zhang,
Jianming Hu
Abstract:
The deployment of autonomous vehicles (AVs) has faced hurdles due to the dominance of rare but critical corner cases within the long-tail distribution of driving scenarios, which negatively affects their overall performance. To address this challenge, adversarial generation methods have emerged as a class of efficient approaches to synthesize safety-critical scenarios for AV testing. However, thes…
▽ More
The deployment of autonomous vehicles (AVs) has faced hurdles due to the dominance of rare but critical corner cases within the long-tail distribution of driving scenarios, which negatively affects their overall performance. To address this challenge, adversarial generation methods have emerged as a class of efficient approaches to synthesize safety-critical scenarios for AV testing. However, these generated scenarios are often underutilized for AV training, resulting in the potential for continual AV policy improvement remaining untapped, along with a deficiency in the closed-loop design needed to achieve it. Therefore, we tailor the Stackelberg Driver Model (SDM) to accurately characterize the hierarchical nature of vehicle interaction dynamics, facilitating iterative improvement by engaging background vehicles (BVs) and AV in a sequential game-like interaction paradigm. With AV acting as the leader and BVs as followers, this leader-follower modeling ensures that AV would consistently refine its policy, always taking into account the additional information that BVs play the best response to challenge AV. Extensive experiments have shown that our algorithm exhibits superior performance compared to several baselines especially in higher dimensional scenarios, leading to substantial advancements in AV capabilities while continually generating progressively challenging scenarios. Code is available at https://github.com/BlueCat-de/SDM.
△ Less
Submitted 5 December, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Continual Driving Policy Optimization with Closed-Loop Individualized Curricula
Authors:
Haoyi Niu,
Yizhou Xu,
Xingjian Jiang,
Jianming Hu
Abstract:
The safety of autonomous vehicles (AV) has been a long-standing top concern, stemming from the absence of rare and safety-critical scenarios in the long-tail naturalistic driving distribution. To tackle this challenge, a surge of research in scenario-based autonomous driving has emerged, with a focus on generating high-risk driving scenarios and applying them to conduct safety-critical testing of…
▽ More
The safety of autonomous vehicles (AV) has been a long-standing top concern, stemming from the absence of rare and safety-critical scenarios in the long-tail naturalistic driving distribution. To tackle this challenge, a surge of research in scenario-based autonomous driving has emerged, with a focus on generating high-risk driving scenarios and applying them to conduct safety-critical testing of AV models. However, limited work has been explored on the reuse of these extensive scenarios to iteratively improve AV models. Moreover, it remains intractable and challenging to filter through gigantic scenario libraries collected from other AV models with distinct behaviors, attempting to extract transferable information for current AV improvement. Therefore, we develop a continual driving policy optimization framework featuring Closed-Loop Individualized Curricula (CLIC), which we factorize into a set of standardized sub-modules for flexible implementation choices: AV Evaluation, Scenario Selection, and AV Training. CLIC frames AV Evaluation as a collision prediction task, where it estimates the chance of AV failures in these scenarios at each iteration. Subsequently, by re-sampling from historical scenarios based on these failure probabilities, CLIC tailors individualized curricula for downstream training, aligning them with the evaluated capability of AV. Accordingly, CLIC not only maximizes the utilization of the vast pre-collected scenario library for closed-loop driving policy optimization but also facilitates AV improvement by individualizing its training with more challenging cases out of those poorly organized scenarios. Experimental results clearly indicate that CLIC surpasses other curriculum-based training strategies, showing substantial improvement in managing risky scenarios, while still maintaining proficiency in handling simpler cases.
△ Less
Submitted 6 March, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
Authors:
Haoyi Niu,
Tianying Ji,
Bingqi Liu,
Haocheng Zhao,
Xiangyu Zhu,
Jianying Zheng,
Pengfei Huang,
Guyue Zhou,
Jianming Hu,
Xianyuan Zhan
Abstract:
Solving real-world complex tasks using reinforcement learning (RL) without high-fidelity simulation environments or large amounts of offline data can be quite challenging. Online RL agents trained in imperfect simulation environments can suffer from severe sim-to-real issues. Offline RL approaches although bypass the need for simulators, often pose demanding requirements on the size and quality of…
▽ More
Solving real-world complex tasks using reinforcement learning (RL) without high-fidelity simulation environments or large amounts of offline data can be quite challenging. Online RL agents trained in imperfect simulation environments can suffer from severe sim-to-real issues. Offline RL approaches although bypass the need for simulators, often pose demanding requirements on the size and quality of the offline datasets. The recently emerged hybrid offline-and-online RL provides an attractive framework that enables joint use of limited offline data and imperfect simulator for transferable policy learning. In this paper, we develop a new algorithm, called H2O+, which offers great flexibility to bridge various choices of offline and online learning methods, while also accounting for dynamics gaps between the real and simulation environment. Through extensive simulation and real-world robotics experiments, we demonstrate superior performance and flexibility over advanced cross-domain online and offline RL algorithms.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Sim-to-Real Deep Reinforcement Learning with Manipulators for Pick-and-place
Authors:
Wenxing Liu,
Hanlin Niu,
Robert Skilton,
Joaquin Carrasco
Abstract:
When transferring a Deep Reinforcement Learning model from simulation to the real world, the performance could be unsatisfactory since the simulation cannot imitate the real world well in many circumstances. This results in a long period of fine-tuning in the real world. This paper proposes a self-supervised vision-based DRL method that allows robots to pick and place objects effectively and effic…
▽ More
When transferring a Deep Reinforcement Learning model from simulation to the real world, the performance could be unsatisfactory since the simulation cannot imitate the real world well in many circumstances. This results in a long period of fine-tuning in the real world. This paper proposes a self-supervised vision-based DRL method that allows robots to pick and place objects effectively and efficiently when directly transferring a training model from simulation to the real world. A height-sensitive action policy is specially designed for the proposed method to deal with crowded and stacked objects in challenging environments. The training model with the proposed approach can be applied directly to a real suction task without any fine-tuning from the real world while maintaining a high suction success rate. It is also validated that our model can be deployed to suction novel objects in a real experiment with a suction success rate of 90\% without any real-world fine-tuning. The experimental video is available at: https://youtu.be/jSTC-EGsoFA.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
Superconvergence of a nonconforming brick element for the quad-curl problem
Authors:
Xinchen Zhou,
Zhaoliang Meng,
Hexin Niu
Abstract:
This short note shows the superconvergence of an $H(\mathrm{grad}\,\mathrm{curl})$-nonconforming brick element very recently introduced in [17] for the quad-curl problem. The supercloseness is based on proper modifications for both the interpolation and the discrete formulation, leading to an $O(h^2)$ superclose order in the discrete $H(\mathrm{grad}\,\mathrm{curl})$ norm. Moreover, we propose a s…
▽ More
This short note shows the superconvergence of an $H(\mathrm{grad}\,\mathrm{curl})$-nonconforming brick element very recently introduced in [17] for the quad-curl problem. The supercloseness is based on proper modifications for both the interpolation and the discrete formulation, leading to an $O(h^2)$ superclose order in the discrete $H(\mathrm{grad}\,\mathrm{curl})$ norm. Moreover, we propose a suitable postprocessing method to ensure the global superconvergence. Numerical results verify our theory.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making
Authors:
Hui Niu,
Siyuan Li,
Jiahao Zheng,
Zhouchi Lin,
Jian Li,
Jian Guo,
Bo An
Abstract:
Market making (MM) has attracted significant attention in financial trading owing to its essential function in ensuring market liquidity. With strong capabilities in sequential decision-making, Reinforcement Learning (RL) technology has achieved remarkable success in quantitative trading. Nonetheless, most existing RL-based MM methods focus on optimizing single-price level strategies which fail at…
▽ More
Market making (MM) has attracted significant attention in financial trading owing to its essential function in ensuring market liquidity. With strong capabilities in sequential decision-making, Reinforcement Learning (RL) technology has achieved remarkable success in quantitative trading. Nonetheless, most existing RL-based MM methods focus on optimizing single-price level strategies which fail at frequent order cancellations and loss of queue priority. Strategies involving multiple price levels align better with actual trading scenarios. However, given the complexity that multi-price level strategies involves a comprehensive trading action space, the challenge of effectively training profitable RL agents for MM persists. Inspired by the efficient workflow of professional human market makers, we propose Imitative Market Maker (IMM), a novel RL framework leveraging both knowledge from suboptimal signal-based experts and direct policy interactions to develop multi-price level MM strategies efficiently. The framework start with introducing effective state and action representations adept at encoding information about multi-price level orders. Furthermore, IMM integrates a representation learning unit capable of capturing both short- and long-term market trends to mitigate adverse selection risk. Subsequently, IMM formulates an expert strategy based on signals and trains the agent through the integration of RL and imitation learning techniques, leading to efficient learning. Extensive experimental results on four real-world market datasets demonstrate that IMM outperforms current RL-based market making strategies in terms of several financial criteria. The findings of the ablation study substantiate the effectiveness of the model components.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Photometric calibration of the Stellar Abundance and Galactic Evolution Survey (SAGES): Nanshan One-meter Wide-field Telescope g, r, and i band imaging data
Authors:
Kai Xiao,
Haibo Yuan,
Bowen Huang,
Shuai Xu,
Jie Zheng,
Chun Li,
Zhou Fan,
Wei Wang,
Gang Zhao,
Guojie Feng,
Xuan Zhang,
**zhong Liu,
Ruoyi Zhang,
Lin Yang,
Yu Zhang,
Chunhai Bai,
Hubiao Niu,
Esamdin Ali,
Lu Ma
Abstract:
In this paper, a total of approximately 2.6 million dwarfs were constructed as standard stars, with an accuracy of about 0.01-0.02 mag for each band, by combining spectroscopic data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope Data Release 7, photometric data from the corrected Gaia Early Data Release 3, and photometric metallicities. Using the spectroscopy based stellar colo…
▽ More
In this paper, a total of approximately 2.6 million dwarfs were constructed as standard stars, with an accuracy of about 0.01-0.02 mag for each band, by combining spectroscopic data from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope Data Release 7, photometric data from the corrected Gaia Early Data Release 3, and photometric metallicities. Using the spectroscopy based stellar color regression method (SCR method) and the photometric-based SCR method (SCR' method), we performed the relative calibration of the Nanshan One-meter Wide-field Telescope imaging data. Based on the corrected Pan-STARRS DR1 photometry, the absolute calibration was also performed. In the photometric calibration process, we analyzed the dependence of the calibration zero points on different images (observation time), different gates of the CCD detector, and different CCD positions. We found that the stellar flat and the relative gain between different gates depend on time. The amplitude of gain variation in three channels is approximately 0.5%-0.7% relative to the other channel, with a maximum value of 4%. In addition, significant spatial variations of the stellar flat fitting residual are found and corrected. Using repeated sources in the adjacent images, we checked and discovered internal consistency of about 1-2 mmag in all the filters. Using the PS1 magnitudes synthesized by Gaia DR3 BP/RP spectra by the synthetic photometry method, we found that the photometric calibration uniformity is about 1-2 mmag for all the bands, at a spatial resolution of 1.3 degree. A detailed comparison between the spectroscopy-based SCR and photometric-based SCR method magnitude offsets was performed, and we achieved an internal consistency precision of about 2 mmag or better with resolutions of 1.3 degree for all the filters. Which is mainly from the position-dependent errors of the E(B-V) used in SCR' method.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Change of rotation measure during eclipse of a black widow PSR J2051$-$0827
Authors:
S. Q. Wang,
J. B. Wang,
D. Z. Li,
J. M. Yao,
R. N. Manchester,
G. Hobbs,
N. Wang,
S. Dai,
H. Xu,
R. Luo,
Y. Feng,
W. Y. Wang,
D. Li,
Y. W. Yu,
Z. X. Du,
C. H. Niu,
S. B. Zhang,
C. M. Zhang
Abstract:
Black widows are millisecond pulsars ablating their companions. The material blown from the companion blocks the radio emission, resulting in radio eclipses. The properties of the eclipse medium are poorly understood. Here, we present direct evidence of the existence of magnetic fields in the eclipse medium of the black widow PSR J2051$-$0827 using observations made with the Five-hundred-meter Ape…
▽ More
Black widows are millisecond pulsars ablating their companions. The material blown from the companion blocks the radio emission, resulting in radio eclipses. The properties of the eclipse medium are poorly understood. Here, we present direct evidence of the existence of magnetic fields in the eclipse medium of the black widow PSR J2051$-$0827 using observations made with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). We detect a regular decrease in rotation measure (RM) in the egress of eclipse, changing from $60\,\rm rad\,m^{-2}$ to $-28.7\,\rm rad\,m^{-2}$. The RM gradually changes back to normal when the line-of-sight moves away from the eclipse. The estimated line-of-sight magnetic field strength in the eclipse medium is $\sim 0.1$ G. The RM reversal could be caused by a change of the magnetic field strength along the line of sight due to binary orbital motion. The RM reversal phenomenon has also been observed in some repeating fast radio bursts (FRBs), and the study of spider pulsars may provide additional information about the origin of FRBs.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
The Stellar Abundances and Galactic Evolution Survey (SAGES) -- -- I. General Description and the First Data Release (DR1)
Authors:
Zhou Fan,
Gang Zhao,
Wei Wang,
Jie Zheng,
**gkun Zhao,
Chun Li,
Yuqin Chen,
Haibo Yuan,
Haining Li,
Kefeng Tan,
Yihan Song,
Fang Zuo,
Yang Huang,
Ali Luo,
Ali Esamdin,
Lu Ma,
Bin Li,
Nan Song,
Frank Grupp,
Haibin Zhao,
Shuhrat A. Ehgamberdiev,
Otabek A. Burkhonov,
Guojie Feng,
Chunhai Bai,
Xuan Zhang
, et al. (13 additional authors not shown)
Abstract:
The Stellar Abundances and Galactic Evolution Survey (SAGES) of the northern sky is a specifically-designed multi-band photometric survey aiming to provide reliable stellar parameters with accuracy comparable to those from low-resolution optical spectra. It was carried out with the 2.3-m Bok telescope of Steward Observatory and three other telescopes. The observations in the $u_s$ and $v_s$ passba…
▽ More
The Stellar Abundances and Galactic Evolution Survey (SAGES) of the northern sky is a specifically-designed multi-band photometric survey aiming to provide reliable stellar parameters with accuracy comparable to those from low-resolution optical spectra. It was carried out with the 2.3-m Bok telescope of Steward Observatory and three other telescopes. The observations in the $u_s$ and $v_s$ passband produced over 36,092 frames of images in total, covering a sky area of $\sim9960$ degree$^2$. The median survey completeness of all observing fields for the two bands are of $u_{\rm s}=20.4$ mag and $v_s=20.3$ mag, respectively, while the limiting magnitudes with signal-to-noise ratio (S/N) of 100 are $u_s\sim17$ mag and $v_s\sim18$ mag, correspondingly. We combined our catalog with the data release 1 (DR1) of the first of Panoramic Survey Telescope And Rapid Response System (Pan-STARRS1, PS1) catalog, and obtained a total of 48,553,987 sources which have at least one photometric measurement in each of the SAGES $u_s$ and $v_s$ and PS1 $grizy$ passbands, which is the DR1 of SAGES and it will be released in our paper. We compare our $gri$ point-source photometry with those of PS1 and found an RMS scatter of $\sim2$% in difference of PS1 and SAGES for the same band. We estimated an internal photometric precision of SAGES to be on the order of $\sim1$%. Astrometric precision is better than $0^{\prime\prime}.2$ based on comparison with the DR1 of Gaia mission. In this paper, we also describe the final end-user database, and provide some science applications.
△ Less
Submitted 28 June, 2023; v1 submitted 27 June, 2023;
originally announced June 2023.
-
Unlocking the Structural Mystery of Vaterite CaCO3
Authors:
Xingyuan San,
Junwei Hu,
Mingyi Chen,
Haiyang Niu,
Paul J. M. Smeets,
Jie Deng,
Kunmo Koo,
Roberto dos Reis,
Vinayak P. Dravid,
Xiaobing Hu
Abstract:
Calcium carbonate (CaCO3), the most abundant biogenic mineral on earth, plays a crucial role in various fields. Of the four polymorphs, calcite, aragonite, vaterite, and amorphous CaCO3, vaterite is the most enigmatic one due to an ongoing debate regarding its structure that has persisted for nearly a century. In this work, based on systematic transmission electron microscopy characterizations, el…
▽ More
Calcium carbonate (CaCO3), the most abundant biogenic mineral on earth, plays a crucial role in various fields. Of the four polymorphs, calcite, aragonite, vaterite, and amorphous CaCO3, vaterite is the most enigmatic one due to an ongoing debate regarding its structure that has persisted for nearly a century. In this work, based on systematic transmission electron microscopy characterizations, elaborate crystallographic analysis and machine learning aided molecular dynamics simulations with ab initio accuracy, we reveal that vaterite can be regarded as a polytypic structure. The basic phase is a monoclinic lattice possessing pseudohexagonal symmetry. Direct imaging and atomic-scale simulations provide evidence that a single grain of vaterite can have three orientation variants. Additionally, we find that vaterite undergoes a second-order phase transition. These atomic scale insights provide a comprehensive understanding of the structure of vaterite and offer new perspectives on the biomineralization process of calcium carbonate.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Hardness and fracture toughness models by symbolic regression
Authors:
**bin Zhao,
Peitao Liu,
Jiantao Wang,
Jiangxu Li,
Haiyang Niu,
Yan Sun,
Junlin Li,
Xing-Qiu Chen
Abstract:
Superhard materials with good fracture toughness have found wide industrial applications, which necessitates the development of accurate hardness and fracture toughness models for efficient materials design. Although several macroscopic models have been proposed, they are mostly semiempirical based on prior knowledge or assumptions, and obtained by fitting limited experimental data. Here, through…
▽ More
Superhard materials with good fracture toughness have found wide industrial applications, which necessitates the development of accurate hardness and fracture toughness models for efficient materials design. Although several macroscopic models have been proposed, they are mostly semiempirical based on prior knowledge or assumptions, and obtained by fitting limited experimental data. Here, through an unbiased and explanatory symbolic regression technique, we built a macroscopic hardness model and fracture toughness model, which only require shear and bulk moduli as inputs. The developed hardness model was trained on an extended dataset, which not only includes cubic systems, but also contains non-cubic systems with anisotropic elastic properties. The obtained models turned out to be simple, accurate, and transferable. Moreover, we assessed the performance of three popular deep learning models for predicting bulk and shear moduli, and found that the crystal graph convolutional neural network and crystal explainable property predictor perform almost equally well, both better than the atomistic line graph neural network. By combining the machine-learned bulk and shear moduli with the hardness and fracture toughness prediction models, potential superhard materials with good fracture toughness can be efficiently screened out through high-throughput calculations.
△ Less
Submitted 4 August, 2023; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Ab initio investigation of the crystallization mechanism of cadmium selenide
Authors:
Linshuang Zhang,
Manyi Yang,
Shiwei Zhang,
Haiyang Niu
Abstract:
Cadmium selenide (CdSe) is an inorganic semiconductor with unique optical and electronic properties that made it useful in various applications, including solar cells, light-emitting diodes, and biofluorescent tagging. In order to synthesize high-quality crystals and subsequently integrate them into devices, it is crucial to understand the atomic scale crystallization mechanism of CdSe. Unfortunat…
▽ More
Cadmium selenide (CdSe) is an inorganic semiconductor with unique optical and electronic properties that made it useful in various applications, including solar cells, light-emitting diodes, and biofluorescent tagging. In order to synthesize high-quality crystals and subsequently integrate them into devices, it is crucial to understand the atomic scale crystallization mechanism of CdSe. Unfortunately, such studies are still absent in the literature.To overcome this limitation, we employed an enhanced sampling-accelerated active learning approach to construct a deep neural potential with ab initio accuracy for studying the crystallization of CdSe.Our brute-force molecular dynamics simulations revealed that a spherical-like nucleus formed spontaneously and stochastically, resulting in a stacking disordered structure where the competition between hexagonal wurtzite and cubic zinc blende polymorphs is temperature-dependent. We found that pure hexagonal crystal can only be obtained approximately above 1430 K, which is 35 K below its melting temperature. We observed that the solidification dynamics of Cd and Se atoms were distinct due to their different diffusion coefficients. The solidification process was initiated by lower mobile Se atoms forming tetrahedral frameworks, followed by Cd atoms occupying these tetrahedral centers and settling down until the third-shell neighbor of Se atoms sited on their lattice positions. Therefore, the medium-range ordering of Se atoms governs the crystallization process of CdSe. Our findings indicate that understanding the complex dynamical process is the key to comprehending the crystallization mechanism of compounds like CdSe, and can shed lights in the synthesis of high-quality crystals.
△ Less
Submitted 28 May, 2023;
originally announced May 2023.
-
Sum Secrecy Rate Maximization for IRS-aided Multi-Cluster MIMO-NOMA Terahertz Systems
Authors:
**lei Xu,
Zhengyu Zhu,
Zheng Chu,
Hehao Niu,
Pei Xiao,
Inkyu Lee
Abstract:
Intelligent reflecting surface (IRS) is a promising technique to extend the network coverage and improve spectral efficiency. This paper investigates an IRS-assisted terahertz (THz) multiple-input multiple-output (MIMO)-nonorthogonal multiple access (NOMA) system based on hybrid precoding with the presence of eavesdropper. Two types of sparse RF chain antenna structures are adopted, i.e., sub-conn…
▽ More
Intelligent reflecting surface (IRS) is a promising technique to extend the network coverage and improve spectral efficiency. This paper investigates an IRS-assisted terahertz (THz) multiple-input multiple-output (MIMO)-nonorthogonal multiple access (NOMA) system based on hybrid precoding with the presence of eavesdropper. Two types of sparse RF chain antenna structures are adopted, i.e., sub-connected structure and fully connected structure. First, cluster heads are selected for each beam, and analog precoding based on discrete phase is designed. Then, users are clustered based on channel correlation, and NOMA technology is employed to serve the users. In addition, a low-complexity forced-zero method is utilized to design digital precoding in order to eliminate inter-cluster interference. On this basis, we propose a secure transmission scheme to maximize the sum secrecy rate by jointly optimizing the power allocation and phase shifts of IRS subject to the total transmit power budget, minimal achievable rate requirement of each user, and IRS reflection coefficients. Due to multiple coupled variables, the formulated problem leads to a non-convex issue. We apply the Taylor series expansion and semidefinite programming to convert the original non-convex problem into a convex one. Then, an alternating optimization algorithm is developed to obtain a feasible solution of the original problem. Simulation results verify the convergence of the proposed algorithm, and deploying IRS can bring significant beamforming gains to suppress the eavesdrop**.
△ Less
Submitted 11 June, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
Variable stars detection in the field of open cluster NGC 188
Authors:
Fang-Fang Song,
Hu-Biao Niu,
Ali Esamdin,
Yu Zhang,
Xiang-Yun Zeng
Abstract:
This work presents the charge-coupled device (CCD) photometric survey of the old open cluster NGC 188. Time-series V-band photometric observations were conducted for ten nights in January 2017 using the Nanshan One-meter Wide-field Telescope (NOWT) to search for variable stars in the field of the cluster field. A total of 25 variable stars, including one new variable star, were detected in the tar…
▽ More
This work presents the charge-coupled device (CCD) photometric survey of the old open cluster NGC 188. Time-series V-band photometric observations were conducted for ten nights in January 2017 using the Nanshan One-meter Wide-field Telescope (NOWT) to search for variable stars in the field of the cluster field. A total of 25 variable stars, including one new variable star, were detected in the target field. Among the detected variables, 16 are cluster member stars, and the others are identified as field stars. The periods, radial velocities, effective temperatures, and classifications of the detected variables are discussed in this work. Most of the stars' effective temperatures are between 4200 K and 6600 K, indicating their spectral types are G or K. The newly discovered variable is probably a W UMa system. In this study, a known cluster variable star (V21 = V0769 Cep) is classified as an EA-type variable star based on the presence of an 0.5 magnitude eclipse in its light curve.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Imperfectly coordinated water molecules pave the way for homogeneous ice nucleation
Authors:
Mingyi Chen,
Lin Tan,
Han Wang,
Linfeng Zhang,
Haiyang Niu
Abstract:
Water freezing is ubiquitous on Earth, affecting many areas from biology to climate science and aviation technology. Probing the atomic structure in the homogeneous ice nucleation process from scratch is of great value but still experimentally unachievable. Theoretical simulations have found that ice originates from the low-mobile region with increasing abundance and persistence of tetrahedrally c…
▽ More
Water freezing is ubiquitous on Earth, affecting many areas from biology to climate science and aviation technology. Probing the atomic structure in the homogeneous ice nucleation process from scratch is of great value but still experimentally unachievable. Theoretical simulations have found that ice originates from the low-mobile region with increasing abundance and persistence of tetrahedrally coordinated water molecules. However, a detailed microscopic picture of how the disordered hydrogen-bond network rearranges itself into an ordered network is still unclear. In this work, we use a deep neural network (DNN) model to "learn" the interatomic potential energy from quantum mechanical data, thereby allowing for large-scale and long molecular dynamics (MD) simulations with ab initio accuracy. The nucleation mechanism and dynamics at atomic resolution, represented by a total of 36 $μ$s-long MD trajectories, are deeply affected by the structural and dynamical heterogeneity in supercooled water. We find that imperfectly coordinated (IC) water molecules with high mobility pave the way for hydrogen-bond network rearrangement, leading to the growth or shrinkage of the ice nucleus. The hydrogen-bond network formed by perfectly coordinated (PC) molecules stabilizes the nucleus, thus preventing it from vanishing and growing. Consequently, ice is born through competition and cooperation between IC and PC molecules. We anticipate that our picture of the microscopic mechanism of ice nucleation will provide new insights into many properties of water and other relevant materials.
△ Less
Submitted 25 April, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Frequency Decomposition to Tap the Potential of Single Domain for Generalization
Authors:
Qingyue Yang,
Hong**g Niu,
Pengfei Xia,
Wei Zhang,
Bin Li
Abstract:
Domain generalization (DG), aiming at models able to work on multiple unseen domains, is a must-have characteristic of general artificial intelligence. DG based on single source domain training data is more challenging due to the lack of comparable information to help identify domain invariant features. In this paper, it is determined that the domain invariant features could be contained in the si…
▽ More
Domain generalization (DG), aiming at models able to work on multiple unseen domains, is a must-have characteristic of general artificial intelligence. DG based on single source domain training data is more challenging due to the lack of comparable information to help identify domain invariant features. In this paper, it is determined that the domain invariant features could be contained in the single source domain training samples, then the task is to find proper ways to extract such domain invariant features from the single source domain samples. An assumption is made that the domain invariant features are closely related to the frequency. Then, a new method that learns through multiple frequency domains is proposed. The key idea is, dividing the frequency domain of each original image into multiple subdomains, and learning features in the subdomain by a designed two branches network. In this way, the model is enforced to learn features from more samples of the specifically limited spectrum, which increases the possibility of obtaining the domain invariant features that might have previously been defiladed by easily learned features. Extensive experimental investigation reveals that 1) frequency decomposition can help the model learn features that are difficult to learn. 2) the proposed method outperforms the state-of-the-art methods of single-source domain generalization.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency
Authors:
Xingwu Ji,
Peilin Liu,
Haochen Niu,
Xiang Chen,
Rendong Ying,
Fei Wen
Abstract:
Visual simultaneous localization and map** (SLAM) systems face challenges in detecting loop closure under the circumstance of large viewpoint changes. In this paper, we present an object-based loop closure detection method based on the spatial layout and semanic consistency of the 3D scene graph. Firstly, we propose an object-level data association approach based on the semantic information from…
▽ More
Visual simultaneous localization and map** (SLAM) systems face challenges in detecting loop closure under the circumstance of large viewpoint changes. In this paper, we present an object-based loop closure detection method based on the spatial layout and semanic consistency of the 3D scene graph. Firstly, we propose an object-level data association approach based on the semantic information from semantic labels, intersection over union (IoU), object color, and object embedding. Subsequently, multi-view bundle adjustment with the associated objects is utilized to jointly optimize the poses of objects and cameras. We represent the refined objects as a 3D spatial graph with semantics and topology. Then, we propose a graph matching approach to select correspondence objects based on the structure layout and semantic property similarity of vertices' neighbors. Finally, we jointly optimize camera trajectories and object poses in an object-level pose graph optimization, which results in a globally consistent map. Experimental results demonstrate that our proposed data association approach can construct more accurate 3D semantic maps, and our loop closure method is more robust than point-based and object-based methods in circumstances with large viewpoint changes.
△ Less
Submitted 14 April, 2023; v1 submitted 11 April, 2023;
originally announced April 2023.
-
Estimating Treatment Effects from Irregular Time Series Observations with Hidden Confounders
Authors:
Defu Cao,
James Enouen,
Yu**g Wang,
Xiangchen Song,
Chuizheng Meng,
Hao Niu,
Yan Liu
Abstract:
Causal analysis for time series data, in particular estimating individualized treatment effect (ITE), is a key task in many real-world applications, such as finance, retail, healthcare, etc. Real-world time series can include large-scale, irregular, and intermittent time series observations, raising significant challenges to existing work attempting to estimate treatment effects. Specifically, the…
▽ More
Causal analysis for time series data, in particular estimating individualized treatment effect (ITE), is a key task in many real-world applications, such as finance, retail, healthcare, etc. Real-world time series can include large-scale, irregular, and intermittent time series observations, raising significant challenges to existing work attempting to estimate treatment effects. Specifically, the existence of hidden confounders can lead to biased treatment estimates and complicate the causal inference process. In particular, anomaly hidden confounders which exceed the typical range can lead to high variance estimates. Moreover, in continuous time settings with irregular samples, it is challenging to directly handle the dynamics of causality. In this paper, we leverage recent advances in Lipschitz regularization and neural controlled differential equations (CDE) to develop an effective and scalable solution, namely LipCDE, to address the above challenges. LipCDE can directly model the dynamic causal relationships between historical data and outcomes with irregular samples by considering the boundary of hidden confounders given by Lipschitz-constrained neural networks. Furthermore, we conduct extensive experiments on both synthetic and real-world datasets to demonstrate the effectiveness and scalability of LipCDE.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
CLIPER: A Unified Vision-Language Framework for In-the-Wild Facial Expression Recognition
Authors:
Hanting Li,
Hong**g Niu,
Zhaoqing Zhu,
Feng Zhao
Abstract:
Facial expression recognition (FER) is an essential task for understanding human behaviors. As one of the most informative behaviors of humans, facial expressions are often compound and variable, which is manifested by the fact that different people may express the same expression in very different ways. However, most FER methods still use one-hot or soft labels as the supervision, which lack suff…
▽ More
Facial expression recognition (FER) is an essential task for understanding human behaviors. As one of the most informative behaviors of humans, facial expressions are often compound and variable, which is manifested by the fact that different people may express the same expression in very different ways. However, most FER methods still use one-hot or soft labels as the supervision, which lack sufficient semantic descriptions of facial expressions and are less interpretable. Recently, contrastive vision-language pre-training (VLP) models (e.g., CLIP) use text as supervision and have injected new vitality into various computer vision tasks, benefiting from the rich semantics in text. Therefore, in this work, we propose CLIPER, a unified framework for both static and dynamic facial Expression Recognition based on CLIP. Besides, we introduce multiple expression text descriptors (METD) to learn fine-grained expression representations that make CLIPER more interpretable. We conduct extensive experiments on several popular FER benchmarks and achieve state-of-the-art performance, which demonstrates the effectiveness of CLIPER.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
(Re)$^2$H2O: Autonomous Driving Scenario Generation via Reversely Regularized Hybrid Offline-and-Online Reinforcement Learning
Authors:
Haoyi Niu,
Kun Ren,
Yizhou Xu,
Ziyuan Yang,
Yichen Lin,
Yi Zhang,
Jianming Hu
Abstract:
Autonomous driving and its widespread adoption have long held tremendous promise. Nevertheless, without a trustworthy and thorough testing procedure, not only does the industry struggle to mass-produce autonomous vehicles (AV), but neither the general public nor policymakers are convinced to accept the innovations. Generating safety-critical scenarios that present significant challenges to AV is a…
▽ More
Autonomous driving and its widespread adoption have long held tremendous promise. Nevertheless, without a trustworthy and thorough testing procedure, not only does the industry struggle to mass-produce autonomous vehicles (AV), but neither the general public nor policymakers are convinced to accept the innovations. Generating safety-critical scenarios that present significant challenges to AV is an essential first step in testing. Real-world datasets include naturalistic but overly safe driving behaviors, whereas simulation would allow for unrestricted exploration of diverse and aggressive traffic scenarios. Conversely, higher-dimensional searching space in simulation disables efficient scenario generation without real-world data distribution as implicit constraints. In order to marry the benefits of both, it seems appealing to learn to generate scenarios from both offline real-world and online simulation data simultaneously. Therefore, we tailor a Reversely Regularized Hybrid Offline-and-Online ((Re)$^2$H2O) Reinforcement Learning recipe to additionally penalize Q-values on real-world data and reward Q-values on simulated data, which ensures the generated scenarios are both varied and adversarial. Through extensive experiments, our solution proves to produce more risky scenarios than competitive baselines and it can generalize to work with various autonomous driving models. In addition, these generated scenarios are also corroborated to be capable of fine-tuning AV performance.
△ Less
Submitted 10 June, 2023; v1 submitted 27 February, 2023;
originally announced February 2023.
-
Sim-and-Real Reinforcement Learning for Manipulation: A Consensus-based Approach
Authors:
Wenxing Liu,
Hanlin Niu,
Wei Pan,
Guido Herrmann,
Joaquin Carrasco
Abstract:
Sim-and-real training is a promising alternative to sim-to-real training for robot manipulations. However, the current sim-and-real training is neither efficient, i.e., slow convergence to the optimal policy, nor effective, i.e., sizeable real-world robot data. Given limited time and hardware budgets, the performance of sim-and-real training is not satisfactory. In this paper, we propose a Consens…
▽ More
Sim-and-real training is a promising alternative to sim-to-real training for robot manipulations. However, the current sim-and-real training is neither efficient, i.e., slow convergence to the optimal policy, nor effective, i.e., sizeable real-world robot data. Given limited time and hardware budgets, the performance of sim-and-real training is not satisfactory. In this paper, we propose a Consensus-based Sim-And-Real deep reinforcement learning algorithm (CSAR) for manipulator pick-and-place tasks, which shows comparable performance in both sim-and-real worlds. In this algorithm, we train the agents in simulators and the real world to get the optimal policies for both sim-and-real worlds. We found two interesting phenomenons: (1) Best policy in simulation is not the best for sim-and-real training. (2) The more simulation agents, the better sim-and-real training. The experimental video is available at: https://youtu.be/mcHJtNIsTEQ.
△ Less
Submitted 17 September, 2023; v1 submitted 26 February, 2023;
originally announced February 2023.
-
Proton FLASH irradiation platform for small animal setup at Chang Gung Memorial Hospital
Authors:
Tung-Yuan Hsiao,
Lu-Kai Wang,
Tzung-Yuang Chen,
Ching-Fang Yu,
Pan,
Cheng-Ya,
Chun-Chieh Wang,
Chien-Yu Lin,
I-Chun Cho,
Huan Niu,
Chien-Hsu Chen
Abstract:
Background : Proton flash therapy is an emergency research topic in radiation therapy since the Varian announced the promising results from the first in human clinical trial of Flash therapy recently. However, it still needs a lot of researches on this topic, not only to understand the mechanism of the radiobiological effects but also to develop an appropriate dose monitoring system. Purpose : In…
▽ More
Background : Proton flash therapy is an emergency research topic in radiation therapy since the Varian announced the promising results from the first in human clinical trial of Flash therapy recently. However, it still needs a lot of researches on this topic, not only to understand the mechanism of the radiobiological effects but also to develop an appropriate dose monitoring system. Purpose : In this study we setup an experimental station for small animal proton Flash irradiation in a clinical machine. The dose monitoring system is able to provide real-time irradiation dose and irradiation time structure.
Methods : The dose monitoring system includes homebrewed transmission ionization chamber (TIC), plastic scintillator based beam position monitor, and Poor Man Faraday Cup (FC). Both TIC and FC are equipped with a homebrewed fast reading current integral electronics device. The imaging guidance system comprises a moveable CT, laser, as well as attaching a bead on the body surface of the mouse can accurately guide the testing small animal in position.
Results : The dose monitoring system can provide the time structure of delivered dose rate within 1 ms time resolution. Experimental testing results show that the highest dose in one pulse of 230 MeV proton that can be delivered to the target is about 20 Gy during 199 ms pulse period at 100 Gy/s dose rate.
Conclusion : A proton research irradiation platform dedicated for studying small animal Flash biological effects has been established at Chang Gung Memorial Hospital. The final setup data represent a reference for the beam users to plan the experiments as well as for the improvement of the facility.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
An X-Ray-dim "Isolated'' Neutron Star in a Binary?
Authors:
Jie Lin,
Chunqian Li,
Weiyang Wang,
Heng Xu,
**chen Jiang,
Daoye Yang,
Shahidin Yaqup,
Abdusamatjan Iskandar,
Shuguo Ma,
Hubiao Niu,
Ali Esamdin,
Shuai Liu,
Gavin Ramsay,
Jose I. Vines,
Jianrong Shi,
Renxin Xu
Abstract:
We report the discovery of a dark companion to 2MASS J15274848+3536572 with an orbital period of 6.14 hr. Combining the radial velocity from LAMOST observations and modelling of the multiband light curve, one obtains a mass function of $\simeq 0.131~\rm M_{\odot}$, an inclination of $45.20^\circ{}^{+0.13^{\circ}}_{-0.20^{\circ}}$, and a mass ratio of $0.631^{+0.014}_{-0.003}$, which demonstrate th…
▽ More
We report the discovery of a dark companion to 2MASS J15274848+3536572 with an orbital period of 6.14 hr. Combining the radial velocity from LAMOST observations and modelling of the multiband light curve, one obtains a mass function of $\simeq 0.131~\rm M_{\odot}$, an inclination of $45.20^\circ{}^{+0.13^{\circ}}_{-0.20^{\circ}}$, and a mass ratio of $0.631^{+0.014}_{-0.003}$, which demonstrate the binary nature of the dark companion with mass of $0.98 \pm 0.03\rm M_{\odot}$ and a main-sequence K9-M0 star of $0.62 \pm 0.01~\rm M_{\odot}$. LAMOST optical spectra at a range of orbital phases reveal extra-peaked Halpha emission that suggests the presence of an accretion disk. The dark companion does not seem to be a white dwarf because of the lack of any observed dwarf nova outbursts in the long-term data archive, although a magnetic white dwarf cannot be excluded. Alternatively, we propose a scenario wherein the dark companion is a neutron star, but we have not detected radio pulsations or a single pulse from the system with the FAST (Five-hundred-meter Aperture Spherical radio Telescope), which hints at a radio-quiet compact object. If the dark companion is identified as a neutron star, it will be the nearest ( ~ 118 pc) and lightest neutron star. Furthermore, a kinematic analysis of the system's orbit in the galaxy may suggest its supernova event is associated with the radionuclide $^{60} \rm Fe$ signal observed from the deep-sea crusts. This radio-quiet and X-ray-dim nearby neutron star may resemble an XDINS (X-ray-dim isolated neutron star), but in a binary.
△ Less
Submitted 10 February, 2023; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Scattering variability detected from the circumsource medium of FRB 20190520B
Authors:
S. K. Ocker,
J. M. Cordes,
S. Chatterjee,
D. Li,
C. H. Niu,
J. W. McKee,
C. J. Law,
R. Anna-Thomas
Abstract:
Fast radio bursts (FRBs) are millisecond-timescale radio transients, the origins of which are predominantly extragalactic and likely involve highly magnetized compact objects. FRBs undergo multipath propagation, or scattering, from electron density fluctuations on sub-parsec scales in ionized gas along the line-of-sight. Scattering observations have located plasma structures within FRB host galaxi…
▽ More
Fast radio bursts (FRBs) are millisecond-timescale radio transients, the origins of which are predominantly extragalactic and likely involve highly magnetized compact objects. FRBs undergo multipath propagation, or scattering, from electron density fluctuations on sub-parsec scales in ionized gas along the line-of-sight. Scattering observations have located plasma structures within FRB host galaxies, probed Galactic and extragalactic turbulence, and constrained FRB redshifts. Scattering also inhibits FRB detection and biases the observed FRB population. We report the detection of scattering times from the repeating FRB 20190520B that vary by up to a factor of two or more on minutes to days-long timescales. In one notable case, the scattering time varied from $7.9\pm0.4$ ms to less than 3.1 ms ($95\%$ confidence) over 2.9 minutes at 1.45 GHz. The scattering times appear to be uncorrelated between bursts or with dispersion and rotation measure variations. Scattering variations are attributable to dynamic, inhomogeneous plasma in the circumsource medium, and analogous variations have been observed from the Crab pulsar. Under such circumstances, the frequency dependence of scattering can deviate from the typical power-law used to measure scattering. Similar variations may therefore be detectable from other FRBs, even those with inconspicuous scattering, providing a unique probe of small-scale processes within FRB environments.
△ Less
Submitted 29 November, 2022; v1 submitted 4 October, 2022;
originally announced October 2022.
-
MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization
Authors:
Hui Niu,
Siyuan Li,
Jian Li
Abstract:
Portfolio management is a fundamental problem in finance. It involves periodic reallocations of assets to maximize the expected returns within an appropriate level of risk exposure. Deep reinforcement learning (RL) has been considered a promising approach to solving this problem owing to its strong capability in sequential decision making. However, due to the non-stationary nature of financial mar…
▽ More
Portfolio management is a fundamental problem in finance. It involves periodic reallocations of assets to maximize the expected returns within an appropriate level of risk exposure. Deep reinforcement learning (RL) has been considered a promising approach to solving this problem owing to its strong capability in sequential decision making. However, due to the non-stationary nature of financial markets, applying RL techniques to portfolio optimization remains a challenging problem. Extracting trading knowledge from various expert strategies could be helpful for agents to accommodate the changing markets. In this paper, we propose MetaTrader, a novel two-stage RL-based approach for portfolio management, which learns to integrate diverse trading policies to adapt to various market conditions. In the first stage, MetaTrader incorporates an imitation learning objective into the reinforcement learning framework. Through imitating different expert demonstrations, MetaTrader acquires a set of trading policies with great diversity. In the second stage, MetaTrader learns a meta-policy to recognize the market conditions and decide on the most proper learned policy to follow. We evaluate the proposed approach on three real-world index datasets and compare it to state-of-the-art baselines. The empirical results demonstrate that MetaTrader significantly outperforms those baselines in balancing profits and risks. Furthermore, thorough ablation studies validate the effectiveness of the components in the proposed approach.
△ Less
Submitted 1 September, 2022;
originally announced October 2022.
-
Domain-Unified Prompt Representations for Source-Free Domain Generalization
Authors:
Hong**g Niu,
Hanting Li,
Feng Zhao,
Bin Li
Abstract:
Domain generalization (DG), aiming to make models work on unseen domains, is a surefire way toward general artificial intelligence. Limited by the scale and diversity of current DG datasets, it is difficult for existing methods to scale to diverse domains in open-world scenarios (e.g., science fiction and pixelate style). Therefore, the source-free domain generalization (SFDG) task is necessary an…
▽ More
Domain generalization (DG), aiming to make models work on unseen domains, is a surefire way toward general artificial intelligence. Limited by the scale and diversity of current DG datasets, it is difficult for existing methods to scale to diverse domains in open-world scenarios (e.g., science fiction and pixelate style). Therefore, the source-free domain generalization (SFDG) task is necessary and challenging. To address this issue, we propose an approach based on large-scale vision-language pretraining models (e.g., CLIP), which exploits the extensive domain information embedded in it. The proposed scheme generates diverse prompts from a domain bank that contains many more diverse domains than existing DG datasets. Furthermore, our method yields domain-unified representations from these prompts, thus being able to cope with samples from open-world domains. Extensive experiments on mainstream DG datasets, namely PACS, VLCS, OfficeHome, and DomainNet, show that the proposed method achieves competitive performance compared to state-of-the-art (SOTA) DG methods that require source domain data for training. Besides, we collect a small datasets consists of two domains to evaluate the open-world domain generalization ability of the proposed method. The source code and the dataset will be made publicly available at https://github.com/muse1998/Source-Free-Domain-Generalization
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Stable solid molecular hydrogen above 900K from a machine-learned potential trained with diffusion Quantum Monte Carlo
Authors:
Hongwei Niu,
Yubo Yang,
Scott Jensen,
Markus Holzmann,
Carlo Pierleoni,
David M. Ceperley
Abstract:
We survey the phase diagram of high-pressure molecular hydrogen with path integral molecular dynamics using a machine-learned interatomic potential trained with Quantum Monte Carlo forces and energies. Besides the HCP and C2/c-24 phases, we find two new stable phases both with molecular centers in the Fmmm-4 structure, separated by a molecular orientation transition with temperature. The high temp…
▽ More
We survey the phase diagram of high-pressure molecular hydrogen with path integral molecular dynamics using a machine-learned interatomic potential trained with Quantum Monte Carlo forces and energies. Besides the HCP and C2/c-24 phases, we find two new stable phases both with molecular centers in the Fmmm-4 structure, separated by a molecular orientation transition with temperature. The high temperature isotropic Fmmm-4 phase has a reentrant melting line with a maximum at higher temperature (1450K at 150GPa) than previously estimated and crosses the liquid-liquid transition line around 1200K and 200GPa.
△ Less
Submitted 14 February, 2023; v1 submitted 1 September, 2022;
originally announced September 2022.
-
Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
Authors:
Hanting Li,
Hong**g Niu,
Zhaoqing Zhu,
Feng Zhao
Abstract:
Compared with the image-based static facial expression recognition (SFER) task, the dynamic facial expression recognition (DFER) task based on video sequences is closer to the natural expression recognition scene. However, DFER is often more challenging. One of the main reasons is that video sequences often contain frames with different expression intensities, especially for the facial expressions…
▽ More
Compared with the image-based static facial expression recognition (SFER) task, the dynamic facial expression recognition (DFER) task based on video sequences is closer to the natural expression recognition scene. However, DFER is often more challenging. One of the main reasons is that video sequences often contain frames with different expression intensities, especially for the facial expressions in the real-world scenarios, while the images in SFER frequently present uniform and high expression intensities. However, if the expressions with different intensities are treated equally, the features learned by the networks will have large intra-class and small inter-class differences, which is harmful to DFER. To tackle this problem, we propose the global convolution-attention block (GCA) to rescale the channels of the feature maps. In addition, we introduce the intensity-aware loss (IAL) in the training process to help the network distinguish the samples with relatively low expression intensities. Experiments on two in-the-wild dynamic facial expression datasets (i.e., DFEW and FERV39k) indicate that our method outperforms the state-of-the-art DFER approaches. The source code will be made publicly available.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
KIC 10417986: Spectroscopic confirmation of the nature of the binary system with a δ Scuti component
Authors:
Guo-Jie Feng,
Ali Esamdin,
Jian-Ning Fu,
Hu-Biao Niu,
Peng Zong,
Tao-Zhi Yang,
Shu-Guo Ma,
**g Xu,
Chun-Hai Bai,
Yong Wang,
Wei-Chao Sun,
Xin-Liang Wang
Abstract:
KIC 10417986 is a short orbital period (0.0737 d) ellipsoidal variable star with a δ Scuti and γ Doradus hybrid pulsations component discovered by Kepler. The ground-based spectroscopic observations were carried out in the winters of 2020 and 2021 to investigate the binary nature of this star. We derive the orbital parameters using the rvfit code with a result of K1 = 29.7 $\pm$ 1.5 km/s, γ = -18.…
▽ More
KIC 10417986 is a short orbital period (0.0737 d) ellipsoidal variable star with a δ Scuti and γ Doradus hybrid pulsations component discovered by Kepler. The ground-based spectroscopic observations were carried out in the winters of 2020 and 2021 to investigate the binary nature of this star. We derive the orbital parameters using the rvfit code with a result of K1 = 29.7 $\pm$ 1.5 km/s, γ = -18.7 $\pm$ 1.7 km/s, and confirm an orbital period of 0.84495 d instead of the result given by Kepler. The atmospheric parameters of the primary are determined by the synthetic spectra fitting technique with the estimated values of Teff = 7411 $\pm$ 187 K, log g = 4.2 $\pm$ 0.3 dex, [M/H] = 0.08 $\pm$ 0.09 dex and vsini = 52 $\pm$ 11 km/s. KIC 10417986 is a circular orbit binary system. From the single-lined nature and mass function of the star, the derived orbital inclination is 26 $\pm$ 6°, and the mass of the secondary is from 0.43 to 0.7 M_sun, which should be a late-K to early-M type star. Fourteen frequencies are extracted from Kepler light curves, of which six independent frequencies in the high-frequency region are identified as the p-mode pulsations of δ Scuti star, and one independent frequency in the low-frequency region (f2 = 1.3033 c/d) is probably the rotational frequency due to the starspots rather than the ellipsoidal effect or g-mode of γ Doradus.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Discriminator-Guided Model-Based Offline Imitation Learning
Authors:
Wenjia Zhang,
Haoran Xu,
Haoyi Niu,
Peng Cheng,
Ming Li,
Heming Zhang,
Guyue Zhou,
Xianyuan Zhan
Abstract:
Offline imitation learning (IL) is a powerful method to solve decision-making problems from expert demonstrations without reward labels. Existing offline IL methods suffer from severe performance degeneration under limited expert data. Including a learned dynamics model can potentially improve the state-action space coverage of expert data, however, it also faces challenging issues like model appr…
▽ More
Offline imitation learning (IL) is a powerful method to solve decision-making problems from expert demonstrations without reward labels. Existing offline IL methods suffer from severe performance degeneration under limited expert data. Including a learned dynamics model can potentially improve the state-action space coverage of expert data, however, it also faces challenging issues like model approximation/generalization errors and suboptimality of rollout data. In this paper, we propose the Discriminator-guided Model-based offline Imitation Learning (DMIL) framework, which introduces a discriminator to simultaneously distinguish the dynamics correctness and suboptimality of model rollout data against real expert demonstrations. DMIL adopts a novel cooperative-yet-adversarial learning strategy, which uses the discriminator to guide and couple the learning process of the policy and dynamics model, resulting in improved model performance and robustness. Our framework can also be extended to the case when demonstrations contain a large proportion of suboptimal data. Experimental results show that DMIL and its extension achieve superior performance and robustness compared to state-of-the-art offline IL methods under small datasets.
△ Less
Submitted 10 January, 2023; v1 submitted 1 July, 2022;
originally announced July 2022.
-
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
Authors:
Haoyi Niu,
Shubham Sharma,
Yiwen Qiu,
Ming Li,
Guyue Zhou,
Jianming Hu,
Xianyuan Zhan
Abstract:
Learning effective reinforcement learning (RL) policies to solve real-world complex tasks can be quite challenging without a high-fidelity simulation environment. In most cases, we are only given imperfect simulators with simplified dynamics, which inevitably lead to severe sim-to-real gaps in RL policy learning. The recently emerged field of offline RL provides another possibility to learn polici…
▽ More
Learning effective reinforcement learning (RL) policies to solve real-world complex tasks can be quite challenging without a high-fidelity simulation environment. In most cases, we are only given imperfect simulators with simplified dynamics, which inevitably lead to severe sim-to-real gaps in RL policy learning. The recently emerged field of offline RL provides another possibility to learn policies directly from pre-collected historical data. However, to achieve reasonable performance, existing offline RL algorithms need impractically large offline data with sufficient state-action space coverage for training. This brings up a new question: is it possible to combine learning from limited real data in offline RL and unrestricted exploration through imperfect simulators in online RL to address the drawbacks of both approaches? In this study, we propose the Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning (H2O) framework to provide an affirmative answer to this question. H2O introduces a dynamics-aware policy evaluation scheme, which adaptively penalizes the Q function learning on simulated state-action pairs with large dynamics gaps, while also simultaneously allowing learning from a fixed real-world dataset. Through extensive simulation and real-world tasks, as well as theoretical analysis, we demonstrate the superior performance of H2O against other cross-domain online and offline RL algorithms. H2O provides a brand new hybrid offline-and-online RL paradigm, which can potentially shed light on future RL algorithm design for solving practical real-world tasks.
△ Less
Submitted 11 January, 2023; v1 submitted 27 June, 2022;
originally announced June 2022.
-
Arecibo and FAST Timing Follow-up of twelve Millisecond Pulsars Discovered in Commensal Radio Astronomy FAST Survey
Authors:
C. C. Miao,
W. W. Zhu,
D. Li,
P. C. C. Freire,
J. R. Niu,
P. Wang,
J. P. Yuan,
M. Y. Xue,
A. D. Cameron,
D. J. Champion,
M. Cruces,
Y. T. Chen,
M. M. Chi,
X. F. Cheng,
S. J. Dang,
M. F. Ding,
Y. Feng,
Z. Y. Gan,
G. Hobbs,
M. Kramer,
Z. J. Liu,
Y. X. Li,
Z. K. Luo,
X. L. Miao,
L. Q. Meng
, et al. (24 additional authors not shown)
Abstract:
We report the phase-connected timing ephemeris, polarization pulse profiles, Faraday rotation measurements, and Rotating-Vector-Model (RVM) fitting results of twelve millisecond pulsars (MSPs) discovered with the Five-hundred-meter Aperture Spherical radio Telescope (FAST) in the Commensal radio Astronomy FAST survey (CRAFTS). The timing campaigns were carried out with FAST and Arecibo over three…
▽ More
We report the phase-connected timing ephemeris, polarization pulse profiles, Faraday rotation measurements, and Rotating-Vector-Model (RVM) fitting results of twelve millisecond pulsars (MSPs) discovered with the Five-hundred-meter Aperture Spherical radio Telescope (FAST) in the Commensal radio Astronomy FAST survey (CRAFTS). The timing campaigns were carried out with FAST and Arecibo over three years. Eleven of the twelve pulsars are in neutron star - white dwarf binary systems, with orbital periods between 2.4 and 100 d. Ten of them have spin periods, companion masses, and orbital eccentricities that are consistent with the theoretical expectations for MSP - Helium white dwarf (He WD) systems. The last binary pulsar (PSR J1912$-$0952) has a significantly smaller spin frequency and a smaller companion mass, the latter could be caused by a low orbital inclination for the system. Its orbital period of 29 days is well within the range of orbital periods where some MSP - He WD systems have shown anomalous eccentricities, however, the eccentricity of PSR J1912$-$0952 is typical of what one finds for the remaining MSP - He WD systems.
△ Less
Submitted 9 May, 2022;
originally announced May 2022.
-
Group Theory Analysis of Phonons in Monolayer Chromium Trihalides and Their Janus Structures
Authors:
Y. C. Liu,
H. B. Niu,
J. B. Lin,
V. Wang
Abstract:
A contrastive investigation of the symmetry aspects of phonons in monolayer chromium trihalides and their Janus structures Y$_3$-Cr$_2$-X$_3$ (X, Y = F, Cl, Br, I) by group theory is presented. We first classify all phonons at the Brillouin-zone center ($Γ$) into the irreducible representation. Then the infrared and Raman activity of optic phonons, Raman tensors, and the possible polarization assi…
▽ More
A contrastive investigation of the symmetry aspects of phonons in monolayer chromium trihalides and their Janus structures Y$_3$-Cr$_2$-X$_3$ (X, Y = F, Cl, Br, I) by group theory is presented. We first classify all phonons at the Brillouin-zone center ($Γ$) into the irreducible representation. Then the infrared and Raman activity of optic phonons, Raman tensors, and the possible polarization assignments of R active phonons are predicted. Base on these results, we clarify the the discrepancy about the Raman activity o optic modes in monolayer CrI$_3$. Besides, we find that the Raman and infrared spectra for X$_3$-Cr$_2$-X$_3$ are exclusive, whereas that for Janus Y$_3$-Cr$_2$-X$_3$ are coincident. This distinction is vital for optic spectra identification of Janus Y$_3$-Cr$_2$-X$_3$ monolayer from X$_3$-Cr$_2$-X$_3$ monolayer. In addition, we derive the symmetry-matched phonon eigenfunctions and corresponding schematic representations of the eigenvectors for both F$_3$-Cr$_2$-I$_3$ and I$_3$-Cr$_2$-I$_3$ monolayer, which demonstrate intuitively the origin of phonon chirality and magnetism. At last, our analysis indicates that the spin-phonon coupling, the magneto-optical effect of infrared and Raman active phonons, and phonon chirality should be observed in Janus Y$_3$-Cr$_2$-X$_3$ monolayer as that and even easier than that in X$_3$-Cr$_2$-X$_3$ monolayer. Our work provides a detailed guiding map for experimental characterization of Y$_3$-Cr$_2$-X$_3$ monolayer, and also reveals important effects of optic phonons in Janus Y$_3$-Cr$_2$-X$_3$ monolayer.
△ Less
Submitted 28 July, 2022; v1 submitted 12 April, 2022;
originally announced April 2022.
-
Magnetic Field Reversal around an Active Fast Radio Burst
Authors:
S. Dai,
Y. Feng,
Y. P. Yang,
Y. K. Zhang,
D. Li,
C. H. Niu,
P. Wang,
M. Y. Xue,
B. Zhang,
S. Burke-Spolaor,
C. J. Law,
R. S. Lynch,
L. Connor,
R. Anna-Thomas,
L. Zhang,
R. Duan,
J. M. Yao,
C. W. Tsai,
W. W. Zhu,
M. Cruces,
G. Hobbs,
C. C. Miao,
J. R. Niu,
M. D. Filipovic,
S. Q. Zhu
Abstract:
The environment of actively repeating fast radio bursts (FRBs) has been shown to be complex and varying. The recently localized FRB 20190520B is extremely active, has the largest confirmed host dispersion measure, and is only the second FRB source associated with a compact, persistent radio source (PRS). The main tracer of the magneto-ionic environments is the rotation measure (RM), a path-integra…
▽ More
The environment of actively repeating fast radio bursts (FRBs) has been shown to be complex and varying. The recently localized FRB 20190520B is extremely active, has the largest confirmed host dispersion measure, and is only the second FRB source associated with a compact, persistent radio source (PRS). The main tracer of the magneto-ionic environments is the rotation measure (RM), a path-integral of the line-of-sight component of magnetic field strength (B) and electron density, which does not allow a direct probe of the B-field configuration. Here we report direct evidence for a B-field reversal based on the observed sign change and extreme variation of FRB 20190520B's RM, which changed from $\sim10000$ rad m$^{-2}$ to $\sim-16000$ rad m$^{-2}$ between June 2021 and January 2022. Such extreme RM reversal has never been observed before in any FRB nor in any astronomical object. The implied short-term change of the B-field configuration in or around the FRB could be due to the vicinity of massive black holes, or a magnetized companion star in binary systems, or a young supernova remnant along the line of sight.
△ Less
Submitted 11 May, 2023; v1 submitted 15 March, 2022;
originally announced March 2022.
-
Network resilience in the aging brain
Authors:
Tao Liu,
Shu Guo,
Hao Liu,
Rui Kang,
Mingyang Bai,
Jiyang Jiang,
Wei Wen,
Xing Pan,
Jun Tai,
Jianxin Li,
Jian Cheng,
**g **g,
Zhenzhou Wu,
Haijun Niu,
Haogang Zhu,
Zixiao Li,
Yongjun Wang,
Henry Brodaty,
Perminder Sachdev,
Daqing Li
Abstract:
Degeneration and adaptation are two competing sides of the same coin called resilience in the progressive processes of brain aging or diseases. Degeneration accumulates during brain aging and other cerebral activities, causing structural atrophy and dysfunction. At the same time, adaptation allows brain network reorganize to compensate for structural loss to maintain cognition function. Although h…
▽ More
Degeneration and adaptation are two competing sides of the same coin called resilience in the progressive processes of brain aging or diseases. Degeneration accumulates during brain aging and other cerebral activities, causing structural atrophy and dysfunction. At the same time, adaptation allows brain network reorganize to compensate for structural loss to maintain cognition function. Although hidden resilience mechanism is critical and fundamental to uncover the brain aging law, due to the lack of datasets and appropriate methodology, it remains essentially unknown how these two processes interact dynamically across brain networks. To quantitatively investigate this complex process, we analyze aging brains based on 6-year follow-up multimodal neuroimaging database from 63 persons. We reveal the critical mechanism of network resilience that various perturbation may cause fast brain structural atrophy, and then brain can reorganize its functional layout to lower its operational efficiency, which helps to slow down the structural atrophy and finally recover its functional efficiency equilibrium. This empirical finding could be explained by our theoretical model, suggesting one universal resilience dynamical function. This resilience is achieved in the brain functional network with evolving percolation and rich-club features. Our findings can help to understand the brain aging process and design possible mitigation methods to adjust interaction between degeneration and adaptation from resilience viewpoint.
△ Less
Submitted 3 February, 2022;
originally announced February 2022.
-
The Properties and Evolutions of Starspots on Three Detached Eclipsing Binaries in the LAMOST-Kepler survey
Authors:
Jiaxin Wang,
Jianning Fu,
Weikai Zong,
Yang Pan,
Hubiao Niu,
Bo Zhang,
Yong Zhang
Abstract:
The spotted detached eclipsing binary (DEB) offers insights into starspots on the binary. Three spotted DEBs, KIC 8097825, KIC 6859813, and KIC 5527172, which were observed by the Kepler photometry and LAMOST spectroscopy, are studied in this work. The physical parameters of binaries are determined by binary modeling. The sizes, lifetimes, and single/double-dip ratio (SDR) of starspots are derived…
▽ More
The spotted detached eclipsing binary (DEB) offers insights into starspots on the binary. Three spotted DEBs, KIC 8097825, KIC 6859813, and KIC 5527172, which were observed by the Kepler photometry and LAMOST spectroscopy, are studied in this work. The physical parameters of binaries are determined by binary modeling. The sizes, lifetimes, and single/double-dip ratio (SDR) of starspots are derived by starspot analysis. KIC 8097825 has large starspots. KIC 6859813 has a spot rotation period shorter than its orbital period but the system should be synchronized inferred from timescale estimation. The difference may be the result of the surface differential rotation. The KIC 5527172 has a long spot lifetime and an M dwarf component with an inflation radius. The primaries of these binaries and the secondary of KIC 8097825 have spots. Adding spotted DEBs of literature, we compare the starspots on binaries with those on the single stars. The spot sizes of starspots on 65% binaries are smaller than the median of those on single stars. The lifetimes of starspots on binaries are consistent with those on single stars when the rotation periods are larger than 3 days. SDRs for half of the binaries are consistent with those of single star systems, while another half are smaller. The relative lifetime positively correlates with the RMS and SDR but negatively correlates with the rotation period. These relations are similar to those of spots on the single star systems. Binaries with luminosity ratios close to the unit tend to have more double dips.
△ Less
Submitted 21 January, 2022;
originally announced January 2022.
-
Double Intelligent Reflecting Surface-assisted Multi-User MIMO mmWave Systems with Hybrid Precoding
Authors:
Hehao Niu,
Zheng Chu,
Fuhui Zhou,
Cunhua Pan,
Derrick Wing Kwan Ng,
Huan X. Nguyen
Abstract:
This work investigates the effect of double intelligent reflecting surface (IRS) in improving the spectrum efficient of multi-user multiple-input multiple-output (MIMO) network operating in the millimeter wave (mmWave) band. Specifically, we aim to solve a weighted sum rate maximization problem by jointly optimizing the digital precoding at the transmitter and the analog phase shifters at the IRS,…
▽ More
This work investigates the effect of double intelligent reflecting surface (IRS) in improving the spectrum efficient of multi-user multiple-input multiple-output (MIMO) network operating in the millimeter wave (mmWave) band. Specifically, we aim to solve a weighted sum rate maximization problem by jointly optimizing the digital precoding at the transmitter and the analog phase shifters at the IRS, subject to the minimum achievable rate constraint. To facilitate the design of an efficient solution, we first reformulate the original problem into a tractable one by exploiting the majorization-minimization (MM) method. Then, a block coordinate descent (BCD) method is proposed to obtain a suboptimal solution, where the precoding matrices and the phase shifters are alternately optimized. Specifically, the digital precoding matrix design problem is solved by the quadratically constrained quadratic programming (QCQP), while the analog phase shift optimization is solved by the Riemannian manifold optimization (RMO). The convergence and computational complexity are analyzed. Finally, simulation results are provided to verify the performance of the proposed design, as well as the effectiveness of double-IRS in improving the spectral efficiency.
△ Less
Submitted 26 November, 2021;
originally announced November 2021.