-
Magnetic Excitations in Strained Infinite-layer Nickelate PrNiO2
Authors:
Qiang Gao,
Shiyu Fan,
Qisi Wang,
Jiarui Li,
Xiaolin Ren,
Izabela Biało,
Annabella Drewanowski,
Pascal Rothenbühler,
Jaewon Choi,
Yao Wang,
Tao Xiang,
Jiang** Hu,
Ke-** Zhou,
Valentina Bisogni,
Riccardo Comin,
J. Chang,
Jonathan Pelliciari,
X. J. Zhou,
Zhihai Zhu
Abstract:
Strongly correlated materials often respond sensitively to the external perturbations. In the recently discovered superconducting infinite-layer nickelates, the superconducting transition temperature can be dramatically enhanced via only ~1% compressive strain-tuning enabled by substrate design. However, the root of such enhancement remains elusive. While the superconducting pairing mechanism is s…
▽ More
Strongly correlated materials often respond sensitively to the external perturbations. In the recently discovered superconducting infinite-layer nickelates, the superconducting transition temperature can be dramatically enhanced via only ~1% compressive strain-tuning enabled by substrate design. However, the root of such enhancement remains elusive. While the superconducting pairing mechanism is still not settled, magnetic Cooper pairing - similar to the cuprates has been proposed. Using resonant inelastic x-ray scattering, we investigate the magnetic excitations in infinite-layer PrNiO2 thin films for different strain conditions. The magnon bandwidth of PrNiO2 shows only marginal response to strain-tuning, in sharp contrast to the striking enhancement of the superconducting transition temperature Tc in the doped superconducting samples. These results suggest the enhancement of Tc is not mediated by spin excitations and thus provide important empirics for the understanding of superconductivity in infinite-layer nickelates.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Strain-modulated anisotropic electronic structure in superconducting RuO$_2$ films
Authors:
Connor A. Occhialini,
Luiz G. P. Martins,
Shiyu Fan,
Valentina Bisogni,
Takahiro Yasunami,
Maki Musashi,
Masashi Kawasaki,
Masaki Uchida,
Riccardo Comin,
Jonathan Pelliciari
Abstract:
The binary ruthenate, RuO$_2$, has been the subject of intense interest due to its itinerant antiferromagnetism and strain-induced superconductivity. The strain mechanism and its effect on the microscopic electronic states leading to the normal and superconducting state, however, remain undisclosed. Here, we investigate highly-strained epitaxial (110) RuO$_2$ films using polarization-dependent oxy…
▽ More
The binary ruthenate, RuO$_2$, has been the subject of intense interest due to its itinerant antiferromagnetism and strain-induced superconductivity. The strain mechanism and its effect on the microscopic electronic states leading to the normal and superconducting state, however, remain undisclosed. Here, we investigate highly-strained epitaxial (110) RuO$_2$ films using polarization-dependent oxygen K-edge X-ray absorption spectroscopy (XAS). Through the detection of pre-edge peaks, arising from O:$2p$ - Ru:$4d$ hybridization, we uncover the effects of epitaxial strain on the orbital/electronic structure near the Fermi level. Our data show robust strain-induced shifts of orbital levels and a reduction of hybridization strength. Furthermore, we reveal a pronounced in-plane anisotropy of the electronic structure along the $[110]/[1\bar{1}0]$ directions naturally stemming from the symmetry-breaking epitaxial strain of the substrate. The $B_{2g}$ symmetry component of the epitaxially-enforced strain breaks a sublattice degeneracy, resulting in an increase of the density of states at the Fermi level ($E_F$), possibly paving the way to superconductivity. These results underscore the importance of the effective reduction from tetragonal to orthorhombic lattice symmetry in (110) RuO$_2$ films and its relevance towards the superconducting and magnetic properties.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Salient Object Detection for Point Clouds
Authors:
Songlin Fan,
Wei Gao,
Ge Li
Abstract:
This paper researches the unexplored task-point cloud salient object detection (SOD). Differing from SOD for images, we find the attention shift of point clouds may provoke saliency conflict, i.e., an object paradoxically belongs to salient and non-salient categories. To eschew this issue, we present a novel view-dependent perspective of salient objects, reasonably reflecting the most eye-catching…
▽ More
This paper researches the unexplored task-point cloud salient object detection (SOD). Differing from SOD for images, we find the attention shift of point clouds may provoke saliency conflict, i.e., an object paradoxically belongs to salient and non-salient categories. To eschew this issue, we present a novel view-dependent perspective of salient objects, reasonably reflecting the most eye-catching objects in point cloud scenarios. Following this formulation, we introduce PCSOD, the first dataset proposed for point cloud SOD consisting of 2,872 in-/out-door 3D views. The samples in our dataset are labeled with hierarchical annotations, e.g., super-/sub-class, bounding box, and segmentation map, which endows the brilliant generalizability and broad applicability of our dataset verifying various conjectures. To evidence the feasibility of our solution, we further contribute a baseline model and benchmark five representative models for a comprehensive comparison. The proposed model can effectively analyze irregular and unordered points for detecting salient objects. Thanks to incorporating the task-tailored designs, our method shows visible superiority over other baselines, producing more satisfactory results. Extensive experiments and discussions reveal the promising potential of this research field, paving the way for further study.
△ Less
Submitted 24 July, 2022;
originally announced July 2022.
-
SPR:Supervised Personalized Ranking Based on Prior Knowledge for Recommendation
Authors:
Chun Yang,
Shicai Fan
Abstract:
The goal of a recommendation system is to model the relevance between each user and each item through the user-item interaction history, so that maximize the positive samples score and minimize negative samples. Currently, two popular loss functions are widely used to optimize recommender systems: the pointwise and the pairwise. Although these loss functions are widely used, however, there are two…
▽ More
The goal of a recommendation system is to model the relevance between each user and each item through the user-item interaction history, so that maximize the positive samples score and minimize negative samples. Currently, two popular loss functions are widely used to optimize recommender systems: the pointwise and the pairwise. Although these loss functions are widely used, however, there are two problems. (1) These traditional loss functions do not fit the goals of recommendation systems adequately and utilize prior knowledge information sufficiently. (2) The slow convergence speed of these traditional loss functions makes the practical application of various recommendation models difficult.
To address these issues, we propose a novel loss function named Supervised Personalized Ranking (SPR) Based on Prior Knowledge. The proposed method improves the BPR loss by exploiting the prior knowledge on the interaction history of each user or item in the raw data. Unlike BPR, instead of constructing <user, positive item, negative item> triples, the proposed SPR constructs <user, similar user, positive item, negative item> quadruples. Although SPR is very simple, it is very effective. Extensive experiments show that our proposed SPR not only achieves better recommendation performance, but also significantly accelerates the convergence speed, resulting in a significant reduction in the required training time.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery
Authors:
Yuehao Wang,
Yonghao Long,
Siu Hin Fan,
Qi Dou
Abstract:
Reconstruction of the soft tissues in robotic surgery from endoscopic stereo videos is important for many applications such as intra-operative navigation and image-guided robotic surgery automation. Previous works on this task mainly rely on SLAM-based approaches, which struggle to handle complex surgical scenes. Inspired by recent progress in neural rendering, we present a novel framework for def…
▽ More
Reconstruction of the soft tissues in robotic surgery from endoscopic stereo videos is important for many applications such as intra-operative navigation and image-guided robotic surgery automation. Previous works on this task mainly rely on SLAM-based approaches, which struggle to handle complex surgical scenes. Inspired by recent progress in neural rendering, we present a novel framework for deformable tissue reconstruction from binocular captures in robotic surgery under the single-viewpoint setting. Our framework adopts dynamic neural radiance fields to represent deformable surgical scenes in MLPs and optimize shapes and deformations in a learning-based manner. In addition to non-rigid deformations, tool occlusion and poor 3D clues from a single viewpoint are also particular challenges in soft tissue reconstruction. To overcome these difficulties, we present a series of strategies of tool mask-guided ray casting, stereo depth-cueing ray marching and stereo depth-supervised optimization. With experiments on DaVinci robotic surgery videos, our method significantly outperforms the current state-of-the-art reconstruction method for handling various complex non-rigid deformations. To our best knowledge, this is the first work leveraging neural rendering for surgical scene 3D reconstruction with remarkable potential demonstrated. Code is available at: https://github.com/med-air/EndoNeRF.
△ Less
Submitted 30 June, 2022;
originally announced June 2022.
-
Evolutionary rationality of risk preference
Authors:
Songjia Fan,
Yi Tao,
Cong Li
Abstract:
Selection shapes all kinds of behaviors, including how we make decisions under uncertainty. The risk attitude reflected from it should be simple, flexible, yet consistent. In this paper we engaged evolutionary dynamics to find the decision making rule concerning risk that is evolutionarily superior, and developed the theory of evolutionary rationality. We highlight the importance of selection inte…
▽ More
Selection shapes all kinds of behaviors, including how we make decisions under uncertainty. The risk attitude reflected from it should be simple, flexible, yet consistent. In this paper we engaged evolutionary dynamics to find the decision making rule concerning risk that is evolutionarily superior, and developed the theory of evolutionary rationality. We highlight the importance of selection intensity and fitness, as well as their equivalents in the human mind, named as attention degree and meta-fitness, in the decision making process. Evolutionary rationality targets the maximization of the geometric mean of meta-fitness (or fitness), and attention degree (or selection intensity) holds the key in the change of attitude of the same individual towards different events and under varied situations. Then as an example, the Allais paradox is presented to show the application of evolutionary rationality, in which the anomalous choices made by the majority of people can be well justified by a simple change in the attention degree.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Laser cooling assisted thermal management of lightsails
Authors:
Weiliang **,
Wei Li,
Chinmay Khandekar,
Meir Orenstein,
Shanhui Fan
Abstract:
A lightsail can be accelerated to ultra-high speed by the radiation pressure of a laser having an intensity of the order of GW/m$^2$, which though presents a critical challenge in the thermal management of lightsails. In this letter, we explore the applicable regimes of solid-state laser cooling in dissipating heat in additional to the previously explored radiative cooling approach. We begin by ex…
▽ More
A lightsail can be accelerated to ultra-high speed by the radiation pressure of a laser having an intensity of the order of GW/m$^2$, which though presents a critical challenge in the thermal management of lightsails. In this letter, we explore the applicable regimes of solid-state laser cooling in dissipating heat in additional to the previously explored radiative cooling approach. We begin by examining the cooling capacity of laser cooling, and show that the cooling rate from a micron-thick layer doped with ytterbium ions can exceed that of blackbody thermal emission. This allows more intense laser illumination upon material damage, and consequently shortened acceleration distance. Next, we explore the impact of the limited operating bandwidth of laser cooling to account for the Doppler shift of the pum** laser, and conclude that laser cooling is helpful for target velocities $\lesssim5\%$ for room-temperature operations.
△ Less
Submitted 10 June, 2022;
originally announced June 2022.
-
Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map
Authors:
Haodong Yuan,
Yudong Zhang,
Shengyin Fan,
Xue Li,
Jian Wang
Abstract:
The integration of a SLAM algorithm with place recognition technology empowers it with the ability to mitigate accumulated errors and to relocalize itself. However, existing methods for point cloud-based place recognition predominantly rely on the matching of descriptors, which are mostly lidar-centric. These methods suffer from two major drawbacks: first, they cannot perform place recognition whe…
▽ More
The integration of a SLAM algorithm with place recognition technology empowers it with the ability to mitigate accumulated errors and to relocalize itself. However, existing methods for point cloud-based place recognition predominantly rely on the matching of descriptors, which are mostly lidar-centric. These methods suffer from two major drawbacks: first, they cannot perform place recognition when the distance between two point clouds is significant, and second, they can only calculate the rotation angle without considering the offset in the X and Y directions. To overcome these limitations, we propose a novel local descriptor that is constructed around the Main Object. By using a geometric method, we can accurately calculate the relative pose. We have provided a theoretical analysis to demonstrate that this method can overcome the aforementioned limitations. Furthermore, we conducted extensive experiments on KITTI Odometry and KITTI360, which indicate that our proposed method has significant advantages over state-of-the-art methods.
△ Less
Submitted 13 November, 2023; v1 submitted 7 June, 2022;
originally announced June 2022.
-
Experimental evaluation of digitally-verifiable photonic computing for blockchain and cryptocurrency
Authors:
Sunil Pai,
Taewon Park,
Marshall Ball,
Bogdan Penkovsky,
Maziyar Milanizadeh,
Michael Dubrovsky,
Nathnael Abebe,
Francesco Morichetti,
Andrea Melloni,
Shanhui Fan,
Olav Solgaard,
David A. B. Miller
Abstract:
As blockchain technology and cryptocurrency become increasingly mainstream, ever-increasing energy costs required to maintain the computational power running these decentralized platforms create a market for more energy-efficient hardware. Photonic cryptographic hash functions, which use photonic integrated circuits to accelerate computation, promise energy efficiency for verifying transactions an…
▽ More
As blockchain technology and cryptocurrency become increasingly mainstream, ever-increasing energy costs required to maintain the computational power running these decentralized platforms create a market for more energy-efficient hardware. Photonic cryptographic hash functions, which use photonic integrated circuits to accelerate computation, promise energy efficiency for verifying transactions and mining in a cryptonetwork. Like many analog computing approaches, however, current proposals for photonic cryptographic hash functions that promise similar security guarantees as Bitcoin are susceptible to systematic error, so multiple devices may not reach a consensus on computation despite high numerical precision (associated with low photodetector noise). In this paper, we theoretically and experimentally demonstrate that a more general family of robust discrete analog cryptographic hash functions, which we introduce as LightHash, leverages integer matrix-vector operations on photonic mesh networks of interferometers. The difficulty of LightHash can be adjusted to be sufficiently tolerant to systematic error (calibration error, loss error, coupling error, and phase error) and preserve inherent security guarantees present in the Bitcoin protocol. Finally, going beyond our proof-of-concept, we define a ``photonic advantage'' criterion and justify how recent developments in CMOS optoelectronics (including analog-digital conversion) provably achieve such advantage for robust and digitally-verifiable photonic computing and ultimately generate a new market for decentralized photonic technology.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Experimentally realized in situ backpropagation for deep learning in nanophotonic neural networks
Authors:
Sunil Pai,
Zhanghao Sun,
Tyler W. Hughes,
Taewon Park,
Ben Bartlett,
Ian A. D. Williamson,
Momchil Minkov,
Maziyar Milanizadeh,
Nathnael Abebe,
Francesco Morichetti,
Andrea Melloni,
Shanhui Fan,
Olav Solgaard,
David A. B. Miller
Abstract:
Neural networks are widely deployed models across many scientific disciplines and commercial endeavors ranging from edge computing and sensing to large-scale signal processing in data centers. The most efficient and well-entrenched method to train such networks is backpropagation, or reverse-mode automatic differentiation. To counter an exponentially increasing energy budget in the artificial inte…
▽ More
Neural networks are widely deployed models across many scientific disciplines and commercial endeavors ranging from edge computing and sensing to large-scale signal processing in data centers. The most efficient and well-entrenched method to train such networks is backpropagation, or reverse-mode automatic differentiation. To counter an exponentially increasing energy budget in the artificial intelligence sector, there has been recent interest in analog implementations of neural networks, specifically nanophotonic neural networks for which no analog backpropagation demonstration exists. We design mass-manufacturable silicon photonic neural networks that alternately cascade our custom designed "photonic mesh" accelerator with digitally implemented nonlinearities. These reconfigurable photonic meshes program computationally intensive arbitrary matrix multiplication by setting physical voltages that tune the interference of optically encoded input data propagating through integrated Mach-Zehnder interferometer networks. Here, using our packaged photonic chip, we demonstrate in situ backpropagation for the first time to solve classification tasks and evaluate a new protocol to keep the entire gradient measurement and update of physical device voltages in the analog domain, improving on past theoretical proposals. Our method is made possible by introducing three changes to typical photonic meshes: (1) measurements at optical "grating tap" monitors, (2) bidirectional optical signal propagation automated by fiber switch, and (3) universal generation and readout of optical amplitude and phase. After training, our classification achieves accuracies similar to digital equivalents even in presence of systematic error. Our findings suggest a new training paradigm for photonics-accelerated artificial intelligence based entirely on a physical analog of the popular backpropagation technique.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
Authors:
Xu Wang,
Simin Fan,
Jessica Houghton,
Lu Wang
Abstract:
NLP-powered automatic question generation (QG) techniques carry great pedagogical potential of saving educators' time and benefiting student learning. Yet, QG systems have not been widely adopted in classrooms to date. In this work, we aim to pinpoint key impediments and investigate how to improve the usability of automatic QG techniques for educational purposes by understanding how instructors co…
▽ More
NLP-powered automatic question generation (QG) techniques carry great pedagogical potential of saving educators' time and benefiting student learning. Yet, QG systems have not been widely adopted in classrooms to date. In this work, we aim to pinpoint key impediments and investigate how to improve the usability of automatic QG techniques for educational purposes by understanding how instructors construct questions and identifying touch points to enhance the underlying NLP models. We perform an in-depth need finding study with 11 instructors across 7 different universities, and summarize their thought processes and needs when creating questions. While instructors show great interests in using NLP systems to support question design, none of them has used such tools in practice. They resort to multiple sources of information, ranging from domain knowledge to students' misconceptions, all of which missing from today's QG systems. We argue that building effective human-NLP collaborative QG systems that emphasize instructor control and explainability is imperative for real-world adoption. We call for QG systems to provide process-oriented support, use modular design, and handle diverse sources of input.
△ Less
Submitted 30 April, 2022;
originally announced May 2022.
-
Simulating Fluids in Real-World Still Images
Authors:
Siming Fan,
**gtan Piao,
Chen Qian,
Kwan-Yee Lin,
Hongsheng Li
Abstract:
In this work, we tackle the problem of real-world fluid animation from a still image. The key of our system is a surface-based layered representation deriving from video decomposition, where the scene is decoupled into a surface fluid layer and an impervious background layer with corresponding transparencies to characterize the composition of the two layers. The animated video can be produced by w…
▽ More
In this work, we tackle the problem of real-world fluid animation from a still image. The key of our system is a surface-based layered representation deriving from video decomposition, where the scene is decoupled into a surface fluid layer and an impervious background layer with corresponding transparencies to characterize the composition of the two layers. The animated video can be produced by war** only the surface fluid layer according to the estimation of fluid motions and recombining it with the background. In addition, we introduce surface-only fluid simulation, a $2.5D$ fluid calculation version, as a replacement for motion estimation. Specifically, we leverage the triangular mesh based on a monocular depth estimator to represent the fluid surface layer and simulate the motion in the physics-based framework with the inspiration of the classic theory of the hybrid Lagrangian-Eulerian method, along with a learnable network so as to adapt to complex real-world image textures. We demonstrate the effectiveness of the proposed system through comparison with existing methods in both standard objective metrics and subjective ranking scores. Extensive experiments not only indicate our method's competitive performance for common fluid scenes but also better robustness and reasonability under complex transparent fluid scenarios. Moreover, as the proposed surface-based layer representation and surface-only fluid simulation naturally disentangle the scene, interactive editing such as adding objects to the river and texture replacing could be easily achieved with realistic results.
△ Less
Submitted 24 April, 2022;
originally announced April 2022.
-
Efficient Pipeline Planning for Expedited Distributed DNN Training
Authors:
Ziyue Luo,
Xiaodong Yi,
Guo** Long,
Shiqing Fan,
Chuan Wu,
Jun Yang,
Wei Lin
Abstract:
To train modern large DNN models, pipeline parallelism has recently emerged, which distributes the model across GPUs and enables different devices to process different microbatches in pipeline. Earlier pipeline designs allow multiple versions of model parameters to co-exist (similar to asynchronous training), and cannot ensure the same model convergence and accuracy performance as without pipelini…
▽ More
To train modern large DNN models, pipeline parallelism has recently emerged, which distributes the model across GPUs and enables different devices to process different microbatches in pipeline. Earlier pipeline designs allow multiple versions of model parameters to co-exist (similar to asynchronous training), and cannot ensure the same model convergence and accuracy performance as without pipelining. Synchronous pipelining has recently been proposed which ensures model performance by enforcing a synchronization barrier between training iterations. Nonetheless, the synchronization barrier requires waiting for gradient aggregation from all microbatches and thus delays the training progress. Optimized pipeline planning is needed to minimize such wait and hence the training time, which has not been well studied in the literature. This paper designs efficient, near-optimal algorithms for expediting synchronous pipeline-parallel training of modern large DNNs over arbitrary inter-GPU connectivity. Our algorithm framework comprises two components: a pipeline partition and device map** algorithm, and a pipeline scheduler that decides processing order of microbatches over the partitions, which together minimize the per-iteration training time. We conduct thorough theoretical analysis, extensive testbed experiments and trace-driven simulation, and demonstrate our scheme can accelerate training up to 157% compared with state-of-the-art designs.
△ Less
Submitted 22 April, 2022;
originally announced April 2022.
-
Transformer-Guided Convolutional Neural Network for Cross-View Geolocalization
Authors:
Teng Wang,
Shujuan Fan,
Daikun Liu,
Changyin Sun
Abstract:
Ground-to-aerial geolocalization refers to localizing a ground-level query image by matching it to a reference database of geo-tagged aerial imagery. This is very challenging due to the huge perspective differences in visual appearances and geometric configurations between these two views. In this work, we propose a novel Transformer-guided convolutional neural network (TransGCNN) architecture, wh…
▽ More
Ground-to-aerial geolocalization refers to localizing a ground-level query image by matching it to a reference database of geo-tagged aerial imagery. This is very challenging due to the huge perspective differences in visual appearances and geometric configurations between these two views. In this work, we propose a novel Transformer-guided convolutional neural network (TransGCNN) architecture, which couples CNN-based local features with Transformer-based global representations for enhanced representation learning. Specifically, our TransGCNN consists of a CNN backbone extracting feature map from an input image and a Transformer head modeling global context from the CNN map. In particular, our Transformer head acts as a spatial-aware importance generator to select salient CNN features as the final feature representation. Such a coupling procedure allows us to leverage a lightweight Transformer network to greatly enhance the discriminative capability of the embedded features. Furthermore, we design a dual-branch Transformer head network to combine image features from multi-scale windows in order to improve details of the global feature representation. Extensive experiments on popular benchmark datasets demonstrate that our model achieves top-1 accuracy of 94.12\% and 84.92\% on CVUSA and CVACT_val, respectively, which outperforms the second-performing baseline with less than 50% parameters and almost 2x higher frame rate, therefore achieving a preferable accuracy-efficiency tradeoff.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Deformed in-medium similarity renormalization group
Authors:
Q. Yuan,
S. Q. Fan,
B. S. Hu,
J. G. Li,
S. Zhang,
S. M. Wang,
Z. H. Sun,
Y. Z. Ma,
F. R. Xu
Abstract:
We have developed an {\it ab initio} deformed in-medium similarity renormalization group (IMSRG) for open-shell nuclei. This is a single-reference IMSRG in deformed Hartree-Fock (HF) basis. Deformed wave functions are more efficient in describing deformed nuclei. The broken spherical symmetry needs to be restored by angular momentum projection, which is computational expensive. The angular momentu…
▽ More
We have developed an {\it ab initio} deformed in-medium similarity renormalization group (IMSRG) for open-shell nuclei. This is a single-reference IMSRG in deformed Hartree-Fock (HF) basis. Deformed wave functions are more efficient in describing deformed nuclei. The broken spherical symmetry needs to be restored by angular momentum projection, which is computational expensive. The angular momentum mainly capture the static correlations and can be estimated by the projection of the HF state. In this work, we do deformed IMSRG calculation and add the correlation energy from projected HF as a leading order approximation. As the test ground, we have calculated the deformed $^{8,10}\rm Be$ isotopes with the optimized chiral interaction NNLO$_{\rm opt}$. The results are benchmarked with the no-core shell model and valence space IMSRG calculations. Then we systematically investigated the ground-state energies and charge radii of even-even isotopes from light beryllium to medium-mass magnesium. The calculated energies are extrapolated to infinite basis space by an exponential form, and compared with the extrapolated valence-space IMSRG results and experimental data available.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
Seasonal variations of chemical species and haze in Titan's upper atmosphere
Authors:
Siteng Fan,
Daniel Zhao,
Cheng Li,
Donald E. Shemansky,
Mao-Chang Liang,
Yuk L. Yung
Abstract:
Seasonal variation is significant in Titan's atmosphere due to the large change of solar insolation resulting from Titan's 26.7° axial tilt relative to the plane of Saturn's orbit. Here we present an investigation of hydrocarbon and nitrile species in Titan's upper atmosphere at 400-1200 km, which includes the mesosphere and the lower thermosphere, over more than one fourth of Titan's year (2006-2…
▽ More
Seasonal variation is significant in Titan's atmosphere due to the large change of solar insolation resulting from Titan's 26.7° axial tilt relative to the plane of Saturn's orbit. Here we present an investigation of hydrocarbon and nitrile species in Titan's upper atmosphere at 400-1200 km, which includes the mesosphere and the lower thermosphere, over more than one fourth of Titan's year (2006-2014, LS=318°-60°), using eighteen stellar occultation observations obtained by Cassini/UVIS. Vertical profiles of eight chemical species (CH4, C2H2, C2H4, C2H6, C4H2, C6H6, HCN, HC3N) and haze particles are retrieved from these observations using an instrument forward model, which considers the technical issue of pointing motion. The Markov-chain Monte Carlo (MCMC) algorithm is used to obtain the posterior probability distributions of parameters in the retrieval, which inherently tests the extent to which species profiles can be constrained. The results show that no change of the species profiles is noticeable before the equinox, while the decrease of atmospheric temperature and significant upwelling in the summer hemisphere are found five terrestrial years afterwards. Altitude of the detached haze layer decreases towards the vernal equinox then it disappears, and no reappearance is identified within the time range of our data, which is consistent with observations from Cassini/ISS. This study provides observational constraints on the seasonal change of Titan's upper atmosphere, and suggests further investigations of the atmospheric chemistry and dynamics therein.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
E^2TAD: An Energy-Efficient Tracking-based Action Detector
Authors:
Xin Hu,
Zhenyu Wu,
Hao-Yu Miao,
Siqi Fan,
Taiyu Long,
Zhenyu Hu,
Pengcheng Pi,
Yi Wu,
Zhou Ren,
Zhangyang Wang,
Gang Hua
Abstract:
Video action detection (spatio-temporal action localization) is usually the starting point for human-centric intelligent analysis of videos nowadays. It has high practical impacts for many applications across robotics, security, healthcare, etc. The two-stage paradigm of Faster R-CNN inspires a standard paradigm of video action detection in object detection, i.e., firstly generating person proposa…
▽ More
Video action detection (spatio-temporal action localization) is usually the starting point for human-centric intelligent analysis of videos nowadays. It has high practical impacts for many applications across robotics, security, healthcare, etc. The two-stage paradigm of Faster R-CNN inspires a standard paradigm of video action detection in object detection, i.e., firstly generating person proposals and then classifying their actions. However, none of the existing solutions could provide fine-grained action detection to the "who-when-where-what" level. This paper presents a tracking-based solution to accurately and efficiently localize predefined key actions spatially (by predicting the associated target IDs and locations) and temporally (by predicting the time in exact frame indices). This solution won first place in the UAV-Video Track of 2021 Low-Power Computer Vision Challenge (LPCVC).
△ Less
Submitted 29 October, 2022; v1 submitted 9 April, 2022;
originally announced April 2022.
-
Who is next: rising star prediction via diffusion of user interest in social networks
Authors:
Xuan Yang,
Yang Yang,
**tao Su,
Yifei Sun,
Shen Fan,
Zhongyao Wang,
Jun Zhang,
**gmin Chen
Abstract:
Finding items with potential to increase sales is of great importance in online market. In this paper, we propose to study this novel and practical problem: rising star prediction. We call these potential items Rising Star, which implies their ability to rise from low-turnover items to best-sellers in the future. Rising stars can be used to help with unfair recommendation in e-commerce platform, b…
▽ More
Finding items with potential to increase sales is of great importance in online market. In this paper, we propose to study this novel and practical problem: rising star prediction. We call these potential items Rising Star, which implies their ability to rise from low-turnover items to best-sellers in the future. Rising stars can be used to help with unfair recommendation in e-commerce platform, balance supply and demand to benefit the retailers and allocate marketing resources rationally. Although the study of rising star can bring great benefits, it also poses challenges to us. The sales trend of rising star fluctuates sharply in the short-term and exhibits more contingency caused by some external events (e.g., COVID-19 caused increasing purchase of the face mask) than other items, which cannot be solved by existing sales prediction methods. To address above challenges, in this paper, we observe that the presence of rising stars is closely correlated with the early diffusion of user interest in social networks, which is validated in the case of Taocode (an intermediary that diffuses user interest in Taobao). Thus, we propose a novel framework, RiseNet, to incorporate the user interest diffusion process with the item dynamic features to effectively predict rising stars. Specifically, we adopt a coupled mechanism to capture the dynamic interplay between items and user interest, and a special designed GNN based framework to quantify user interest. Our experimental results on large-scale real-world datasets provided by Taobao demonstrate the effectiveness of our proposed framework.
△ Less
Submitted 29 March, 2022; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Channel Self-Supervision for Online Knowledge Distillation
Authors:
Shixiao Fan,
Xuan Cheng,
Xiaomin Wang,
Chun Yang,
Pan Deng,
Minghui Liu,
Jiali Deng,
Ming Liu
Abstract:
Recently, researchers have shown an increased interest in the online knowledge distillation. Adopting an one-stage and end-to-end training fashion, online knowledge distillation uses aggregated intermediated predictions of multiple peer models for training. However, the absence of a powerful teacher model may result in the homogeneity problem between group peers, affecting the effectiveness of gro…
▽ More
Recently, researchers have shown an increased interest in the online knowledge distillation. Adopting an one-stage and end-to-end training fashion, online knowledge distillation uses aggregated intermediated predictions of multiple peer models for training. However, the absence of a powerful teacher model may result in the homogeneity problem between group peers, affecting the effectiveness of group distillation adversely. In this paper, we propose a novel online knowledge distillation method, \textbf{C}hannel \textbf{S}elf-\textbf{S}upervision for Online Knowledge Distillation (CSS), which structures diversity in terms of input, target, and network to alleviate the homogenization problem. Specifically, we construct a dual-network multi-branch structure and enhance inter-branch diversity through self-supervised learning, adopting the feature-level transformation and augmenting the corresponding labels. Meanwhile, the dual network structure has a larger space of independent parameters to resist the homogenization problem during distillation. Extensive quantitative experiments on CIFAR-100 illustrate that our method provides greater diversity than OKDDip and we also give pretty performance improvement, even over the state-of-the-art such as PCL. The results on three fine-grained datasets (StanfordDogs, StanfordCars, CUB-200-211) also show the significant generalization capability of our approach.
△ Less
Submitted 23 March, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
Creating boundaries along a synthetic frequency dimension
Authors:
Avik Dutt,
Luqi Yuan,
Ki Youl Yang,
Kai Wang,
Siddharth Buddhiraju,
Jelena Vučković,
Shanhui Fan
Abstract:
Synthetic dimensions have garnered widespread interest for implementing high dimensional classical and quantum dynamics on lower dimensional geometries. Synthetic frequency dimensions, in particular, have been used to experimentally realize a plethora of bulk physics effects, such as effective gauge potentials, nontrivial Hermitian as well as non-Hermitian topology, spin-momentum locking, complex…
▽ More
Synthetic dimensions have garnered widespread interest for implementing high dimensional classical and quantum dynamics on lower dimensional geometries. Synthetic frequency dimensions, in particular, have been used to experimentally realize a plethora of bulk physics effects, such as effective gauge potentials, nontrivial Hermitian as well as non-Hermitian topology, spin-momentum locking, complex long-range coupling, unidirectional frequency conversion, and four-dimensional lattices. However, in synthetic frequency dimensions there has not been any demonstration of boundary effects which are of paramount importance in topological physics due to the bulk edge correspondence, since systems exhibiting synthetic frequency dimensions do not support well-defined sharp boundaries. Here we theoretically elucidate a method to construct boundaries in the synthetic frequency dimension of dynamically modulated ring resonators by strongly coupling it to an auxiliary ring, and provide an experimental demonstration of this method. We experimentally explore various physics effects associated with the creation of such boundaries in the synthetic frequency dimension, including confinement of the spectrum of light, the discretization of the band structure, and the interaction of such boundaries with the topologically protected one-way chiral modes in a quantum Hall ladder. The incorporation of boundaries allows us to observe topologically robust transport of light along the frequency axis, which shows that the frequency of light can be controlled through topological concepts. Our demonstration of such sharp boundaries fundamentally expands the capability of exploring topological physics, and is also of importance for other applications such as classical and quantum information processing in synthetic frequency dimensions.
△ Less
Submitted 21 March, 2022;
originally announced March 2022.
-
Thermal Tides in the Martian Atmosphere near Northern Summer Solstice Observed by ACS/TIRVIM onboard TGO
Authors:
Siteng Fan,
Sandrine Guerlet,
François Forget,
Antoine Bierjon,
Ehouarn Millour,
Nikolay Ignatiev,
Alexey Shakun,
Alexey Grigoriev,
Alexander Trokhimovskiy,
Franck Montmessin,
Oleg Korablev
Abstract:
Thermal tides in the Martian atmosphere are analyzed using temperature profiles retrieved from nadir observations obtained by the TIRVIM Fourier-spectrometer, part of the Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO). The data is selected near the northern summer solstice at solar longitude (LS) 75°-105° of Martian Year (MY) 35. The observations have a full local ti…
▽ More
Thermal tides in the Martian atmosphere are analyzed using temperature profiles retrieved from nadir observations obtained by the TIRVIM Fourier-spectrometer, part of the Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO). The data is selected near the northern summer solstice at solar longitude (LS) 75°-105° of Martian Year (MY) 35. The observations have a full local time coverage, which enables analyses of daily temperature anomalies. The observed zonal mean temperature is lower by 4-6K at ~100Pa, but higher towards the summer pole, compared to the LMD Mars General Circulation Model (GCM). Wave mode decomposition shows dominant diurnal tide and important semi-diurnal tide and diurnal Kelvin wave, with maximal amplitudes of 5K, 3K, and 2.5K, respectively, from tens to hundreds of Pa. The results generally agree well with the LMD Mars GCM, but with noticeable earlier phases of diurnal (~1h) and semi-diurnal (~3h) tides.
△ Less
Submitted 20 March, 2022;
originally announced March 2022.
-
Unsupervised Learning of 3D Semantic Keypoints with Mutual Reconstruction
Authors:
Haocheng Yuan,
Chen Zhao,
Shichao Fan,
Jiaxi Jiang,
Jiaqi Yang
Abstract:
Semantic 3D keypoints are category-level semantic consistent points on 3D objects. Detecting 3D semantic keypoints is a foundation for a number of 3D vision tasks but remains challenging, due to the ambiguity of semantic information, especially when the objects are represented by unordered 3D point clouds. Existing unsupervised methods tend to generate category-level keypoints in implicit manners,…
▽ More
Semantic 3D keypoints are category-level semantic consistent points on 3D objects. Detecting 3D semantic keypoints is a foundation for a number of 3D vision tasks but remains challenging, due to the ambiguity of semantic information, especially when the objects are represented by unordered 3D point clouds. Existing unsupervised methods tend to generate category-level keypoints in implicit manners, making it difficult to extract high-level information, such as semantic labels and topology. From a novel mutual reconstruction perspective, we present an unsupervised method to generate consistent semantic keypoints from point clouds explicitly. To achieve this, the proposed model predicts keypoints that not only reconstruct the object itself but also reconstruct other instances in the same category. To the best of our knowledge, the proposed method is the first to mine 3D semantic consistent keypoints from a mutual reconstruction view. Experiments under various evaluation metrics as well as comparisons with the state-of-the-arts demonstrate the efficacy of our new solution to mining semantic consistent keypoints with mutual reconstruction.
△ Less
Submitted 18 March, 2022;
originally announced March 2022.
-
Nonreciprocal Infrared Absorption via Resonant Magneto-optical Coupling to InAs
Authors:
Komron Shayegan,
Bo Zhao,
Yonghwi Kim,
Shanhui Fan,
Harry Atwater
Abstract:
Nonreciprocal elements are a vital building block of electrical and optical systems. In the infrared regime, there is a particular interest in structures that break reciprocity because their thermal absorptive (and emissive) properties should not obey the Kirchhoff thermal radiation law. In this work, we break time-reversal symmetry and reciprocity in n-type doped magneto-optic InAs with a static…
▽ More
Nonreciprocal elements are a vital building block of electrical and optical systems. In the infrared regime, there is a particular interest in structures that break reciprocity because their thermal absorptive (and emissive) properties should not obey the Kirchhoff thermal radiation law. In this work, we break time-reversal symmetry and reciprocity in n-type doped magneto-optic InAs with a static magnetic field where light coupling is mediated by a guided-mode-resonator (GMR) structure whose resonant frequency coincides with the epsilon-near-zero (ENZ) resonance of the doped InAs. Using this structure, we observe the nonreciprocal absorptive behavior as a function of magnetic field and scattering angle in the infrared. Accounting for resonant and nonresonant optical scattering, we reliably model experimental results that break reciprocal absorption relations in the infrared. The ability to design such nonreciprocal absorbers opens an avenue to explore devices with unequal absorptivity and emissivity in specific channels.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs
Authors:
**gfei Xia,
Mingchen Zhuge,
Tiantian Geng,
Shun Fan,
Yuantai Wei,
Zhenyu He,
Feng Zheng
Abstract:
Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music. Most learning-based methods cannot solve it well for two reasons: 1) each move in figure skating changes quickly, hence simply applying traditional frame sampling will lose a lot of valuable information, especially in 3 to 5 minutes long vide…
▽ More
Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music. Most learning-based methods cannot solve it well for two reasons: 1) each move in figure skating changes quickly, hence simply applying traditional frame sampling will lose a lot of valuable information, especially in 3 to 5 minutes long videos; 2) prior methods rarely considered the critical audio-visual relationship in their models. Due to these reasons, we introduce a novel architecture, named Skating-Mixer. It extends the MLP framework into a multimodal fashion and effectively learns long-term representations through our designed memory recurrent unit (MRU). Aside from the model, we collected a high-quality audio-visual FS1000 dataset, which contains over 1000 videos on 8 types of programs with 7 different rating metrics, overtaking other datasets in both quantity and diversity. Experiments show the proposed method achieves SOTAs over all major metrics on the public Fis-V and our FS1000 dataset. In addition, we include an analysis applying our method to the recent competitions in Bei**g 2022 Winter Olympic Games, proving our method has strong applicability.
△ Less
Submitted 17 December, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
Effect of Choices of Boundary Conditions on the Numerical Efficiency of Direct Solutions of Finite Difference frequency Domain Systems with Perfectly Matched Layers
Authors:
Nathan Zhao,
Shanhui Fan
Abstract:
Direct solvers are a common method for solving finite difference frequency domain (FDFD) systems that arise in numerical solutions of Maxwell's equations. In a direct solver, one factorizes the system matrix. Since the system matrix is typically very sparse, the fill-in of these factors is the single most important computational consideration in terms of time complexity and memory requirements. As…
▽ More
Direct solvers are a common method for solving finite difference frequency domain (FDFD) systems that arise in numerical solutions of Maxwell's equations. In a direct solver, one factorizes the system matrix. Since the system matrix is typically very sparse, the fill-in of these factors is the single most important computational consideration in terms of time complexity and memory requirements. As a result, it is of great interest to determine ways in which this fill-in can be systematically reduced. In this paper, we show that in the context of commonly used perfectly matched boundary layer methods, the choice of boundary condition behind the perfectly matched boundary layer can be exploited to reduce fill-in incurred during the factorization, leading to significant gains of up to 40 percent in the efficiency of the factorization procedure. We illustrate our findings by solving linear systems and eigenvalue problems associated with the FDFD method.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Parametric Mie resonances and directional amplification in time-modulated scatterers
Authors:
V. Asadchy,
A. G. Lamprianidis,
G. Ptitcyn,
M. Albooyeh,
Rituraj,
T. Karamanos,
R. Alaee,
S. A. Tretyakov,
C. Rockstuhl,
S. Fan
Abstract:
We provide a theoretical description of light scattering by a spherical particle whose permittivity is modulated in time at twice the frequency of the incident light. Such a particle acts as a finite-sized photonic time crystal and, despite its sub-wavelength spatial extent, can host optical parametric amplification. Conditions of parametric Mie resonances in the sphere are derived. We show that t…
▽ More
We provide a theoretical description of light scattering by a spherical particle whose permittivity is modulated in time at twice the frequency of the incident light. Such a particle acts as a finite-sized photonic time crystal and, despite its sub-wavelength spatial extent, can host optical parametric amplification. Conditions of parametric Mie resonances in the sphere are derived. We show that time-modulated materials provide a route to tailor directional light amplification, qualitatively different from that in scatterers made from a gain media. We design two characteristic time-modulated spheres that simultaneously exhibit light amplification and desired radiation patterns, including those with zero backward and/or vanishing forward scattering. The latter sphere provides an opportunity for creating shadow-free detectors of incident light.
△ Less
Submitted 22 November, 2022; v1 submitted 22 February, 2022;
originally announced February 2022.
-
Parameter Identification of a PN-Guided Incoming Missile Using an Improved Multiple-Model Mechanism
Authors:
Yinhan Wang,
Jiang Wang,
Shipeng Fan
Abstract:
An active defense against an incoming missile requires information of it, including a guidance law parameter and a first-order lateral time constant. To this end, assuming that a missile with a proportional navigation (PN) guidance law attempts to attack an aerial target with bang-bang evasive maneuvers, a parameter identification model based on the gated recurrent unit (GRU) neural network is bui…
▽ More
An active defense against an incoming missile requires information of it, including a guidance law parameter and a first-order lateral time constant. To this end, assuming that a missile with a proportional navigation (PN) guidance law attempts to attack an aerial target with bang-bang evasive maneuvers, a parameter identification model based on the gated recurrent unit (GRU) neural network is built in this paper. The analytic identification solutions for the guidance law parameter and the first-order lateral time constant are derived. The inputs of the identification model are available kinematic information between the aircraft and the missile, while the outputs contain the regression results of missile parameters. To increase the training speed and the identification accuracy of the Model, an output processing method called improved multiplemodel mechanism (IMMM) is proposed in this paper. The effectiveness of IMMM and the performance of the established model are demonstrated through numerical simulations under various engagement scenarios.
△ Less
Submitted 25 January, 2022;
originally announced February 2022.
-
Assessing Planetary Complexity and Potential Agnostic Biosignatures using Epsilon Machines
Authors:
Stuart Bartlett,
Jiazheng Li,
Lixiang Gu,
Lana Sinapayen,
Siteng Fan,
Vijay Natraj,
Jonathan Jiang,
David Crisp,
Yuk Yung
Abstract:
We present a new approach to exoplanet characterisation using techniques from complexity science, with potential applications to biosignature detection. This agnostic method makes use of the temporal variability of light reflected or emitted from a planet. We use a technique known as epsilon machine reconstruction to compute the statistical complexity, a measure of the minimal model size for time…
▽ More
We present a new approach to exoplanet characterisation using techniques from complexity science, with potential applications to biosignature detection. This agnostic method makes use of the temporal variability of light reflected or emitted from a planet. We use a technique known as epsilon machine reconstruction to compute the statistical complexity, a measure of the minimal model size for time series data. We demonstrate that statistical complexity is an effective measure of the complexity of planetary features. Increasing levels of qualitative planetary complexity correlate with increases in statistical complexity and Shannon entropy, demonstrating that our approach can identify planets with the richest dynamics. We also compare Earth time series with Jupiter data, and find that for the three wavelengths considered, Earth's average complexity and entropy rate are approximately 50% and 43% higher than Jupiter's, respectively. The majority of schemes for the detection of extraterrestrial life rely upon biochemical signatures and planetary context. However, it is increasingly recognised that extraterrestrial life could be very different to life on Earth. Under the hypothesis that there is a correlation between the presence of a biosphere and observable planetary complexity, our technique offers an agnostic and quantitative method for the measurement thereof.
△ Less
Submitted 8 February, 2022;
originally announced February 2022.
-
Vibrational fingerprints of ferroelectric hafnia
Authors:
Shiyu Fan,
Sobhit Singh,
Xianghan Xu,
Kiman Park,
Yubo Qi,
S. W. Cheong,
David Vanderbilt,
Karin M. Rabe,
J. L. Musfeldt
Abstract:
Hafnia (HfO2) is a promising material for emerging chip applications due to its high-k dielectric behaviour, suitability for negative capacitance heterostructures, scalable ferroelectricity, and silicon compatibility. The lattice dynamics along with phononic properties such as thermal conductivity, contraction, and heat capacity are under-explored, primarily due to the absence of high quality sing…
▽ More
Hafnia (HfO2) is a promising material for emerging chip applications due to its high-k dielectric behaviour, suitability for negative capacitance heterostructures, scalable ferroelectricity, and silicon compatibility. The lattice dynamics along with phononic properties such as thermal conductivity, contraction, and heat capacity are under-explored, primarily due to the absence of high quality single crystals. Herein, we report the vibrational properties of a series of HfO2 crystals stabilized with yttrium (chemical formula HfO2:xY, where x = 20, 12, 11, 8, and 0%) and compare our findings with a symmetry analysis and lattice dynamics calculations. We untangle the effects of Y by testing our calculations against the measured Raman and infrared spectra of the cubic, antipolar orthorhombic, and monoclinic phases and then proceed to reveal the signature modes of polar orthorhombic hafnia. This work provides a spectroscopic fingerprint for several different phases of HfO2 and paves the way for an analysis of mode contributions to high-k dielectric and ferroelectric properties for chip technologies.
△ Less
Submitted 29 January, 2022;
originally announced January 2022.
-
Thermal structure and aerosols in Mars' atmosphere from TIRVIM/ACS onboard the ExoMars Trace Gas Orbiter : validation of the retrieval algorithm
Authors:
Sandrine Guerlet,
N. Ignatiev,
F. Forget,
T. Fouchet,
P. Vlasov,
G. Bergeron,
R. M. B. Young,
E. Millour,
S. Fan,
H. Tran,
A. Shakun,
A. Grigoriev,
A. Trokhimovskiy,
F. Montmessin,
O. Korablev
Abstract:
The Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO) monitors the Martian atmosphere through different spectral intervals in the infrared light. We present a retrieval algorithm tailored to the analysis of spectra acquired in nadir geometry by TIRVIM, the thermal infrared channel of ACS. Our algorithm simultaneously retrieves vertical profile of atmospheric temperature…
▽ More
The Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO) monitors the Martian atmosphere through different spectral intervals in the infrared light. We present a retrieval algorithm tailored to the analysis of spectra acquired in nadir geometry by TIRVIM, the thermal infrared channel of ACS. Our algorithm simultaneously retrieves vertical profile of atmospheric temperature up to 50 km, surface temperature, and integrated optical depth of dust and water ice clouds. The specificity of the TIRVIM dataset lies in its capacity to resolve the diurnal cycle over a 54 sol period. However, it is uncertain to what extent can the desired atmospheric quantities be accurately estimated at different times of day. Here we first present an Observing System Simulation Experiment (OSSE). We produce synthetic observations at various latitudes, seasons and local times and run our retrieval algorithm on these synthetic data, to evaluate its robustness. Different sources of biases are documented, in particular regarding aerosol retrievals. Atmospheric temperature retrievals are found robust even when dust and/or water ice cloud opacities are not well estimated in our OSSE. We then apply our algorithm to TIRVIM observations in April-May, 2018 and perform a cross-validation of retrieved atmospheric temperature and dust integrated opacity by comparisons with thousands of co-located Mars Climate Sounder (MCS) retrievals. Most differences between TIRVIM and MCS atmospheric temperatures can be attributed to differences in vertical sensitivity. Daytime dust opacities agree well with each other, while biases are found in nighttime dust opacity retrieved from TIRVIM at this season.
△ Less
Submitted 27 January, 2022;
originally announced January 2022.
-
Debiased Graph Neural Networks with Agnostic Label Selection Bias
Authors:
Shaohua Fan,
Xiao Wang,
Chuan Shi,
Kun Kuang,
Nian Liu,
Bai Wang
Abstract:
Most existing Graph Neural Networks (GNNs) are proposed without considering the selection bias in data, i.e., the inconsistent distribution between the training set with test set. In reality, the test data is not even available during the training process, making selection bias agnostic. Training GNNs with biased selected nodes leads to significant parameter estimation bias and greatly impacts the…
▽ More
Most existing Graph Neural Networks (GNNs) are proposed without considering the selection bias in data, i.e., the inconsistent distribution between the training set with test set. In reality, the test data is not even available during the training process, making selection bias agnostic. Training GNNs with biased selected nodes leads to significant parameter estimation bias and greatly impacts the generalization ability on test nodes. In this paper, we first present an experimental investigation, which clearly shows that the selection bias drastically hinders the generalization ability of GNNs, and theoretically prove that the selection bias will cause the biased estimation on GNN parameters. Then to remove the bias in GNN estimation, we propose a novel Debiased Graph Neural Networks (DGNN) with a differentiated decorrelation regularizer. The differentiated decorrelation regularizer estimates a sample weight for each labeled node such that the spurious correlation of learned embeddings could be eliminated. We analyze the regularizer in causal view and it motivates us to differentiate the weights of the variables based on their contribution on the confounding bias. Then, these sample weights are used for reweighting GNNs to eliminate the estimation bias, thus help to improve the stability of prediction on unknown test nodes. Comprehensive experiments are conducted on several challenging graph datasets with two kinds of label selection biases. The results well verify that our proposed model outperforms the state-of-the-art methods and DGNN is a flexible framework to enhance existing GNNs.
△ Less
Submitted 25 January, 2022; v1 submitted 19 January, 2022;
originally announced January 2022.
-
Roadmap on Topological Photonics
Authors:
Hannah Price,
Yidong Chong,
Alexander Khanikaev,
Henning Schomerus,
Lukas J. Maczewsky,
Mark Kremer,
Matthias Heinrich,
Alexander Szameit,
Oded Zilberberg,
Yihao Yang,
Baile Zhang,
Andrea Alù,
Ronny Thomale,
Iacopo Carusotto,
Philippe St-Jean,
Alberto Amo,
Avik Dutt,
Luqi Yuan,
Shanhui Fan,
Xuefan Yin,
Chao Peng,
Tomoki Ozawa,
Andrea Blanco-Redondo
Abstract:
Topological photonics seeks to control the behaviour of the light through the design of protected topological modes in photonic structures. While this approach originated from studying the behaviour of electrons in solid-state materials, it has since blossomed into a field that is at the very forefront of the search for new topological types of matter. This can have real implications for future te…
▽ More
Topological photonics seeks to control the behaviour of the light through the design of protected topological modes in photonic structures. While this approach originated from studying the behaviour of electrons in solid-state materials, it has since blossomed into a field that is at the very forefront of the search for new topological types of matter. This can have real implications for future technologies by harnessing the robustness of topological photonics for applications in photonics devices. This Roadmap surveys some of the main emerging areas of research within topological photonics, with a special attention to questions in fundamental science, which photonics is in an ideal position to address. Each section provides an overview of the current and future challenges within a part of the field, highlighting the most exciting opportunities for future research and developments.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
A bimodal distribution of haze in Pluto's atmosphere
Authors:
Siteng Fan,
Peter Gao,
Xi Zhang,
Danica J. Adams,
Nicholas W. Kutsop,
Carver J. Bierson,
Chao Liu,
Jiani Yang,
Leslie A. Young,
Andrew F. Cheng,
Yuk L. Yung
Abstract:
Pluto, Titan, and Triton make up a unique class of solar system bodies, with icy surfaces and chemically reducing atmospheres rich in organic photochemistry and haze formation. Hazes play important roles in these atmospheres, with physical and chemical processes highly dependent on particle sizes, but the haze size distribution in reducing atmospheres is currently poorly understood. Here we report…
▽ More
Pluto, Titan, and Triton make up a unique class of solar system bodies, with icy surfaces and chemically reducing atmospheres rich in organic photochemistry and haze formation. Hazes play important roles in these atmospheres, with physical and chemical processes highly dependent on particle sizes, but the haze size distribution in reducing atmospheres is currently poorly understood. Here we report observational evidence that Pluto's haze particles are bimodally distributed, which successfully reproduces the full phase scattering observations from New Horizons. Combined with previous simulations of Titan's haze, this result suggests that haze particles in reducing atmospheres undergo rapid shape change near pressure levels ~0.5Pa and favors a photochemical rather than a dynamical origin for the formation of Titan's detached haze. It also demonstrates that both oxidizing and reducing atmospheres can produce multi-modal hazes, and encourages reanalysis of observations of hazes on Titan and Triton.
△ Less
Submitted 12 January, 2022;
originally announced January 2022.
-
Knots and their effect on the tensile strength of lumber: a case study
Authors:
Shuxian Fan,
Samuel W. K. Wong,
James V. Zidek
Abstract:
When assessing the strength of sawn lumber for use in engineering applications, the sizes and locations of knots are an important consideration. Knots are the most common visual characteristics of lumber, that result from the growth of tree branches. Large individual knots, as well as clusters of distinct knots, are known to have strength-reducing effects. However, industry grading rules that gove…
▽ More
When assessing the strength of sawn lumber for use in engineering applications, the sizes and locations of knots are an important consideration. Knots are the most common visual characteristics of lumber, that result from the growth of tree branches. Large individual knots, as well as clusters of distinct knots, are known to have strength-reducing effects. However, industry grading rules that govern knots are informed by subjective judgment to some extent, particularly the spatial interaction of knots and their relationship with lumber strength. This case study reports the results of an experiment that investigated and modelled the strength-reducing effects of knots on a sample of Douglas Fir lumber. Experimental data were obtained by taking scans of lumber surfaces and applying tensile strength testing. The modelling approach presented incorporates all relevant knot information in a Bayesian framework, thereby contributing a more refined way of managing the quality of manufactured lumber.
△ Less
Submitted 14 February, 2023; v1 submitted 10 January, 2022;
originally announced January 2022.
-
Do**-driven topological polaritons in graphene/α-MoO3 heterostructures
Authors:
Hai Hu,
Na Chen,
Hanchao Teng,
Renwen Yu,
Yunpeng Qu,
Jianzhe Sun,
Mengfei Xue,
Debo Hu,
Bin Wu,
Chi Li,
Jianing Chen,
Mengkun Liu,
Zhipei Sun,
Yunqi Liu,
Peining Li,
Shanhui Fan,
F. Javier García de Abajo,
Qing Dai
Abstract:
Controlling the charge carrier density provides an efficient way to trigger phase transitions and modulate the optoelectronic properties in natural materials. This approach could be used to induce topological transitions in the optical response of photonic systems. Here, we predict a topological transition in the isofrequency dispersion contours of hybrid polaritons supported by a two-dimensional…
▽ More
Controlling the charge carrier density provides an efficient way to trigger phase transitions and modulate the optoelectronic properties in natural materials. This approach could be used to induce topological transitions in the optical response of photonic systems. Here, we predict a topological transition in the isofrequency dispersion contours of hybrid polaritons supported by a two-dimensional heterostructure consisting of graphene and $α$-phase molybdenum trioxide ($α$-MoO3). By chemically changing the do** level of graphene, we experimentally demonstrate that the contour topology of polariton isofrequency surfaces transforms from open to closed shapes as a result of do**-dependent polariton hybridization. Moreover, by changing the substrate medium for the heterostructure, the dispersion contour can be further engineered into a rather flattened shape at the topological transition, thus supporting tunable polariton canalization and providing the means to locally control the topology. We demonstrate this idea to achieve extremely subwavelength focusing by using a 1.2-$μ$m-wide silica substrate as a negative refraction lens. Our findings open a disruptive approach toward promising on-chip applications in nanoimaging, optical sensing, and manipulation of nanoscale energy transfer.
△ Less
Submitted 3 January, 2022;
originally announced January 2022.
-
How Powerful are Interest Diffusion on Purchasing Prediction: A Case Study of Taocode
Authors:
Xuanwen Huang,
Yang Yang,
Ziqiang Cheng,
Shen Fan,
Zhongyao Wang,
Juren Li,
Jun Zhang,
**gmin Chen
Abstract:
A taocode is a kind of specially coded text-link on Taobao(the world's biggest online shop** website), through which users can share messages about products with each other. Analyzing taocodes can potentially facilitate understanding of the social relationships between users and, more excitingly, their online purchasing behaviors under the influence of taocode diffusion. This paper innovatively…
▽ More
A taocode is a kind of specially coded text-link on Taobao(the world's biggest online shop** website), through which users can share messages about products with each other. Analyzing taocodes can potentially facilitate understanding of the social relationships between users and, more excitingly, their online purchasing behaviors under the influence of taocode diffusion. This paper innovatively investigates the problem of online purchasing predictions from an information diffusion perspective, with taocode as a case study. Specifically, we conduct profound observational studies on a large-scale real-world dataset from Taobao, containing over 100M Taocode sharing records. Inspired by our observations, we propose InfNet, a dynamic GNN-based framework that models the information diffusion across Taocode. We then apply InfNet to item purchasing predictions. Extensive experiments on real-world datasets validate the effectiveness of InfNet compared with 8 state-of-the-art baselines.
△ Less
Submitted 30 December, 2021; v1 submitted 29 December, 2021;
originally announced December 2021.
-
Efficient method for accelerating line searches using a combined Schur complement domain decomposition and Born series expansions in photonic-based adjoint optimization
Authors:
Nathan Zhao,
Salim Boutami,
Shanhui Fan
Abstract:
A line search in gradient-based optimization algorithm solves the problem of determining the optimal learning rate for a given gradient or search direction in a single iteration. For most problems, this is determined by evaluating different candidate learning rates to find the optimum, which can be expensive. Recent work has provided an efficient way to perform a line search with the use of the Sh…
▽ More
A line search in gradient-based optimization algorithm solves the problem of determining the optimal learning rate for a given gradient or search direction in a single iteration. For most problems, this is determined by evaluating different candidate learning rates to find the optimum, which can be expensive. Recent work has provided an efficient way to perform a line search with the use of the Shanks transformation of a Born series derived from the Lippman-Schwinger formalism. In this paper we show that the cost for performing such a line search can be further reduced with the use of the method of the Schur complement domain decomposition, which can lead to a 10-fold total speed-up resulting from the reduced number of iterations to convergence and reduced wall-clock time per iteration.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Polarization-independent isotropic nonlocal metasurfaces with wavelength-controlled functionality
Authors:
Olivia Y. Long,
Cheng Guo,
Weiliang **,
Shanhui Fan
Abstract:
Flat optics has demonstrated great advances in miniaturizing conventional, bulky optical elements due to the recent developments in metasurface design. Specific applications of such designs include spatial differentiation and the compression of free space. However, metasurfaces designed for such applications are often polarization-dependent and are designed for a single functionality. In this work…
▽ More
Flat optics has demonstrated great advances in miniaturizing conventional, bulky optical elements due to the recent developments in metasurface design. Specific applications of such designs include spatial differentiation and the compression of free space. However, metasurfaces designed for such applications are often polarization-dependent and are designed for a single functionality. In this work, we introduce a polarization-independent metasurface structure by designing guided resonances with degenerate band curvatures in a photonic crystal slab. Our device can perform both free-space compression and spatial differentiation when operated at different frequencies at normal incidence. This work demonstrates the promise of dispersion engineering in metasurface design to create ultrathin devices with polarization-independent functionality.
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
Constrained Adaptive Projection with Pretrained Features for Anomaly Detection
Authors:
Xingtai Gui,
Di Wu,
Yang Chang,
Shicai Fan
Abstract:
Anomaly detection aims to separate anomalies from normal samples, and the pretrained network is promising for anomaly detection. However, adapting the pretrained features would be confronted with the risk of pattern collapse when finetuning on one-class training data. In this paper, we propose an anomaly detection framework called constrained adaptive projection with pretrained features (CAP). Com…
▽ More
Anomaly detection aims to separate anomalies from normal samples, and the pretrained network is promising for anomaly detection. However, adapting the pretrained features would be confronted with the risk of pattern collapse when finetuning on one-class training data. In this paper, we propose an anomaly detection framework called constrained adaptive projection with pretrained features (CAP). Combined with pretrained features, a simple linear projection head applied on a specific input and its k most similar pretrained normal representations is designed for feature adaptation, and a reformed self-attention is leveraged to mine the inner-relationship among one-class semantic features. A loss function is proposed to avoid potential pattern collapse. Concretely, it considers the similarity between a specific data and its corresponding adaptive normal representation, and incorporates a constraint term slightly aligning pretrained and adaptive spaces. Our method achieves state-ofthe-art anomaly detection performance on semantic anomaly detection and sensory anomaly detection benchmarks including 96.5% AUROC on CIFAR- 100 dataset, 97.0% AUROC on CIFAR-10 dataset and 89.9% AUROC on MvTec dataset.
△ Less
Submitted 25 April, 2022; v1 submitted 5 December, 2021;
originally announced December 2021.
-
Electron Pulse Compression with Optical Beat Note
Authors:
Zhexin Zhao,
Kenneth J. Leedle,
Dylan S. Black,
Olav Solgaard,
Robert L. Byer,
Shanhui Fan
Abstract:
Compressing electron pulses is important in many applications of electron beam systems. In this study, we propose to use optical beat notes to compress electron pulses. The beat frequency is chosen to match the initial electron pulse duration, which enables the compression of electron pulses with a wide range of durations. This functionality extends the optical control of electron beams, which is…
▽ More
Compressing electron pulses is important in many applications of electron beam systems. In this study, we propose to use optical beat notes to compress electron pulses. The beat frequency is chosen to match the initial electron pulse duration, which enables the compression of electron pulses with a wide range of durations. This functionality extends the optical control of electron beams, which is important in compact electron beam systems such as dielectric laser accelerators. We also find that the dominant frequency of the electron charge density changes continuously along its drift trajectory, which may open up new opportunities in coherent interaction between free electrons and quantum or classical systems.
△ Less
Submitted 20 November, 2021;
originally announced November 2021.
-
Generalizing Graph Neural Networks on Out-Of-Distribution Graphs
Authors:
Shaohua Fan,
Xiao Wang,
Chuan Shi,
Peng Cui,
Bai Wang
Abstract:
Graph Neural Networks (GNNs) are proposed without considering the agnostic distribution shifts between training and testing graphs, inducing the degeneration of the generalization ability of GNNs on Out-Of-Distribution (OOD) settings. The fundamental reason for such degeneration is that most GNNs are developed based on the I.I.D hypothesis. In such a setting, GNNs tend to exploit subtle statistica…
▽ More
Graph Neural Networks (GNNs) are proposed without considering the agnostic distribution shifts between training and testing graphs, inducing the degeneration of the generalization ability of GNNs on Out-Of-Distribution (OOD) settings. The fundamental reason for such degeneration is that most GNNs are developed based on the I.I.D hypothesis. In such a setting, GNNs tend to exploit subtle statistical correlations existing in the training set for predictions, even though it is a spurious correlation. However, such spurious correlations may change in testing environments, leading to the failure of GNNs. Therefore, eliminating the impact of spurious correlations is crucial for stable GNNs. To this end, we propose a general causal representation framework, called StableGNN. The main idea is to extract high-level representations from graph data first and resort to the distinguishing ability of causal inference to help the model get rid of spurious correlations. Particularly, we exploit a graph pooling layer to extract subgraph-based representations as high-level representations. Furthermore, we propose a causal variable distinguishing regularizer to correct the biased training distribution. Hence, GNNs would concentrate more on the stable correlations. Extensive experiments on both synthetic and real-world OOD graph datasets well verify the effectiveness, flexibility and interpretability of the proposed framework.
△ Less
Submitted 10 March, 2024; v1 submitted 20 November, 2021;
originally announced November 2021.
-
Eigenvalue topology of non-Hermitian band structures in two and three dimensions
Authors:
Charles C. Wojcik,
Kai Wang,
Avik Dutt,
Janet Zhong,
Shanhui Fan
Abstract:
In the band theory for non-Hermitian systems, the energy eigenvalues, which are complex, can exhibit non-trivial topology which is not present in Hermitian systems. In one dimension, it was recently noted theoretically and demonstrated experimentally that the eigenvalue topology is classified by the braid group. The classification of eigenvalue topology in higher dimensions, however, remained an o…
▽ More
In the band theory for non-Hermitian systems, the energy eigenvalues, which are complex, can exhibit non-trivial topology which is not present in Hermitian systems. In one dimension, it was recently noted theoretically and demonstrated experimentally that the eigenvalue topology is classified by the braid group. The classification of eigenvalue topology in higher dimensions, however, remained an open question. Here, we give a complete description of eigenvalue topology in two and three dimensional systems, including the gapped and gapless cases. We reduce the topological classification problem to a purely computational problem in algebraic topology. In two dimensions, the Brillouin zone torus is punctured by exceptional points, and each nontrivial loop in the punctured torus acquires a braid group invariant. These braids satisfy the constraint that the composite of the braids around the exceptional points is equal to the commutator of the braids on the fundamental cycles of the torus. In three dimensions, there are exceptional knots and links, and the classification depends on how they are embedded in the Brillouin zone three-torus. When the exceptional link is contained in a contractible ball, the classification can be expressed in terms of the knot group of the link. Our results provide a comprehensive understanding of non-Hermitian eigenvalue topology in higher dimensional systems, and should be important for the further explorations of topologically robust open quantum and classical systems.
△ Less
Submitted 18 November, 2021;
originally announced November 2021.
-
DFC: Deep Feature Consistency for Robust Point Cloud Registration
Authors:
Zhu Xu,
Zhengyao Bai,
Huijie Liu,
Qianjie Lu,
Shenglan Fan
Abstract:
How to extract significant point cloud features and estimate the pose between them remains a challenging question, due to the inherent lack of structure and ambiguous order permutation of point clouds. Despite significant improvements in applying deep learning-based methods for most 3D computer vision tasks, such as object classification, object segmentation and point cloud registration, the consi…
▽ More
How to extract significant point cloud features and estimate the pose between them remains a challenging question, due to the inherent lack of structure and ambiguous order permutation of point clouds. Despite significant improvements in applying deep learning-based methods for most 3D computer vision tasks, such as object classification, object segmentation and point cloud registration, the consistency between features is still not attractive in existing learning-based pipelines. In this paper, we present a novel learning-based alignment network for complex alignment scenes, titled deep feature consistency and consisting of three main modules: a multiscale graph feature merging network for converting the geometric correspondence set into high-dimensional features, a correspondence weighting module for constructing multiple candidate inlier subsets, and a Procrustes approach named deep feature matching for giving a closed-form solution to estimate the relative pose. As the most important step of the deep feature matching module, the feature consistency matrix for each inlier subset is constructed to obtain its principal vectors as the inlier likelihoods of the corresponding subset. We comprehensively validate the robustness and effectiveness of our approach on both the 3DMatch dataset and the KITTI odometry dataset. For large indoor scenes, registration results on the 3DMatch dataset demonstrate that our method outperforms both the state-of-the-art traditional and learning-based methods. For KITTI outdoor scenes, our approach remains quite capable of lowering the transformation errors. We also explore its strong generalization capability over cross-datasets.
△ Less
Submitted 13 December, 2021; v1 submitted 15 November, 2021;
originally announced November 2021.
-
Invariant representation for generators of general time interval quadratic BSDEs under stochastic growth conditions
Authors:
Guangshuo Zhou,
Fengjiao Du,
Shengjun Fan
Abstract:
This paper is devoted to proving a general invariant representation theorem for generators of general time interval backward stochastic differential equations, where the generator $g$ has a quadratic growth in the unknown variable $z$ and satisfies some stochastic growth conditions in the unknown variable $y$. This unifies and strengthens some known results. And, a natural and innovative idea is u…
▽ More
This paper is devoted to proving a general invariant representation theorem for generators of general time interval backward stochastic differential equations, where the generator $g$ has a quadratic growth in the unknown variable $z$ and satisfies some stochastic growth conditions in the unknown variable $y$. This unifies and strengthens some known results. And, a natural and innovative idea is used to prove the representation theorem.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Julia sets and geometrically finite maps over finite extensions of the $p$-adic field
Authors:
Shilei Fan,
Lingmin Liao,
Hongmin Nie,
Yuefei Wang
Abstract:
Let $K$ be a finite extension of the field $\mathbb{Q}_p$ of $p$-adic numbers, and $φ\in K(z)$ be a rational map of degree at least $2$. We prove that the $K$-Julia set of $φ$ is the natural restriction of $\mathbb{C}_p$-Julia set, provided that the critical orbits are well-behaved. Moreover, under further assumption that $φ$ is geometrically finite, we prove that the dynamics on the $K$-Julia set…
▽ More
Let $K$ be a finite extension of the field $\mathbb{Q}_p$ of $p$-adic numbers, and $φ\in K(z)$ be a rational map of degree at least $2$. We prove that the $K$-Julia set of $φ$ is the natural restriction of $\mathbb{C}_p$-Julia set, provided that the critical orbits are well-behaved. Moreover, under further assumption that $φ$ is geometrically finite, we prove that the dynamics on the $K$-Julia set of $φ$ is a countable state Markov shift.
△ Less
Submitted 11 January, 2024; v1 submitted 2 November, 2021;
originally announced November 2021.
-
A Fast Location Algorithm for Very Sparse Point Clouds Based on Object Detection
Authors:
Shiyu Fan
Abstract:
Limited by the performance factor, it is arduous to recognize target object and locate it in Augmented Reality (AR) scenes on low-end mobile devices, especially which using monocular cameras. In this paper, we proposed an algorithm which can quickly locate the target object through image object detection in the circumstances of having very sparse feature points. We introduce YOLOv3-Tiny to our alg…
▽ More
Limited by the performance factor, it is arduous to recognize target object and locate it in Augmented Reality (AR) scenes on low-end mobile devices, especially which using monocular cameras. In this paper, we proposed an algorithm which can quickly locate the target object through image object detection in the circumstances of having very sparse feature points. We introduce YOLOv3-Tiny to our algorithm as the object detection module to filter the possible points and using Principal Component Analysis (PCA) to determine the location. We conduct the experiment in a manually designed scene by holding a smartphone and the results represent high positioning speed and accuracy of our method.
△ Less
Submitted 21 October, 2021;
originally announced October 2021.
-
Scattering from spheres made of time-varying and dispersive materials
Authors:
G. Ptitcyn,
A. G. Lamprianidis,
T. Karamanos,
V. S. Asadchy,
R. Alaee,
M. Müller,
M. Albooyeh,
M. S. Mirmoosa,
S. Fan,
S. A. Tretyakov,
C. Rockstuhl
Abstract:
Exploring the interaction of light with time-varying media is an intellectual challenge that, in addition to fundamental aspects, provides a pathway to multiple promising applications. Time modulation constitutes here a fundamental handle to control light on entirely different grounds. That holds particularly for complex systems simultaneously structured in space and time. However, a realistic des…
▽ More
Exploring the interaction of light with time-varying media is an intellectual challenge that, in addition to fundamental aspects, provides a pathway to multiple promising applications. Time modulation constitutes here a fundamental handle to control light on entirely different grounds. That holds particularly for complex systems simultaneously structured in space and time. However, a realistic description of time-varying materials requires considering their material dispersion. The combination thereof has barely been considered but is crucial since dispersion accompanies materials suitable for dynamic modulation. As a canonical scattering problem from which many general insights can be obtained, we develop and apply a self-consistent analytical theory of light scattering by a sphere made from a time-varying material exemplarily assumed to have a Lorentzian dispersion. We discuss the eigensolutions of Maxwell's equations in the bulk and present a dedicated Mie theory. The proposed theory is verified with full-wave simulations. We disclose effects such as energy transfer from the time-modulation subsystem to the electromagnetic field, amplifying carefully structured incident fields. Since many phenomena can be studied on analytical grounds with our formalism, it will be indispensable when exploring electromagnetic phenomena in time-varying and spatially structured finite objects of other geometries.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Mirror symmetric on-chip frequency circulation of light
Authors:
Jason F. Herrmann,
Vahid Ansari,
Jiahui Wang,
Jeremy D. Witmer,
Shanhui Fan,
Amir H. Safavi-Naeini
Abstract:
Integrated circulators and isolators are important for develo** on-chip optical technologies, such as laser cavities, communication systems, and quantum information processors. These devices appear to inherently require mirror symmetry breaking to separate backwards from forwards propagation, so existing implementations rely upon magnetic materials, or interactions driven by propagating waves. I…
▽ More
Integrated circulators and isolators are important for develo** on-chip optical technologies, such as laser cavities, communication systems, and quantum information processors. These devices appear to inherently require mirror symmetry breaking to separate backwards from forwards propagation, so existing implementations rely upon magnetic materials, or interactions driven by propagating waves. In contrast to previous work, we demonstrate a mirror symmetric nonreciprocal device. Our device comprises three coupled photonic resonators implemented in thin-film lithium niobate. Applying radio frequency modulation, we drive conversion between the frequency eigenmodes of this system. We measure nearly 40 dB of isolation for approximately 75 mW of RF power near 1550 nm. We simultaneously generate nonreciprocal conversion between all of the eigenmodes in order to demonstrate circulation. Mirror symmetric circulation significantly simplifies the fabrication and operation of nonreciprocal integrated devices. Finally, we consider applications of such on-chip isolators and circulators, such as full-duplex isolation within a single waveguide.
△ Less
Submitted 28 September, 2021;
originally announced September 2021.
-
POAR: Efficient Policy Optimization via Online Abstract State Representation Learning
Authors:
Zhaorun Chen,
Siqi Fan,
Yuan Tan,
Liang Gong,
Binhao Chen,
Te Sun,
David Filliat,
Natalia Díaz-Rodríguez,
Chengliang Liu
Abstract:
While the rapid progress of deep learning fuels end-to-end reinforcement learning (RL), direct application, especially in high-dimensional space like robotic scenarios still suffers from low sample efficiency. Therefore State Representation Learning (SRL) is proposed to specifically learn to encode task-relevant features from complex sensory data into low-dimensional states. However, the pervasive…
▽ More
While the rapid progress of deep learning fuels end-to-end reinforcement learning (RL), direct application, especially in high-dimensional space like robotic scenarios still suffers from low sample efficiency. Therefore State Representation Learning (SRL) is proposed to specifically learn to encode task-relevant features from complex sensory data into low-dimensional states. However, the pervasive implementation of SRL is usually conducted by a decoupling strategy in which the observation-state map** is learned separately, which is prone to over-fit. To handle such problem, we summarize the state-of-the-art (SOTA) SRL sub-tasks in previous works and present a new algorithm called Policy Optimization via Abstract Representation which integrates SRL into the policy optimization phase. Firstly, We engage RL loss to assist in updating SRL model so that the states can evolve to meet the demand of RL and maintain a good physical interpretation. Secondly, we introduce a dynamic loss weighting mechanism so that both models can efficiently adapt to each other. Thirdly, we introduce a new SRL prior called domain resemblance to leverage expert demonstration to improve SRL interpretations. Finally, we provide a real-time access of state graph to monitor the course of learning. Experiments indicate that POAR significantly outperforms SOTA RL algorithms and decoupling SRL strategies in terms of sample efficiency and final rewards. We empirically verify POAR to efficiently handle tasks in high dimensions and facilitate training real-life robots directly from scratch.
△ Less
Submitted 9 December, 2023; v1 submitted 17 September, 2021;
originally announced September 2021.
-
A Medical Pre-Diagnosis System for Histopathological Image of Breast Cancer
Authors:
Shiyu Fan,
Runhai Xu,
Zhaohang Yan
Abstract:
This paper constructs a novel intelligent medical diagnosis system, which can realize automatic communication and breast cancer pathological image recognition. This system contains two main parts, including a pre-training chatbot called M-Chatbot and an improved neural network model of EfficientNetV2-S named EfficientNetV2-SA, in which the activation function in top layers is replaced by ACON-C. U…
▽ More
This paper constructs a novel intelligent medical diagnosis system, which can realize automatic communication and breast cancer pathological image recognition. This system contains two main parts, including a pre-training chatbot called M-Chatbot and an improved neural network model of EfficientNetV2-S named EfficientNetV2-SA, in which the activation function in top layers is replaced by ACON-C. Using information retrieval mechanism, M-Chatbot instructs patients to send breast pathological image to EfficientNetV2-SA network, and then the classifier trained by transfer learning will return the diagnosis results. We verify the performance of our chatbot and classification on the extrinsic metrics and BreaKHis dataset, respectively. The task completion rate of M-Chatbot reached 63.33\%. For the BreaKHis dataset, the highest accuracy of EfficientNetV2-SA network have achieved 84.71\%. All these experimental results illustrate that the proposed model can improve the accuracy performance of image recognition and our new intelligent medical diagnosis system is successful and efficient in providing automatic diagnosis of breast cancer.
△ Less
Submitted 16 September, 2021;
originally announced September 2021.