Search | arXiv e-print repository

Netmarble AI Center's WMT21 Automatic Post-Editing Shared Task Submission

Authors: Shinhyeok Oh, Sion Jang, Hu Xu, Shounan An, Insoo Oh

Abstract: This paper describes Netmarble's submission to WMT21 Automatic Post-Editing (APE) Shared Task for the English-German language pair. First, we propose a Curriculum Training Strategy in training stages. Facebook Fair's WMT19 news translation model was chosen to engage the large and powerful pre-trained neural networks. Then, we post-train the translation model with different levels of data at each t… ▽ More This paper describes Netmarble's submission to WMT21 Automatic Post-Editing (APE) Shared Task for the English-German language pair. First, we propose a Curriculum Training Strategy in training stages. Facebook Fair's WMT19 news translation model was chosen to engage the large and powerful pre-trained neural networks. Then, we post-train the translation model with different levels of data at each training stages. As the training stages go on, we make the system learn to solve multiple tasks by adding extra information at different training stages gradually. We also show a way to utilize the additional data in large volume for APE tasks. For further improvement, we apply Multi-Task Learning Strategy with the Dynamic Weight Average during the fine-tuning stage. To fine-tune the APE corpus with limited data, we add some related subtasks to learn a unified representation. Finally, for better performance, we leverage external translations as augmented machine translation (MT) during the post-training and fine-tuning. As experimental results show, our APE system significantly improves the translations of provided MT results by -2.848 and +3.74 on the development dataset in terms of TER and BLEU, respectively. It also demonstrates its effectiveness on the test dataset with higher quality than the development dataset. △ Less

Submitted 16 November, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

Comments: WMT21 Automatic Post-Editing Shared Task System Paper (at EMNLP2021 Workshop)

ACM Class: I.2.7

arXiv:2108.10515 [pdf, other]

doi 10.1145/3474085.3481537

ARShoe: Real-Time Augmented Reality Shoe Try-on System on Smartphones

Authors: Shan An, Guangfu Che, **ghao Guo, Haogang Zhu, Junjie Ye, Fangru Zhou, Zhaoqi Zhu, Dong Wei, Aishan Liu, Wei Zhang

Abstract: Virtual try-on technology enables users to try various fashion items using augmented reality and provides a convenient online shop** experience. However, most previous works focus on the virtual try-on for clothes while neglecting that for shoes, which is also a promising task. To this concern, this work proposes a real-time augmented reality virtual shoe try-on system for smartphones, namely AR… ▽ More Virtual try-on technology enables users to try various fashion items using augmented reality and provides a convenient online shop** experience. However, most previous works focus on the virtual try-on for clothes while neglecting that for shoes, which is also a promising task. To this concern, this work proposes a real-time augmented reality virtual shoe try-on system for smartphones, namely ARShoe. Specifically, ARShoe adopts a novel multi-branch network to realize pose estimation and segmentation simultaneously. A solution to generate realistic 3D shoe model occlusion during the try-on process is presented. To achieve a smooth and stable try-on effect, this work further develop a novel stabilization method. Moreover, for training and evaluation, we construct the very first large-scale foot benchmark with multiple virtual shoe try-on task-related labels annotated. Exhaustive experiments on our newly constructed benchmark demonstrate the satisfying performance of ARShoe. Practical tests on common smartphones validate the real-time performance and stabilization of the proposed approach. △ Less

Submitted 23 August, 2021; originally announced August 2021.

Comments: Accepted by ACM Multimedia 2021

arXiv:2108.10506 [pdf, other]

Real-Time Monocular Human Depth Estimation and Segmentation on Embedded Systems

Authors: Shan An, Fangru Zhou, Mei Yang, Haogang Zhu, Changhong Fu, Konstantinos A. Tsintotas

Abstract: Estimating a scene's depth to achieve collision avoidance against moving pedestrians is a crucial and fundamental problem in the robotic field. This paper proposes a novel, low complexity network architecture for fast and accurate human depth estimation and segmentation in indoor environments, aiming to applications for resource-constrained platforms (including battery-powered aerial, micro-aerial… ▽ More Estimating a scene's depth to achieve collision avoidance against moving pedestrians is a crucial and fundamental problem in the robotic field. This paper proposes a novel, low complexity network architecture for fast and accurate human depth estimation and segmentation in indoor environments, aiming to applications for resource-constrained platforms (including battery-powered aerial, micro-aerial, and ground vehicles) with a monocular camera being the primary perception module. Following the encoder-decoder structure, the proposed framework consists of two branches, one for depth prediction and another for semantic segmentation. Moreover, network structure optimization is employed to improve its forward inference speed. Exhaustive experiments on three self-generated datasets prove our pipeline's capability to execute in real-time, achieving higher frame rates than contemporary state-of-the-art frameworks (114.6 frames per second on an NVIDIA Jetson Nano GPU with TensorRT) while maintaining comparable accuracy. △ Less

Submitted 23 August, 2021; originally announced August 2021.

Comments: Accepted by IROS 2021

arXiv:2108.09295 [pdf, other]

Wide field-of-view flat lens: an analytical formalism

Authors: Fan Yang, Sensong An, Mikhail Y. Shalaginov, Hualiang Zhang, Juejun Hu, Tian Gu

Abstract: Wide field-of-view (FOV) optics are widely used in various imaging, display, and sensing applications. While conventional wide FOV optics rely on cascading multiple elements to suppress coma and other aberrations, it has recently been demonstrated that diffraction-limited, near-180 degree FOV operation can be achieved with a single-piece flat fisheye lens designed via iterative numerical optimizat… ▽ More Wide field-of-view (FOV) optics are widely used in various imaging, display, and sensing applications. While conventional wide FOV optics rely on cascading multiple elements to suppress coma and other aberrations, it has recently been demonstrated that diffraction-limited, near-180 degree FOV operation can be achieved with a single-piece flat fisheye lens designed via iterative numerical optimization [Nano Lett. 20, 7429(2020)]. Here we derive an analytical solution to enable computationally efficient design of flat wide FOV lenses based on metasurfaces or diffractive optical elements (DOEs). Leveraging this analytical approach, we further quantified trade-offs between optical performance and design parameters in wide FOV metalenses. △ Less

Submitted 23 August, 2021; v1 submitted 20 August, 2021; originally announced August 2021.

Comments: 14 pages, format change

arXiv:2107.08417 [pdf, other]

doi 10.1038/s41467-021-27900-6

Shortcuts to Adiabaticity for Open Systems in Circuit Quantum Electrodynamics

Authors: Zelong Yin, Chunzhen Li, Jonathan Allcock, Yicong Zheng, Xiu Gu, Maochun Dai, Shengyu Zhang, Shuoming An

Abstract: Shortcuts to adiabaticity (STA) are powerful quantum control methods, allowing quick evolution into target states of otherwise slow adiabatic dynamics. Such methods have widespread applications in quantum technologies, and various STA protocols have been demonstrated in closed systems. However, realizing STA for open quantum systems has presented a greater challenge, due to complex controls requir… ▽ More Shortcuts to adiabaticity (STA) are powerful quantum control methods, allowing quick evolution into target states of otherwise slow adiabatic dynamics. Such methods have widespread applications in quantum technologies, and various STA protocols have been demonstrated in closed systems. However, realizing STA for open quantum systems has presented a greater challenge, due to complex controls required in existing proposals. Here we present the first experimental demonstration of STA for open quantum systems, using a superconducting circuit QED system consisting of two coupled bosonic oscillators and a transmon qubit. By applying a counterdiabatic driving pulse, we reduce the adiabatic evolution time of a single lossy mode from 800 ns to 100 ns. In addition, we propose and implement an optimal control protocol to achieve fast and qubit-unconditional equilibrium of multiple lossy modes. Our results pave the way for accelerating dynamics of open quantum systems and have potential applications in designing fast open-system protocols of physical and interdisciplinary interest, such as accelerating bioengineering and chemical reaction dynamics. △ Less

Submitted 18 October, 2021; v1 submitted 18 July, 2021; originally announced July 2021.

arXiv:2107.06516 [pdf, other]

Learning Algebraic Recombination for Compositional Generalization

Authors: Chenyao Liu, Shengnan An, Zeqi Lin, Qian Liu, Bei Chen, Jian-Guang Lou, Lijie Wen, Nanning Zheng, Dongmei Zhang

Abstract: Neural sequence models exhibit limited compositional generalization ability in semantic parsing tasks. Compositional generalization requires algebraic recombination, i.e., dynamically recombining structured expressions in a recursive manner. However, most previous studies mainly concentrate on recombining lexical units, which is an important but not sufficient part of algebraic recombination. In t… ▽ More Neural sequence models exhibit limited compositional generalization ability in semantic parsing tasks. Compositional generalization requires algebraic recombination, i.e., dynamically recombining structured expressions in a recursive manner. However, most previous studies mainly concentrate on recombining lexical units, which is an important but not sufficient part of algebraic recombination. In this paper, we propose LeAR, an end-to-end neural model to learn algebraic recombination for compositional generalization. The key insight is to model the semantic parsing task as a homomorphism between a latent syntactic algebra and a semantic algebra, thus encouraging algebraic recombination. Specifically, we learn two modules jointly: a Composer for producing latent syntax, and an Interpreter for assigning semantic operations. Experiments on two realistic and comprehensive compositional generalization benchmarks demonstrate the effectiveness of our model. The source code is publicly available at https://github.com/microsoft/ContextualSP. △ Less

Submitted 14 July, 2021; originally announced July 2021.

Comments: ACL Findings 2021

arXiv:2107.05023 [pdf, other]

NeoUNet: Towards accurate colon polyp segmentation and neoplasm detection

Authors: Phan Ngoc Lan, Nguyen Sy An, Dao Viet Hang, Dao Van Long, Tran Quang Trung, Nguyen Thi Thuy, Dinh Viet Sang

Abstract: Automatic polyp segmentation has proven to be immensely helpful for endoscopy procedures, reducing the missing rate of adenoma detection for endoscopists while increasing efficiency. However, classifying a polyp as being neoplasm or not and segmenting it at the pixel level is still a challenging task for doctors to perform in a limited time. In this work, we propose a fine-grained formulation for… ▽ More Automatic polyp segmentation has proven to be immensely helpful for endoscopy procedures, reducing the missing rate of adenoma detection for endoscopists while increasing efficiency. However, classifying a polyp as being neoplasm or not and segmenting it at the pixel level is still a challenging task for doctors to perform in a limited time. In this work, we propose a fine-grained formulation for the polyp segmentation problem. Our formulation aims to not only segment polyp regions, but also identify those at high risk of malignancy with high accuracy. In addition, we present a UNet-based neural network architecture called NeoUNet, along with a hybrid loss function to solve this problem. Experiments show highly competitive results for NeoUNet on our benchmark dataset compared to existing polyp segmentation models. △ Less

Submitted 11 July, 2021; originally announced July 2021.

arXiv:2107.03861 [pdf, other]

Feedback Vertex Set on Geometric Intersection Graphs

Authors: Shinwoo An, Eun** Oh

Abstract: In this paper, we present an algorithm for computing a feedback vertex set of a unit disk graph of size $k$, if it exists, which runs in time $2^{O(\sqrt{k})}(n+m)$, where $n$ and $m$ denote the numbers of vertices and edges, respectively. This improves the $2^{O(\sqrt{k}\log k)}n^{O(1)}$-time algorithm for this problem on unit disk graphs by Fomin et al. [ICALP 2017]. Moreover, our algorithm is o… ▽ More In this paper, we present an algorithm for computing a feedback vertex set of a unit disk graph of size $k$, if it exists, which runs in time $2^{O(\sqrt{k})}(n+m)$, where $n$ and $m$ denote the numbers of vertices and edges, respectively. This improves the $2^{O(\sqrt{k}\log k)}n^{O(1)}$-time algorithm for this problem on unit disk graphs by Fomin et al. [ICALP 2017]. Moreover, our algorithm is optimal assuming the exponential-time hypothesis. Also, our algorithm can be extended to handle geometric intersection graphs of similarly sized fat objects without increasing the running time. △ Less

Submitted 8 July, 2021; originally announced July 2021.

arXiv:2106.04973 [pdf, other]

Reachability Problems for Transmission Graphs

Authors: Shinwoo An, Eun** Oh

Abstract: Let $P$ be a set of $n$ points in the plane where each point $p$ of $P$ is associated with a radius $r_p>0$.The transmission graph $G=(P,E)$ of $P$ is defined as the directed graph such that $E$ contains an edge from $p$ to $q$ if and only if $|pq|\leq r_p$ for any two points $p$ and $q$ in $P$, where $|pq|$ denotes the Euclidean distance between $p$ and $q$. In this paper, we present a data struc… ▽ More Let $P$ be a set of $n$ points in the plane where each point $p$ of $P$ is associated with a radius $r_p>0$.The transmission graph $G=(P,E)$ of $P$ is defined as the directed graph such that $E$ contains an edge from $p$ to $q$ if and only if $|pq|\leq r_p$ for any two points $p$ and $q$ in $P$, where $|pq|$ denotes the Euclidean distance between $p$ and $q$. In this paper, we present a data structure of size $O(n^{5/3})$ such that for any two points in $P$, we can check in $O(n^{2/3})$ time if there is a path in $G$ between the two points. This is the first data structure for answering reachability queries whose performance depends only on $n$ but not on the number of edges. △ Less

Submitted 9 June, 2021; originally announced June 2021.

Comments: To appear in WADS2021

arXiv:2105.12741 [pdf, other]

doi 10.3847/1538-4357/abfa95

Living with Neighbors. IV. Dissecting the Spin$-$Orbit Alignment of Dark Matter Halos: Interacting Neighbors and the Local Large-scale Structure

Authors: Sung-Ho An, Juhan Kim, Jun-Sung Moon, Suk-** Yoon

Abstract: Spin$-$orbit alignment (SOA; i.e., the vector alignment between the halo spin and the orbital angular momentum of neighboring halos) provides an important clue to how galactic angular momenta develop. For this study, we extract virial-radius-wise contact halo pairs with mass ratios between 1/10 and 10 from a set of cosmological $N$-body simulations. In the spin--orbit angle distribution, we find a… ▽ More Spin$-$orbit alignment (SOA; i.e., the vector alignment between the halo spin and the orbital angular momentum of neighboring halos) provides an important clue to how galactic angular momenta develop. For this study, we extract virial-radius-wise contact halo pairs with mass ratios between 1/10 and 10 from a set of cosmological $N$-body simulations. In the spin--orbit angle distribution, we find a significant SOA in that 52.7%$\pm$0.2% of neighbors are on the prograde orbit. The SOA of our sample is mainly driven by low-mass target halos ($<10^{11.5}h^{-1}M_{\odot}$) with close merging neighbors, corroborating the notion that the tidal interaction is one of the physical origins of SOA. We also examine the correlation of SOA with the adjacent filament and find that halos closer to the filament show stronger SOA. Most interestingly, we discover for the first time that halos with the spin parallel to the filament experience most frequently the prograde-polar interaction (i.e., fairly perpendicular but still prograde interaction; spin--orbit angle $\sim$ 70$^{\circ}$). This instantly invokes the spin-flip event and the prograde-polar interaction will soon flip the spin of the halo to align it with the neighbor's orbital angular momentum. We propose that the SOA originates from the local cosmic flow along the anisotropic large-scale structure, especially that along the filament, and grows further by interactions with neighbors. △ Less

Submitted 26 May, 2021; originally announced May 2021.

Comments: 18 pages, 12 figures, accepted for publication in ApJ. arXiv admin note: text overlap with arXiv:2005.06479

arXiv:2105.10867 [pdf, other]

EXoN: EXplainable encoder Network

Authors: SeungHwan An, Hosik Choi, Jong-June Jeon

Abstract: We propose a new semi-supervised learning method of Variational AutoEncoder (VAE) which yields a customized and explainable latent space by EXplainable encoder Network (EXoN). Customization means a manual design of latent space layout for specific labeled data. To improve the performance of our VAE in a classification task without the loss of performance as a generative model, we employ a new semi… ▽ More We propose a new semi-supervised learning method of Variational AutoEncoder (VAE) which yields a customized and explainable latent space by EXplainable encoder Network (EXoN). Customization means a manual design of latent space layout for specific labeled data. To improve the performance of our VAE in a classification task without the loss of performance as a generative model, we employ a new semi-supervised classification method called SCI (Soft-label Consistency Interpolation). The classification loss and the Kullback-Leibler divergence play a crucial role in constructing explainable latent space. The variability of generated samples from our proposed model depends on a specific subspace, called activated latent subspace. Our numerical results with MNIST and CIFAR-10 datasets show that EXoN produces an explainable latent space and reduces the cost of investigating representation patterns on the latent space. △ Less

Submitted 17 October, 2022; v1 submitted 23 May, 2021; originally announced May 2021.

arXiv:2104.14115 [pdf, other]

LIQA: Lifelong Blind Image Quality Assessment

Authors: Jianzhao Liu, Wei Zhou, Jiahua Xu, Xin Li, Shukun An, Zhibo Chen

Abstract: Existing blind image quality assessment (BIQA) methods are mostly designed in a disposable way and cannot evolve with unseen distortions adaptively, which greatly limits the deployment and application of BIQA models in real-world scenarios. To address this problem, we propose a novel Lifelong blind Image Quality Assessment (LIQA) approach, targeting to achieve the lifelong learning of BIQA. Withou… ▽ More Existing blind image quality assessment (BIQA) methods are mostly designed in a disposable way and cannot evolve with unseen distortions adaptively, which greatly limits the deployment and application of BIQA models in real-world scenarios. To address this problem, we propose a novel Lifelong blind Image Quality Assessment (LIQA) approach, targeting to achieve the lifelong learning of BIQA. Without accessing to previous training data, our proposed LIQA can not only learn new distortions, but also mitigate the catastrophic forgetting of seen distortions. Specifically, we adopt the Split-and-Merge distillation strategy to train a single-head network that makes task-agnostic predictions. In the split stage, we first employ a distortion-specific generator to obtain the pseudo features of each seen distortion. Then, we use an auxiliary multi-head regression network to generate the predicted quality of each seen distortion. In the merge stage, we replay the pseudo features paired with pseudo labels to distill the knowledge of multiple heads, which can build the final regressed single head. Experimental results demonstrate that the proposed LIQA method can handle the continuous shifts of different distortion types and even datasets. More importantly, our LIQA model can achieve stable performance even if the task sequence is long. △ Less

Submitted 29 April, 2021; originally announced April 2021.

arXiv:2104.02326 [pdf, other]

Self-Supervised Learning based CT Denoising using Pseudo-CT Image Pairs

Authors: Dongkyu Won, Eui** Jung, Sion An, Philip Chikontwe, Sang Hyun Park

Abstract: Recently, Self-supervised learning methods able to perform image denoising without ground truth labels have been proposed. These methods create low-quality images by adding random or Gaussian noise to images and then train a model for denoising. Ideally, it would be beneficial if one can generate high-quality CT images with only a few training samples via self-supervision. However, the performance… ▽ More Recently, Self-supervised learning methods able to perform image denoising without ground truth labels have been proposed. These methods create low-quality images by adding random or Gaussian noise to images and then train a model for denoising. Ideally, it would be beneficial if one can generate high-quality CT images with only a few training samples via self-supervision. However, the performance of CT denoising is generally limited due to the complexity of CT noise. To address this problem, we propose a novel self-supervised learning-based CT denoising method. In particular, we train pre-train CT denoising and noise models that can predict CT noise from Low-dose CT (LDCT) using available LDCT and Normal-dose CT (NDCT) pairs. For a given test LDCT, we generate Pseudo-LDCT and NDCT pairs using the pre-trained denoising and noise models and then update the parameters of the denoising model using these pairs to remove noise in the test LDCT. To make realistic Pseudo LDCT, we train multiple noise models from individual images and generate the noise using the ensemble of noise models. We evaluate our method on the 2016 AAPM Low-Dose CT Grand Challenge dataset. The proposed ensemble noise model can generate realistic CT noise, and thus our method significantly improves the denoising performance existing denoising models trained by supervised- and self-supervised learning. △ Less

Submitted 6 April, 2021; originally announced April 2021.

arXiv:2103.12393 [pdf, other]

RISC-NN: Use RISC, NOT CISC as Neural Network Hardware Infrastructure

Authors: Taoran Xiang, Lunkai Zhang, Shuqian An, Xiaochun Ye, Mingzhe Zhang, Yanhuan Liu, Mingyu Yan, Da Wang, Hao Zhang, Wenming Li, Ninghui Sun, Dongrui Fan

Abstract: Neural Networks (NN) have been proven to be powerful tools to analyze Big Data. However, traditional CPUs cannot achieve the desired performance and/or energy efficiency for NN applications. Therefore, numerous NN accelerators have been used or designed to meet these goals. These accelerators all fall into three categories: GPGPUs, ASIC NN Accelerators and CISC NN Accelerators. Though CISC NN Acce… ▽ More Neural Networks (NN) have been proven to be powerful tools to analyze Big Data. However, traditional CPUs cannot achieve the desired performance and/or energy efficiency for NN applications. Therefore, numerous NN accelerators have been used or designed to meet these goals. These accelerators all fall into three categories: GPGPUs, ASIC NN Accelerators and CISC NN Accelerators. Though CISC NN Accelerators can achieve considerable smaller memory footprint than GPGPU thus improve energy efficiency; they still fail to provide same level of data reuse optimization achieved by ASIC NN Accelerators because of the inherited poor pragrammability of their CISC architecture. We argue that, for NN Accelerators, RISC is a better design choice than CISC, as is the case with general purpose processors. We propose RISC-NN, a novel many-core RISC-based NN accelerator that achieves high expressiveness and high parallelism and features strong programmability and low control-hardware costs. We show that, RISC-NN can implement all the necessary instructions of state-of-the-art CISC NN Accelerators; in the meantime, RISC-NN manages to achieve advanced optimization such as multiple-level data reuse and support for Sparse NN applications which previously only existed in ASIC NN Accelerators. Experiment results show that, RISC-NN achieves on average 11.88X performance efficiency compared with state-of-the-art Nvidia TITAN Xp GPGPU for various NN applications. RISC-NN also achieves on average 1.29X, 8.37X and 21.71X performance efficiency over CISC-based TPU in CNN, MLP and LSTM applications, respectively. Finally, RISC-NN can achieve additional 26.05% performance improvement and 33.13% energy reduction after applying pruning for Sparse NN applications. △ Less

Submitted 23 March, 2021; originally announced March 2021.

arXiv:2103.11315 [pdf, other]

doi 10.1038/s41467-021-26205-y

Rapid and Unconditional Parametric Reset Protocol for Tunable Superconducting Qubits

Authors: Yu Zhou, Zhenxing Zhang, Zelong Yin, Sainan Huai, Xiu Gu, Xiong Xu, Jonathan Allcock, Fuming Liu, Guanglei Xi, Qiaonian Yu, Hualiang Zhang, Mengyu Zhang, Hekang Li, Xiaohui Song, Zhan Wang, Dongning Zheng, Shuoming An, Yarui Zheng, Shengyu Zhang

Abstract: Qubit initialization is a critical task in quantum computation and communication. Extensive efforts have been made to achieve this with high speed, efficiency and scalability. However, previous approaches have either been measurement-based and required fast feedback, suffered from crosstalk or required sophisticated calibration. Here, we report a fast and high-fidelity reset scheme, avoiding the i… ▽ More Qubit initialization is a critical task in quantum computation and communication. Extensive efforts have been made to achieve this with high speed, efficiency and scalability. However, previous approaches have either been measurement-based and required fast feedback, suffered from crosstalk or required sophisticated calibration. Here, we report a fast and high-fidelity reset scheme, avoiding the issues above without any additional chip architecture. By modulating the flux through a transmon qubit, we realize a swap between the qubit and its readout resonator that suppresses the excited state population to 0.08% $\pm$ 0.08% within 34 ns (284 ns if photon depletion of the resonator is required). Furthermore, our approach (i) can achieve effective second excited state depletion, (ii) has negligible effects on neighbouring qubits, and (iii) offers a way to entangle the qubit with an itinerant single photon, useful in quantum communication applications. △ Less

Submitted 22 November, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

Comments: 38 pages, 15 figures

arXiv:2102.11895 [pdf, other]

doi 10.1145/3485434

MGait: Model-Based Gait Analysis Using Wearable Bend and Inertial Sensors

Authors: Sizhe An, Yigit Tuncel, Toygun Basaklar, Gokul Krishna Krishnakumar, Ganapati Bhat, Umit Ogras

Abstract: Movement disorders, such as Parkinson's disease, affect more than 10 million people worldwide. Gait analysis is a critical step in the diagnosis and rehabilitation of these disorders. Specifically, step length provides valuable insights into the gait quality and rehabilitation process. However, traditional approaches for estimating step length are not suitable for continuous daily monitoring since… ▽ More Movement disorders, such as Parkinson's disease, affect more than 10 million people worldwide. Gait analysis is a critical step in the diagnosis and rehabilitation of these disorders. Specifically, step length provides valuable insights into the gait quality and rehabilitation process. However, traditional approaches for estimating step length are not suitable for continuous daily monitoring since they rely on special mats and clinical environments. To address this limitation, we present a novel and practical step-length estimation technique using low-power wearable bend and inertial sensors. Experimental results show that the proposed model estimates step length with 5.49% mean absolute percentage error and provides accurate real-time feedback to the user. △ Less

Submitted 7 September, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

Journal ref: ACM Transactions on Internet of Things 3.1 (2021): 1-24. (Presented in ACM SenSys 2021)

arXiv:2102.07498 [pdf, other]

JEST: N+1-version Differential Testing of Both JavaScript Engines and Specification

Authors: Jihyeok Park, Seungmin An, Dongjun Youn, Gyeongwon Kim, Sukyoung Ryu

Abstract: Modern programming follows the continuous integration (CI) and continuous deployment (CD) approach rather than the traditional waterfall model. Even the development of modern programming languages uses the CI/CD approach to swiftly provide new language features and to adapt to new development environments. Unlike in the conventional approach, in the modern CI/CD approach, a language specification… ▽ More Modern programming follows the continuous integration (CI) and continuous deployment (CD) approach rather than the traditional waterfall model. Even the development of modern programming languages uses the CI/CD approach to swiftly provide new language features and to adapt to new development environments. Unlike in the conventional approach, in the modern CI/CD approach, a language specification is no more the oracle of the language semantics because both the specification and its implementations can co-evolve. In this setting, both the specification and implementations may have bugs, and guaranteeing their correctness is non-trivial. In this paper, we propose a novel N+1-version differential testing to resolve the problem. Unlike the traditional differential testing, our approach consists of three steps: 1) to automatically synthesize programs guided by the syntax and semantics from a given language specification, 2) to generate conformance tests by injecting assertions to the synthesized programs to check their final program states, 3) to detect bugs in the specification and implementations via executing the conformance tests on multiple implementations, and 4) to localize bugs on the specification using statistical information. We actualize our approach for the JavaScript programming language via JEST, which performs N+1-version differential testing for modern JavaScript engines and ECMAScript, the language specification describing the syntax and semantics of JavaScript in a natural language. We evaluated JEST with four JavaScript engines that support all modern JavaScript language features and the latest version of ECMAScript (ES11, 2020). JEST automatically synthesized 1,700 programs that covered 97.78% of syntax and 87.70% of semantics from ES11. Using the assertion-injection, it detected 44 engine bugs in four engines and 27 specification bugs in ES11. △ Less

Submitted 15 February, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

Comments: 12 pages, 5 figures, 3 tables, In Proceedings of the ACM/IEEE 43rd International Conference on Software Engineering (ICSE 2021)

arXiv:2102.07067 [pdf, other]

Fast Monocular Hand Pose Estimation on Embedded Systems

Authors: Shan An, Xiajie Zhang, Dong Wei, Haogang Zhu, Jianyu Yang, Konstantinos A. Tsintotas

Abstract: Hand pose estimation is a fundamental task in many human-robot interaction-related applications. However, previous approaches suffer from unsatisfying hand landmark predictions in real-world scenes and high computation burden. This paper proposes a fast and accurate framework for hand pose estimation, dubbed as "FastHand". Using a lightweight encoder-decoder network architecture, FastHand fulfills… ▽ More Hand pose estimation is a fundamental task in many human-robot interaction-related applications. However, previous approaches suffer from unsatisfying hand landmark predictions in real-world scenes and high computation burden. This paper proposes a fast and accurate framework for hand pose estimation, dubbed as "FastHand". Using a lightweight encoder-decoder network architecture, FastHand fulfills the requirements of practical applications running on embedded devices. The encoder consists of deep layers with a small number of parameters, while the decoder makes use of spatial location information to obtain more accurate results. The evaluation took place on two publicly available datasets demonstrating the improved performance of the proposed pipeline compared to other state-of-the-art approaches. FastHand offers high accuracy scores while reaching a speed of 25 frames per second on an NVIDIA Jetson TX2 graphics processing unit. △ Less

Submitted 11 October, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

arXiv:2102.05123 [pdf, other]

Backdoor Scanning for Deep Neural Networks through K-Arm Optimization

Authors: Guangyu Shen, Yingqi Liu, Guanhong Tao, Shengwei An, Qiuling Xu, Siyuan Cheng, Shiqing Ma, Xiangyu Zhang

Abstract: Back-door attack poses a severe threat to deep learning systems. It injects hidden malicious behaviors to a model such that any input stamped with a special pattern can trigger such behaviors. Detecting back-door is hence of pressing need. Many existing defense techniques use optimization to generate the smallest input pattern that forces the model to misclassify a set of benign inputs injected wi… ▽ More Back-door attack poses a severe threat to deep learning systems. It injects hidden malicious behaviors to a model such that any input stamped with a special pattern can trigger such behaviors. Detecting back-door is hence of pressing need. Many existing defense techniques use optimization to generate the smallest input pattern that forces the model to misclassify a set of benign inputs injected with the pattern to a target label. However, the complexity is quadratic to the number of class labels such that they can hardly handle models with many classes. Inspired by Multi-Arm Bandit in Reinforcement Learning, we propose a K-Arm optimization method for backdoor detection. By iteratively and stochastically selecting the most promising labels for optimization with the guidance of an objective function, we substantially reduce the complexity, allowing to handle models with many classes. Moreover, by iteratively refining the selection of labels to optimize, it substantially mitigates the uncertainty in choosing the right labels, improving detection accuracy. At the time of submission, the evaluation of our method on over 4000 models in the IARPA TrojAI competition from round 1 to the latest round 4 achieves top performance on the leaderboard. Our technique also supersedes three state-of-the-art techniques in terms of accuracy and the scanning time needed. △ Less

Submitted 2 August, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

arXiv:2102.04979 [pdf, ps, other]

The Stembridge Equality for Skew Stable Grothendieck Polynomials and Skew Dual Stable Grothendieck Polynomials

Authors: Fiona Abney-McPeek, Serena An, Jakin Ng

Abstract: The Schur polynomials $s_λ$ are essential in understanding the representation theory of the general linear group. They also describe the cohomology ring of the Grassmannians. For $ρ= (n, n-1, \dots, 1)$ a staircase shape and $μ\subseteq ρ$ a subpartition, the Stembridge equality states that $s_{ρ/μ} = s_{ρ/μ^T}$. This equality provides information about the symmetry of the cohomology ring. The sta… ▽ More The Schur polynomials $s_λ$ are essential in understanding the representation theory of the general linear group. They also describe the cohomology ring of the Grassmannians. For $ρ= (n, n-1, \dots, 1)$ a staircase shape and $μ\subseteq ρ$ a subpartition, the Stembridge equality states that $s_{ρ/μ} = s_{ρ/μ^T}$. This equality provides information about the symmetry of the cohomology ring. The stable Grothendieck polynomials $G_λ$, and the dual stable Grothendieck polynomials $g_λ$, developed by Buch, Lam, and Pylyavskyy, are variants of the Schur polynomials and describe the $K$-theory of the Grassmannians. Using the Hopf algebra structure of the ring of symmetric functions and a generalized Littlewood-Richardson rule, we prove that $G_{ρ/μ} = G_{ρ/μ^T}$ and $g_{ρ/μ} = g_{ρ/μ^T}$, the analogues of the Stembridge equality for the skew stable and skew dual stable Grothendieck polynomials. △ Less

Submitted 2 October, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

Comments: 23 pages, 0 figures

MSC Class: 05E05

arXiv:2102.01761 [pdf]

Deep Convolutional Neural Networks to Predict Mutual Coupling Effects in Metasurfaces

Authors: Sensong An, Bowen Zheng, Mikhail Y. Shalaginov, Hong Tang, Hang Li, Li Zhou, Yunxi Dong, Mohammad Haerinia, Anuradha Murthy Agarwal, Clara Rivero-Baleine, Myungkoo Kang, Kathleen A. Richardson, Tian Gu, Juejun Hu, Clayton Fowler, Hualiang Zhang

Abstract: Metasurfaces have provided a novel and promising platform for the realization of compact and large-scale optical devices. The conventional metasurface design approach assumes periodic boundary conditions for each element, which is inaccurate in most cases since the near-field coupling effects between elements will change when surrounded by non-identical structures. In this paper, we propose a deep… ▽ More Metasurfaces have provided a novel and promising platform for the realization of compact and large-scale optical devices. The conventional metasurface design approach assumes periodic boundary conditions for each element, which is inaccurate in most cases since the near-field coupling effects between elements will change when surrounded by non-identical structures. In this paper, we propose a deep learning approach to predict the actual electromagnetic (EM) responses of each target meta-atom placed in a large array with near-field coupling effects taken into account. The predicting neural network takes the physical specifications of the target meta-atom and its neighbors as input, and calculates its phase and amplitude in milliseconds. This approach can be applied to explain metasurfaces' performance deterioration caused by mutual coupling and further used to optimize their efficiencies once combined with optimization algorithms. To demonstrate the efficacy of this methodology, we obtain large improvements in efficiency for a beam deflector and a metalens over the conventional design approach. Moreover, we show the correlations between a metasurface's performance and its design errors caused by mutual coupling are not bound to certain specifications (materials, shapes, etc.). As such, we envision that this approach can be readily applied to explore the mutual coupling effects and improve the performance of various metasurface designs. △ Less

Submitted 2 February, 2021; originally announced February 2021.

Comments: 16 pages, 10 figures

arXiv:2102.01701 [pdf, ps, other]

doi 10.3847/1538-4357/abda3b

Living with Neighbors. III. The Origin of the Spin$-$Orbit Alignment of Galaxy Pairs: A Neighbor versus the Large-scale Structure

Authors: Jun-Sung Moon, Sung-Ho An, Suk-** Yoon

Abstract: Recent observations revealed a coherence between the spin vector of a galaxy and the orbital motion of its neighbors. We refer to the phenomenon as "the spin$-$orbit alignment (SOA)" and explore its physical origin via the IllustrisTNG simulation. This is the first study to utilize a cosmological hydrodynamic simulation to investigate the SOA of galaxy pairs. In particular, we identify paired gala… ▽ More Recent observations revealed a coherence between the spin vector of a galaxy and the orbital motion of its neighbors. We refer to the phenomenon as "the spin$-$orbit alignment (SOA)" and explore its physical origin via the IllustrisTNG simulation. This is the first study to utilize a cosmological hydrodynamic simulation to investigate the SOA of galaxy pairs. In particular, we identify paired galaxies at $z = 0$ having the nearest neighbor with mass ratios from 1/10 to 10 and calculate the spin$-$orbit angle for each pair. Our results are as follows. (a) There exists a clear preference for prograde orientations (i.e., SOA) for galaxy pairs, qualitatively consistent with observations. (b) The SOA is significant for both baryonic and dark matter spins, being the strongest for gas and the weakest for dark matter. (c) The SOA is stronger for less massive targets and for targets having closer neighbors. (d) The SOA strengthens for galaxies in low-density regions, and the signal is dominated by central$-$satellite pairs in low-mass halos. (e) There is an explicit dependence of the SOA on the duration of interaction with its current neighbor. Taken together, we propose that the SOA witnessed at $z = 0$ has been developed mainly by interactions with a neighbor for an extended period of time, rather than tidal torque from the ambient large-scale structure. △ Less

Submitted 2 February, 2021; originally announced February 2021.

Comments: 16 pages, 12 figures, accepted for publication in ApJ

arXiv:2101.02358 [pdf, other]

OAAE: Adversarial Autoencoders for Novelty Detection in Multi-modal Normality Case via Orthogonalized Latent Space

Authors: Sungkwon An, Jeonghoon Kim, Myungjoo Kang, Shahbaz Razaei, Xin Liu

Abstract: Novelty detection using deep generative models such as autoencoder, generative adversarial networks mostly takes image reconstruction error as novelty score function. However, image data, high dimensional as it is, contains a lot of different features other than class information which makes models hard to detect novelty data. The problem gets harder in multi-modal normality case. To address this… ▽ More Novelty detection using deep generative models such as autoencoder, generative adversarial networks mostly takes image reconstruction error as novelty score function. However, image data, high dimensional as it is, contains a lot of different features other than class information which makes models hard to detect novelty data. The problem gets harder in multi-modal normality case. To address this challenge, we propose a new way of measuring novelty score in multi-modal normality cases using orthogonalized latent space. Specifically, we employ orthogonal low-rank embedding in the latent space to disentangle the features in the latent space using mutual class information. With the orthogonalized latent space, novelty score is defined by the change of each latent vector. Proposed algorithm was compared to state-of-the-art novelty detection algorithms using GAN such as RaPP and OCGAN, and experimental results show that ours outperforms those algorithms. △ Less

Submitted 6 January, 2021; originally announced January 2021.

Comments: Accepted to AAAI 2021 Workshop: Towards Robust, Secure and Efficient Machine Learning

arXiv:2012.06336 [pdf, other]

Construction and commissioning of CMS CE prototype silicon modules

Authors: B. Acar, G. Adamov, C. Adloff, S. Afanasiev, N. Akchurin, B. Akgün, M. Alhusseini, J. Alison, G. Altopp, M. Alyari, S. An, S. Anagul, I. Andreev, M. Andrews, P. Aspell, I. A. Atakisi, O. Bach, A. Baden, G. Bakas, A. Bakshi, P. Bargassa, D. Barney, E. Becheva, P. Behera, A. Belloni , et al. (307 additional authors not shown)

Abstract: As part of its HL-LHC upgrade program, the CMS Collaboration is develo** a High Granularity Calorimeter (CE) to replace the existing endcap calorimeters. The CE is a sampling calorimeter with unprecedented transverse and longitudinal readout for both electromagnetic (CE-E) and hadronic (CE-H) compartments. The calorimeter will be built with $\sim$30,000 hexagonal silicon modules. Prototype modul… ▽ More As part of its HL-LHC upgrade program, the CMS Collaboration is develo** a High Granularity Calorimeter (CE) to replace the existing endcap calorimeters. The CE is a sampling calorimeter with unprecedented transverse and longitudinal readout for both electromagnetic (CE-E) and hadronic (CE-H) compartments. The calorimeter will be built with $\sim$30,000 hexagonal silicon modules. Prototype modules have been constructed with 6-inch hexagonal silicon sensors with cell areas of 1.1~$cm^2$, and the SKIROC2-CMS readout ASIC. Beam tests of different sampling configurations were conducted with the prototype modules at DESY and CERN in 2017 and 2018. This paper describes the construction and commissioning of the CE calorimeter prototype, the silicon modules used in the construction, their basic performance, and the methods used for their calibration. △ Less

Submitted 10 December, 2020; originally announced December 2020.

Comments: 35 pages, submitted to JINST

arXiv:2012.04479 [pdf, other]

Transfer Learning for Human Activity Recognition using Representational Analysis of Neural Networks

Authors: Sizhe An, Ganapati Bhat, Suat Gumussoy, Umit Ogras

Abstract: Human activity recognition (HAR) research has increased in recent years due to its applications in mobile health monitoring, activity recognition, and patient rehabilitation. The typical approach is training a HAR classifier offline with known users and then using the same classifier for new users. However, the accuracy for new users can be low with this approach if their activity patterns are dif… ▽ More Human activity recognition (HAR) research has increased in recent years due to its applications in mobile health monitoring, activity recognition, and patient rehabilitation. The typical approach is training a HAR classifier offline with known users and then using the same classifier for new users. However, the accuracy for new users can be low with this approach if their activity patterns are different than those in the training data. At the same time, training from scratch for new users is not feasible for mobile applications due to the high computational cost and training time. To address this issue, we propose a HAR transfer learning framework with two components. First, a representational analysis reveals common features that can transfer across users and user-specific features that need to be customized. Using this insight, we transfer the reusable portion of the offline classifier to new users and fine-tune only the rest. Our experiments with five datasets show up to 43% accuracy improvement and 66% training time reduction when compared to the baseline without using transfer learning. Furthermore, measurements on the Nvidia Jetson Xavier-NX hardware platform reveal that the power and energy consumption decrease by 43% and 68%, respectively, while achieving the same or higher accuracy as training from scratch. △ Less

Submitted 23 February, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

arXiv:2012.03876 [pdf, other]

doi 10.1088/1748-0221/16/04/T04001

The DAQ system of the 12,000 Channel CMS High Granularity Calorimeter Prototype

Authors: B. Acar, G. Adamov, C. Adloff, S. Afanasiev, N. Akchurin, B. Akgün, M. Alhusseini, J. Alison, G. Altopp, M. Alyari, S. An, S. Anagul, I. Andreev, M. Andrews, P. Aspell, I. A. Atakisi, O. Bach, A. Baden, G. Bakas, A. Bakshi, P. Bargassa, D. Barney, E. Becheva, P. Behera, A. Belloni , et al. (307 additional authors not shown)

Abstract: The CMS experiment at the CERN LHC will be upgraded to accommodate the 5-fold increase in the instantaneous luminosity expected at the High-Luminosity LHC (HL-LHC). Concomitant with this increase will be an increase in the number of interactions in each bunch crossing and a significant increase in the total ionising dose and fluence. One part of this upgrade is the replacement of the current endca… ▽ More The CMS experiment at the CERN LHC will be upgraded to accommodate the 5-fold increase in the instantaneous luminosity expected at the High-Luminosity LHC (HL-LHC). Concomitant with this increase will be an increase in the number of interactions in each bunch crossing and a significant increase in the total ionising dose and fluence. One part of this upgrade is the replacement of the current endcap calorimeters with a high granularity sampling calorimeter equipped with silicon sensors, designed to manage the high collision rates. As part of the development of this calorimeter, a series of beam tests have been conducted with different sampling configurations using prototype segmented silicon detectors. In the most recent of these tests, conducted in late 2018 at the CERN SPS, the performance of a prototype calorimeter equipped with ${\approx}12,000\rm{~channels}$ of silicon sensors was studied with beams of high-energy electrons, pions and muons. This paper describes the custom-built scalable data acquisition system that was built with readily available FPGA mezzanines and low-cost Raspberry PI computers. △ Less

Submitted 8 December, 2020; v1 submitted 7 December, 2020; originally announced December 2020.

arXiv:2011.09608 [pdf, other]

Bidirectional RNN-based Few Shot Learning for 3D Medical Image Segmentation

Authors: Soopil Kim, Sion An, Philip Chikontwe, Sang Hyun Park

Abstract: Segmentation of organs of interest in 3D medical images is necessary for accurate diagnosis and longitudinal studies. Though recent advances using deep learning have shown success for many segmentation tasks, large datasets are required for high performance and the annotation process is both time consuming and labor intensive. In this paper, we propose a 3D few shot segmentation framework for accu… ▽ More Segmentation of organs of interest in 3D medical images is necessary for accurate diagnosis and longitudinal studies. Though recent advances using deep learning have shown success for many segmentation tasks, large datasets are required for high performance and the annotation process is both time consuming and labor intensive. In this paper, we propose a 3D few shot segmentation framework for accurate organ segmentation using limited training samples of the target organ annotation. To achieve this, a U-Net like network is designed to predict segmentation by learning the relationship between 2D slices of support data and a query image, including a bidirectional gated recurrent unit (GRU) that learns consistency of encoded features between adjacent slices. Also, we introduce a transfer learning method to adapt the characteristics of the target image and organ by updating the model before testing with arbitrary support and query data sampled from the support data. We evaluate our proposed model using three 3D CT datasets with annotations of different organs. Our model yielded significantly improved performance over state-of-the-art few shot segmentation models and was comparable to a fully supervised model trained with more target training data. △ Less

Submitted 18 November, 2020; originally announced November 2020.

Comments: Submitted to AAAI21

arXiv:2010.13287 [pdf, ps, other]

doi 10.1007/JHEP10(2021)085

Notes on the post-bounce background dynamics in bouncing cosmologies

Authors: Ok Song An, ** U Kang, Thae Hyok Kim, Ui Ri Mun

Abstract: We investigate the post-bounce background dynamics in a certain class of single bounce scenarios studied in the literature, in which the cosmic bounce is driven by a scalar field with negative exponential potential such as the ekpyrotic potential. We show that those models can actually lead to cyclic evolutions with repeated bounces. These cyclic evolutions, however, do not account for the current… ▽ More We investigate the post-bounce background dynamics in a certain class of single bounce scenarios studied in the literature, in which the cosmic bounce is driven by a scalar field with negative exponential potential such as the ekpyrotic potential. We show that those models can actually lead to cyclic evolutions with repeated bounces. These cyclic evolutions, however, do not account for the currently observed late-time accelerated expansion and hence are not cosmologically viable. In this respect we consider a new kind of cyclic model proposed recently and derive some cosmological constraints on this model. △ Less

Submitted 16 September, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

Comments: 26 pages, 15 figures. Significantly revised and accepted version for JHEP

arXiv:2010.11703 [pdf, other]

Fast and Incremental Loop Closure Detection with Deep Features and Proximity Graphs

Authors: Shan An, Haogang Zhu, Dong Wei, Konstantinos A. Tsintotas, Antonios Gasteratos

Abstract: In recent years, the robotics community has extensively examined methods concerning the place recognition task within the scope of simultaneous localization and map** applications.This article proposes an appearance-based loop closure detection pipeline named ``FILD++" (Fast and Incremental Loop closure Detection).First, the system is fed by consecutive images and, via passing them twice through… ▽ More In recent years, the robotics community has extensively examined methods concerning the place recognition task within the scope of simultaneous localization and map** applications.This article proposes an appearance-based loop closure detection pipeline named ``FILD++" (Fast and Incremental Loop closure Detection).First, the system is fed by consecutive images and, via passing them twice through a single convolutional neural network, global and local deep features are extracted.Subsequently, a hierarchical navigable small-world graph incrementally constructs a visual database representing the robot's traversed path based on the computed global features.Finally, a query image, grabbed each time step, is set to retrieve similar locations on the traversed route.An image-to-image pairing follows, which exploits local features to evaluate the spatial information. Thus, in the proposed article, we propose a single network for global and local feature extraction in contrast to our previous work (FILD), while an exhaustive search for the verification process is adopted over the generated deep local features avoiding the utilization of hash codes. Exhaustive experiments on eleven publicly available datasets exhibit the system's high performance (achieving the highest recall score on eight of them) and low execution times (22.05 ms on average in New College, which is the largest one containing 52480 images) compared to other state-of-the-art approaches. △ Less

Submitted 2 January, 2022; v1 submitted 28 September, 2020; originally announced October 2020.

Comments: submitted to Journal of Field Robotics

arXiv:2010.04869 [pdf, other]

Direct Measurement of Curvature-Dependent Surface Tension in a Capillary-Condensed Alcohol Nanomeniscus

Authors: Dohyun Kim, Jongwoo Kim, Jonggeun Hwang, Dongha Shin, Sangmin An, Wonho Jhe

Abstract: Surface tension is a key parameter for understanding nucleation from the very initial stage of phase transformation. Although surface tension has been predicted to vary with the curvature of the liquid-vapor interface, particularly at the large curvature of, e.g., the subnanometric critical nucleus, experimental study still remains challenging due to inaccessibility to such a small cluster. Here,… ▽ More Surface tension is a key parameter for understanding nucleation from the very initial stage of phase transformation. Although surface tension has been predicted to vary with the curvature of the liquid-vapor interface, particularly at the large curvature of, e.g., the subnanometric critical nucleus, experimental study still remains challenging due to inaccessibility to such a small cluster. Here, by directly measuring the critical size of a single capillary-condensed nanomeniscus using atomic force microscopy, we address the curvature dependence of surface tension of alcohols and observe the surface tension is doubled for ethanol and n-propanol with the radius-of-curvature of ~ -0.46 nm. We also find that the interface of larger negative (positive) curvature exhibits the larger (smaller) surface tension, which evidently governs nucleation at ~ 1 nm scale, indicating more facilitated nucleation than normally expected. Such well characterized curvature effects contribute to better understanding and accurate analysis of nucleation occurring in various fields including material science and atmospheric science. △ Less

Submitted 9 October, 2020; originally announced October 2020.

Comments: 5 pages, 4 figures and Supplementary Material(pdf)

arXiv:2009.13826 [pdf, other]

EEMC: Embedding Enhanced Multi-tag Classification

Authors: Yanlin Li, Shi An, Ruisheng Zhang

Abstract: The recently occurred representation learning make an attractive performance in NLP and complex network, it is becoming a fundamental technology in machine learning and data mining. How to use representation learning to improve the performance of classifiers is a very significance research direction. We using representation learning technology to map raw data(node of graph) to a low-dimensional fe… ▽ More The recently occurred representation learning make an attractive performance in NLP and complex network, it is becoming a fundamental technology in machine learning and data mining. How to use representation learning to improve the performance of classifiers is a very significance research direction. We using representation learning technology to map raw data(node of graph) to a low-dimensional feature space. In this space, each raw data obtained a lower dimensional vector representation, we do some simple linear operations for those vectors to produce some virtual data, using those vectors and virtual data to training multi-tag classifier. After that we measured the performance of classifier by F1 score(Macro% F1 and Micro% F1). Our method make Macro F1 rise from 28 % - 450% and make average F1 score rise from 12 % - 224%. By contrast, we trained the classifier directly with the lower dimensional vector, and measured the performance of classifiers. We validate our algorithm on three public data sets, we found that the virtual data helped the classifier greatly improve the F1 score. Therefore, our algorithm is a effective way to improve the performance of classifier. These result suggest that the virtual data generated by simple linear operation, in representation space, still retains the information of the raw data. It's also have great significance to the learning of small sample data sets. △ Less

Submitted 29 September, 2020; originally announced September 2020.

arXiv:2008.10400 [pdf, other]

An Ensemble of Simple Convolutional Neural Network Models for MNIST Digit Recognition

Authors: Sanghyeon An, Minjun Lee, Sanglee Park, Heerin Yang, Jungmin So

Abstract: We report that a very high accuracy on the MNIST test set can be achieved by using simple convolutional neural network (CNN) models. We use three different models with 3x3, 5x5, and 7x7 kernel size in the convolution layers. Each model consists of a set of convolution layers followed by a single fully connected layer. Every convolution layer uses batch normalization and ReLU activation, and poolin… ▽ More We report that a very high accuracy on the MNIST test set can be achieved by using simple convolutional neural network (CNN) models. We use three different models with 3x3, 5x5, and 7x7 kernel size in the convolution layers. Each model consists of a set of convolution layers followed by a single fully connected layer. Every convolution layer uses batch normalization and ReLU activation, and pooling is not used. Rotation and translation is used to augment training data, which is frequently used in most image classification tasks. A majority voting using the three models independently trained on the training data set can achieve up to 99.87% accuracy on the test set, which is one of the state-of-the-art results. A two-layer ensemble, a heterogeneous ensemble of three homogeneous ensemble networks, can achieve up to 99.91% test accuracy. The results can be reproduced by using the code at: https://github.com/ansh941/MnistSimpleCNN △ Less

Submitted 4 October, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

Comments: 10 pages, 12 figures, 7 tables

arXiv:2008.06659 [pdf]

doi 10.1038/s41565-021-00881-9

Electrically Reconfigurable Nonvolatile Metasurface Using Low-Loss Optical Phase Change Material

Authors: Yifei Zhang, Clayton Fowler, Junhao Liang, Bilal Azhar, Mikhail Y. Shalaginov, Skylar Deckoff-Jones, Sensong An, Jeffrey B. Chou, Christopher M. Roberts, Vladimir Liberman, Myungkoo Kang, Carlos Ríos, Kathleen A. Richardson, Clara Rivero-Baleine, Tian Gu, Hualiang Zhang, Juejun Hu

Abstract: Active metasurfaces promise reconfigurable optics with drastically improved compactness, ruggedness, manufacturability, and functionality compared to their traditional bulk counterparts. Optical phase change materials (O-PCMs) offer an appealing material solution for active metasurface devices with their large index contrast and nonvolatile switching characteristics. Here we report what we believe… ▽ More Active metasurfaces promise reconfigurable optics with drastically improved compactness, ruggedness, manufacturability, and functionality compared to their traditional bulk counterparts. Optical phase change materials (O-PCMs) offer an appealing material solution for active metasurface devices with their large index contrast and nonvolatile switching characteristics. Here we report what we believe to be the first electrically reconfigurable nonvolatile metasurfaces based on O-PCMs. The O-PCM alloy used in the devices, Ge2Sb2Se4Te1 (GSST), uniquely combines giant non-volatile index modulation capability, broadband low optical loss, and a large reversible switching volume, enabling significantly enhanced light-matter interactions within the active O-PCM medium. Capitalizing on these favorable attributes, we demonstrated continuously tunable active metasurfaces with record half-octave spectral tuning range and large optical contrast of over 400%. We further prototyped a polarization-insensitive phase-gradient metasurface to realize dynamic optical beam steering. △ Less

Submitted 2 September, 2020; v1 submitted 15 August, 2020; originally announced August 2020.

Comments: 12 pages, 5 figures

arXiv:2007.11256 [pdf, other]

Improving Monocular Depth Estimation by Leveraging Structural Awareness and Complementary Datasets

Authors: Tian Chen, Shijie An, Yuan Zhang, Chongyang Ma, Huayan Wang, Xiaoyan Guo, Wen Zheng

Abstract: Monocular depth estimation plays a crucial role in 3D recognition and understanding. One key limitation of existing approaches lies in their lack of structural information exploitation, which leads to inaccurate spatial layout, discontinuous surface, and ambiguous boundaries. In this paper, we tackle this problem in three aspects. First, to exploit the spatial relationship of visual features, we p… ▽ More Monocular depth estimation plays a crucial role in 3D recognition and understanding. One key limitation of existing approaches lies in their lack of structural information exploitation, which leads to inaccurate spatial layout, discontinuous surface, and ambiguous boundaries. In this paper, we tackle this problem in three aspects. First, to exploit the spatial relationship of visual features, we propose a structure-aware neural network with spatial attention blocks. These blocks guide the network attention to global structures or local details across different feature layers. Second, we introduce a global focal relative loss for uniform point pairs to enhance spatial constraint in the prediction, and explicitly increase the penalty on errors in depth-wise discontinuous regions, which helps preserve the sharpness of estimation results. Finally, based on analysis of failure cases for prior methods, we collect a new Hard Case (HC) Depth dataset of challenging scenes, such as special lighting conditions, dynamic objects, and tilted camera angles. The new dataset is leveraged by an informed learning curriculum that mixes training examples incrementally to handle diverse data distributions. Experimental results show that our method outperforms state-of-the-art approaches by a large margin in terms of both prediction accuracy on NYUDv2 dataset and generalization performance on unseen datasets. △ Less

Submitted 22 July, 2020; originally announced July 2020.

Comments: 14 pages, 8 figures

arXiv:2007.07944 [pdf]

Multi-level Electro-thermal Switching of Optical Phase-Change Materials Using Graphene

Authors: Carlos Ríos, Yifei Zhang, Mikhail Shalaginov, Skylar Deckoff-Jones, Haozhe Wang, Sensong An, Hualiang Zhang, Myungkoo Kang, Kathleen A. Richardson, Christopher Roberts, Jeffrey B. Chou, Vladimir Liberman, Steven A. Vitale, **g Kong, Tian Gu, Juejun Hu

Abstract: Reconfigurable photonic systems featuring minimal power consumption are crucial for integrated optical devices in real-world technology. Current active devices available in foundries, however, use volatile methods to modulate light, requiring a constant supply of power and significant form factors. Essential aspects to overcoming these issues are the development of nonvolatile optical reconfigurat… ▽ More Reconfigurable photonic systems featuring minimal power consumption are crucial for integrated optical devices in real-world technology. Current active devices available in foundries, however, use volatile methods to modulate light, requiring a constant supply of power and significant form factors. Essential aspects to overcoming these issues are the development of nonvolatile optical reconfiguration techniques which are compatible with on-chip integration with different photonic platforms and do not disrupt their optical performances. In this paper, a solution is demonstrated using an optoelectronic framework for nonvolatile tunable photonics that employs undoped-graphene microheaters to thermally and reversibly switch the optical phase-change material Ge$_2$Sb$_2$Se$_4$Te$_1$ (GSST). An in-situ Raman spectroscopy method is utilized to demonstrate, in real-time, reversible switching between four different levels of crystallinity. Moreover, a 3D computational model is developed to precisely interpret the switching characteristics, and to quantify the impact of current saturation on power dissipation, thermal diffusion, and switching speed. This model is used to inform the design of nonvolatile active photonic devices; namely, broadband Si$_3$N$_4$ integrated photonic circuits with small form-factor modulators and reconfigurable metasurfaces displaying 2$π$ phase coverage through neural-network-designed GSST meta-atoms. This framework will enable scalable, low-loss nonvolatile applications across a diverse range of photonics platforms. △ Less

Submitted 15 July, 2020; originally announced July 2020.

Comments: 22 pages, 5 Figures, 2 tables

arXiv:2007.04543 [pdf, other]

Blur Invariant Kernel-Adaptive Network for Single Image Blind deblurring

Authors: Sungkwon An, Hyungmin Roh, Myungjoo Kang

Abstract: We present a novel, blind, single image deblurring method that utilizes information regarding blur kernels. Our model solves the deblurring problem by dividing it into two successive tasks: (1) blur kernel estimation and (2) sharp image restoration. We first introduce a kernel estimation network that produces adaptive blur kernels based on the analysis of the blurred image. The network learns the… ▽ More We present a novel, blind, single image deblurring method that utilizes information regarding blur kernels. Our model solves the deblurring problem by dividing it into two successive tasks: (1) blur kernel estimation and (2) sharp image restoration. We first introduce a kernel estimation network that produces adaptive blur kernels based on the analysis of the blurred image. The network learns the blur pattern of the input image and trains to generate the estimation of image-specific blur kernels. Subsequently, we propose a deblurring network that restores sharp images using the estimated blur kernel. To use the kernel efficiently, we propose a kernel-adaptive AE block that encodes features from both blurred images and blur kernels into a low dimensional space and then decodes them simultaneously to obtain an appropriately synthesized feature representation. We evaluate our model on REDS, GOPRO and Flickr2K datasets using various Gaussian blur kernels. Experiments show that our model can achieve state-of-the-art results on each dataset. △ Less

Submitted 15 December, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

Comments: 9 pages, 7 figures

ACM Class: I.4.3

arXiv:2006.10627 [pdf, other]

Compositional Generalization by Learning Analytical Expressions

Authors: Qian Liu, Shengnan An, Jian-Guang Lou, Bei Chen, Zeqi Lin, Yan Gao, Bin Zhou, Nanning Zheng, Dongmei Zhang

Abstract: Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily. However, existing neural network based models have been proven to be extremely deficient in such a capability. Inspired by work in cognition which argues compositionality can be captured by variable slots with symbolic functions, we present a refreshing v… ▽ More Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily. However, existing neural network based models have been proven to be extremely deficient in such a capability. Inspired by work in cognition which argues compositionality can be captured by variable slots with symbolic functions, we present a refreshing view that connects a memory-augmented neural model with analytical expressions, to achieve compositional generalization. Our model consists of two cooperative neural modules, Composer and Solver, fitting well with the cognitive argument while being able to be trained in an end-to-end manner via a hierarchical reinforcement learning algorithm. Experiments on the well-known benchmark SCAN demonstrate that our model seizes a great ability of compositional generalization, solving all challenges addressed by previous works with 100% accuracies. △ Less

Submitted 23 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

Comments: To appear in NeurIPS 2020 (Spotlight)

arXiv:2005.06479

Living with Neighbors. III. Scrutinizing the Spin$-$Orbit Alignment of Interacting Dark Matter Halo Pairs

Authors: Sung-Ho An, Juhan Kim, Jun-Sung Moon, Suk-** Yoon

Abstract: We present that the spin$-$orbit alignment (SOA; i.e., the angular alignment between the spin vector of a halo and the orbital angular momentum vector of its neighbor) provides an important clue to how galactic angular momenta develop. In particular, we identify virial-radius-wise contact halo pairs with mass ratios from 1/3 to 3 in a set of cosmological $N$-body simulations, and divide them into… ▽ More We present that the spin$-$orbit alignment (SOA; i.e., the angular alignment between the spin vector of a halo and the orbital angular momentum vector of its neighbor) provides an important clue to how galactic angular momenta develop. In particular, we identify virial-radius-wise contact halo pairs with mass ratios from 1/3 to 3 in a set of cosmological $N$-body simulations, and divide them into merger and flyby subsamples according to their total (kinetic+potential) energy. In the spin$-$orbit angle distribution, we find a significant SOA in that $75.0\pm0.6$ % of merging neighbors and $58.7\pm0.6$ % of flybying neighbors are on the prograde orbit. The overall SOA of our sample is mainly driven by fast-rotating halos, corroborating that a well-aligned interaction spins a halo faster. More interestingly, we find for the first time a strong number excess of nearly perpendicular but still prograde interactions ($\sim75^{\circ}$) in the spin$-$orbit angle distribution for both the merger and flyby cases. Such prograde-polar interactions predominate for slow-rotating halos, testifying that misaligned interactions reduce the halos' spin. The frequency of the prograde-polar interactions correlates with the halo mass, yet anticorrelates with the large-scale density. This instantly invokes the spin-flip phenomenon that is conditional on the mass and environment. The prograde-polar interaction will soon flip the spin of a slow-rotator to align with its neighbor's orbital angular momentum. Finally, we propose a scenario that connects the SOA to the ambient large-scale structure based on the spin-flip argument. △ Less

Submitted 2 June, 2020; v1 submitted 13 May, 2020; originally announced May 2020.

Comments: Found significant mistakes in our results

arXiv:2004.12650 [pdf, other]

Tailoring high-TN interlayer antiferromagnetism in a van der Waals itinerant magnet

Authors: Junho Seo, Eun Su An, Taesu Park, Soo-Yoon Hwang, Gi-Yeop Kim, Kyung Song, Eunseok Oh, Minhyuk Choi, Kenji Watanabe, Takashi Taniguchi, Youn Jung Jo, Han Woong Yeom, Si-Young Choi, Ji Hoon Shim, Jun Sung Kim

Abstract: Antiferromagnetic (AFM) van der Waals (vdW) materials provide a novel platform for synthetic AFM spintronics, in which the spin-related functionalities are derived from manipulating spin configurations between the layers. Metallic vdW antiferromagnets are expected to have several advantages over the widely-studied insulating counterparts in switching and detecting the spin states through electrica… ▽ More Antiferromagnetic (AFM) van der Waals (vdW) materials provide a novel platform for synthetic AFM spintronics, in which the spin-related functionalities are derived from manipulating spin configurations between the layers. Metallic vdW antiferromagnets are expected to have several advantages over the widely-studied insulating counterparts in switching and detecting the spin states through electrical currents but have been much less explored due to the lack of suitable materials. Here, utilizing the extreme sensitivity of the vdW interlayer magnetism to material composition, we report the itinerant antiferromagnetism in Co-doped Fe4GeTe2 with TN ~ 210 K, an order of magnitude increased as compared to other known AFM vdW metals. The resulting spin configurations and orientations are sensitively controlled by do**, magnetic field, temperature, and thickness, which are effectively read out by electrical conduction. These findings manifest strong merits of metallic vdW magnets with tunable interlayer exchange interaction and magnetic anisotropy, suitable for AFM spintronic applications. △ Less

Submitted 27 April, 2020; originally announced April 2020.

Comments: 19 pages, 4 figures, submitted

arXiv:2004.07675 [pdf, ps, other]

Software Challenges For HL-LHC Data Analysis

Authors: ROOT Team, Kim Albertsson Brann, Guilherme Amadio, Sitong An, Bertrand Bellenot, Jakob Blomer, Philippe Canal, Olivier Couet, Massimiliano Galli, Enrico Guiraud, Stephan Hageboeck, Sergey Linev, Pere Mato Vila, Lorenzo Moneta, Axel Naumann, Alja Mrak Tadel, Vincenzo Eduardo Padulano, Fons Rademakers, Oksana Shadura, Matevz Tadel, Enric Tejedor Saavedra, Xavier Valls Pla, Vassil Vassilev, Stefan Wunsch

Abstract: The high energy physics community is discussing where investment is needed to prepare software for the HL-LHC and its unprecedented challenges. The ROOT project is one of the central software players in high energy physics since decades. From its experience and expectations, the ROOT team has distilled a comprehensive set of areas that should see research and development in the context of data ana… ▽ More The high energy physics community is discussing where investment is needed to prepare software for the HL-LHC and its unprecedented challenges. The ROOT project is one of the central software players in high energy physics since decades. From its experience and expectations, the ROOT team has distilled a comprehensive set of areas that should see research and development in the context of data analysis software, for making best use of HL-LHC's physics potential. This work shows what these areas could be, why the ROOT team believes investing in them is needed, which gains are expected, and where related work is ongoing. It can serve as an indication for future research proposals and cooperations. △ Less

Submitted 4 May, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

arXiv:2003.13762 [pdf]

Using VERA to explain the impact of social distancing on the spread of COVID-19

Authors: William Broniec, Sungeun An, Spencer Rugaber, Ashok K. Goel

Abstract: COVID-19 continues to spread across the country and around the world. Current strategies for managing the spread of COVID-19 include social distancing. We present VERA, an interactive AI tool, that first enables users to specify conceptual models of the impact of social distancing on the spread of COVID-19. Then, VERA automatically spawns agent-based simulations from the conceptual models, and, gi… ▽ More COVID-19 continues to spread across the country and around the world. Current strategies for managing the spread of COVID-19 include social distancing. We present VERA, an interactive AI tool, that first enables users to specify conceptual models of the impact of social distancing on the spread of COVID-19. Then, VERA automatically spawns agent-based simulations from the conceptual models, and, given a data set, automatically fills in the values of the simulation parameters from the data. Next, the user can view the simulation results, and, if needed, revise the simulation parameters and run another experimental trial, or build an alternative conceptual model. We describe the use VERA to develop a SIR model for the spread of COVID-19 and its relationship with healthcare capacity. △ Less

Submitted 30 March, 2020; originally announced March 2020.

Comments: 6 figures, 1 table

arXiv:2003.11603 [pdf, other]

Graph Neural Networks for Particle Reconstruction in High Energy Physics detectors

Authors: Xiangyang Ju, Steven Farrell, Paolo Calafiura, Daniel Murnane, Prabhat, Lindsey Gray, Thomas Klijnsma, Kevin Pedro, Giuseppe Cerati, Jim Kowalkowski, Gabriel Perdue, Panagiotis Spentzouris, Nhan Tran, Jean-Roch Vlimant, Alexander Zlokapa, Joosep Pata, Maria Spiropulu, Sitong An, Adam Aurisano, V Hewes, Aristeidis Tsaris, Kazuhiro Terao, Tracy Usher

Abstract: Pattern recognition problems in high energy physics are notably different from traditional machine learning applications in computer vision. Reconstruction algorithms identify and measure the kinematic properties of particles produced in high energy collisions and recorded with complex detector systems. Two critical applications are the reconstruction of charged particle trajectories in tracking d… ▽ More Pattern recognition problems in high energy physics are notably different from traditional machine learning applications in computer vision. Reconstruction algorithms identify and measure the kinematic properties of particles produced in high energy collisions and recorded with complex detector systems. Two critical applications are the reconstruction of charged particle trajectories in tracking detectors and the reconstruction of particle showers in calorimeters. These two problems have unique challenges and characteristics, but both have high dimensionality, high degree of sparsity, and complex geometric layouts. Graph Neural Networks (GNNs) are a relatively new class of deep learning architectures which can deal with such data effectively, allowing scientists to incorporate domain knowledge in a graph structure and learn powerful representations leveraging that structure to identify patterns of interest. In this work we demonstrate the applicability of GNNs to these two diverse particle reconstruction problems. △ Less

Submitted 3 June, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

Comments: Presented at NeurIPS 2019 Workshop "Machine Learning and the Physical Sciences"

arXiv:2003.01300 [pdf, other]

doi 10.1109/IROS45743.2020.9340933

Few-Shot Relation Learning with Attention for EEG-based Motor Imagery Classification

Authors: Sion An, Soopil Kim, Philip Chikontwe, Sang Hyun Park

Abstract: Brain-Computer Interfaces (BCI) based on Electroencephalography (EEG) signals, in particular motor imagery (MI) data have received a lot of attention and show the potential towards the design of key technologies both in healthcare and other industries. MI data is generated when a subject imagines movement of limbs and can be used to aid rehabilitation as well as in autonomous driving scenarios. Th… ▽ More Brain-Computer Interfaces (BCI) based on Electroencephalography (EEG) signals, in particular motor imagery (MI) data have received a lot of attention and show the potential towards the design of key technologies both in healthcare and other industries. MI data is generated when a subject imagines movement of limbs and can be used to aid rehabilitation as well as in autonomous driving scenarios. Thus, classification of MI signals is vital for EEG-based BCI systems. Recently, MI EEG classification techniques using deep learning have shown improved performance over conventional techniques. However, due to inter-subject variability, the scarcity of unseen subject data, and low signal-to-noise ratio, extracting robust features and improving accuracy is still challenging. In this context, we propose a novel two-way few shot network that is able to efficiently learn how to learn representative features of unseen subject categories and how to classify them with limited MI EEG data. The pipeline includes an embedding module that learns feature representations from a set of samples, an attention mechanism for key signal feature discovery, and a relation module for final classification based on relation scores between a support set and a query signal. In addition to the unified learning of feature similarity and a few shot classifier, our method leads to emphasize informative features in support data relevant to the query data, which generalizes better on unseen subjects. For evaluation, we used the BCI competition IV 2b dataset and achieved an 9.3% accuracy improvement in the 20-shot classification task with state-of-the-art performance. Experimental results demonstrate the effectiveness of employing attention and the overall generality of our method. △ Less

Submitted 19 August, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

Comments: 6 pages. This paper was accepted at IROS2020

ACM Class: I.2.6

Journal ref: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

arXiv:2001.00121 [pdf]

A Freeform Dielectric Metasurface Modeling Approach Based on Deep Neural Networks

Authors: Sensong An, Bowen Zheng, Mikhail Y. Shalaginov, Hong Tang, Hang Li, Li Zhou, Jun Ding, Anuradha Murthy Agarwal, Clara Rivero-Baleine, Myungkoo Kang, Kathleen A. Richardson, Tian Gu, Juejun Hu, Clayton Fowler, Hualiang Zhang

Abstract: Metasurfaces have shown promising potentials in sha** optical wavefronts while remaining compact compared to bulky geometric optics devices. Design of meta-atoms, the fundamental building blocks of metasurfaces, relies on trial-and-error method to achieve target electromagnetic responses. This process includes the characterization of an enormous amount of different meta-atom designs with differe… ▽ More Metasurfaces have shown promising potentials in sha** optical wavefronts while remaining compact compared to bulky geometric optics devices. Design of meta-atoms, the fundamental building blocks of metasurfaces, relies on trial-and-error method to achieve target electromagnetic responses. This process includes the characterization of an enormous amount of different meta-atom designs with different physical and geometric parameters, which normally demands huge computational resources. In this paper, a deep learning-based metasurface/meta-atom modeling approach is introduced to significantly reduce the characterization time while maintaining accuracy. Based on a convolutional neural network (CNN) structure, the proposed deep learning network is able to model meta-atoms with free-form 2D patterns and different lattice sizes, material refractive indexes and thicknesses. Moreover, the presented approach features the capability to predict meta-atoms' wide spectrum responses in the timescale of milliseconds, which makes it attractive for applications such as fast meta-atom/metasurface on-demand designs and optimizations. △ Less

Submitted 31 December, 2019; originally announced January 2020.

arXiv:1911.12970 [pdf]

doi 10.1038/s41467-021-21440-9

Reconfigurable all-dielectric metalens with diffraction limited performance

Authors: Mikhail Y. Shalaginov, Sensong An, Yifei Zhang, Fan Yang, Peter Su, Vladimir Liberman, Jeffrey B. Chou, Christopher M. Roberts, Myungkoo Kang, Carlos Rios, Qingyang Du, Clayton Fowler, Anuradha Agarwal, Kathleen Richardson, Clara Rivero-Baleine, Hualiang Zhang, Juejun Hu, Tian Gu

Abstract: Active metasurfaces, whose optical properties can be modulated post-fabrication, have emerged as an intensively explored field in recent years. The efforts to date, however, still face major performance limitations in tuning range, optical quality, and efficiency especially for non mechanical actuation mechanisms. In this paper, we introduce an active metasurface platform combining phase tuning co… ▽ More Active metasurfaces, whose optical properties can be modulated post-fabrication, have emerged as an intensively explored field in recent years. The efforts to date, however, still face major performance limitations in tuning range, optical quality, and efficiency especially for non mechanical actuation mechanisms. In this paper, we introduce an active metasurface platform combining phase tuning covering the full 2$π$ range and diffraction-limited performance using an all-dielectric, low-loss architecture based on optical phase change materials (O-PCMs). We present a generic design principle enabling switching of metasurfaces between two arbitrary phase profiles and propose a new figure-of-merit (FOM) tailored for active meta-optics. We implement the approach to realize a high-performance varifocal metalens operating at 5.2 $μ$m wavelength. The metalens is constructed using Ge2Sb2Se4Te1 (GSST), an O-PCM with a large refractive index contrast ($Δ$ n > 1) and unique broadband low-loss characteristics in both amorphous and crystalline states. The reconfigurable metalens features focusing efficiencies above 20% at both states for linearly polarized light and a record large switching contrast ratio of 29.5 dB. We further validated aberration-free imaging using the metalens at both optical states, which represents the first experimental demonstration of a non-mechanical active metalens with diffraction-limited performance. △ Less

Submitted 10 December, 2019; v1 submitted 29 November, 2019; originally announced November 2019.

arXiv:1911.11782 [pdf, other]

doi 10.3847/1538-4357/ab535f

Living with Neighbors. II. Statistical Analysis of Flybys and Mergers of Dark Matter Halos in Cosmological Simulations

Authors: Sung-Ho An, Juhan Kim, Jun-Sung Moon, Suk-** Yoon

Abstract: We present a statistical analysis of flybys of dark matter halos compared to mergers using cosmological $N$-body simulations. We mainly focus on gravitationally interacting target halos with mass of $10^{10.8}-10^{13.0}h^{-1}M_{\odot}$, and their neighbors are counted only when the mass ratio is 1:3$-$3:1 and the distance is less than the sum of the virial radii of target and neighbor. The neighbo… ▽ More We present a statistical analysis of flybys of dark matter halos compared to mergers using cosmological $N$-body simulations. We mainly focus on gravitationally interacting target halos with mass of $10^{10.8}-10^{13.0}h^{-1}M_{\odot}$, and their neighbors are counted only when the mass ratio is 1:3$-$3:1 and the distance is less than the sum of the virial radii of target and neighbor. The neighbors are divided into the flyby or merger samples if the pair's total energy is greater or smaller, respectively, than the capture criterion with consideration of dynamical friction. The main results are as follows: (a) The flyby fraction increases by up to a factor of 50 with decreasing halo mass and by up to a factor of 400 with increasing large-scale density, while the merger fraction does not show any significant dependencies on these two parameters; (b) The redshift evolution of the flyby fraction is twofold, increasing with redshift at $0<z<1$ and remaining constant at $z>1$, while the merger fraction increases monotonically with redshift at $z=0\sim4$; (c) The multiple interactions with two or more neighbors are on average flyby-dominated, and their fraction has a mass and environment dependence similar to that for the flyby fraction; (d) Given that flybys substantially outnumber mergers toward $z=0$ (by a factor of five) and the multiple interactions are flyby-dominated, the flyby's contribution to galactic evolution is stronger than ever at the present epoch, especially for less massive halos and in the higher density environment. We propose a scenario that connects the evolution of the flyby and merger fractions to the hierarchical structure formation process. △ Less

Submitted 26 November, 2019; originally announced November 2019.

Comments: 21 pages, 12 figures, and 1 table, accepted for publication in ApJ

arXiv:1911.10841 [pdf, other]

doi 10.1103/PhysRevLett.124.110501

High-rate, high-fidelity entanglement of qubits across an elementary quantum network

Authors: L J Stephenson, D P Nadlinger, B C Nichol, S An, P Drmota, T G Ballance, K Thirumalai, J F Goodwin, D M Lucas, C J Ballance

Abstract: We demonstrate remote entanglement of trapped-ion qubits via a quantum-optical fiber link with fidelity and rate approaching those of local operations. Two ${}^{88}$Sr${}^{+}$ qubits are entangled via the polarization degree of freedom of two photons which are coupled by high-numerical-aperture lenses into single-mode optical fibers and interfere on a beamsplitter. A novel geometry allows high-eff… ▽ More We demonstrate remote entanglement of trapped-ion qubits via a quantum-optical fiber link with fidelity and rate approaching those of local operations. Two ${}^{88}$Sr${}^{+}$ qubits are entangled via the polarization degree of freedom of two photons which are coupled by high-numerical-aperture lenses into single-mode optical fibers and interfere on a beamsplitter. A novel geometry allows high-efficiency photon collection while maintaining unit fidelity for ion-photon entanglement. We generate remote Bell pairs with fidelity $F=0.940(5)$ at an average rate $182\,\mathrm{s}^{-1}$ (success probability $2.18\times10^{-4}$). △ Less

Submitted 13 May, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

Comments: v2 updated to include responses to reviewers, as published in PRL

Journal ref: Phys. Rev. Lett. 124, 110501 (2020)

arXiv:1911.10752 [pdf, other]

Fast and Incremental Loop Closure Detection Using Proximity Graphs

Authors: Shan An, Guangfu Che, Fangru Zhou, Xianglong Liu, Xin Ma, Yu Chen

Abstract: Visual loop closure detection, which can be considered as an image retrieval task, is an important problem in SLAM (Simultaneous Localization and Map**) systems. The frequently used bag-of-words (BoW) models can achieve high precision and moderate recall. However, the requirement for lower time costs and fewer memory costs for mobile robot applications is not well satisfied. In this paper, we pr… ▽ More Visual loop closure detection, which can be considered as an image retrieval task, is an important problem in SLAM (Simultaneous Localization and Map**) systems. The frequently used bag-of-words (BoW) models can achieve high precision and moderate recall. However, the requirement for lower time costs and fewer memory costs for mobile robot applications is not well satisfied. In this paper, we propose a novel loop closure detection framework titled `FILD' (Fast and Incremental Loop closure Detection), which focuses on an on-line and incremental graph vocabulary construction for fast loop closure detection. The global and local features of frames are extracted using the Convolutional Neural Networks (CNN) and SURF on the GPU, which guarantee extremely fast extraction speeds. The graph vocabulary construction is based on one type of proximity graph, named Hierarchical Navigable Small World (HNSW) graphs, which is modified to adapt to this specific application. In addition, this process is coupled with a novel strategy for real-time geometrical verification, which only keeps binary hash codes and significantly saves on memory usage. Extensive experiments on several publicly available datasets show that the proposed approach can achieve fairly good recall at 100\% precision compared to other state-of-the-art methods. The source code can be downloaded at https://github.com/AnshanTJU/FILD for further studies. △ Less

Submitted 25 November, 2019; originally announced November 2019.

Comments: 8 pages, 6 figures, IROS 2019

arXiv:1910.10300 [pdf, other]

Prioritized Inverse Kinematics: Desired Task Trajectories in Nonsingular Task Spaces

Authors: Sang-ik An, Dongheui Lee

Abstract: A prioritized inverse kinematics (PIK) solution can be considered as a (regulation or output tracking) control law of a dynamical system with prioritized multiple outputs. We propose a method that guarantees that a joint trajectory generated from a class of PIK solutions exists uniquely in a nonsingular configuration space. We start by assuming that desired task trajectories stay in nonsingular ta… ▽ More A prioritized inverse kinematics (PIK) solution can be considered as a (regulation or output tracking) control law of a dynamical system with prioritized multiple outputs. We propose a method that guarantees that a joint trajectory generated from a class of PIK solutions exists uniquely in a nonsingular configuration space. We start by assuming that desired task trajectories stay in nonsingular task spaces and find conditions for task trajectories to stay in a neighborhood of desired task trajectories in which we can guarantee existence and uniqueness of a joint trajectory in a nonsingular configuration space. Based on this result, we find a sufficient condition for task convergence and analyze various stability notions such as stability, uniform stability, uniform asymptotic stability, and exponential stability in both continuous and discrete times. We discuss why the number of tasks is limited in discrete time and show how preconditioning can be used in order to overcome this limitation. △ Less

Submitted 22 October, 2019; originally announced October 2019.

Comments: 16 pages

arXiv:1910.07029 [pdf, other]

End-to-end particle and event identification at the Large Hadron Collider with CMS Open Data

Authors: John Alison, Sitong An, Michael Andrews, Patrick Bryant, Bjorn Burkle, Sergei Gleyzer, Ulrich Heintz, Meenakshi Narain, Manfred Paulini, Barnabas Poczos, Emanuele Usai

Abstract: From particle identification to the discovery of the Higgs boson, deep learning algorithms have become an increasingly important tool for data analysis at the Large Hadron Collider (LHC). We present an innovative end-to-end deep learning approach for jet identification at the Compact Muon Solenoid (CMS) experiment at the LHC. The method combines deep neural networks with low-level detector informa… ▽ More From particle identification to the discovery of the Higgs boson, deep learning algorithms have become an increasingly important tool for data analysis at the Large Hadron Collider (LHC). We present an innovative end-to-end deep learning approach for jet identification at the Compact Muon Solenoid (CMS) experiment at the LHC. The method combines deep neural networks with low-level detector information, such as calorimeter energy deposits and tracking information, to build a discriminator to identify different particle species. Using two physics examples as references: electron vs. photon discrimination and quark vs. gluon discrimination, we demonstrate the performance of the end-to-end approach on simulated events with full detector geometry as available in the CMS Open Data. We also offer insights into the importance of the information extracted from various sub-detectors and describe how end-to-end techniques can be extended to event-level classification using information from the whole CMS detector. △ Less

Submitted 15 October, 2019; originally announced October 2019.

Comments: Talk presented at the 2019 Meeting of the Division of Particles and Fields of the American Physical Society (DPF2019), July 29 - August 2, 2019, Northeastern University, Boston, C1907293

Showing 101–150 of 213 results for author: An, S