Search | arXiv e-print repository

Contrastive Video-Language Learning with Fine-grained Frame Sampling

Authors: Zixu Wang, Yujie Zhong, Yishu Miao, Lin Ma, Lucia Specia

Abstract: Despite recent progress in video and language representation learning, the weak or sparse correspondence between the two modalities remains a bottleneck in the area. Most video-language models are trained via pair-level loss to predict whether a pair of video and text is aligned. However, even in paired video-text segments, only a subset of the frames are semantically relevant to the corresponding… ▽ More Despite recent progress in video and language representation learning, the weak or sparse correspondence between the two modalities remains a bottleneck in the area. Most video-language models are trained via pair-level loss to predict whether a pair of video and text is aligned. However, even in paired video-text segments, only a subset of the frames are semantically relevant to the corresponding text, with the remainder representing noise; where the ratio of noisy frames is higher for longer videos. We propose FineCo (Fine-grained Contrastive Loss for Frame Sampling), an approach to better learn video and language representations with a fine-grained contrastive objective operating on video frames. It helps distil a video by selecting the frames that are semantically equivalent to the text, improving cross-modal correspondence. Building on the well established VideoCLIP model as a starting point, FineCo achieves state-of-the-art performance on YouCookII, a text-video retrieval benchmark with long videos. FineCo also achieves competitive results on text-video retrieval (MSR-VTT), and video question answering datasets (MSR-VTT QA and MSR-VTT MC) with shorter videos. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: AACL-IJCNLP 2022

arXiv:2210.03945 [pdf, other]

Understanding HTML with Large Language Models

Authors: Izzeddin Gur, Ofir Nachum, Yingjie Miao, Mustafa Safdari, Austin Huang, Aakanksha Chowdhery, Sharan Narang, Noah Fiedel, Aleksandra Faust

Abstract: Large language models (LLMs) have shown exceptional performance on a variety of natural language tasks. Yet, their capabilities for HTML understanding -- i.e., parsing the raw HTML of a webpage, with applications to automation of web-based tasks, crawling, and browser-assisted retrieval -- have not been fully explored. We contribute HTML understanding models (fine-tuned LLMs) and an in-depth analy… ▽ More Large language models (LLMs) have shown exceptional performance on a variety of natural language tasks. Yet, their capabilities for HTML understanding -- i.e., parsing the raw HTML of a webpage, with applications to automation of web-based tasks, crawling, and browser-assisted retrieval -- have not been fully explored. We contribute HTML understanding models (fine-tuned LLMs) and an in-depth analysis of their capabilities under three tasks: (i) Semantic Classification of HTML elements, (ii) Description Generation for HTML inputs, and (iii) Autonomous Web Navigation of HTML pages. While previous work has developed dedicated architectures and training procedures for HTML understanding, we show that LLMs pretrained on standard natural language corpora transfer remarkably well to HTML understanding tasks. For instance, fine-tuned LLMs are 12% more accurate at semantic classification compared to models trained exclusively on the task dataset. Moreover, when fine-tuned on data from the MiniWoB benchmark, LLMs successfully complete 50% more tasks using 192x less data compared to the previous best supervised model. Out of the LLMs we evaluate, we show evidence that T5-based models are ideal due to their bidirectional encoder-decoder architecture. To promote further research on LLMs for HTML understanding, we create and open-source a large-scale HTML dataset distilled and auto-labeled from CommonCrawl. △ Less

Submitted 19 May, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

arXiv:2209.13130 [pdf, other]

3D Scene Flow Estimation on Pseudo-LiDAR: Bridging the Gap on Estimating Point Motion

Authors: Chaokang Jiang, Guangming Wang, Yanzi Miao, Hesheng Wang

Abstract: 3D scene flow characterizes how the points at the current time flow to the next time in the 3D Euclidean space, which possesses the capacity to infer autonomously the non-rigid motion of all objects in the scene. The previous methods for estimating scene flow from images have limitations, which split the holistic nature of 3D scene flow by estimating optical flow and disparity separately. Learning… ▽ More 3D scene flow characterizes how the points at the current time flow to the next time in the 3D Euclidean space, which possesses the capacity to infer autonomously the non-rigid motion of all objects in the scene. The previous methods for estimating scene flow from images have limitations, which split the holistic nature of 3D scene flow by estimating optical flow and disparity separately. Learning 3D scene flow from point clouds also faces the difficulties of the gap between synthesized and real data and the sparsity of LiDAR point clouds. In this paper, the generated dense depth map is utilized to obtain explicit 3D coordinates, which achieves direct learning of 3D scene flow from 2D images. The stability of the predicted scene flow is improved by introducing the dense nature of 2D pixels into the 3D space. Outliers in the generated 3D point cloud are removed by statistical methods to weaken the impact of noisy points on the 3D scene flow estimation task. Disparity consistency loss is proposed to achieve more effective unsupervised learning of 3D scene flow. The proposed method of self-supervised learning of 3D scene flow on real-world images is compared with a variety of methods for learning on the synthesized dataset and learning on LiDAR point clouds. The comparisons of multiple scene flow metrics are shown to demonstrate the effectiveness and superiority of introducing pseudo-LiDAR point cloud to scene flow estimation. △ Less

Submitted 26 September, 2022; originally announced September 2022.

Comments: 9 pages, 5 figures; This paper has been accepted by IEEE Transactions on Industrial Informatics

arXiv:2209.08847 [pdf, other]

Multibeam Sparse Tiled Planar Array for Joint Communication and Sensing

Authors: Hadi Alidoustaghdam, André Kokkeler, Yang Miao

Abstract: Multibeam analog arrays have been proposed for millimeter-wave joint communication and sensing (JCAS). We study multibeam planar arrays for JCAS, providing time division duplex communication and full-duplex sensing with steerable beams. In order to have a large aperture with a narrow beamwidth in the radiation pattern, we propose to design a sparse tiled planar array (STPA) aperture with affordabl… ▽ More Multibeam analog arrays have been proposed for millimeter-wave joint communication and sensing (JCAS). We study multibeam planar arrays for JCAS, providing time division duplex communication and full-duplex sensing with steerable beams. In order to have a large aperture with a narrow beamwidth in the radiation pattern, we propose to design a sparse tiled planar array (STPA) aperture with affordable number of phase shifters. The modular tiling and sparse design of the array are non-convex optimization problems, however, we exploit the fact that the more irregularity of the antenna array geometry, the less the side lobe level. We propose to first solve the optimization by the maximum entropy in the phase centers of tiles in the array; then we perform sparse subarray selection leveraging the geometry of the sunflower array. While maintaining the same spectral efficiency in the communication link as conventional uniform planar array (CUPA), the STPA improves angle of arrival estimation when the line-of-sight path is dominant, e.g., the STPA with 125 elements distinguishes two adjacent targets with 20$^\circ$ difference in the proximity of boresight whereas CUPA cannot. Moreover, the STPA has a 40$\%$ shorter blockage time compared to the CUPA when a blocker moves in the elevation angles. △ Less

Submitted 19 September, 2022; originally announced September 2022.

Comments: Manuscript submitted to IEEE Trans. Wireless Communication. On August 25, 2022. 27 pages, 16 figures

arXiv:2209.07419 [pdf, other]

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

Authors: Chaokang Jiang, Guangming Wang, **xing Wu, Yanzi Miao, Hesheng Wang

Abstract: Promising complementarity exists between the texture features of color images and the geometric information of LiDAR point clouds. However, there still present many challenges for efficient and robust feature fusion in the field of 3D object detection. In this paper, first, unstructured 3D point clouds are filled in the 2D plane and 3D point cloud features are extracted faster using projection-awa… ▽ More Promising complementarity exists between the texture features of color images and the geometric information of LiDAR point clouds. However, there still present many challenges for efficient and robust feature fusion in the field of 3D object detection. In this paper, first, unstructured 3D point clouds are filled in the 2D plane and 3D point cloud features are extracted faster using projection-aware convolution layers. Further, the corresponding indexes between different sensor signals are established in advance in the data preprocessing, which enables faster cross-modal feature fusion. To address LiDAR points and image pixels misalignment problems, two new plug-and-play fusion modules, LiCamFuse and BiLiCamFuse, are proposed. In LiCamFuse, soft query weights with perceiving the Euclidean distance of bimodal features are proposed. In BiLiCamFuse, the fusion module with dual attention is proposed to deeply correlate the geometric and textural features of the scene. The quantitative results on the KITTI dataset demonstrate that the proposed method achieves better feature-level fusion. In addition, the proposed network shows a shorter running time compared to existing methods. △ Less

Submitted 15 September, 2022; originally announced September 2022.

Comments: 7 pages, 4 figures; under review

arXiv:2209.01567 [pdf, other]

Pseudo-LiDAR for Visual Odometry

Authors: Huiying Deng, Guangming Wang, Zhiheng Feng, Chaokang Jiang, Xinrui Wu, Yanzi Miao, Hesheng Wang

Abstract: In the existing methods, LiDAR odometry shows superior performance, but visual odometry is still widely used for its price advantage. Conventionally, the task of visual odometry mainly rely on the input of continuous images. However, it is very complicated for the odometry network to learn the epipolar geometry information provided by the images. In this paper, the concept of pseudo-LiDAR is intro… ▽ More In the existing methods, LiDAR odometry shows superior performance, but visual odometry is still widely used for its price advantage. Conventionally, the task of visual odometry mainly rely on the input of continuous images. However, it is very complicated for the odometry network to learn the epipolar geometry information provided by the images. In this paper, the concept of pseudo-LiDAR is introduced into the odometry to solve this problem. The pseudo-LiDAR point cloud back-projects the depth map generated by the image into the 3D point cloud, which changes the way of image representation. Compared with the stereo images, the pseudo-LiDAR point cloud generated by the stereo matching network can get the explicit 3D coordinates. Since the 6 Degrees of Freedom (DoF) pose transformation occurs in 3D space, the 3D structure information provided by the pseudo-LiDAR point cloud is more direct than the image. Compared with sparse LiDAR, the pseudo-LiDAR has a denser point cloud. In order to make full use of the rich point cloud information provided by the pseudo-LiDAR, a projection-aware dense odometry pipeline is adopted. Most previous LiDAR-based algorithms sampled 8192 points from the point cloud as input to the odometry network. The projection-aware dense odometry pipeline takes all the pseudo-LiDAR point clouds generated from the images except for the error points as the input to the network. While making full use of the 3D geometric information in the images, the semantic information in the images is also used in the odometry task. The fusion of 2D-3D is achieved in an image-only based odometry. Experiments on the KITTI dataset prove the effectiveness of our method. To the best of our knowledge, this is the first visual odometry method using pseudo-LiDAR. △ Less

Submitted 4 September, 2022; originally announced September 2022.

Comments: 8 pages, 7 figures

arXiv:2208.10863 [pdf, ps, other]

Contact-Free Multi-Target Tracking Using Distributed Massive MIMO-OFDM Communication System: Prototype and Analysis

Authors: Chenglong Li, Sibren De Bast, Yang Miao, Emmeric Tanghe, Sofie Pollin, Wout Joseph

Abstract: Wireless-based human activity recognition has become an essential technology that enables contact-free human-machine and human-environment interactions. In this paper, we consider contact-free multi-target tracking (MTT) based on available communication systems. A radar-like prototype is built upon a sub-6 GHz distributed massive multiple-input and multiple-output (MIMO) orthogonal frequency-divis… ▽ More Wireless-based human activity recognition has become an essential technology that enables contact-free human-machine and human-environment interactions. In this paper, we consider contact-free multi-target tracking (MTT) based on available communication systems. A radar-like prototype is built upon a sub-6 GHz distributed massive multiple-input and multiple-output (MIMO) orthogonal frequency-division multiplexing communication system. Specifically, the raw channel state information (CSI) is calibrated in the frequency and antenna domain before being used for tracking. Then the targeted CSIs reflected or scattered from the moving pedestrians are extracted. To evade the complex association problem of distributed massive MIMO-based MTT, we propose to use a complex Bayesian compressive sensing (CBCS) algorithm to estimate the targets' locations based on the extracted target-of-interest CSI signal directly. The estimated locations from CBCS are fed to a Gaussian mixture probability hypothesis density filter for tracking. A multi-pedestrian tracking experiment is conducted in a room with size of 6.5 m$\times$10 m to evaluate the performance of the proposed algorithm. According to experimental results, we achieve 75th and 95th percentile accuracy of 12.7 cm and 18.2 cm for single-person tracking and 28.9 cm and 45.7 cm for multi-person tracking, respectively. Furthermore, the proposed algorithm achieves the tracking purposes in real-time, which is promising for practical MTT use cases. △ Less

Submitted 1 January, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

arXiv:2208.06870 [pdf, other]

Guard Beam: Protecting mmWave Communication through In-Band Early Blockage Prediction

Authors: Rizqi Hersyandika, Yang Miao, Sofie Pollin

Abstract: Human blockage is one of the main challenges for mmWave communication networks in dynamic environments. The shadowing by a human body results in significant received power degradation and could occur abruptly and frequently. A shadowing period of hundred milliseconds might interrupt the communication and cause significant data loss, considering the huge bandwidth utilized in mmWave communications.… ▽ More Human blockage is one of the main challenges for mmWave communication networks in dynamic environments. The shadowing by a human body results in significant received power degradation and could occur abruptly and frequently. A shadowing period of hundred milliseconds might interrupt the communication and cause significant data loss, considering the huge bandwidth utilized in mmWave communications. An even longer shadowing period might cause a long-duration link outage. Therefore, a blockage prediction mechanism has to be taken to detect the moving blocker within the vicinity of mmWave links. By detecting the potential blockage as early as possible, a user equipment can anticipate by establishing a new connection and performing beam training with an alternative base station before shadowing happens. This paper proposes an early moving blocker detection mechanism by leveraging an extra guard beam to protect the main communication beam. The guard beam is intended to sense the environment by expanding the field of view of a base station. The blockage can be detected early by observing received signal fluctuation resulting from the blocker's presence within the field of view. We derive a channel model for the pre-shadowing event, design a moving blockage detection algorithm for the guard beam, and evaluate the performance of the guard beam theoretically and experimentally based on the measurement campaign using our mmWave testbed. Our results demonstrate that the guard beam can extend the detection range and predict the blockage up to 360 ms before the shadowing occurs. △ Less

Submitted 14 August, 2022; originally announced August 2022.

arXiv:2208.01929 [pdf]

Physical essence of propagable fractional-strength optical vortices in free space

Authors: Xiaoyu Weng, Yu Miao, Yang Li, Xiangmei Dong, Xiumin Gao, Songlin Zhuang

Abstract: Fractional-order vector vortex beams are recently demonstrated to be new carriers of fractional-strength optical vortices. However, why can those new vortex beams formed by the combination of both unstable states propagate stably in free space? Here, we solve this scientific problem by revealing the physical essence of propagable fractional-strength optical vortices in free space.Three new underst… ▽ More Fractional-order vector vortex beams are recently demonstrated to be new carriers of fractional-strength optical vortices. However, why can those new vortex beams formed by the combination of both unstable states propagate stably in free space? Here, we solve this scientific problem by revealing the physical essence of propagable fractional-strength optical vortices in free space.Three new understandings regarding those peculiar vortex beams are therefore proposed, namely Abbe diffraction limit, phase evolution of vortex beam, and phase binary time vector property.For the first one, owing to Abbe diffraction limit, the inherent polarization modes are intertwined together, thereby maintaining the entire peculiar vortex beams in free space. For the second one, we demonstrate the phase evolution of vortex beam, which is the physical reason of polarization rotation of fractional-order VVBs. For the third one, the phase is not merely a scalar attribute of light beam, but manifests a binary time vector property. This work provides entirely different physical viewpoints on the phase of vortex beam and Abbe diffraction limit, which may deepen our knowledge on the behavior of light beam in classical optics. △ Less

Submitted 3 August, 2022; originally announced August 2022.

arXiv:2207.03545 [pdf, ps, other]

A new type of results on probabilities of moderate deviations for i.i.d. random variables

Authors: Deli Li, Yu Miao, Yongcheng Qi

Abstract: Let $\{X, X_{n}; n \geq 1\}$ be a sequence of i.i.d. non-degenerate real-valued random variables with $\mathbb{E}X^{2} < \infty$. Let $S_{n} = \sum_{i=1}^{n} X_{i}$, $n \geq 1$. Let $g(\cdot): ~[0, \infty) \rightarrow [0, \infty)$ be a nondecreasing regularly varying function with index $ρ\geq 0$ and $\lim_{t \rightarrow \infty} g(t) = \infty$. Let $μ= \mathbb{E}X$ and… ▽ More Let $\{X, X_{n}; n \geq 1\}$ be a sequence of i.i.d. non-degenerate real-valued random variables with $\mathbb{E}X^{2} < \infty$. Let $S_{n} = \sum_{i=1}^{n} X_{i}$, $n \geq 1$. Let $g(\cdot): ~[0, \infty) \rightarrow [0, \infty)$ be a nondecreasing regularly varying function with index $ρ\geq 0$ and $\lim_{t \rightarrow \infty} g(t) = \infty$. Let $μ= \mathbb{E}X$ and $σ^{2} = \mathbb{E}(X - μ)^{2}$. In this paper, we obtain precise asymptotic estimates for probabilities of moderate deviations by showing that, for all $x > 0$, \[ \limsup_{n \rightarrow \infty} \frac{\log \mathbb{P}\left(S_{n} - n μ> x \sqrt{n g(\log n)} \right)}{g(\log n)} = - \left(\frac{x^{2}}{2σ^{2}} \wedge \frac{\overlineλ_{1}}{2^ρ} \right), \] \[ \liminf_{n \rightarrow \infty} \frac{\log \mathbb{P}\left(S_{n} - n μ> x \sqrt{n g(\log n)} \right)}{g(\log n)} = - \left(\frac{x^{2}}{2σ^{2}} \wedge \frac{\underlineλ_{1}}{2^ρ} \right), \] \[ \limsup_{n \rightarrow \infty} \frac{\log \mathbb{P}\left(S_{n} - n μ< -x \sqrt{n g(\log n)} \right)}{g(\log n)} = - \left(\frac{x^{2}}{2σ^{2}} \wedge \frac{\overlineλ_{2}}{2^ρ} \right), \] and \[ \liminf_{n \rightarrow \infty} \frac{\log \mathbb{P}\left(S_{n} - n μ< -x \sqrt{n g(\log n)} \right)}{g(\log n)} = - \left(\frac{x^{2}}{2σ^{2}} \wedge \frac{\underlineλ_{2}}{2^ρ} \right), \] where $\overlineλ_{1}$ are $\underlineλ_{1}$ are determined by the asymptotic behavior of $\mathbb{P}(X > t)$ and $\overlineλ_{2}$ and $\underlineλ_{2}$ are determined by the asymptotic behavior of $\mathbb{P}(X < -t)$. Unlike those known results in the literature, the moderate deviation results established in this paper depend on both the variance and the asymptotic behavior of the tail distribution of $X$. △ Less

Submitted 7 July, 2022; originally announced July 2022.

MSC Class: Primary: 60F10; Secondary 60B12; 60F05; 60G50

arXiv:2206.15142 [pdf, other]

doi 10.21468/SciPostPhys.16.3.078

The Floquet Baxterisation

Authors: Yuan Miao, Vladimir Gritsev, Denis V. Kurlov

Abstract: Quantum integrability has proven to be a useful tool to study quantum many-body systems out of equilibrium. In this paper we construct a generic framework for integrable quantum circuits through the procedure of Floquet Baxterisation. The integrability is guaranteed by establishing a connection between Floquet evolution operators and inhomogeneous transfer matrices obtained from the Yang-Baxter re… ▽ More Quantum integrability has proven to be a useful tool to study quantum many-body systems out of equilibrium. In this paper we construct a generic framework for integrable quantum circuits through the procedure of Floquet Baxterisation. The integrability is guaranteed by establishing a connection between Floquet evolution operators and inhomogeneous transfer matrices obtained from the Yang-Baxter relations. This allows us to construct integrable Floquet evolution operators with arbitrary depths and various boundary conditions. Furthermore, we focus on the example related to the staggered 6-vertex model. In the scaling limit we establish a connection of this Floquet protocol with a non-rational conformal field theory. Employing the properties of the underlying affine Temperley--Lieb algebraic structure, we demonstrate the dynamical anti-unitary symmetry breaking in the easy-plane regime. We also give an overview of integrability-related quantum circuits, highlighting future research directions. △ Less

Submitted 8 January, 2024; v1 submitted 30 June, 2022; originally announced June 2022.

Comments: 45 pages, 16 figures

Journal ref: SciPost Phys. 16, 078 (2024)

arXiv:2206.12189 [pdf, other]

doi 10.1088/1674-1137/ac8652

$Λ_b \to Λ_c$ Form Factors from QCD Light-Cone Sum Rules

Authors: Yan Miao, Hui Deng, Ke-Sheng Huang, **g Gao, Yue-Long Shen

Abstract: In this work, we calculate the transition form factors of $Λ_b$ decaying into $Λ_c$ within the framework of light-cone sum rules with the distribution amplitudes (DAs) of $Λ_b$-baryon. In the hadronic representation of the correlation function, we have isolated both the $Λ_c$ and the $Λ_c^*$ states so that the $Λ_b \rightarrow Λ_c$ form factors can be obtained without ambiguity. We investigate the… ▽ More In this work, we calculate the transition form factors of $Λ_b$ decaying into $Λ_c$ within the framework of light-cone sum rules with the distribution amplitudes (DAs) of $Λ_b$-baryon. In the hadronic representation of the correlation function, we have isolated both the $Λ_c$ and the $Λ_c^*$ states so that the $Λ_b \rightarrow Λ_c$ form factors can be obtained without ambiguity. We investigate the P-type and A-type current to interpolate the light baryons for a comparison since the interpolation current for the baryon state is not unique. We also employ three parametrization models for DAs of $Λ_b $ in the numerical calculation. We present the numerical predictions on the $Λ_b \rightarrow Λ_c$ form factors and the branching fractions, the averaged forward-backward asymmetry , the averaged final hadron polarization and the averaged lepton polarization of the $Λ_b \to Λ_c \ellμ$ decays, as well as the ratio of branching ratios $R_{Λ_c}$, and the predicted $R_{Λ_c}$ can be consistent with the LHCb data. △ Less

Submitted 24 June, 2022; originally announced June 2022.

Comments: 22 pages, 2figure

arXiv:2206.08694 [pdf, other]

doi 10.1088/1674-1137/acc1cd

Regular black holes with improved energy conditions and their analogues in fluids

Authors: Chen Lan, Yan-Gang Miao, Yi-Xiong Zang

Abstract: On the premise of the importance of energy conditions for regular black holes, we propose a method to remedy those models that break the dominant energy condition, e.g., the Bardeen and Hayward black holes. We modify the metrics but ensure their regularity at the same time, so that the weak, null, and dominant energy conditions are satisfied, with the exception of the strong energy condition. Like… ▽ More On the premise of the importance of energy conditions for regular black holes, we propose a method to remedy those models that break the dominant energy condition, e.g., the Bardeen and Hayward black holes. We modify the metrics but ensure their regularity at the same time, so that the weak, null, and dominant energy conditions are satisfied, with the exception of the strong energy condition. Likewise, we prove a no-go theorem for conformally related regular black holes, which states that the four energy conditions can never be met in this class of black holes. In order to seek evidences for distinguishing regular black holes from singular black holes, we resort to analogue gravity and regard it as a tool to mimic realistic regular black holes in a fluid. The equations of state for the fluid are solved via an asymptotic analysis associated with a numerical method, which provides a modus operandi for experimental observations, in particular, the conditions under which one can simulate realistic regular black holes in the fluid. △ Less

Submitted 13 April, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

Comments: 38 pages, 30 figures, published version in Chinese Physics C

Journal ref: Chin. Phys. C 47, no.5,052001 (2023)

arXiv:2206.05473 [pdf, other]

doi 10.18653/v1/2022.ecnlp-1.7

Comparative Snippet Generation

Authors: Saurabh Jain, Yisong Miao, Min-Yen Kan

Abstract: We model product reviews to generate comparative responses consisting of positive and negative experiences regarding the product. Specifically, we generate a single-sentence, comparative response from a given positive and a negative opinion. We contribute the first dataset for this task of Comparative Snippet Generation from contrasting opinions regarding a product, and a performance analysis of a… ▽ More We model product reviews to generate comparative responses consisting of positive and negative experiences regarding the product. Specifically, we generate a single-sentence, comparative response from a given positive and a negative opinion. We contribute the first dataset for this task of Comparative Snippet Generation from contrasting opinions regarding a product, and a performance analysis of a pre-trained BERT model to generate such snippets. △ Less

Submitted 11 June, 2022; originally announced June 2022.

Journal ref: In Proceedings of The Fifth Workshop on e-Commerce and NLP (ECNLP 5), pages 49-57, Dublin, Ireland. Association for Computational Linguistics (2022)

arXiv:2206.00217 [pdf, ps, other]

Resilience in Industrial Internet of Things Systems: A Communication Perspective

Authors: Hao Wu, Yifan Miao, Peng Zhang, Yang Tian, Hui Tian

Abstract: Industrial Internet of Things is an ultra-large-scale system that is much more sophisticated and fragile than conventional industrial platforms. The effective management of such a system relies heavily on the resilience of the network, especially the communication part. Imperative as resilient communication is, there is not enough attention from literature and a standardized framework is still mis… ▽ More Industrial Internet of Things is an ultra-large-scale system that is much more sophisticated and fragile than conventional industrial platforms. The effective management of such a system relies heavily on the resilience of the network, especially the communication part. Imperative as resilient communication is, there is not enough attention from literature and a standardized framework is still missing. In awareness of these, this paper intends to provide a systematic overview of resilience in IIoT with a communication perspective, aiming to answer the questions of why we need it, what it is, how to enhance it, and where it can be applied. Specifically, we emphasize the urgency of resilience studies via examining existing literature and analyzing malfunction data from a real satellite communication system. Resilience-related concepts and metrics, together with standardization efforts are then summarized and discussed, presenting a basic framework for analyzing the resilience of the system before, during, and after disruptive events. On the basis of the framework, key resilience concerns associated with the design, deployment, and operation of IIoT are briefly described to shed light on the methods for resilience enhancement. Promising resilient applications in different IIoT sectors are also introduced to highlight the opportunities and challenges in practical implementations. △ Less

Submitted 31 May, 2022; originally announced June 2022.

arXiv:2205.11719 [pdf, other]

Safe, Occlusion-Aware Manipulation for Online Object Reconstruction in Confined Spaces

Authors: Yinglong Miao, Rui Wang, Kostas Bekris

Abstract: Recent work in robotic manipulation focuses on object retrieval in cluttered spaces under occlusion. Nevertheless, the majority of efforts lack an analysis of conditions for the completeness of the approaches or the methods apply only when objects can be removed from the workspace. This work formulates the general, occlusion-aware manipulation task, and focuses on safe object reconstruction in a c… ▽ More Recent work in robotic manipulation focuses on object retrieval in cluttered spaces under occlusion. Nevertheless, the majority of efforts lack an analysis of conditions for the completeness of the approaches or the methods apply only when objects can be removed from the workspace. This work formulates the general, occlusion-aware manipulation task, and focuses on safe object reconstruction in a confined space with in-place rearrangement. It proposes a framework that ensures safety with completeness guarantees. Furthermore, an algorithm, which is an instantiation of this abstract framework for monotone instances is developed and evaluated empirically by comparing against a random and a greedy baseline on randomly generated experiments in simulation. Even for cluttered scenes with realistic objects, the proposed algorithm significantly outperforms the baselines and maintains a high success rate across experimental conditions. △ Less

Submitted 20 September, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

arXiv:2205.05675 [pdf, other]

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, **gyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, **shan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution. △ Less

Submitted 11 May, 2022; originally announced May 2022.

Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

arXiv:2205.00047 [pdf, other]

Logically Consistent Adversarial Attacks for Soft Theorem Provers

Authors: Alexander Gaskell, Yishu Miao, Lucia Specia, Francesca Toni

Abstract: Recent efforts within the AI community have yielded impressive results towards "soft theorem proving" over natural language sentences using language models. We propose a novel, generative adversarial framework for probing and improving these models' reasoning capabilities. Adversarial attacks in this domain suffer from the logical inconsistency problem, whereby perturbations to the input may alter… ▽ More Recent efforts within the AI community have yielded impressive results towards "soft theorem proving" over natural language sentences using language models. We propose a novel, generative adversarial framework for probing and improving these models' reasoning capabilities. Adversarial attacks in this domain suffer from the logical inconsistency problem, whereby perturbations to the input may alter the label. Our Logically consistent AdVersarial Attacker, LAVA, addresses this by combining a structured generative process with a symbolic solver, guaranteeing logical consistency. Our framework successfully generates adversarial attacks and identifies global weaknesses common across multiple target models. Our analyses reveal naive heuristics and vulnerabilities in these models' reasoning capabilities, exposing an incomplete grasp of logical deduction under logic programs. Finally, in addition to effective probing of these models, we show that training on the generated samples improves the target model's performance. △ Less

Submitted 29 April, 2022; originally announced May 2022.

Comments: IJCAI-ECAI 2022

arXiv:2204.04292 [pdf, other]

Evolving Pareto-Optimal Actor-Critic Algorithms for Generalizability and Stability

Authors: Juan Jose Garau-Luis, Yingjie Miao, John D. Co-Reyes, Aaron Parisi, Jie Tan, Esteban Real, Aleksandra Faust

Abstract: Generalizability and stability are two key objectives for operating reinforcement learning (RL) agents in the real world. Designing RL algorithms that optimize these objectives can be a costly and painstaking process. This paper presents MetaPG, an evolutionary method for automated design of actor-critic loss functions. MetaPG explicitly optimizes for generalizability and performance, and implicit… ▽ More Generalizability and stability are two key objectives for operating reinforcement learning (RL) agents in the real world. Designing RL algorithms that optimize these objectives can be a costly and painstaking process. This paper presents MetaPG, an evolutionary method for automated design of actor-critic loss functions. MetaPG explicitly optimizes for generalizability and performance, and implicitly optimizes the stability of both metrics. We initialize our loss function population with Soft Actor-Critic (SAC) and perform multi-objective optimization using fitness metrics encoding single-task performance, zero-shot generalizability to unseen environment configurations, and stability across independent runs with different random seeds. On a set of continuous control tasks from the Real-World RL Benchmark Suite, we find that our method, using a single environment during evolution, evolves algorithms that improve upon SAC's performance and generalizability by 4% and 20%, respectively, and reduce instability up to 67%. Then, we scale up to more complex environments from the Brax physics simulator and replicate generalizability tests encountered in practical settings, such as different friction coefficients. MetaPG evolves algorithms that can obtain 10% better generalizability without loss of performance within the same meta-training environment and obtain similar results to SAC when doing cross-domain evaluations in other Brax environments. The evolution results are interpretable; by analyzing the structure of the best algorithms we identify elements that help optimizing certain objectives, such as regularization terms for the critic loss. △ Less

Submitted 24 April, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

arXiv:2203.16594 [pdf, other]

doi 10.21468/SciPostPhys.13.3.070

Generalised Onsager Algebra in Quantum Lattice Models

Authors: Yuan Miao

Abstract: The Onsager algebra is one of the cornerstones of exactly solvable models in statistical mechanics. Starting from the generalised Clifford algebra, we demonstrate its relations to the graph Temperley-Lieb algebra, and a generalisation of the Onsager algebra. We present a series of quantum lattice models as representations of the generalised Clifford algebra, possessing the structure of a special t… ▽ More The Onsager algebra is one of the cornerstones of exactly solvable models in statistical mechanics. Starting from the generalised Clifford algebra, we demonstrate its relations to the graph Temperley-Lieb algebra, and a generalisation of the Onsager algebra. We present a series of quantum lattice models as representations of the generalised Clifford algebra, possessing the structure of a special type of the generalised Onsager algebra. The integrability of those models is presented, analogous to the free fermionic eight-vertex model. We also mention further extensions of the models and physical properties related to the generalised Onsager algebras, hinting at a general framework that includes families of quantum lattice models possessing the structure of the generalised Onsager algebras. △ Less

Submitted 17 August, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

Comments: 25 pages, 2 figures

Journal ref: SciPost Phys. 13, 070 (2022)

arXiv:2203.11749 [pdf, ps, other]

Global existence and blow-up for a stochastic transport equation with non-local velocity

Authors: Diego Alonso-Orán, Yingting Miao, Hao Tang

Abstract: In this paper we investigate a non-linear and non-local one dimensional transport equation under random perturbations on the real line. We first establish a local-in-time theory, i.e., existence, uniqueness and blow-up criterion for pathwise solutions in Sobolev spaces $H^{s}$ with $s>3$. Thereafter, we give a complete picture of the long time behavior of the solutions based on the type of noise w… ▽ More In this paper we investigate a non-linear and non-local one dimensional transport equation under random perturbations on the real line. We first establish a local-in-time theory, i.e., existence, uniqueness and blow-up criterion for pathwise solutions in Sobolev spaces $H^{s}$ with $s>3$. Thereafter, we give a complete picture of the long time behavior of the solutions based on the type of noise we consider. On one hand, we identify a family of noises such that blow-up can be prevented with probability $1$, guaranteeing the existence and uniqueness of global solutions almost surely. On the other hand, in the particular linear noise case, we show that singularities occur in finite time with positive probability, and we derive lower bounds of these probabilities. To conclude, we introduce the notion of stability of exiting times and show that one cannot improve the stability of the exiting time and simultaneously improve the continuity of the dependence on initial data. △ Less

Submitted 22 March, 2022; originally announced March 2022.

Comments: 32 pages

arXiv:2203.08406 [pdf, ps, other]

Levenberg-Marquardt Method Based Cooperative Source Localization in SIMO Molecular Communication via Diffusion Systems

Authors: Yuqi Miao, Wence Zhang, Xu Bao

Abstract: Molecular communication underpins nano-scale communications in nanotechnology. The combination of multinanomachines to form nano-networks is one of the main enabling methods. Due to the importance of source localization in establishing nano-networks, this paper proposes a cooperative source localization method for Molecular Communication via Diffusion (MCvD) systems using multiple spherical absorp… ▽ More Molecular communication underpins nano-scale communications in nanotechnology. The combination of multinanomachines to form nano-networks is one of the main enabling methods. Due to the importance of source localization in establishing nano-networks, this paper proposes a cooperative source localization method for Molecular Communication via Diffusion (MCvD) systems using multiple spherical absorption receivers. Since there is no exact mathematical expression of the channel impulse response for multiple absorbing receivers, we adopt an empirical expression and use Levenberg-Marquardt method to estimate the distance of the transmitter to each receiver, based on which the location of the transmitter is obtained using an iterative scheme where the initial point is obtained using a multi-point localization method. Particle based simulation is carried out to evaluate the performance of the proposed method. Simulation results show that the proposed method can accurately estimate the location of transmitter in short to medium communication ranges. △ Less

Submitted 16 March, 2022; originally announced March 2022.

arXiv:2203.05248 [pdf, other]

Look Backward and Forward: Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation

Authors: Xuanwei Zhang, Libin Shen, Disheng Pan, Liang Wang, Yanjun Miao

Abstract: Neural Machine Translation(NMT) models are usually trained via unidirectional decoder which corresponds to optimizing one-step-ahead prediction. However, this kind of unidirectional decoding framework may incline to focus on local structure rather than global coherence. To alleviate this problem, we propose a novel method, Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine T… ▽ More Neural Machine Translation(NMT) models are usually trained via unidirectional decoder which corresponds to optimizing one-step-ahead prediction. However, this kind of unidirectional decoding framework may incline to focus on local structure rather than global coherence. To alleviate this problem, we propose a novel method, Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation(SBD-NMT). We deploy a backward decoder which can act as an effective regularization method to the forward decoder. By leveraging the backward decoder's information about the longer-term future, distilling knowledge learned in the backward decoder can encourage auto-regressive NMT models to plan ahead. Experiments show that our method is significantly better than the strong Transformer baselines on multiple machine translation data sets. △ Less

Submitted 10 March, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

arXiv:2203.03570 [pdf, other]

Kubric: A scalable dataset generator

Authors: Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi , et al. (10 additional authors not shown)

Abstract: Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential… ▽ More Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential to address these shortcomings: 1) it is cheap 2) supports rich ground-truth annotations 3) offers full control over data and 4) can circumvent or mitigate problems regarding bias, privacy and licensing. Unfortunately, software tools for effective data generation are less mature than those for architecture design and training, which leads to fragmented generation efforts. To address these problems we introduce Kubric, an open-source Python framework that interfaces with PyBullet and Blender to generate photo-realistic scenes, with rich annotations, and seamlessly scales to large jobs distributed over thousands of machines, and generating TBs of data. We demonstrate the effectiveness of Kubric by presenting a series of 13 different generated datasets for tasks ranging from studying 3D NeRF models to optical flow estimation. We release Kubric, the used assets, all of the generation code, as well as the rendered datasets for reuse and modification. △ Less

Submitted 7 March, 2022; originally announced March 2022.

Comments: 21 pages, CVPR2022

arXiv:2202.10280 [pdf, ps, other]

On the Information-theoretic Security of Combinatorial All-or-nothing Transforms

Authors: Yujie Gu, Sonata Akao, Navid Nasr Esfahani, Ying Miao, Kouichi Sakurai

Abstract: All-or-nothing transforms (AONT) were proposed by Rivest as a message preprocessing technique for encrypting data to protect against brute-force attacks, and have numerous applications in cryptography and information security. Later the unconditionally secure AONT and their combinatorial characterization were introduced by Stinson. Informally, a combinatorial AONT is an array with the unbiased req… ▽ More All-or-nothing transforms (AONT) were proposed by Rivest as a message preprocessing technique for encrypting data to protect against brute-force attacks, and have numerous applications in cryptography and information security. Later the unconditionally secure AONT and their combinatorial characterization were introduced by Stinson. Informally, a combinatorial AONT is an array with the unbiased requirements and its security properties in general depend on the prior probability distribution on the inputs $s$-tuples. Recently, it was shown by Esfahani and Stinson that a combinatorial AONT has perfect security provided that all the inputs $s$-tuples are equiprobable, and has weak security provided that all the inputs $s$-tuples are with non-zero probability. This paper aims to explore on the gap between perfect security and weak security for combinatorial $(t,s,v)$-AONTs. Concretely, we consider the typical scenario that all the $s$ inputs take values independently (but not necessarily identically) and quantify the amount of information $H(\mathcal{X}|\mathcal{Y})$ about any $t$ inputs $\mathcal{X}$ that is not revealed by any $s-t$ outputs $\mathcal{Y}$. In particular, we establish the general lower and upper bounds on $H(\mathcal{X}|\mathcal{Y})$ for combinatorial AONTs using information-theoretic techniques, and also show that the derived bounds can be attained in certain cases. Furthermore, the discussions are extended for the security properties of combinatorial asymmetric AONTs. △ Less

Submitted 21 February, 2022; originally announced February 2022.

Comments: 16 pages

arXiv:2202.08422 [pdf, ps, other]

Euler's scheme of Mckean-Vlasov SDEs with non-Lipschitz coefficients

Authors: Zhen Wang, Jie Ren, Yu Miao

Abstract: In this paper, we show the strong well-posedness of Mckean-Vlasov SDEs with non-Lipschitz coefficients. Moreover, propagation of chaos and the convergence rate for Euler's scheme of Mckean-Vlasov SDEs are also obtained. In this paper, we show the strong well-posedness of Mckean-Vlasov SDEs with non-Lipschitz coefficients. Moreover, propagation of chaos and the convergence rate for Euler's scheme of Mckean-Vlasov SDEs are also obtained. △ Less

Submitted 16 February, 2022; originally announced February 2022.

arXiv:2202.05469 [pdf, ps, other]

Privacy-preserving Generative Framework Against Membership Inference Attacks

Authors: Ruikang Yang, Jianfeng Ma, Yinbin Miao, Xindi Ma

Abstract: Artificial intelligence and machine learning have been integrated into all aspects of our lives and the privacy of personal data has attracted more and more attention. Since the generation of the model needs to extract the effective information of the training data, the model has the risk of leaking the privacy of the training data. Membership inference attacks can measure the model leakage of sou… ▽ More Artificial intelligence and machine learning have been integrated into all aspects of our lives and the privacy of personal data has attracted more and more attention. Since the generation of the model needs to extract the effective information of the training data, the model has the risk of leaking the privacy of the training data. Membership inference attacks can measure the model leakage of source data to a certain degree. In this paper, we design a privacy-preserving generative framework against membership inference attacks, through the information extraction and data generation capabilities of the generative model variational autoencoder (VAE) to generate synthetic data that meets the needs of differential privacy. Instead of adding noise to the model output or tampering with the training process of the target model, we directly process the original data. We first map the source data to the latent space through the VAE model to get the latent code, then perform noise process satisfying metric privacy on the latent code, and finally use the VAE model to reconstruct the synthetic data. Our experimental evaluation demonstrates that the machine learning model trained with newly generated synthetic data can effectively resist membership inference attacks and still maintain high utility. △ Less

Submitted 11 February, 2022; originally announced February 2022.

Comments: Under Review

arXiv:2202.05416 [pdf, other]

FAAG: Fast Adversarial Audio Generation through Interactive Attack Optimisation

Authors: Yuantian Miao, Chao Chen, Lei Pan, Jun Zhang, Yang Xiang

Abstract: Automatic Speech Recognition services (ASRs) inherit deep neural networks' vulnerabilities like crafted adversarial examples. Existing methods often suffer from low efficiency because the target phases are added to the entire audio sample, resulting in high demand for computational resources. This paper proposes a novel scheme named FAAG as an iterative optimization-based method to generate target… ▽ More Automatic Speech Recognition services (ASRs) inherit deep neural networks' vulnerabilities like crafted adversarial examples. Existing methods often suffer from low efficiency because the target phases are added to the entire audio sample, resulting in high demand for computational resources. This paper proposes a novel scheme named FAAG as an iterative optimization-based method to generate targeted adversarial examples quickly. By injecting the noise over the beginning part of the audio, FAAG generates adversarial audio in high quality with a high success rate timely. Specifically, we use audio's logits output to map each character in the transcription to an approximate position of the audio's frame. Thus, an adversarial example can be generated by FAAG in approximately two minutes using CPUs only and around ten seconds with one GPU while maintaining an average success rate over 85%. Specifically, the FAAG method can speed up around 60% compared with the baseline method during the adversarial example generation process. Furthermore, we found that appending benign audio to any suspicious examples can effectively defend against the targeted adversarial attack. We hope that this work paves the way for inventing new adversarial attacks against speech recognition with computational constraints. △ Less

Submitted 10 February, 2022; originally announced February 2022.

arXiv:2201.10583 [pdf]

doi 10.1063/5.0086321

High Gradient Silicon Carbide Immersion Lens Ultrafast Electron Sources

Authors: Kenneth J. Leedle, Uwe Niedermayer, Eric Skär, Karel Urbanek, Yu Miao, Payton Broaddus, Olav Solgaard, Robert L. Byer

Abstract: We present two compact ultrafast electron injector designs with integrated focusing that provide high peak brightness of up to $1.9*10^{12} A/m^2Sr^2$ with 10s of electrons per laser pulse using silicon carbide electrodes and silicon nanotip emitters. We demonstrate a few centimeter scale 96 keV immersion lens electron source and a 57 keV immersion lens electron source with a 19 kV/mm average acce… ▽ More We present two compact ultrafast electron injector designs with integrated focusing that provide high peak brightness of up to $1.9*10^{12} A/m^2Sr^2$ with 10s of electrons per laser pulse using silicon carbide electrodes and silicon nanotip emitters. We demonstrate a few centimeter scale 96 keV immersion lens electron source and a 57 keV immersion lens electron source with a 19 kV/mm average acceleration gradient, nearly double the typical 10 kV/mm used in DC electron sources. The brightness of the electron sources is measured alongside start-to-end simulations including space charge effects. These sources are suitable for dielectric laser accelerator experiments, ultrafast electron diffraction, and other applications where a compact high brightness electron source is required. △ Less

Submitted 25 January, 2022; originally announced January 2022.

Comments: 8 pages, 5 figures

arXiv:2201.08896 [pdf, other]

Environment Generation for Zero-Shot Compositional Reinforcement Learning

Authors: Izzeddin Gur, Natasha Jaques, Yingjie Miao, Jongwook Choi, Manoj Tiwari, Honglak Lee, Aleksandra Faust

Abstract: Many real-world problems are compositional - solving them requires completing interdependent sub-tasks, either in series or in parallel, that can be represented as a dependency graph. Deep reinforcement learning (RL) agents often struggle to learn such complex tasks due to the long time horizons and sparse rewards. To address this problem, we present Compositional Design of Environments (CoDE), wh… ▽ More Many real-world problems are compositional - solving them requires completing interdependent sub-tasks, either in series or in parallel, that can be represented as a dependency graph. Deep reinforcement learning (RL) agents often struggle to learn such complex tasks due to the long time horizons and sparse rewards. To address this problem, we present Compositional Design of Environments (CoDE), which trains a Generator agent to automatically build a series of compositional tasks tailored to the RL agent's current skill level. This automatic curriculum not only enables the agent to learn more complex tasks than it could have otherwise, but also selects tasks where the agent's performance is weak, enhancing its robustness and ability to generalize zero-shot to unseen tasks at test-time. We analyze why current environment generation techniques are insufficient for the problem of generating compositional tasks, and propose a new algorithm that addresses these issues. Our results assess learning and generalization across multiple compositional tasks, including the real-world problem of learning to navigate and interact with web pages. We learn to generate environments composed of multiple pages or rooms, and train RL agents capable of completing wide-range of complex tasks in those environments. We contribute two new benchmark frameworks for generating compositional tasks, compositional MiniGrid and gMiniWoB for web navigation.CoDE yields 4x higher success rate than the strongest baseline, and demonstrates strong performance of real websites learned on 3500 primitive tasks. △ Less

Submitted 21 January, 2022; originally announced January 2022.

Comments: Published in NeurIPS 2021

arXiv:2201.03916 [pdf, other]

doi 10.1613/jair.1.13596

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Authors: Jack Parker-Holder, Raghu Rajan, Xingyou Song, André Biedenkapp, Yingjie Miao, Theresa Eimer, Baohe Zhang, Vu Nguyen, Roberto Calandra, Aleksandra Faust, Frank Hutter, Marius Lindauer

Abstract: The combination of Reinforcement Learning (RL) with deep learning has led to a series of impressive feats, with many believing (deep) RL provides a path towards generally capable agents. However, the success of RL agents is often highly sensitive to design choices in the training process, which may require tedious and error-prone manual tuning. This makes it challenging to use RL for new problems,… ▽ More The combination of Reinforcement Learning (RL) with deep learning has led to a series of impressive feats, with many believing (deep) RL provides a path towards generally capable agents. However, the success of RL agents is often highly sensitive to design choices in the training process, which may require tedious and error-prone manual tuning. This makes it challenging to use RL for new problems, while also limits its full potential. In many other areas of machine learning, AutoML has shown it is possible to automate such design choices and has also yielded promising initial results when applied to RL. However, Automated Reinforcement Learning (AutoRL) involves not only standard applications of AutoML but also includes additional challenges unique to RL, that naturally produce a different set of methods. As such, AutoRL has been emerging as an important area of research in RL, providing promise in a variety of applications from RNA design to playing games such as Go. Given the diversity of methods and environments considered in RL, much of the research has been conducted in distinct subfields, ranging from meta-learning to evolution. In this survey we seek to unify the field of AutoRL, we provide a common taxonomy, discuss each area in detail and pose open problems which would be of interest to researchers going forward. △ Less

Submitted 2 June, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

Comments: Published in JAIR. Co-first authors and co-last authors are listed in alphabetical order

MSC Class: 68T01 ACM Class: I.2.6

Journal ref: Journal of Artificial Intelligence Research 74 (2022) 517-568

arXiv:2201.03859 [pdf, other]

On Exploring Pose Estimation as an Auxiliary Learning Task for Visible-Infrared Person Re-identification

Authors: Yunqi Miao, Nianchang Huang, Xiao Ma, Qiang Zhang, Jungong Han

Abstract: Visible-infrared person re-identification (VI-ReID) has been challenging due to the existence of large discrepancies between visible and infrared modalities. Most pioneering approaches reduce intra-class variations and inter-modality discrepancies by learning modality-shared and ID-related features. However, an explicit modality-shared cue, i.e., body keypoints, has not been fully exploited in VI-… ▽ More Visible-infrared person re-identification (VI-ReID) has been challenging due to the existence of large discrepancies between visible and infrared modalities. Most pioneering approaches reduce intra-class variations and inter-modality discrepancies by learning modality-shared and ID-related features. However, an explicit modality-shared cue, i.e., body keypoints, has not been fully exploited in VI-ReID. Additionally, existing feature learning paradigms imposed constraints on either global features or partitioned feature stripes, which neglect the prediction consistency of global and part features. To address the above problems, we exploit Pose Estimation as an auxiliary learning task to assist the VI-ReID task in an end-to-end framework. By jointly training these two tasks in a mutually beneficial manner, our model learns higher quality modality-shared and ID-related features. On top of it, the learnings of global features and local features are seamlessly synchronized by Hierarchical Feature Constraint (HFC), where the former supervises the latter using the knowledge distillation strategy. Experimental results on two benchmark VI-ReID datasets show that the proposed method consistently improves state-of-the-art methods by significant margins. Specifically, our method achieves nearly 20$\%$ mAP improvements against the state-of-the-art method on the RegDB dataset. Our intriguing findings highlight the usage of auxiliary task learning in VI-ReID. △ Less

Submitted 23 February, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

arXiv:2201.02971 [pdf, ps, other]

doi 10.1103/PhysRevD.106.124052

Bounce corrections to gravitational lensing, quasinormal spectral stability and gray-body factors of Reissner-Nordström black holes

Authors: Yang Guo, Chen Lan, Yan-Gang Miao

Abstract: Gravitational lensing in the weak field limit, quasinormal spectra, and gray-body factors are investigated in the Reissner-Nordström spacetime corrected by bounce parameters. Using the Gauss-Bonnet theorem, we analyze the effects of bounce corrections to the weak gravitational deflection angle and find that the divergence of the deflection angle can be suppressed by a bounce correction in the Reis… ▽ More Gravitational lensing in the weak field limit, quasinormal spectra, and gray-body factors are investigated in the Reissner-Nordström spacetime corrected by bounce parameters. Using the Gauss-Bonnet theorem, we analyze the effects of bounce corrections to the weak gravitational deflection angle and find that the divergence of the deflection angle can be suppressed by a bounce correction in the Reissner-Nordström spacetime. We also notice that the bounce correction plays the same role as the Morse potential in the deflection angle. Moreover, we derive the perturbation equations with the spin-dependent Regge-Wheeler potential and discuss the quasinormal spectral stability. We observe that the quasinormal spectra decrease for both the massless scalar and electromagnetic field perturbations. We further study the transmission probability of particles scattered by the Regge-Wheeler potential and reveal that the bounce correction introduced into the Reissner-Nordström spacetime increases the gray-body factors of perturbation fields. △ Less

Submitted 1 January, 2023; v1 submitted 9 January, 2022; originally announced January 2022.

Comments: v1: 8 pages, 2 figures, 2 tables; v2: references added; v3: 16 pages, four tables, one author, two appendixes, clarifications, and references added, final version to appear in Physical Review D

Journal ref: Phys. Rev. D 106, 124052 (2022)

arXiv:2201.00742 [pdf]

Property unification of inherent amplitude, phase and polarization within a light beam

Authors: Xiaoyu Weng, Yu Miao, Guanxue Wang, Yihui Wang, Qiufang Zhan, Xiangmei Dong, Junle Qu, Xiumin Gao, Songlin Zhuang

Abstract: Is it possible to modulate the inherent properties of a single light beam, namely amplitude, phase and polarization, simultaneously, by merely its phase? Here, we solve this scientific problem by unifying all these three properties of a single light beam using phase vectorization and phase version of Malus's law. Full-property spatial light modulator is therefore developed based on the unification… ▽ More Is it possible to modulate the inherent properties of a single light beam, namely amplitude, phase and polarization, simultaneously, by merely its phase? Here, we solve this scientific problem by unifying all these three properties of a single light beam using phase vectorization and phase version of Malus's law. Full-property spatial light modulator is therefore developed based on the unification of these fundament links, which enables pixel-level polarization, amplitude and phase manipulation of light beams in a real-time dynamic way. This work not only implies that the amplitude, phase and polarization of a single light beam are interconnected, but also offers a solid answer on how to modulate these three natures of a single light beam simultaneously, which will deepen our understanding about the behavior of light beam, and facilitating extensive developments in optics and relate fields. △ Less

Submitted 3 January, 2022; originally announced January 2022.

arXiv:2112.14397 [pdf, other]

EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate

Authors: Xiaonan Nie, Xupeng Miao, Shijie Cao, Lingxiao Ma, Qibin Liu, Jilong Xue, Youshan Miao, Yi Liu, Zhi Yang, Bin Cui

Abstract: Mixture-of-experts (MoE) is becoming popular due to its success in improving the model quality, especially in Transformers. By routing tokens with a sparse gate to a few experts (i.e., a small pieces of the full model), MoE can easily increase the model parameters to a very large scale while kee** the computation cost in a constant level. Most existing works just initialize some random experts,… ▽ More Mixture-of-experts (MoE) is becoming popular due to its success in improving the model quality, especially in Transformers. By routing tokens with a sparse gate to a few experts (i.e., a small pieces of the full model), MoE can easily increase the model parameters to a very large scale while kee** the computation cost in a constant level. Most existing works just initialize some random experts, set a fixed gating strategy (e.g., Top-k), and train the model from scratch in an ad-hoc way. We identify that these MoE models are suffering from the immature experts and unstable sparse gate, which are harmful to the convergence performance. In this paper, we propose an efficient end-to-end MoE training framework called EvoMoE. EvoMoE starts from training one single expert and gradually evolves into a large and sparse MoE structure. EvoMoE mainly contains two phases: the expert-diversify phase to train the base expert for a while and spawn multiple diverse experts from it, and the gate-sparsify phase to learn an adaptive sparse gate and activate a dynamic number of experts. EvoMoE naturally decouples the joint learning of both the experts and the sparse gate and focuses on learning the basic knowledge with a single expert at the early training stage. Then it diversifies the experts and continues to train the MoE with a novel Dense-to-Sparse gate (DTS-Gate). Specifically, instead of using a permanent sparse gate, DTS-Gate begins as a dense gate that routes tokens to all experts, then gradually and adaptively becomes sparser while routes to fewer experts. Evaluations are conducted on three popular models and tasks, including RoBERTa for masked language modeling task, GPT for language modeling task and Transformer for machine translation task. The results show that EvoMoE outperforms existing baselines, including Switch, BASE Layer, Hash Layer and StableMoE. △ Less

Submitted 9 October, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

arXiv:2112.01747 [pdf, other]

doi 10.1016/j.nuclphysb.2022.115938

Charged black-bounce spacetimes: Photon rings, shadows and observational appearances

Authors: Yang Guo, Yan-Gang Miao

Abstract: The photon ring, shadow and observational appearance of the emission originating near a charged black-bounce are investigated. Based on the geodesic analysis, we determine the upper and lower limits of critical impact parameters of a charged black-bounce. In particular, we find that the charged black-bounce shares the same critical impact parameter with the Reissner-Nordstöm black hole. In additio… ▽ More The photon ring, shadow and observational appearance of the emission originating near a charged black-bounce are investigated. Based on the geodesic analysis, we determine the upper and lower limits of critical impact parameters of a charged black-bounce. In particular, we find that the charged black-bounce shares the same critical impact parameter with the Reissner-Nordstöm black hole. In addition, we classify the light trajectories coming from the region near the charged black-bounce by utilizing the rays tracing procedure, and then investigate the observational appearance of the emissions from a thin disk accretion and a spherically symmetric infalling accretion. We reveal that a large charge increases the observed intensity but decreases the apparent size of shadows, and that the photon ring presents the intrinsic property of a spacetime geometry, which is independent of the types of the two accretions. Our results are in good agreement with the recent observations. △ Less

Submitted 7 August, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

Comments: v1: 10 pages, 6 figures; v2: references added; v3: 12 pages, 7 figures, clarifications and references added, final version to appear in Nuclear Physics B

Journal ref: Nucl. Phys. B 983 (2022) 115938

arXiv:2110.14201 [pdf, other]

doi 10.1140/epjc/s10052-022-10458-y

The generalized holographic $c$-function for regular AdS black holes

Authors: Yang Li, Yan-Gang Miao

Abstract: We use the causal horizon entropy to study the asymptotic behaviors of regular AdS black holes. In some literature, the causal horizon entropy is regarded as a generalized holographic $c$-function. In this paper, we apply this idea to the case of regular AdS black holes. We show that the causal horizon entropy decreases to zero at the center of regular AdS black holes and in particular it is stati… ▽ More We use the causal horizon entropy to study the asymptotic behaviors of regular AdS black holes. In some literature, the causal horizon entropy is regarded as a generalized holographic $c$-function. In this paper, we apply this idea to the case of regular AdS black holes. We show that the causal horizon entropy decreases to zero at the center of regular AdS black holes and in particular it is stationary because its derivative with respect to the affine parameter approaches zero asymptotically. Meanwhile, the asymptotic behavior of the metric of regular AdS black holes implies that the black hole center corresponds to an IR fixed point. Therefore, we conclude that the causal horizon entropy is a valid candidate for the holographic $c$-function of these regular AdS black holes. △ Less

Submitted 22 May, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

Comments: v1: 21 pages, 5 figures; v2: title changed, final version to appear in European Physical Journal C

Journal ref: Eur. Phys. J. C 82, 503 (2022)

arXiv:2110.12008 [pdf]

Super-Resolution imaging of plasmonic Near-fields: Overcoming Emitter Mislocalizations

Authors: Yuting Miao, Robert C. Boutelle, Anastasia Blake, Vigneshwaran Chandrasekaran, Jennifer Hollingsworth, Shimon Weiss

Abstract: Plasmonic nano-objects have shown great potential in enhancing biological and chemical sensing, light harvesting and energy transfer, and optical and quantum computing to name a few. Therefore, an extensive effort has been vested in optimizing plasmonic systems and exploiting their field enhancement properties. Super-resolution imaging with quantum dots (QDs) is a promising method to probe plasmon… ▽ More Plasmonic nano-objects have shown great potential in enhancing biological and chemical sensing, light harvesting and energy transfer, and optical and quantum computing to name a few. Therefore, an extensive effort has been vested in optimizing plasmonic systems and exploiting their field enhancement properties. Super-resolution imaging with quantum dots (QDs) is a promising method to probe plasmonic near-fields, but is hindered by the distortion of the emission intensity and radiation pattern. Here we investigate the interaction between QDs and 'L-shaped' gold nanoantennas, and demonstrate both theoretically and experimentally that this strong interaction can induce polarization-dependent modifications to the apparent QD emission intensity, polarization and localization. Based on FDTD simulations and polarization-modulated single-molecule microscopy, we show that the displacement of the emitter's localization is due to the interference between the emitter and the induced dipole and can be up to 100 nm. We also discovered that the emission polarization can rotate towards the symmetry axis or one arm of the L-shape because of the scattering. Our results could assist in paving a pathway for higher precision plasmonic near-field map** and its underlying applications. △ Less

Submitted 22 October, 2021; originally announced October 2021.

arXiv:2110.10358 [pdf, other]

Hierarchical Aspect-guided Explanation Generation for Explainable Recommendation

Authors: Yidan Hu, Yong Liu, Chunyan Miao, Gongqi Lin, Yuan Miao

Abstract: Explainable recommendation systems provide explanations for recommendation results to improve their transparency and persuasiveness. The existing explainable recommendation methods generate textual explanations without explicitly considering the user's preferences on different aspects of the item. In this paper, we propose a novel explanation generation framework, named Hierarchical Aspect-guided… ▽ More Explainable recommendation systems provide explanations for recommendation results to improve their transparency and persuasiveness. The existing explainable recommendation methods generate textual explanations without explicitly considering the user's preferences on different aspects of the item. In this paper, we propose a novel explanation generation framework, named Hierarchical Aspect-guided explanation Generation (HAG), for explainable recommendation. Specifically, HAG employs a review-based syntax graph to provide a unified view of the user/item details. An aspect-guided graph pooling operator is proposed to extract the aspect-relevant information from the review-based syntax graphs to model the user's preferences on an item at the aspect level. Then, a hierarchical explanation decoder is developed to generate aspects and aspect-relevant explanations based on the attention mechanism. The experimental results on three real datasets indicate that HAG outperforms state-of-the-art explanation generation methods in both single-aspect and multi-aspect explanation generation tasks, and also achieves comparable or even better preference prediction accuracy than strong baseline methods. △ Less

Submitted 22 October, 2021; v1 submitted 19 October, 2021; originally announced October 2021.

arXiv:2110.09998 [pdf, other]

Watch out for the risky actors: Assessing risk in dynamic environments for safe driving

Authors: Saurabh Jha, Yan Miao, Zbigniew Kalbarczyk, Ravishankar K. Iyer

Abstract: Driving in a dynamic environment that consists of other actors is inherently a risky task as each actor influences the driving decision and may significantly limit the number of choices in terms of navigation and safety plan. The risk encountered by the Ego actor depends on the driving scenario and the uncertainty associated with predicting the future trajectories of the other actors in the drivin… ▽ More Driving in a dynamic environment that consists of other actors is inherently a risky task as each actor influences the driving decision and may significantly limit the number of choices in terms of navigation and safety plan. The risk encountered by the Ego actor depends on the driving scenario and the uncertainty associated with predicting the future trajectories of the other actors in the driving scenario. However, not all objects pose a similar risk. Depending on the object's type, trajectory, position, and the associated uncertainty with these quantities; some objects pose a much higher risk than others. The higher the risk associated with an actor, the more attention must be directed towards that actor in terms of resources and safety planning. In this paper, we propose a novel risk metric to calculate the importance of each actor in the world and demonstrate its usefulness through a case study. △ Less

Submitted 19 October, 2021; originally announced October 2021.

Comments: preprint version

arXiv:2110.08226 [pdf, other]

Guiding Visual Question Generation

Authors: Nihir Vedd, Zixu Wang, Marek Rei, Yishu Miao, Lucia Specia

Abstract: In traditional Visual Question Generation (VQG), most images have multiple concepts (e.g. objects and categories) for which a question could be generated, but models are trained to mimic an arbitrary choice of concept as given in their training data. This makes training difficult and also poses issues for evaluation -- multiple valid questions exist for most images but only one or a few are captur… ▽ More In traditional Visual Question Generation (VQG), most images have multiple concepts (e.g. objects and categories) for which a question could be generated, but models are trained to mimic an arbitrary choice of concept as given in their training data. This makes training difficult and also poses issues for evaluation -- multiple valid questions exist for most images but only one or a few are captured by the human references. We present Guiding Visual Question Generation - a variant of VQG which conditions the question generator on categorical information based on expectations on the type of question and the objects it should explore. We propose two variants: (i) an explicitly guided model that enables an actor (human or automated) to select which objects and categories to generate a question for; and (ii) an implicitly guided model that learns which objects and categories to condition on, based on discrete latent variables. The proposed models are evaluated on an answer-category augmented VQA dataset and our quantitative results show a substantial improvement over the current state of the art (over 9 BLEU-4 increase). Human evaluation validates that guidance helps the generation of questions that are grammatically coherent and relevant to the given image and objects. △ Less

Submitted 26 July, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

Comments: 14 pages including references and Appendix. 3 figures and 4 tables

arXiv:2110.02814 [pdf, other]

Efficient and High-quality Prehensile Rearrangement in Cluttered and Confined Spaces

Authors: Rui Wang, Yinglong Miao, Kostas E. Bekris

Abstract: Prehensile object rearrangement in cluttered and confined spaces has broad applications but is also challenging. For instance, rearranging products in a grocery shelf means that the robot cannot directly access all objects and has limited free space. This is harder than tabletop rearrangement where objects are easily accessible with top-down grasps, which simplifies robot-object interactions. This… ▽ More Prehensile object rearrangement in cluttered and confined spaces has broad applications but is also challenging. For instance, rearranging products in a grocery shelf means that the robot cannot directly access all objects and has limited free space. This is harder than tabletop rearrangement where objects are easily accessible with top-down grasps, which simplifies robot-object interactions. This work focuses on problems where such interactions are critical for completing tasks. It proposes a new efficient and complete solver under general constraints for monotone instances, which can be solved by moving each object at most once. The monotone solver reasons about robot-object constraints and uses them to effectively prune the search space. The new monotone solver is integrated with a global planner to solve non-monotone instances with high-quality solutions fast. Furthermore, this work contributes an effective pre-processing tool to significantly speed up online motion planning queries for rearrangement in confined spaces. Experiments further demonstrate that the proposed monotone solver, equipped with the pre-processing tool, results in 57.3% faster computation and 3 times higher success rate than state-of-the-art methods. Similarly, the resulting global planner is computationally more efficient and has a higher success rate, while producing high-quality solutions for non-monotone instances (i.e., only 1.3 additional actions are needed on average). Videos of demonstrating solutions on a real robotic system and codes can be found at https://github.com/Rui1223/uniform_object_rearrangement. △ Less

Submitted 17 March, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

Comments: accepted to IEEE International Conference on Robotics and Automation (ICRA 2022)

arXiv:2110.01819 [pdf, ps, other]

doi 10.1103/PhysRevX.11.041048

Tuning the Parity Mixing of Singlet-Septet Pairing in a Half-Heusler Superconductor

Authors: K. Ishihara, T. Takenaka, Y. Miao, Y. Mizukami, K. Hashimoto, M. Yamashita, M. Konczykowski, R. Masuki, M. Hirayama, T. Nomoto, R. Arita, O. Pavlosiuk, P. Wisniewski, D. Kaczorowski, T. Shibauchi

Abstract: In superconductors, electrons with spin ${s=1/2}$ form Cooper pairs whose spin structure is usually singlet (${S=0}$) or triplet (${S=1}$). When the electronic structure near the Fermi level is characterized by fermions with angular momentum ${j=3/2}$ due to strong spin-orbit interactions, novel pairing states such as even-parity quintet (${J=2}$) and odd-parity septet (${J=3}$) states become allo… ▽ More In superconductors, electrons with spin ${s=1/2}$ form Cooper pairs whose spin structure is usually singlet (${S=0}$) or triplet (${S=1}$). When the electronic structure near the Fermi level is characterized by fermions with angular momentum ${j=3/2}$ due to strong spin-orbit interactions, novel pairing states such as even-parity quintet (${J=2}$) and odd-parity septet (${J=3}$) states become allowed. Prime candidates for such exotic states are half-Heusler superconductors, which exhibit unconventional superconducting properties, but their pairing nature remains unsettled. Here we show that the superconductivity in the noncentrosymmetric half-Heusler LuPdBi can be consistently described by the admixture of isotropic even-parity singlet and anisotropic odd-parity septet pairing, whose ratio can be tuned by electron irradiation. From magnetotransport and penetration depth measurements, we find that carrier concentrations and impurity scattering both increase with irradiation, resulting in a nonmonotonic change of the superconducting gap structure. Our findings shed new light on our fundamental understanding of unconventional superconducting states in topological materials. △ Less

Submitted 2 December, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

Comments: 11 pages, 9 figures, to be published in Phys. Rev. X

Journal ref: Phys. Rev. X 11, 041048 (2021)

arXiv:2109.13910 [pdf, other]

Online Object Model Reconstruction and Reuse for Lifelong Improvement of Robot Manipulation

Authors: Shiyang Lu, Rui Wang, Yinglong Miao, Chaitanya Mitash, Kostas Bekris

Abstract: This work proposes a robotic pipeline for picking and constrained placement of objects without geometric shape priors. Compared to recent efforts developed for similar tasks, where every object was assumed to be novel, the proposed system recognizes previously manipulated objects and performs online model reconstruction and reuse. Over a lifelong manipulation process, the system keeps learning fea… ▽ More This work proposes a robotic pipeline for picking and constrained placement of objects without geometric shape priors. Compared to recent efforts developed for similar tasks, where every object was assumed to be novel, the proposed system recognizes previously manipulated objects and performs online model reconstruction and reuse. Over a lifelong manipulation process, the system keeps learning features of objects it has interacted with and updates their reconstructed models. Whenever an instance of a previously manipulated object reappears, the system aims to first recognize it and then register its previously reconstructed model given the current observation. This step greatly reduces object shape uncertainty allowing the system to even reason for parts of objects, which are currently not observable. This also results in better manipulation efficiency as it reduces the need for active perception of the target object during manipulation. To get a reusable reconstructed model, the proposed pipeline adopts: i) TSDF for object representation, and ii) a variant of the standard particle filter algorithm for pose estimation and tracking of the partial object model. Furthermore, an effective way to construct and maintain a dataset of manipulated objects is presented. A sequence of real-world manipulation experiments is performed. They show how future manipulation tasks become more effective and efficient by reusing reconstructed models of previously manipulated objects, which were generated during their prior manipulation, instead of treating objects as novel every time. △ Less

Submitted 22 May, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

arXiv:2109.13556 [pdf, other]

doi 10.1140/epjc/s10052-022-10200-8

Acoustic regular black hole in fluid and its similarity and diversity to a conformally related black hole

Authors: Chen Lan, Yan-Gang Miao, Yi-Xiong Zang

Abstract: We address an interesting question in the present paper that whether the acoustic gravity can be applied as a tool to the study of regular black holes. For this purpose, we construct a general acoustic regular black hole in the spherically symmetric fluid, where its regularity is verified from the perspective of finiteness of curvature invariants and completeness of geodesics. In particular, we fi… ▽ More We address an interesting question in the present paper that whether the acoustic gravity can be applied as a tool to the study of regular black holes. For this purpose, we construct a general acoustic regular black hole in the spherically symmetric fluid, where its regularity is verified from the perspective of finiteness of curvature invariants and completeness of geodesics. In particular, we find that the acoustic interval not only looks like a line element of a conformally related black hole in which the fluid density can be regarded as a conformal factor, but also gives rise to a non-vanishing partition function which coincides with that of a conformally related black hole. As an application, we provide a specific acoustic regular black hole model, investigate its energy conditions and compute its quasinormal modes. We note that the strong energy condition of our model is violated completely outside the horizon of the model but remains valid in some regions inside the horizon, which may give a new insight into the relation between the regularity and strong energy condition. Moreover, we analyze the oscillating and dam** features of our model when it is perturbed. △ Less

Submitted 9 March, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

Comments: v1: 27 pages, 16 figures; v2: 28 pages, 17 figures, clarifications and references added; v3: clarifications and references added; v4: 33 pages, clarifications and references added, final version to appear in European Physical Journal C

Journal ref: Eur. Phys. J. C (2022) 82: 231

arXiv:2108.06470 [pdf, other]

doi 10.1103/PhysRevD.105.044031

Absorption cross section of regular black holes in scalar-tensor conformal gravity

Authors: Yang Li, Yan-Gang Miao

Abstract: In terms of the complex angular momentum method, we compute the absorption cross section by analyzing a massless test scalar field around conformally related black holes. At first, we investigate circular null geodesics and thereby prove a precondition for calculating the absorption cross section in the context of conformally related black holes. Then we use the WKB approximation method to derive… ▽ More In terms of the complex angular momentum method, we compute the absorption cross section by analyzing a massless test scalar field around conformally related black holes. At first, we investigate circular null geodesics and thereby prove a precondition for calculating the absorption cross section in the context of conformally related black holes. Then we use the WKB approximation method to derive the analytic expression of Regge frequency and the oscillation part of absorption cross sections. We find that this oscillation part depends on the scale factor of conformal transformations. By taking the conformally related Schwarzschild-Tangherlini black hole as an example, we show that this regular black hole has substantially distinctive absorption behavior compared with singular black holes. Our result provides a new approach to distinguish a regular black hole from a singular one. △ Less

Submitted 31 January, 2022; v1 submitted 14 August, 2021; originally announced August 2021.

Comments: v1: 27 pages, 3 figures, 2 appendixes; v2: 28 pages, clarifications added, final version to appear in Physical Review D

Journal ref: Phys. Rev. D 105 (2022) 044031

arXiv:2107.08709 [pdf, other]

ZIPPER: Exploiting Tile- and Operator-level Parallelism for General and Scalable Graph Neural Network Acceleration

Authors: Zhihui Zhang, **gwen Leng, Shuwen Lu, Youshan Miao, Yijia Diao, Minyi Guo, Chao Li, Yuhao Zhu

Abstract: Graph neural networks (GNNs) start to gain momentum after showing significant performance improvement in a variety of domains including molecular science, recommendation, and transportation. Turning such performance improvement of GNNs into practical applications relies on effective and efficient execution, especially for inference. However, neither CPU nor GPU can meet these needs if considering… ▽ More Graph neural networks (GNNs) start to gain momentum after showing significant performance improvement in a variety of domains including molecular science, recommendation, and transportation. Turning such performance improvement of GNNs into practical applications relies on effective and efficient execution, especially for inference. However, neither CPU nor GPU can meet these needs if considering both performance and energy efficiency. That's because accelerating GNNs is challenging due to their excessive memory usage and arbitrary interleaving of diverse operations. Besides, the semantics gap between the high-level GNN programming model and efficient hardware makes it difficult in accelerating general-domain GNNs. To address the challenge, we propose Zipper, an efficient yet general acceleration system for GNNs. The keys to Zipper include a graph-native intermediate representation (IR) and the associated compiler. By capturing GNN primitive operations and representing with GNN IR, Zipper is able to fit GNN semantics into hardware structure for efficient execution. The IR also enables GNN-specific optimizations including sparse graph tiling and redundant operation elimination. We further present an hardware architecture design consisting of dedicated blocks for different primitive operations, along with a run-time scheduler to map a IR program to the hardware blocks. Our evaluation shows that Zipper achieves 93.6x speedup and 147x energy reduction over Intel Xeon CPU, and 1.56x speedup and 4.85x energy reduction over NVIDIA V100 GPU on averages. △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: 11 pages

arXiv:2107.08352 [pdf, other]

Can we know about black hole thermodynamics through shadows?

Authors: Xin-Chang Cai, Yan-Gang Miao

Abstract: We investigate the relationship between shadow radius and microstructure for a general static spherically symmetric black hole and confirm their close connection. In this regard, we take the Reissner-Nordström (AdS) black hole as an example to do the concrete analysis. On the other hand, we study for the Kerr (AdS) black hole the relationship between its shadow and thermodynamics in the aspects of… ▽ More We investigate the relationship between shadow radius and microstructure for a general static spherically symmetric black hole and confirm their close connection. In this regard, we take the Reissner-Nordström (AdS) black hole as an example to do the concrete analysis. On the other hand, we study for the Kerr (AdS) black hole the relationship between its shadow and thermodynamics in the aspects of phase transition and microstructure. Our results for the Kerr (AdS) black hole show that the shadow radius $r_{\rm sh}$, the deformation parameters $δ_s$ and $k_s$, and the circularity deviation $ΔC$ can reflect the black hole thermodynamics. In addition, we give the constraints to the relaxation time of the M$87^{*}$ black hole by combining its shadow data and the Bekenstein-Hod universal bounds when the M$87^{*}$ is regarded as the Reissner-Nordström or Kerr black hole. We predict that the minimum relaxation times of M$87^{*}$ black hole and Sgr $A^{*}$ black hole are approximately 3 days and 2.64 minutes, respectively. Finally, we draw the first graph of the minimum relaxation time $τ_{\rm min}$ with respect to the maximum shadow radius $ r_{\rm sh}^{\rm max}$ at different mass levels. △ Less

Submitted 12 November, 2021; v1 submitted 17 July, 2021; originally announced July 2021.

Comments: v1: 31 pages, 11 figures, a new paper regarding shadow and thermodynamics for both spherically symmetric and rotating black holes, substantial new results for rotating black holes, only a small part of discussions on shadow and scalar curvature for spherically symmetric black holes overlap** with arXiv:2101.10780; v4: clarifications added, typos corrected

arXiv:2107.01866 [pdf, other]

doi 10.1016/j.nuclphysb.2022.115839

Weinhold geometry and thermodynamics of Bardeen AdS black holes

Authors: Yang Guo, Yan-Gang Miao

Abstract: Thermodynamics of Bardeen AdS black holes has attracted a great deal of attentions due to its intrinsic complications and rich phase structures. However, the entropy and thermodynamic volume are incorrect in some literatures. In this paper we revisit the thermodynamics of Bardeen AdS black holes and provide the correct entropy and thermodynamic volume. Furthermore, thermodynamic geometries are a p… ▽ More Thermodynamics of Bardeen AdS black holes has attracted a great deal of attentions due to its intrinsic complications and rich phase structures. However, the entropy and thermodynamic volume are incorrect in some literatures. In this paper we revisit the thermodynamics of Bardeen AdS black holes and provide the correct entropy and thermodynamic volume. Furthermore, thermodynamic geometries are a powerful tool to probe the microstructure of black holes. Based on the Hessian matrix of black hole mass, we introduce a thermodynamic metric and give its scalar curvature in Weinhold's geometry. The conformal relation between Weinhold's geometry and Ruppeiner's geometry will be changed due to the specific first law of thermodynamics for regular black holes like the Bardeen AdS black hole. We also investigate the critical behaviour of phase transitions in an extended phase space, and find that the critical behaviour of the Bardeen AdS black hole coincides with that of liquid-gas systems. In particular, based on the Weinhold geometry we unveil a repulsive interaction in the microstructure of the Bardeen AdS black hole under its small volume state, while it is known that only attractive interaction exists in the microstructure of the van der Waals fluid. △ Less

Submitted 13 May, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

Comments: v1: 11 pages, 4 figures; v2: clarifications and references added; v3: 12 pages, 5 figures, title changed, new contexts added, errors and typos corrected; v4: clarifications and references added, final version to appear in Nuclear Physics B

Journal ref: Nucl. Phys. B 980 (2022) 115839 (15 pages)

arXiv:2106.12833 [pdf, ps, other]

doi 10.1093/mnrasl/slab071

Light Bridges Can Suppress the Formation of Coronal Loops

Authors: Yuhu Miao, Libo Fu, Xian Du, Ding Yuan, Chaowei Jiang, Jiangtao Su, Mingyu Zhao, Sergey Anfinogentov

Abstract: A light bridge is a magnetic intrusion into a sunspot, it interacts with the main magnetic field and excites a variety of dynamical processes. In the letter, we studied magnetic connectivity between a light bridge and coronal loops rooted at the sunspot. We used the data of the Atmospheric Imaging Assembly onboard the Solar Dynamics Observatory (SDO) to study the features of sunspots with light br… ▽ More A light bridge is a magnetic intrusion into a sunspot, it interacts with the main magnetic field and excites a variety of dynamical processes. In the letter, we studied magnetic connectivity between a light bridge and coronal loops rooted at the sunspot. We used the data of the Atmospheric Imaging Assembly onboard the Solar Dynamics Observatory (SDO) to study the features of sunspots with light bridges. It is found that if a light bridge anchors at the umbra-penumbra boundary, the coronal loops could not be formed around the anchoring point. If the a light bridge become detached from the penumbra, the coronal loop starts to form again. The vector magnetogram provided by the Helioseismic Magnetic Imager onboard SDO shows that the anchoring region of a light bridge usually have an accompanying opposite minor-polarities. We conjugate that the magnetic field line could connect to these opposite polarities and form short-range magnetic loops, and therefore, coronal loops that extend to long-range could not be formed. A model of light bridge is proposed to explain the magnetic connectivity between a light bridge and the coronal loops. This model could explain many physical processes associated with light bridges. △ Less

Submitted 24 June, 2021; originally announced June 2021.

Comments: MNRAS, 5 pages,3 figures

Showing 101–150 of 347 results for author: Miao, Y