-
IFNet: Deep Imaging and Focusing for Handheld SAR with Millimeter-wave Signals
Authors:
Yadong Li,
Dongheng Zhang,
Ruixu Geng,
**cheng Wu,
Yang Hu,
Qibin Sun,
Yan Chen
Abstract:
Recent advancements have showcased the potential of handheld millimeter-wave (mmWave) imaging, which applies synthetic aperture radar (SAR) principles in portable settings. However, existing studies addressing handheld motion errors either rely on costly tracking devices or employ simplified imaging models, leading to impractical deployment or limited performance. In this paper, we present IFNet,…
▽ More
Recent advancements have showcased the potential of handheld millimeter-wave (mmWave) imaging, which applies synthetic aperture radar (SAR) principles in portable settings. However, existing studies addressing handheld motion errors either rely on costly tracking devices or employ simplified imaging models, leading to impractical deployment or limited performance. In this paper, we present IFNet, a novel deep unfolding network that combines the strengths of signal processing models and deep neural networks to achieve robust imaging and focusing for handheld mmWave systems. We first formulate the handheld imaging model by integrating multiple priors about mmWave images and handheld phase errors. Furthermore, we transform the optimization processes into an iterative network structure for improved and efficient imaging performance. Extensive experiments demonstrate that IFNet effectively compensates for handheld phase errors and recovers high-fidelity images from severely distorted signals. In comparison with existing methods, IFNet can achieve at least 11.89 dB improvement in average peak signal-to-noise ratio (PSNR) and 64.91% improvement in average structural similarity index measure (SSIM) on a real-world dataset.
△ Less
Submitted 5 May, 2024; v1 submitted 3 May, 2024;
originally announced May 2024.
-
Towards Dense and Accurate Radar Perception Via Efficient Cross-Modal Diffusion Model
Authors:
Ruibin Zhang,
Donglai Xue,
Yuhan Wang,
Ruixu Geng,
Fei Gao
Abstract:
Millimeter wave (mmWave) radars have attracted significant attention from both academia and industry due to their capability to operate in extreme weather conditions. However, they face challenges in terms of sparsity and noise interference, which hinder their application in the field of micro aerial vehicle (MAV) autonomous navigation. To this end, this paper proposes a novel approach to dense an…
▽ More
Millimeter wave (mmWave) radars have attracted significant attention from both academia and industry due to their capability to operate in extreme weather conditions. However, they face challenges in terms of sparsity and noise interference, which hinder their application in the field of micro aerial vehicle (MAV) autonomous navigation. To this end, this paper proposes a novel approach to dense and accurate mmWave radar point cloud construction via cross-modal learning. Specifically, we introduce diffusion models, which possess state-of-the-art performance in generative modeling, to predict LiDAR-like point clouds from paired raw radar data. We also incorporate the most recent diffusion model inference accelerating techniques to ensure that the proposed method can be implemented on MAVs with limited computing resources.We validate the proposed method through extensive benchmark comparisons and real-world experiments, demonstrating its superior performance and generalization ability. Code and pretrained models will be available at https://github.com/ZJU-FAST-Lab/Radar-Diffusion.
△ Less
Submitted 19 March, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models
Authors:
Wei Zou,
Runpeng Geng,
Binghui Wang,
**yuan Jia
Abstract:
Large language models (LLMs) have achieved remarkable success due to their exceptional generative capabilities. Despite their success, they also have inherent limitations such as a lack of up-to-date knowledge and hallucination. Retrieval-Augmented Generation (RAG) is a state-of-the-art technique to mitigate those limitations. In particular, given a question, RAG retrieves relevant knowledge from…
▽ More
Large language models (LLMs) have achieved remarkable success due to their exceptional generative capabilities. Despite their success, they also have inherent limitations such as a lack of up-to-date knowledge and hallucination. Retrieval-Augmented Generation (RAG) is a state-of-the-art technique to mitigate those limitations. In particular, given a question, RAG retrieves relevant knowledge from a knowledge database to augment the input of the LLM. For instance, the retrieved knowledge could be a set of top-k texts that are most semantically similar to the given question when the knowledge database contains millions of texts collected from Wikipedia. As a result, the LLM could utilize the retrieved knowledge as the context to generate an answer for the given question. Existing studies mainly focus on improving the accuracy or efficiency of RAG, leaving its security largely unexplored. We aim to bridge the gap in this work. Particularly, we propose PoisonedRAG , a set of knowledge poisoning attacks to RAG, where an attacker could inject a few poisoned texts into the knowledge database such that the LLM generates an attacker-chosen target answer for an attacker-chosen target question. We formulate knowledge poisoning attacks as an optimization problem, whose solution is a set of poisoned texts. Depending on the background knowledge (e.g., black-box and white-box settings) of an attacker on the RAG, we propose two solutions to solve the optimization problem, respectively. Our results on multiple benchmark datasets and LLMs show our attacks could achieve 90% attack success rates when injecting 5 poisoned texts for each target question into a database with millions of texts. We also evaluate recent defenses and our results show they are insufficient to defend against our attacks, highlighting the need for new defenses.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
PALoc: Advancing SLAM Benchmarking with Prior-Assisted 6-DoF Trajectory Generation and Uncertainty Estimation
Authors:
Xiangcheng Hu,
Linwei Zheng,
** Wu,
Ruoyu Geng,
Yang Yu,
Hexiang Wei,
Xiaoyu Tang,
Lujia Wang,
Jianhao Jiao,
Ming Liu
Abstract:
Accurately generating ground truth (GT) trajectories is essential for Simultaneous Localization and Map** (SLAM) evaluation, particularly under varying environmental conditions. This study introduces a systematic approach employing a prior map-assisted framework for generating dense six-degree-of-freedom (6-DoF) GT poses for the first time, enhancing the fidelity of both indoor and outdoor SLAM…
▽ More
Accurately generating ground truth (GT) trajectories is essential for Simultaneous Localization and Map** (SLAM) evaluation, particularly under varying environmental conditions. This study introduces a systematic approach employing a prior map-assisted framework for generating dense six-degree-of-freedom (6-DoF) GT poses for the first time, enhancing the fidelity of both indoor and outdoor SLAM datasets. Our method excels in handling degenerate and stationary conditions frequently encountered in SLAM datasets, thereby increasing robustness and precision. A significant aspect of our approach is the detailed derivation of covariances within the factor graph, enabling an in-depth analysis of pose uncertainty propagation. This analysis crucially contributes to demonstrating specific pose uncertainties and enhancing trajectory reliability from both theoretical and empirical perspectives. Additionally, we provide an open-source toolbox (https://github.com/JokerJohn/Cloud_Map_Evaluation) for map evaluation criteria, facilitating the indirect assessment of overall trajectory precision. Experimental results show at least a 30\% improvement in map accuracy and a 20\% increase in direct trajectory accuracy compared to the Iterative Closest Point (ICP) \cite{sharp2002icp} algorithm across diverse campus environments, with substantially enhanced robustness. Our open-source solution (https://github.com/JokerJohn/PALoc), extensively applied in the FusionPortable\cite{Jiao2022Mar} dataset, is geared towards SLAM benchmark dataset augmentation and represents a significant advancement in SLAM evaluations.
△ Less
Submitted 6 February, 2024; v1 submitted 31 January, 2024;
originally announced January 2024.
-
DeFlow: Decoder of Scene Flow Network in Autonomous Driving
Authors:
Qingwen Zhang,
Yi Yang,
Heng Fang,
Ruoyu Geng,
Patric Jensfelt
Abstract:
Scene flow estimation determines a scene's 3D motion field, by predicting the motion of points in the scene, especially for aiding tasks in autonomous driving. Many networks with large-scale point clouds as input use voxelization to create a pseudo-image for real-time running. However, the voxelization process often results in the loss of point-specific features. This gives rise to a challenge in…
▽ More
Scene flow estimation determines a scene's 3D motion field, by predicting the motion of points in the scene, especially for aiding tasks in autonomous driving. Many networks with large-scale point clouds as input use voxelization to create a pseudo-image for real-time running. However, the voxelization process often results in the loss of point-specific features. This gives rise to a challenge in recovering those features for scene flow tasks. Our paper introduces DeFlow which enables a transition from voxel-based features to point features using Gated Recurrent Unit (GRU) refinement. To further enhance scene flow estimation performance, we formulate a novel loss function that accounts for the data imbalance between static and dynamic points. Evaluations on the Argoverse 2 scene flow task reveal that DeFlow achieves state-of-the-art results on large-scale point cloud data, demonstrating that our network has better performance and efficiency compared to others. The code is open-sourced at https://github.com/KTH-RPL/deflow.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Unifying Structured Data as Graph for Data-to-Text Pre-Training
Authors:
Shujie Li,
Liang Li,
Ruiying Geng,
Min Yang,
Binhua Li,
Guanghu Yuan,
Wanwei He,
Shao Yuan,
Can Ma,
Fei Huang,
Yongbin Li
Abstract:
Data-to-text (D2T) generation aims to transform structured data into natural language text. Data-to-text pre-training has proved to be powerful in enhancing D2T generation and yields impressive performances. However, previous pre-training methods either oversimplified structured data into a sequence without considering input structures or designed training objectives tailored for a specific data s…
▽ More
Data-to-text (D2T) generation aims to transform structured data into natural language text. Data-to-text pre-training has proved to be powerful in enhancing D2T generation and yields impressive performances. However, previous pre-training methods either oversimplified structured data into a sequence without considering input structures or designed training objectives tailored for a specific data structure (e.g., table or knowledge graph). In this paper, we unify different types of structured data (i.e., table, key-value data, knowledge graph) into the graph format and cast different data-to-text generation tasks as graph-to-text generation. To effectively exploit the structural information of the input graph, we propose a structure-enhanced pre-training method for D2T generation by designing a structure-enhanced Transformer. Concretely, we devise a position matrix for the Transformer, encoding relative positional information of connected nodes in the input graph. In addition, we propose a new attention matrix to incorporate graph structures into the original Transformer by taking the available explicit connectivity structure into account. Extensive experiments on six benchmark datasets show the effectiveness of our model. Our source codes are available at https://github.com/AlibabaResearch/DAMO-ConvAI/tree/main/unid2t.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Passive Non-Line-of-Sight Imaging with Light Transport Modulation
Authors:
Jiarui Zhang,
Ruixu Geng,
Xiaolong Du,
Yan Chen,
Houqiang Li,
Yang Hu
Abstract:
Passive non-line-of-sight (NLOS) imaging has witnessed rapid development in recent years, due to its ability to image objects that are out of sight. The light transport condition plays an important role in this task since changing the conditions will lead to different imaging models. Existing learning-based NLOS methods usually train independent models for different light transport conditions, whi…
▽ More
Passive non-line-of-sight (NLOS) imaging has witnessed rapid development in recent years, due to its ability to image objects that are out of sight. The light transport condition plays an important role in this task since changing the conditions will lead to different imaging models. Existing learning-based NLOS methods usually train independent models for different light transport conditions, which is computationally inefficient and impairs the practicality of the models. In this work, we propose NLOS-LTM, a novel passive NLOS imaging method that effectively handles multiple light transport conditions with a single network. We achieve this by inferring a latent light transport representation from the projection image and using this representation to modulate the network that reconstructs the hidden image from the projection image. We train a light transport encoder together with a vector quantizer to obtain the light transport representation. To further regulate this representation, we jointly learn both the reconstruction network and the reprojection network during training. A set of light transport modulation blocks is used to modulate the two jointly trained networks in a multi-scale way. Extensive experiments on a large-scale passive NLOS dataset demonstrate the superiority of the proposed method. The code is available at https://github.com/JerryOctopus/NLOS-LTM.
△ Less
Submitted 26 March, 2024; v1 submitted 26 December, 2023;
originally announced December 2023.
-
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
Authors:
Yupei Liu,
Yuqi Jia,
Runpeng Geng,
**yuan Jia,
Neil Zhenqiang Gong
Abstract:
A prompt injection attack aims to inject malicious instruction/data into the input of an LLM-Integrated Application such that it produces results as an attacker desires. Existing works are limited to case studies. As a result, the literature lacks a systematic understanding of prompt injection attacks and their defenses. We aim to bridge the gap in this work. In particular, we propose a framework…
▽ More
A prompt injection attack aims to inject malicious instruction/data into the input of an LLM-Integrated Application such that it produces results as an attacker desires. Existing works are limited to case studies. As a result, the literature lacks a systematic understanding of prompt injection attacks and their defenses. We aim to bridge the gap in this work. In particular, we propose a framework to formalize prompt injection attacks. Existing attacks are special cases in our framework. Moreover, based on our framework, we design a new attack by combining existing ones. Using our framework, we conduct a systematic evaluation on 5 prompt injection attacks and 10 defenses with 10 LLMs and 7 tasks. Our work provides a common benchmark for quantitatively evaluating future prompt injection attacks and defenses. To facilitate research on this topic, we make our platform public at https://github.com/liu00222/Open-Prompt-Injection.
△ Less
Submitted 1 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
DREAM-PCD: Deep Reconstruction and Enhancement of mmWave Radar Pointcloud
Authors:
Ruixu Geng,
Yadong Li,
Dongheng Zhang,
**cheng Wu,
Yating Gao,
Yang Hu,
Yan Chen
Abstract:
Millimeter-wave (mmWave) radar pointcloud offers attractive potential for 3D sensing, thanks to its robustness in challenging conditions such as smoke and low illumination. However, existing methods failed to simultaneously address the three main challenges in mmWave radar pointcloud reconstruction: specular information lost, low angular resolution, and strong interference and noise. In this paper…
▽ More
Millimeter-wave (mmWave) radar pointcloud offers attractive potential for 3D sensing, thanks to its robustness in challenging conditions such as smoke and low illumination. However, existing methods failed to simultaneously address the three main challenges in mmWave radar pointcloud reconstruction: specular information lost, low angular resolution, and strong interference and noise. In this paper, we propose DREAM-PCD, a novel framework that combines signal processing and deep learning methods into three well-designed components to tackle all three challenges: Non-Coherent Accumulation for dense points, Synthetic Aperture Accumulation for improved angular resolution, and Real-Denoise Multiframe network for noise and interference removal. Moreover, the causal multiframe and "real-denoise" mechanisms in DREAM-PCD significantly enhance the generalization performance. We also introduce RadarEyes, the largest mmWave indoor dataset with over 1,000,000 frames, featuring a unique design incorporating two orthogonal single-chip radars, lidar, and camera, enriching dataset diversity and applications. Experimental results demonstrate that DREAM-PCD surpasses existing methods in reconstruction quality, and exhibits superior generalization and real-time capabilities, enabling high-quality real-time reconstruction of radar pointcloud under various parameters and scenarios. We believe that DREAM-PCD, along with the RadarEyes dataset, will significantly advance mmWave radar perception in future real-world applications.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
A Dynamic Points Removal Benchmark in Point Cloud Maps
Authors:
Qingwen Zhang,
Daniel Duberg,
Ruoyu Geng,
Mingkai Jia,
Lujia Wang,
Patric Jensfelt
Abstract:
In the field of robotics, the point cloud has become an essential map representation. From the perspective of downstream tasks like localization and global path planning, points corresponding to dynamic objects will adversely affect their performance. Existing methods for removing dynamic points in point clouds often lack clarity in comparative evaluations and comprehensive analysis. Therefore, we…
▽ More
In the field of robotics, the point cloud has become an essential map representation. From the perspective of downstream tasks like localization and global path planning, points corresponding to dynamic objects will adversely affect their performance. Existing methods for removing dynamic points in point clouds often lack clarity in comparative evaluations and comprehensive analysis. Therefore, we propose an easy-to-extend unified benchmarking framework for evaluating techniques for removing dynamic points in maps. It includes refactored state-of-art methods and novel metrics to analyze the limitations of these approaches. This enables researchers to dive deep into the underlying reasons behind these limitations. The benchmark makes use of several datasets with different sensor types. All the code and datasets related to our study are publicly available for further development and utilization.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
CATS: A Pragmatic Chinese Answer-to-Sequence Dataset with Large Scale and High Quality
Authors:
Liang Li,
Ruiying Geng,
Chengyang Fang,
Bing Li,
Can Ma,
Rongyu Cao,
Binhua Li,
Fei Huang,
Yongbin Li
Abstract:
There are three problems existing in the popular data-to-text datasets. First, the large-scale datasets either contain noise or lack real application scenarios. Second, the datasets close to real applications are relatively small in size. Last, current datasets bias in the English language while leaving other languages underexplored. To alleviate these limitations, in this paper, we present CATS,…
▽ More
There are three problems existing in the popular data-to-text datasets. First, the large-scale datasets either contain noise or lack real application scenarios. Second, the datasets close to real applications are relatively small in size. Last, current datasets bias in the English language while leaving other languages underexplored. To alleviate these limitations, in this paper, we present CATS, a pragmatic Chinese answer-to-sequence dataset with large scale and high quality. The dataset aims to generate textual descriptions for the answer in the practical TableQA system. Further, to bridge the structural gap between the input SQL and table and establish better semantic alignments, we propose a Unified Graph Transformation approach to establish a joint encoding space for the two hybrid knowledge resources and convert this task to a graph-to-text problem. The experiment results demonstrate the effectiveness of our proposed method. Further analysis on CATS attests to both the high quality and challenges of the dataset.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation
Authors:
Weihao Zeng,
Lulu Zhao,
Keqing He,
Ruotong Geng,
**gang Wang,
Wei Wu,
Weiran Xu
Abstract:
Existing controllable dialogue generation work focuses on the single-attribute control and lacks generalization capability to out-of-distribution multiple attribute combinations. In this paper, we explore the compositional generalization for multi-attribute controllable dialogue generation where a model can learn from seen attribute values and generalize to unseen combinations. We propose a prompt…
▽ More
Existing controllable dialogue generation work focuses on the single-attribute control and lacks generalization capability to out-of-distribution multiple attribute combinations. In this paper, we explore the compositional generalization for multi-attribute controllable dialogue generation where a model can learn from seen attribute values and generalize to unseen combinations. We propose a prompt-based disentangled controllable dialogue generation model, DCG. It learns attribute concept composition by generating attribute-oriented prompt vectors and uses a disentanglement loss to disentangle different attributes for better generalization. Besides, we design a unified reference-free evaluation framework for multiple attributes with different levels of granularities. Experiment results on two benchmarks prove the effectiveness of our method and the evaluation metric.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
PALoc: Robust Prior-assisted Trajectory Generation for Benchmarking
Authors:
Xiangcheng Hu,
** Wu,
Jianhao Jiao,
Ruoyu Geng,
Ming Liu
Abstract:
Evaluating simultaneous localization and map** (SLAM) algorithms necessitates high-precision and dense ground truth (GT) trajectories. But obtaining desirable GT trajectories is sometimes challenging without GT tracking sensors. As an alternative, in this paper, we propose a novel prior-assisted SLAM system to generate a full six-degree-of-freedom ($6$-DOF) trajectory at around $10$Hz for benchm…
▽ More
Evaluating simultaneous localization and map** (SLAM) algorithms necessitates high-precision and dense ground truth (GT) trajectories. But obtaining desirable GT trajectories is sometimes challenging without GT tracking sensors. As an alternative, in this paper, we propose a novel prior-assisted SLAM system to generate a full six-degree-of-freedom ($6$-DOF) trajectory at around $10$Hz for benchmarking under the framework of the factor graph. Our degeneracy-aware map factor utilizes a prior point cloud map and LiDAR frame for point-to-plane optimization, simultaneously detecting degeneration cases to reduce drift and enhancing the consistency of pose estimation. Our system is seamlessly integrated with cutting-edge odometry via a loosely coupled scheme to generate high-rate and precise trajectories. Moreover, we propose a norm-constrained gravity factor for stationary cases, optimizing pose and gravity to boost performance. Extensive evaluations demonstrate our algorithm's superiority over existing SLAM or map-based methods in diverse scenarios in terms of precision, smoothness, and robustness. Our approach substantially advances reliable and accurate SLAM evaluation methods, fostering progress in robotics research.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs
Authors:
**yang Li,
Binyuan Hui,
Ge Qu,
Jiaxi Yang,
Binhua Li,
Bowen Li,
Bailin Wang,
Bowen Qin,
Rongyu Cao,
Ruiying Geng,
Nan Huo,
Xuanhe Zhou,
Chenhao Ma,
Guoliang Li,
Kevin C. C. Chang,
Fei Huang,
Reynold Cheng,
Yongbin Li
Abstract:
Text-to-SQL parsing, which aims at converting natural language instructions into executable SQLs, has gained increasing attention in recent years. In particular, Codex and ChatGPT have shown impressive results in this task. However, most of the prevalent benchmarks, i.e., Spider, and WikiSQL, focus on database schema with few rows of database contents leaving the gap between academic study and rea…
▽ More
Text-to-SQL parsing, which aims at converting natural language instructions into executable SQLs, has gained increasing attention in recent years. In particular, Codex and ChatGPT have shown impressive results in this task. However, most of the prevalent benchmarks, i.e., Spider, and WikiSQL, focus on database schema with few rows of database contents leaving the gap between academic study and real-world applications. To mitigate this gap, we present Bird, a big benchmark for large-scale database grounded in text-to-SQL tasks, containing 12,751 pairs of text-to-SQL data and 95 databases with a total size of 33.4 GB, spanning 37 professional domains. Our emphasis on database values highlights the new challenges of dirty database contents, external knowledge between NL questions and database contents, and SQL efficiency, particularly in the context of massive databases. To solve these problems, text-to-SQL models must feature database value comprehension in addition to semantic parsing. The experimental results demonstrate the significance of database values in generating accurate text-to-SQLs for big databases. Furthermore, even the most effective text-to-SQL models, i.e. ChatGPT, only achieves 40.08% in execution accuracy, which is still far from the human result of 92.96%, proving that challenges still stand. Besides, we also provide an efficiency analysis to offer insights into generating text-to-efficient-SQLs that are beneficial to industries. We believe that BIRD will contribute to advancing real-world applications of text-to-SQL research. The leaderboard and source code are available: https://bird-bench.github.io/.
△ Less
Submitted 14 November, 2023; v1 submitted 4 May, 2023;
originally announced May 2023.
-
CSP-free adaptive Kriging surrogate model method for reliability analysis with small failure probability
Authors:
Wenxiong Li,
Rong Geng,
Suiyin Chen
Abstract:
In the field of reliability engineering, the Active learning reliability method combining Kriging and Monte Carlo Simulation (AK-MCS) has been developed and demonstrated to be effective in reliability analysis. However, the performance of AK-MCS is sensitive to the size of Candidate Sample Pool (CSP), particularly for systems with small failure probabilities. To address the limitations of conventi…
▽ More
In the field of reliability engineering, the Active learning reliability method combining Kriging and Monte Carlo Simulation (AK-MCS) has been developed and demonstrated to be effective in reliability analysis. However, the performance of AK-MCS is sensitive to the size of Candidate Sample Pool (CSP), particularly for systems with small failure probabilities. To address the limitations of conventional AK-MCS that relies on CSP, this paper proposes a CSP-free AK-MCS. The proposed methodology consists of two stages: surrogate model construction and Monte Carlo simulation for estimating the failure probability. In the stage of surrogate model construction, the surrogate model is iteratively refined based on the representative samples selected by solving the optimization problem facilitated by Particle Swarm Optimization (PSO) algorithm. To achieve an optimal balance between solution accuracy and efficiency, the penalty intensity control and the density control for the experimental design points are introduced to modify the objective function in optimization. The performance of the proposed methodology is evaluated using numerical examples, and results indicate that by leveraging an optimization algorithm to select representative samples, the proposed CSP-free AK-MCS overcomes the limitations of conventional CSP-based AK-MCS and exhibits exceptional performance in addressing small failure probabilities.
△ Less
Submitted 1 August, 2023; v1 submitted 14 April, 2023;
originally announced April 2023.
-
Plan-then-Seam: Towards Efficient Table-to-Text Generation
Authors:
Liang Li,
Ruiying Geng,
Chengyang Fang,
Bing Li,
Can Ma,
Binhua Li,
Yongbin Li
Abstract:
Table-to-text generation aims at automatically generating text to help people conveniently obtain salient information in tables. Recent works explicitly decompose the generation process into content planning and surface generation stages, employing two autoregressive networks for them respectively. However, they are computationally expensive due to the non-parallelizable nature of autoregressive d…
▽ More
Table-to-text generation aims at automatically generating text to help people conveniently obtain salient information in tables. Recent works explicitly decompose the generation process into content planning and surface generation stages, employing two autoregressive networks for them respectively. However, they are computationally expensive due to the non-parallelizable nature of autoregressive decoding and the redundant parameters of two networks. In this paper, we propose the first totally non-autoregressive table-to-text model (Plan-then-Seam, PTS) that produces its outputs in parallel with one single network. PTS firstly writes and calibrates one plan of the content to be generated with a novel rethinking pointer predictor, and then takes the plan as the context for seaming to decode the description. These two steps share parameters and perform iteratively to capture token inter-dependency while kee** parallel decoding. Experiments on two public benchmarks show that PTS achieves 3.0~5.6 times speedup for inference time, reducing 50% parameters, while maintaining as least comparable performance against strong two-stage table-to-text competitors.
△ Less
Submitted 28 February, 2023; v1 submitted 10 February, 2023;
originally announced February 2023.
-
Families of Perfect Tensors
Authors:
Runshi Geng
Abstract:
Perfect tensors are the tensors corresponding to the absolutely maximally entangled states, a special type of quantum states of interest in quantum information theory. We establish a method to compute parameterized families of perfect tensors in $(\mathbb{C}^d)^{\otimes 4}$ using exponential maps from Lie theory. With this method, we find explicit examples of non-classical perfect tensors in…
▽ More
Perfect tensors are the tensors corresponding to the absolutely maximally entangled states, a special type of quantum states of interest in quantum information theory. We establish a method to compute parameterized families of perfect tensors in $(\mathbb{C}^d)^{\otimes 4}$ using exponential maps from Lie theory. With this method, we find explicit examples of non-classical perfect tensors in $(\mathbb{C}^3)^{\otimes 4}$. In particular, we answer an open question posted by Życzkowski et al.
△ Less
Submitted 6 December, 2022; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems
Authors:
Weihao Zeng,
Keqing He,
Zechen Wang,
Dayuan Fu,
Guanting Dong,
Ruotong Geng,
Pei Wang,
**gang Wang,
Chaobo Sun,
Wei Wu,
Weiran Xu
Abstract:
Recent advances in neural approaches greatly improve task-oriented dialogue (TOD) systems which assist users to accomplish their goals. However, such systems rely on costly manually labeled dialogs which are not available in practical scenarios. In this paper, we present our models for Track 2 of the SereTOD 2022 challenge, which is the first challenge of building semi-supervised and reinforced TO…
▽ More
Recent advances in neural approaches greatly improve task-oriented dialogue (TOD) systems which assist users to accomplish their goals. However, such systems rely on costly manually labeled dialogs which are not available in practical scenarios. In this paper, we present our models for Track 2 of the SereTOD 2022 challenge, which is the first challenge of building semi-supervised and reinforced TOD systems on a large-scale real-world Chinese TOD dataset MobileCS. We build a knowledge-grounded dialog model to formulate dialog history and local KB as input and predict the system response. And we perform semi-supervised pre-training both on the labeled and unlabeled data. Our system achieves the first place both in the automatic evaluation and human interaction, especially with higher BLEU (+7.64) and Success (+13.6\%) than the second place.
△ Less
Submitted 23 December, 2022; v1 submitted 17 October, 2022;
originally announced October 2022.
-
Graph-to-Text Generation with Dynamic Structure Pruning
Authors:
Liang Li,
Ruiying Geng,
Bowen Li,
Can Ma,
Yinliang Yue,
Binhua Li,
Yongbin Li
Abstract:
Most graph-to-text works are built on the encoder-decoder framework with cross-attention mechanism. Recent studies have shown that explicitly modeling the input graph structure can significantly improve the performance. However, the vanilla structural encoder cannot capture all specialized information in a single forward pass for all decoding steps, resulting in inaccurate semantic representations…
▽ More
Most graph-to-text works are built on the encoder-decoder framework with cross-attention mechanism. Recent studies have shown that explicitly modeling the input graph structure can significantly improve the performance. However, the vanilla structural encoder cannot capture all specialized information in a single forward pass for all decoding steps, resulting in inaccurate semantic representations. Meanwhile, the input graph is flatted as an unordered sequence in the cross attention, ignoring the original graph structure. As a result, the obtained input graph context vector in the decoder may be flawed. To address these issues, we propose a Structure-Aware Cross-Attention (SACA) mechanism to re-encode the input graph representation conditioning on the newly generated context at each decoding step in a structure aware manner. We further adapt SACA and introduce its variant Dynamic Graph Pruning (DGP) mechanism to dynamically drop irrelevant nodes in the decoding process. We achieve new state-of-the-art results on two graph-to-text datasets, LDC2020T02 and ENT-DESC, with only minor increase on computational cost.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
A Survey on Text-to-SQL Parsing: Concepts, Methods, and Future Directions
Authors:
Bowen Qin,
Binyuan Hui,
Lihan Wang,
Min Yang,
**yang Li,
Binhua Li,
Ruiying Geng,
Rongyu Cao,
Jian Sun,
Luo Si,
Fei Huang,
Yongbin Li
Abstract:
Text-to-SQL parsing is an essential and challenging task. The goal of text-to-SQL parsing is to convert a natural language (NL) question to its corresponding structured query language (SQL) based on the evidences provided by relational databases. Early text-to-SQL parsing systems from the database community achieved a noticeable progress with the cost of heavy human engineering and user interactio…
▽ More
Text-to-SQL parsing is an essential and challenging task. The goal of text-to-SQL parsing is to convert a natural language (NL) question to its corresponding structured query language (SQL) based on the evidences provided by relational databases. Early text-to-SQL parsing systems from the database community achieved a noticeable progress with the cost of heavy human engineering and user interactions with the systems. In recent years, deep neural networks have significantly advanced this task by neural generation models, which automatically learn a map** function from an input NL question to an output SQL query. Subsequently, the large pre-trained language models have taken the state-of-the-art of the text-to-SQL parsing task to a new level. In this survey, we present a comprehensive review on deep learning approaches for text-to-SQL parsing. First, we introduce the text-to-SQL parsing corpora which can be categorized as single-turn and multi-turn. Second, we provide a systematical overview of pre-trained language models and existing methods for text-to-SQL parsing. Third, we present readers with the challenges faced by text-to-SQL parsing and explore some potential future directions in this field.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
RF Accelerator Technology R&D: Report of AF7-rf Topical Group to Snowmass 2021
Authors:
Sergey Belomestnykh,
Emilio A. Nanni,
Hans Weise,
Sergey V. Baryshev,
Pashupati Dhakal,
Rongli Geng,
Bianca Giaccone,
Chunguang **g,
Matthias Liepe,
Xueying Lu,
Tianhuan Luo,
Ganapati Myneni,
Alireza Nassiri,
David Neuffer,
Cho-Kuen Ng,
Sam Posen,
Sami Tantawi,
Anne-Marie Valente-Feliciano,
Jean-Luc Vay,
Brandon Weatherford,
Akira Yamamoto
Abstract:
Accelerator radio frequency (RF) technology has been and remains critical for modern high energy physics (HEP) experiments based on particle accelerators. Tremendous progress in advancing this technology has been achieved over the past decade in several areas highlighted in this report. These achievements and new results expected from continued R&D efforts could pave the way for upgrades of existi…
▽ More
Accelerator radio frequency (RF) technology has been and remains critical for modern high energy physics (HEP) experiments based on particle accelerators. Tremendous progress in advancing this technology has been achieved over the past decade in several areas highlighted in this report. These achievements and new results expected from continued R&D efforts could pave the way for upgrades of existing facilities, improvements to accelerators already under construction (e.g., PIP-II), well-developed proposals (e.g., ILC, CLIC), and/or enable concepts under development, such as FCC-ee, CEPC, C3, HELEN, multi-MW Fermilab Proton Intensity Upgrade, future Muon Colloder, etc. Advances in RF technology have impact beyond HEP on accelerators built for nuclear physics, basic energy sciences, and other areas. Recent examples of such accelerators are European XFEL, LCLS-II and LCLS-II-HE, SHINE, SNS, ESS, FRIB, and EIC. To support and enable new accelerator-based applications and even make some of them feasible, we must continue addressing their challenges via a comprehensive RF R&D program that would advance the existing RF technologies and explore the nascent ones.
△ Less
Submitted 25 August, 2022;
originally announced August 2022.
-
FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Map** Accuracy on Diverse Platforms
Authors:
Jianhao Jiao,
Hexiang Wei,
Tianshuai Hu,
Xiangcheng Hu,
Yilong Zhu,
Zhijian He,
** Wu,
**gwen Yu,
Xupeng Xie,
Huaiyang Huang,
Ruoyu Geng,
Lujia Wang,
Ming Liu
Abstract:
Combining multiple sensors enables a robot to maximize its perceptual awareness of environments and enhance its robustness to external disturbance, crucial to robotic navigation. This paper proposes the FusionPortable benchmark, a complete multi-sensor dataset with a diverse set of sequences for mobile robots. This paper presents three contributions. We first advance a portable and versatile multi…
▽ More
Combining multiple sensors enables a robot to maximize its perceptual awareness of environments and enhance its robustness to external disturbance, crucial to robotic navigation. This paper proposes the FusionPortable benchmark, a complete multi-sensor dataset with a diverse set of sequences for mobile robots. This paper presents three contributions. We first advance a portable and versatile multi-sensor suite that offers rich sensory measurements: 10Hz LiDAR point clouds, 20Hz stereo frame images, high-rate and asynchronous events from stereo event cameras, 200Hz inertial readings from an IMU, and 10Hz GPS signal. Sensors are already temporally synchronized in hardware. This device is lightweight, self-contained, and has plug-and-play support for mobile robots. Second, we construct a dataset by collecting 17 sequences that cover a variety of environments on the campus by exploiting multiple robot platforms for data collection. Some sequences are challenging to existing SLAM algorithms. Third, we provide ground truth for the decouple localization and map** performance evaluation. We additionally evaluate state-of-the-art SLAM approaches and identify their limitations. The dataset, consisting of raw sensor easurements, ground truth, calibration data, and evaluated algorithms, will be released: https://ram-lab.com/file/site/multi-sensor-dataset.
△ Less
Submitted 25 August, 2022;
originally announced August 2022.
-
Real-time Neural Dense Elevation Map** for Urban Terrain with Uncertainty Estimations
Authors:
Bowen Yang,
Qingwen Zhang,
Ruoyu Geng,
Lujia Wang,
Ming Liu
Abstract:
Having good knowledge of terrain information is essential for improving the performance of various downstream tasks on complex terrains, especially for the locomotion and navigation of legged robots. We present a novel framework for neural urban terrain reconstruction with uncertainty estimations. It generates dense robot-centric elevation maps online from sparse LiDAR observations. We design a no…
▽ More
Having good knowledge of terrain information is essential for improving the performance of various downstream tasks on complex terrains, especially for the locomotion and navigation of legged robots. We present a novel framework for neural urban terrain reconstruction with uncertainty estimations. It generates dense robot-centric elevation maps online from sparse LiDAR observations. We design a novel pre-processing and point features representation approach that ensures high robustness and computational efficiency when integrating multiple point cloud frames. A Bayesian-GAN model then recovers the detailed terrain structures while simultaneously providing the pixel-wise reconstruction uncertainty. We evaluate the proposed pipeline through extensive simulation and real-world experiments. It demonstrates efficient terrain reconstruction with high quality and real-time performance on a mobile platform, which further benefits the downstream tasks of legged robots. (See https://kin-zhang.github.io/ndem/ for more details.)
△ Less
Submitted 12 March, 2024; v1 submitted 6 August, 2022;
originally announced August 2022.
-
Analysis of the Spatio-temporal Dynamics of COVID-19 in Massachusetts via Spectral Graph Wavelet Theory
Authors:
Ru Geng,
Yixian Gao,
Hongkun Zhang,
Jian Zu
Abstract:
The rapid spread of COVID-19 disease has had a significant impact on the world. In this paper, we study COVID-19 data interpretation and visualization using open-data sources for 351 cities and towns in Massachusetts from December 6, 2020 to September 25, 2021. Because cities are embedded in rather complex transportation networks, we construct the spatio-temporal dynamic graph model, in which the…
▽ More
The rapid spread of COVID-19 disease has had a significant impact on the world. In this paper, we study COVID-19 data interpretation and visualization using open-data sources for 351 cities and towns in Massachusetts from December 6, 2020 to September 25, 2021. Because cities are embedded in rather complex transportation networks, we construct the spatio-temporal dynamic graph model, in which the graph attention neural network is utilized as a deep learning method to learn the pandemic transition probability among major cities in Massachusetts. Using the spectral graph wavelet transform (SGWT), we process the COVID-19 data on the dynamic graph, which enables us to design effective tools to analyze and detect spatio-temporal patterns in the pandemic spreading. We design a new node classification method, which effectively identifies the anomaly cities based on spectral graph wavelet coefficients. It can assist administrations or public health organizations in monitoring the spread of the pandemic and develo** preventive measures. Unlike most work focusing on the evolution of confirmed cases over time, we focus on the spatio-temporal patterns of pandemic evolution among cities. Through the data analysis and visualization, a better understanding of the epidemiological development at the city level is obtained and can be helpful with city-specific surveillance.
△ Less
Submitted 28 July, 2022;
originally announced August 2022.
-
Sub-micron spin-based magnetic field imaging with an organic light emitting diode
Authors:
Rugang Geng,
Adrian Mena,
William J. Pappas,
Dane R. McCamey
Abstract:
Quantum sensing and imaging of magnetic fields has attracted broad interests due to its potential for high sensitivity and spatial resolution. Common systems used for quantum sensing require either optical excitation (e.g., nitrogen-vacancy centres in diamond, atomic vapor magnetometers), or cryogenic temperatures (e.g., SQUIDs, superconducting qubits), which pose challenges for chip-scale integra…
▽ More
Quantum sensing and imaging of magnetic fields has attracted broad interests due to its potential for high sensitivity and spatial resolution. Common systems used for quantum sensing require either optical excitation (e.g., nitrogen-vacancy centres in diamond, atomic vapor magnetometers), or cryogenic temperatures (e.g., SQUIDs, superconducting qubits), which pose challenges for chip-scale integration and commercial scalability. Here, we demonstrate an integrated organic light emitting diode (OLED) based quantum sensor for magnetic field imaging, which employs spatially resolved magnetic resonance to provide a robust map** of magnetic fields. By considering the monolithic OLED as an array of individual virtual sensors, we achieve sub-micron magnetic field map** with field sensitivity of ~160 $μ$T Hz$^{-1/2}$ um$^{-2}$. Our work demonstrates a chip-scale OLED-based laser free magnetic field sensor and an approach to magnetic field map** built on a commercially relevant and manufacturable technology.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
MMFN: Multi-Modal-Fusion-Net for End-to-End Driving
Authors:
Qingwen Zhang,
Mingkai Tang,
Ruoyu Geng,
Feiyi Chen,
Ren Xin,
Lujia Wang
Abstract:
Inspired by the fact that humans use diverse sensory organs to perceive the world, sensors with different modalities are deployed in end-to-end driving to obtain the global context of the 3D scene. In previous works, camera and LiDAR inputs are fused through transformers for better driving performance. These inputs are normally further interpreted as high-level map information to assist navigation…
▽ More
Inspired by the fact that humans use diverse sensory organs to perceive the world, sensors with different modalities are deployed in end-to-end driving to obtain the global context of the 3D scene. In previous works, camera and LiDAR inputs are fused through transformers for better driving performance. These inputs are normally further interpreted as high-level map information to assist navigation tasks. Nevertheless, extracting useful information from the complex map input is challenging, for redundant information may mislead the agent and negatively affect driving performance. We propose a novel approach to efficiently extract features from vectorized High-Definition (HD) maps and utilize them in the end-to-end driving tasks. In addition, we design a new expert to further enhance the model performance by considering multi-road rules. Experimental results prove that both of the proposed improvements enable our agent to achieve superior performance compared with other methods.
△ Less
Submitted 3 August, 2022; v1 submitted 30 June, 2022;
originally announced July 2022.
-
Higgs-Energy LEptoN (HELEN) Collider based on advanced superconducting radio frequency technology
Authors:
S. Belomestnykh,
P. C. Bhat,
A. Grassellino,
M. Checchin,
D. Denisov,
R. L. Geng,
S. **dariani,
M. Liepe,
M. Martinello,
P. Merkel,
S. Nagaitsev,
H. Padamsee,
S. Posen,
R. A. Rimmer,
A. Romanenko,
V. Shiltsev,
A. Valishev,
V. Yakovlev
Abstract:
This Snowmass 2021 contributed paper discusses a Higgs-Energy LEptoN (HELEN) $e^+e^-$ linear collider based on advances superconducting radio frequency technology. The proposed collider offers cost and AC power savings, smaller footprint (relative to the ILC), and could be built at Fermilab with an Interaction Region within the site boundaries. After the initial physics run at 250 GeV, the collide…
▽ More
This Snowmass 2021 contributed paper discusses a Higgs-Energy LEptoN (HELEN) $e^+e^-$ linear collider based on advances superconducting radio frequency technology. The proposed collider offers cost and AC power savings, smaller footprint (relative to the ILC), and could be built at Fermilab with an Interaction Region within the site boundaries. After the initial physics run at 250 GeV, the collider could be upgraded either to higher luminosity or to higher (up to 500 GeV) energies. If the ILC could not be realized in Japan in a timely fashion, the HELEN collider would be a viable option to build a Higgs factory in the U.S.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
The International Linear Collider: Report to Snowmass 2021
Authors:
Alexander Aryshev,
Ties Behnke,
Mikael Berggren,
James Brau,
Nathaniel Craig,
Ayres Freitas,
Frank Gaede,
Spencer Gessner,
Stefania Gori,
Christophe Grojean,
Sven Heinemeyer,
Daniel Jeans,
Katja Kruger,
Benno List,
Jenny List,
Zhen Liu,
Shinichiro Michizono,
David W. Miller,
Ian Moult,
Hitoshi Murayama,
Tatsuya Nakada,
Emilio Nanni,
Mihoko Nojiri,
Hasan Padamsee,
Maxim Perelstein
, et al. (487 additional authors not shown)
Abstract:
The International Linear Collider (ILC) is on the table now as a new global energy-frontier accelerator laboratory taking data in the 2030s. The ILC addresses key questions for our current understanding of particle physics. It is based on a proven accelerator technology. Its experiments will challenge the Standard Model of particle physics and will provide a new window to look beyond it. This docu…
▽ More
The International Linear Collider (ILC) is on the table now as a new global energy-frontier accelerator laboratory taking data in the 2030s. The ILC addresses key questions for our current understanding of particle physics. It is based on a proven accelerator technology. Its experiments will challenge the Standard Model of particle physics and will provide a new window to look beyond it. This document brings the story of the ILC up to date, emphasizing its strong physics motivation, its readiness for construction, and the opportunity it presents to the US and the global particle physics community.
△ Less
Submitted 16 January, 2023; v1 submitted 14 March, 2022;
originally announced March 2022.
-
S$^2$SQL: Injecting Syntax to Question-Schema Interaction Graph Encoder for Text-to-SQL Parsers
Authors:
Binyuan Hui,
Ruiying Geng,
Lihan Wang,
Bowen Qin,
Bowen Li,
Jian Sun,
Yongbin Li
Abstract:
The task of converting a natural language question into an executable SQL query, known as text-to-SQL, is an important branch of semantic parsing. The state-of-the-art graph-based encoder has been successfully used in this task but does not model the question syntax well. In this paper, we propose S$^2$SQL, injecting Syntax to question-Schema graph encoder for Text-to-SQL parsers, which effectivel…
▽ More
The task of converting a natural language question into an executable SQL query, known as text-to-SQL, is an important branch of semantic parsing. The state-of-the-art graph-based encoder has been successfully used in this task but does not model the question syntax well. In this paper, we propose S$^2$SQL, injecting Syntax to question-Schema graph encoder for Text-to-SQL parsers, which effectively leverages the syntactic dependency information of questions in text-to-SQL to improve the performance. We also employ the decoupling constraint to induce diverse relational edge embedding, which further improves the network's performance. Experiments on the Spider and robustness setting Spider-Syn demonstrate that the proposed approach outperforms all existing methods when pre-training models are used, resulting in a performance ranks first on the Spider leaderboard.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
Geometric Rank and Linear Determinantal Varieties
Authors:
Runshi Geng
Abstract:
There are close relations between tripartite tensors with bounded geometric ranks and linear determinantal varieties with bounded codimensions. We study linear determinantal varieties with bounded codimensions, and prove upper bounds of the dimensions of the ambient spaces. Using those results, we classify tensors with geometric rank 3, find upper bounds of multilinear ranks of primitive tensors w…
▽ More
There are close relations between tripartite tensors with bounded geometric ranks and linear determinantal varieties with bounded codimensions. We study linear determinantal varieties with bounded codimensions, and prove upper bounds of the dimensions of the ambient spaces. Using those results, we classify tensors with geometric rank 3, find upper bounds of multilinear ranks of primitive tensors with geometric rank 4, and prove the existence of such upper bounds in general. We extend results of tripartite tensors to n-part tensors, showing the equivalence between geometric rank 1 and partition rank 1.
△ Less
Submitted 27 November, 2022; v1 submitted 10 January, 2022;
originally announced January 2022.
-
Linking-Enhanced Pre-Training for Table Semantic Parsing
Authors:
Bowen Qin,
Lihan Wang,
Binyuan Hui,
Ruiying Geng,
Zheng Cao,
Min Yang,
Jian Sun,
Yongbin Li
Abstract:
Recently pre-training models have significantly improved the performance of various NLP tasks by leveraging large-scale text corpora to improve the contextual representation ability of the neural network. The large pre-training language model has also been applied in the area of table semantic parsing. However, existing pre-training approaches have not carefully explored explicit interaction relat…
▽ More
Recently pre-training models have significantly improved the performance of various NLP tasks by leveraging large-scale text corpora to improve the contextual representation ability of the neural network. The large pre-training language model has also been applied in the area of table semantic parsing. However, existing pre-training approaches have not carefully explored explicit interaction relationships between a question and the corresponding database schema, which is a key ingredient for uncovering their semantic and structural correspondence. Furthermore, the question-aware representation learning in the schema grounding context has received less attention in pre-training objective.To alleviate these issues, this paper designs two novel pre-training objectives to impose the desired inductive bias into the learned representations for table pre-training. We further propose a schema-aware curriculum learning approach to mitigate the impact of noise and learn effectively from the pre-training data in an easy-to-hard manner. We evaluate our pre-trained framework by fine-tuning it on two benchmarks, Spider and SQUALL. The results demonstrate the effectiveness of our pre-training objective and curriculum compared to a variety of baselines.
△ Less
Submitted 14 February, 2022; v1 submitted 17 November, 2021;
originally announced November 2021.
-
Recent Advances on Non-Line-of-Sight Imaging: Conventional Physical Models, Deep Learning, and New Scenes
Authors:
Ruixu Geng,
Yang Hu,
Yan Chen
Abstract:
As an emerging technology that has attracted huge attention, non-line-of-sight (NLOS) imaging can reconstruct hidden objects by analyzing the diffuse reflection on a relay surface, with broad application prospects in the fields of autonomous driving, medical imaging, and defense. Despite the challenges of low signal-to-noise ratio (SNR) and high ill-posedness, NLOS imaging has been developed rapid…
▽ More
As an emerging technology that has attracted huge attention, non-line-of-sight (NLOS) imaging can reconstruct hidden objects by analyzing the diffuse reflection on a relay surface, with broad application prospects in the fields of autonomous driving, medical imaging, and defense. Despite the challenges of low signal-to-noise ratio (SNR) and high ill-posedness, NLOS imaging has been developed rapidly in recent years. Most current NLOS imaging technologies use conventional physical models, constructing imaging models through active or passive illumination and using reconstruction algorithms to restore hidden scenes. Moreover, deep learning algorithms for NLOS imaging have also received much attention recently. This paper presents a comprehensive overview of both conventional and deep learning-based NLOS imaging techniques. Besides, we also survey new proposed NLOS scenes, and discuss the challenges and prospects of existing technologies. Such a survey can help readers have an overview of different types of NLOS imaging, thus expediting the development of seeing around corners.
△ Less
Submitted 21 November, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
Relational Learning with Gated and Attentive Neighbor Aggregator for Few-Shot Knowledge Graph Completion
Authors:
Guanglin Niu,
Yang Li,
Chengguang Tang,
Ruiying Geng,
Jian Dai,
Qiao Liu,
Hao Wang,
Jian Sun,
Fei Huang,
Luo Si
Abstract:
Aiming at expanding few-shot relations' coverage in knowledge graphs (KGs), few-shot knowledge graph completion (FKGC) has recently gained more research interests. Some existing models employ a few-shot relation's multi-hop neighbor information to enhance its semantic representation. However, noise neighbor information might be amplified when the neighborhood is excessively sparse and no neighbor…
▽ More
Aiming at expanding few-shot relations' coverage in knowledge graphs (KGs), few-shot knowledge graph completion (FKGC) has recently gained more research interests. Some existing models employ a few-shot relation's multi-hop neighbor information to enhance its semantic representation. However, noise neighbor information might be amplified when the neighborhood is excessively sparse and no neighbor is available to represent the few-shot relation. Moreover, modeling and inferring complex relations of one-to-many (1-N), many-to-one (N-1), and many-to-many (N-N) by previous knowledge graph completion approaches requires high model complexity and a large amount of training instances. Thus, inferring complex relations in the few-shot scenario is difficult for FKGC models due to limited training instances. In this paper, we propose a few-shot relational learning with global-local framework to address the above issues. At the global stage, a novel gated and attentive neighbor aggregator is built for accurately integrating the semantics of a few-shot relation's neighborhood, which helps filtering the noise neighbors even if a KG contains extremely sparse neighborhoods. For the local stage, a meta-learning based TransH (MTransH) method is designed to model complex relations and train our model in a few-shot learning fashion. Extensive experiments show that our model outperforms the state-of-the-art FKGC approaches on the frequently-used benchmark datasets NELL-One and Wiki-One. Compared with the strong baseline model MetaR, our model achieves 5-shot FKGC performance improvements of 8.0% on NELL-One and 2.8% on Wiki-One by the metric Hits@10.
△ Less
Submitted 4 June, 2021; v1 submitted 27 April, 2021;
originally announced April 2021.
-
Spatial Variation and Correlation of Spin Properties in Organic Light-Emitting Diodes
Authors:
William J. Pappas,
Rugang Geng,
Adrian Mena,
Alexander Baldacchino,
Amir Asadpoordarvish,
Dane R. McCamey
Abstract:
Devices which exploit the quantum properties of materials are widespread, with quantum information processors and quantum sensors showing significant progress. Organic devices offer interesting opportunities for quantum technologies owing to their engineerable spin properties, with spintronic operation and spin resonance magnetic-field sensing demonstrated in research grade devices, as well as pro…
▽ More
Devices which exploit the quantum properties of materials are widespread, with quantum information processors and quantum sensors showing significant progress. Organic devices offer interesting opportunities for quantum technologies owing to their engineerable spin properties, with spintronic operation and spin resonance magnetic-field sensing demonstrated in research grade devices, as well as proven compatibility with large scale fabrication techniques. Yet several important challenges remain as we move toward scaling these proof-of-principle quantum devices to larger integrated logic systems or spatially smaller sensing elements, particularly those associated with the variation of quantum properties both within and between devices. Here, spatially resolved magnetoluminescence is used to provide the first two-dimensional map of a spin property - the Overhauser field - in an organic light-emitting diode. We find intra-device variabilities exceeding 20% while spatially correlated behaviour is exhibited on lengths beyond $7 \, \mathsfμm$, similar in size to pixels in state-of-the-art AMOLED arrays, which has implications for the reproducibility and integration of organic quantum devices.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
Improving Text-to-SQL with Schema Dependency Learning
Authors:
Binyuan Hui,
Xiang Shi,
Ruiying Geng,
Binhua Li,
Yongbin Li,
Jian Sun,
Xiaodan Zhu
Abstract:
Text-to-SQL aims to map natural language questions to SQL queries. The sketch-based method combined with execution-guided (EG) decoding strategy has shown a strong performance on the WikiSQL benchmark. However, execution-guided decoding relies on database execution, which significantly slows down the inference process and is hence unsatisfactory for many real-world applications. In this paper, we…
▽ More
Text-to-SQL aims to map natural language questions to SQL queries. The sketch-based method combined with execution-guided (EG) decoding strategy has shown a strong performance on the WikiSQL benchmark. However, execution-guided decoding relies on database execution, which significantly slows down the inference process and is hence unsatisfactory for many real-world applications. In this paper, we present the Schema Dependency guided multi-task Text-to-SQL model (SDSQL) to guide the network to effectively capture the interactions between questions and schemas. The proposed model outperforms all existing methods in both the settings with or without EG. We show the schema dependency learning partially cover the benefit from EG and alleviates the need for it. SDSQL without EG significantly reduces time consumption during inference, sacrificing only a small amount of performance and provides more flexibility for downstream applications.
△ Less
Submitted 10 December, 2021; v1 submitted 7 March, 2021;
originally announced March 2021.
-
Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing
Authors:
Binyuan Hui,
Ruiying Geng,
Qiyu Ren,
Binhua Li,
Yongbin Li,
Jian Sun,
Fei Huang,
Luo Si,
Pengfei Zhu,
Xiaodan Zhu
Abstract:
Semantic parsing has long been a fundamental problem in natural language processing. Recently, cross-domain context-dependent semantic parsing has become a new focus of research. Central to the problem is the challenge of leveraging contextual information of both natural language utterance and database schemas in the interaction history. In this paper, we present a dynamic graph framework that is…
▽ More
Semantic parsing has long been a fundamental problem in natural language processing. Recently, cross-domain context-dependent semantic parsing has become a new focus of research. Central to the problem is the challenge of leveraging contextual information of both natural language utterance and database schemas in the interaction history. In this paper, we present a dynamic graph framework that is capable of effectively modelling contextual utterances, tokens, database schemas, and their complicated interaction as the conversation proceeds. The framework employs a dynamic memory decay mechanism that incorporates inductive bias to integrate enriched contextual relation representation, which is further enhanced with a powerful reranking model. At the time of writing, we demonstrate that the proposed framework outperforms all existing models by large margins, achieving new state-of-the-art performance on two large-scale benchmarks, the SParC and CoSQL datasets. Specifically, the model attains a 55.8% question-match and 30.8% interaction-match accuracy on SParC, and a 46.8% question-match and 17.0% interaction-match accuracy on CoSQL.
△ Less
Submitted 5 January, 2021;
originally announced January 2021.
-
On the geometry of geometric rank
Authors:
Runshi Geng,
J. M. Landsberg
Abstract:
We make a geometric study of the Geometric Rank of tensors recently introduced by Kopparty et al. Results include classification of tensors with degenerate geometric rank in $C^3\otimes C^3\otimes C^3$, classification of tensors with geometric rank two, and showing that upper bounds on geometric rank imply lower bounds on tensor rank.
We make a geometric study of the Geometric Rank of tensors recently introduced by Kopparty et al. Results include classification of tensors with degenerate geometric rank in $C^3\otimes C^3\otimes C^3$, classification of tensors with geometric rank two, and showing that upper bounds on geometric rank imply lower bounds on tensor rank.
△ Less
Submitted 25 June, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
An improved helmet detection method for YOLOv3 on an unbalanced dataset
Authors:
Rui Geng,
Yixuan Ma,
Wanhong Huang
Abstract:
The YOLOv3 target detection algorithm is widely used in industry due to its high speed and high accuracy, but it has some limitations, such as the accuracy degradation of unbalanced datasets. The YOLOv3 target detection algorithm is based on a Gaussian fuzzy data augmentation approach to pre-process the data set and improve the YOLOv3 target detection algorithm. Through the efficient pre-processin…
▽ More
The YOLOv3 target detection algorithm is widely used in industry due to its high speed and high accuracy, but it has some limitations, such as the accuracy degradation of unbalanced datasets. The YOLOv3 target detection algorithm is based on a Gaussian fuzzy data augmentation approach to pre-process the data set and improve the YOLOv3 target detection algorithm. Through the efficient pre-processing, the confidence level of YOLOv3 is generally improved by 0.01-0.02 without changing the recognition speed of YOLOv3, and the processed images also perform better in image localization due to effective feature fusion, which is more in line with the requirement of recognition speed and accuracy in production.
△ Less
Submitted 30 November, 2020; v1 submitted 9 November, 2020;
originally announced November 2020.
-
Approaches of large-scale images recognition with more than 50,000 categoris
Authors:
Wanhong Huang,
Rui Geng
Abstract:
Though current CV models have been able to achieve high levels of accuracy on small-scale images classification dataset with hundreds or thousands of categories, many models become infeasible in computational or space consumption when it comes to large-scale dataset with more than 50,000 categories. In this paper, we provide a viable solution for classifying large-scale species datasets using trad…
▽ More
Though current CV models have been able to achieve high levels of accuracy on small-scale images classification dataset with hundreds or thousands of categories, many models become infeasible in computational or space consumption when it comes to large-scale dataset with more than 50,000 categories. In this paper, we provide a viable solution for classifying large-scale species datasets using traditional CV techniques such as.features extraction and processing, BOVW(Bag of Visual Words) and some statistical learning technics like Mini-Batch K-Means,SVM which are used in our works. And then mixed with a neural network model. When applying these techniques, we have done some optimization in time and memory consumption, so that it can be feasible for large-scale dataset. And we also use some technics to reduce the impact of mislabeling data. We use a dataset with more than 50, 000 categories, and all operations are done on common computer with l 6GB RAM and a CPU of 3. OGHz. Our contributions are: 1) analysis what problems may meet in the training processes, and presents several feasible ways to solve these problems. 2) Make traditional CV models combined with neural network models provide some feasible scenarios for training large-scale classified datasets within the constraints of time and spatial resources.
△ Less
Submitted 26 July, 2020;
originally announced July 2020.
-
Dynamic Memory Induction Networks for Few-Shot Text Classification
Authors:
Ruiying Geng,
Binhua Li,
Yongbin Li,
Jian Sun,
Xiaodan Zhu
Abstract:
This paper proposes Dynamic Memory Induction Networks (DMIN) for few-shot text classification. The model utilizes dynamic routing to provide more flexibility to memory-based few-shot learning in order to better adapt the support sets, which is a critical capacity of few-shot classification models. Based on that, we further develop induction models with query information, aiming to enhance the gene…
▽ More
This paper proposes Dynamic Memory Induction Networks (DMIN) for few-shot text classification. The model utilizes dynamic routing to provide more flexibility to memory-based few-shot learning in order to better adapt the support sets, which is a critical capacity of few-shot classification models. Based on that, we further develop induction models with query information, aiming to enhance the generalization ability of meta-learning. The proposed model achieves new state-of-the-art results on the miniRCV1 and ODIC dataset, improving the best performance (accuracy) by 2~4%. Detailed analysis is further performed to show the effectiveness of each component.
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
Semantic Graph Convolutional Network for Implicit Discourse Relation Classification
Authors:
Yingxue Zhang,
** Jian,
Fandong Meng,
Ruiying Geng,
Wei Cheng,
Jie Zhou
Abstract:
Implicit discourse relation classification is of great importance for discourse parsing, but remains a challenging problem due to the absence of explicit discourse connectives communicating these relations. Modeling the semantic interactions between the two arguments of a relation has proven useful for detecting implicit discourse relations. However, most previous approaches model such semantic in…
▽ More
Implicit discourse relation classification is of great importance for discourse parsing, but remains a challenging problem due to the absence of explicit discourse connectives communicating these relations. Modeling the semantic interactions between the two arguments of a relation has proven useful for detecting implicit discourse relations. However, most previous approaches model such semantic interactions from a shallow interactive level, which is inadequate on capturing enough semantic information. In this paper, we propose a novel and effective Semantic Graph Convolutional Network (SGCN) to enhance the modeling of inter-argument semantics on a deeper interaction level for implicit discourse relation classification. We first build an interaction graph over representations of the two arguments, and then automatically extract in-depth semantic interactive information through graph convolution. Experimental results on the English corpus PDTB and the Chinese corpus CDTB both demonstrate the superiority of our model to previous state-of-the-art systems.
△ Less
Submitted 21 October, 2019;
originally announced October 2019.
-
Giant Spin Seebeck Effect through an Interface Organic Semiconductor
Authors:
V. Kalappattil,
R. Geng,
R. Das,
H. Luong,
M. Pham,
T. Nguyen,
A. Popescu,
L. M. Woods,
M. Kläui,
H. Srikanth,
M. H. Phan
Abstract:
Interfacing an organic semiconductor C60 with a non-magnetic metallic thin film (Cu or Pt) has created a novel heterostructure that is ferromagnetic at ambient temperature, while its interface with a magnetic metal (Fe or Co) can tune the anisotropic magnetic surface property of the material. Here, we demonstrate that sandwiching C60 in between a magnetic insulator (Y3Fe5O12: YIG) and a non-magnet…
▽ More
Interfacing an organic semiconductor C60 with a non-magnetic metallic thin film (Cu or Pt) has created a novel heterostructure that is ferromagnetic at ambient temperature, while its interface with a magnetic metal (Fe or Co) can tune the anisotropic magnetic surface property of the material. Here, we demonstrate that sandwiching C60 in between a magnetic insulator (Y3Fe5O12: YIG) and a non-magnetic, strong spin-orbit metal (Pt) promotes highly efficient spin current transport via the thermally driven spin Seebeck effect (SSE). Experiments and first principles calculations consistently show that the presence of C60 reduces significantly the conductivity mismatch between YIG and Pt and the surface perpendicular magnetic anisotropy of YIG, giving rise to enhanced spin mixing conductance across YIG/C60/Pt interfaces. As a result, a 600% increase in the SSE voltage (VLSSE) has been realized in YIG/C60/Pt relative to YIG/Pt. Temperature-dependent SSE voltage measurements on YIG/C60/Pt with varying C60 layer thicknesses also show an exponential increase in VLSSE at low temperatures below 200 K, resembling the temperature evolution of spin diffusion length of C60. Our study emphasizes the important roles of the magnetic anisotropy and the spin diffusion length of the intermediate layer in the SSE in YIG/C60/Pt structures, providing a new pathway for develo** novel spin-caloric materials.
△ Less
Submitted 11 May, 2019;
originally announced May 2019.
-
Induction Networks for Few-Shot Text Classification
Authors:
Ruiying Geng,
Binhua Li,
Yongbin Li,
Xiaodan Zhu,
** Jian,
Jian Sun
Abstract:
Text classification tends to struggle when data is deficient or when it needs to adapt to unseen classes. In such challenging scenarios, recent studies have used meta-learning to simulate the few-shot task, in which new queries are compared to a small support set at the sample-wise level. However, this sample-wise comparison may be severely disturbed by the various expressions in the same class. T…
▽ More
Text classification tends to struggle when data is deficient or when it needs to adapt to unseen classes. In such challenging scenarios, recent studies have used meta-learning to simulate the few-shot task, in which new queries are compared to a small support set at the sample-wise level. However, this sample-wise comparison may be severely disturbed by the various expressions in the same class. Therefore, we should be able to learn a general representation of each class in the support set and then compare it to new queries. In this paper, we propose a novel Induction Network to learn such a generalized class-wise representation, by innovatively leveraging the dynamic routing algorithm in meta-learning. In this way, we find the model is able to induce and generalize better. We evaluate the proposed model on a well-studied sentiment classification dataset (English) and a real-world dialogue intent classification dataset (Chinese). Experiment results show that on both datasets, the proposed model significantly outperforms the existing state-of-the-art approaches, proving the effectiveness of class-wise generalization in few-shot text classification.
△ Less
Submitted 29 September, 2019; v1 submitted 27 February, 2019;
originally announced February 2019.
-
Magnetically Tunable Organic Semiconductors with Superparamagnetic Nanoparticles
Authors:
Rugang Geng,
Hoang Mai Luong,
Raja Das,
Kristen Stojak,
Minh Thien Pham,
Joshua Robles-Garcia,
Tuan Anh Duong,
Huy Thanh Pham,
Thi Huong Au,
Ngoc Diep Lai,
George K. Larsen,
Manh-Huong Phan,
Tho Duc Nguyen
Abstract:
Magnetic nanoparticles (MNPs) exhibiting superparamagnetic properties might generate large magnetic dipole-dipole interaction with electron spins in organic semiconductors (OSECs). This concept could be considered analogous to the effect of hyperfine interaction (HFI). In order to investigate this model, Fe3O4 MNPs are used as a dopant for generating random hyperfine-like magnetic fields in a HFI-…
▽ More
Magnetic nanoparticles (MNPs) exhibiting superparamagnetic properties might generate large magnetic dipole-dipole interaction with electron spins in organic semiconductors (OSECs). This concept could be considered analogous to the effect of hyperfine interaction (HFI). In order to investigate this model, Fe3O4 MNPs are used as a dopant for generating random hyperfine-like magnetic fields in a HFI-dominant π-conjugated polymer host, poly(2-methoxy-5-(2-ethylhexyloxy)-1,4-phenylenevinylene) (MeH-PPV). The magnetoconductance (MC) response in organic light emitting diodes made by MeH-PPV/MNP blends is used to estimate the effective hyperfine field in the blends. Firstly, we find that the shape of the MC response essentially remains the same regardless of the MNP concentration, which is attributed to the similar functionality between the nuclear spins and the MNPs. Secondly, the width of MC increases with increasing MNP concentration. Magneto-optical Kerr effect experiments and micromagnetic simulation indicate that the additional increase of the MC width is associated with the strength of the magnetization of the blend. Finally, the MC broadening has the same temperature dependent trend as the magnetization of the MNPs where the unique effect of the MNPs in their superparamagnetic and ferromagnetic regimes on the MC response is observed. Magneto-photoinduced absorption (MPA) spectroscopy confirms that the MC broadening is not due to defects introduced by the MNPs, but is a result of unique superparamagnetic behavior. Our study yields a new pathway for tuning OSECs' magnetic functionality, which is essential to organic optoelectronic devices and magnetic sensor applications.
△ Less
Submitted 11 June, 2019; v1 submitted 18 February, 2019;
originally announced February 2019.
-
The International Linear Collider Technical Design Report - Volume 3.I: Accelerator R&D in the Technical Design Phase
Authors:
Chris Adolphsen,
Maura Barone,
Barry Barish,
Karsten Buesser,
Philip Burrows,
John Carwardine,
Jeffrey Clark,
Hélène Mainaud Durand,
Gerry Dugan,
Eckhard Elsen,
Atsushi Enomoto,
Brian Foster,
Shigeki Fukuda,
Wei Gai,
Martin Gastal,
Rongli Geng,
Camille Ginsburg,
Susanna Guiducci,
Mike Harrison,
Hitoshi Hayano,
Keith Kershaw,
Kiyoshi Kubo,
Victor Kuchler,
Benno List,
Wanming Liu
, et al. (19 additional authors not shown)
Abstract:
The International Linear Collider Technical Design Report (TDR) describes in four volumes the physics case and the design of a 500 GeV centre-of-mass energy linear electron-positron collider based on superconducting radio-frequency technology using Niobium cavities as the accelerating structures. The accelerator can be extended to 1 TeV and also run as a Higgs factory at around 250 GeV and on the…
▽ More
The International Linear Collider Technical Design Report (TDR) describes in four volumes the physics case and the design of a 500 GeV centre-of-mass energy linear electron-positron collider based on superconducting radio-frequency technology using Niobium cavities as the accelerating structures. The accelerator can be extended to 1 TeV and also run as a Higgs factory at around 250 GeV and on the Z0 pole. A comprehensive value estimate of the accelerator is give, together with associated uncertainties. It is shown that no significant technical issues remain to be solved. Once a site is selected and the necessary site-dependent engineering is carried out, construction can begin immediately. The TDR also gives baseline documentation for two high-performance detectors that can share the ILC luminosity by being moved into and out of the beam line in a "push-pull" configuration. These detectors, ILD and SiD, are described in detail. They form the basis for a world-class experimental programme that promises to increase significantly our understanding of the fundamental processes that govern the evolution of the Universe.
△ Less
Submitted 26 June, 2013;
originally announced June 2013.
-
The International Linear Collider Technical Design Report - Volume 3.II: Accelerator Baseline Design
Authors:
Chris Adolphsen,
Maura Barone,
Barry Barish,
Karsten Buesser,
Philip Burrows,
John Carwardine,
Jeffrey Clark,
Hélène Mainaud Durand,
Gerry Dugan,
Eckhard Elsen,
Atsushi Enomoto,
Brian Foster,
Shigeki Fukuda,
Wei Gai,
Martin Gastal,
Rongli Geng,
Camille Ginsburg,
Susanna Guiducci,
Mike Harrison,
Hitoshi Hayano,
Keith Kershaw,
Kiyoshi Kubo,
Victor Kuchler,
Benno List,
Wanming Liu
, et al. (19 additional authors not shown)
Abstract:
The International Linear Collider Technical Design Report (TDR) describes in four volumes the physics case and the design of a 500 GeV centre-of-mass energy linear electron-positron collider based on superconducting radio-frequency technology using Niobium cavities as the accelerating structures. The accelerator can be extended to 1 TeV and also run as a Higgs factory at around 250 GeV and on the…
▽ More
The International Linear Collider Technical Design Report (TDR) describes in four volumes the physics case and the design of a 500 GeV centre-of-mass energy linear electron-positron collider based on superconducting radio-frequency technology using Niobium cavities as the accelerating structures. The accelerator can be extended to 1 TeV and also run as a Higgs factory at around 250 GeV and on the Z0 pole. A comprehensive value estimate of the accelerator is give, together with associated uncertainties. It is shown that no significant technical issues remain to be solved. Once a site is selected and the necessary site-dependent engineering is carried out, construction can begin immediately. The TDR also gives baseline documentation for two high-performance detectors that can share the ILC luminosity by being moved into and out of the beam line in a "push-pull" configuration. These detectors, ILD and SiD, are described in detail. They form the basis for a world-class experimental programme that promises to increase significantly our understanding of the fundamental processes that govern the evolution of the Universe.
△ Less
Submitted 26 June, 2013;
originally announced June 2013.
-
Science Requirements and Conceptual Design for a Polarized Medium Energy Electron-Ion Collider at Jefferson Lab
Authors:
S. Abeyratne,
A. Accardi,
S. Ahmed,
D. Barber,
J. Bisognano,
A. Bogacz,
A. Castilla,
P. Chevtsov,
S. Corneliussen,
W. Deconinck,
P. Degtiarenko,
J. Delayen,
Ya. Derbenev,
S. DeSilva,
D. Douglas,
V. Dudnikov,
R. Ent,
B. Erdelyi,
P. Evtushenko,
Yu. Filatov,
D. Gaskell,
R. Geng,
V. Guzey,
T. Horn,
A. Hutton
, et al. (33 additional authors not shown)
Abstract:
This report presents a brief summary of the science opportunities and program of a polarized medium energy electron-ion collider at Jefferson Lab and a comprehensive description of the conceptual design of such a collider based on the CEBAF electron accelerator facility.
This report presents a brief summary of the science opportunities and program of a polarized medium energy electron-ion collider at Jefferson Lab and a comprehensive description of the conceptual design of such a collider based on the CEBAF electron accelerator facility.
△ Less
Submitted 5 September, 2012; v1 submitted 4 September, 2012;
originally announced September 2012.
-
Search for gravitational waves associated with gamma-ray bursts during LIGO science run 6 and Virgo science runs 2 and 3
Authors:
The LIGO Scientific Collaboration,
Virgo Collaboration,
J. Abadie,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
M. Abernathy,
T. Accadia,
F. Acernese,
C. Adams,
R. Adhikari,
C. Affeldt,
M. Agathos,
K. Agatsuma,
P. Ajith,
B. Allen,
E. Amador Ceron,
D. Amariutei,
S. B. Anderson,
W. G. Anderson,
K. Arai,
M. A. Arain,
M. C. Araya,
S. M. Aston,
P. Astone
, et al. (785 additional authors not shown)
Abstract:
We present the results of a search for gravitational waves associated with 154 gamma-ray bursts (GRBs) that were detected by satellite-based gamma-ray experiments in 2009-2010, during the sixth LIGO science run and the second and third Virgo science runs. We perform two distinct searches: a modeled search for coalescences of either two neutron stars or a neutron star and black hole; and a search f…
▽ More
We present the results of a search for gravitational waves associated with 154 gamma-ray bursts (GRBs) that were detected by satellite-based gamma-ray experiments in 2009-2010, during the sixth LIGO science run and the second and third Virgo science runs. We perform two distinct searches: a modeled search for coalescences of either two neutron stars or a neutron star and black hole; and a search for generic, unmodeled gravitational-wave bursts. We find no evidence for gravitational-wave counterparts, either with any individual GRB in this sample or with the population as a whole. For all GRBs we place lower bounds on the distance to the progenitor, under the optimistic assumption of a gravitational-wave emission energy of 10^-2 M c^2 at 150 Hz, with a median limit of 17 Mpc. For short hard GRBs we place exclusion distances on binary neutron star and neutron star-black hole progenitors, using astrophysically motivated priors on the source parameters, with median values of 16 Mpc and 28 Mpc respectively. These distance limits, while significantly larger than for a search that is not aided by GRB satellite observations, are not large enough to expect a coincidence with a GRB. However, projecting these exclusions to the sensitivities of Advanced LIGO and Virgo, which should begin operation in 2015, we find that the detection of gravitational waves associated with GRBs will become quite possible.
△ Less
Submitted 24 September, 2012; v1 submitted 10 May, 2012;
originally announced May 2012.
-
All-sky search for gravitational-wave bursts in the second joint LIGO-Virgo run
Authors:
the LIGO Scientific Collaboration,
the Virgo Collaboration,
J. Abadie,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
M. Abernathy,
T. Accadia,
F. Acernese,
C. Adams,
R. Adhikari,
C. Affeldt,
M. Agathos,
K. Agatsuma,
P. Ajith,
B. Allen,
E. Amador Ceron,
D. Amariutei,
S. B. Anderson,
W. G. Anderson,
K. Arai,
M. A. Arain,
M. C. Araya,
S. M. Aston,
P. Astone
, et al. (766 additional authors not shown)
Abstract:
We present results from a search for gravitational-wave bursts in the data collected by the LIGO and Virgo detectors between July 7, 2009 and October 20, 2010: data are analyzed when at least two of the three LIGO-Virgo detectors are in coincident operation, with a total observation time of 207 days. The analysis searches for transients of duration < 1 s over the frequency band 64-5000 Hz, without…
▽ More
We present results from a search for gravitational-wave bursts in the data collected by the LIGO and Virgo detectors between July 7, 2009 and October 20, 2010: data are analyzed when at least two of the three LIGO-Virgo detectors are in coincident operation, with a total observation time of 207 days. The analysis searches for transients of duration < 1 s over the frequency band 64-5000 Hz, without other assumptions on the signal waveform, polarization, direction or occurrence time. All identified events are consistent with the expected accidental background. We set frequentist upper limits on the rate of gravitational-wave bursts by combining this search with the previous LIGO-Virgo search on the data collected between November 2005 and October 2007. The upper limit on the rate of strong gravitational-wave bursts at the Earth is 1.3 events per year at 90% confidence. We also present upper limits on source rate density per year and Mpc^3 for sample populations of standard-candle sources. As in the previous joint run, typical sensitivities of the search in terms of the root-sum-squared strain amplitude for these waveforms lie in the range 5 10^-22 Hz^-1/2 to 1 10^-20 Hz^-1/2. The combination of the two joint runs entails the most sensitive all-sky search for generic gravitational-wave bursts and synthesizes the results achieved by the initial generation of interferometric detectors.
△ Less
Submitted 20 April, 2012; v1 submitted 13 February, 2012;
originally announced February 2012.
-
Search for Gravitational Waves from Intermediate Mass Binary Black Holes
Authors:
the LIGO Scientific Collaboration,
the Virgo Collaboration,
J. Abadie,
B. P. Abbott,
R. Abbott,
T. D. Abbott,
M. Abernathy,
T. Accadia,
F. Acernese,
C. Adams,
R. Adhikari,
C. Affeldt,
M. Agathos,
K. Agatsuma,
P. Ajith,
B. Allen,
E. Amador Ceron,
D. Amariutei,
S. B. Anderson,
W. G. Anderson,
K. Arai,
M. A. Arain,
M. C. Araya,
S. M. Aston,
P. Astone
, et al. (770 additional authors not shown)
Abstract:
We present the results of a weakly modeled burst search for gravitational waves from mergers of non-spinning intermediate mass black holes (IMBH) in the total mass range 100--450 solar masses and with the component mass ratios between 1:1 and 4:1. The search was conducted on data collected by the LIGO and Virgo detectors between November of 2005 and October of 2007. No plausible signals were obser…
▽ More
We present the results of a weakly modeled burst search for gravitational waves from mergers of non-spinning intermediate mass black holes (IMBH) in the total mass range 100--450 solar masses and with the component mass ratios between 1:1 and 4:1. The search was conducted on data collected by the LIGO and Virgo detectors between November of 2005 and October of 2007. No plausible signals were observed by the search which constrains the astrophysical rates of the IMBH mergers as a function of the component masses. In the most efficiently detected bin centered on 88+88 solar masses, for non-spinning sources, the rate density upper limit is 0.13 per Mpc^3 per Myr at the 90% confidence level.
△ Less
Submitted 25 April, 2012; v1 submitted 28 January, 2012;
originally announced January 2012.