-
Hypermultiplexed off-chip hologram by on-chip integrated metasurface
Authors:
Xian** Liu,
Zhanying Ma,
Dasen Zhang,
Qiwen Bao,
Zhenzhen Liu,
Jun-Jun Xiao
Abstract:
The waveguide-integrated metasurface introduces a novel photonic chip capable of converting guided modes into free-space light. This enables functions such as off-chip beam focusing, steering, and imaging. The challenge lies in achieving hypermultiplexing across diverse parameters, including guided-wave mode type, direction, polarization, and notably, multiple wavelengths. Here, we introduce a com…
▽ More
The waveguide-integrated metasurface introduces a novel photonic chip capable of converting guided modes into free-space light. This enables functions such as off-chip beam focusing, steering, and imaging. The challenge lies in achieving hypermultiplexing across diverse parameters, including guided-wave mode type, direction, polarization, and notably, multiple wavelengths. Here, we introduce a comprehensive end-to-end inverse design framework, rooted in a physical model, for the multifunctional design of on-chip metasurfaces. This framework allows for metasurface optimization through a target-field-driven iteration process. We demonstrate a hypermultiplexed on-chip metasurface capable of generating red-green-blue holograms at multiple target planes, with both independent and cooperative control over guided-wave direction. Significantly, the proposed method streamlines the design process utilizing only the positions of meta-atoms as the design variable. We demonstrate 9 independent holographic channels through a combination of wavelength and distance multiplexing. Moreover, by incorporating the excitation direction into the design, the metasurface produces a total of 36 distinct holograms. The robustness of these results against fabrication discrepancies is validated through 3D full-wave electromagnetic simulations, aligning well with advanced manufacturing techniques. Our research presents a universal design framework for the development of multifunctional on-chip metasurfaces, opening up new avenues for a wide range of applications.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Machine learning disentangles bias causes of shortwave cloud radiative effect in a climate model
Authors:
Hongtao Yang,
Guoxing Chen,
Wei-Chyung Wang,
Qing Bao,
Jiandong Li
Abstract:
Large bias exists in shortwave cloud radiative effect (SWCRE) of general circulation models (GCMs), attributed mainly to the combined effect of cloud fraction and water contents, whose representations in models remain challenging. Here we show an effective machine-learning approach to dissect the individual bias of relevant cloud parameters determining SWCRE. A surrogate model for calculating SWCR…
▽ More
Large bias exists in shortwave cloud radiative effect (SWCRE) of general circulation models (GCMs), attributed mainly to the combined effect of cloud fraction and water contents, whose representations in models remain challenging. Here we show an effective machine-learning approach to dissect the individual bias of relevant cloud parameters determining SWCRE. A surrogate model for calculating SWCRE was developed based on random forest using observations and FGOALS-f3-L simulation data of cloud fraction (CFR), cloud-solar concurrence ratio (CSC), cloud liquid and ice water paths (LWP and IWP), TOA upward clear-sky solar flux (SUC), and solar zenith angle. The model, which achieves high determination coefficient > 0.96 in the validation phase, was then used to quantify SWCRE bias associated with these parameters following the partial radiation perturbation method. The global-mean SWCRE bias (in W m-2) is contributed by CFR (+5.11), LWP (-6.58), IWP (-1.67), and CSC (+4.38), while SUC plays a minor role; the large CSC contribution highlights the importance of cloud diurnal variation. Regionally, the relative importance varies according to climate regimes. In Tropics, overestimated LWP and IWP exist over lands, while oceans exhibit underestimated CFR and CSC. In contrast, the extratropical lands and oceans have, respectively, too-small CSC and the 'too few, too bright' low-level clouds. We thus suggest that machine learning, in addition for develo** GCM physical parameterizations, can also be utilized for diagnosing and understanding complex cloud-climate interactions.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Structured Model Pruning for Efficient Inference in Computational Pathology
Authors:
Mohammed Adnan,
Qinle Ba,
Nazim Shaikh,
Shivam Kalra,
Satarupa Mukherjee,
Auranuch Lorsakul
Abstract:
Recent years have seen significant efforts to adopt Artificial Intelligence (AI) in healthcare for various use cases, from computer-aided diagnosis to ICU triage. However, the size of AI models has been rapidly growing due to scaling laws and the success of foundational models, which poses an increasing challenge to leverage advanced models in practical applications. It is thus imperative to devel…
▽ More
Recent years have seen significant efforts to adopt Artificial Intelligence (AI) in healthcare for various use cases, from computer-aided diagnosis to ICU triage. However, the size of AI models has been rapidly growing due to scaling laws and the success of foundational models, which poses an increasing challenge to leverage advanced models in practical applications. It is thus imperative to develop efficient models, especially for deploying AI solutions under resource-constrains or with time sensitivity. One potential solution is to perform model compression, a set of techniques that remove less important model components or reduce parameter precision, to reduce model computation demand. In this work, we demonstrate that model pruning, as a model compression technique, can effectively reduce inference cost for computational and digital pathology based analysis with a negligible loss of analysis performance. To this end, we develop a methodology for pruning the widely used U-Net-style architectures in biomedical imaging, with which we evaluate multiple pruning heuristics on nuclei instance segmentation and classification, and empirically demonstrate that pruning can compress models by at least 70% with a negligible drop in performance.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Enhancing Kubernetes Automated Scheduling with Deep Learning and Reinforcement Techniques for Large-Scale Cloud Computing Optimization
Authors:
Zheng Xu,
Yulu Gong,
Yanlin Zhou,
Qiaozhi Bao,
Wenpin Qian
Abstract:
With the continuous expansion of the scale of cloud computing applications, artificial intelligence technologies such as Deep Learning and Reinforcement Learning have gradually become the key tools to solve the automated task scheduling of large-scale cloud computing systems. Aiming at the complexity and real-time requirement of task scheduling in large-scale cloud computing system, this paper pro…
▽ More
With the continuous expansion of the scale of cloud computing applications, artificial intelligence technologies such as Deep Learning and Reinforcement Learning have gradually become the key tools to solve the automated task scheduling of large-scale cloud computing systems. Aiming at the complexity and real-time requirement of task scheduling in large-scale cloud computing system, this paper proposes an automatic task scheduling scheme based on deep learning and reinforcement learning. Firstly, the deep learning technology is used to monitor and predict the parameters in the cloud computing system in real time to obtain the system status information. Then, combined with reinforcement learning algorithm, the task scheduling strategy is dynamically adjusted according to the real-time system state and task characteristics to achieve the optimal utilization of system resources and the maximum of task execution efficiency. This paper verifies the effectiveness and performance advantages of the proposed scheme in experiments, and proves the potential and application prospect of deep learning and reinforcement learning in automatic task scheduling in large-scale cloud computing systems.
△ Less
Submitted 26 February, 2024;
originally announced March 2024.
-
Waveform-Domain Complementary Signal Sets for Interrupted Sampling Repeater Jamming Suppression
Authors:
Hanning Su,
Qinglong Bao,
Jiameng Pan,
Fucheng Guo,
Weidong Hu
Abstract:
The interrupted-sampling repeater jamming (ISRJ) is coherent and has the characteristic of suppression and deception to degrade the radar detection capabilities. The study focuses on anti-ISRJ techniques in the waveform domain, primarily capitalizing on waveform design and and anti-jamming signal processing methods in the waveform domain. By exploring the relationship between waveform-domain adapt…
▽ More
The interrupted-sampling repeater jamming (ISRJ) is coherent and has the characteristic of suppression and deception to degrade the radar detection capabilities. The study focuses on anti-ISRJ techniques in the waveform domain, primarily capitalizing on waveform design and and anti-jamming signal processing methods in the waveform domain. By exploring the relationship between waveform-domain adaptive matched filtering (WD-AMF) output and waveform-domain signals, we demonstrate that ISRJ can be effectively suppressed when the transmitted waveform exhibits waveform-domain complementarity. We introduce a phase-coded (PC) waveform set with waveform-domain complementarity and propose a method for generating such waveform sets of arbitrary code lengths. The performance of WD-AMF are further developed due to the designed waveforms, and simulations affirm the superior adaptive anti-jamming capabilities of the designed waveforms compared to traditional ones. Remarkably, this improved performance is achieved without the need for prior knowledge of ISRJ interference parameters at either the transmitter or receiver stages.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Vietnamese Poem Generation & The Prospect Of Cross-Language Poem-To-Poem Translation
Authors:
Triet Minh Huynh,
Quan Le Bao
Abstract:
Poetry generation has been a challenging task in the field of Natural Language Processing, as it requires the model to understand the nuances of language, sentiment, and style. In this paper, we propose using Large Language Models to generate Vietnamese poems of various genres from natural language prompts, thereby facilitating an intuitive process with enhanced content control. Our most efficacio…
▽ More
Poetry generation has been a challenging task in the field of Natural Language Processing, as it requires the model to understand the nuances of language, sentiment, and style. In this paper, we propose using Large Language Models to generate Vietnamese poems of various genres from natural language prompts, thereby facilitating an intuitive process with enhanced content control. Our most efficacious model, the GPT-3 Babbage variant, achieves a custom evaluation score of 0.8, specifically tailored to the "luc bat" genre of Vietnamese poetry. Furthermore, we also explore the idea of paraphrasing poems into normal text prompts and yield a relatively high score of 0.781 in the "luc bat" genre. This experiment presents the potential for cross-Language poem-to-poem translation with translated poems as the inputs while concurrently maintaining complete control over the generated content.
△ Less
Submitted 4 January, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning
Authors:
Qiming Bao,
Gael Gendron,
Alex Yuxuan Peng,
Wanjun Zhong,
Neset Tan,
Yang Chen,
Michael Witbrock,
Jiamou Liu
Abstract:
Large language models (LLMs), such as LLaMA, Alpaca, Vicuna, GPT-3.5 and GPT-4, have advanced the performance of AI systems on various natural language processing tasks to human-like levels. However, their generalisation and robustness when performing logical reasoning has not been sufficiently assessed. To comprehensively evaluate this ability, we develop three new logical reasoning datasets name…
▽ More
Large language models (LLMs), such as LLaMA, Alpaca, Vicuna, GPT-3.5 and GPT-4, have advanced the performance of AI systems on various natural language processing tasks to human-like levels. However, their generalisation and robustness when performing logical reasoning has not been sufficiently assessed. To comprehensively evaluate this ability, we develop three new logical reasoning datasets named "ReClor-plus", "LogiQA-plus" and "LogiQAv2-plus" that extend standard logical reasoning datasets to evaluate the robustness of the LLM's reasoning. For each, we create three subsets: the first with randomly shuffled options, the second with the correct choices replaced by "none of the other options is correct", and the third with a combination of shuffling and substitution. Experiments on these datasets show that these simple augmentations greatly hinder the models' performance. Despite their high performance on the original publicly available datasets, we find that all models perform poorly on these newly constructed datasets. We also demonstrate that introducing task variations into the training set can markedly improve the model's performance on both the original and our developed datasets. Finally, we show that applying logic-driven data augmentation for fine-tuning and prompting can enhance generalisation in both discriminative and generative models, offering a path to improving their robustness for tasks involving logical reasoning. Source code and data are made publicly available at https://github.com/Strong-AI-Lab/Logical-and-abstract-reasoning.
△ Less
Submitted 30 March, 2024; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models
Authors:
Qiming Bao,
Juho Leinonen,
Alex Yuxuan Peng,
Wanjun Zhong,
Gaël Gendron,
Timothy Pistotti,
Alice Huang,
Paul Denny,
Michael Witbrock,
Jiamou Liu
Abstract:
Large language models exhibit superior capabilities in processing and understanding language, yet their applications in educational contexts remain underexplored. Learnersourcing enhances learning by engaging students in creating their own educational content. When learnersourcing multiple-choice questions, creating explanations for the solution of a question is a crucial step; it helps other stud…
▽ More
Large language models exhibit superior capabilities in processing and understanding language, yet their applications in educational contexts remain underexplored. Learnersourcing enhances learning by engaging students in creating their own educational content. When learnersourcing multiple-choice questions, creating explanations for the solution of a question is a crucial step; it helps other students understand the solution and promotes a deeper understanding of related concepts. However, it is often difficult for students to craft effective solution explanations, due to limited subject understanding. To help scaffold the task of automated explanation generation, we present and evaluate a framework called "ILearner-LLM", that iteratively enhances the generated explanations for the given questions with large language models. Comprising an explanation generation model and an explanation evaluation model, the framework generates high-quality student-aligned explanations by iteratively feeding the quality rating score from the evaluation model back into the instruction prompt of the explanation generation model. Experimental results demonstrate the effectiveness of our ILearner-LLM on LLaMA2-13B and GPT-4 to generate higher quality explanations that are closer to those written by students on five PeerWise datasets. Our findings represent a promising path to enrich the learnersourcing experience for students and to enhance the capabilities of large language models for educational applications.
△ Less
Submitted 10 March, 2024; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Fast generation of Schrödinger cat states in a Kerr-tunable superconducting resonator
Authors:
X. L. He,
Yong Lu,
D. Q. Bao,
Hang Xue,
W. B. Jiang,
Zhen Wang,
A. F. Roudsari,
Per Delsing,
J. S. Tsai,
Z. R. Lin
Abstract:
Schrödinger cat states, quantum superpositions of macroscopically distinct classical states, are an important resource for quantum communication, quantum metrology and quantum computation. Especially, cat states in a phase space protected against phase-flip errors can be used as a logical qubit. However, cat states, normally generated in three-dimensional cavities, are facing the challenges of sca…
▽ More
Schrödinger cat states, quantum superpositions of macroscopically distinct classical states, are an important resource for quantum communication, quantum metrology and quantum computation. Especially, cat states in a phase space protected against phase-flip errors can be used as a logical qubit. However, cat states, normally generated in three-dimensional cavities, are facing the challenges of scalability and controllability. Here, we present a novel strategy to generate and store cat states in a coplanar superconducting circuit by the fast modulation of Kerr nonlinearity. At the Kerr-free work point, our cat states are passively preserved due to the vanishing Kerr effect. We are able to prepare a 2-component cat state in our chip-based device with a fidelity reaching 89.1% under a 96 ns gate time. Our scheme shows an excellent route to constructing a chip-based bosonic quantum processor.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Reliable Synthesis of Large-Area Monolayer WS2 Single Crystals, Films, and Heterostructures with Extraordinary Photoluminescence Induced by Water Intercalation
Authors:
Qianhui Zhang,
Jianfeng Lu,
Ziyu Wang,
Zhigao Dai,
Yupeng Zhang,
Fuzhi Huang,
Qiaoliang Bao,
Wenhui Duan,
Michael S. Fuhrer,
Changxi Zheng
Abstract:
Two-dimensional (2D) transition metal dichalcogenides (TMDs) hold great potential for future low-energy optoelectronics owing to their unique electronic, optical, and mechanical properties. Chemical vapor deposition (CVD) is the technique widely used for the synthesis of large-area TMDs. However, due to high sensitivity to the growth environment, reliable synthesis of monolayer TMDs via CVD remain…
▽ More
Two-dimensional (2D) transition metal dichalcogenides (TMDs) hold great potential for future low-energy optoelectronics owing to their unique electronic, optical, and mechanical properties. Chemical vapor deposition (CVD) is the technique widely used for the synthesis of large-area TMDs. However, due to high sensitivity to the growth environment, reliable synthesis of monolayer TMDs via CVD remains challenging. Here we develop a controllable CVD process for large-area synthesis of monolayer WS2 crystals, films, and in-plane graphene-WS2 heterostructures by cleaning the reaction tube with hydrochloric acid, sulfuric acid and aqua regia. The concise cleaning process can remove the residual contaminates attached to the CVD reaction tube and crucibles, reducing the nucleation density but enhancing the diffusion length of WS2 species. The photoluminescence (PL) map**s of a WS2 single crystal and film reveal that the extraordinary PL around the edges of a triangular single crystal is induced by ambient water intercalation at the WS2-sapphire interface. The extraordinary PL can be controlled by the choice of substrates with different wettabilities.
△ Less
Submitted 31 July, 2023;
originally announced July 2023.
-
Waveform-Domain Adaptive Matched Filtering for Suppressing Interrupted-Sampling Repeater Jamming
Authors:
Hanning Su,
Qinglong Bao,
Jiameng Pan,
Fucheng Guo,
Weidong Hu
Abstract:
The inadequate adaptability to flexible interference scenarios remains an unresolved challenge in the majority of techniques utilized for mitigating interrupted-sampling repeater jamming (ISRJ). Matched filtering system based methods is desirable to incorporate anti-ISRJ measures based on prior ISRJ modeling, either preceding or succeeding the matched filtering. Due to the partial matching nature…
▽ More
The inadequate adaptability to flexible interference scenarios remains an unresolved challenge in the majority of techniques utilized for mitigating interrupted-sampling repeater jamming (ISRJ). Matched filtering system based methods is desirable to incorporate anti-ISRJ measures based on prior ISRJ modeling, either preceding or succeeding the matched filtering. Due to the partial matching nature of ISRJ, its characteristics are revealed during the process of matched filtering. Therefore, this paper introduces an extended domain called the waveform domain within the matched filtering process. On this domain, an adaptive matched filtering model, known as the waveform-domain adaptive matched filtering (WD-AMF), is established to tackle the problem of ISRJ suppression without relying on a pre-existing ISRJ model. The output of the WD-AMF encompasses an adaptive filtering term and a compensation term. The adaptive filtering term encompasses the adaptive integration outcomes in the waveform domain, which are determined by an adaptive weighted function. This function, akin to a collection of bandpass filters, decomposes the integrated function into multiple components, some of which contain interference while others do not. The compensation term adheres to an integrated guideline for discerning the presence of signal components or noise within the integrated function. The integration results are then concatenated to reconstruct a compensated matched filter signal output. Simulations are conducted to showcase the exceptional capability of the proposed method in suppressing ISRJ in diverse interference scenarios, even in the absence of a pre-existing ISRJ model.
△ Less
Submitted 13 November, 2023; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Measures and Optimization for Robustness and Vulnerability in Disconnected Networks
Authors:
Liwang Zhu,
Qi Bao,
Zhongzhi Zhang
Abstract:
The function or performance of a network is strongly dependent on its robustness, quantifying the ability of the network to continue functioning under perturbations. While a wide variety of robustness metrics have been proposed, they have their respective limitations. In this paper, we propose to use the forest index as a measure of network robustness, which overcomes the deficiencies of existing…
▽ More
The function or performance of a network is strongly dependent on its robustness, quantifying the ability of the network to continue functioning under perturbations. While a wide variety of robustness metrics have been proposed, they have their respective limitations. In this paper, we propose to use the forest index as a measure of network robustness, which overcomes the deficiencies of existing metrics. Using such a measure as an optimization criterion, we propose and study the problem of breaking down a network by attacking some key edges. We show that the objective function of the problem is monotonic but not submodular, which impose more challenging on the problem. We thus resort to greedy algorithms extended for non-submodular functions by iteratively deleting the most promising edges. We first propose a simple greedy algorithm with a proved bound for the approximation ratio and cubic-time complexity. To confront the computation challenge for large networks, we further propose an improved nearly-linear time greedy algorithm, which significantly speeds up the process for edge selection but sacrifices little accuracy. Extensive experimental results for a large set of real-world networks verify the effectiveness and efficiency of our algorithms, demonstrating that our algorithms outperform several baseline schemes.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments
Authors:
Yu Sun,
Qian Bao,
Wu Liu,
Tao Mei,
Michael J. Black
Abstract:
Although the estimation of 3D human pose and shape (HPS) is rapidly progressing, current methods still cannot reliably estimate moving humans in global coordinates, which is critical for many applications. This is particularly challenging when the camera is also moving, entangling human and camera motion. To address these issues, we adopt a novel 5D representation (space, time, and identity) that…
▽ More
Although the estimation of 3D human pose and shape (HPS) is rapidly progressing, current methods still cannot reliably estimate moving humans in global coordinates, which is critical for many applications. This is particularly challenging when the camera is also moving, entangling human and camera motion. To address these issues, we adopt a novel 5D representation (space, time, and identity) that enables end-to-end reasoning about people in scenes. Our method, called TRACE, introduces several novel architectural components. Most importantly, it uses two new "maps" to reason about the 3D trajectory of people over time in camera, and world, coordinates. An additional memory unit enables persistent tracking of people even during long occlusions. TRACE is the first one-stage method to jointly recover and track 3D humans in global coordinates from dynamic cameras. By training it end-to-end, and using full image information, TRACE achieves state-of-the-art performance on tracking and HPS benchmarks. The code and dataset are released for research purposes.
△ Less
Submitted 20 November, 2023; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Large Language Models Are Not Strong Abstract Reasoners
Authors:
Gaël Gendron,
Qiming Bao,
Michael Witbrock,
Gillian Dobbie
Abstract:
Large Language Models have shown tremendous performance on a large variety of natural language processing tasks, ranging from text comprehension to common sense reasoning. However, the mechanisms responsible for this success remain opaque, and it is unclear whether LLMs can achieve human-like cognitive capabilities or whether these models are still fundamentally circumscribed. Abstract reasoning i…
▽ More
Large Language Models have shown tremendous performance on a large variety of natural language processing tasks, ranging from text comprehension to common sense reasoning. However, the mechanisms responsible for this success remain opaque, and it is unclear whether LLMs can achieve human-like cognitive capabilities or whether these models are still fundamentally circumscribed. Abstract reasoning is a fundamental task for cognition, consisting of finding and applying a general pattern from few data. Evaluating deep neural architectures on this task could give insight into their potential limitations regarding reasoning and their broad generalisation abilities, yet this is currently an under-explored area. In this paper, we introduce a new benchmark for evaluating language models beyond memorization on abstract reasoning tasks. We perform extensive evaluations of state-of-the-art LLMs, showing that they currently achieve very limited performance in contrast with other natural language tasks, even when applying techniques that have been shown to improve performance on other NLP tasks. We argue that guiding LLM generation to follow causal paths could help improve the generalisation and reasoning abilities of LLMs.
△ Less
Submitted 2 January, 2024; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning
Authors:
Qiming Bao,
Alex Yuxuan Peng,
Zhenyun Deng,
Wanjun Zhong,
Gael Gendron,
Timothy Pistotti,
Neset Tan,
Nathan Young,
Yang Chen,
Yonghua Zhu,
Paul Denny,
Michael Witbrock,
Jiamou Liu
Abstract:
Combining large language models with logical reasoning enhances their capacity to address problems in a robust and reliable manner. Nevertheless, the intricate nature of logical reasoning poses challenges when gathering reliable data from the web to build comprehensive training datasets, subsequently affecting performance on downstream tasks. To address this, we introduce a novel logic-driven data…
▽ More
Combining large language models with logical reasoning enhances their capacity to address problems in a robust and reliable manner. Nevertheless, the intricate nature of logical reasoning poses challenges when gathering reliable data from the web to build comprehensive training datasets, subsequently affecting performance on downstream tasks. To address this, we introduce a novel logic-driven data augmentation approach, AMR-LDA. AMR-LDA converts the original text into an Abstract Meaning Representation (AMR) graph, a structured semantic representation that encapsulates the logical structure of the sentence, upon which operations are performed to generate logically modified AMR graphs. The modified AMR graphs are subsequently converted back into text to create augmented data. Notably, our methodology is architecture-agnostic and enhances both generative large language models, such as GPT-3.5 and GPT-4, through prompt augmentation, and discriminative large language models through contrastive learning with logic-driven data augmentation. Empirical evidence underscores the efficacy of our proposed method with improvement in performance across seven downstream tasks, such as reading comprehension requiring logical reasoning, textual entailment, and natural language inference. Furthermore, our method leads on the ReClor leaderboard at https://eval.ai/web/challenges/challenge-page/503/leaderboard/1347. The source code and data are publicly available at https://github.com/Strong-AI-Lab/Logical-Equivalence-driven-AMR-Data-Augmentation-for-Representation-Learning.
△ Less
Submitted 6 June, 2024; v1 submitted 21 May, 2023;
originally announced May 2023.
-
Input-length-shortening and text generation via attention values
Authors:
Neşet Özkan Tan,
Alex Yuxuan Peng,
Joshua Bensemann,
Qiming Bao,
Tim Hartill,
Mark Gahegan,
Michael Witbrock
Abstract:
Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention (i.e., relevance) scores to some words than others. Because of the attention mechanism's high computational cost, transformer models usually have an input-leng…
▽ More
Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention (i.e., relevance) scores to some words than others. Because of the attention mechanism's high computational cost, transformer models usually have an input-length limitation caused by hardware constraints. This limitation applies to many transformers, including the well-known bidirectional encoder representations of the transformer (BERT) model. In this paper, we examined BERT's attention assignment mechanism, focusing on two questions: (1) How can attention be employed to reduce input length? (2) How can attention be used as a control mechanism for conditional text generation? We investigated these questions in the context of a text classification task. We discovered that BERT's early layers assign more critical attention scores for text classification tasks compared to later layers. We demonstrated that the first layer's attention sums could be used to filter tokens in a given sequence, considerably decreasing the input length while maintaining good test accuracy. We also applied filtering, which uses a compute-efficient semantic similarities algorithm, and discovered that retaining approximately 6\% of the original sequence is sufficient to obtain 86.5\% accuracy. Finally, we showed that we could generate data in a stable manner and indistinguishable from the original one by only using a small percentage (10\%) of the tokens with high attention scores according to BERT's first layer.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
Two $q$-operational equations and Hahn polynomials
Authors:
**g Gu,
DunKun Yang,
Qi Bao
Abstract:
Motivated by Liu's recent work in \cite{Liu2022}. We shall reveal the essential feature of Hahn polynomials by presenting two new $q$-exponential operators. These lead us to use a systematic method to study identities involving Hahn polynomials. As applications, we use the method of $q$-exponential operator to prove the bilinear generating function of Hahn polynomials and Heine's second transforma…
▽ More
Motivated by Liu's recent work in \cite{Liu2022}. We shall reveal the essential feature of Hahn polynomials by presenting two new $q$-exponential operators. These lead us to use a systematic method to study identities involving Hahn polynomials. As applications, we use the method of $q$-exponential operator to prove the bilinear generating function of Hahn polynomials and Heine's second transformation formula. Moreover, a generalization of $q$-Gaussian summation is given, too.
△ Less
Submitted 5 November, 2022; v1 submitted 25 October, 2022;
originally announced October 2022.
-
DPIT: Dual-Pipeline Integrated Transformer for Human Pose Estimation
Authors:
Shuaitao Zhao,
Kun Liu,
Yuhang Huang,
Qian Bao,
Dan Zeng,
Wu Liu
Abstract:
Human pose estimation aims to figure out the keypoints of all people in different scenes. Current approaches still face some challenges despite promising results. Existing top-down methods deal with a single person individually, without the interaction between different people and the scene they are situated in. Consequently, the performance of human detection degrades when serious occlusion happe…
▽ More
Human pose estimation aims to figure out the keypoints of all people in different scenes. Current approaches still face some challenges despite promising results. Existing top-down methods deal with a single person individually, without the interaction between different people and the scene they are situated in. Consequently, the performance of human detection degrades when serious occlusion happens. On the other hand, existing bottom-up methods consider all people at the same time and capture the global knowledge of the entire image. However, they are less accurate than the top-down methods due to the scale variation. To address these problems, we propose a novel Dual-Pipeline Integrated Transformer (DPIT) by integrating top-down and bottom-up pipelines to explore the visual clues of different receptive fields and achieve their complementarity. Specifically, DPIT consists of two branches, the bottom-up branch deals with the whole image to capture the global visual information, while the top-down branch extracts the feature representation of local vision from the single-human bounding box. Then, the extracted feature representations from bottom-up and top-down branches are fed into the transformer encoder to fuse the global and local knowledge interactively. Moreover, we define the keypoint queries to explore both full-scene and single-human posture visual clues to realize the mutual complementarity of the two pipelines. To the best of our knowledge, this is one of the first works to integrate the bottom-up and top-down pipelines with transformers for human pose estimation. Extensive experiments on COCO and MPII datasets demonstrate that our DPIT achieves comparable performance to the state-of-the-art methods.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
In-Place Gestures Classification via Long-term Memory Augmented Network
Authors:
Lizhi Zhao,
Xuequan Lu,
Qianyue Bao,
Meili Wang
Abstract:
In-place gesture-based virtual locomotion techniques enable users to control their viewpoint and intuitively move in the 3D virtual environment. A key research problem is to accurately and quickly recognize in-place gestures, since they can trigger specific movements of virtual viewpoints and enhance user experience. However, to achieve real-time experience, only short-term sensor sequence data (u…
▽ More
In-place gesture-based virtual locomotion techniques enable users to control their viewpoint and intuitively move in the 3D virtual environment. A key research problem is to accurately and quickly recognize in-place gestures, since they can trigger specific movements of virtual viewpoints and enhance user experience. However, to achieve real-time experience, only short-term sensor sequence data (up to about 300ms, 6 to 10 frames) can be taken as input, which actually affects the classification performance due to limited spatio-temporal information. In this paper, we propose a novel long-term memory augmented network for in-place gestures classification. It takes as input both short-term gesture sequence samples and their corresponding long-term sequence samples that provide extra relevant spatio-temporal information in the training phase. We store long-term sequence features with an external memory queue. In addition, we design a memory augmented loss to help cluster features of the same class and push apart features from different classes, thus enabling our memory queue to memorize more relevant long-term sequence features. In the inference phase, we input only short-term sequence samples to recall the stored features accordingly, and fuse them together to predict the gesture class. We create a large-scale in-place gestures dataset from 25 participants with 11 gestures. Our method achieves a promising accuracy of 95.1% with a latency of 192ms, and an accuracy of 97.3% with a latency of 312ms, and is demonstrated to be superior to recent in-place gesture classification techniques. User study also validates our approach. Our source code and dataset will be made available to the community.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
WOC: A Handy Webcam-based 3D Online Chatroom
Authors:
Chuanhang Yan,
Yu Sun,
Qian Bao,
**hui Pang,
Wu Liu,
Tao Mei
Abstract:
We develop WOC, a webcam-based 3D virtual online chatroom for multi-person interaction, which captures the 3D motion of users and drives their individual 3D virtual avatars in real-time. Compared to the existing wearable equipment-based solution, WOC offers convenient and low-cost 3D motion capture with a single camera. To promote the immersive chat experience, WOC provides high-fidelity virtual a…
▽ More
We develop WOC, a webcam-based 3D virtual online chatroom for multi-person interaction, which captures the 3D motion of users and drives their individual 3D virtual avatars in real-time. Compared to the existing wearable equipment-based solution, WOC offers convenient and low-cost 3D motion capture with a single camera. To promote the immersive chat experience, WOC provides high-fidelity virtual avatar manipulation, which also supports the user-defined characters. With the distributed data flow service, the system delivers highly synchronized motion and voice for all users. Deployed on the website and no installation required, users can freely experience the virtual online chat at https://yanch.cloud.
△ Less
Submitted 17 March, 2023; v1 submitted 1 September, 2022;
originally announced September 2022.
-
Continual Learning for Tumor Classification in Histopathology Images
Authors:
Veena Kaustaban,
Qinle Ba,
Ipshita Bhattacharya,
Nahil Sobh,
Satarupa Mukherjee,
Jim Martin,
Mohammad Saleh Miri,
Christoph Guetter,
Amal Chaturvedi
Abstract:
Recent years have seen great advancements in the development of deep learning models for histopathology image analysis in digital pathology applications, evidenced by the increasingly common deployment of these models in both research and clinical settings. Although such models have shown unprecedented performance in solving fundamental computational tasks in DP applications, they suffer from cata…
▽ More
Recent years have seen great advancements in the development of deep learning models for histopathology image analysis in digital pathology applications, evidenced by the increasingly common deployment of these models in both research and clinical settings. Although such models have shown unprecedented performance in solving fundamental computational tasks in DP applications, they suffer from catastrophic forgetting when adapted to unseen data with transfer learning. With an increasing need for deep learning models to handle ever changing data distributions, including evolving patient population and new diagnosis assays, continual learning models that alleviate model forgetting need to be introduced in DP based analysis. However, to our best knowledge, there is no systematic study of such models for DP-specific applications. Here, we propose CL scenarios in DP settings, where histopathology image data from different sources/distributions arrive sequentially, the knowledge of which is integrated into a single model without training all the data from scratch. We then established an augmented dataset for colorectal cancer H&E classification to simulate shifts of image appearance and evaluated CL model performance in the proposed CL scenarios. We leveraged a breast tumor H&E dataset along with the colorectal cancer to evaluate CL from different tumor types. In addition, we evaluated CL methods in an online few-shot setting under the constraints of annotation and computational resources. We revealed promising results of CL in DP applications, potentially paving the way for application of these methods in clinical practice.
△ Less
Submitted 6 August, 2022;
originally announced August 2022.
-
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
Authors:
Qiming Bao,
Alex Yuxuan Peng,
Tim Hartill,
Neset Tan,
Zhenyun Deng,
Michael Witbrock,
Jiamou Liu
Abstract:
Combining deep learning with symbolic logic reasoning aims to capitalize on the success of both fields and is drawing increasing attention. Inspired by DeepLogic, an end-to-end model trained to perform inference on logic programs, we introduce IMA-GloVe-GA, an iterative neural inference network for multi-step reasoning expressed in natural language. In our model, reasoning is performed using an it…
▽ More
Combining deep learning with symbolic logic reasoning aims to capitalize on the success of both fields and is drawing increasing attention. Inspired by DeepLogic, an end-to-end model trained to perform inference on logic programs, we introduce IMA-GloVe-GA, an iterative neural inference network for multi-step reasoning expressed in natural language. In our model, reasoning is performed using an iterative memory neural network based on RNN with a gated attention mechanism. We evaluate IMA-GloVe-GA on three datasets: PARARULES, CONCEPTRULES V1 and CONCEPTRULES V2. Experimental results show DeepLogic with gated attention can achieve higher test accuracy than DeepLogic and other RNN baseline models. Our model achieves better out-of-distribution generalisation than RoBERTa-Large when the rules have been shuffled. Furthermore, to address the issue of unbalanced distribution of reasoning depths in the current multi-step reasoning datasets, we develop PARARULE-Plus, a large dataset with more examples that require deeper reasoning steps. Experimental results show that the addition of PARARULE-Plus can increase the model's performance on examples requiring deeper reasoning depths. The source code and data are available at https://github.com/Strong-AI-Lab/Multi-Step-Deductive-Reasoning-Over-Natural-Language.
△ Less
Submitted 30 March, 2024; v1 submitted 28 July, 2022;
originally announced July 2022.
-
Notes on $q$-partial differential equations for $q$-Laguerre polynomials and little $q$-Jacobi polynomials
Authors:
Qi Bao,
DunKun Yang
Abstract:
We define two common $q$-orthogonal polynomials: homogeneous $q$-Laguerre polynomials and homogeneous little $q$-Jacobi polynomials. They can be viewed separately as solutions to two $q$-partial differential equations. Then, we proved that if an analytic function satisfies a certain system of $q$-partial differential equations, if and only if it can be expanded in terms of homogeneous $q$-Laguerre…
▽ More
We define two common $q$-orthogonal polynomials: homogeneous $q$-Laguerre polynomials and homogeneous little $q$-Jacobi polynomials. They can be viewed separately as solutions to two $q$-partial differential equations. Then, we proved that if an analytic function satisfies a certain system of $q$-partial differential equations, if and only if it can be expanded in terms of homogeneous $q$-Laguerre polynomials or homogeneous little $q$-Jacobi polynomials. As applications, we obtain generalizations of the Ramanujan $q$-beta integrals and Andrews-Askey integrals. Additionally, we present an operator representation of $q$-Laguerre polynomials that facilitates the computation of identities involving $q$-Laguerre polynomials.
△ Less
Submitted 6 May, 2023; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Anisotropic polaritons in 2D vdW materials
Authors:
Babar Shabbir,
Weiliang Ma,
Qiaoliang Bao
Abstract:
Perhaps the most significant progress to the field of infrared optics and nanophotonics has been made through the real space realisation of polaritons in two-dimensional materials that provide maximum light confinement functionalities. The recent breakthrough discovery of in-plane hyperbolicity in the natural van der Waals material has revealed a most exciting optical property which enable an in-p…
▽ More
Perhaps the most significant progress to the field of infrared optics and nanophotonics has been made through the real space realisation of polaritons in two-dimensional materials that provide maximum light confinement functionalities. The recent breakthrough discovery of in-plane hyperbolicity in the natural van der Waals material has revealed a most exciting optical property which enable an in-plane anisotropic dispersion. Yet, the most intriguing feature of in-plane anisotropic dispersion is the manipulation of polaritons at the nano scale. This development has opened a new window of opportunity in order to develop unique nanophotonic devices with unprecedented controls. This chapter will cover these developments with focus on fundamental understandings and progress of real space visualisation of in-plane anisotropic polaritons in the near-field range. The last section will conclude with the future prospects of this rapidly emerging area.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Conformal optical black hole for cavity
Authors:
Qingtao Ba,
Yangyang Zhou,
Jue Li,
Wen Xiao,
Longfang Ye,
Yineng Liu,
**-hui Chen,
Huanyang Chen
Abstract:
Whispering gallery mode (WGM) cavity is important for exploring physics of strong light-matter interaction. Yet it suffers from the notorious radiation loss universally due to the light tunneling effect through the curved boundary. In this work, we propose and demonstrate an optical black hole (OBH) cavity based on transformation optics. The radiation loss of all WGMs in OBH cavity is completely i…
▽ More
Whispering gallery mode (WGM) cavity is important for exploring physics of strong light-matter interaction. Yet it suffers from the notorious radiation loss universally due to the light tunneling effect through the curved boundary. In this work, we propose and demonstrate an optical black hole (OBH) cavity based on transformation optics. The radiation loss of all WGMs in OBH cavity is completely inhibited by an infinite wide potential barrier. Besides, the WGM field outside the cavity is revealed to follow $1/r^α$ decay rule based on conformal map**, which is fundamentally different from the conventional Hankel-function distributions in a homogeneous cavity. Experimentally, a truncated OBH cavity is achieved based on the effective medium theory, and both the Q-factor enhancement and tightly confined WGM field are measured in the microwave spectra which agree well with the theoretical results. The circular OBH cavity is further applied to the arbitrary-shaped cavities including single-core and multi-core structures with high-Q factor via the conformal map**. The OBH cavity design strategy can be generalized to resonant modes of various wave systems, such as acoustic and elastic waves, and finds applications in energy harvesting and optoelectronics.
△ Less
Submitted 22 May, 2022;
originally announced May 2022.
-
A Generalization of q-Binomial Theorem
Authors:
Qi Bao
Abstract:
By using Liu's $q$-partial differential equations theory, we prove that if an analytic function in several variables satisfies a system of $q$-partial differential equations, if and only if it can be expanded in terms of homogeneous $(q,c)$-Al-Salam-Carlitz polynomials. As an application, we proved that for $c\neq0$ and $\max \{|cq|,|x|\}<1$, \begin{align*} \sum_{n=0}^{\infty} \frac{ (a;q)_n }{(cq…
▽ More
By using Liu's $q$-partial differential equations theory, we prove that if an analytic function in several variables satisfies a system of $q$-partial differential equations, if and only if it can be expanded in terms of homogeneous $(q,c)$-Al-Salam-Carlitz polynomials. As an application, we proved that for $c\neq0$ and $\max \{|cq|,|x|\}<1$, \begin{align*} \sum_{n=0}^{\infty} \frac{ (a;q)_n }{(cq;q)_n}x^n=(ax/c;q)_{\infty} \sum_{n=0}^{\infty} \frac{x^n}{(cq;q)_n}, \end{align*} which is a generalization of famous $q$-binomial theorem or so-called Cauchy theorem.
△ Less
Submitted 30 April, 2022; v1 submitted 25 April, 2022;
originally announced April 2022.
-
AbductionRules: Training Transformers to Explain Unexpected Inputs
Authors:
Nathan Young,
Qiming Bao,
Joshua Bensemann,
Michael Witbrock
Abstract:
Transformers have recently been shown to be capable of reliably performing logical reasoning over facts and rules expressed in natural language, but abductive reasoning - inference to the best explanation of an unexpected observation - has been underexplored despite significant applications to scientific discovery, common-sense reasoning, and model interpretability.
We present AbductionRules, a…
▽ More
Transformers have recently been shown to be capable of reliably performing logical reasoning over facts and rules expressed in natural language, but abductive reasoning - inference to the best explanation of an unexpected observation - has been underexplored despite significant applications to scientific discovery, common-sense reasoning, and model interpretability.
We present AbductionRules, a group of natural language datasets designed to train and test generalisable abduction over natural-language knowledge bases. We use these datasets to finetune pretrained Transformers and discuss their performance, finding that our models learned generalisable abductive techniques but also learned to exploit the structure of our data. Finally, we discuss the viability of this approach to abductive reasoning and ways in which it may be improved in future work.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
Negative reflection of polaritons at the nanoscale in a low-loss natural medium
Authors:
Gonzalo Alvarez-Perez,
Jiahua Duan,
Javier Taboada-Gutierrez,
Qingdong Ou,
Elizaveta Nikulina,
Song Liu,
James H. Edgar,
Qiaoliang Bao,
Vincenzo Giannini,
Rainer Hillenbrand,
J. Martin-Sanchez,
Alexey Y. Nikitin,
Pablo Alonso-Gonzalez
Abstract:
Negative reflection occurs when light is reflected towards the same side of the normal to the boundary from which it is incident. This exotic optical phenomenon, which provides a new avenue towards light manipulation, is not only yet to be visualized in real space but remains largely unexplored both at the nanoscale and in natural media. Here, we directly visualize nanoscale-confined polaritons ne…
▽ More
Negative reflection occurs when light is reflected towards the same side of the normal to the boundary from which it is incident. This exotic optical phenomenon, which provides a new avenue towards light manipulation, is not only yet to be visualized in real space but remains largely unexplored both at the nanoscale and in natural media. Here, we directly visualize nanoscale-confined polaritons negatively reflecting on subwavelength mirrors fabricated in a low-loss van der Waals crystal. Our near-field nanoimaging results unveil an unconventional and broad tunability of both the polaritonic wavelength and direction of propagation upon negative reflection. Based on these findings, we introduce a novel device in nano-optics: a hyperbolic nanoresonator, in which hyperbolic polaritons with different momenta reflect back to a common point source, enhancing its intensity. These results pave the way to realize nanophotonics in low-loss natural media, providing a novel and efficient route to confine and control the flow of light at the nanoscale, key for future optical on-chip nanotechnologies.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Submultiplicative and Power Submultiplicative Properties for Generalized Hersch-Pfluger Distortion Function
Authors:
Qi Bao
Abstract:
For $a\in(0,1/2]$ and $r\in(0,1)$, and for $K>0$, we investigate submultiplicative and power submultiplicative properties for generalized Hersch-Pfluger distortion function $\varphi_K^a(r)$, which generalize the recent results of Hersch-Pfluger distortion function $\varphi_K(r)$ obtained by Wang, Qiu and Chu.
For $a\in(0,1/2]$ and $r\in(0,1)$, and for $K>0$, we investigate submultiplicative and power submultiplicative properties for generalized Hersch-Pfluger distortion function $\varphi_K^a(r)$, which generalize the recent results of Hersch-Pfluger distortion function $\varphi_K(r)$ obtained by Wang, Qiu and Chu.
△ Less
Submitted 20 February, 2022;
originally announced February 2022.
-
Smart Director: An Event-Driven Directing System for Live Broadcasting
Authors:
Yingwei Pan,
Yue Chen,
Qian Bao,
Ning Zhang,
Ting Yao,
**gen Liu,
Tao Mei
Abstract:
Live video broadcasting normally requires a multitude of skills and expertise with domain knowledge to enable multi-camera productions. As the number of cameras keep increasing, directing a live sports broadcast has now become more complicated and challenging than ever before. The broadcast directors need to be much more concentrated, responsive, and knowledgeable, during the production. To reliev…
▽ More
Live video broadcasting normally requires a multitude of skills and expertise with domain knowledge to enable multi-camera productions. As the number of cameras keep increasing, directing a live sports broadcast has now become more complicated and challenging than ever before. The broadcast directors need to be much more concentrated, responsive, and knowledgeable, during the production. To relieve the directors from their intensive efforts, we develop an innovative automated sports broadcast directing system, called Smart Director, which aims at mimicking the typical human-in-the-loop broadcasting process to automatically create near-professional broadcasting programs in real-time by using a set of advanced multi-view video analysis algorithms. Inspired by the so-called "three-event" construction of sports broadcast, we build our system with an event-driven pipeline consisting of three consecutive novel components: 1) the Multi-view Event Localization to detect events by modeling multi-view correlations, 2) the Multi-view Highlight Detection to rank camera views by the visual importance for view selection, 3) the Auto-Broadcasting Scheduler to control the production of broadcasting videos. To our best knowledge, our system is the first end-to-end automated directing system for multi-camera sports broadcasting, completely driven by the semantic understanding of sports events. It is also the first system to solve the novel problem of multi-view joint event detection by cross-view relation modeling. We conduct both objective and subjective evaluations on a real-world multi-camera soccer dataset, which demonstrate the quality of our auto-generated videos is comparable to that of the human-directed. Thanks to its faster response, our system is able to capture more fast-passing and short-duration events which are usually missed by human directors.
△ Less
Submitted 11 January, 2022;
originally announced January 2022.
-
Tailoring topological transition of anisotropic polaritons by interface engineering in biaxial crystals
Authors:
Yali Zeng,
Qingdong Ou,
Lu Liu,
Chunqi Zheng,
Ziyu Wang,
Youning Gong,
Xiang Liang,
Yupeng Zhang,
Guangwei Hu,
Zhilin Yang,
Cheng-Wei Qiu,
Qiaoliang Bao,
Huanyang Chen,
Zhigao Dai
Abstract:
Polaritons in polar biaxial crystals with extreme anisotropy offer a promising route to manipulate nanoscale light-matter interactions. The dynamical modulation of their dispersion is great significance for future integrated nano-optics but remains challenging. Here, we report a momentum-directed strategy, a coupling between the modes with extra momentum supported by the interface and in-plane hyp…
▽ More
Polaritons in polar biaxial crystals with extreme anisotropy offer a promising route to manipulate nanoscale light-matter interactions. The dynamical modulation of their dispersion is great significance for future integrated nano-optics but remains challenging. Here, we report a momentum-directed strategy, a coupling between the modes with extra momentum supported by the interface and in-plane hyperbolic polaritons, to tailor topological transitions of anisotropic polaritons in biaxial crystals. We experimentally demonstrate such tailored polaritons at the interface of heterostructures between graphene and α-phase molybdenum trioxide (α-MoO3). The interlayer coupling can be electrically modulated by changing the Fermi level in graphene, enabling a dynamic topological transition. More interestingly, we found that the topological transition occurs at a constant Fermi level when tuning the thickness of α-MoO3. The momentum-directed strategy implemented by interface engineering offers new insights for optical topological transitions, which may shed new light for programmable polaritonics, energy transfer and neuromorphic photonics.
△ Less
Submitted 4 January, 2022;
originally announced January 2022.
-
Relating Blindsight and AI: A Review
Authors:
Joshua Bensemann,
Qiming Bao,
Gaël Gendron,
Tim Hartill,
Michael Witbrock
Abstract:
Processes occurring in brains, a.k.a. biological neural networks, can and have been modeled within artificial neural network architectures. Due to this, we have conducted a review of research on the phenomenon of blindsight in an attempt to generate ideas for artificial intelligence models. Blindsight can be considered as a diminished form of visual experience. If we assume that artificial network…
▽ More
Processes occurring in brains, a.k.a. biological neural networks, can and have been modeled within artificial neural network architectures. Due to this, we have conducted a review of research on the phenomenon of blindsight in an attempt to generate ideas for artificial intelligence models. Blindsight can be considered as a diminished form of visual experience. If we assume that artificial networks have no form of visual experience, then deficits caused by blindsight give us insights into the processes occurring within visual experience that we can incorporate into artificial neural networks. This article has been structured into three parts. Section 2 is a review of blindsight research, looking specifically at the errors occurring during this condition compared to normal vision. Section 3 identifies overall patterns from Section 2 to generate insights for computational models of vision. Section 4 demonstrates the utility of examining biological research to inform artificial intelligence research by examining computation models of visual attention relevant to one of the insights generated in Section 3. The research covered in Section 4 shows that incorporating one of our insights into computational vision does benefit those models. Future research will be required to determine whether our other insights are as valuable.
△ Less
Submitted 8 December, 2021;
originally announced January 2022.
-
RFormer: Transformer-based Generative Adversarial Network for Real Fundus Image Restoration on A New Clinical Benchmark
Authors:
Zhuo Deng,
Yuanhao Cai,
Lu Chen,
Zheng Gong,
Qiqi Bao,
Xue Yao,
Dong Fang,
Shaochong Zhang,
Lan Ma
Abstract:
Ophthalmologists have used fundus images to screen and diagnose eye diseases. However, different equipments and ophthalmologists pose large variations to the quality of fundus images. Low-quality (LQ) degraded fundus images easily lead to uncertainty in clinical screening and generally increase the risk of misdiagnosis. Thus, real fundus image restoration is worth studying. Unfortunately, real cli…
▽ More
Ophthalmologists have used fundus images to screen and diagnose eye diseases. However, different equipments and ophthalmologists pose large variations to the quality of fundus images. Low-quality (LQ) degraded fundus images easily lead to uncertainty in clinical screening and generally increase the risk of misdiagnosis. Thus, real fundus image restoration is worth studying. Unfortunately, real clinical benchmark has not been explored for this task so far. In this paper, we investigate the real clinical fundus image restoration problem. Firstly, We establish a clinical dataset, Real Fundus (RF), including 120 low- and high-quality (HQ) image pairs. Then we propose a novel Transformer-based Generative Adversarial Network (RFormer) to restore the real degradation of clinical fundus images. The key component in our network is the Window-based Self-Attention Block (WSAB) which captures non-local self-similarity and long-range dependencies. To produce more visually pleasant results, a Transformer-based discriminator is introduced. Extensive experiments on our clinical benchmark show that the proposed RFormer significantly outperforms the state-of-the-art (SOTA) methods. In addition, experiments of downstream tasks such as vessel segmentation and optic disc/cup detection demonstrate that our proposed RFormer benefits clinical fundus image analysis and applications. The dataset, code, and models are publicly available at https://github.com/dengzhuo-AI/Real-Fundus
△ Less
Submitted 3 August, 2022; v1 submitted 2 January, 2022;
originally announced January 2022.
-
Monotonicity Properties of Gaussian Hypergeometric Functions with Respect to the Parameter
Authors:
Qi Bao,
Miao-Kun Wang,
AND Song-Liang Qiu
Abstract:
The authors establish the necessary and sufficient conditions under which certain combinations of Gaussian hypergeometric function and elementary function are monotone in the parameter, which generalize the recent results of generalized elliptic integrals of the first and second kinds obtained by Qiu et al. Moreover, the authors also prove two monotonicity theorems of generalized elliptic integral…
▽ More
The authors establish the necessary and sufficient conditions under which certain combinations of Gaussian hypergeometric function and elementary function are monotone in the parameter, which generalize the recent results of generalized elliptic integrals of the first and second kinds obtained by Qiu et al. Moreover, the authors also prove two monotonicity theorems of generalized elliptic integrals from another point of view.
△ Less
Submitted 27 December, 2021;
originally announced December 2021.
-
Putting People in their Place: Monocular Regression of 3D People in Depth
Authors:
Yu Sun,
Wu Liu,
Qian Bao,
Yili Fu,
Tao Mei,
Michael J. Black
Abstract:
Given an image with multiple people, our goal is to directly regress the pose and shape of all the people as well as their relative depth. Inferring the depth of a person in an image, however, is fundamentally ambiguous without knowing their height. This is particularly problematic when the scene contains people of very different sizes, e.g. from infants to adults. To solve this, we need several t…
▽ More
Given an image with multiple people, our goal is to directly regress the pose and shape of all the people as well as their relative depth. Inferring the depth of a person in an image, however, is fundamentally ambiguous without knowing their height. This is particularly problematic when the scene contains people of very different sizes, e.g. from infants to adults. To solve this, we need several things. First, we develop a novel method to infer the poses and depth of multiple people in a single image. While previous work that estimates multiple people does so by reasoning in the image plane, our method, called BEV, adds an additional imaginary Bird's-Eye-View representation to explicitly reason about depth. BEV reasons simultaneously about body centers in the image and in depth and, by combing these, estimates 3D body position. Unlike prior work, BEV is a single-shot method that is end-to-end differentiable. Second, height varies with age, making it impossible to resolve depth without also estimating the age of people in the image. To do so, we exploit a 3D body model space that lets BEV infer shapes from infants to adults. Third, to train BEV, we need a new dataset. Specifically, we create a "Relative Human" (RH) dataset that includes age labels and relative depth relationships between the people in the images. Extensive experiments on RH and AGORA demonstrate the effectiveness of the model and training scheme. BEV outperforms existing methods on depth reasoning, child shape estimation, and robustness to occlusion. The code and dataset are released for research purposes.
△ Less
Submitted 19 April, 2022; v1 submitted 15 December, 2021;
originally announced December 2021.
-
Discussion on phase ambiguity and multiple beam generation in coherent beam combining system
Authors:
H. Jia,
J. Zuo,
Q. Bao,
C. Geng,
A. Tang,
Y. Luo,
Z. Li,
J. Jiang,
F. Li,
F. Zou,
X. Yang,
Z. Pan,
J. Jiang,
J. Ren,
X. Li
Abstract:
There exists the phase ambiguity problem in the coherent beam combining (CBC) system with centrosymmetric arrays, which means that multiple different piston aberrations may generate the same far-field image. This will cause that the far-field image can not correctly reflect the phase information, resulting in the performance degradation of image-based intelligent algorithms. In this paper, we make…
▽ More
There exists the phase ambiguity problem in the coherent beam combining (CBC) system with centrosymmetric arrays, which means that multiple different piston aberrations may generate the same far-field image. This will cause that the far-field image can not correctly reflect the phase information, resulting in the performance degradation of image-based intelligent algorithms. In this paper, we make a theoretical analysis on phase ambiguity. To the best of our knowledge, we give the number and descriptions of all solutions of the phase ambiguity problem in above system for the first time. A method to solve phase ambiguity is proposed, which requires no additional optical devices. We designed simulations to verify our conclusions and methods. We believe that our work solves the phase ambiguity problem in theory and is conducive to improving the performance of image-based algorithms. In addition, we designed a two-stage algorithm to generate Bi-beam, which have valuables application in laser propagation.
△ Less
Submitted 1 December, 2021; v1 submitted 24 November, 2021;
originally announced November 2021.
-
DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions
Authors:
Lin Ni,
Qiming Bao,
Xiaoxuan Li,
Qianqian Qi,
Paul Denny,
Jim Warren,
Michael Witbrock,
Jiamou Liu
Abstract:
Automated question quality rating (AQQR) aims to evaluate question quality through computational means, thereby addressing emerging challenges in online learnersourced question repositories. Existing methods for AQQR rely solely on explicitly-defined criteria such as readability and word count, while not fully utilising the power of state-of-the-art deep-learning techniques. We propose DeepQR, a n…
▽ More
Automated question quality rating (AQQR) aims to evaluate question quality through computational means, thereby addressing emerging challenges in online learnersourced question repositories. Existing methods for AQQR rely solely on explicitly-defined criteria such as readability and word count, while not fully utilising the power of state-of-the-art deep-learning techniques. We propose DeepQR, a novel neural-network model for AQQR that is trained using multiple-choice-question (MCQ) datasets collected from PeerWise, a widely-used learnersourcing platform. Along with designing DeepQR, we investigate models based on explicitly-defined features, or semantic features, or both. We also introduce a self-attention mechanism to capture semantic correlations between MCQ components, and a contrastive-learning approach to acquire question representations using quality ratings. Extensive experiments on datasets collected from eight university-level courses illustrate that DeepQR has superior performance over six comparative models.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Role Similarity Metric Based on Spanning Rooted Forest
Authors:
Qi Bao,
Zhongzhi Zhang,
Haibin Kan
Abstract:
As a fundamental issue in network analysis, structural node similarity has received much attention in academia and is adopted in a wide range of applications. Among these proposed structural node similarity measures, role similarity stands out because of satisfying several axiomatic properties including automorphism conformation. Existing role similarity metrics cannot handle top-k queries on larg…
▽ More
As a fundamental issue in network analysis, structural node similarity has received much attention in academia and is adopted in a wide range of applications. Among these proposed structural node similarity measures, role similarity stands out because of satisfying several axiomatic properties including automorphism conformation. Existing role similarity metrics cannot handle top-k queries on large real-world networks due to the high time and space cost. In this paper, we propose a new role similarity metric, namely \textsf{ForestSim}. We prove that \textsf{ForestSim} is an admissible role similarity metric and devise the corresponding top-k similarity search algorithm, namely \textsf{ForestSimSearch}, which is able to process a top-k query in $O(k)$ time once the precomputation is finished. Moreover, we speed up the precomputation by using a fast approximate algorithm to compute the diagonal entries of the forest matrix, which reduces the time and space complexity of the precomputation to $O(ε^{-2}m\log^5{n}\log{\frac{1}ε})$ and $O(m\log^3{n})$, respectively. Finally, we conduct extensive experiments on 26 real-world networks. The results show that \textsf{ForestSim} works efficiently on million-scale networks and achieves comparable performance to the state-of-art methods.
△ Less
Submitted 1 April, 2024; v1 submitted 15 October, 2021;
originally announced October 2021.
-
LUAI Challenge 2021 on Learning to Understand Aerial Images
Authors:
Gui-Song Xia,
Jian Ding,
Ming Qian,
Nan Xue,
Jiaming Han,
Xiang Bai,
Michael Ying Yang,
Shengyang Li,
Serge Belongie,
Jiebo Luo,
Mihai Datcu,
Marcello Pelillo,
Liangpei Zhang,
Qiang Zhou,
Chao-hui Yu,
Kaixuan Hu,
Yingjia Bu,
Wenming Tan,
Zhe Yang,
Wei Li,
Shang Liu,
Jiaxuan Zhao,
Tianzhi Ma,
Zi-han Gao,
Lingqi Wang
, et al. (11 additional authors not shown)
Abstract:
This report summarizes the results of Learning to Understand Aerial Images (LUAI) 2021 challenge held on ICCV 2021, which focuses on object detection and semantic segmentation in aerial images. Using DOTA-v2.0 and GID-15 datasets, this challenge proposes three tasks for oriented object detection, horizontal object detection, and semantic segmentation of common categories in aerial images. This cha…
▽ More
This report summarizes the results of Learning to Understand Aerial Images (LUAI) 2021 challenge held on ICCV 2021, which focuses on object detection and semantic segmentation in aerial images. Using DOTA-v2.0 and GID-15 datasets, this challenge proposes three tasks for oriented object detection, horizontal object detection, and semantic segmentation of common categories in aerial images. This challenge received a total of 146 registrations on the three tasks. Through the challenge, we hope to draw attention from a wide range of communities and call for more efforts on the problems of learning to understand aerial images.
△ Less
Submitted 17 September, 2021; v1 submitted 30 August, 2021;
originally announced August 2021.
-
Ideal type-II Weyl points in twisted one-dimensional dielectric photonic crystals
Authors:
Ying Chen,
Hai-xiao Wang,
Qiaoliang Bao,
Jian-Hua Jiang,
Huanyang Chen
Abstract:
Weyl points are the degenerate points in three-dimensional momentum space with nontrivial topological phase, which are usually realized in classical system with structure and symmetry designs. Here we proposed a one-dimensional layer-stacked photonic crystal using anisotropic materials to realize ideal type-II Weyl points without structure designs. The topological transition from two Dirac points…
▽ More
Weyl points are the degenerate points in three-dimensional momentum space with nontrivial topological phase, which are usually realized in classical system with structure and symmetry designs. Here we proposed a one-dimensional layer-stacked photonic crystal using anisotropic materials to realize ideal type-II Weyl points without structure designs. The topological transition from two Dirac points to four Weyl points can be clearly observed by tuning the twist angle between layers. Besides, on the interface between the photonic type-II Weyl material and air, gappless surface states have also been demonstrated in an incomplete bulk bandgap. By breaking parameter symmetry, these ideal type-II Weyl points at the same frequency would transform into the non-ideal ones, and exhibit topological surface states with single group velocity. Our work may provide a new idea for the realization of photonic Weyl points or other semimetal phases by utilizing naturally anisotropic materials.
△ Less
Submitted 18 July, 2021;
originally announced July 2021.
-
Sharp double inequality for complete elliptic integral of the first kind
Authors:
Qi Bao
Abstract:
For $r\in(0,1)$, the function $\K(r)=\int_0^{π/2}(1-r^2\sin^2t)^{-1/2}dt$ is known as the complete elliptic integral of the first kind. In this paper, we prove the absolute monotonicity of two functions involving $\K(r)$. As a consequence, we improve Alzer and Richards' result.
For $r\in(0,1)$, the function $\K(r)=\int_0^{π/2}(1-r^2\sin^2t)^{-1/2}dt$ is known as the complete elliptic integral of the first kind. In this paper, we prove the absolute monotonicity of two functions involving $\K(r)$. As a consequence, we improve Alzer and Richards' result.
△ Less
Submitted 17 March, 2021;
originally announced April 2021.
-
Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective
Authors:
Wu Liu,
Qian Bao,
Yu Sun,
Tao Mei
Abstract:
Estimation of the human pose from a monocular camera has been an emerging research topic in the computer vision community with many applications. Recently, benefited from the deep learning technologies, a significant amount of research efforts have greatly advanced the monocular human pose estimation both in 2D and 3D areas. Although there have been some works to summarize the different approaches…
▽ More
Estimation of the human pose from a monocular camera has been an emerging research topic in the computer vision community with many applications. Recently, benefited from the deep learning technologies, a significant amount of research efforts have greatly advanced the monocular human pose estimation both in 2D and 3D areas. Although there have been some works to summarize the different approaches, it still remains challenging for researchers to have an in-depth view of how these approaches work. In this paper, we provide a comprehensive and holistic 2D-to-3D perspective to tackle this problem. We categorize the mainstream and milestone approaches since the year 2014 under unified frameworks. By systematically summarizing the differences and connections between these approaches, we further analyze the solutions for challenging cases, such as the lack of data, the inherent ambiguity between 2D and 3D, and the complex multi-person scenarios. We also summarize the pose representation styles, benchmarks, evaluation metrics, and the quantitative performance of popular approaches. Finally, we discuss the challenges and give deep thinking of promising directions for future research. We believe this survey will provide the readers with a deep and insightful understanding of monocular human pose estimation.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
Deep learning piston aberration control of fiber laser phased array by spiral phase modulation
Authors:
**g Zuo,
Haolong Jia,
Chao Geng,
Qiliang Bao,
Feng Li,
ZIQIANG LI,
**g Jiang,
Yunxia Xia,
Fan Zou,
Xinyang Li
Abstract:
The stochastic parallel gradient descent (SPGD) algorithm is usually employed as the control strategy for phase-locking in fiber laser phased array systems. However, the convergence speed of the SPGD algorithm will slow down as the number of array elements increases. To improve the control bandwidth, the convolutional neural network is introduced to quickly calculate the initial piston aberration…
▽ More
The stochastic parallel gradient descent (SPGD) algorithm is usually employed as the control strategy for phase-locking in fiber laser phased array systems. However, the convergence speed of the SPGD algorithm will slow down as the number of array elements increases. To improve the control bandwidth, the convolutional neural network is introduced to quickly calculate the initial piston aberration in a single step. In addition, the irrationality of the commonly used Mean Square Error (MSE) evaluation function in existing convolutional neural networks is analyzed. A new evaluation function NPCD (Normalized Phase Cosine Distance) is proposed to improve the accuracy of the neural networks. The results show that the piston aberration residual is 0.005 and the power in the bucket (PIB) is 0.993 after accurate preliminary compensation, which means that the system directly enters the co-phase state. We also demonstrate the robustness and scalability by adding additional disturbance and expanding the scale of the array.
△ Less
Submitted 20 June, 2021; v1 submitted 31 March, 2021;
originally announced March 2021.
-
Focusing of in-plane hyperbolic polaritons in van der Waals crystals with tailored infrared nanoantennas
Authors:
Javier Martín-Sánchez,
Jiahua Duan,
Javier Taboada-Gutiérrez,
Gonzalo Álvarez-Pérez,
Kirill V. Voronin,
Iván Prieto,
Weiliang Ma,
Qiaoliang Bao,
Valentyn S. Volkov,
Rainer Hillenbrand,
Alexey Y. Nikitin,
Pablo Alonso-González
Abstract:
Phonon polaritons (PhPs),light coupled to lattice vibrations,with in-plane hyperbolic dispersion exhibit ray-like propagation with large wavevectors and enhanced density of optical states along certain directions on a surface. As such, they have raised a surge of interest as they promise unprecedented possibilities for the manipulation of infrared light with planar circuitry and at the nanoscale.…
▽ More
Phonon polaritons (PhPs),light coupled to lattice vibrations,with in-plane hyperbolic dispersion exhibit ray-like propagation with large wavevectors and enhanced density of optical states along certain directions on a surface. As such, they have raised a surge of interest as they promise unprecedented possibilities for the manipulation of infrared light with planar circuitry and at the nanoscale. Here, we demonstrate, for the first time, the focusing of in-plane hyperbolic PhPs propagating along thin slabs of MoO3. To that end, we developed metallic nanoantennas of convex geometries for both the efficient launching and focusing of the polaritons. Remarkably, the foci obtained exhibit enhanced near-field confinement and absorption compared to foci produced by in-plane isotropic PhPs. More intriguingly, foci sizes as small as lamdap/5 =lamda0/50 were achieved (lamdap is the polariton wavelength and lamda0 the photon wavelength). Focusing of in-plane hyperbolic polaritons introduces a first and most basic building block develo** planar polariton optics utilizing in-plane anisotropic van der Waals materials and metasurfaces.
△ Less
Submitted 19 March, 2021;
originally announced March 2021.
-
Hybridized hyperbolic surface phonon polaritons at α-MoO3 and polar dielectric interfaces
Authors:
Qing Zhang,
Qingdong Ou,
Guangwei Hu,
**gying Liu,
Zhigao Dai,
Michael S. Fuhrer,
Qiaoliang Bao,
Cheng-Wei Qiu
Abstract:
Surface phonon polaritons (SPhPs) in polar dielectrics offer new opportunities for infrared nanophotonics due to sub-diffraction confinement with low optical losses. Though the polaritonic field confinement can be significantly improved by modifying the dielectric environment, it is challenging to break the fundamental limits in photon confinement and propagation behavior of SPhP modes. In particu…
▽ More
Surface phonon polaritons (SPhPs) in polar dielectrics offer new opportunities for infrared nanophotonics due to sub-diffraction confinement with low optical losses. Though the polaritonic field confinement can be significantly improved by modifying the dielectric environment, it is challenging to break the fundamental limits in photon confinement and propagation behavior of SPhP modes. In particular, as SPhPs inherently propagate isotropically in these bulk polar dielectrics, how to collectively realize ultra-large field confinement, in-plane hyperbolicity and unidirectional propagation remains elusive. Here, we report an approach to solve the aforementioned issues of bulk polar dielectric's SPhPs at one go by constructing a heterostructural interface between biaxial van der Waals material (e.g., MoO3) and bulk polar dielectric (e.g., SiC, AlN, and GaN). Due to anisotropy-oriented mode couplings at the interface, the hybridized SPhPs with a large confinement factor (>100) show in-plane hyperbolicity that has been switched to the orthogonal direction as compared to that in natural MoO3. More interestingly, this proof of concept allows steerable, angle-dependent and unidirectional polariton excitation by suspending MoO3 on patterned SiC air cavities. Our finding exemplifies a generalizable framework to manipulate the flow of nano-light and engineer unusual polaritonic responses in many other hybrid systems consisting of van der Waals materials and bulk polar dielectrics.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
On a Conjecture Concerning the Approximates of Complete Elliptic Integral of the First Kind by Inverse Hyperbolic Tangent
Authors:
Song-Liang Qiu,
Qi Bao,
Xiao-Yan Ma,
Hong-Biao Jiang
Abstract:
Let $\K$ be the complete elliptic integral of the first kind. In this paper, the authors prove that the function $r\mapsto r^{-2}\{[\log(2\K(r)/π)]/\log((\arth r)/r)-3/4\}$ is strictly increasing from $(0,1)$ onto $(1/320,1/4)$, so that $[(\arth r)/r]^{3/4+r^2/320}<2\K(r)/π<[(\arth r)/r]^{3/4+r^2/4}$ for $r\in(0,1)$, in which all the coefficients of the exponents of the two bounds are best possibl…
▽ More
Let $\K$ be the complete elliptic integral of the first kind. In this paper, the authors prove that the function $r\mapsto r^{-2}\{[\log(2\K(r)/π)]/\log((\arth r)/r)-3/4\}$ is strictly increasing from $(0,1)$ onto $(1/320,1/4)$, so that $[(\arth r)/r]^{3/4+r^2/320}<2\K(r)/π<[(\arth r)/r]^{3/4+r^2/4}$ for $r\in(0,1)$, in which all the coefficients of the exponents of the two bounds are best possible, thus proving a conjecture raised by Alzer and Qiu to be true, and giving better bounds of $\K(r)$ than those they conjectured and put in an open problem. Some other analytic properties of the complete elliptic integrals, including other kind of approximates for $\K(r)$, are obtained, too.
△ Less
Submitted 6 March, 2021;
originally announced March 2021.
-
Vector Optimization with Domination Structures: Variational Principles and Applications
Authors:
Truong Q. Bao,
Boris S. Mordukhovich,
Antoine Soubeyran,
Christiane Tammer
Abstract:
This paper addresses a large class of vector optimization problems in infinite-dimensional spaces with respect to two important binary relations derived from domination structures. Motivated by theoretical challenges as well as by applications to some models in behavioral sciences, we establish new variational principles that can be viewed as far-going extensions of the Ekeland variational princip…
▽ More
This paper addresses a large class of vector optimization problems in infinite-dimensional spaces with respect to two important binary relations derived from domination structures. Motivated by theoretical challenges as well as by applications to some models in behavioral sciences, we establish new variational principles that can be viewed as far-going extensions of the Ekeland variational principle to cover domination vector settings. Our approach combines advantages of both primal and dual techniques in variational analysis with providing useful suficient conditions for the existence of variational traps in behavioral science models with variable domination structures.
△ Less
Submitted 16 February, 2021;
originally announced February 2021.
-
Survey on Aerial Radio Access Networks: Toward a Comprehensive 6G Access Infrastructure
Authors:
Nhu-Ngoc Dao,
Quoc-Viet Pham,
Ngo Hoang Tu,
Tran Thien Thanh,
Vo Nguyen Quoc Bao,
Demeke Shumeye Lakew,
Sungrae Cho
Abstract:
Current network access infrastructures are characterized by heterogeneity, low latency, high throughput, and high computational capability, enabling massive concurrent connections and various services. Unfortunately, this design does not pay significant attention to mobile services in underserved areas. In this context, the use of aerial radio access networks (ARANs) is a promising strategy to com…
▽ More
Current network access infrastructures are characterized by heterogeneity, low latency, high throughput, and high computational capability, enabling massive concurrent connections and various services. Unfortunately, this design does not pay significant attention to mobile services in underserved areas. In this context, the use of aerial radio access networks (ARANs) is a promising strategy to complement existing terrestrial communication systems. Involving airborne components such as unmanned aerial vehicles, drones, and satellites, ARANs can quickly establish a flexible access infrastructure on demand. ARANs are expected to support the development of seamless mobile communication systems toward a comprehensive sixth-generation (6G) global access infrastructure. This paper provides an overview of recent studies regarding ARANs in the literature. First, we investigate related work to identify areas for further exploration in terms of recent knowledge advancements and analyses. Second, we define the scope and methodology of this study. Then, we describe ARAN architecture and its fundamental features for the development of 6G networks. In particular, we analyze the system model from several perspectives, including transmission propagation, energy consumption, communication latency, and network mobility. Furthermore, we introduce technologies that enable the success of ARAN implementations in terms of energy replenishment, operational management, and data delivery. Subsequently, we discuss application scenarios envisioned for these technologies. Finally, we highlight ongoing research efforts and trends toward 6G ARANs.
△ Less
Submitted 27 February, 2021; v1 submitted 14 February, 2021;
originally announced February 2021.
-
Fast Evaluation for Relevant Quantities of Opinion Dynamics
Authors:
Wanyue Xu,
Qi Bao,
Zhongzhi Zhang
Abstract:
One of the main subjects in the field of social networks is to quantify conflict, disagreement, controversy, and polarization, and some quantitative indicators have been developed to quantify these concepts. However, direct computation of these indicators involves the operations of matrix inversion and multiplication, which make it computationally infeasible for large-scale graphs with millions of…
▽ More
One of the main subjects in the field of social networks is to quantify conflict, disagreement, controversy, and polarization, and some quantitative indicators have been developed to quantify these concepts. However, direct computation of these indicators involves the operations of matrix inversion and multiplication, which make it computationally infeasible for large-scale graphs with millions of nodes. In this paper, by reducing the problem of computing relevant quantities to evaluating $\ell_2$ norms of some vectors, we present a nearly linear time algorithm to estimate all these quantities. Our algorithm is based on the Laplacian solvers, and has a proved theoretical guarantee of error for each quantity. We execute extensive numerical experiments on a variety of real networks, which demonstrate that our approximation algorithm is efficient and effective, scalable to large graphs having millions of nodes.
△ Less
Submitted 12 June, 2021; v1 submitted 20 January, 2021;
originally announced January 2021.
-
LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification
Authors:
Jiangjie Chen,
Qiaoben Bao,
Changzhi Sun,
Xinbo Zhang,
Jiaze Chen,
Hao Zhou,
Yanghua Xiao,
Lei Li
Abstract:
Given a natural language statement, how to verify its veracity against a large-scale textual knowledge source like Wikipedia? Most existing neural models make predictions without giving clues about which part of a false claim goes wrong. In this paper, we propose LOREN, an approach for interpretable fact verification. We decompose the verification of the whole claim at phrase-level, where the vera…
▽ More
Given a natural language statement, how to verify its veracity against a large-scale textual knowledge source like Wikipedia? Most existing neural models make predictions without giving clues about which part of a false claim goes wrong. In this paper, we propose LOREN, an approach for interpretable fact verification. We decompose the verification of the whole claim at phrase-level, where the veracity of the phrases serves as explanations and can be aggregated into the final verdict according to logical rules. The key insight of LOREN is to represent claim phrase veracity as three-valued latent variables, which are regularized by aggregation logical rules. The final claim verification is based on all latent variables. Thus, LOREN enjoys the additional benefit of interpretability -- it is easy to explain how it reaches certain results with claim phrase veracity. Experiments on a public fact verification benchmark show that LOREN is competitive against previous approaches while enjoying the merit of faithful and accurate interpretability. The resources of LOREN are available at: https://github.com/jiangjiechen/LOREN.
△ Less
Submitted 9 December, 2021; v1 submitted 25 December, 2020;
originally announced December 2020.