-
Brant-2: Foundation Model for Brain Signals
Authors:
Zhizhang Yuan,
Daoze Zhang,
Junru Chen,
Gefei Gu,
Yang Yang
Abstract:
Foundational models benefit from pre-training on large amounts of unlabeled data and enable strong performance in a wide variety of applications with a small amount of labeled data. Such models can be particularly effective in analyzing brain signals, as this field encompasses numerous application scenarios, and it is costly to perform large-scale annotation. In this work, we present the largest f…
▽ More
Foundational models benefit from pre-training on large amounts of unlabeled data and enable strong performance in a wide variety of applications with a small amount of labeled data. Such models can be particularly effective in analyzing brain signals, as this field encompasses numerous application scenarios, and it is costly to perform large-scale annotation. In this work, we present the largest foundation model in brain signals, Brant-2. Compared to Brant, a foundation model designed for intracranial neural signals, Brant-2 not only exhibits robustness towards data variations and modeling scales but also can be applied to a broader range of brain neural data. By experimenting on an extensive range of tasks, we demonstrate that Brant-2 is adaptive to various application scenarios in brain signals. Further analyses reveal the scalability of the Brant-2, validate each component's effectiveness, and showcase our model's ability to maintain performance in scenarios with scarce labels.
△ Less
Submitted 28 March, 2024; v1 submitted 15 February, 2024;
originally announced February 2024.
-
Twofold Structured Features-Based Siamese Network for Infrared Target Tracking
Authors:
Wei-Jie Yan,
Yun-Kai Xu,
Qian Chen,
Xiao-Fang Kong,
Guo-Hua Gu,
A-Jun Shao,
Min-Jie Wan
Abstract:
Nowadays, infrared target tracking has been a critical technology in the field of computer vision and has many applications, such as motion analysis, pedestrian surveillance, intelligent detection, and so forth. Unfortunately, due to the lack of color, texture and other detailed information, tracking drift often occurs when the tracker encounters infrared targets that vary in size or shape. To add…
▽ More
Nowadays, infrared target tracking has been a critical technology in the field of computer vision and has many applications, such as motion analysis, pedestrian surveillance, intelligent detection, and so forth. Unfortunately, due to the lack of color, texture and other detailed information, tracking drift often occurs when the tracker encounters infrared targets that vary in size or shape. To address this issue, we present a twofold structured features-based Siamese network for infrared target tracking. First of all, in order to improve the discriminative capacity for infrared targets, a novel feature fusion network is proposed to fuse both shallow spatial information and deep semantic information into the extracted features in a comprehensive manner. Then, a multi-template update module based on template update mechanism is designed to effectively deal with interferences from target appearance changes which are prone to cause early tracking failures. Finally, both qualitative and quantitative experiments are carried out on VOT-TIR 2016 dataset, which demonstrates that our method achieves the balance of promising tracking performance and real-time tracking speed against other out-of-the-art trackers.
△ Less
Submitted 26 June, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Fast optical refocusing through multimode fiber bend using Cake-Cutting Hadamard encoding algorithm to improve robustness
Authors:
Chuncheng Zhang,
Zheyi Yao,
Zhengyue Qin,
Guohua Gu,
Qian Chen,
Zhihua Xie,
Guodong Liu,
Xiubao Sui
Abstract:
Multimode fibres offer the advantages of high resolution and miniaturization over single mode fibers in the field of optical imaging. However, multimode fibre's imaging is susceptible to perturbations of MMF that can lead to secondary spatial distortions in the transmitted image. Perturbations include random disturbances in the fiber as well as environmental noise. Here, we exploit the fast focusi…
▽ More
Multimode fibres offer the advantages of high resolution and miniaturization over single mode fibers in the field of optical imaging. However, multimode fibre's imaging is susceptible to perturbations of MMF that can lead to secondary spatial distortions in the transmitted image. Perturbations include random disturbances in the fiber as well as environmental noise. Here, we exploit the fast focusing capability of the Cake-Cutting Hadamard coding algorithm to counteract the effects of perturbations and improve the system's robustness. Simulation shows that it can approach the theoretical enhancement at 2000 measurements. Experimental results show that the algorithm can help the system to refocus in a short time when MMFs are perturbed. This research will further contribute to using multimode fibres in medicine, communication, and detection.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
Optimal Stationary State Estimation Over Multiple Markovian Packet Drop Channels
Authors:
Jiapeng Xu,
Guoxiang Gu,
Vijay Gupta,
Yang Tang
Abstract:
In this paper, we investigate the state estimation problem over multiple Markovian packet drop channels. In this problem setup, a remote estimator receives measurement data transmitted from multiple sensors over individual channels. By the method of Markovian jump linear systems, an optimal stationary estimator that minimizes the error variance in the steady state is obtained, based on the mean-sq…
▽ More
In this paper, we investigate the state estimation problem over multiple Markovian packet drop channels. In this problem setup, a remote estimator receives measurement data transmitted from multiple sensors over individual channels. By the method of Markovian jump linear systems, an optimal stationary estimator that minimizes the error variance in the steady state is obtained, based on the mean-square (MS) stabilizing solution to the coupled algebraic Riccati equations. An explicit necessary and sufficient condition is derived for the existence of the MS stabilizing solution, which coincides with that of the standard Kalman filter. More importantly, we provide a sufficient condition under which the MS detectability with multiple Markovian packet drop channels can be decoupled, and propose a locally optimal stationary estimator but computationally more tractable. Analytic sufficient and necessary MS detectability conditions are presented for the decoupled subsystems subsequently. Finally, numerical simulations are conducted to illustrate the results on the MS stabilizing solution, the MS detectability, and the performance of the optimal and locally optimal stationary estimators.
△ Less
Submitted 5 March, 2021;
originally announced March 2021.
-
Practical Speech Re-use Prevention in Voice-driven Services
Authors:
Yangyong Zhang,
Maliheh Shirvanian,
Sunpreet S. Arora,
Jianwei Huang,
Guofei Gu
Abstract:
Voice-driven services (VDS) are being used in a variety of applications ranging from smart home control to payments using digital assistants. The input to such services is often captured via an open voice channel, e.g., using a microphone, in an unsupervised setting. One of the key operational security requirements in such setting is the freshness of the input speech. We present AEOLUS, a security…
▽ More
Voice-driven services (VDS) are being used in a variety of applications ranging from smart home control to payments using digital assistants. The input to such services is often captured via an open voice channel, e.g., using a microphone, in an unsupervised setting. One of the key operational security requirements in such setting is the freshness of the input speech. We present AEOLUS, a security overlay that proactively embeds a dynamic acoustic nonce at the time of user interaction, and detects the presence of the embedded nonce in the recorded speech to ensure freshness. We demonstrate that acoustic nonce can (i) be reliably embedded and retrieved, and (ii) be non-disruptive (and even imperceptible) to a VDS user. Optimal parameters (acoustic nonce's operating frequency, amplitude, and bitrate) are determined for (i) and (ii) from a practical perspective. Experimental results show that AEOLUS yields 0.5% FRR at 0% FAR for speech re-use prevention upto a distance of 4 meters in three real-world environments with different background noise levels. We also conduct a user study with 120 participants, which shows that the acoustic nonce does not degrade overall user experience for 94.16% of speech samples, on average, in these environments. AEOLUS can therefore be used in practice to prevent speech re-use and ensure the freshness of speech input.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
Fringe pattern analysis using deep learning
Authors:
Shijie Feng,
Qian Chen,
Guohua Gu,
Tianyang Tao,
Liang Zhang,
Yan Hu,
Wei Yin,
Chao Zuo
Abstract:
In many optical metrology techniques, fringe pattern analysis is the central algorithm for recovering the underlying phase distribution from the recorded fringe patterns. Despite extensive research efforts for decades, how to extract the desired phase information, with the highest possible accuracy, from the minimum number of fringe patterns remains one of the most challenging open problems. Inspi…
▽ More
In many optical metrology techniques, fringe pattern analysis is the central algorithm for recovering the underlying phase distribution from the recorded fringe patterns. Despite extensive research efforts for decades, how to extract the desired phase information, with the highest possible accuracy, from the minimum number of fringe patterns remains one of the most challenging open problems. Inspired by recent successes of deep learning techniques for computer vision and other applications, here, we demonstrate for the first time, to our knowledge, that the deep neural networks can be trained to perform fringe analysis, which substantially enhances the accuracy of phase demodulation from a single fringe pattern. The effectiveness of the proposed method is experimentally verified using carrier fringe patterns under the scenario of fringe projection profilometry. Experimental results demonstrate its superior performance in terms of high accuracy and edge-preserving over two representative single-frame techniques: Fourier transform profilometry and Windowed Fourier profilometry.
△ Less
Submitted 8 July, 2018;
originally announced July 2018.