-
A self-supervised CNN for image watermark removal
Authors:
Chunwei Tian,
Menghua Zheng,
Tiancai Jiao,
Wangmeng Zuo,
Yanning Zhang,
Chia-Wen Lin
Abstract:
Popular convolutional neural networks mainly use paired images in a supervised way for image watermark removal. However, watermarked images do not have reference images in the real world, which results in poor robustness of image watermark removal techniques. In this paper, we propose a self-supervised convolutional neural network (CNN) in image watermark removal (SWCNN). SWCNN uses a self-supervi…
▽ More
Popular convolutional neural networks mainly use paired images in a supervised way for image watermark removal. However, watermarked images do not have reference images in the real world, which results in poor robustness of image watermark removal techniques. In this paper, we propose a self-supervised convolutional neural network (CNN) in image watermark removal (SWCNN). SWCNN uses a self-supervised way to construct reference watermarked images rather than given paired training samples, according to watermark distribution. A heterogeneous U-Net architecture is used to extract more complementary structural information via simple components for image watermark removal. Taking into account texture information, a mixed loss is exploited to improve visual effects of image watermark removal. Besides, a watermark dataset is conducted. Experimental results show that the proposed SWCNN is superior to popular CNNs in image watermark removal.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Multi-Agent Robust Control Synthesis from Global Temporal Logic Tasks
Authors:
Tiange Yang,
Yuanyuan Zou,
**feng Liu,
Tianyu Jia,
Shaoyuan Li
Abstract:
This paper focuses on the heterogeneous multi-agent control problem under global temporal logic tasks. We define a specification language, called extended capacity temporal logic (ECaTL), to describe the required global tasks, including the number of times that a local or coupled signal temporal logic (STL) task needs to be satisfied and the synchronous requirements on task satisfaction. The robus…
▽ More
This paper focuses on the heterogeneous multi-agent control problem under global temporal logic tasks. We define a specification language, called extended capacity temporal logic (ECaTL), to describe the required global tasks, including the number of times that a local or coupled signal temporal logic (STL) task needs to be satisfied and the synchronous requirements on task satisfaction. The robustness measure for ECaTL is formally designed. In particular, the robustness for synchronous tasks is evaluated from both the temporal and spatial perspectives. Mixed-integer linear constraints are designed to encode ECaTL specifications, and a two-step optimization framework is further proposed to realize task-satisfied motion planning with high spatial robustness and synchronicity. Simulations are conducted to demonstrate the expressivity of ECaTL and the efficiency of the proposed control synthesis approach.
△ Less
Submitted 17 November, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Hierarchical-level rain image generative model based on GAN
Authors:
Zhenyuan Liu,
Tong Jia,
Xingyu Xing,
Jianfeng Wu,
Junyi Chen
Abstract:
Autonomous vehicles are exposed to various weather during operation, which is likely to trigger the performance limitations of the perception system, leading to the safety of the intended functionality (SOTIF) problems. To efficiently generate data for testing the performance of visual perception algorithms under various weather conditions, a hierarchical-level rain image generative model, rain co…
▽ More
Autonomous vehicles are exposed to various weather during operation, which is likely to trigger the performance limitations of the perception system, leading to the safety of the intended functionality (SOTIF) problems. To efficiently generate data for testing the performance of visual perception algorithms under various weather conditions, a hierarchical-level rain image generative model, rain conditional CycleGAN (RCCycleGAN), is constructed. RCCycleGAN is based on the generative adversarial network (GAN) and can generate images of light, medium, and heavy rain. Different rain intensities are introduced as labels in conditional GAN (CGAN). Meanwhile, the model structure is optimized and the training strategy is adjusted to alleviate the problem of mode collapse. In addition, natural rain images of different intensities are collected and processed for model training and validation. Compared with the two baseline models, CycleGAN and DerainCycleGAN, the peak signal-to-noise ratio (PSNR) of RCCycleGAN on the test dataset is improved by 2.58 dB and 0.74 dB, and the structural similarity (SSIM) is improved by 18% and 8%, respectively. The ablation experiments are also carried out to validate the effectiveness of the model tuning.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Multi-feature concatenation and multi-classifier stacking: an interpretable and generalizable machine learning method for MDD discrimination with rsfMRI
Authors:
Yunsong Luo,
Wenyu Chen,
Ling Zhan,
Jiang Qiu,
Tao Jia
Abstract:
Major depressive disorder is a serious and heterogeneous psychiatric disorder that needs accurate diagnosis. Resting-state functional MRI (rsfMRI), which captures multiple perspectives on brain structure, function, and connectivity, is increasingly applied in the diagnosis and pathological research of mental diseases. Different machine learning algorithms are then developed to exploit the rich inf…
▽ More
Major depressive disorder is a serious and heterogeneous psychiatric disorder that needs accurate diagnosis. Resting-state functional MRI (rsfMRI), which captures multiple perspectives on brain structure, function, and connectivity, is increasingly applied in the diagnosis and pathological research of mental diseases. Different machine learning algorithms are then developed to exploit the rich information in rsfMRI and discriminate MDD patients from normal controls. Despite recent advances reported, the discrimination accuracy has room for further improvement. The generalizability and interpretability of the method are not sufficiently addressed either. Here, we propose a machine learning method (MFMC) for MDD discrimination by concatenating multiple features and stacking multiple classifiers. MFMC is tested on the REST-meta-MDD data set that contains 2428 subjects collected from 25 different sites. MFMC yields 96.9% MDD discrimination accuracy, demonstrating a significant improvement over existing methods. In addition, the generalizability of MFMC is validated by the good performance when the training and testing subjects are from independent sites. The use of XGBoost as the meta classifier allows us to probe the decision process of MFMC. We identify 13 feature values related to 9 brain regions including the posterior cingulate gyrus, superior frontal gyrus orbital part, and angular gyrus, which contribute most to the classification and also demonstrate significant differences at the group level. The use of these 13 feature values alone can reach 87% of MFMC's full performance when taking all feature values. These features may serve as clinically useful diagnostic and prognostic biomarkers for mental disorders in the future.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
An Ontology-based Method to Identify Triggering Conditions for Perception Insufficiency of Autonomous Vehicles
Authors:
Xingyu Xing,
Tong Jia,
Junyi Chen,
Lu Xiong,
Zhuo** Yu
Abstract:
The autonomous vehicle (AV) is a safety-critical system relying on complex sensors and algorithms. The AV may confront risk conditions if these sensors and algorithms misunderstand the environment and situation, even though all components are fault-free. The ISO 21448 defined the safety of the intended functionality (SOTIF), aiming to enhance the AV's safety by specifying AV's development and vali…
▽ More
The autonomous vehicle (AV) is a safety-critical system relying on complex sensors and algorithms. The AV may confront risk conditions if these sensors and algorithms misunderstand the environment and situation, even though all components are fault-free. The ISO 21448 defined the safety of the intended functionality (SOTIF), aiming to enhance the AV's safety by specifying AV's development and validation process. As required in the ISO 21448, the triggering conditions, which may lead to the vehicle's functional insufficiencies, should be analyzed and verified. However, there is not yet a method to realize a comprehensive and systematic identification of triggering conditions so far. This paper proposed an analysis framework of triggering conditions for the perception system based on the propagation chain of events model, which consists of triggering source, influenced perception stage, and triggering effect. According to the analysis framework, ontologies of triggering source and perception stage were constructed, and the relationships between concepts in ontologies are defined. According to these ontologies, triggering conditions can be generated comprehensively and systematically. The proposed method was applied on an L3 autonomous vehicle, and 20 from 87 triggering conditions identified were tested in the field, among which eight triggering conditions triggered risky behaviors of the vehicle.
△ Less
Submitted 16 October, 2022;
originally announced October 2022.
-
Multi-channel Nuclear Norm Minus Frobenius Norm Minimization for Color Image Denoising
Authors:
Yiwen Shan,
Dong Hu,
Zhi Wang,
Tao Jia
Abstract:
Color image denoising is frequently encountered in various image processing and computer vision tasks. One traditional strategy is to convert the RGB image to a less correlated color space and denoise each channel of the new space separately. However, such a strategy can not fully exploit the correlated information between channels and is inadequate to obtain satisfactory results. To address this…
▽ More
Color image denoising is frequently encountered in various image processing and computer vision tasks. One traditional strategy is to convert the RGB image to a less correlated color space and denoise each channel of the new space separately. However, such a strategy can not fully exploit the correlated information between channels and is inadequate to obtain satisfactory results. To address this issue, this paper proposes a new multi-channel optimization model for color image denoising under the nuclear norm minus Frobenius norm minimization framework. Specifically, based on the block-matching, the color image is decomposed into overlap** RGB patches. For each patch, we stack its similar neighbors to form the corresponding patch matrix. The proposed model is performed on the patch matrix to recover its noise-free version. During the recovery process, a) a weight matrix is introduced to fully utilize the noise difference between channels; b) the singular values are shrunk adaptively without additionally assigning weights. With them, the proposed model can achieve promising results while kee** simplicity. To solve the proposed model, an accurate and effective algorithm is built based on the alternating direction method of multipliers framework. The solution of each updating step can be analytically expressed in closed-from. Rigorous theoretical analysis proves the solution sequences generated by the proposed algorithm converge to their respective stationary points. Experimental results on both synthetic and real noise datasets demonstrate the proposed model outperforms state-of-the-art models.
△ Less
Submitted 16 September, 2022;
originally announced September 2022.
-
$ω-$nonblocking supervisory control of discrete-event systems with infinite behavior
Authors:
Ting Jiao,
Renyuan Zhang,
Kai Cai
Abstract:
In the supervisory control framework of discrete-event systems (DES) with infinite behavior initiated by Thistle and Wonham, a supervisor satisfying the minimal acceptable specification and the maximal legal specification is synthesized. However, this supervisor may incur livelocks as it cannot ensure that the infinite behavior under supervision will always visit some marker states. To tackle this…
▽ More
In the supervisory control framework of discrete-event systems (DES) with infinite behavior initiated by Thistle and Wonham, a supervisor satisfying the minimal acceptable specification and the maximal legal specification is synthesized. However, this supervisor may incur livelocks as it cannot ensure that the infinite behavior under supervision will always visit some marker states. To tackle this problem, we propose the definition of markability by requiring that all infinite cycles include at least one marker state. Then we formulate the problem of $ω-$nonblocking supervisory control of DES with infinite behavior to synthesize an $ω-$nonblocking (i.e. nonblocking, deadlock-free and livelock-free) supervisor. An algorithm is proposed to achieve $ω-$nonblockingness by computing the supremal $*-$controllable, $*-$closed, $ω-$controllable and markable sublanguage. We utilize the example of a robot as a running example.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.