NDELS: A Novel Approach for Nighttime Dehazing, Low-Light Enhancement, and Light Suppression

Silvano A. Bernabel and Sos S. Agaian (Fellow, IEEE)
The Graduate Center, City University of New York, New York, NY USA

Abstract

This paper tackles the intricate challenge of improving the quality of nighttime images under hazy and low-light conditions. Overcoming issues including nonuniform illumination glows, texture blurring, glow effects, color distortion, noise disturbance, and overall, low light have proven daunting. Despite the inherent difficulties, this paper introduces a pioneering solution named Nighttime Dehazing, Low-Light Enhancement, and Light Suppression (NDELS). NDELS utilizes a unique network that combines three essential processes to enhance visibility, brighten low-light regions, and effectively suppress glare from bright light sources. In contrast to limited progress in nighttime dehazing, unlike its daytime counterpart, NDELS presents a comprehensive and innovative approach. The efficacy of NDELS is rigorously validated through extensive comparisons with eight state-of-the-art algorithms across four diverse datasets. Experimental results showcase the superior performance of our method, demonstrating its outperformance in terms of overall image quality, including color and edge enhancement. Quantitative (PSNR, SSIM) and qualitative metrics (CLIPIQA, MANIQA, TRES), measure these results.

Index Terms:

Single-image nighttime dehazing, non-uniform haze, bright light suppression, low-light enhancement, multiscale retinex

This work was supported in part by the U.S. Department of Transportation, Federal Highway Administration (FHWA), under Contract 693JJ320C000023

Introduction

Nighttime dehazing is a crucial and dynamic research field that addresses a pressing issue in low-light image capture. Poor quality images captured under low-light conditions can negatively impact applications, ascribable to non-uniform illumination, texture blurring, glow effects, color distortion, haze density, noise disturbance, and other light sources. These degradations render images useless, reducing user experience and limiting the extent to which applications may benefit from higher image quality.

By improving the quality through dehazing techniques, images can be effectively utilized in computer vision, surveillance, and autonomous systems [1, 2]. Despite atmospheric conditions, including haze and foggy weather, Qiu et al. [3], demonstrated that effect suppression improves autonomous driving. In addition, Liu et al. [4], showed advantages of decreasing haze density, one of which is increasing object detection accuracy.

Despite having achieved satisfying dehazing results in daytime settings, in nighttime, daytime methods have been delivering poor quality, at least due to differences in degradation characteristics. Nevertheless, there have been developments. One strategy, first transformed color characteristics between night and day domains, then used prior-based dehazing methods [5]. Another, had first reduced the glowing effects by estimating and removing a glow layer, then applied dehazing [6]. The remaining daytime dehazing algorithms are either prior-based [7, 8, 9, 10, 11, 12, 13, 14] or learning-based [15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 6]. These methods have shown good daytime dehazing. But, when faced with nighttime dehazing, quality still eludes them.

Nighttime-dehazing methods are naturally better, but not perfect. Recently, Zhang et al. [32, 33], developed maximum reflection-prior and parameter estimation to achieve dehazing. Similarly, Wang et al., designed gray haze line prior (GHLP) [34], and a variation method to attain dehazing. Later, Zhang et al. [35], created an optimal-scale-fusion method and a novel, benchmark dataset of nighttime images with synthetic haze. These methods improve dehazing, increase visibility and reduce light glow. Despite this, they amplify noise and, as a result, reduce the visual quality.

This paper addresses the challenge of enhancing the quality of hazy-nighttime images in low light. Quality that has been degraded by non-uniform illumination, texture blurring, glow effects, color distortion, noise disturbance, and low light. The paper proposes a novel nighttime dehazing, enhancement, and light-suppression network (NDELS) to engage the problem. This contribution provides the following:

1.

Introduction of an end-to-end single-image, nighttime-dehazing, convolutional neural network, providing a holistic solution to the challenges posed by nighttime conditions.
2.

We propose a novel approach for generating training data, crucial for effectively suppressing light effects, and enhancing the adaptability and robustness of the NDELS architecture.
3.

Conducting detailed computer simulations to validate the effectiveness of the NDELS architecture, ensuring its practical applicability and superior performance in real-world scenarios.

The remainder of this paper is organized as follows. The background section presents existing dehazing studies with their limitations and advantages. Second, the proposed-method section describes the NDELS architecture. Third, the experimental results, including quantitative and qualitative comparisons with state-of-the-art, nighttime-dehazing and enhancement methods, an ablation study, a real-world application, a subjective study, runtime estimates, and limitations. Finally, we conclude the paper with final remarks.

Background

This section briefly reviews existing daytime and nighttime-dehazing methods.

Methods Base on a Physical Model

These methods have relied primarily on the atmospheric scattering model (ASM) [36], tuning prior-based estimation parameters to approximate transmission and atmospheric light.

Daytime

By estimation and optimization, this kind of dehazing has yielded good results. He et al. [7], developed a dark channel prior (DCP) based on statistically analyzing many haze-free outdoor images. Then, an image is dehazed by finding the inverse of the statistical model’s prediction. Fang et al. [37], built a variational model and the alternating direction method of multipliers to do single-image dehazing. Nevertheless, the more complex the haze, present in real-world imagery, the more challenging it’s for these methods to produce a convincing result.

Nighttime

Even more complex, nighttime haze, has led researchers to devise new approaches. Zhang et al. [32, 35], created a prior, that estimates ambient illumination called maximum-reflectance prior (MRP), accounting for degradations. Liu et al. [38], removed noise and light glow using a linear model to decompose an image into four components. Pei and Lee [5], used a reference image to map, blue to gray, air-light color, then applied DCP to dehaze the images. Wang et al. [34], exploited the gray-haze line to project, between color spaces, the concentration of haze in RGB to the Y channel in YUV. And used a variational structure, estimating the inverted radiance and transmission. Finally, Liu et al. [39], developed a variational-decomposition model to decompose a hazy image into structure, detail, and noise layers. These systems may fix multiple degradation problems, but introduce undesirable noise.

Methods Based on Enhancement:

Daytime

As the field has developed, daytime-dehazing methods have been incorporating fusion, retinex, and contrast schemes to enhance hazy images. By observing weather effects in hazy images, Mi et al. [40], deduced that degradation is concentrated in contrast and color, which the authors compensated by means of contrast enhancement via a multiscale-gradient domain. Strategically, Wang et al. [41], mixed enhancement and a physical-model, namely multiscale-retinex with color restoration and transmission-map estimation, improving image quality lost due to weather degradation. To overcome the limitations of dehazing with physical models, such as imprecise depth information, Zhu et al. [42], proposed image fusion, to fuse gamma-corrected images via pixel-weight maps. Lastly, Liu et al. [43], proposed an efficient framework to contrast enhance and fuse exposures. These methods have yielded good daytime dehazing, but can’t be directly applied to nighttime images.

Nighttime

In similar fashion, other researchers have tackled the night. Utilizing multiscale fusion, Ancuti et al. [44, 45, 46], computed the air light component on different size patches and blended the multiple images using Laplacian-pyramid decomposition. Finally, Yu et al. [47], estimated the transmission map by combining dark and bright-channel priors. Enhancement-based approaches are multifaceted, cleverly combining enhancement techniques, achieving good dehazing. Despite this, they require considerable amount of finetuning, reducing their generalization.

Methods Based on Deep-Neural Networks

Daytime

Some approaches with Deep-Neural Networks (DNN) have tried predicting the transmission map. For example, in DehazeNet [15], and MSCNN [22], neural-networks are used to estimate the transmission parameters, thereby attaining a haze-free image. Analogously, Li et al. [17], designed AOD-Net to learn multiple parameters of the ASM. Similarly, Yang et al. [48], proposed a region-detection network to predict a transmission map. Moreover, Zhang et al. [26], estimated a transmission map, with regularization to control noises. But, there is noise. In daytime there is need for noise reduction, but more so in nighttime.

Other approaches have jointly used enhancement and DNNs. Using a network to dehaze and adaptively enhance, AED-Net, Hovhannisyan et al. [49], applied gray-level dehazing, modified gamma-correction with region awareness and channel attention. And, combined low and high order features, removing significant haze.

The list of methods is extensive, some of the notable ones: GFN [18], PDN [19], DRCDN [25], NLDN [27], FFA-Net [24], MSBDN [23], DA-Net [28], PDR-Net [50], PSD-Net [29], MSAFF-Net [51], FSDGN [52], Dehamer [53], and AIDED-Net [31].

Observe the images in Figure 2, and note the results of the daytime-dehazing methods FSDGN [52] and Dehamer [53]. Compared to NDELS, these two methods show very little improvement in the visibility of the hazy image. In general, this class of methods have performed well in the daytime, but incompletely dehaze in the nighttime, due to non-uniform illumination, haze, low light, light glow, and other degradation factors.

Nighttime

Another approach has been to isolate light glow. For instance, Koo and Kim [54], proposed a glow-decomposition network to extract glow effects from a hazy image, obtaining atmospheric light and transmission maps. And, Kuanar et al. [55], proposed DeGlow, a model to remove glow effects, and DeHaze, a network for removing haze effects.

Others have tried separating frequency elements. Yan et al. [56], build a framework to decompose a grayscale component of a hazy image into low and high-frequency layers. Then, enhancing the textures in the high-frequency layer, and removing glow and fog in the lower-frequency layer. In summary, these nighttime dehazing methods don’t provide consistent quality for real-world image dehazing.

As of late, instance normalization is hel** bridge the gap between daytime and nighttime-dehazing methods. Instance normalization is showing good results in image restoration tasks. For example, HINet proposed in [57], a half-instance, normalization block, replacing batch normalization due to its inefficiency in gathering accurate feature statistics, as a consequence of using small batches of data during training. Instead, HINet improves a networks ability to capture more low-level features by concatenating inputs with their normalization, which the authors call half-instance normalization.

Narrowing the gap between overexposed and underexposed images by instance normalizing and map** into a common space, is the exposure-invariant space (ENC) module [58]. It was designed to overcome data imbalance, by training with different exposures. But, there’s a worst performing exposure, improved by parameter regularization. It’s plausible, that ENC could benefit daytime and nighttime dehazing. We could exploit an exposure-invariant space to accomplish dehazing, then map into the day or night domain. In future work, we will explore using instance normalization.

This paper proposes a nighttime dehazing, enhancement, and light-suppression network (NDELS). This network combines (i) a multiscale, attention-guided, feature-fusion network, designed to brighten poorly illuminated zones, thereby exposing concealed haze and atmospheric effects, (ii) a modified Res2Net encoder with residual-channel-attention modules to amplify low-light regions, eliminate haze and sharpen edges, while mitigating the impact of excessive illumination, and (iii) a module extending multiscale Retinex, augmenting color clarity and constancy.

Contributions

Our paper contributes the following: (a) An end-to-end single-image, nighttime-dehazing, convolutional, neural network. Its functionality is independent of external image references, transmission-map estimation, atmospheric-light analysis, and gathering sequences of images from a single scene with varying weather effects. (b) A novel approach of generating training data, crucial for suppressing night-light effects. (c) The results of detailed computer simulations. And, upon their analysis, the proposed method outperforms all the state-of-the-art, single-image, nighttime-dehazing algorithms, namely NDIM [32], GS [59], MRP and MRPF [33], OSFD [35], FDGCN [54], GHLP [34], and UVD [39] for real-world images in the NHRW and synthetic NHM datasets [35]. Qualitatively, the proposed method does better than non-dehazing, nighttime-enhancement methods, such as DRSL [60], HDRCNN [61], Zero-DCE [62], EnlightenGAN [63], and SingleHDR [64].

Table I, summarizes the strengths and weaknesses of nighttime-dehazing and enhancement methods.

Table I: Comparison of state-of-the-art single-image night dehazing methods.

	NDIM[32]	GS[59]	MRP[33]	MRPF[33]	OSFD[35]	FDGCN[54]	GHLP[34]	UVD[39]	NDELS
No assumptions					$\checkmark$	$\checkmark$			$\checkmark$
Noise reduction								$\checkmark$	$\checkmark$
Light suppression								$\checkmark$	$\checkmark$
Glow removal		$\checkmark$				$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$
Non-uniform illumination	$\checkmark$	$\checkmark$			$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$
Low-light enhancement	$\checkmark$							$\checkmark$	$\checkmark$
Nighttime haze removal	$\checkmark$		$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$	$\checkmark$		$\checkmark$
Edge enhancement						$\checkmark$		$\checkmark$	$\checkmark$

Proposed Method

This section provides details on the NDELS architecture, comprised of three key modules: the low-light, dehazing, and extended-multiscale-retinex. Furthermore, we discuss the loss functions needed to train the low-light and dehazing modules.

Overview of the Network Architecture

Our NDELS network architecture has three essential components:

(i) A multiscale, attention-guided, feature-fusion network, designed to brighten poorly illuminated zones, exposing concealed haze and atmospheric effects.

(ii) A modified Res2Net encoder with residual-channel-attention modules to amplify low-light regions, eliminate haze, and sharpen edges, while mitigating the impact of excessive illumination.

(iii) A module extending multiscale Retinex, augmenting color clarity and constancy.

These components are carefully crafted to enhance image quality under varying illumination conditions, while maintaining the integrity of the light source.

Showcasing the connectivity of the network’s modules, Figure 3, depicts NDELS as a block diagram with high-level details. The low-light module shines light on the image, but haze is now apparent. The dehazing module focuses in on the haze features, that are then faded away. With adjustment of contrast, and an extended-multiscale retinex applied, original [65], the image is considerably brightened and color enhanced. Our extended-multiscale retinex greatly improves the visual quality of the image, but overexposes the bright regions. Thus, we combine the retinex and dehazed images, producing an image of higher quality.

Low-Light Module (LLM): The low-light module, part (a) of Figure 4, takes in a low-light image with haze. We have implemented our own version of the Attention-Guided, Multiscale, Fusion Network (AGMFN) [66]. The key components are the resblocks, convolutional-block-attention (CBAM), cascade-fusion (CFM), attention-fusion (AFM), feature-enhancement (FEM), and feature-calibration modules (FCM). The input image passes through a set of resblocks and convolutional layers, then through the CBAM. It guides the low-light module in enhancing poorly lit regions, and restricting the over-enhancement of bright areas.

The two remaining encoder-decoder branches pass a downscaled image through shallow feature (SFM) and FEM modules. The SFM convolves the image through a series of low-order kernels to capture low-level features. Given the encoded features, the FEM combines them with those of the SFM. Fusing the low-level and multiscale-feature maps, the AFM uses global-average pooling to amplify channel information and strengthen semantic dependencies. The CFM concatenates the encoded data from the three scales to compensate for the loss of information due to the downscaling. The remaining feature-encoded information is fed back into the adjacent higher-scale branch through decoding via deconvolution, and linked by the FCM. Improving degradation and detail enhancement is accomplished by the FCM, which passes the important lower-scale features to the AFM.

Dehazing Module (DHM): To tackle the issue of haze present in hazy nighttime images, we employ the dehazing module proposed in [38]. Although this network is meant to dehaze daytime images, we prove that by using a low-light module to enhance the low-light regions and the haze, it can be trained to dehaze nighttime images with the appropriate training data. Part (b) of Figure 4, depicts the dehazing module with two branches: the upper pretrained and the lower trained branches.

The upper, consists of a Res2Net encoder-decoder module [67]. It uses feature-fusion attention [24] as the attention block, PixelShuffle [68] for upscaling, and an enhancement module [21]. ImageNet weights are loaded during training, allowing the network to extract more robust features rather than randomly initializing the weights. The lower branch captures finer details, avoiding rescaling, and adjusts the network to the dehazed image. The branch comprises residual-channel-attention blocks (RCAB) [69]. Each residual-channel-attention module (RCAM) is composed of RCABs with a long skip connection, reducing the gradient vanishing problem [70]. The two branches are concatenated, passing through a convolutional layer and $tanh$ activation.

Discriminator: We use a discriminator network-based generative model to differentiate between real and generated data, which can improve the dehazing module’s overall performance and its ability to produce more robust features that resemble real data, making the generated data lifelike [38, 71]. The following structure is adopted: The network starts by taking an image as input, which is then scaled up to 512 output channels using multiple convolutional layers with a kernel size of $3\times 3$ . These convolutional layers are followed by batch normalization and leaky-ReLU activation, stabilizing the training process and avoiding vanishing gradients—the negative activation slope allows small gradients to flow through during training. Then adaptive-average pooling is applied to the feature maps, reducing spatial dimensions and improving translation robustness. Two convolutional layers follow, with kernel size $1\times 1$ , and output channels of 1024 and 1. These layers reduce the number of channels to one. Finally, a sigmoid activation normalizes the probability, that an image is real or synthetic.

Extended-Multiscale-Retinex (EMSR): The multiscale-retinex method [72], is a human-perception based, color-image, enhancement algorithm. EMSR utilizes the Fast-Fourier-Transform (FFT) to filter each color channel, using Gaussian kernels with scales of 5, 130, and 255. Low and high pixel intensities are analyzed, kee** the most frequent, with frequency less than 10% the frequency of pixels with 0 intensity. The color channels are combined yielding a contrast and brightness-improved image, visually more appealing, easier to interpret, and better color constancy.

Loss Functions: To train the LLM, we compute the content loss calculated using the $L_{1}$ loss and multiscale-frequency-domain loss (MSFD) [66, 73]. The total loss for the LLM is the linear combination of the content loss and MSFD loss given by

L_{\text{total}}=\gamma_{1}L_{\text{content}}+\gamma_{2}L_{\text{MSFD}},

where the coefficients $\gamma_{1}$ and $\gamma_{2}$ are hyperparameters. To train the DHM, we compute the perceptual[74, 75], smoothness $L_{1}$ [76], multiscale, structural-similarity-index measure (MS-SSIM) [13], and adversarial losses [71]. The total loss for the DHM is the linear combination of the four losses defined as:

L_{\text{total}}=\gamma_{1}L_{l1}+\gamma_{2}L_{\text{MS-SSIM}}+\gamma_{3}L_{% \text{perc}}+\gamma_{4}L_{\text{adv}},

where the coefficients $\gamma_{1},\gamma_{2},\gamma_{3}$ and $\gamma_{4}$ are hyperparameters.

We believe that the MSFD losses are better suited for the type of task that requires for the restoration of hidden or degradated details in an image, which is typical of nighttime or low-light imagery. Additionally, this loss helps reduce and suppress the light effects. For the DHM, the smoothness $L_{1}$ , perceptual, and MS-SSIM, ensure that the dehazed image is close to the target perceptually and structurally, based on human-visual perception and hidden features from a neural network. As for the adversarial loss, it ensures the final dehazed image is realistic.

Experiments

This section provides quantitative and qualitative comparisons of NDELS with state-of-the-art, nighttime-dehazing algorithms.

Datasets

We choose the following four datasets to train, validate, and test our proposed method.

Nighttime Hazy Middlebury (NHM): The NHM dataset [35], contains 350 synthetic images, taken at night, with low, medium, and high-haze densities. With the dataset we quantitatively compare our method with state-of-the-art methods available at the time of this writing. In addition, we select five samples of varying haze density, and measure the quality relative to the ground truth.

Transient-Attributes Dataset (TA): The TA dataset [77], contains 8,571 images from 101 webcams with 40 attribute labels. We select images with dark and bright attributes, such that the attribute confidence is at least 80%.

Nighttime Hazy Real-World (NHRW): The NHRW dataset [35], contains 150 real-world, nighttime images with low, medium, and high-haze densities. Qualitatively, with this dataset we compare our method with leading others, measuring their effectiveness in dehazing and enhancing nighttime images.

Light-Effects Dataset (LE): The LE dataset [60], contains 501 nighttime, haze-free images having light glare, glow, and flood lights. We compare, qualitatively with the dataset, our model’s light-suppressing capabilities to other state-of-the-art methods for nighttime enhancement.

Table II: Breakdown of training and validation data generated from a subset of the TA dataset.

Task	Cameras	Bright	Dark	Generated Pairs
Training	52	550	331	3056
Validation	9	60	76	385

Implementation Details

To enable the network to effectively learn the processes of dehazing and low-light enhancement, we have devised a strategy that utilizes both night and day imagery. The implementation involves the following steps:

•
Data Gathering:
- –
  
  Utilizing camera data from the TA dataset.
- –
  
  Alternatively, recommending the use of fixed-view, internet webcams capturing day and night outdoor scenes with elements such as vehicles, buildings, and night lights. For efficient data gathering, we suggest capturing one image every 12 hours.
•
Image Composition:
- –
  
  Obtaining an image pair, bright image $(B)$ and dark image $(D)$ , from the TA dataset.
- –
  
  Generating a new bright image $(B^{\prime})$ through linear combination with the dark image $(D)$ : $B^{\prime}=0.7B+0.3D$ .
- –
  
  Generating a new dark image $(D^{\prime})$ : $D^{\prime}=0.3B+0.7D$ .
•
Synthetic-Haze Addition:
- –
  
  Adding random synthetic haze $(H)$ to each pair $B^{\prime}$ and $D^{\prime}$ using functions from [78].
- –
  
  Resulting in images: bright, bright-hazy, and dark-hazy ( $B^{\prime},B^{\prime}+H,D^{\prime}+H$ , respectively).
•
Image Enhancement:
- –
  
  Sharpening and contrast enhancing the bright image $(B^{\prime})$ further by clip** the bottom and top 5% of intensity values in each color channel.

Observe the presence of lights inside windows in the bright image $(B^{\prime})$ in Figure 5, which constrains the low-light enhancement and dehazing modules from over enhancing light sources. During training, each training image is resized to $512\times 256$ pixels and randomly cropped to snippets of $256\times 256$ pixels, randomly rotated and flipped. The validation images are resized to $512\times 256$ pixels. The low-light enhancement module learns to map images that are dark-hazy to bright-hazy. And, the dehazing module learns to map images that are bright-hazy to bright-dehazed.

The low-light, enhancement module uses the total-loss function described in the previous section. Reiterating, the total loss is the sum of the content loss and the MSFD loss. In addition, the dehazing module uses a perceptual loss through VGG16 [75]. Along with MS-SSIM, smooth, and adversarial losses. During training, the learning rate is initialized to $10^{-4}$ , and decreased every 14 epochs by 10, for 42 epochs. Approximately 9 hours of training are needed when performed on an NVIDIA RTX A6000 with PyTorch 1.13.

Qualitative Evaluation

In Figure 6, we show a qualitative comparison of NDELS with other algorithms. In the first image, all methods reduce haze. The brightest appears to be (b) NDIM [32], but at the cost of high noise and further brightening the bright lamps. Methods (g) FDGCN [56] and (h) GHLP [34] barely illuminate the input, but don’t worsen the light effects. However, visible haze remains. The second best, qualitatively is (i) UVD [39], which produces a good image, but with blur and some haze remaining. In comparison, (j) NDELS has a sharp, bright, and dehazed image, with light-effects suppression.

Shown in Figure 7, is a qualitative comparison of night-enhancement algorithms with NDELS. The DRSL [60] algorithm does a good job of removing light effects, but doesn’t enhance all low-light areas. Others, such as Zero-DCE [62], EnlightenGAN [63], and SingleHDR [64], distort the input image by expanding the red-color lights and adding some haze to the image. Remaining methods blur and over enhance bright regions or increase haze. But, NDELS successfully suppresses light effects, glow, glare, and flood lights, sharpening and enhancing low-light areas, with the added benefit of removing nighttime haze.

Quantitative Evaluation

In our evaluation to quantify the NDELS method, we adopt the peak-signal-to-noise ratio (PSNR) and the structural-similarity-index measure (SSIM) [79]. Five representative images from the NHM dataset are shown in Figure 8. The scores of these images are given in the top rows of Table III. The table demonstrates that NDELS consistently scores higher for both metrics. The last two rows are for the entire NHM dataset. NDELS has a higher PSNR and SSIM than all the other methods. These quantitative results are for NDELS without the EMSR module. We find that the module works best on real-world images.

Table III: PSNR (top) and SSIM (bottom) scores for a subset and full NHM dataset.

Subset	Image	NDIM[32]	GS[59]	MRP[33]	MRPF[33]	OSFD[35]	FDGCN[54]	GHLP[34]	UVD[39]	NDELS
	1	11.0509	14.8753	12.2937	13.9119	12.7518	9.5976	14.5704	15.5548	16.2492
	1	0.6215	0.6767	0.6868	0.6780	0.6913	0.4908	0.7761	0.8193	0.7996
	2	16.1221	13.9817	14.7126	14.617	14.6873	13.3735	16.0579	16.6188	16.7865
	2	0.6927	0.6782	0.6892	0.6818	0.7084	0.4087	0.7494	0.7414	0.7529
	3	12.6267	12.9039	12.0158	13.7679	12.0900	10.2708	13.7826	14.7216	17.4966
	3	0.6080	0.5818	0.6177	0.6171	0.6331	0.2611	0.6937	0.7387	0.7653
	4	13.1032	13.3063	12.1928	14.6411	11.8457	10.8901	11.6252	14.8185	17.4809
	4	0.6422	0.6626	0.6465	0.6654	0.6562	0.4491	0.6709	0.7007	0.7492
	5	14.0529	14.8528	16.0453	15.0434	16.0152	14.5347	16.1686	17.2239	16.6297
	5	0.6225	0.6640	0.6996	0.6366	0.7078	0.4501	0.7185	0.7233	0.7395
	Avg	13.3912	13.9840	13.4520	14.3963	13.4780	11.7333	14.4409	15.7875	16.9286
	Avg	0.6374	0.6527	0.6680	0.6558	0.6794	0.4120	0.7217	0.7447	0.7613
Full	-	12.4924	11.8963	12.9928	13.1847	13.3027	11.5466	13.1523	13.4679	14.6458
Full	-	0.5752	0.5899	0.6299	0.6164	0.6435	0.3684	0.6735	0.6630	0.7035

Object Detection

In order to assess the real-world applicability of NDELS, particularly in the context of object detection, we selected a test image from the NHRW dataset. Subsequently, we process the image through our network and, for comparative analysis, through an established object detection network, DETR [80]. As illustrated in Figure 9, the application of NDELS results in a significant improvement in both accuracy and the number of objects detected, outperforming MRP, MRPF, and OSFD. It’s noteworthy that certain methods, such as the GHLP, couldn’t be easily compared due to the unavailability of public code. Even when implementing GHLP based on information provided in the published paper, the results were unfavorable.

Subjective Study

The results of the questionnaire align with expectations. We randomly selected ten images from the NHRW dataset and surveyed 142 individuals. Participants were tasked with selecting the image that demonstrated the highest clarity, visibility, and detail. Each question featured one original image and the outputs of four models. As depicted in Figure 10, our analysis reveals that, on average, NDELS outperforms the other models by at least $18.9\pm 7.7$ votes. In conclusion, the survey findings imply that NDELS generally produces a subjectively better image.

Figure 10: Survey on image quality of MRP[33], MRPF[33], OSFD[35], and NDELS. Votes are average number of people that chose a model’s output image.

Ablation Study

To better understand the interactions of the modules to effectively dehaze nighttime images, we train each independently and simultaneously. For a sample image, we evaluate the CLIPIQA , MANIQA, and TRES no-reference, quality measures [81, 82, 83]. For all three quality measures, the higher the score the better the image quality. In Figure 11, the DHM appears to score better than the LLM, as well as combined modules for two quality measures. However, the TRES quality measure tells us that the DHM+LLM produces a more pleasing image evidenced by the higher scores, in particular the alpha-blended image. We can observe that the DHM creates a partial-vignette effect around the image, while the combined modules don’t.

Table IV: Ablation study examining the importance of NDELS modules tested on the NHM dataset.

Modules	Base		EMSR		Enhancement
Modules	PSNR	SSIM	PSNR	SSIM	PSNR	SSIM
LLM	13.8821	0.7013	11.3590	0.4731	12.9873	0.5995
DHM	14.5164	0.7036	11.7811	0.4706	13.5823	0.6024
LLM+DHM	14.6458	0.7035	11.6899	0.5078	13.4844	0.6250

Furthermore, upon evaluating each module on the NHM dataset it’s evident that the combination DHM+LLM produces the best result, a PSNR of 14.6458. In Table IV, we have also broken down the scores based on whether or not the application of EMSR and Enhancement are applied after the module. And, as noted earlier, the EMSR and enhancement don’t work well on synthetic images, as the results indicate.

Runtime Measurement

The runtime values in Table V, show that NDELS isn’t faster than MRPF. However, MRP and MRPF are both implemented in C++, making the algorithms faster than their non-compiled versions. Yet, NDELS only using Python, is 3-4 times faster than MRP and 1.5-2 times faster than OSFD. Network loading time isn’t included, since in a real-world application the model would be initialized once, then waits for input. It remains to be seen, in future work, how fast is a compiled NDELS algorithm.

Table V: Runtimes of nighttime-dehazing methods on a

800\times 600

-sized color image, averaged over 5 runs.

Method	MRP[33]	MRPF[33]	OSFD[35]	NDELS
Time (s)	1.460	0.241	0.623	0.397

Limitation

During our testing, it became evident that the NDELS network faced challenges in effectively dehazing daytime images. Notably, this limitation is not unique to our method, as other nighttime-dehazing approaches exhibit similar issues, as illustrated in Figure 12. While NDELS does reduce some haze, background objects may appear blurred, and colors may fade. One potential explanation could be the lack of training samples with daytime hazy.

For example, in Figure 12, OSFD attains the highest PSNR score, but the resulting image appears dark with bright edges. On the other hand, the TRES scores align more closely with human-visual perception. As expected, NDELS achieves higher scores, although counterintuitively, OSFD registers the highest score. Further investigation is necessary to understand the underlying factors contributing to this behavior.

Conclusion

We present a cutting-edge approach for dehazing and enhancing single-nighttime-hazy images. The presented method leverages a low-light module (LLM) that has been trained to suppress light glare and enhance low-light regions and a dehazing module (DHM) that effectively removes haze, restores natural color, and sharpens images. Our approach, dubbed NDELS, outperforms state-of-the-art, single-image nighttime-dehazing, and nighttime-enhancement methods by better suppressing light effects and enhancing low-light areas without causing further distortion. Quantitative results on the NHM dataset show the superiority of NDELS over other approaches by employing non-reference quality measures to validate the effectiveness of our extended-multiscale retinex (EMSR) approach. In future work, we plan to integrate a deraining and desnowing module into the NDELS network, creating a comprehensive system for nighttime dehazing and removal of weather elements.

References

[1] V. S. Saravanarajan, R.-C. Chen, C.-H. Hsieh, and L.-S. Chen, “Improving Semantic Segmentation Under Hazy Weather for Autonomous Vehicles Using Explainable Artificial Intelligence and Adaptive Dehazing Approach,” IEEE Access, vol. 11, pp. 38 194–38 207, 2023.
[2] Y. Guo, Y. Lu, R. W. Liu, L. Wang, and F. Zhu, “Heterogeneous Twin Dehazing Network for Visibility Enhancement in Maritime Video Surveillance,” in 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Sep. 2021, pp. 2875–2880.
[3] Y. Qiu, Y. Lu, Y. Wang, and H. Jiang, “IDOD-YOLOV7: Image-Dehazing YOLOV7 for Object Detection in Low-Light Foggy Traffic Environments,” Sensors, vol. 23, no. 3, p. 1347, Jan. 2023.
[4] Z. Liu, Y. He, C. Wang, and R. Song, “Analysis of the Influence of Foggy Weather Environment on the Detection Effect of Machine Vision Obstacles,” Sensors (Basel, Switzerland), vol. 20, 2020.
[5] S.-C. Pei and T.-Y. Lee, “Nighttime haze removal using color transfer pre-processing and Dark Channel Prior,” 2012 19th IEEE Inter. Conf. Image Process., pp. 957–960, 2012.
[6] T. Trongtirakul and S. Agaian, “Transmission map optimization for single image dehazing,” in Multimodal Image Exploitation and Learning 2022, vol. 12100. SPIE, 2022, pp. 124–134.
[7] Kaiming He, Jian Sun, and Xiaoou Tang, “Single Image Haze Removal Using Dark Channel Prior,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, no. 12, pp. 2341–2353, Dec. 2011.
[8] R. Fattal, “Dehazing Using Color-Lines,” ACM Transactions on Graphics (TOG), vol. 34, pp. 1 – 14, 2014.
[9] Q. Zhu, J. Mai, and L. Shao, “A Fast Single Image Haze Removal Algorithm Using Color Attenuation Prior,” IEEE Transactions on Image Processing, vol. 24, no. 11, pp. 3522–3533, Nov. 2015.
[10] T. M. Bui and W. Kim, “Single Image Dehazing Using Color Ellipsoid Prior,” IEEE Transactions on Image Processing, vol. 27, pp. 999–1009, 2018.
[11] M. Ju, C. Ding, Y. J. Guo, and D. Zhang, “IDGCP: Image dehazing based on gamma correction prior,” IEEE Transactions on Image Processing, vol. 29, pp. 3104–3118, 2019.
[12] M. Ju, C. Ding, C. A. Guo, W. Ren, and D. Tao, “Idrlp: Image Dehazing Using Region Line Prior,” IEEE Transactions on Image Processing, vol. 30, pp. 9043–9057, 2021.
[13] T. Trongtirakul, W. Chiracharit, and S. S. Agaian, “Single Backlit Image Enhancement,” IEEE Access, vol. 8, pp. 71 940–71 950, 2020.
[14] D. Berman, T. Treibitz, and S. Avidan, “Single Image Dehazing Using Haze-Lines,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 3, pp. 720–734, Mar. 2020.
[15] B. Cai, X. Xu, K. Jia, C. Qing, and D. Tao, “DehazeNet: An End-to-End System for Single Image Haze Removal,” IEEE Transactions on Image Processing, vol. 25, pp. 5187–5198, 2016.
[16] W. Ren, J. Pan, H. Zhang, X. Cao, and M.-H. Yang, “Single image dehazing via multi-scale convolutional neural networks with holistic edges,” International Journal of Computer Vision, vol. 128, pp. 240–259, 2020.
[17] B. Li, X. Peng, Z. Wang, J. Xu, and D. Feng, “AOD-Net: All-in-One Dehazing Network,” 2017 IEEE International Conference on Computer Vision (ICCV), pp. 4780–4788, 2017.
[18] W. Ren, L. Ma, J. Zhang, J. Pan, X. Cao, W. Liu, and M.-H. Yang, “Gated Fusion Network for Single Image Dehazing,” 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3253–3261, 2018.
[19] D. Yang and J. Sun, “Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image Dehazing,” in European Conf. on Computer Vision, 2018.
[20] R. Li, J. Pan, Z. Li, and J. Tang, “Single image dehazing via conditional generative adversarial network,” in Proceedings of the IEEE Conf. on Computer Vision and Pattern Recog., 2018, pp. 8202–8211.
[21] Y. Qu, Y. Chen, J. Huang, and Y. Xie, “Enhanced Pix2pix Dehazing Network,” 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8152–8160, 2019.
[22] W. Ren, S. Liu, H. Zhang, J. Pan, X. Cao, and M.-H. Yang, “Single Image Dehazing via Multi-scale Convolutional Neural Networks,” in European Conf. on Computer Vision, 2016.
[23] H. Dong, J. Pan, L. Xiang, Z. Hu, X. Zhang, F. Wang, and M.-H. Yang, “Multi-scale boosted dehazing network with dense feature fusion,” in Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recog., 2020, pp. 2157–2167.
[24] X. Qin, Z. Wang, Y. Bai, X. Xie, and H. Jia, “FFA-Net: Feature Fusion Attention Network for Single Image Dehazing,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 07, pp. 11 908–11 915, Apr. 2020.
[25] S. Zhang and F. He, “DRCDN: learning deep residual convolutional dehazing networks,” The Visual Computer, vol. 36, no. 9, pp. 1797–1808, 2020, publisher: Springer.
[26] S. Zhang, F. He, W. Ren, and J. Yao, “Joint learning of image detail and transmission map for single image dehazing,” The Visual Computer, vol. 36, pp. 305–316, 2020.
[27] S. Zhang, F. He, and W. Ren, “NLDN: Non-local dehazing network for dense haze removal,” Neurocomputing, vol. 410, pp. 363–373, 2020.
[28] Y. Shao, L. Li, W. Ren, C. Gao, and N. Sang, “Domain adaptation for image dehazing,” in Proceedings of the IEEE/CVF Conf. Comp. Vision and Pat. Recog., 2020, pp. 2808–2817.
[29] Z. Chen, Y. Wang, Y. Yang, and D. Liu, “PSD: Principled synthetic-to-real dehazing guided by physical priors,” in Proceedings of the IEEE/CVF Conf. on Comp. Vision and Pat. Recog., 2021, pp. 7180–7189.
[30] H. Wu, Y. Qu, S. Lin, J. Zhou, R. Qiao, Z. Zhang, Y. Xie, and L. Ma, “Contrastive learning for compact single image dehazing,” in Proceedings of the IEEE/CVF Conf. on Comp. Vision and Pat. Recog., 2021, pp. 10 551–10 560.
[31] J. Zhang, F. He, Y. Duan, and S. Yang, “AIDEDNet: Anti-interference and detail enhancement dehazing network for real-world scenes,” Frontiers of Comp. Sci., vol. 17, no. 2, p. 172703, 2023.
[32] J. Zhang, Y. Cao, and Z. Wang, “Nighttime haze removal based on a new imaging model,” in 2014 IEEE International Conference on Image Processing (ICIP), Oct. 2014, pp. 4557–4561.
[33] J. Zhang, Y. Cao, S. Fang, Y. Kang, and C. W. Chen, “Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, HI: IEEE, Jul. 2017, pp. 7016–7024.
[34] W. Wang, A. Wang, and C. Liu, “Variational Single Nighttime Image Haze Removal with a Gray Haze-Line Prior,” IEEE Transactions on Image Processing, vol. 31, pp. 1349–1363, 2022.
[35] J. Zhang, Y. Cao, Z. Zha, and D. Tao, “Nighttime Dehazing with a Synthetic Benchmark,” Proceedings of the 28th ACM International Conference on Multimedia, 2020.
[36] H. Koschmieder, “Theorie der horizontalen Sichtweite,” Beitrage zur Physik der freien Atmosphare, pp. 33–53, 1924.
[37] F. Fang, T. Wang, Y. Wang, T. Zeng, and G. Zhang, “Variational Single Image Dehazing for Enhanced Visualization,” IEEE Transactions on Multimedia, vol. 22, no. 10, pp. 2537–2550, Oct. 2020.
[38] Y. Yu, H. Liu, M. Fu, J. Chen, X. Wang, and K. Wang, “A Two-branch Neural Network for Non-homogeneous Dehazing via Ensemble Learning,” Apr. 2021.
[39] Y. Liu, Z. Yan, T. Ye, A. Wu, and Y. Li, “Single nighttime image dehazing based on unified variational decomposition model and multi-scale contrast enhancement,” Engineering Applications of Artificial Intelligence, vol. 116, p. 105373, Nov. 2022.
[40] Z. Mi, H. Zhou, Y.-J. Zheng, and M. Wang, “Single image dehazing via multi-scale gradient domain contrast enhancement,” IET Image Process., vol. 10, pp. 206–214, 2016.
[41] J.-B. Wang, K. Lu, J. Xue, N. He, and L. Shao, “Single Image Dehazing Based on the Physical Model and MSRCR Algorithm,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 28, pp. 2190–2199, 2018.
[42] Z. Zhu, H. Wei, G. Hu, Y. Li, G. Qi, and N. Mazur, “A Novel Fast Single Image Dehazing Algorithm Based on Artificial Multiexposure Image Fusion,” IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–23, 2021.
[43] X. Liu, H. Li, and C. Zhu, “Joint Contrast Enhancement and Exposure Fusion for Real-World Image Dehazing,” IEEE Transactions on Multimedia, vol. 24, pp. 3934–3946, 2022.
[44] C. Ancuti, C. O. Ancuti, C. D. Vleeschouwer, and A. C. Bovik, “Night-time dehazing by fusion,” 2016 IEEE International Conference on Image Processing (ICIP), pp. 2256–2260, 2016.
[45] C. O. Ancuti, C. Ancuti, and C. D. Vleeschouwer, “Effective Local Airlight Estimation for Image Dehazing,” 2018 25th IEEE International Conference on Image Processing (ICIP), pp. 2850–2854, 2018.
[46] C. Ancuti, C. O. Ancuti, C. D. Vleeschouwer, and A. C. Bovik, “Day and Night-Time Dehazing by Local Airlight Estimation,” IEEE Transactions on Image Processing, vol. 29, pp. 6264–6275, 2020.
[47] T. Yu, K. Song, P. Miao, G. Yang, H. Yang, and C. Chen, “Nighttime Single Image Dehazing via Pixel-Wise Alpha Blending,” IEEE Access, vol. 7, pp. 114 619–114 630, 2019.
[48] X. Yang, H. Li, Y.-L. Fan, and R. Chen, “Single Image Haze Removal via Region Detection Network,” IEEE Transactions on Multimedia, vol. 21, no. 10, pp. 2545–2560, Oct. 2019.
[49] S. A. Hovhannisyan, H. A. Gasparyan, S. S. Agaian, and A. Ghazaryan, “AED-Net: A Single Image Dehazing,” IEEE Access, vol. 10, pp. 12 465–12 474, 2022.
[50] C. Li, C. Guo, J. Guo, P. Han, H. Fu, and R. Cong, “PDR-Net: Perception-Inspired Single Image Dehazing Network with Refinement,” IEEE Transactions on Multimedia, vol. 22, no. 3, pp. 704–716, Mar. 2020.
[51] C. Lin, X. Rong, and X. Yu, “MSAFF-Net: Multiscale Attention Feature Fusion Networks for Single Image Dehazing and Beyond,” IEEE Transactions on Multimedia, pp. 1–1, 2022.
[52] H. Yu, N. Zheng, M. Zhou, J. Huang, Z. Xiao, and F. Zhao, “Frequency and Spatial Dual Guidance for Image Dehazing,” in Computer Vision – ECCV 2022, S. Avidan, G. Brostow, M. Cissé, G. M. Farinella, and T. Hassner, Eds. Cham: Springer Nature Switzerland, 2022, pp. 181–198.
[53] C. Guo, Q. Yan, S. Anwar, R. Cong, W. Ren, and C. Li, “Image Dehazing Transformer with Transmission-Aware 3D Position Embedding,” in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 5802–5810.
[54] B. Koo and G. Kim, “Nighttime haze removal with glow decomposition using GAN,” in Pattern Recognition: 5th Asian Conference, ACPR 2019, Auckland, New Zealand, November 26–29, 2019, Revised Selected Papers, Part I 5. Springer, 2020, pp. 807–820.
[55] S. Kuanar and M. Bilas, “Night Time Haze and Glow Removal using Deep Dilated Convolutional Network,” p. 13.
[56] W. Yan, R. T. Tan, and D. Dai, “Nighttime Defogging Using High-Low Frequency Decomposition and Grayscale-Color Networks,” in European Conference on Computer Vision, 2020.
[57] L. Chen, X. Lu, J. Zhang, X. Chu, and C. Chen, “HINet: Half Instance Normalization Network for Image Restoration,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Nashville, TN, USA: IEEE, Jun. 2021, pp. 182–192.
[58] J. Huang, Y. Liu, X. Fu, M. Zhou, Y. Wang, F. Zhao, and Z. Xiong, “Exposure Normalization and Compensation for Multiple-Exposure Correction,” in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2022, pp. 6033–6042.
[59] Y. Li, R. T. Tan, and M. S. Brown, “Nighttime haze removal with glow and multiple light colors,” … of the IEEE international conference on …, 2015.
[60] A. Sharma and R. T. Tan, “Nighttime Visibility Enhancement by Increasing the Dynamic Range and Suppression of Light Effects,” in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Nashville, TN, USA: IEEE, Jun. 2021, pp. 11 972–11 981.
[61] G. Eilertsen, J. Kronander, G. Denes, R. K. Mantiuk, and J. Unger, “HDR image reconstruction from a single exposure using deep CNNs,” ACM transactions on graphics (TOG), vol. 36, no. 6, pp. 1–15, 2017, publisher: ACM New York, NY, USA.
[62] C. Guo, C. Li, J. Guo, C. C. Loy, J. Hou, S. T. W. Kwong, and R. Cong, “Zero-Reference Deep Curve Estimation for Low-Light Image Enhancement,” 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1777–1786, 2020.
[63] Y. Jiang, X. Gong, D. Liu, Y. Cheng, C. Fang, X. Shen, J. Yang, P. Zhou, and Z. Wang, “EnlightenGAN: Deep Light Enhancement Without Paired Supervision,” IEEE Transactions on Image Processing, vol. 30, pp. 2340–2349, 2019.
[64] Y.-L. Liu, W.-S. Lai, Y.-S. Chen, Y.-L. Kao, M.-H. Yang, Y.-Y. Chuang, and J.-B. Huang, “Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline,” 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1648–1657, 2020.
[65] A. S. Krishnan, “Retinex Image Enhancement,” Dec. 2022, original-date: 2020-08-17T09:58:05Z. [Online]. Available: https://github.com/aravindskrishnan/Retinex-Image-Enhancement
[66] H. Cui, J. Li, Z. Hua, and L. Fan, “Attention-Guided Multi-Scale Feature Fusion Network for Low-Light Image Enhancement,” Frontiers in Neurorobotics, vol. 16, p. 837208, Mar. 2022.
[67] S. Gao, M.-M. Cheng, K. Zhao, X. Zhang, M.-H. Yang, and P. H. S. Torr, “Res2Net: A New Multi-Scale Backbone Architecture,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, pp. 652–662, 2019.
[68] W. Shi, J. Caballero, F. Huszár, J. Totz, A. P. Aitken, R. Bishop, D. Rueckert, and Z. Wang, “Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1874–1883, 2016.
[69] Y. Zhang, K. Li, K. Li, L. Wang, B. Zhong, and Y. R. Fu, “Image Super-Resolution Using Very Deep Residual Channel Attention Networks,” in European Conference on Computer Vision, 2018.
[70] R. Pascanu, T. Mikolov, and Y. Bengio, “On the difficulty of training recurrent neural networks,” in International Conference on Machine Learning, 2012.
[71] C. Ledig, L. Theis, F. Huszár, J. Caballero, A. P. Aitken, A. Tejani, J. Totz, Z. Wang, and W. Shi, “Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network,” 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 105–114, 2016.
[72] A. B. Petro, C. Sbert, and J.-M. Morel, “Multiscale Retinex,” Image Process. Line, vol. 4, pp. 71–88, 2014.
[73] J. Li, J. Li, F. Fang, F. Li, and G. Zhang, “Luminance-Aware Pyramid Network for Low-Light Image Enhancement,” IEEE Transactions on Multimedia, vol. 23, pp. 3153–3165, 2021.
[74] J. Johnson, A. Alahi, and L. Fei-Fei, “Perceptual Losses for Real-Time Style Transfer and Super-Resolution,” in Computer Vision – ECCV 2016, B. Leibe, J. Matas, N. Sebe, and M. Welling, Eds. Cham: Springer International Publishing, 2016, pp. 694–711.
[75] K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition,” CoRR, vol. abs/1409.1556, 2014.
[76] R. B. Girshick, “Fast R-CNN,” 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448, 2015.
[77] P.-Y. Laffont, Z. Ren, X. Tao, C. Qian, and J. Hays, “Transient Attributes for High-Level Understanding and Editing of Outdoor Scenes,” ACM Transactions on Graphics (proceedings of SIGGRAPH), vol. 33, no. 4, 2014.
[78] A. Buslaev, V. I. Iglovikov, E. Khvedchenya, A. Parinov, M. Druzhinin, and A. A. Kalinin, “Albumentations: Fast and Flexible Image Augmentations,” Information, vol. 11, no. 2, 2020.
[79] Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,” IEEE Transactions on Image Processing, vol. 13, pp. 600–612, 2004.
[80] N. Carion, F. Massa, G. Synnaeve, N. Usunier, A. Kirillov, and S. Zagoruyko, “End-to-End Object Detection with Transformers,” CoRR, vol. abs/2005.12872, 2020, _eprint: 2005.12872.
[81] J. Wang, K. C. K. Chan, and C. C. Loy, “Exploring CLIP for Assessing the Look and Feel of Images,” Nov. 2022.
[82] S. Yang, T. Wu, S. Shi, S. Lao, Y. Gong, M. Cao, J. Wang, and Y. Yang, “MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment,” Apr. 2022.
[83] S. A. Golestaneh, S. Dadsetan, and K. M. Kitani, “No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency,” in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022, pp. 3209–3218.
[84] C. O. Ancuti, C. Ancuti, R. Timofte, and C. D. Vleeschouwer, “O-HAZE: a dehazing benchmark with real hazy and haze-free outdoor images,” in IEEE Conference on Computer Vision and Pattern Recognition, NTIRE Workshop, ser. NTIRE CVPR’18, 2018, event-place: Salt Lake City, Utah, USA.

PSNR: 11.8033, TRES: 74.7440 (a) hazy \@finalstrut	PSNR: 14.5052, TRES: 68.1582 (b) MRP[33] \@finalstrut	PSNR: 10.4543, TRES: 67.5944 (c) MRPF[33] \@finalstrut
PSNR: 14.8811, TRES: 81.4415 (d) OSFD[35] \@finalstrut	PSNR: 11.6259, TRES: 81.1322 (e) NDELS \@finalstrut	TRES: 95.9874 (f) GT \@finalstrut