-
Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey
Authors:
Marcos V. Conde,
Zhijun Lei,
Wen Li,
Cosmin Stejerean,
Ioannis Katsavounidis,
Radu Timofte,
Kihwan Yoon,
Ganzorig Gankhuyag,
Jiangtao Lv,
Long Sun,
**shan Pan,
Jiangxin Dong,
**hui Tang,
Zhiyuan Li,
Hao Wei,
Chenyang Ge,
Dongyang Zhang,
Tianle Liu,
Huaian Chen,
Yi **,
Menghan Zhou,
Yiqiang Yan,
Si Gao,
Biao Wu,
Shaoli Liu
, et al. (50 additional authors not shown)
Abstract:
This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod…
▽ More
This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF codec, instead of JPEG. All the proposed methods improve PSNR fidelity over Lanczos interpolation, and process images under 10ms. Out of the 160 participants, 25 teams submitted their code and models. The solutions present novel designs tailored for memory-efficiency and runtime on edge devices. This survey describes the best solutions for real-time SR of compressed high-resolution images.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Suppression of Antiferromagnetic Order by Strain in Honeycomb Cobaltate: Implication for Quantum Spin Liquid
Authors:
Gye-Hyeon Kim,
Miju Park,
Uksam Choi,
Baekjune Kang,
Uihyeon Seo,
GwangCheol Ji,
Seunghyeon Noh,
Deok-Yong Cho,
Jung-Woo Yoo,
Jong Mok Ok,
Changhee Sohn
Abstract:
Recently, layered honeycomb cobaltates have been predicted as a new promising system for realizing the Kitaev quantum spin liquid, a many-body quantum entangled ground state characterized by fractional excitations. However, these cobaltates, similar to other candidate materials, exhibit classical antiferromagnetic ordering at low temperatures, which impedes the formation of the expected quantum st…
▽ More
Recently, layered honeycomb cobaltates have been predicted as a new promising system for realizing the Kitaev quantum spin liquid, a many-body quantum entangled ground state characterized by fractional excitations. However, these cobaltates, similar to other candidate materials, exhibit classical antiferromagnetic ordering at low temperatures, which impedes the formation of the expected quantum state. Here, we demonstrate that the control of the trigonal crystal field of Co ions is crucial to suppress classical antiferromagnetic ordering and to locate its ground state in closer vicinity to quantum spin liquid in layered honeycomb cobaltates. By utilizing heterostructure engineering on Cu3Co2SbO6 thin films, we adjust the trigonal distortion of CoO6 octahedra and the associated trigonal crystal field. The original Néel temperature of 16 K in bulk Cu3Co2SbO6 decreases (increases) to 7.8 K (22.7 K) in strained Cu3Co2SbO6 films by decreasing (increasing) the magnitude of the trigonal crystal fields. Our experimental finding substantiates the potential of layered honeycomb cobaltate heterostructures and strain engineering to accomplish the extremely elusive quantum phase of matter.
△ Less
Submitted 20 December, 2023; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Variational non-Bayesian inference of the Probability Density Function in the Wiener Algebra
Authors:
U ** Choi,
Kyung Soo Rim
Abstract:
This paper presents a research study focused on uncovering the hidden population distribution from the viewpoint of a variational non-Bayesian approach. It asserts that if the hidden probability density function (PDF) has continuous partial derivatives of at least half the dimension's order, it can be perfectly reconstructed from a stationary ergodic process: First, we establish that if the PDF be…
▽ More
This paper presents a research study focused on uncovering the hidden population distribution from the viewpoint of a variational non-Bayesian approach. It asserts that if the hidden probability density function (PDF) has continuous partial derivatives of at least half the dimension's order, it can be perfectly reconstructed from a stationary ergodic process: First, we establish that if the PDF belongs to the Wiener algebra, its canonical ensemble form is uniquely determined through the Fréchet differentiation of the Kullback-Leibler divergence, aiming to minimize their cross-entropy. Second, we utilize the result that the differentiability of the PDF implies its membership in the Wiener algebra. Third, as the energy function of the canonical ensemble is defined as a series, the problem transforms into finding solutions to the equations of analytic series for the coefficients in the energy function. Naturally, through the use of truncated polynomial series and by demonstrating the convergence of partial sums of the energy function, we ensure the efficiency of approximation with a finite number of data points. Finally, through numerical experiments, we approximate the PDF from a random sample obtained from a bivariate normal distribution and also provide approximations for the mean and covariance from the PDF. This study substantiates the excellence of its results and their practical applicability.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Optical detection of bond-dependent and frustrated spin in the two-dimensional cobalt-based honeycomb antiferromagnet Cu3Co2SbO6
Authors:
Baekjune Kang,
Uksam Choi,
Taek Sun Jung,
Seunghyeon Noh,
Gye-Hyeon Kim,
UiHyeon Seo,
Miju Park,
**-Hyun Choi,
Minjae Kim,
GwangCheol Ji,
Sehwan Song,
Hyesung Jo,
Seokjo Hong,
Nguyen Xuan Duong,
Tae Heon Kim,
Yongsoo Yang,
Sungkyun Park,
Jong Mok Ok,
Jung-Woo Yoo,
Jae Hoon Kim,
Changhee Sohn
Abstract:
Two-dimensional honeycomb antiferromagnet becomes an important class of materials as it can provide a route to Kitaev quantum spin liquid, characterized by massive quantum entanglement and fractional excitations. The signatures of its proximity to Kitaev quantum spin liquid in the honeycomb antiferromagnet includes anisotropic bond-dependent magnetic responses and persistent fluctuation by frustra…
▽ More
Two-dimensional honeycomb antiferromagnet becomes an important class of materials as it can provide a route to Kitaev quantum spin liquid, characterized by massive quantum entanglement and fractional excitations. The signatures of its proximity to Kitaev quantum spin liquid in the honeycomb antiferromagnet includes anisotropic bond-dependent magnetic responses and persistent fluctuation by frustration in paramagnetic regime. Here, we propose Cu3Co2SbO6 heterostructures as an intriguing honeycomb antiferromagnet for quantum spin liquid, wherein bond-dependent and frustrated spins interact with optical excitons. This system exhibits antiferromagnetism at 16 K with different spin-flip magnetic fields between a bond-parallel and bond-perpendicular directions, aligning more closely with the generalized Heisenberg-Kitaev than the XXZ model. Optical spectroscopy reveals a strong excitonic transition coupled to the antiferromagnetism, enabling optical detection of its spin states. Particularly, such spin-exciton coupling presents anisotropic responses between bond-parallel and bond-perpendicular magnetic field as well as a finite spin-spin correlation function around 40 K, higher than twice its Néel temperature. The characteristic temperature that remains barely changed even under strong magnetic fields highlights the robustness of the spin-fluctuation region. Our results demonstrate Cu3Co2SbO6 as a unique candidate for the quantum spin liquid phase, where the spin Hamiltonian and quasiparticle excitations can be probed and potentially controlled by light.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Authors:
Andrey Ignatov,
Radu Timofte,
Shuai Liu,
Chaoyu Feng,
Furui Bai,
Xiaotao Wang,
Lei Lei,
Ziyao Yi,
Yan Xiang,
Zibin Liu,
Shaoqing Li,
Keming Shi,
Dehui Kong,
Ke Xu,
Minsu Kwon,
Yaqi Wu,
Jiesi Zheng,
Zhihao Fan,
Xun Wu,
Feng Zhang,
Albert No,
Minhyeok Cho,
Zewen Chen,
Xiaze Zhang,
Ran Li
, et al. (13 additional authors not shown)
Abstract:
The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. Th…
▽ More
The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. The participants were provided with a large-scale Fujifilm UltraISP dataset consisting of thousands of paired photos captured with a normal mobile camera sensor and a professional 102MP medium-format FujiFilm GFX100 camera. The runtime of the resulting models was evaluated on the Snapdragon's 8 Gen 1 GPU that provides excellent acceleration results for the majority of common deep learning ops. The proposed solutions are compatible with all recent mobile GPUs, being able to process Full HD photos in less than 20-50 milliseconds while achieving high fidelity results. A detailed description of all models developed in this challenge is provided in this paper.
△ Less
Submitted 7 November, 2022;
originally announced November 2022.
-
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration
Authors:
Marcos V. Conde,
Ui-** Choi,
Maxime Burchi,
Radu Timofte
Abstract:
Compression plays an important role on the efficient transmission and storage of images and videos through band-limited systems such as streaming services, virtual reality or videogames. However, compression unavoidably leads to artifacts and the loss of the original information, which may severely degrade the visual quality. For these reasons, quality enhancement of compressed images has become a…
▽ More
Compression plays an important role on the efficient transmission and storage of images and videos through band-limited systems such as streaming services, virtual reality or videogames. However, compression unavoidably leads to artifacts and the loss of the original information, which may severely degrade the visual quality. For these reasons, quality enhancement of compressed images has become a popular research topic. While most state-of-the-art image restoration methods are based on convolutional neural networks, other transformers-based methods such as SwinIR, show impressive performance on these tasks.
In this paper, we explore the novel Swin Transformer V2, to improve SwinIR for image super-resolution, and in particular, the compressed input scenario. Using this method we can tackle the major issues in training transformer vision models, such as training instability, resolution gaps between pre-training and fine-tuning, and hunger on data. We conduct experiments on three representative tasks: JPEG compression artifacts removal, image super-resolution (classical and lightweight), and compressed image super-resolution. Experimental results demonstrate that our method, Swin2SR, can improve the training convergence and performance of SwinIR, and is a top-5 solution at the "AIM 2022 Challenge on Super-Resolution of Compressed Image and Video".
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
The PWLR Graph Representation: A Persistent Weisfeiler-Lehman scheme with Random Walks for Graph Classification
Authors:
Sun Woo Park,
Yun Young Choi,
Dosang Joe,
U ** Choi,
Youngho Woo
Abstract:
This paper presents the Persistent Weisfeiler-Lehman Random walk scheme (abbreviated as PWLR) for graph representations, a novel mathematical framework which produces a collection of explainable low-dimensional representations of graphs with discrete and continuous node features. The proposed scheme effectively incorporates normalized Weisfeiler-Lehman procedure, random walks on graphs, and persis…
▽ More
This paper presents the Persistent Weisfeiler-Lehman Random walk scheme (abbreviated as PWLR) for graph representations, a novel mathematical framework which produces a collection of explainable low-dimensional representations of graphs with discrete and continuous node features. The proposed scheme effectively incorporates normalized Weisfeiler-Lehman procedure, random walks on graphs, and persistent homology. We thereby integrate three distinct properties of graphs, which are local topological features, node degrees, and global topological invariants, while preserving stability from graph perturbations. This generalizes many variants of Weisfeiler-Lehman procedures, which are primarily used to embed graphs with discrete node labels. Empirical results suggest that these representations can be efficiently utilized to produce comparable results to state-of-the-art techniques in classifying graphs with discrete node labels, and enhanced performances in classifying those with continuous node features.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results
Authors:
Ren Yang,
Radu Timofte,
Xin Li,
Qi Zhang,
Lin Zhang,
Fanglong Liu,
Dongliang He,
Fu li,
He Zheng,
Weihang Yuan,
Pavel Ostyakov,
Dmitry Vyal,
Magauiya Zhussip,
Xueyi Zou,
Youliang Yan,
Lei Li,
**gzhu Tang,
Ming Chen,
Shijie Zhao,
Yu Zhu,
Xiaoran Qin,
Chenghua Li,
Cong Leng,
Jian Cheng,
Claudio Rota
, et al. (28 additional authors not shown)
Abstract:
This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 3…
▽ More
This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 365 videos, including the LDV 2.0 dataset (335 videos) and 30 additional videos. In this challenge, there are 12 teams and 2 teams that submitted the final results to Track 1 and Track 2, respectively. The proposed methods and solutions gauge the state-of-the-art of super-resolution on compressed image and video. The proposed LDV 3.0 dataset is available at https://github.com/RenYang-home/LDV_dataset. The homepage of this challenge is at https://github.com/RenYang-home/AIM22_CompressSR.
△ Less
Submitted 25 August, 2022; v1 submitted 23 August, 2022;
originally announced August 2022.
-
Few-shot Long-Tailed Bird Audio Recognition
Authors:
Marcos V. Conde,
Ui-** Choi
Abstract:
It is easier to hear birds than see them. However, they still play an essential role in nature and are excellent indicators of deteriorating environmental quality and pollution. Recent advances in Deep Neural Networks allow us to process audio data to detect and classify birds. This technology can assist researchers in monitoring bird populations and biodiversity. We propose a sound detection and…
▽ More
It is easier to hear birds than see them. However, they still play an essential role in nature and are excellent indicators of deteriorating environmental quality and pollution. Recent advances in Deep Neural Networks allow us to process audio data to detect and classify birds. This technology can assist researchers in monitoring bird populations and biodiversity. We propose a sound detection and classification pipeline to analyze complex soundscape recordings and identify birdcalls in the background. Our method learns from weak labels and few data and acoustically recognizes the bird species. Our solution achieved 18th place of 807 teams at the BirdCLEF 2022 Challenge hosted on Kaggle.
△ Less
Submitted 4 July, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Probabilistic Neural Network: Frequency and Moment Learnings
Authors:
Kyung Soo Rim,
U ** Choi
Abstract:
We introduce probabilistic neural networks that describe unsupervised synchronous learning on an atomic Hardy space and space of bounded real analytic functions, respectively. For a stationary ergodic vector process, we prove that the probabilistic neural network yields a unique collection of neurons in global optimization without initialization and back-propagation. During learning, we show that…
▽ More
We introduce probabilistic neural networks that describe unsupervised synchronous learning on an atomic Hardy space and space of bounded real analytic functions, respectively. For a stationary ergodic vector process, we prove that the probabilistic neural network yields a unique collection of neurons in global optimization without initialization and back-propagation. During learning, we show that all neurons communicate with each other, in the sense of linear combinations, until the learning is finished. Also, we give convergence results for the stability of neurons, estimation methods, and topological statistics to appreciate unsupervised estimation of a probabilistic neural network. As application, we attach numerical experiments on samples drawn by a standing wave.
△ Less
Submitted 22 April, 2020;
originally announced April 2020.