-
Evaluating the Usability of Differential Privacy Tools with Data Practitioners
Authors:
Ivoline C. Ngong,
Brad Stenger,
Joseph P. Near,
Yuanyuan Feng
Abstract:
Differential privacy (DP) has become the gold standard in privacy-preserving data analytics, but implementing it in real-world datasets and systems remains challenging. Recently developed DP tools aim to make DP implementation easier, but limited research has investigated these DP tools' usability. Through a usability study with 24 US data practitioners with varying prior DP knowledge, we evaluate…
▽ More
Differential privacy (DP) has become the gold standard in privacy-preserving data analytics, but implementing it in real-world datasets and systems remains challenging. Recently developed DP tools aim to make DP implementation easier, but limited research has investigated these DP tools' usability. Through a usability study with 24 US data practitioners with varying prior DP knowledge, we evaluated the usability of four Python-based open-source DP tools: DiffPrivLib, Tumult Analytics, PipelineDP, and OpenDP. Our results suggest that using DP tools in this study may help DP novices better understand DP; that Application Programming Interface (API) design and documentation are vital for successful DP implementation; and that user satisfaction correlates with how well participants completed study tasks with these DP tools. We provide evidence-based recommendations to improve DP tools' usability to broaden DP adoption.
△ Less
Submitted 19 February, 2024; v1 submitted 23 September, 2023;
originally announced September 2023.
-
Age Prediction From Face Images Via Contrastive Learning
Authors:
Yeongnam Chae,
Poulami Raha,
Mijung Kim,
Bjorn Stenger
Abstract:
This paper presents a novel approach for accurately estimating age from face images, which overcomes the challenge of collecting a large dataset of individuals with the same identity at different ages. Instead, we leverage readily available face datasets of different people at different ages and aim to extract age-related features using contrastive learning. Our method emphasizes these relevant fe…
▽ More
This paper presents a novel approach for accurately estimating age from face images, which overcomes the challenge of collecting a large dataset of individuals with the same identity at different ages. Instead, we leverage readily available face datasets of different people at different ages and aim to extract age-related features using contrastive learning. Our method emphasizes these relevant features while suppressing identity-related features using a combination of cosine similarity and triplet margin losses. We demonstrate the effectiveness of our proposed approach by achieving state-of-the-art performance on two public datasets, FG-NET and MORPH-II.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
LLDiffusion: Learning Degradation Representations in Diffusion Models for Low-Light Image Enhancement
Authors:
Tao Wang,
Kaihao Zhang,
Ziqian Shao,
Wenhan Luo,
Bjorn Stenger,
Tae-Kyun Kim,
Wei Liu,
Hongdong Li
Abstract:
Current deep learning methods for low-light image enhancement (LLIE) typically rely on pixel-wise map** learned from paired data. However, these methods often overlook the importance of considering degradation representations, which can lead to sub-optimal outcomes. In this paper, we address this limitation by proposing a degradation-aware learning scheme for LLIE using diffusion models, which e…
▽ More
Current deep learning methods for low-light image enhancement (LLIE) typically rely on pixel-wise map** learned from paired data. However, these methods often overlook the importance of considering degradation representations, which can lead to sub-optimal outcomes. In this paper, we address this limitation by proposing a degradation-aware learning scheme for LLIE using diffusion models, which effectively integrates degradation and image priors into the diffusion process, resulting in improved image enhancement. Our proposed degradation-aware learning scheme is based on the understanding that degradation representations play a crucial role in accurately modeling and capturing the specific degradation patterns present in low-light images. To this end, First, a joint learning framework for both image generation and image enhancement is presented to learn the degradation representations. Second, to leverage the learned degradation representations, we develop a Low-Light Diffusion model (LLDiffusion) with a well-designed dynamic diffusion module. This module takes into account both the color map and the latent degradation representations to guide the diffusion process. By incorporating these conditioning factors, the proposed LLDiffusion can effectively enhance low-light images, considering both the inherent degradation patterns and the desired color fidelity. Finally, we evaluate our proposed method on several well-known benchmark datasets, including synthetic and real-world unpaired datasets. Extensive experiments on public benchmarks demonstrate that our LLDiffusion outperforms state-of-the-art LLIE methods both quantitatively and qualitatively. The source code and pre-trained models are available at https://github.com/TaoWangzj/LLDiffusion.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions
Authors:
Tao Wang,
Kaihao Zhang,
Ziqian Shao,
Wenhan Luo,
Bjorn Stenger,
Tong Lu,
Tae-Kyun Kim,
Wei Liu,
Hongdong Li
Abstract:
Image restoration in adverse weather conditions is a difficult task in computer vision. In this paper, we propose a novel transformer-based framework called GridFormer which serves as a backbone for image restoration under adverse weather conditions. GridFormer is designed in a grid structure using a residual dense transformer block, and it introduces two core designs. First, it uses an enhanced a…
▽ More
Image restoration in adverse weather conditions is a difficult task in computer vision. In this paper, we propose a novel transformer-based framework called GridFormer which serves as a backbone for image restoration under adverse weather conditions. GridFormer is designed in a grid structure using a residual dense transformer block, and it introduces two core designs. First, it uses an enhanced attention mechanism in the transformer layer. The mechanism includes stages of the sampler and compact self-attention to improve efficiency, and a local enhancement stage to strengthen local information. Second, we introduce a residual dense transformer block (RDTB) as the final GridFormer layer. This design further improves the network's ability to learn effective features from both preceding and current local features. The GridFormer framework achieves state-of-the-art results on five diverse image restoration tasks in adverse weather conditions, including image deraining, dehazing, deraining \& dehazing, desnowing, and multi-weather restoration. The source code and pre-trained models are available at https://github.com/TaoWangzj/GridFormer.
△ Less
Submitted 21 June, 2024; v1 submitted 28 May, 2023;
originally announced May 2023.
-
Faint but not forgotten. I. First results from a search for astrospheres around AGB stars in the far-ultraviolet
Authors:
Raghvendra Sahai,
Benjamin Stenger
Abstract:
Using the GALEX archive, we have discovered extended structures around ten asymptotic giant branch (AGB) stars (out of a total 92 searched) emitting in the far-ultraviolet (FUV) band. In all but one, we find the typical morphology expected for a spherical wind moving relative to, and interacting with the ISM to produce an astrosphere. The exception is V\,Hya whose mass-ejection is known to be high…
▽ More
Using the GALEX archive, we have discovered extended structures around ten asymptotic giant branch (AGB) stars (out of a total 92 searched) emitting in the far-ultraviolet (FUV) band. In all but one, we find the typical morphology expected for a spherical wind moving relative to, and interacting with the ISM to produce an astrosphere. The exception is V\,Hya whose mass-ejection is known to be highly aspherical, where we find evidence of its large parabolic outflows interacting with the ISM, and its collimated, extreme velocity outflows interacting with the circumstellar medium. For 8 objects with relatively large proper motions, we find (as expected) that the termination-shock region lies in a hemisphere that contains the proper motion vector. Radial intensity cuts for each source have been used to locate the termination shock and the astropause's outer edge. In a few objects, the cuts also reveal faint emission just outside the astropause that likely arises in shocked ISM material. We have used these data, together with published mass-loss rates and wind expansion velocities, to determine the total mass lost and duration for each source -- we find that the duration of and total mass in the shocked wind are significantly larger than their corresponding values for the unshocked wind. The combination of FUV and far-IR data on AGB astrospheres, provides a unique database for theoretical studies (numerical simulations) of wind-ISM interactions. We show that a Cyclical Spatial Heterodyne Spectrometer on a small space-based telescope, can provide high-resolution spectra of astrospheres to confirm the emission mechanism.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method
Authors:
Tao Wang,
Kaihao Zhang,
Tianrun Shen,
Wenhan Luo,
Bjorn Stenger,
Tong Lu
Abstract:
As the quality of optical sensors improves, there is a need for processing large-scale images. In particular, the ability of devices to capture ultra-high definition (UHD) images and video places new demands on the image processing pipeline. In this paper, we consider the task of low-light image enhancement (LLIE) and introduce a large-scale database consisting of images at 4K and 8K resolution. W…
▽ More
As the quality of optical sensors improves, there is a need for processing large-scale images. In particular, the ability of devices to capture ultra-high definition (UHD) images and video places new demands on the image processing pipeline. In this paper, we consider the task of low-light image enhancement (LLIE) and introduce a large-scale database consisting of images at 4K and 8K resolution. We conduct systematic benchmarking studies and provide a comparison of current LLIE algorithms. As a second contribution, we introduce LLFormer, a transformer-based low-light enhancement method. The core components of LLFormer are the axis-based multi-head self-attention and cross-layer attention fusion block, which significantly reduces the linear complexity. Extensive experiments on the new dataset and existing public datasets show that LLFormer outperforms state-of-the-art methods. We also show that employing existing LLIE methods trained on our benchmark as a pre-processing step significantly improves the performance of downstream tasks, e.g., face detection in low-light conditions. The source code and pre-trained models are available at https://github.com/TaoWangzj/LLFormer.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision
Authors:
Tianyu Li,
Ali Cevahir,
Derek Cho,
Hao Gong,
DuyKhuong Nguyen,
Bjorn Stenger
Abstract:
E-commerce platforms generate vast amounts of customer behavior data, such as clicks and purchases, from millions of unique users every day. However, effectively using this data for behavior understanding tasks is challenging because there are usually not enough labels to learn from all users in a supervised manner. This paper extends the BERT model to e-commerce user data for pre-training represe…
▽ More
E-commerce platforms generate vast amounts of customer behavior data, such as clicks and purchases, from millions of unique users every day. However, effectively using this data for behavior understanding tasks is challenging because there are usually not enough labels to learn from all users in a supervised manner. This paper extends the BERT model to e-commerce user data for pre-training representations in a self-supervised manner. By viewing user actions in sequences as analogous to words in sentences, we extend the existing BERT model to user behavior data. Further, our model adopts a unified structure to simultaneously learn from long-term and short-term user behavior, as well as user attributes. We propose methods for the tokenization of different types of user behavior sequences, the generation of input representation vectors, and a novel pretext task to enable the pre-trained model to learn from its own input, eliminating the need for labeled training data. Extensive experiments demonstrate that the learned representations result in significant improvements when transferred to three different real-world tasks, particularly compared to task-specific modeling and multi-task representation learning
△ Less
Submitted 14 February, 2022;
originally announced February 2022.
-
Deep Image Deblurring: A Survey
Authors:
Kaihao Zhang,
Wenqi Ren,
Wenhan Luo,
Wei-Sheng Lai,
Bjorn Stenger,
Ming-Hsuan Yang,
Hongdong Li
Abstract:
Image deblurring is a classic problem in low-level computer vision with the aim to recover a sharp image from a blurred input image. Advances in deep learning have led to significant progress in solving this problem, and a large number of deblurring networks have been proposed. This paper presents a comprehensive and timely survey of recently published deep-learning based image deblurring approach…
▽ More
Image deblurring is a classic problem in low-level computer vision with the aim to recover a sharp image from a blurred input image. Advances in deep learning have led to significant progress in solving this problem, and a large number of deblurring networks have been proposed. This paper presents a comprehensive and timely survey of recently published deep-learning based image deblurring approaches, aiming to serve the community as a useful literature review. We start by discussing common causes of image blur, introduce benchmark datasets and performance metrics, and summarize different problem formulations. Next, we present a taxonomy of methods using convolutional neural networks (CNN) based on architecture, loss function, and application, offering a detailed review and comparison. In addition, we discuss some domain-specific deblurring applications including face images, text, and stereo image pairs. We conclude by discussing key challenges and future research directions.
△ Less
Submitted 27 May, 2022; v1 submitted 25 January, 2022;
originally announced January 2022.
-
MC-Blur: A Comprehensive Benchmark for Image Deblurring
Authors:
Kaihao Zhang,
Tao Wang,
Wenhan Luo,
Boheng Chen,
Wenqi Ren,
Bjorn Stenger,
Wei Liu,
Hongdong Li,
Ming-Hsuan Yang
Abstract:
Blur artifacts can seriously degrade the visual quality of images, and numerous deblurring methods have been proposed for specific scenarios. However, in most real-world images, blur is caused by different factors, e.g., motion and defocus. In this paper, we address how different deblurring methods perform in the case of multiple types of blur. For in-depth performance evaluation, we construct a n…
▽ More
Blur artifacts can seriously degrade the visual quality of images, and numerous deblurring methods have been proposed for specific scenarios. However, in most real-world images, blur is caused by different factors, e.g., motion and defocus. In this paper, we address how different deblurring methods perform in the case of multiple types of blur. For in-depth performance evaluation, we construct a new large-scale multi-cause image deblurring dataset (called MC-Blur), including real-world and synthesized blurry images with mixed factors of blurs. The images in the proposed MC-Blur dataset are collected using different techniques: averaging sharp images captured by a 1000-fps high-speed camera, convolving Ultra-High-Definition (UHD) sharp images with large-size kernels, adding defocus to images, and real-world blurry images captured by various camera models. Based on the MC-Blur dataset, we conduct extensive benchmarking studies to compare SOTA methods in different scenarios, analyze their efficiency, and investigate the built dataset's capacity. These benchmarking results provide a comprehensive overview of the advantages and limitations of current deblurring methods, and reveal the advances of our dataset.
△ Less
Submitted 11 September, 2023; v1 submitted 30 November, 2021;
originally announced December 2021.
-
Deblurring by Realistic Blurring
Authors:
Kaihao Zhang,
Wenhan Luo,
Yiran Zhong,
Lin Ma,
Bjorn Stenger,
Wei Liu,
Hongdong Li
Abstract:
Existing deep learning methods for image deblurring typically train models using pairs of sharp images and their blurred counterparts. However, synthetically blurring images do not necessarily model the genuine blurring process in real-world scenarios with sufficient accuracy. To address this problem, we propose a new method which combines two GAN models, i.e., a learning-to-Blur GAN (BGAN) and le…
▽ More
Existing deep learning methods for image deblurring typically train models using pairs of sharp images and their blurred counterparts. However, synthetically blurring images do not necessarily model the genuine blurring process in real-world scenarios with sufficient accuracy. To address this problem, we propose a new method which combines two GAN models, i.e., a learning-to-Blur GAN (BGAN) and learning-to-DeBlur GAN (DBGAN), in order to learn a better model for image deblurring by primarily learning how to blur images. The first model, BGAN, learns how to blur sharp images with unpaired sharp and blurry image sets, and then guides the second model, DBGAN, to learn how to correctly deblur such images. In order to reduce the discrepancy between real blur and synthesized blur, a relativistic blur loss is leveraged. As an additional contribution, this paper also introduces a Real-World Blurred Image (RWBI) dataset including diverse blurry images. Our experiments show that the proposed method achieves consistently superior quantitative performance as well as higher perceptual quality on both the newly proposed dataset and the public GOPRO dataset.
△ Less
Submitted 6 May, 2020; v1 submitted 4 April, 2020;
originally announced April 2020.
-
Learning Classifiers on Positive and Unlabeled Data with Policy Gradient
Authors:
Tianyu Li,
Chien-Chih Wang,
Yukun Ma,
Patricia Ortal,
Qifang Zhao,
Bjorn Stenger,
Yu Hirate
Abstract:
Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data generally require estimating the class prior or label noises ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcem…
▽ More
Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data generally require estimating the class prior or label noises ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better strategy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement on the classification performance. Furthermore, we present two different approaches to represent the actions sampled from the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples.We validate the effectiveness of the proposed method on two benchmark datasets as well as one e-commerce dataset. The result shows the proposed method is able to consistently outperform state-of-the-art methods in various settings.
△ Less
Submitted 29 August, 2020; v1 submitted 15 October, 2019;
originally announced October 2019.
-
Deep Heterogeneous Autoencoders for Collaborative Filtering
Authors:
Tianyu Li,
Yukun Ma,
Jiu Xu,
Bjorn Stenger,
Chen Liu,
Yu Hirate
Abstract:
This paper leverages heterogeneous auxiliary information to address the data sparsity problem of recommender systems. We propose a model that learns a shared feature space from heterogeneous data, such as item descriptions, product tags and online purchase history, to obtain better predictions. Our model consists of autoencoders, not only for numerical and categorical data, but also for sequential…
▽ More
This paper leverages heterogeneous auxiliary information to address the data sparsity problem of recommender systems. We propose a model that learns a shared feature space from heterogeneous data, such as item descriptions, product tags and online purchase history, to obtain better predictions. Our model consists of autoencoders, not only for numerical and categorical data, but also for sequential data, which enables capturing user tastes, item characteristics and the recent dynamics of user preference. We learn the autoencoder architecture for each data source independently in order to better model their statistical properties. Our evaluation on two MovieLens datasets and an e-commerce dataset shows that mean average precision and recall improve over state-of-the-art methods.
△ Less
Submitted 16 December, 2018;
originally announced December 2018.
-
RGB-based 3D Hand Pose Estimation via Privileged Learning with Depth Images
Authors:
Shanxin Yuan,
Bjorn Stenger,
Tae-Kyun Kim
Abstract:
This paper proposes a method for hand pose estimation from RGB images that uses both external large-scale depth image datasets and paired depth and RGB images as privileged information at training time. We show that providing depth information during training significantly improves performance of pose estimation from RGB images during testing. We explore different ways of using this privileged inf…
▽ More
This paper proposes a method for hand pose estimation from RGB images that uses both external large-scale depth image datasets and paired depth and RGB images as privileged information at training time. We show that providing depth information during training significantly improves performance of pose estimation from RGB images during testing. We explore different ways of using this privileged information: (1) using depth data to initially train a depth-based network, (2) using the features from the depth-based network of the paired depth images to constrain mid-level RGB network weights, and (3) using the foreground mask, obtained from the depth data, to suppress the responses from the background area. By using paired RGB and depth images, we are able to supervise the RGB-based network to learn middle layer features that mimic that of the corresponding depth-based network, which is trained on large-scale, accurately annotated depth data. During testing, when only an RGB image is available, our method produces accurate 3D hand pose predictions. Our method is also tested on 2D hand pose estimation. Experiments on three public datasets show that the method outperforms the state-of-the-art methods for hand pose estimation using RGB image input.
△ Less
Submitted 18 November, 2018;
originally announced November 2018.
-
Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals
Authors:
Shanxin Yuan,
Guillermo Garcia-Hernando,
Bjorn Stenger,
Gyeongsik Moon,
Ju Yong Chang,
Kyoung Mu Lee,
Pavlo Molchanov,
Jan Kautz,
Sina Honari,
Liuhao Ge,
Junsong Yuan,
Xinghao Chen,
Gui** Wang,
Fan Yang,
Kai Akiyama,
Yang Wu,
Qingfu Wan,
Meysam Madadi,
Sergio Escalera,
Shile Li,
Dongheui Lee,
Iason Oikonomidis,
Antonis Argyros,
Tae-Kyun Kim
Abstract:
In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are the next challenges that need to be tackled? Following the successful Hands In the Million Challenge (HIM2017), we investigate the top 10 state-of-the-art methods on three tasks: single frame 3D pose estimation, 3D hand tracking, and hand pose estimation during ob…
▽ More
In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are the next challenges that need to be tackled? Following the successful Hands In the Million Challenge (HIM2017), we investigate the top 10 state-of-the-art methods on three tasks: single frame 3D pose estimation, 3D hand tracking, and hand pose estimation during object interaction. We analyze the performance of different CNN structures with regard to hand shape, joint visibility, view point and articulation distributions. Our findings include: (1) isolated 3D hand pose estimation achieves low mean errors (10 mm) in the view point range of [70, 120] degrees, but it is far from being solved for extreme view points; (2) 3D volumetric representations outperform 2D CNNs, better capturing the spatial structure of the depth data; (3) Discriminative methods still generalize poorly to unseen hand shapes; (4) While joint occlusions pose a challenge for most methods, explicit modeling of structure constraints can significantly narrow the gap between errors on visible and occluded joints.
△ Less
Submitted 29 March, 2018; v1 submitted 11 December, 2017;
originally announced December 2017.
-
BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis
Authors:
Shanxin Yuan,
Qi Ye,
Bjorn Stenger,
Siddhant Jain,
Tae-Kyun Kim
Abstract:
In this paper we introduce a large-scale hand pose dataset, collected using a novel capture method. Existing datasets are either generated synthetically or captured using depth sensors: synthetic datasets exhibit a certain level of appearance difference from real depth images, and real datasets are limited in quantity and coverage, mainly due to the difficulty to annotate them. We propose a tracki…
▽ More
In this paper we introduce a large-scale hand pose dataset, collected using a novel capture method. Existing datasets are either generated synthetically or captured using depth sensors: synthetic datasets exhibit a certain level of appearance difference from real depth images, and real datasets are limited in quantity and coverage, mainly due to the difficulty to annotate them. We propose a tracking system with six 6D magnetic sensors and inverse kinematics to automatically obtain 21-joints hand pose annotations of depth maps captured with minimal restriction on the range of motion. The capture protocol aims to fully cover the natural hand pose space. As shown in embedding plots, the new dataset exhibits a significantly wider and denser range of hand poses compared to existing benchmarks. Current state-of-the-art methods are evaluated on the dataset, and we demonstrate significant improvements in cross-benchmark performance. We also show significant improvements in egocentric hand pose estimation with a CNN trained on the new dataset.
△ Less
Submitted 9 December, 2017; v1 submitted 9 April, 2017;
originally announced April 2017.
-
Pano2CAD: Room Layout From A Single Panorama Image
Authors:
Jiu Xu,
Bjorn Stenger,
Tommi Kerola,
Tony Tung
Abstract:
This paper presents a method of estimating the geometry of a room and the 3D pose of objects from a single 360-degree panorama image. Assuming Manhattan World geometry, we formulate the task as a Bayesian inference problem in which we estimate positions and orientations of walls and objects. The method combines surface normal estimation, 2D object detection and 3D object pose estimation. Quantitat…
▽ More
This paper presents a method of estimating the geometry of a room and the 3D pose of objects from a single 360-degree panorama image. Assuming Manhattan World geometry, we formulate the task as a Bayesian inference problem in which we estimate positions and orientations of walls and objects. The method combines surface normal estimation, 2D object detection and 3D object pose estimation. Quantitative results are presented on a dataset of synthetically generated 3D rooms containing objects, as well as on a subset of hand-labeled images from the public SUN360 dataset.
△ Less
Submitted 30 September, 2016; v1 submitted 29 September, 2016;
originally announced September 2016.