Skip to main content

Showing 1–34 of 34 results for author: Callet, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01024  [pdf, other

    cs.CV eess.IV

    AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images

    Authors: Liu Yang, Huiyu Duan, Long Teng, Yucheng Zhu, Xiaohong Liu, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

    Abstract: In recent years, the rapid advancement of Artificial Intelligence Generated Content (AIGC) has attracted widespread attention. Among the AIGC, AI generated omnidirectional images hold significant potential for Virtual Reality (VR) and Augmented Reality (AR) applications, hence omnidirectional AIGC techniques have also been widely studied. AI-generated omnidirectional images exhibit unique distorti… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  2. Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low latency Encoding

    Authors: Vignesh V Menon, **gwen Zhu, Prajit T Rajendran, Samira Afzal, Klaus Schoeffmann, Patrick Le Callet, Christian Timmerer

    Abstract: In HTTP adaptive live streaming applications, video segments are encoded at a fixed set of bitrate-resolution pairs known as bitrate ladder. Live encoders use the fastest available encoding configuration, referred to as preset, to ensure the minimum possible latency in video encoding. However, an optimized preset and optimized number of CPU threads for each encoding instance may result in (i) incr… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 2024 Mile High Video (MHV)

  3. arXiv:2305.00225  [pdf, other

    cs.MM

    Just Noticeable Difference-aware Per-Scene Bitrate-laddering for Adaptive Video Streaming

    Authors: Vignesh V Menon, **gwen Zhu, Prajit T Rajendran, Hadi Amirpour, Patrick Le Callet, Christian Timmerer

    Abstract: In video streaming applications, a fixed set of bitrate-resolution pairs (known as a bitrate ladder) is typically used during the entire streaming session. However, an optimized bitrate ladder per scene may result in (i) decreased storage or delivery costs or/and (ii) increased Quality of Experience. This paper introduces a Just Noticeable Difference (JND)-aware per-scene bitrate ladder prediction… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: 2023 IEEE International Conference on Multimedia and Expo (ICME)

  4. arXiv:2304.09064  [pdf, other

    cs.HC cs.AI

    LLM-based Interaction for Content Generation: A Case Study on the Perception of Employees in an IT department

    Authors: Alexandre Agossah, Frédérique Krupa, Matthieu Perreira Da Silva, Patrick Le Callet

    Abstract: In the past years, AI has seen many advances in the field of NLP. This has led to the emergence of LLMs, such as the now famous GPT-3.5, which revolutionise the way humans can access or generate content. Current studies on LLM-based generative tools are mainly interested in the performance of such tools in generating relevant content (code, text or image). However, ethical concerns related to the… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 14 pages (bibliography inclued), 6 figures, preprint submitted to Work-In-Progress session of ACM IMX'23 Interactive Media Experience

    ACM Class: I.2.7; J.7

  5. arXiv:2302.04796  [pdf, other

    cs.MM cs.GR eess.IV

    BASICS: Broad quality Assessment of Static point clouds In Compression Scenarios

    Authors: Ali Ak, Emin Zerman, Maurice Quach, Aladine Chetouani, Aljosa Smolic, Giuseppe Valenzise, Patrick Le Callet

    Abstract: Point clouds are now commonly used to represent 3D scenes in virtual world, in addition to 3D meshes. Their ease of capture enable various applications on mobile devices, such as smartphones or other microcontrollers. Point cloud compression is now at an advanced level and being standardized. Nevertheless, quality assessment databases, which is needed to develop better objective quality metrics, a… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Manuscript in preparation, 11 pages, 8 figures

  6. arXiv:2205.08007  [pdf, other

    cs.MM cs.SD eess.AS eess.IV

    Perceptual Evaluation on Audio-visual Dataset of 360 Content

    Authors: Randy F Fela, Andréas Pastor, Patrick Le Callet, Nick Zacharov, Toinon Vigier, Søren Forchhammer

    Abstract: To open up new possibilities to assess the multimodal perceptual quality of omnidirectional media formats, we proposed a novel open source 360 audiovisual (AV) quality dataset. The dataset consists of high-quality 360 video clips in equirectangular (ERP) format and higher-order ambisonic (4th order) along with the subjective scores. Three subjective quality experiments were conducted for audio, vi… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 6 pages, 5 figures, International Conference on Multimedia and Expo 2022

  7. arXiv:2205.03574  [pdf, other

    cs.CV eess.IV

    Utility-Oriented Underwater Image Quality Assessment Based on Transfer Learning

    Authors: Weiling Chen, Rongfu Lin, Honggang Liao, Tiesong Zhao, Ke Gu, Patrick Le Callet

    Abstract: The widespread image applications have greatly promoted the vision-based tasks, in which the Image Quality Assessment (IQA) technique has become an increasingly significant issue. For user enjoyment in multimedia systems, the IQA exploits image fidelity and aesthetics to characterize user experience; while for other tasks such as popular object recognition, there exists a low correlation between u… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

  8. Confusing Image Quality Assessment: Towards Better Augmented Reality Experience

    Authors: Huiyu Duan, Xiongkuo Min, Yucheng Zhu, Guangtao Zhai, Xiaokang Yang, Patrick Le Callet

    Abstract: With the development of multimedia technology, Augmented Reality (AR) has become a promising next-generation mobile platform. The primary value of AR is to promote the fusion of digital contents and real-world environments, however, studies on how this fusion will influence the Quality of Experience (QoE) of these two components are lacking. To achieve better QoE of AR, whose two layers are influe… ▽ More

    Submitted 31 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  9. arXiv:2203.13186  [pdf, other

    q-bio.NC cs.LG

    Improving Maximum Likelihood Difference Scaling method to measure inter content scale

    Authors: Pastor Andréas, Lukáš Krasula, Xiaoqing Zhu, Zhi Li, Patrick Le Callet

    Abstract: The goal of most subjective studies is to place a set of stimuli on a perceptual scale. This is mostly done directly by rating, e.g. using single or double stimulus methodologies, or indirectly by ranking or pairwise comparison. All these methods estimate the perceptual magnitudes of the stimuli on a scale. However, procedures such as Maximum Likelihood Difference Scaling (MLDS) have shown that co… ▽ More

    Submitted 25 February, 2022; originally announced March 2022.

    Comments: Difference scaling, supra-threshold estimation, human perception, subjective experiment

  10. arXiv:2202.02397  [pdf, other

    cs.GR

    Textured Mesh Quality Assessment: Large-Scale Dataset and Deep Learning-based Quality Metric

    Authors: Yana Nehmé, Johanna Delanoy, Florent Dupont, Jean-Philippe Farrugia, Patrick Le Callet, Guillaume Lavoué

    Abstract: Over the past decade, 3D graphics have become highly detailed to mimic the real world, exploding their size and complexity. Certain applications and device constraints necessitate their simplification and/or lossy compression, which can degrade their visual quality. Thus, to ensure the best Quality of Experience (QoE), it is important to evaluate the visual quality to accurately drive the compress… ▽ More

    Submitted 8 May, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    ACM Class: I.3

  11. arXiv:2110.06956  [pdf, other

    cs.CV cs.AI

    Considering user agreement in learning to predict the aesthetic quality

    Authors: Suiyi Ling, Andreas Pastor, Junle Wang, Patrick Le Callet

    Abstract: How to robustly rank the aesthetic quality of given images has been a long-standing ill-posed topic. Such challenge stems mainly from the diverse subjective opinions of different observers about the varied types of content. There is a growing interest in estimating the user agreement by considering the standard deviation of the scores, instead of only predicting the mean aesthetic opinion score. N… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: 5 pages

    MSC Class: 68T07 ACM Class: I.4.0

  12. arXiv:2103.05099  [pdf, other

    cs.CV cs.LG eess.IV

    Subjective and Objective Quality Assessment of Mobile Gaming Video

    Authors: Shaoguo Wen, Suiyi Ling, Junle Wang, Ximing Chen, Lizhi Fang, Yanqing **g, Patrick Le Callet

    Abstract: Nowadays, with the vigorous expansion and development of gaming video streaming techniques and services, the expectation of users, especially the mobile phone users, for higher quality of experience is also growing swiftly. As most of the existing research focuses on traditional video streaming, there is a clear lack of both subjective study and objective quality models that are tailored for quali… ▽ More

    Submitted 27 January, 2021; originally announced March 2021.

    Comments: 5 pages

    MSC Class: 68U10 ACM Class: J.0

  13. arXiv:2102.07599  [pdf, other

    cs.AI

    Seeing by haptic glance: reinforcement learning-based 3D object Recognition

    Authors: Kevin Riou, Suiyi Ling, Guillaume Gallot, Patrick Le Callet

    Abstract: Human is able to conduct 3D recognition by a limited number of haptic contacts between the target object and his/her fingers without seeing the object. This capability is defined as `haptic glance' in cognitive neuroscience. Most of the existing 3D recognition models were developed based on dense 3D data. Nonetheless, in many real-life use cases, where robots are used to collect 3D data by haptic… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 5 pages

    MSC Class: 68T07 ACM Class: I.2

  14. arXiv:2101.11700  [pdf, other

    cs.CV cs.AI

    Multi-Modal Aesthetic Assessment for MObile Gaming Image

    Authors: Zhenyu Lei, Ye**g Xie, Suiyi Ling, Andreas Pastor, Junle Wang, Patrick Le Callet

    Abstract: With the proliferation of various gaming technology, services, game styles, and platforms, multi-dimensional aesthetic assessment of the gaming contents is becoming more and more important for the gaming industry. Depending on the diverse needs of diversified game players, game designers, graphical developers, etc. in particular conditions, multi-modal aesthetic assessment is required to consider… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: 5 pages

    MSC Class: 68U10 ACM Class: J.0

  15. Wide Color Gamut Image Content Characterization: Method, Evaluation, and Applications

    Authors: Junghyuk Lee, Toinon Vigier, Patrick Le Callet, Jong-Seok Lee

    Abstract: In this paper, we propose a novel framework to characterize a wide color gamut image content based on perceived quality due to the processes that change color gamut, and demonstrate two practical use cases where the framework can be applied. We first introduce the main framework and implementation details. Then, we provide analysis for understanding of existing wide color gamut datasets with quant… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

    Journal ref: IEEE Transactions on Multimedia (2020)

  16. Ambiguity of Objective Image Quality Metrics: A New Methodology for Performance Evaluation

    Authors: Manri Cheon, Toinon Vigier, Lukáš Krasula, Junghyuk Lee, Patrick Le Callet, Jong-Seok Lee

    Abstract: Objective image quality metrics try to estimate the perceptual quality of the given image by considering the characteristics of the human visual system. However, it is possible that the metrics produce different quality scores even for two images that are perceptually indistinguishable by human viewers, which have not been considered in the existing studies related to objective quality assessment.… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

    Journal ref: Signal Processing: Image Communication (2021)

  17. arXiv:2011.02719  [pdf, other

    cs.CV cs.LG

    Few-Shot Object Detection in Real Life: Case Study on Auto-Harvest

    Authors: Kevin Riou, **gwen Zhu, Suiyi Ling, Mathis Piquet, Vincent Truffault, Patrick Le Callet

    Abstract: Confinement during COVID-19 has caused serious effects on agriculture all over the world. As one of the efficient solutions, mechanical harvest/auto-harvest that is based on object detection and robotic harvester becomes an urgent need. Within the auto-harvest system, robust few-shot object detection model is one of the bottlenecks, since the system is required to deal with new vegetable/fruit cat… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: 6 pages

  18. arXiv:2010.00370  [pdf, other

    cs.AI

    Strategy for Boosting Pair Comparison and Improving Quality Assessment Accuracy

    Authors: Suiyi Ling, **g Li, Anne Flore Perrin, Zhi Li, Lukáš Krasula, Patrick Le Callet

    Abstract: The development of rigorous quality assessment model relies on the collection of reliable subjective data, where the perceived quality of visual multimedia is rated by the human observers. Different subjective assessment protocols can be used according to the objectives, which determine the discriminability and accuracy of the subjective data. Single stimulus methodology, e.g., the Absolute Cate… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: 8 pages, 11 figures

    MSC Class: 68T05 ACM Class: I.2.6

  19. arXiv:2003.10810  [pdf, other

    cs.LG stat.ML

    Capturing and Explaining Trajectory Singularities using Composite Signal Neural Networks

    Authors: Hippolyte Dubois, Patrick Le Callet, Michael Hornberger, Hugo J. Spiers, Antoine Coutrot

    Abstract: Spatial trajectories are ubiquitous and complex signals. Their analysis is crucial in many research fields, from urban planning to neuroscience. Several approaches have been proposed to cluster trajectories. They rely on hand-crafted features, which struggle to capture the spatio-temporal complexity of the signal, or on Artificial Neural Networks (ANNs) which can be more efficient but less interpr… ▽ More

    Submitted 7 May, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

    Comments: 5 pages, 9 figures, submitted to Eusipco2020 conference

  20. arXiv:2003.00475  [pdf, other

    cs.AI

    GPM: A Generic Probabilistic Model to Recover Annotator's Behavior and Ground Truth Labeling

    Authors: **g Li, Suiyi Ling, Junle Wang, Zhi Li, Patrick Le Callet

    Abstract: In the big data era, data labeling can be obtained through crowdsourcing. Nevertheless, the obtained labels are generally noisy, unreliable or even adversarial. In this paper, we propose a probabilistic graphical annotation model to infer the underlying ground truth and annotator's behavior. To accommodate both discrete and continuous application scenarios (e.g., classifying scenes vs. rating vide… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

  21. Spectral domain decomposition method for physically-based rendering of photochromic/electrochromic glass windows

    Authors: Guillaume Gbikpi-Benissan, Patrick Callet, Frederic Magoules

    Abstract: This paper covers the time consuming issues intrinsic to physically-based image rendering algorithms. First, glass materials optical properties were measured on samples of real glasses and other objects materials inside an hotel room were characterized by deducing spectral data from multiple trichromatic images. We then present the rendering model and ray-tracing algorithm implemented in Virtueliu… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1912.05494

  22. Spectral Domain Decomposition Method for Natural Lighting and Medieval Glass Rendering

    Authors: Guillaume Gbikpi-Benissan, Remi Cerise, Patrick Callet, Frederic Magoules

    Abstract: In this paper, we use an original ray-tracing domain decomposition method to address image rendering of naturally lighted scenes. This new method allows to particularly analyze rendering problems on parallel architectures, in the case of interactions between light-rays and glass material. Numerical experiments, for medieval glass rendering within the church of the Royaumont abbey, illustrate the p… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  23. Spectral domain decomposition method for physically-based rendering of Royaumont abbey

    Authors: Guillaume Gbikpi-Benissan, Patrick Callet, Frederic Magoules

    Abstract: In the context of a virtual reconstitution of the destroyed Royaumont abbey church, this paper investigates computer sciences issues intrinsic to the physically-based image rendering. First, a virtual model was designed from historical sources and archaeological descriptions. Then some materials physical properties were measured on remains of the church and on pieces from similar ancient churches.… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  24. arXiv:1911.07682  [pdf, other

    cs.LG stat.ML

    A New Ensemble Adversarial Attack Powered by Long-term Gradient Memories

    Authors: Zhaohui Che, Ali Borji, Guangtao Zhai, Suiyi Ling, **g Li, Patrick Le Callet

    Abstract: Deep neural networks are vulnerable to adversarial attacks.

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI2020

  25. arXiv:1909.01738  [pdf, other

    cs.MM eess.IV

    Binocular Rivalry Oriented Predictive Auto-Encoding Network for Blind Stereoscopic Image Quality Measurement

    Authors: Jiahua Xu, Wei Zhou, Zhibo Chen, Suiyi Ling, Patrick Le Callet

    Abstract: Stereoscopic image quality measurement (SIQM) has become increasingly important for guiding stereo image processing and commutation systems due to the widespread usage of 3D contents. Compared with conventional methods which are relied on hand-crafted features, deep learning oriented measurements have achieved remarkable performance in recent years. However, most existing deep SIQM evaluators are… ▽ More

    Submitted 1 November, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

  26. How is Gaze Influenced by Image Transformations? Dataset and Model

    Authors: Zhaohui Che, Ali Borji, Guangtao Zhai, Xiongkuo Min, Guodong Guo, Patrick Le Callet

    Abstract: Data size is the bottleneck for develo** deep saliency models, because collecting eye-movement data is very time consuming and expensive. Most of current studies on human attention and saliency modeling have used high quality stereotype stimuli. In real world, however, captured images undergo various types of transformations. Can we use these transformations to augment existing saliency datasets… ▽ More

    Submitted 3 October, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

  27. arXiv:1905.00161  [pdf, other

    eess.IV cs.MM

    State-of-the-art in 360° Video/Image Processing: Perception, Assessment and Compression

    Authors: Chen Li, Mai Xu, Shanyi Zhang, Patrick Le Callet

    Abstract: Nowadays, 360° video/image has been increasingly popular and drawn great attention. The spherical viewing range of 360° video/image accounts for huge data, which pose the challenges to 360° video/image processing in solving the bottleneck of storage, transmission, etc. Accordingly, the recent years have witnessed the explosive emergence of works on 360° video/image processing. In this paper, we re… ▽ More

    Submitted 28 October, 2019; v1 submitted 30 April, 2019; originally announced May 2019.

    Comments: Submitted to IEEE J-STSP SI of Perception-driven 360-degree video processing as an Invited Overview Paper

  28. arXiv:1904.01231  [pdf, other

    cs.CV

    Adversarial Attacks against Deep Saliency Models

    Authors: Zhaohui Che, Ali Borji, Guangtao Zhai, Suiyi Ling, Guodong Guo, Patrick Le Callet

    Abstract: Currently, a plethora of saliency models based on deep neural networks have led great breakthroughs in many complex high-level vision tasks (e.g. scene description, object detection). The robustness of these models, however, has not yet been studied. In this paper, we propose a sparse feature-space adversarial attack method against deep saliency models for the first time. The proposed attack only… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  29. arXiv:1903.12107  [pdf, other

    cs.MM

    Quality Assessment of Free-viewpoint Videos by Quantifying the Elastic Changes of Multi-Scale Motion Trajectories

    Authors: Suiyi Ling, **g Li, Zhaohui Che, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

    Abstract: Virtual viewpoints synthesis is an essential process for many immersive applications including Free-viewpoint TV (FTV). A widely used technique for viewpoints synthesis is Depth-Image-Based-Rendering (DIBR) technique. However, such techniques may introduce challenging non-uniform spatial-temporal structure-related distortions. Most of the existing state-of-the-art quality metrics fail to handle th… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: 13 pages

  30. arXiv:1903.12088  [pdf, other

    cs.MM cs.CV

    GANs-NQM: A Generative Adversarial Networks based No Reference Quality Assessment Metric for RGB-D Synthesized Views

    Authors: Suiyi Ling, **g Li, Junle Wang, Patrick Le Callet

    Abstract: In this paper, we proposed a no-reference (NR) quality metric for RGB plus image-depth (RGB-D) synthesis images based on Generative Adversarial Networks (GANs), namely GANs-NQM. Due to the failure of the inpainting on dis-occluded regions in RGB-D synthesis process, to capture the non-uniformly distributed local distortions and to learn their impact on perceptual quality are challenging tasks for… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

  31. arXiv:1810.08851  [pdf, other

    cs.LG stat.ML

    Hybrid-MST: A Hybrid Active Sampling Strategy for Pairwise Preference Aggregation

    Authors: **g Li, Rafal K. Mantiuk, Junle Wang, Suiyi Ling, Patrick Le Callet

    Abstract: In this paper we present a hybrid active sampling strategy for pairwise preference aggregation, which aims at recovering the underlying rating of the test candidates from sparse and noisy pairwise labelling. Our method employs Bayesian optimization framework and Bradley-Terry model to construct the utility function, then to obtain the Expected Information Gain (EIG) of each pair. For computational… ▽ More

    Submitted 20 October, 2018; originally announced October 2018.

    Comments: NIPS 2018

  32. arXiv:1810.04409  [pdf, other

    cs.CV

    Prediction of the Influence of Navigation Scan-path on Perceived Quality of Free-Viewpoint Videos

    Authors: Suiyi Ling, Jesús Gutiérrez, Gu Ke, Patrick Le Callet

    Abstract: Free-Viewpoint Video (FVV) systems allow the viewers to freely change the viewpoints of the scene. In such systems, view synthesis and compression are the two main sources of artifacts influencing the perceived quality. To assess this influence, quality evaluation studies are often carried out using conventional displays and generating predefined navigation trajectories mimicking the possible move… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

    Comments: 11 pages, 7 figures

  33. Data Analysis in Multimedia Quality Assessment: Revisiting the Statistical Tests

    Authors: Manish Narwaria, Lukas Krasula, Patrick Le Callet

    Abstract: Assessment of multimedia quality relies heavily on subjective assessment, and is typically done by human subjects in the form of preferences or continuous ratings. Such data is crucial for analysis of different multimedia processing algorithms as well as validation of objective (computational) methods for the said purpose. To that end, statistical testing provides a theoretical framework towards d… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Journal ref: IEEE Transactions on Multimedia 2018

  34. arXiv:1612.07872  [pdf, other

    cs.MM

    Object Shape Approximation & Contour Adaptive Depth Image Coding for Virtual View Synthesis

    Authors: Yuan Yuan, Gene Cheung, Patrick Le Callet, Pascal Frossard, Hong Vicky Zhao

    Abstract: A depth image provides partial geometric information of a 3D scene, namely the shapes of physical objects as observed from a particular viewpoint. This information is important when synthesizing images of different virtual camera viewpoints via depth-image-based rendering (DIBR). It has been shown that depth images can be efficiently coded using contour-adaptive codecs that preserve edge sharpness… ▽ More

    Submitted 22 December, 2016; originally announced December 2016.

    Comments: 13 pages, submitted to IEEE Transactions on Circuits and Systems for Video Technology