Skip to main content

Showing 1–11 of 11 results for author: Doumanoglou, A

.
  1. arXiv:2303.10523  [pdf, other

    cs.CV cs.AI cs.LG

    Unsupervised Interpretable Basis Extraction for Concept-Based Visual Explanations

    Authors: Alexandros Doumanoglou, Stylianos Asteriadis, Dimitrios Zarpalas

    Abstract: An important line of research attempts to explain CNN image classifier predictions and intermediate layer representations in terms of human understandable concepts. In this work, we expand on previous works in the literature that use annotated concept datasets to extract interpretable feature space directions and propose an unsupervised post-hoc method to extract a disentangling interpretable basi… ▽ More

    Submitted 25 September, 2023; v1 submitted 18 March, 2023; originally announced March 2023.

    Comments: 15 pages, Accepted in IEEE Transactions on Artificial Intelligence, Special Issue on New Developments in Explainable and Interpretable AI

  2. arXiv:2211.08290  [pdf, other

    cs.CV

    Cross-Stitched Multi-task Dual Recursive Networks for Unified Single Image Deraining and Desnowing

    Authors: Sotiris Karavarsamis, Alexandros Doumanoglou, Konstantinos Konstantoudakis, Dimitrios Zarpalas

    Abstract: We present the Cross-stitched Multi-task Unified Dual Recursive Network (CMUDRN) model targeting the task of unified deraining and desnowing in a multi-task learning setting. This unified model borrows from the basic Dual Recursive Network (DRN) architecture developed by Cai et al. The proposed model makes use of cross-stitch units that enable multi-task learning across two separate DRN models, ea… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 6 pages, 4 figures, conference

  3. arXiv:2112.05396  [pdf, other

    cs.CV

    Towards Full-to-Empty Room Generation with Structure-Aware Feature Encoding and Soft Semantic Region-Adaptive Normalization

    Authors: Vasileios Gkitsas, Nikolaos Zioulis, Vladimiros Sterzentsenko, Alexandros Doumanoglou, Dimitrios Zarpalas

    Abstract: The task of transforming a furnished room image into a background-only is extremely challenging since it requires making large changes regarding the scene context while still preserving the overall layout and style. In order to acquire photo-realistic and structural consistent background, existing deep learning methods either employ image inpainting approaches or incorporate the learning of the sc… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  4. Serverless Streaming for Emerging Media: Towards 5G Network-Driven Cost Optimization

    Authors: Konstantinos Konstantoudakis, David Breitgand, Alexandros Doumanoglou, Nikolaos Zioulis, Avi Weit, Kyriaki Christaki, Petros Drakoulis, Emmanouil Christakis, Dimitrios Zarpalas, Petros Daras

    Abstract: Immersive 3D media is an emerging type of media that captures, encodes and reconstructs the 3D appearance of people and objects, with applications in tele-presence, teleconference, entertainment, gaming and other fields. In this paper, we discuss a novel concept of live 3D immersive media streaming in a serverless setting. In particular, we present a novel network-centric adaptive streaming framew… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 32 pages, 12 figures, preprint: to appear in "Multimedia Tools and Applications: 5G Multimedia Communications" special issue

  5. arXiv:2003.10176  [pdf, other

    cs.CV cs.LG

    Deep Soft Procrustes for Markerless Volumetric Sensor Alignment

    Authors: Vladimiros Sterzentsenko, Alexandros Doumanoglou, Spyridon Thermos, Nikolaos Zioulis, Dimitrios Zarpalas, Petros Daras

    Abstract: With the advent of consumer grade depth sensors, low-cost volumetric capture systems are easier to deploy. Their wider adoption though depends on their usability and by extension on the practicality of spatially aligning multiple sensors. Most existing alignment approaches employ visual patterns, e.g. checkerboards, or markers and require high user involvement and technical knowledge. More user-fr… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: 10 pages, 7 figures, to appear in IEEE VR 2020. Code and models at https://vcl3d.github.io/StructureNet/

  6. A Low-Cost, Flexible and Portable Volumetric Capturing System

    Authors: Vladimiros Sterzentsenko, Antonis Karakottas, Alexandros Papachristou, Nikolaos Zioulis, Alexandros Doumanoglou, Dimitrios Zarpalas, Petros Daras

    Abstract: Multi-view capture systems are complex systems to engineer. They require technical knowledge to install and intricate processes to setup related mainly to the sensors' spatial alignment (i.e. external calibration). However, with the ongoing developments in new production methods, we are now at a position where the production of high quality realistic 3D assets is possible even with commodity senso… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: System available at https://github.com/VCL3D/VolumetricCapture

  7. arXiv:1909.01193  [pdf, other

    cs.CV

    Self-Supervised Deep Depth Denoising

    Authors: Vladimiros Sterzentsenko, Leonidas Saroglou, Anargyros Chatzitofis, Spyridon Thermos, Nikolaos Zioulis, Alexandros Doumanoglou, Dimitrios Zarpalas, Petros Daras

    Abstract: Depth perception is considered an invaluable source of information for various vision tasks. However, depth maps acquired using consumer-level sensors still suffer from non-negligible noise. This fact has recently motivated researchers to exploit traditional filters, as well as the deep learning paradigm, in order to suppress the aforementioned non-uniform noise, while preserving geometric details… ▽ More

    Submitted 4 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: 18 pages, 15 figures, ICCV 2019

  8. arXiv:1708.07038  [pdf, other

    cs.CV cs.AI

    Non-linear Convolution Filters for CNN-based Learning

    Authors: Georgios Zoumpourlis, Alexandros Doumanoglou, Nicholas Vretos, Petros Daras

    Abstract: During the last years, Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in image classification. Their architectures have largely drawn inspiration by models of the primate visual system. However, while recent research results of neuroscience prove the existence of non-linear operations in the response of complex visual cells, little effort has been devoted to extend… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Comments: 9 pages, 5 figures, code link, ICCV 2017

  9. arXiv:1607.02257  [pdf, other

    cs.CV

    Siamese Regression Networks with Efficient mid-level Feature Extraction for 3D Object Pose Estimation

    Authors: Andreas Doumanoglou, Vassileios Balntas, Rigas Kouskouridas, Tae-Kyun Kim

    Abstract: In this paper we tackle the problem of estimating the 3D pose of object instances, using convolutional neural networks. State of the art methods usually solve the challenging problem of regression in angle space indirectly, focusing on learning discriminative features that are later fed into a separate architecture for 3D pose estimation. In contrast, we propose an end-to-end learning framework fo… ▽ More

    Submitted 8 July, 2016; originally announced July 2016.

    Comments: 9 pages, paper submitted to NIPS 2016, project page: http://www.iis.ee.ic.ac.uk/rkouskou/research/SRN.html

  10. arXiv:1602.01464  [pdf, other

    cs.CV

    Latent-Class Hough Forests for 6 DoF Object Pose Estimation

    Authors: Rigas Kouskouridas, Alykhan Tejani, Andreas Doumanoglou, Danhang Tang, Tae-Kyun Kim

    Abstract: In this paper we present Latent-Class Hough Forests, a method for object detection and 6 DoF pose estimation in heavily cluttered and occluded scenarios. We adapt a state of the art template matching feature into a scale-invariant patch descriptor and integrate it into a regression forest using a novel template-based split function. We train with positive samples only and we treat class distributi… ▽ More

    Submitted 3 February, 2016; originally announced February 2016.

    Comments: PAMI submission, project page: http://www.iis.ee.ic.ac.uk/rkouskou/research/LCHF.html

  11. arXiv:1512.07506  [pdf, other

    cs.CV

    Recovering 6D Object Pose and Predicting Next-Best-View in the Crowd

    Authors: Andreas Doumanoglou, Rigas Kouskouridas, Sotiris Malassiotis, Tae-Kyun Kim

    Abstract: Object detection and 6D pose estimation in the crowd (scenes with multiple object instances, severe foreground occlusions and background distractors), has become an important problem in many rapidly evolving technological areas such as robotics and augmented reality. Single shot-based 6D pose estimators with manually designed features are still unable to tackle the above challenges, motivating the… ▽ More

    Submitted 19 April, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

    Comments: CVPR 2016 accepted paper, project page: http://www.iis.ee.ic.ac.uk/rkouskou/6D_NBV.html