Skip to main content

Showing 1–15 of 15 results for author: Thermos, S

.
  1. arXiv:2406.01136  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Towards Practical Single-shot Motion Synthesis

    Authors: Konstantinos Roditakis, Spyridon Thermos, Nikolaos Zioulis

    Abstract: Despite the recent advances in the so-called "cold start" generation from text prompts, their needs in data and computing resources, as well as the ambiguities around intellectual property and privacy concerns pose certain counterarguments for their utility. An interesting and relatively unexplored alternative has been the introduction of unconditional synthesis from a single sample, which has led… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: CVPR 2024, AI for 3D Generation Workshop, Project page: https://moverseai.github.io/single-shot

  2. arXiv:2309.14330  [pdf, other

    cs.CV cs.GR cs.LG

    Noise-in, Bias-out: Balanced and Real-time MoCap Solving

    Authors: Georgios Albanis, Nikolaos Zioulis, Spyridon Thermos, Anargyros Chatzitofis, Kostas Kolomvatsos

    Abstract: Real-time optical Motion Capture (MoCap) systems have not benefited from the advances in modern data-driven modeling. In this work we apply machine learning to solve noisy unstructured marker estimates in real-time and deliver robust marker-based MoCap even when using sparse affordable sensors. To achieve this we focus on a number of challenges related to model training, namely the sourcing of tra… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Project page: https://moverseai.github.io/noise-tail

  3. arXiv:2306.07783  [pdf, other

    cs.CV cs.LG

    Compositionally Equivariant Representation Learning

    Authors: Xiao Liu, Pedro Sanchez, Spyridon Thermos, Alison Q. O'Neil, Sotirios A. Tsaftaris

    Abstract: Deep learning models often need sufficient supervision (i.e. labelled data) in order to be trained effectively. By contrast, humans can swiftly learn to identify important anatomy in medical images like MRI and CT scans, with minimal guidance. This recognition capability easily generalises to new images from different medical facilities and to new tasks in different settings. This rapid and genera… ▽ More

    Submitted 17 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Submitted. 10 pages. arXiv admin note: text overlap with arXiv:2206.14538

  4. arXiv:2208.03563  [pdf, other

    cs.CV

    HSIC-InfoGAN: Learning Unsupervised Disentangled Representations by Maximising Approximated Mutual Information

    Authors: Xiao Liu, Spyridon Thermos, Pedro Sanchez, Alison Q. O'Neil, Sotirios A. Tsaftaris

    Abstract: Learning disentangled representations requires either supervision or the introduction of specific model designs and learning constraints as biases. InfoGAN is a popular disentanglement framework that learns unsupervised disentangled representations by maximising the mutual information between latent representations and their corresponding generated images. Maximisation of mutual information is ach… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: MICCAI MAD Workshop 2022

  5. arXiv:2206.14538  [pdf, other

    cs.CV

    vMFNet: Compositionality Meets Domain-generalised Segmentation

    Authors: Xiao Liu, Spyridon Thermos, Pedro Sanchez, Alison Q. O'Neil, Sotirios A. Tsaftaris

    Abstract: Training medical image segmentation models usually requires a large amount of labeled data. By contrast, humans can quickly learn to accurately recognise anatomy of interest from medical (e.g. MRI and CT) images with some limited guidance. Such recognition ability can easily generalise to new images from different clinical centres. This rapid and generalisable learning ability is mostly due to the… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted by MICCAI 2022

  6. Learning Disentangled Representations in the Imaging Domain

    Authors: Xiao Liu, Pedro Sanchez, Spyridon Thermos, Alison Q. O'Neil, Sotirios A. Tsaftaris

    Abstract: Disentangled representation learning has been proposed as an approach to learning general representations even in the absence of, or with limited, supervision. A good general representation can be fine-tuned for new target tasks using modest amounts of data, or used directly in unseen domains achieving remarkable performance in the corresponding task. This alleviation of the data and annotation re… ▽ More

    Submitted 29 July, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted by Medical Image Analysis. This paper follows a tutorial style but also surveys a considerable (more than 260 citations) number of works

  7. arXiv:2107.01748  [pdf, other

    eess.IV cs.CV

    Controllable cardiac synthesis via disentangled anatomy arithmetic

    Authors: Spyridon Thermos, Xiao Liu, Alison O'Neil, Sotirios A. Tsaftaris

    Abstract: Acquiring annotated data at scale with rare diseases or conditions remains a challenge. It would be extremely useful to have a method that controllably synthesizes images that can correct such underrepresentation. Assuming a proper latent representation, the idea of a "latent vector arithmetic" could offer the means of achieving such synthesis. A proper representation must encode the fidelity of t… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in MICCAI 2021

  8. arXiv:2106.13292  [pdf, other

    cs.CV

    Semi-supervised Meta-learning with Disentanglement for Domain-generalised Medical Image Segmentation

    Authors: Xiao Liu, Spyridon Thermos, Alison O'Neil, Sotirios A. Tsaftaris

    Abstract: Generalising deep models to new data from new centres (termed here domains) remains a challenge. This is largely attributed to shifts in data statistics (domain shifts) between source and unseen domains. Recently, gradient-based meta-learning approaches where the training data are split into meta-train and meta-test sets to simulate and handle the domain shifts during training have shown improved… ▽ More

    Submitted 1 October, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: MICCAI 2021 Oral

  9. GaNDLF: A Generally Nuanced Deep Learning Framework for Scalable End-to-End Clinical Workflows in Medical Imaging

    Authors: Sarthak Pati, Siddhesh P. Thakur, İbrahim Ethem Hamamcı, Ujjwal Baid, Bhakti Baheti, Megh Bhalerao, Orhun Güley, Sofia Mouchtaris, David Lang, Spyridon Thermos, Karol Gotkowski, Camila González, Caleb Grenko, Alexander Getka, Brandon Edwards, Micah Sheller, Junwen Wu, Deepthi Karkada, Ravi Panchumarthy, Vinayak Ahluwalia, Chunrui Zou, Vishnu Bashyam, Yuemeng Li, Babak Haghighi, Rhea Chitalia , et al. (17 additional authors not shown)

    Abstract: Deep Learning (DL) has the potential to optimize machine learning in both the scientific and clinical communities. However, greater expertise is required to develop DL algorithms, and the variability of implementations hinders their reproducibility, translation, and deployment. Here we present the community-driven Generally Nuanced Deep Learning Framework (GaNDLF), with the goal of lowering these… ▽ More

    Submitted 16 May, 2023; v1 submitted 25 February, 2021; originally announced March 2021.

    Comments: Deep Learning, Framework, Segmentation, Regression, Classification, Cross-validation, Data augmentation, Deployment, Clinical, Workflows

    Journal ref: Commun Eng 2, 23 (2023)

  10. arXiv:2008.12378  [pdf, other

    cs.CV

    Measuring the Biases and Effectiveness of Content-Style Disentanglement

    Authors: Xiao Liu, Spyridon Thermos, Gabriele Valvano, Agisilaos Chartsias, Alison O'Neil, Sotirios A. Tsaftaris

    Abstract: A recent spate of state-of-the-art semi- and un-supervised solutions disentangle and encode image "content" into a spatial tensor and image appearance or "style" into a vector, to achieve good performance in spatially equivariant tasks (e.g. image-to-image translation). To achieve this, they employ different model design, learning objective, and data biases. While considerable effort has been made… ▽ More

    Submitted 15 September, 2021; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: 28 pages, 10 figures

  11. arXiv:2008.11514  [pdf, other

    eess.IV cs.CV

    Disentangled Representations for Domain-generalized Cardiac Segmentation

    Authors: Xiao Liu, Spyridon Thermos, Agisilaos Chartsias, Alison O'Neil, Sotirios A. Tsaftaris

    Abstract: Robust cardiac image segmentation is still an open challenge due to the inability of the existing methods to achieve satisfactory performance on unseen data of different domains. Since the acquisition and annotation of medical data are costly and time-consuming, recent work focuses on domain adaptation and generalization to bridge the gap between data from different populations and scanners. In th… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: Accepted by STACOM 2020

  12. A Deep Learning Approach to Object Affordance Segmentation

    Authors: Spyridon Thermos, Petros Daras, Gerasimos Potamianos

    Abstract: Learning to understand and infer object functionalities is an important step towards robust visual intelligence. Significant research efforts have recently focused on segmenting the object parts that enable specific types of human-object interaction, the so-called "object affordances". However, most works treat it as a static semantic segmentation problem, focusing solely on object appearance and… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

    Comments: 5 pages, 4 figures, ICASSP 2020

  13. arXiv:2003.10176  [pdf, other

    cs.CV cs.LG

    Deep Soft Procrustes for Markerless Volumetric Sensor Alignment

    Authors: Vladimiros Sterzentsenko, Alexandros Doumanoglou, Spyridon Thermos, Nikolaos Zioulis, Dimitrios Zarpalas, Petros Daras

    Abstract: With the advent of consumer grade depth sensors, low-cost volumetric capture systems are easier to deploy. Their wider adoption though depends on their usability and by extension on the practicality of spatially aligning multiple sensors. Most existing alignment approaches employ visual patterns, e.g. checkerboards, or markers and require high user involvement and technical knowledge. More user-fr… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: 10 pages, 7 figures, to appear in IEEE VR 2020. Code and models at https://vcl3d.github.io/StructureNet/

  14. arXiv:1909.01193  [pdf, other

    cs.CV

    Self-Supervised Deep Depth Denoising

    Authors: Vladimiros Sterzentsenko, Leonidas Saroglou, Anargyros Chatzitofis, Spyridon Thermos, Nikolaos Zioulis, Alexandros Doumanoglou, Dimitrios Zarpalas, Petros Daras

    Abstract: Depth perception is considered an invaluable source of information for various vision tasks. However, depth maps acquired using consumer-level sensors still suffer from non-negligible noise. This fact has recently motivated researchers to exploit traditional filters, as well as the deep learning paradigm, in order to suppress the aforementioned non-uniform noise, while preserving geometric details… ▽ More

    Submitted 4 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: 18 pages, 15 figures, ICCV 2019

  15. arXiv:1704.02787  [pdf, other

    cs.CV

    Deep Affordance-grounded Sensorimotor Object Recognition

    Authors: Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos

    Abstract: It is well-established by cognitive neuroscience that human perception of objects constitutes a complex process, where object appearance information is combined with evidence about the so-called object "affordances", namely the types of actions that humans typically perform when interacting with them. This fact has recently motivated the "sensorimotor" approach to the challenging task of automatic… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: 9 pages, 7 figures, dataset link included, accepted to CVPR 2017