Skip to main content

Showing 1–25 of 25 results for author: Georgescu, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09326  [pdf, other

    cs.CV cs.AI cs.LG

    Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

    Authors: Diana-Nicoleta Grigore, Mariana-Iuliana Georgescu, Jon Alvarez Justo, Tor Johansen, Andreea Iuliana Ionescu, Radu Tudor Ionescu

    Abstract: Few-shot knowledge distillation recently emerged as a viable approach to harness the knowledge of large-scale pre-trained models, using limited data and computational resources. In this paper, we propose a novel few-shot feature distillation approach for vision transformers. Our approach is based on two key steps. Leveraging the fact that vision transformers have a consistent depth-wise structure,… ▽ More

    Submitted 17 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  2. arXiv:2310.16210  [pdf, other

    cs.CV eess.IV

    Sea-Land-Cloud Segmentation in Satellite Hyperspectral Imagery by Deep Learning

    Authors: Jon Alvarez Justo, Joseph L. Garrett, Mariana-Iuliana Georgescu, Jesus Gonzalez-Llorente, Radu Tudor Ionescu, Tor Arne Johansen

    Abstract: Satellites are increasingly adopting on-board AI for enhanced autonomy through in-orbit inference. In this context, the use of deep learning (DL) techniques for segmentation in hyperspectral (HS) satellite imagery offers advantages for remote sensing applications, and therefore, we train 16 different models, whose codes are made available through our study, which we consider to be relevant for on-… ▽ More

    Submitted 28 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Remote Sensing, Satellite Imagery, Hyperspectral Imaging, Deep Learning, Segmentation

  3. arXiv:2309.15238  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Using Generated Privileged Information by Text-to-Image Diffusion Models

    Authors: Rafael-Edy Menadil, Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: Learning Using Privileged Information is a particular type of knowledge distillation where the teacher model benefits from an additional data representation during training, called privileged information, improving the student model, which does not see the extra representation. However, privileged information is rarely available in practice. To this end, we propose a text classification framework… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  4. arXiv:2307.07534  [pdf, other

    eess.IV cs.CV

    Masked Autoencoders for Unsupervised Anomaly Detection in Medical Images

    Authors: Mariana-Iuliana Georgescu

    Abstract: Pathological anomalies exhibit diverse appearances in medical imaging, making it difficult to collect and annotate a representative amount of data required to train deep learning models in a supervised setting. Therefore, in this work, we tackle anomaly detection in medical images training our framework using only healthy samples. We propose to use the Masked Autoencoder model to learn the structu… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  5. arXiv:2212.05922  [pdf, other

    cs.CV cs.SD

    Audiovisual Masked Autoencoders

    Authors: Mariana-Iuliana Georgescu, Eduardo Fonseca, Radu Tudor Ionescu, Mario Lucic, Cordelia Schmid, Anurag Arnab

    Abstract: Can we leverage the audiovisual information already present in video to improve self-supervised representation learning? To answer this question, we study various pretraining architectures and objectives within the masked autoencoding framework, motivated by the success of similar methods in natural language and image understanding. We show that we can achieve significant improvements on audiovisu… ▽ More

    Submitted 4 January, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: ICCV 2023

  6. arXiv:2210.12388  [pdf, other

    eess.IV cs.CV cs.LG

    Diversity-Promoting Ensemble for Medical Image Segmentation

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron

    Abstract: Medical image segmentation is an actively studied task in medical imaging, where the precision of the annotations is of utter importance towards accurate diagnosis and treatment. In recent years, the task has been approached with various deep learning systems, among the most popular models being U-Net. In this work, we propose a novel strategy to generate ensembles of different architectures for m… ▽ More

    Submitted 21 December, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: Accepted at SAC 2023

  7. arXiv:2207.08003  [pdf, other

    cs.CV cs.LG

    SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection

    Authors: Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: A self-supervised multi-task learning (SSMTL) framework for video anomaly detection was recently introduced in literature. Due to its highly accurate results, the method attracted the attention of many researchers. In this work, we revisit the self-supervised multi-task learning framework, proposing several updates to the original method. First, we study various detection methods, e.g. based on de… ▽ More

    Submitted 12 February, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: Accepted in Computer Vision and Image Understanding

  8. arXiv:2204.04218  [pdf, other

    eess.IV cs.CV cs.LG

    Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan

    Abstract: Super-resolving medical images can help physicians in providing more accurate diagnostics. In many situations, computed tomography (CT) or magnetic resonance imaging (MRI) techniques capture several scans (modes) during a single investigation, which can jointly be used (in a multimodal fashion) to further boost the quality of super-resolution results. To this end, we propose a novel multimodal mul… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted at WACV 2023 (main paper + supplementary)

  9. arXiv:2202.05152  [pdf, other

    cs.CV cs.LG

    Feature-level augmentation to improve robustness of deep neural networks to affine transformations

    Authors: Adrian Sandru, Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: Recent studies revealed that convolutional neural networks do not generalize well to small image transformations, e.g. rotations by a few degrees or translations of a few pixels. To improve the robustness to such transformations, we propose to introduce data augmentation at intermediate layers of the neural architecture, in addition to the common data augmentation applied on the input images. By i… ▽ More

    Submitted 20 August, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: Accepted at ECCV Workshop on Adversarial Robustness in the Real World (AROW 2022)

  10. arXiv:2111.10561  [pdf, other

    cs.CV cs.LG

    Teacher-Student Training and Triplet Loss to Reduce the Effect of Drastic Face Occlusion

    Authors: Mariana-Iuliana Georgescu, Georgian Duta, Radu Tudor Ionescu

    Abstract: We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing Virtual Reality (VR) headsets. On the other hand, we aim to estimate the age and identify the gender of people wearing surgical masks. For all these tasks, the common ground is that half of the face is oc… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: Accepted in Machine Vision and Applications. arXiv admin note: text overlap with arXiv:2008.01003

  11. arXiv:2111.08644  [pdf, other

    cs.CV cs.LG

    UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection

    Authors: Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Detecting abnormal events in video is commonly framed as a one-class classification task, where training videos contain only normal events, while test videos encompass both normal and abnormal events. In this scenario, anomaly detection is an open-set problem. However, some studies assimilate anomaly detection to action recognition. This is a closed-set scenario that fails to test the capability o… ▽ More

    Submitted 7 April, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted at CVPR 2022. Paper + supplementary (15 pages, 9 figures)

  12. CyTran: A Cycle-Consistent Transformer with Multi-Level Consistency for Non-Contrast to Contrast CT Translation

    Authors: Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, Radu Tudor Ionescu

    Abstract: We propose a novel approach to translate unpaired contrast computed tomography (CT) scans to non-contrast CT scans and the other way around. Solving this task has two important applications: (i) to automatically generate contrast CT scans for patients for whom injecting contrast substance is not an option, and (ii) to enhance the alignment between contrast and non-contrast CT by reducing the diffe… ▽ More

    Submitted 5 April, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in Neurocomputing

  13. arXiv:2109.01745  [pdf, other

    cs.CV cs.LG

    A realistic approach to generate masked faces applied on two novel masked face recognition data sets

    Authors: Tudor Mare, Georgian Duta, Mariana-Iuliana Georgescu, Adrian Sandru, Bogdan Alexe, Marius Popescu, Radu Tudor Ionescu

    Abstract: The COVID-19 pandemic raises the problem of adapting face recognition systems to the new reality, where people may wear surgical masks to cover their noses and mouths. Traditional data sets (e.g., CelebA, CASIA-WebFace) used for training these systems were released before the pandemic, so they now seem unsuited due to the lack of examples of people wearing masks. We propose a method for enhancing… ▽ More

    Submitted 25 October, 2021; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: Accepted at NeurIPS 2021

  14. arXiv:2108.07387  [pdf, other

    cs.CV cs.LG

    Contextual Convolutional Neural Networks

    Authors: Ionut Cosmin Duta, Mariana Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: We propose contextual convolution (CoConv) for visual recognition. CoConv is a direct replacement of the standard convolution, which is the core component of convolutional neural networks. CoConv is implicitly equipped with the capability of incorporating contextual information while maintaining a similar number of parameters and computational cost compared to the standard convolution. CoConv is i… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted at ICCV Workshop on Neural Architectures (NeurArch 2021)

  15. arXiv:2011.07491  [pdf, other

    cs.CV cs.LG eess.IV

    Anomaly Detection in Video via Self-Supervised and Multi-Task Learning

    Authors: Mariana-Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah

    Abstract: Anomaly detection in video is a challenging computer vision problem. Due to the lack of anomalous events at training time, anomaly detection requires the design of learning methods without full supervision. In this paper, we approach anomalous event detection in video through self-supervised and multi-task learning at the object level. We first utilize a pre-trained detector to detect objects. The… ▽ More

    Submitted 10 September, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted at CVPR 2021. Main paper and supplementary are both included

  16. arXiv:2009.12339  [pdf, other

    cs.CV

    SuPEr-SAM: Using the Supervision Signal from a Pose Estimator to Train a Spatial Attention Module for Personal Protective Equipment Recognition

    Authors: Adrian Sandru, Georgian-Emilian Duta, Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: We propose a deep learning method to automatically detect personal protective equipment (PPE), such as helmets, surgical masks, reflective vests, boots and so on, in images of people. Typical approaches for PPE detection based on deep learning are (i) to train an object detector for items such as those listed above or (ii) to train a person detector and a classifier that takes the bounding boxes p… ▽ More

    Submitted 2 November, 2020; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: Accepted at WACV 2021

  17. A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah

    Abstract: Abnormal event detection in video is a complex computer vision problem that has attracted significant attention in recent years. The complexity of the task arises from the commonly-adopted definition of an abnormal event, that is, a rarely occurring event that typically depends on the surrounding context. Following the standard formulation of abnormal event detection as outlier detection, we propo… ▽ More

    Submitted 6 April, 2023; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

  18. arXiv:2008.01003  [pdf, other

    cs.CV cs.LG

    Teacher-Student Training and Triplet Loss for Facial Expression Recognition under Occlusion

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: In this paper, we study the task of facial expression recognition under strong occlusion. We are particularly interested in cases where 50% of the face is occluded, e.g. when the subject wears a Virtual Reality (VR) headset. While previous studies show that pre-training convolutional neural networks (CNNs) on fully-visible (non-occluded) faces improves the accuracy, we propose to employ knowledge… ▽ More

    Submitted 25 February, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted at ICPR 2020

  19. arXiv:2003.03229  [pdf, other

    cs.NE cs.CV cs.LG stat.ML

    Non-linear Neurons with Human-like Apical Dendrite Activations

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Nicolae-Catalin Ristea, Nicu Sebe

    Abstract: In order to classify linearly non-separable data, neurons are typically organized into multi-layer neural networks that are equipped with at least one hidden layer. Inspired by some recent discoveries in neuroscience, we propose a new model of artificial neuron along with a novel activation function enabling the learning of nonlinear decision boundaries using a single neuron. We show that a standa… ▽ More

    Submitted 10 August, 2023; v1 submitted 2 February, 2020; originally announced March 2020.

    Comments: Accepted for publication in Applied Intelligence

  20. arXiv:2001.01330  [pdf, other

    cs.CV cs.LG eess.IV

    Convolutional Neural Networks with Intermediate Loss for 3D Super-Resolution of CT and MRI Scans

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Nicolae Verga

    Abstract: CT scanners that are commonly-used in hospitals nowadays produce low-resolution images, up to 512 pixels in size. One pixel in the image corresponds to a one millimeter piece of tissue. In order to accurately segment tumors and make treatment plans, doctors need CT scans of higher resolution. The same problem appears in MRI. In this paper, we propose an approach for the single-image super-resoluti… ▽ More

    Submitted 6 April, 2023; v1 submitted 5 January, 2020; originally announced January 2020.

    Comments: Accepted in IEEE Access

  21. arXiv:1911.04852  [pdf, other

    cs.CV cs.LG

    Recognizing Facial Expressions of Occluded Faces using Convolutional Neural Networks

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: In this paper, we present an approach based on convolutional neural networks (CNNs) for facial expression recognition in a difficult setting with severe occlusions. More specifically, our task is to recognize the facial expression of a person wearing a virtual reality (VR) headset which essentially occludes the upper part of the face. In order to accurately train neural networks for this setting,… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: Accepted at ICONIP 2019

  22. arXiv:1905.00773  [pdf, other

    cs.CV cs.LG

    Clustering Images by Unmasking - A New Baseline

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: We propose a novel agglomerative clustering method based on unmasking, a technique that was previously used for authorship verification of text documents and for abnormal event detection in videos. In order to join two clusters, we alternate between (i) training a binary classifier to distinguish between the samples from one cluster and the samples from the other cluster, and (ii) removing at each… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: Accepted at ICIP 2019

  23. arXiv:1812.04960  [pdf, other

    cs.CV cs.LG

    Object-centric Auto-encoders and Dummy Anomalies for Abnormal Event Detection in Video

    Authors: Radu Tudor Ionescu, Fahad Shahbaz Khan, Mariana-Iuliana Georgescu, Ling Shao

    Abstract: Abnormal event detection in video is a challenging vision problem. Most existing approaches formulate abnormal event detection as an outlier detection task, due to the scarcity of anomalous data during training. Because of the lack of prior information regarding abnormal events, these methods are not fully-equipped to differentiate between normal and abnormal events. In this work, we formalize abn… ▽ More

    Submitted 7 April, 2019; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: Accepted at CVPR 2019

  24. Local Learning with Deep and Handcrafted Features for Facial Expression Recognition

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Marius Popescu

    Abstract: We present an approach that combines automatic features learned by convolutional neural networks (CNN) and handcrafted features computed by the bag-of-visual-words (BOVW) model in order to achieve state-of-the-art results in facial expression recognition. To obtain automatic features, we experiment with multiple CNN architectures, pre-trained models and training procedures, e.g. Dense-Sparse-Dense… ▽ More

    Submitted 12 March, 2020; v1 submitted 29 April, 2018; originally announced April 2018.

    Comments: Accepted in IEEE Access

    Journal ref: in IEEE Access, vol. 7, pp. 64827-64836, 2019

  25. arXiv:1310.1540  [pdf, other

    cs.CR cs.HC

    Three-Way Dissection of a Game-CAPTCHA: Automated Attacks, Relay Attacks, and Usability

    Authors: Manar Mohamed, Niharika Sachdeva, Michael Georgescu, Song Gao, Nitesh Saxena, Chengcui Zhang, Ponnurangam Kumaraguru, Paul C. van Oorschot, Wei-Bang Chen

    Abstract: Existing captcha solutions on the Internet are a major source of user frustration. Game captchas are an interesting and, to date, little-studied approach claiming to make captcha solving a fun activity for the users. One broad form of such captchas -- called Dynamic Cognitive Game (DCG) captchas -- challenge the user to perform a game-like cognitive task interacting with a series of dynamic images… ▽ More

    Submitted 6 October, 2013; originally announced October 2013.

    Comments: 16 pages, 10 figures