Skip to main content

Showing 1–50 of 106 results for author: Ionescu, R

.
  1. arXiv:2407.04541  [pdf, ps, other

    cs.CL cs.AI cs.LG

    PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts

    Authors: Ana-Cristina Rogoz, Maria Ilinca Nechita, Radu Tudor Ionescu

    Abstract: We introduce PoPreRo, the first dataset for Popularity Prediction of Romanian posts collected from Reddit. The PoPreRo dataset includes a varied compilation of post samples from five distinct subreddits of Romania, totaling 28,107 data samples. Along with our novel dataset, we introduce a set of competitive models to be used as baselines for future research. Interestingly, the top-scoring model ac… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Accepted at ICPR 2024

  2. arXiv:2406.04746  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction

    Authors: Eduard Poesina, Adriana Valentina Costache, Adrian-Gabriel Chifu, Josiane Mothe, Radu Tudor Ionescu

    Abstract: Text-to-image generation has recently emerged as a viable alternative to text-to-image retrieval, due to the visually impressive results of generative diffusion models. Although query performance prediction is an active research topic in information retrieval, to the best of our knowledge, there is no prior study that analyzes the difficulty of queries (prompts) in text-to-image generation, based… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.13637  [pdf, other

    cs.CV cs.AI cs.LG

    Curriculum Direct Preference Optimization for Diffusion and Consistency Models

    Authors: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Nicu Sebe, Mubarak Shah

    Abstract: Direct Preference Optimization (DPO) has been proposed as an effective and efficient alternative to reinforcement learning from human feedback (RLHF). In this paper, we propose a novel and enhanced version of DPO based on curriculum learning for text-to-image generation. Our method is divided into two training stages. First, a ranking of the examples generated for each prompt is obtained by employ… ▽ More

    Submitted 24 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2405.11877  [pdf, other

    cs.CL cs.AI cs.LG

    A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus

    Authors: Eduard Poesina, Cornelia Caragea, Radu Tudor Ionescu

    Abstract: Natural language inference (NLI), the task of recognizing the entailment relationship in sentence pairs, is an actively studied topic serving as a proxy for natural language understanding. Despite the relevance of the task in building conversational agents and improving text classification, machine translation and other NLP tasks, to the best of our knowledge, there is no publicly available NLI co… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted at ACL 2024 (Main)

  5. arXiv:2404.13343  [pdf, other

    cs.CL cs.AI cs.LG

    UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions

    Authors: Ana-Cristina Rogoz, Radu Tudor Ionescu

    Abstract: This work explores a novel data augmentation method based on Large Language Models (LLMs) for predicting item difficulty and response time of retired USMLE Multiple-Choice Questions (MCQs) in the BEA 2024 Shared Task. Our approach is based on augmenting the dataset with answers from zero-shot LLMs (Falcon, Meditron, Mistral) and employing transformer-based models based on six alternative feature c… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: Accepted at BEA 2024 (NAACL Workshop)

  6. arXiv:2404.09326  [pdf, other

    cs.CV cs.AI cs.LG

    Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

    Authors: Diana-Nicoleta Grigore, Mariana-Iuliana Georgescu, Jon Alvarez Justo, Tor Johansen, Andreea Iuliana Ionescu, Radu Tudor Ionescu

    Abstract: Few-shot knowledge distillation recently emerged as a viable approach to harness the knowledge of large-scale pre-trained models, using limited data and computational resources. In this paper, we propose a novel few-shot feature distillation approach for vision transformers. Our approach is based on two key steps. Leveraging the fact that vision transformers have a consistent depth-wise structure,… ▽ More

    Submitted 17 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  7. arXiv:2401.07575  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Cascaded Cross-Modal Transformer for Audio-Textual Classification

    Authors: Nicolae-Catalin Ristea, Andrei Anghel, Radu Tudor Ionescu

    Abstract: Speech classification tasks often require powerful language understanding models to grasp useful features, which becomes problematic when limited training data is available. To attain superior classification performance, we propose to harness the inherent value of multimodal representations by transcribing speech using automatic speech recognition (ASR) models and translating the transcripts into… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  8. arXiv:2310.16210  [pdf, other

    cs.CV eess.IV

    Sea-Land-Cloud Segmentation in Satellite Hyperspectral Imagery by Deep Learning

    Authors: Jon Alvarez Justo, Joseph L. Garrett, Mariana-Iuliana Georgescu, Jesus Gonzalez-Llorente, Radu Tudor Ionescu, Tor Arne Johansen

    Abstract: Satellites are increasingly adopting on-board AI for enhanced autonomy through in-orbit inference. In this context, the use of deep learning (DL) techniques for segmentation in hyperspectral (HS) satellite imagery offers advantages for remote sensing applications, and therefore, we train 16 different models, whose codes are made available through our study, which we consider to be relevant for on-… ▽ More

    Submitted 28 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Remote Sensing, Satellite Imagery, Hyperspectral Imaging, Deep Learning, Segmentation

  9. arXiv:2310.06540  [pdf, other

    cs.CL cs.AI cs.LG

    A Novel Contrastive Learning Method for Clickbait Detection on RoCliCo: A Romanian Clickbait Corpus of News Articles

    Authors: Daria-Mihaela Broscoteanu, Radu Tudor Ionescu

    Abstract: To increase revenue, news websites often resort to using deceptive news titles, luring users into clicking on the title and reading the full news. Clickbait detection is the task that aims to automatically detect this form of false advertisement and avoid wasting the precious time of online users. Despite the importance of the task, to the best of our knowledge, there is no publicly available clic… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023

  10. arXiv:2310.06476  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Accounting for localized deformation: a simple computation of true stress in micropillar compression experiments

    Authors: Jalal Smiri, Oguz Umut Salman, Matteo Ghidelli, Ioan R. Ionescu

    Abstract: Compression experiments are widely used to study the mechanical properties of materials at micro- and nanoscale. However, the conventional engineering stress measurement method used in these experiments neglects to account for the alterations in the material's shape during loading. This can lead to inaccurate stress values and potentially misleading conclusions about the material's mechanical beha… ▽ More

    Submitted 19 June, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2012.12780

  11. arXiv:2310.00096  [pdf, other

    cs.CV cs.LG

    Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation

    Authors: Vlad Hondru, Radu Tudor Ionescu

    Abstract: Diffusion models showcased strong capabilities in image synthesis, being used in many computer vision tasks with great success. To this end, we propose to explore a new use case, namely to copy black-box classification models without having access to the original training data, the architecture, and the weights of the model, \ie~the model is only exposed through an inference API. More specifically… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  12. arXiv:2309.15238  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Using Generated Privileged Information by Text-to-Image Diffusion Models

    Authors: Rafael-Edy Menadil, Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: Learning Using Privileged Information is a particular type of knowledge distillation where the teacher model benefits from an additional data representation during training, called privileged information, improving the student model, which does not see the extra representation. However, privileged information is rarely available in practice. To this end, we propose a text classification framework… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  13. arXiv:2309.03378  [pdf, other

    cs.CL cs.SD eess.AS

    RoDia: A New Dataset for Romanian Dialect Identification from Speech

    Authors: Codrut Rotaru, Nicolae-Catalin Ristea, Radu Tudor Ionescu

    Abstract: We introduce RoDia, the first dataset for Romanian dialect identification from speech. The RoDia dataset includes a varied compilation of speech samples from five distinct regions of Romania, covering both urban and rural environments, totaling 2 hours of manually annotated speech data. Along with our dataset, we introduce a set of competitive models to be used as baselines for future research. Th… ▽ More

    Submitted 20 March, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at NAACL 2024

  14. arXiv:2308.16572  [pdf, other

    cs.CV cs.AI cs.LG

    CL-MAE: Curriculum-Learned Masked Autoencoders

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu

    Abstract: Masked image modeling has been demonstrated as a powerful pretext task for generating robust representations that can be effectively generalized across multiple downstream tasks. Typically, this approach involves randomly masking patches (tokens) in input images, with the masking strategy remaining unchanged during training. In this paper, we propose a curriculum learning approach that updates the… ▽ More

    Submitted 28 February, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted at WACV 2024

  15. arXiv:2308.13679  [pdf, other

    cs.CV cs.AI

    An Open Hyperspectral Dataset with Sea-Land-Cloud Ground-Truth from the HYPSO-1 Satellite

    Authors: Jon A. Justo, Joseph Garrett, Dennis D. Langer, Marie B. Henriksen, Radu T. Ionescu, Tor A. Johansen

    Abstract: Hyperspectral Imaging, employed in satellites for space remote sensing, like HYPSO-1, faces constraints due to few labeled data sets, affecting the training of AI models demanding these ground-truth annotations. In this work, we introduce The HYPSO-1 Sea-Land-Cloud-Labeled Dataset, an open dataset with 200 diverse hyperspectral images from the HYPSO-1 mission, available in both raw and calibrated… ▽ More

    Submitted 3 September, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Computer Vision, Artificial Intelligence, Remote Sensing, Earth Observation, Hyperspectral Imaging, Classification, Labeled Data

  16. arXiv:2308.04934  [pdf, other

    cs.CV cs.LG

    JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition

    Authors: Lucian Bicsi, Bogdan Alexe, Radu Tudor Ionescu, Marius Leordeanu

    Abstract: We propose JEDI, a multi-dataset semi-supervised learning method, which efficiently combines knowledge from multiple experts, learned on different datasets, to train and improve the performance of individual, per dataset, student models. Our approach achieves this by addressing two important problems in current machine learning research: generalization across datasets and limitations of supervised… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted in ICCV 2023 Workshops

  17. arXiv:2308.01472  [pdf, other

    cs.CV cs.CL cs.LG

    Reverse Stable Diffusion: What prompt was used to generate this image?

    Authors: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah

    Abstract: Text-to-image diffusion models such as Stable Diffusion have recently attracted the interest of many researchers, and inverting the diffusion process can play an important role in better understanding the generative process and how to engineer prompts in order to obtain the desired images. To this end, we introduce the new task of predicting the text prompt given an image generated by a generative… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  18. arXiv:2307.15097  [pdf, other

    cs.CL cs.LG cs.MM eess.AS

    Cascaded Cross-Modal Transformer for Request and Complaint Detection

    Authors: Nicolae-Catalin Ristea, Radu Tudor Ionescu

    Abstract: We propose a novel cascaded cross-modal transformer (CCMT) that combines speech and text transcripts to detect customer requests and complaints in phone conversations. Our approach leverages a multimodal paradigm by transcribing the speech using automatic speech recognition (ASR) models and translating the transcripts into different languages. Subsequently, we combine language-specific BERT-based… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted at ACMMM 2023

  19. arXiv:2307.09315  [pdf, other

    hep-ph quant-ph

    The Strong Field QED approach of the vacuum interaction processes at ELI-NP

    Authors: M. Pentia, C. R. Badita, D. Dumitriu, A. R. Ionescu, H. Petrascu

    Abstract: The commissioning of the high power laser facility Extreme Light Infrastructure - Nuclear Physics (ELI-NP) at Bucharest-Magurele (Romania) allows the in-depth study of nonlinear interactions in Strong Field Quantum Electrodynamics (SF-QED). The present paper analyzes the SF-QED processes possible to study at ELI-NP. Carrying out such experiments will allow finding answers to many fundamental QED q… ▽ More

    Submitted 31 August, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: 14 pages, 20 figures

    MSC Class: 81V10; 81T18

  20. arXiv:2306.12041  [pdf, other

    cs.CV cs.LG

    Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors

    Authors: Nicolae-Catalin Ristea, Florinel-Alin Croitoru, Radu Tudor Ionescu, Marius Popescu, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: We propose an efficient abnormal event detection model based on a lightweight masked auto-encoder (AE) applied at the video frame level. The novelty of the proposed model is threefold. First, we introduce an approach to weight tokens based on motion gradients, thus shifting the focus from the static background scene to the foreground objects. Second, we integrate a teacher decoder and a student de… ▽ More

    Submitted 9 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Accepted at CVPR 2024

  21. arXiv:2306.00630  [pdf, other

    cs.CV cs.IR cs.LG

    Class Anchor Margin Loss for Content-Based Image Retrieval

    Authors: Alexandru Ghita, Radu Tudor Ionescu

    Abstract: The performance of neural networks in content-based image retrieval (CBIR) is highly influenced by the chosen loss (objective) function. The majority of objective functions for neural models can be divided into metric learning and statistical learning. Metric learning approaches require a pair mining strategy that often lacks efficiency, while statistical learning approaches are not generating hig… ▽ More

    Submitted 3 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  22. arXiv:2302.10126  [pdf, other

    cs.CV cs.IR

    iQPP: A Benchmark for Image Query Performance Prediction

    Authors: Eduard Poesina, Radu Tudor Ionescu, Josiane Mothe

    Abstract: To date, query performance prediction (QPP) in the context of content-based image retrieval remains a largely unexplored task, especially in the query-by-example scenario, where the query is an image. To boost the exploration of the QPP task in image retrieval, we propose the first benchmark for image query performance prediction (iQPP). First, we establish a set of four data sets (PASCAL VOC 2012… ▽ More

    Submitted 10 April, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted at SIGIR 2023

  23. arXiv:2212.07707  [pdf, other

    cs.CL cs.LG

    FreCDo: A Large Corpus for French Cross-Domain Dialect Identification

    Authors: Mihaela Gaman, Adrian-Gabriel Chifu, William Domingues, Radu Tudor Ionescu

    Abstract: We present a novel corpus for French dialect identification comprising 413,522 French text samples collected from public news websites in Belgium, Canada, France and Switzerland. To ensure an accurate estimation of the dialect identification performance of models, we designed the corpus to eliminate potential biases related to topic, writing style, and publication source. More precisely, the train… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  24. arXiv:2212.05922  [pdf, other

    cs.CV cs.SD

    Audiovisual Masked Autoencoders

    Authors: Mariana-Iuliana Georgescu, Eduardo Fonseca, Radu Tudor Ionescu, Mario Lucic, Cordelia Schmid, Anurag Arnab

    Abstract: Can we leverage the audiovisual information already present in video to improve self-supervised representation learning? To answer this question, we study various pretraining architectures and objectives within the masked autoencoding framework, motivated by the success of similar methods in natural language and image understanding. We show that we can achieve significant improvements on audiovisu… ▽ More

    Submitted 4 January, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: ICCV 2023

  25. arXiv:2211.15597  [pdf, other

    cs.CV cs.AI cs.LG cs.MM stat.ML

    Lightning Fast Video Anomaly Detection via Adversarial Knowledge Distillation

    Authors: Nicolae-Catalin Ristea, Florinel-Alin Croitoru, Dana Dascalescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: We propose a very fast frame-level model for anomaly detection in video, which learns to detect anomalies by distilling knowledge from multiple highly accurate object-level teacher models. To improve the fidelity of our student, we distill the low-resolution anomaly maps of the teachers by jointly applying standard and adversarial distillation, introducing an adversarial discriminator for each tea… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  26. arXiv:2210.12388  [pdf, other

    eess.IV cs.CV cs.LG

    Diversity-Promoting Ensemble for Medical Image Segmentation

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron

    Abstract: Medical image segmentation is an actively studied task in medical imaging, where the precision of the annotations is of utter importance towards accurate diagnosis and treatment. In recent years, the task has been approached with various deep learning systems, among the most popular models being U-Net. In this work, we propose a novel strategy to generate ensembles of different architectures for m… ▽ More

    Submitted 21 December, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: Accepted at SAC 2023

  27. Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection has recently gained increasing attention in the field of computer vision, likely due to its broad set of applications ranging from product fault detection on industrial production lines and impending event detection in video surveillance to finding lesions in medical scans. Regardless of the domain, anomaly detection is typically framed as a one-class classification task, where t… ▽ More

    Submitted 5 October, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

  28. Diffusion Models in Vision: A Survey

    Authors: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah

    Abstract: Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over several steps by adding Gaussian noise. In t… ▽ More

    Submitted 1 April, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence. 25 pages, 3 figures

  29. arXiv:2209.02057  [pdf, other

    stat.ML cs.CY cs.LG stat.AP

    Applying Machine Learning to Life Insurance: some knowledge sharing to master it

    Authors: Antoine Chancel, Laura Bradier, Antoine Ly, Razvan Ionescu, Laurene Martin, Marguerite Sauce

    Abstract: Machine Learning permeates many industries, which brings new source of benefits for companies. However within the life insurance industry, Machine Learning is not widely used in practice as over the past years statistical models have shown their efficiency for risk assessment. Thus insurers may face difficulties to assess the value of the artificial intelligence. Focusing on the modification of th… ▽ More

    Submitted 27 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

  30. arXiv:2207.08003  [pdf, other

    cs.CV cs.LG

    SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection

    Authors: Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: A self-supervised multi-task learning (SSMTL) framework for video anomaly detection was recently introduced in literature. Due to its highly accurate results, the method attracted the attention of many researchers. In this work, we revisit the self-supervised multi-task learning framework, proposing several updates to the original method. First, we study various detection methods, e.g. based on de… ▽ More

    Submitted 12 February, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: Accepted in Computer Vision and Image Understanding

  31. arXiv:2207.03477  [pdf, other

    cs.CL

    VeriDark: A Large-Scale Benchmark for Authorship Verification on the Dark Web

    Authors: Andrei Manolache, Florin Brad, Antonio Barbalau, Radu Tudor Ionescu, Marius Popescu

    Abstract: The DarkWeb represents a hotbed for illicit activity, where users communicate on different market forums in order to exchange goods and services. Law enforcement agencies benefit from forensic tools that perform authorship analysis, in order to identify and profile users based on their textual content. However, authorship analysis has been traditionally studied using corpora featuring literary tex… ▽ More

    Submitted 1 November, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks. 21 pages, 4 figures, 11 tables

  32. arXiv:2205.09180  [pdf, other

    cs.LG cs.CL cs.CV

    Learning Rate Curriculum

    Authors: Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Nicu Sebe

    Abstract: Most curriculum learning methods require an approach to sort the data samples by difficulty, which is often cumbersome to perform. In this work, we propose a novel curriculum learning approach termed Learning Rate Curriculum (LeRaC), which leverages the use of a different learning rate for each layer of a neural network to create a data-agnostic curriculum during the initial training epochs. More… ▽ More

    Submitted 5 July, 2024; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted at the International Journal of Computer Vision

  33. arXiv:2204.04218  [pdf, other

    eess.IV cs.CV cs.LG

    Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan

    Abstract: Super-resolving medical images can help physicians in providing more accurate diagnostics. In many situations, computed tomography (CT) or magnetic resonance imaging (MRI) techniques capture several scans (modes) during a single investigation, which can jointly be used (in a multimodal fashion) to further boost the quality of super-resolution results. To this end, we propose a novel multimodal mul… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted at WACV 2023 (main paper + supplementary)

  34. arXiv:2203.09581  [pdf, other

    cs.CV cs.LG

    SepTr: Separable Transformer for Audio Spectrogram Processing

    Authors: Nicolae-Catalin Ristea, Radu Tudor Ionescu, Fahad Shahbaz Khan

    Abstract: Following the successful application of vision transformers in multiple computer vision tasks, these models have drawn the attention of the signal processing community. This is because signals are often represented as spectrograms (e.g. through Discrete Fourier Transform) which can be directly provided as input to vision transformers. However, naively applying transformers to spectrograms is subop… ▽ More

    Submitted 20 June, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted at INTERSPEECH 2022

  35. arXiv:2202.07073  [pdf, other

    cs.CV cs.LG

    Discriminability-enforcing loss to improve representation learning

    Authors: Florinel-Alin Croitoru, Diana-Nicoleta Grigore, Radu Tudor Ionescu

    Abstract: During the training process, deep neural networks implicitly learn to represent the input data samples through a hierarchy of features, where the size of the hierarchy is determined by the number of layers. In this paper, we focus on enforcing the discriminative power of the high-level representations, that are typically learned by the deeper layers (closer to the output). To this end, we introduc… ▽ More

    Submitted 7 April, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted in CVPR Workshops

  36. arXiv:2202.05152  [pdf, other

    cs.CV cs.LG

    Feature-level augmentation to improve robustness of deep neural networks to affine transformations

    Authors: Adrian Sandru, Mariana-Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: Recent studies revealed that convolutional neural networks do not generalize well to small image transformations, e.g. rotations by a few degrees or translations of a few pixels. To improve the robustness to such transformations, we propose to introduce data augmentation at intermediate layers of the neural architecture, in addition to the common data augmentation applied on the input images. By i… ▽ More

    Submitted 20 August, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: Accepted at ECCV Workshop on Adversarial Robustness in the Real World (AROW 2022)

  37. arXiv:2201.12216  [pdf, other

    cs.CV cs.LG

    Self-paced learning to improve text row detection in historical documents with missing labels

    Authors: Mihaela Gaman, Lida Ghadamiyan, Radu Tudor Ionescu, Marius Popescu

    Abstract: An important preliminary step of optical character recognition systems is the detection of text rows. To address this task in the context of historical data with missing labels, we propose a self-paced learning algorithm capable of improving the row detection performance. We conjecture that pages with more ground-truth bounding boxes are less likely to have missing annotations. Based on this hypot… ▽ More

    Submitted 15 August, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Accepted at ECCV Workshop on Text in Everything (TiE 2022)

  38. arXiv:2112.05125  [pdf, other

    cs.CL

    Rethinking the Authorship Verification Experimental Setups

    Authors: Florin Brad, Andrei Manolache, Elena Burceanu, Antonio Barbalau, Radu Ionescu, Marius Popescu

    Abstract: One of the main drivers of the recent advances in authorship verification is the PAN large-scale authorship dataset. Despite generating significant progress in the field, inconsistent performance differences between the closed and open test sets have been reported. To this end, we improve the experimental setup by proposing five new public splits over the PAN dataset, specifically designed to isol… ▽ More

    Submitted 1 November, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted as a short paper at the EMNLP 2022 conference. 10 pages, 5 figures, 9 tables

  39. arXiv:2111.11253  [pdf, other

    cond-mat.mtrl-sci

    Covering a surface with pre-stressed ribbons : from theory to nano-structures fabrication

    Authors: Alexandre Danescu, Philippe Regreny, Pierre Cremillieu, Jean-Louis Leclercq, Ioan R. Ionescu

    Abstract: The paper deals with the fabrication of nano-shells from pre-stressed nano-plates release. Due to geometrical and technological restrictions we have to cover a given surface with three-dimensional thin ribbons. We discuss the key role of the geodesic curvature in the design of such shell-ribbons. We show that including small-strains but large rotations we are able to control the metric tensor of b… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  40. arXiv:2111.10561  [pdf, other

    cs.CV cs.LG

    Teacher-Student Training and Triplet Loss to Reduce the Effect of Drastic Face Occlusion

    Authors: Mariana-Iuliana Georgescu, Georgian Duta, Radu Tudor Ionescu

    Abstract: We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing Virtual Reality (VR) headsets. On the other hand, we aim to estimate the age and identify the gender of people wearing surgical masks. For all these tasks, the common ground is that half of the face is oc… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: Accepted in Machine Vision and Applications. arXiv admin note: text overlap with arXiv:2008.01003

  41. arXiv:2111.10373  [pdf, other

    physics.class-ph math-ph

    Design of pre-stressed plate-strips to cover non-developable shells

    Authors: Alexandre Danescu, Ioan R. Ionescu

    Abstract: In this paper we address the following design problem: what is the shape of a plate and the associated pre-stress that relaxes toward a given three-dimensional shell? As isometric transformations conserve the gaussian curvature, three-dimensional non-developable shells cannot be obtained from the relaxation of pre-strained plates by using isometric transformations only. Overcoming this geometric r… ▽ More

    Submitted 26 January, 2022; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: Minor changes in Introduction, Sections 3 and 4, Conclusions and References

  42. arXiv:2111.09099  [pdf, other

    cs.CV cs.LG

    Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection

    Authors: Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection is commonly pursued as a one-class classification problem, where models can only learn from normal training samples, while being evaluated on both normal and abnormal test samples. Among the successful approaches for anomaly detection, a distinguished category of methods relies on predicting masked information (e.g. patches, future frames, etc.) and leveraging the reconstruction… ▽ More

    Submitted 14 March, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted at CVPR 2022. Paper + supplementary (14 pages, 9 figures)

  43. arXiv:2111.08644  [pdf, other

    cs.CV cs.LG

    UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection

    Authors: Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Detecting abnormal events in video is commonly framed as a one-class classification task, where training videos contain only normal events, while test videos encompass both normal and abnormal events. In this scenario, anomaly detection is an open-set problem. However, some studies assimilate anomaly detection to action recognition. This is a closed-set scenario that fails to test the capability o… ▽ More

    Submitted 7 April, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted at CVPR 2022. Paper + supplementary (15 pages, 9 figures)

  44. CyTran: A Cycle-Consistent Transformer with Multi-Level Consistency for Non-Contrast to Contrast CT Translation

    Authors: Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, Radu Tudor Ionescu

    Abstract: We propose a novel approach to translate unpaired contrast computed tomography (CT) scans to non-contrast CT scans and the other way around. Solving this task has two important applications: (i) to automatically generate contrast CT scans for patients for whom injecting contrast substance is not an option, and (ii) to enhance the alignment between contrast and non-contrast CT by reducing the diffe… ▽ More

    Submitted 5 April, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in Neurocomputing

  45. arXiv:2109.01745  [pdf, other

    cs.CV cs.LG

    A realistic approach to generate masked faces applied on two novel masked face recognition data sets

    Authors: Tudor Mare, Georgian Duta, Mariana-Iuliana Georgescu, Adrian Sandru, Bogdan Alexe, Marius Popescu, Radu Tudor Ionescu

    Abstract: The COVID-19 pandemic raises the problem of adapting face recognition systems to the new reality, where people may wear surgical masks to cover their noses and mouths. Traditional data sets (e.g., CelebA, CASIA-WebFace) used for training these systems were released before the pandemic, so they now seem unsuited due to the lack of examples of people wearing masks. We propose a method for enhancing… ▽ More

    Submitted 25 October, 2021; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: Accepted at NeurIPS 2021

  46. arXiv:2108.07387  [pdf, other

    cs.CV cs.LG

    Contextual Convolutional Neural Networks

    Authors: Ionut Cosmin Duta, Mariana Iuliana Georgescu, Radu Tudor Ionescu

    Abstract: We propose contextual convolution (CoConv) for visual recognition. CoConv is a direct replacement of the standard convolution, which is the core component of convolutional neural networks. CoConv is implicitly equipped with the capability of incorporating contextual information while maintaining a similar number of parameters and computational cost compared to the standard convolution. CoConv is i… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted at ICCV Workshop on Neural Architectures (NeurArch 2021)

  47. arXiv:2107.10536  [pdf, other

    cs.CR cs.LG

    Improving the Authentication with Built-in Camera Protocol Using Built-in Motion Sensors: A Deep Learning Solution

    Authors: Cezara Benegui, Radu Tudor Ionescu

    Abstract: We propose an enhanced version of the Authentication with Built-in Camera (ABC) protocol by employing a deep learning solution based on built-in motion sensors. The standard ABC protocol identifies mobile devices based on the photo-response non-uniformity (PRNU) of the camera sensor, while also considering QR-code-based meta-information. During authentication, the user is required to take two phot… ▽ More

    Submitted 27 July, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in Mathematics

  48. arXiv:2105.06456  [pdf, ps, other

    cs.CL cs.LG

    SaRoCo: Detecting Satire in a Novel Romanian Corpus of News Articles

    Authors: Ana-Cristina Rogoz, Mihaela Gaman, Radu Tudor Ionescu

    Abstract: In this work, we introduce a corpus for satire detection in Romanian news. We gathered 55,608 public news articles from multiple real and satirical news sources, composing one of the largest corpora for satire detection regardless of language and the only one for the Romanian language. We provide an official split of the text samples, such that training news articles belong to different sources th… ▽ More

    Submitted 30 June, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: Accepted at ACL 2021

  49. arXiv:2104.04828  [pdf, other

    cs.CL cs.LG

    FreSaDa: A French Satire Data Set for Cross-Domain Satire Detection

    Authors: Radu Tudor Ionescu, Adrian Gabriel Chifu

    Abstract: In this paper, we introduce FreSaDa, a French Satire Data Set, which is composed of 11,570 articles from the news domain. In order to avoid reporting unreasonably high accuracy rates due to the learning of characteristics specific to publication sources, we divided our samples into training, validation and test, such that the training publication sources are distinct from the validation and test p… ▽ More

    Submitted 16 May, 2021; v1 submitted 10 April, 2021; originally announced April 2021.

    Comments: Accepted at IJCNN 2021

  50. arXiv:2103.11988  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Self-paced ensemble learning for speech and audio classification

    Authors: Nicolae-Catalin Ristea, Radu Tudor Ionescu

    Abstract: Combining multiple machine learning models into an ensemble is known to provide superior performance levels compared to the individual components forming the ensemble. This is because models can complement each other in taking better decisions. Instead of just combining the models, we propose a self-paced ensemble learning scheme in which models learn from each other over several iterations. Durin… ▽ More

    Submitted 8 June, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted at INTERSPEECH 2021