Skip to main content

Showing 1–50 of 60 results for author: Dimitrov, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14227  [pdf, other

    cs.PL

    Modular Synthesis of Efficient Quantum Uncomputation

    Authors: Hristo Venev, Timon Gehr, Dimitar Dimitrov, Martin Vechev

    Abstract: A key challenge of quantum programming is uncomputation: the reversible deallocation of qubits. And while there has been much recent progress on automating uncomputation, state-of-the-art methods are insufficient for handling today's expressive quantum programming languages. A core reason is that they operate on primitive quantum circuits, while quantum programs express computations beyond circuit… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 25 pages, 9 figures

    ACM Class: D.3.1

  2. arXiv:2405.15586  [pdf, other

    cs.LG cs.DC

    DAGER: Exact Gradient Inversion for Large Language Models

    Authors: Ivo Petrov, Dimitar I. Dimitrov, Maximilian Baader, Mark Niklas Müller, Martin Vechev

    Abstract: Federated learning works by aggregating locally computed gradients from multiple clients, thus enabling collaborative training without sharing private client data. However, prior work has shown that the data can actually be recovered by the server using so-called gradient inversion attacks. While these attacks perform well when applied on images, they are limited in the text domain and only permit… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    ACM Class: I.2.7; I.2.11

  3. arXiv:2405.12250  [pdf, other

    cs.LG cs.AI cs.CL

    Your Transformer is Secretly Linear

    Authors: Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Nikolai Gerasimenko, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov

    Abstract: This paper reveals a novel linear characteristic exclusive to transformer decoders, including models such as GPT, LLaMA, OPT, BLOOM and others. We analyze embedding transformations between sequential layers, uncovering a near-perfect linear relationship (Procrustes similarity score of 0.99). However, linearity decreases when the residual component is removed due to a consistently low output norm o… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 9 pages, 9 figures

  4. arXiv:2404.06212  [pdf, other

    cs.CV cs.AI cs.LG

    OmniFusion Technical Report

    Authors: Elizaveta Goncharova, Anton Razzhigaev, Matvey Mikhalchuk, Maxim Kurkin, Irina Abdullaeva, Matvey Skripkin, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov

    Abstract: Last year, multimodal architectures served up a revolution in AI-based approaches and solutions, extending the capabilities of large language models (LLM). We propose an \textit{OmniFusion} model based on a pretrained LLM and adapters for visual modality. We evaluated and compared several architecture design principles for better text and visual data coupling: MLP and transformer adapters, various… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 17 pages, 4 figures, 9 tables, 2 appendices

    MSC Class: 6804; 68T50 (Primary) ACM Class: I.2.7; I.2.10; I.4.9

  5. arXiv:2404.01992  [pdf, other

    cs.CL

    Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models

    Authors: Stephan Linzbach, Dimitar Dimitrov, Laura Kallmeyer, Kilian Evang, Hajira Jabeen, Stefan Dietze

    Abstract: Pre-trained Language Models (PLMs) are known to contain various kinds of knowledge. One method to infer relational knowledge is through the use of cloze-style prompts, where a model is tasked to predict missing subjects or objects. Typically, designing these prompts is a tedious task because small differences in syntax or semantics can have a substantial impact on knowledge retrieval performance.… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted for NAACL 2024

  6. arXiv:2403.10378  [pdf, other

    cs.CL cs.CV

    EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models

    Authors: Rocktim Jyoti Das, Simeon Emilov Hristov, Haonan Li, Dimitar Iliyanov Dimitrov, Ivan Koychev, Preslav Nakov

    Abstract: We introduce EXAMS-V, a new challenging multi-discipline multimodal multilingual exam benchmark for evaluating vision language models. It consists of 20,932 multiple-choice questions across 20 school disciplines covering natural science, social science, and other miscellaneous studies, e.g., religion, fine arts, business, etc. EXAMS-V includes a variety of multimodal features such as text, images,… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  7. arXiv:2403.03945  [pdf, other

    cs.LG cs.CR cs.DC

    SPEAR:Exact Gradient Inversion of Batches in Federated Learning

    Authors: Dimitar I. Dimitrov, Maximilian Baader, Mark Niklas Müller, Martin Vechev

    Abstract: Federated learning is a framework for collaborative machine learning where clients only share gradient updates and not their private data with a server. However, it was recently shown that gradient inversion attacks can reconstruct this data from the shared gradients. In the important honest-but-curious setting, existing attacks enable exact reconstruction only for a batch size of $b=1$, with larg… ▽ More

    Submitted 3 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    ACM Class: I.2.11

  8. arXiv:2401.04531  [pdf, other

    cs.CL cs.AI

    MERA: A Comprehensive LLM Evaluation in Russian

    Authors: Alena Fenogenova, Artem Chervyakov, Nikita Martynov, Anastasia Kozlova, Maria Tikhonova, Albina Akhmetgareeva, Anton Emelyanov, Denis Shevelev, Pavel Lebedev, Leonid Sinev, Ulyana Isaeva, Katerina Kolomeytseva, Daniil Moskovskiy, Elizaveta Goncharova, Nikita Savushkin, Polina Mikhailova, Denis Dimitrov, Alexander Panchenko, Sergei Markov

    Abstract: Over the past few years, one of the most notable advancements in AI research has been in foundation models (FMs), headlined by the rise of language models (LMs). As the models' size increases, LMs demonstrate enhancements in measurable aspects and the development of new qualitative features. However, despite researchers' attention and the rapid growth in LM application, the capabilities, limitatio… ▽ More

    Submitted 12 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: The paper version comparable with the release code v.1.1.0 of the benchmark. The links and scores are updated

  9. arXiv:2312.03511  [pdf, other

    cs.CV cs.LG cs.MM

    Kandinsky 3.0 Technical Report

    Authors: Vladimir Arkhipkin, Andrei Filatov, Viacheslav Vasilev, Anastasia Maltseva, Said Azizov, Igor Pavlov, Julia Agafonova, Andrey Kuznetsov, Denis Dimitrov

    Abstract: We present Kandinsky 3.0, a large-scale text-to-image generation model based on latent diffusion, continuing the series of text-to-image Kandinsky models and reflecting our progress to achieve higher quality and realism of image generation. In this report we describe the architecture of the model, the data collection procedure, the training technique, and the production system for user interaction… ▽ More

    Submitted 28 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page: https://ai-forever.github.io/Kandinsky-3

  10. arXiv:2311.13073  [pdf, other

    cs.CV cs.LG cs.MM

    FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

    Authors: Vladimir Arkhipkin, Zein Shaheen, Viacheslav Vasilev, Elizaveta Dakhova, Andrey Kuznetsov, Denis Dimitrov

    Abstract: Multimedia generation approaches occupy a prominent place in artificial intelligence research. Text-to-image models achieved high-quality results over the last few years. However, video synthesis methods recently started to develop. This paper presents a new two-stage latent diffusion text-to-video generation architecture based on the text-to-image diffusion model. The first stage concerns keyfram… ▽ More

    Submitted 20 December, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Project page: https://ai-forever.github.io/kandinsky-video/

  11. arXiv:2311.05928  [pdf, other

    cs.CL cs.AI cs.IT cs.LG math.GN

    The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models

    Authors: Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov

    Abstract: In this study, we present an investigation into the anisotropy dynamics and intrinsic dimension of embeddings in transformer architectures, focusing on the dichotomy between encoders and decoders. Our findings reveal that the anisotropy profile in transformer decoders exhibits a distinct bell-shaped curve, with the highest anisotropy concentrations in the middle layers. This pattern diverges from… ▽ More

    Submitted 26 February, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted to EACL-2024

  12. arXiv:2310.03502  [pdf, other

    cs.CV

    Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion

    Authors: Anton Razzhigaev, Arseniy Shakhmatov, Anastasia Maltseva, Vladimir Arkhipkin, Igor Pavlov, Ilya Ryabov, Angelina Kuts, Alexander Panchenko, Andrey Kuznetsov, Denis Dimitrov

    Abstract: Text-to-image generation is a significant domain in modern computer vision and has achieved substantial improvements through the evolution of generative architectures. Among these, there are diffusion-based models that have demonstrated essential quality enhancements. These models are generally split into two categories: pixel-level and latent-level approaches. We present Kandinsky1, a novel explo… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  13. arXiv:2309.06844  [pdf, other

    cs.CL cs.AI cs.MM

    Gpachov at CheckThat! 2023: A Diverse Multi-Approach Ensemble for Subjectivity Detection in News Articles

    Authors: Georgi Pachov, Dimitar Dimitrov, Ivan Koychev, Preslav Nakov

    Abstract: The wide-spread use of social networks has given rise to subjective, misleading, and even false information on the Internet. Thus, subjectivity detection can play an important role in ensuring the objectiveness and the quality of a piece of information. This paper presents the solution built by the Gpachov team for the CLEF-2023 CheckThat! lab Task~2 on subjectivity detection. Three different rese… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  14. arXiv:2306.03013  [pdf, other

    cs.CR cs.LG

    Hiding in Plain Sight: Disguising Data Stealing Attacks in Federated Learning

    Authors: Kostadin Garov, Dimitar I. Dimitrov, Nikola Jovanović, Martin Vechev

    Abstract: Malicious server (MS) attacks have enabled the scaling of data stealing in federated learning to large batch sizes and secure aggregation, settings previously considered private. However, many concerns regarding the client-side detectability of MS attacks were raised, questioning their practicality. In this work, for the first time, we thoroughly study client-side detectability. We first demonstra… ▽ More

    Submitted 15 April, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    ACM Class: I.2.11

  15. arXiv:2303.16531  [pdf, other

    cs.CV

    RusTitW: Russian Language Text Dataset for Visual Text in-the-Wild Recognition

    Authors: Igor Markov, Sergey Nesteruk, Andrey Kuznetsov, Denis Dimitrov

    Abstract: Information surrounds people in modern life. Text is a very efficient type of information that people use for communication for centuries. However, automated text-in-the-wild recognition remains a challenging problem. The major limitation for a DL system is the lack of training data. For the competitive performance, training set must contain many samples that replicate the real-world cases. While… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: 5 pages, 6 figures, 2 tables

  16. arXiv:2210.07213  [pdf, other

    cs.LG cs.AI cs.CY

    FARE: Provably Fair Representation Learning with Practical Certificates

    Authors: Nikola Jovanović, Mislav Balunović, Dimitar I. Dimitrov, Martin Vechev

    Abstract: Fair representation learning (FRL) is a popular class of methods aiming to produce fair classifiers via data preprocessing. Recent regulatory directives stress the need for FRL methods that provide practical certificates, i.e., provable upper bounds on the unfairness of any downstream classifier trained on preprocessed data, which directly provides assurance in a practical scenario. Creating such… ▽ More

    Submitted 8 June, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: ICML 2023

  17. arXiv:2210.01785  [pdf, other

    cs.LG cs.CR cs.DC

    TabLeak: Tabular Data Leakage in Federated Learning

    Authors: Mark Vero, Mislav Balunović, Dimitar I. Dimitrov, Martin Vechev

    Abstract: While federated learning (FL) promises to preserve privacy, recent works in the image and text domains have shown that training updates leak private client data. However, most high-stakes applications of FL (e.g., in healthcare and finance) use tabular data, where the risk of data leakage has not yet been explored. A successful attack for tabular data must address two key challenges unique to the… ▽ More

    Submitted 7 July, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    ACM Class: I.2.11

  18. arXiv:2208.00406  [pdf, other

    cs.LG cs.AI cs.CE cs.CY

    Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI

    Authors: Semen Budennyy, Vladimir Lazarev, Nikita Zakharenko, Alexey Korovin, Olga Plosskaya, Denis Dimitrov, Vladimir Arkhipkin, Ivan Oseledets, Ivan Barsola, Ilya Egorov, Aleksandra Kosterina, Leonid Zhukov

    Abstract: The size and complexity of deep neural networks continue to grow exponentially, significantly increasing energy consumption for training and inference by these models. We introduce an open-source package eco2AI to help data scientists and researchers to track energy consumption and equivalent CO2 emissions of their models in a straightforward way. In eco2AI we put emphasis on accuracy of energy co… ▽ More

    Submitted 3 August, 2022; v1 submitted 31 July, 2022; originally announced August 2022.

    Comments: Source code for eco2AI package (energy consumption and carbon emission tracker of code in python) is available at: https://github.com/sb-ai-lab/Eco2AI , the package is also available at PyPi: https://pypi.org/project/eco2ai/

  19. arXiv:2206.12395  [pdf, other

    cs.LG cs.CR cs.DC

    Data Leakage in Federated Averaging

    Authors: Dimitar I. Dimitrov, Mislav Balunović, Nikola Konstantinov, Martin Vechev

    Abstract: Recent attacks have shown that user data can be recovered from FedSGD updates, thus breaking privacy. However, these attacks are of limited practical relevance as federated learning typically uses the FedAvg algorithm. Compared to FedSGD, recovering data from FedAvg updates is much harder as: (i) the updates are computed at unobserved intermediate network weights, (ii) a large number of batches ar… ▽ More

    Submitted 1 November, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

    ACM Class: I.2.11

  20. arXiv:2205.04274  [pdf, other

    cs.CL cs.AI cs.CV

    Detecting and Understanding Harmful Memes: A Survey

    Authors: Shivam Sharma, Firoj Alam, Md. Shad Akhtar, Dimitar Dimitrov, Giovanni Da San Martino, Hamed Firooz, Alon Halevy, Fabrizio Silvestri, Preslav Nakov, Tanmoy Chakraborty

    Abstract: The automatic identification of harmful content online is of major concern for social media platforms, policymakers, and society. Researchers have studied textual, visual, and audio content, but typically in isolation. Yet, harmful content often combines multiple modalities, as in the case of memes, which are of particular interest due to their viral nature. With this in mind, here we offer a comp… ▽ More

    Submitted 29 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted at IJCAI-ECAI 2022 (Survey Track) - Editorial Feedback Revised, 9 pages (7 main + 2 reference pages)

  21. arXiv:2203.16872  [pdf, ps, other

    cs.GT

    Group Control for Procedural Rules: Parameterized Complexity and Consecutive Domains

    Authors: Yongjie Yang, Dinko Dimitrov

    Abstract: We consider Group Control by Adding Individuals (GCAI) in the setting of group identification for two procedural rules -- the consensus-start-respecting rule and the liberal-start-respecting rule. It is known that GCAI for both rules are NP-hard, but whether they are fixed-parameter tractable with respect to the number of distinguished individuals remained open. We resolve both open problems in th… ▽ More

    Submitted 26 January, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

  22. arXiv:2202.10784  [pdf, other

    cs.CV cs.AI

    RuCLIP -- new models and experiments: a technical report

    Authors: Alex Shonenkov, Andrey Kuznetsov, Denis Dimitrov, Tatyana Shavrina, Daniil Chesakov, Anastasia Maltseva, Alena Fenogenova, Igor Pavlov, Anton Emelyanov, Sergey Markov, Daria Bakshandaeva, Vera Shybaeva, Andrey Chertok

    Abstract: In the report we propose six new implementations of ruCLIP model trained on our 240M pairs. The accuracy results are compared with original CLIP model with Ru-En translation (OPUS-MT) on 16 datasets from different domains. Our best implementations outperform CLIP + OPUS-MT solution on most of the datasets in few-show and zero-shot tasks. In the report we briefly describe the implementations and co… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  23. arXiv:2202.10435  [pdf, ps, other

    cs.LG cs.AI

    Survey on Large Scale Neural Network Training

    Authors: Julia Gusak, Daria Cherniuk, Alena Shilova, Alexander Katrutsa, Daniel Bershatsky, Xunyi Zhao, Lionel Eyraud-Dubois, Oleg Shlyazhko, Denis Dimitrov, Ivan Oseledets, Olivier Beaumont

    Abstract: Modern Deep Neural Networks (DNNs) require significant memory to store weight, activations, and other intermediate tensors during training. Hence, many models do not fit one GPU device or can be trained using only a small per-GPU batch size. This survey provides a systematic overview of the approaches that enable more efficient DNNs training. We analyze techniques that save memory and make good us… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  24. arXiv:2202.08827  [pdf, other

    cs.LG cs.DC

    LAMP: Extracting Text from Gradients with Language Model Priors

    Authors: Mislav Balunović, Dimitar I. Dimitrov, Nikola Jovanović, Martin Vechev

    Abstract: Recent work shows that sensitive user data can be reconstructed from gradient updates, breaking the key privacy promise of federated learning. While success was demonstrated primarily on image data, these methods do not directly transfer to other domains such as text. In this work, we propose LAMP, a novel attack tailored to textual data, that successfully reconstructs original text from gradients… ▽ More

    Submitted 19 October, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    ACM Class: I.2.7; I.2.11

  25. arXiv:2202.03046  [pdf, other

    cs.CV

    A new face swap method for image and video domains: a technical report

    Authors: Daniil Chesakov, Anastasia Maltseva, Alexander Groshev, Andrey Kuznetsov, Denis Dimitrov

    Abstract: Deep fake technology became a hot field of research in the last few years. Researchers investigate sophisticated Generative Adversarial Networks (GAN), autoencoders, and other approaches to establish precise and robust algorithms for face swap**. Achieved results show that the deep fake unsupervised synthesis task has problems in terms of the visual quality of generated data. These problems usua… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  26. arXiv:2202.00441  [pdf, other

    cs.LG cs.AI

    Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction

    Authors: Georgii Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Dimitrov, Ivan Oseledets

    Abstract: Memory footprint is one of the main limiting factors for large neural network training. In backpropagation, one needs to store the input to each operation in the computational graph. Every modern neural network model has quite a few pointwise nonlinearities in its architecture, and such operation induces additional memory costs which -- as we show -- can be significantly reduced by quantization of… ▽ More

    Submitted 2 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted

  27. arXiv:2112.07395  [pdf, other

    cs.CV

    Handwritten text generation and strikethrough characters augmentation

    Authors: Alex Shonenkov, Denis Karachev, Max Novopoltsev, Mark Potanin, Denis Dimitrov, Andrey Chertok

    Abstract: We introduce two data augmentation techniques, which, used with a Resnet-BiLSTM-CTC network, significantly reduce Word Error Rate (WER) and Character Error Rate (CER) beyond best-reported results on handwriting text recognition (HTR) tasks. We apply a novel augmentation that simulates strikethrough text (HandWritten Blots) and a handwritten text generation method based on printed text (StackMix),… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 16 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2108.11667

    MSC Class: 68-04 ACM Class: I.7.5; I.4.6

  28. arXiv:2112.02448  [pdf, other

    cs.CL cs.AI cs.LG

    Emojich -- zero-shot emoji generation using Russian language: a technical report

    Authors: Alex Shonenkov, Daria Bakshandaeva, Denis Dimitrov, Aleksandr Nikolich

    Abstract: This technical report presents a text-to-image neural network "Emojich" that generates emojis using captions in Russian language as a condition. We aim to keep the generalization ability of a pretrained big model ruDALL-E Malevich (XL) 1.3B parameters at the fine-tuning stage, while giving special style to the images generated. Here are presented some engineering methods, code realization, all hyp… ▽ More

    Submitted 12 January, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

    Comments: 5 pages, 4 figures and big figure at appendix, technical report

  29. arXiv:2111.10974  [pdf, other

    cs.CV cs.AI cs.CL

    Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture

    Authors: Daria Bakshandaeva, Denis Dimitrov, Vladimir Arkhipkin, Alex Shonenkov, Mark Potanin, Denis Karachev, Andrey Kuznetsov, Anton Voronov, Vera Davydova, Elena Tutubalina, Aleksandr Petiushko

    Abstract: Supporting the current trend in the AI community, we present the AI Journey 2021 Challenge called Fusion Brain, the first competition which is targeted to make the universal architecture which could process different modalities (in this case, images, texts, and code) and solve multiple tasks for vision and language. The Fusion Brain Challenge combines the following specific tasks: Code2code Transl… ▽ More

    Submitted 28 December, 2022; v1 submitted 21 November, 2021; originally announced November 2021.

  30. arXiv:2111.04706  [pdf, other

    cs.LG cs.CR

    Bayesian Framework for Gradient Leakage

    Authors: Mislav Balunović, Dimitar I. Dimitrov, Robin Staab, Martin Vechev

    Abstract: Federated learning is an established method for training machine learning models without sharing training data. However, recent work has shown that it cannot guarantee data privacy as shared gradients can still leak sensitive information. To formalize the problem of gradient leakage, we propose a theoretical framework that enables, for the first time, analysis of the Bayes optimal adversary phrase… ▽ More

    Submitted 17 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

  31. arXiv:2110.00413  [pdf, other

    cs.CL cs.LG cs.MM cs.SI

    Detecting Harmful Memes and Their Targets

    Authors: Shraman Pramanick, Dimitar Dimitrov, Rituparna Mukherjee, Shivam Sharma, Md. Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty

    Abstract: Among the various modes of communication in social media, the use of Internet memes has emerged as a powerful means to convey political, psychological, and socio-cultural opinions. Although memes are typically humorous in nature, recent days have witnessed a proliferation of harmful memes targeted to abuse various social entities. As most harmful memes are highly satirical and abstruse without app… ▽ More

    Submitted 24 September, 2021; originally announced October 2021.

    Comments: harmful memes, multimodality, social media

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: ACL-2021 (Findings)

  32. arXiv:2109.08013  [pdf, other

    cs.CV cs.CL cs.LG cs.MM

    Detecting Propaganda Techniques in Memes

    Authors: Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino

    Abstract: Propaganda can be defined as a form of communication that aims to influence the opinions or the actions of people towards a specific goal; this is achieved by means of well-defined rhetorical and psychological devices. Propaganda, in the form we know it today, can be dated back to the beginning of the 17th century. However, it is with the advent of the Internet and the social media that it has sta… ▽ More

    Submitted 7 August, 2021; originally announced September 2021.

    Comments: propaganda, disinformation, fake news, memes, multimodality. arXiv admin note: text overlap with arXiv:2105.09284

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: ACL-2021

  33. arXiv:2109.05184  [pdf, other

    cs.MM cs.CL

    MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets

    Authors: Shraman Pramanick, Shivam Sharma, Dimitar Dimitrov, Md Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty

    Abstract: Internet memes have become powerful means to transmit political, psychological, and socio-cultural ideas. Although memes are typically humorous, recent days have witnessed an escalation of harmful memes used for trolling, cyberbullying, and abuse. Detecting such memes is challenging as they can be highly satirical and cryptic. Moreover, while previous work has focused on specific aspects of memes… ▽ More

    Submitted 22 September, 2021; v1 submitted 11 September, 2021; originally announced September 2021.

    Comments: The paper has been accepted in the Findings of Empirical Methods in Natural Language Processing (EMNLP), 2021

  34. arXiv:2109.00542  [pdf, other

    cs.LG cs.CR

    Shared Certificates for Neural Network Verification

    Authors: Marc Fischer, Christian Sprecher, Dimitar I. Dimitrov, Gagandeep Singh, Martin Vechev

    Abstract: Existing neural network verifiers compute a proof that each input is handled correctly under a given perturbation by propagating a symbolic abstraction of reachable values at each layer. This process is repeated from scratch independently for each input (e.g., image) and perturbation (e.g., rotation), leading to an expensive overall proof effort when handling an entire dataset. In this work, we in… ▽ More

    Submitted 23 November, 2023; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Extended version of our CAV'22 paper

  35. arXiv:2108.11667  [pdf, other

    cs.CV

    StackMix and Blot Augmentations for Handwritten Text Recognition

    Authors: Alex Shonenkov, Denis Karachev, Maxim Novopoltsev, Mark Potanin, Denis Dimitrov

    Abstract: This paper proposes a handwritten text recognition(HTR) system that outperforms current state-of-the-artmethods. The comparison was carried out on three of themost frequently used in HTR task datasets, namely Ben-tham, IAM, and Saint Gall. In addition, the results on tworecently presented datasets, Peter the Greats manuscriptsand HKR Dataset, are provided.The paper describes the architecture of th… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 17 pages, 9 figures

    MSC Class: 68-04 ACM Class: I.7.5; I.4.6

  36. arXiv:2105.09284  [pdf, other

    cs.MM cs.CL cs.LG

    SemEval-2021 Task 6: Detection of Persuasion Techniques in Texts and Images

    Authors: Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, Giovanni Da San Martino

    Abstract: We describe SemEval-2021 task 6 on Detection of Persuasion Techniques in Texts and Images: the data, the annotation guidelines, the evaluation setup, the results, and the participating systems. The task focused on memes and had three subtasks: (i) detecting the techniques in the text, (ii) detecting the text spans where the techniques are used, and (iii) detecting techniques in the entire meme, i.… ▽ More

    Submitted 25 April, 2021; originally announced May 2021.

    Comments: propaganda, disinformation, misinformation, fake news, memes, multimodality

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: SemEval-2021

  37. arXiv:2103.12541  [pdf, other

    cs.MM cs.AI cs.CL cs.CR cs.CY cs.LG cs.SI

    A Survey on Multimodal Disinformation Detection

    Authors: Firoj Alam, Stefano Cresci, Tanmoy Chakraborty, Fabrizio Silvestri, Dimiter Dimitrov, Giovanni Da San Martino, Shaden Shaar, Hamed Firooz, Preslav Nakov

    Abstract: Recent years have witnessed the proliferation of offensive content online such as fake news, propaganda, misinformation, and disinformation. While initially this was mostly about textual content, over time images and videos gained popularity, as they are much easier to consume, attract more attention, and spread further than text. As a result, researchers started leveraging different modalities an… ▽ More

    Submitted 28 September, 2022; v1 submitted 13 March, 2021; originally announced March 2021.

    Comments: Accepted at COLING-2022, disinformation, misinformation, factuality, harmfulness, fake news, propaganda, multimodality, text, images, videos, network structure, temporality

    MSC Class: 68T50 ACM Class: I.2.7

  38. arXiv:2103.09354  [pdf, other

    cs.CV cs.AI cs.LG

    Digital Peter: Dataset, Competition and Handwriting Recognition Methods

    Authors: Mark Potanin, Denis Dimitrov, Alex Shonenkov, Vladimir Bataev, Denis Karachev, Maxim Novopoltsev

    Abstract: This paper presents a new dataset of Peter the Great's manuscripts and describes a segmentation procedure that converts initial images of documents into the lines. The new dataset may be useful for researchers to train handwriting text recognition models as a benchmark for comparing different models. It consists of 9 694 images and text files corresponding to lines in historical documents. The ope… ▽ More

    Submitted 27 August, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: 17 pages, 7 figures, submitted to ICDAR 2021

    ACM Class: I.7.5; I.4.6

  39. arXiv:2007.12133  [pdf, other

    cs.LG stat.ML

    Provably Robust Adversarial Examples

    Authors: Dimitar I. Dimitrov, Gagandeep Singh, Timon Gehr, Martin Vechev

    Abstract: We introduce the concept of provably robust adversarial examples for deep neural networks - connected input regions constructed from standard adversarial examples which are guaranteed to be robust to a set of real-world perturbations (such as changes in pixel intensity and geometric transformations). We present a novel method called PARADE for generating these regions in a scalable manner which wo… ▽ More

    Submitted 17 March, 2022; v1 submitted 23 July, 2020; originally announced July 2020.

  40. TweetsCOV19 -- A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic

    Authors: Dimitar Dimitrov, Erdal Baran, Pavlos Fafalios, Ran Yu, Xiaofei Zhu, Matthäus Zloch, Stefan Dietze

    Abstract: Publicly available social media archives facilitate research in the social sciences and provide corpora for training and testing a wide range of machine learning and natural language processing methods. With respect to the recent outbreak of the Coronavirus disease 2019 (COVID-19), online discourse on Twitter reflects public opinion and perception related to the pandemic itself as well as mitigati… ▽ More

    Submitted 15 August, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

  41. arXiv:2006.02168  [pdf

    cs.SE cs.AI

    A Mixed Initiative Semantic Web Framework for Process Composition

    Authors: **ghai Rao, Dimitar Dimitrov, Paul Hofmann, Norman Sadeh

    Abstract: Semantic Web technologies offer the prospect of significantly reducing the amount of effort required to integrate existing enterprise functionality in support of new composite processes; whether within a given organization or across multiple ones. A significant body of work in this area has aimed to fully automate this process, while assuming that all functionality has already been encapsulated in… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Journal ref: IEEE International Semantic Web Conference 2006, 873-886, Springer, Berlin, Heidelberg

  42. arXiv:2001.05703  [pdf, other

    cs.CV cs.LG cs.RO

    A Markerless Deep Learning-based 6 Degrees of Freedom PoseEstimation for with Mobile Robots using RGB Data

    Authors: Linh Kästner, Daniel Dimitrov, Jens Lambrecht

    Abstract: Augmented Reality has been subject to various integration efforts within industries due to its ability to enhance human machine interaction and understanding. Neural networks have achieved remarkable results in areas of computer vision, which bear great potential to assist and facilitate an enhanced Augmented Reality experience. However, most neural networks are computationally intensive and deman… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 6 pages,5 figures. arXiv admin note: text overlap with arXiv:1912.12101

  43. Query for Architecture, Click through Military: Comparing the Roles of Search and Navigation on Wikipedia

    Authors: Dimitar Dimitrov, Florian Lemmerich, Fabian Flöck, Markus Strohmaier

    Abstract: As one of the richest sources of encyclopedic information on the Web, Wikipedia generates an enormous amount of traffic. In this paper, we study large-scale article access data of the English Wikipedia in order to compare articles with respect to the two main paradigms of information seeking, i.e., search by formulating a query, and navigation by following hyperlinks. To this end, we propose and e… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

  44. Forbidden branches in trees with minimal atom-bond connectivity index

    Authors: Darko Dimitrov, Zhibin Du, Carlos M. da Fonseca

    Abstract: The atom-bond connectivity (ABC) index has been, in recent years, one of the most actively studied vertex-degree-based graph invariants in chemical graph theory. For a given graph $G$, the ABC index is defined as $\sum_{uv\in E}\sqrt{\frac{d(u) +d(v)-2}{d(u)d(v)}}$, where $d(u)$ is the degree of vertex $u$ in $G$ and $E(G)$ denotes the set of edges of $G$. In this paper we present some new structu… ▽ More

    Submitted 27 June, 2017; originally announced June 2017.

    Journal ref: Applied Mathematics and Computation, Volume 313, 15 November 2017, Pages 418 - 430

  45. arXiv:1706.08587  [pdf, other

    cs.DM

    On structural properties of trees with minimal atom-bond connectivity index IV: Solving a conjecture about the pendent paths of length three

    Authors: Darko Dimitrov

    Abstract: The atom-bond connectivity (ABC) index is one of the most investigated degree-based molecular structure descriptors with a variety of chemical applications. It is known that among all connected graphs, the trees minimize the ABC index. However, a full characterization of trees with a minimal ABC index is still an open problem. By now, one of the proved properties is that a tree with a minimal ABC… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

  46. arXiv:1611.03995  [pdf, other

    cs.GT

    The Complexity of Shelflisting

    Authors: Yongjie Yang, Dinko Dimitrov

    Abstract: Optimal shelflisting invites profit maximization to become sensitive to the ways in which purchasing decisions are order-dependent. We study the computational complexity of the corresponding product arrangement problem when consumers are either rational maximizers, use a satisficing procedure, or apply successive choice. The complexity results we report are shown to crucially depend on the size of… ▽ More

    Submitted 12 November, 2016; originally announced November 2016.

    Comments: 10 pages

  47. arXiv:1611.02508  [pdf, other

    cs.SI physics.soc-ph

    What Makes a Link Successful on Wikipedia?

    Authors: Dimitar Dimitrov, Philipp Singer, Florian Lemmerich, Markus Strohmaier

    Abstract: While a plethora of hypertext links exist on the Web, only a small amount of them are regularly clicked. Starting from this observation, we set out to study large-scale click data from Wikipedia in order to understand what makes a link successful. We systematically analyze effects of link properties on the popularity of links. By utilizing mixed-effects hurdle models supplemented with descriptive… ▽ More

    Submitted 20 February, 2017; v1 submitted 8 November, 2016; originally announced November 2016.

  48. arXiv:1610.02574  [pdf, other

    math.CO cs.DM

    Remarks on the maximum atom-bond connectivity index of graphs with given parameters

    Authors: Darko Dimitrov, Barbara Ikica, Riste Škrekovski

    Abstract: The atom-bond connectivity (ABC) index is a degree-based molecular structure descriptor that can be used for modelling thermodynamic properties of organic chemical compounds. Motivated by its applicable potential, a series of investigations have been carried out in the past several years. In this note we first consider graphs with given edge-connectivity that attain the maximum ABC index. In parti… ▽ More

    Submitted 8 October, 2016; originally announced October 2016.

  49. arXiv:1608.05235  [pdf, ps, other

    cs.DM

    On the Irregularity of Some Molecular Structures

    Authors: Hosam Abdo, Darko Dimitrov, Wei Gao

    Abstract: Measures of the irregularity of chemical graphs could be helpful for QSAR/QSPR studies and for the descriptive purposes of biological and chemical properties, such as melting and boiling points, toxicity and resistance. Here we consider the following four established irregularity measures: the irregularity index by Albertson, the total irregularity, the variance of vertex degrees and the Collatz-S… ▽ More

    Submitted 18 August, 2016; originally announced August 2016.

    Comments: 23 pages, 15 figures

  50. arXiv:1606.03366  [pdf, ps, other

    cs.GT

    How Hard Is It to Control A Group?

    Authors: Yongjie Yang, Dinko Dimitrov

    Abstract: We consider group identification models in which the aggregation of individual opinions concerning who is qualified in a given society determines the set of socially qualified persons. In this setting, we study the extent to which social qualification can be changed when societies expand, shrink, or partition themselves. The answers we provide are with respect to the computational complexity of th… ▽ More

    Submitted 28 April, 2018; v1 submitted 10 June, 2016; originally announced June 2016.

    Comments: 25 pages