Skip to main content

Showing 1–17 of 17 results for author: Nassar, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00505  [pdf, other

    cs.IR cs.LG

    KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents

    Authors: Oshri Naparstek, Roi Pony, Inbar Shapira, Foad Abo Dahood, Ophir Azulai, Yevgeny Yaroker, Nadav Rubinstein, Maksym Lysak, Peter Staar, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer, Elad Amrani, Idan Friedman, Orit Prince, Yevgeny Burshtein, Adi Raz Goldfarb, Udi Barzelay

    Abstract: In recent years, the challenge of extracting information from business documents has emerged as a critical task, finding applications across numerous domains. This effort has attracted substantial interest from both industry and academy, highlighting its significance in the current technological landscape. Most datasets in this area are primarily focused on Key Information Extraction (KIE), where… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: accepted ICDAR2024

  2. ESG Accountability Made Easy: DocQA at Your Service

    Authors: Lokesh Mishra, Cesar Berrospi, Kasper Dinkla, Diego Antognini, Francesco Fusco, Benedikt Bothur, Maksym Lysak, Nikolaos Livathinos, Ahmed Nassar, Panagiotis Vagenas, Lucas Morin, Christoph Auer, Michele Dolfi, Peter Staar

    Abstract: We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document conversion to machine-readable format (via computer vision), finding relevant data (via natural language processing), and formulating an eloquent response (via… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at the Demonstration Track of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI 24)

    Journal ref: AAAI 2024, 38, 23814-23816

  3. arXiv:2308.12234  [pdf, other

    cs.CV

    MolGrapher: Graph-based Visual Recognition of Chemical Structures

    Authors: Lucas Morin, Martin Danelljan, Maria Isabel Agea, Ahmed Nassar, Valery Weber, Ingmar Meijer, Peter Staar, Fisher Yu

    Abstract: The automatic analysis of chemical literature has immense potential to accelerate the discovery of new materials and drugs. Much of the critical information in patent documents and scientific articles is contained in figures, depicting the molecule structures. However, automatically parsing the exact chemical structure is a formidable challenge, due to the amount of detailed information, the diver… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  4. ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents

    Authors: Christoph Auer, Ahmed Nassar, Maksym Lysak, Michele Dolfi, Nikolaos Livathinos, Peter Staar

    Abstract: Transforming documents into machine-processable representations is a challenging task due to their complex structures and variability in formats. Recovering the layout structure and content from PDF files or scanned material has remained a key problem for decades. ICDAR has a long tradition in hosting competitions to benchmark the state-of-the-art and encourage the development of novel solutions t… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: ICDAR 2023, 10 pages, 4 figures

  5. arXiv:2305.04927  [pdf, other

    cs.CL cs.AI cs.CY

    Detecting and Reasoning of Deleted Tweets before they are Posted

    Authors: Hamdy Mubarak, Samir Abdaljalil, Azza Nassar, Firoj Alam

    Abstract: Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness etc., they have misuse potentials. Malicious users use them to disseminate hate-speech, offensive content, rumor etc. to gain social and political agendas or to harm individuals, entities and organizations. Often times,… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: disinformation, misinformation, fake news

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  6. arXiv:2305.03393  [pdf, other

    cs.CV

    Optimized Table Tokenization for Table Structure Recognition

    Authors: Maksym Lysak, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer, Peter Staar

    Abstract: Extracting tables from documents is a crucial task in any document conversion pipeline. Recently, transformer-based models have demonstrated that table-structure can be recognized with impressive accuracy using Image-to-Markup-Sequence (Im2Seq) approaches. Taking only the image of a table, such models predict a sequence of tokens (e.g. in HTML, LaTeX) which represent the structure of the table. Si… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted to ICDAR 2023, 12 pages, 6 figures

  7. DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

    Authors: Birgit Pfitzmann, Christoph Auer, Michele Dolfi, Ahmed S Nassar, Peter W J Staar

    Abstract: Accurate document layout analysis is a key requirement for high-quality PDF document conversion. With the recent availability of public, large ground-truth datasets such as PubLayNet and DocBank, deep-learning models have proven to be very effective at layout detection and segmentation. While these datasets are of adequate size to train such models, they severely lack in layout variability since t… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 9 pages, 6 figures, 5 tables. Accepted paper at SIGKDD 2022 conference

  8. arXiv:2205.12328  [pdf

    cs.CL cs.AI

    Multilevel sentiment analysis in arabic

    Authors: Ahmed Nassar, Ebru Sezer

    Abstract: In this study, we aimed to improve the performance results of Arabic sentiment analysis. This can be achieved by investigating the most successful machine learning method and the most useful feature vector to classify sentiments in both term and document levels into two (positive or negative) categories. Moreover, specification of one polarity degree for the term that has more than one is investig… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 10 pages, 3 figures, Published in: 2019 IEEE 7th Palestinian International Conference on Electrical and Computer Engineering (PICECE), Date of Conference: 26-27 March 2019

    Report number: INSPEC Accession Number: 18793641

  9. arXiv:2203.01017  [pdf, other

    cs.CV cs.LG

    TableFormer: Table Structure Understanding with Transformers

    Authors: Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar

    Abstract: Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables come in a large variety of shapes and sizes. Furthermore, they can have complex column/row-header configurations, multiline rows, different variety of separati… ▽ More

    Submitted 11 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  10. arXiv:2107.12940  [pdf, other

    cs.LG stat.ML

    Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

    Authors: Mark Koren, Ahmed Nassar, Mykel J. Kochenderfer

    Abstract: Validating the safety of autonomous systems generally requires the use of high-fidelity simulators that adequately capture the variability of real-world scenarios. However, it is generally not feasible to exhaustively search the space of simulation scenarios for failures. Adaptive stress testing (AST) is a method that uses reinforcement learning to find the most likely failure of a system. AST wit… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted to IROS 2021

  11. Simulation by Rounds of Letter-to-Letter Transducers

    Authors: Antonio Abu Nassar, Shaull Almagor

    Abstract: Letter-to-letter transducers are a standard formalism for modeling reactive systems. Often, two transducers that model similar systems differ locally from one another, by behaving similarly, up to permutations of the input and output letters within "rounds". In this work, we introduce and study notions of simulation by rounds and equivalence by rounds of transducers. In our setting, words are part… ▽ More

    Submitted 4 December, 2023; v1 submitted 4 May, 2021; originally announced May 2021.

    Journal ref: Logical Methods in Computer Science, Volume 19, Issue 4 (December 5, 2023) lmcs:9920

  12. arXiv:2102.09395  [pdf, other

    cs.LG cs.CV cs.IR

    Robust PDF Document Conversion Using Recurrent Neural Networks

    Authors: Nikolaos Livathinos, Cesar Berrospi, Maksym Lysak, Viktor Kuropiatnyk, Ahmed Nassar, Andre Carvalho, Michele Dolfi, Christoph Auer, Kasper Dinkla, Peter Staar

    Abstract: The number of published PDF documents has increased exponentially in recent decades. There is a growing need to make their rich content discoverable to information retrieval tools. In this paper, we present a novel approach to document structure recovery in PDF using recurrent neural networks to process the low-level PDF data representation directly, instead of relying on a visual re-interpretatio… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Comments: 9 pages, 2 tables, 4 figures, uses aaai21.sty. Accepted at the "Thirty-Third Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-21)". Received the "IAAI-21 Innovative Application Award"

    ACM Class: I.7.5; I.5.1; I.5.2; I.5.4; I.5.5; I.2.1

  13. arXiv:2010.09916  [pdf, other

    cs.NI cs.AI cs.LG

    Deep Reinforcement Learning for Adaptive Network Slicing in 5G for Intelligent Vehicular Systems and Smart Cities

    Authors: Almuthanna Nassar, Yasin Yilmaz

    Abstract: Intelligent vehicular systems and smart city applications are the fastest growing Internet of things (IoT) implementations at a compound annual growth rate of 30%. In view of the recent advances in IoT devices and the emerging new breed of IoT applications driven by artificial intelligence (AI), fog radio access network (F-RAN) has been recently introduced for the fifth generation (5G) wireless co… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  14. arXiv:2003.10151  [pdf, other

    cs.CV

    GeoGraph: Learning graph-based multi-view object detection with geometric cues end-to-end

    Authors: Ahmed Samy Nassar, Stefano D'Aronco, Sébastien Lefèvre, Jan D. Wegner

    Abstract: In this paper we propose an end-to-end learnable approach that detects static urban objects from multiple views, re-identifies instances, and finally assigns a geographic position per object. Our method relies on a Graph Neural Network (GNN) to, detect all objects and output their geographic positions given images and approximate camera poses as input. Our GNN simultaneously models relative pose a… ▽ More

    Submitted 24 March, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

  15. arXiv:1907.10892  [pdf, other

    cs.LG cs.CV stat.ML

    Simultaneous multi-view instance detection with learned geometric soft-constraints

    Authors: Ahmed Samy Nassar, Sebastien Lefevre, Jan D. Wegner

    Abstract: We propose to jointly learn multi-view geometry and war** between views of the same object instances for robust cross-view object detection. What makes multi-view object instance detection difficult are strong changes in viewpoint, lighting conditions, high similarity of neighbouring objects, and strong variability in scale. By turning object detection and instance re-identification in different… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: Internationcal Conference on Computer Vision 2019 (ICCV 19)

  16. arXiv:1806.04582  [pdf, other

    cs.NI

    Reinforcement Learning-based Resource Allocation in Fog RAN for IoT with Heterogeneous Latency Requirements

    Authors: Almuthanna T. Nassar, Yasin Yilmaz

    Abstract: In light of the quick proliferation of Internet of things (IoT) devices and applications, fog radio access network (Fog-RAN) has been recently proposed for fifth generation (5G) wireless communications to assure the requirements of ultra-reliable low-latency communication (URLLC) for the IoT applications which cannot accommodate large delays. Hence, fog nodes (FNs) are equipped with computing, sig… ▽ More

    Submitted 15 January, 2019; v1 submitted 27 May, 2018; originally announced June 2018.

  17. Towards seamless multi-view scene analysis from satellite to street-level

    Authors: Sébastien Lefèvre, Devis Tuia, Jan Dirk Wegner, Timothée Produit, Ahmed Samy Nassar

    Abstract: In this paper, we discuss and review how combined multi-view imagery from satellite to street-level can benefit scene analysis. Numerous works exist that merge information from remote sensing and images acquired from the ground for tasks like land cover map**, object detection, or scene understanding. What makes the combination of overhead and street-level images challenging, is the strongly var… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

    Journal ref: Proceedings of the IEEE, 105, pp. 1884-1899, 2017