Skip to main content

Showing 1–8 of 8 results for author: Maguitman, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.11832  [pdf, other

    cs.IR cs.CL

    QuOTeS: Query-Oriented Technical Summarization

    Authors: Juan Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Axel J. Soto, Flavia P. Zanoto, Evangelos Milios

    Abstract: Abstract. When writing an academic paper, researchers often spend considerable time reviewing and summarizing papers to extract relevant citations and data to compose the Introduction and Related Work sections. To address this problem, we propose QuOTeS, an interactive system designed to retrieve sentences related to a summary of the research from a collection of potential references and hence ass… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted at ICDAR 2023

  2. arXiv:2109.06264  [pdf, other

    cs.CL

    Post-OCR Document Correction with large Ensembles of Character Sequence-to-Sequence Models

    Authors: Juan Ramirez-Orta, Eduardo Xamena, Ana Maguitman, Evangelos Milios, Axel J. Soto

    Abstract: In this paper, we propose a novel method based on character sequence-to-sequence models to correct documents already processed with Optical Character Recognition (OCR) systems. The main contribution of this paper is a set of strategies to accurately process strings much longer than the ones used to train the sequence model while being sample- and resource-efficient, supported by thorough experimen… ▽ More

    Submitted 24 January, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  3. arXiv:2007.06616  [pdf, other

    cs.IR

    Assessing the behavior and performance of a supervised term-weighting technique for topic-based retrieval

    Authors: Mariano Maisonnave, Fernando Delbianco, Fernando Tohmé, Ana Maguitman

    Abstract: This article analyses and evaluates FDD\b{eta}, a supervised term-weighting scheme that can be applied for query-term selection in topic-based retrieval. FDD\b{eta} weights terms based on two factors representing the descriptive and discriminating power of the terms with respect to the given topic. It then combines these two factor through the use of an adjustable parameter that allows to favor di… ▽ More

    Submitted 16 July, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  4. arXiv:2007.01379  [pdf, other

    cs.CL cs.LG

    Detecting Ongoing Events Using Contextual Word and Sentence Embeddings

    Authors: Mariano Maisonnave, Fernando Delbianco, Fernando Tohmé, Ana Maguitman, Evangelos Milios

    Abstract: This paper introduces the Ongoing Event Detection (OED) task, which is a specific Event Detection task where the goal is to detect ongoing event mentions only, as opposed to historical, future, hypothetical, or other forms or events that are neither fresh nor current. Any application that needs to extract structured information about ongoing events from unstructured texts can take advantage of an… ▽ More

    Submitted 5 February, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  5. arXiv:1006.0289  [pdf, ps, other

    cs.IR cs.AI

    Métodos para la Selección y el Ajuste de Características en el Problema de la Detección de Spam

    Authors: Carlos M. Lorenzetti, Rocío L. Cecchini, Ana G. Maguitman, András A. Benczúr

    Abstract: The email is used daily by millions of people to communicate around the globe and it is a mission-critical application for many businesses. Over the last decade, unsolicited bulk email has become a major problem for email users. An overwhelming amount of spam is flowing into users' mailboxes daily. In 2004, an estimated 62% of all email was attributed to spam. Spam is not only frustrating for most… ▽ More

    Submitted 14 October, 2010; v1 submitted 1 June, 2010; originally announced June 2010.

    Comments: 5 pages, 1 figure, Workshop de Investigadores en Ciencias de la Computación, WICC 2010, pp 48-52

    MSC Class: 68P20 ACM Class: H.3.3

    Journal ref: Workshop de Investigadores en Ciencias de la Computacion, WICC 2010, El Calafate, Santa Cruz, Argentina

  6. arXiv:1004.3478  [pdf, ps, other

    cs.IR cs.AI

    Learning Better Context Characterizations: An Intelligent Information Retrieval Approach

    Authors: Carlos M. Lorenzetti, Ana G. Maguitman

    Abstract: This paper proposes an incremental method that can be used by an intelligent system to learn better descriptions of a thematic context. The method starts with a small number of terms selected from a simple description of the topic under analysis and uses this description as the initial search context. Using these terms, a set of queries are built and submitted to a search engine. New documents and… ▽ More

    Submitted 27 April, 2010; v1 submitted 20 April, 2010; originally announced April 2010.

    Comments: 10 pages, 3 figures, CLEI 2008

    MSC Class: 68P20 ACM Class: H.3.3

    Journal ref: XXXIV Conferencia Latinoamericana de Informática, pp. 200-209, 2008

  7. Uncovering protein interaction in abstracts and text using a novel linear model and word proximity networks

    Authors: Alaa Abi-Haidar, Jasleen Kaur, Ana G. Maguitman, Predrag Radivojac, Andreas Retchsteiner, Karin Verspoor, Zhi** Wang, Luis M. Rocha

    Abstract: We participated in three of the protein-protein interaction subtasks of the Second BioCreative Challenge: classification of abstracts relevant for protein-protein interaction (IAS), discovery of protein pairs (IPS) and text passages characterizing protein interaction (ISS) in full text documents. We approached the abstract classification task with a novel, lightweight linear model inspired by sp… ▽ More

    Submitted 4 December, 2008; originally announced December 2008.

    Journal ref: Genome Biology 2008, 9(Suppl 2):S11

  8. arXiv:cs/0511035  [pdf, ps, other

    cs.NI cond-mat.dis-nn physics.soc-ph

    Decoding the structure of the WWW: facts versus sampling biases

    Authors: M. Angeles Serrano, Ana Maguitman, Marian Boguna, Santo Fortunato, Alessandro Vespignani

    Abstract: The understanding of the immense and intricate topological structure of the World Wide Web (WWW) is a major scientific and technological challenge. This has been tackled recently by characterizing the properties of its representative graphs in which vertices and directed edges are identified with web-pages and hyperlinks, respectively. Data gathered in large scale crawls have been analyzed by se… ▽ More

    Submitted 14 February, 2006; v1 submitted 8 November, 2005; originally announced November 2005.

    Comments: 10 pages 19 figures. Values in Table 2 and Figure 1 corrected. Figure 7 updated. Minor changes in the text

    ACM Class: H.4.m; G.3

    Journal ref: ACM Transactions on the Web (TWEB) 1, 10 (2007)