Skip to main content

Showing 1–11 of 11 results for author: Neto, J P

Searching in archive cs. Search in all archives.
.
  1. arXiv:1905.13516  [pdf, other

    cs.AI

    Foundations of Digital Archæoludology

    Authors: Cameron Browne, Dennis J. N. J. Soemers, Éric Piette, Matthew Stephenson, Michael Conrad, Walter Crist, Thierry Depaulis, Eddie Duggan, Fred Horn, Steven Kelk, Simon M. Lucas, João Pedro Neto, David Parlett, Abdallah Saffidine, Ulrich Schädler, Jorge Nuno Silva, Alex de Voogt, Mark H. M. Winands

    Abstract: Digital Archaeoludology (DAL) is a new field of study involving the analysis and reconstruction of ancient games from incomplete descriptions and archaeological evidence using modern computational techniques. The aim is to provide digital tools and methods to help game historians and other researchers better understand traditional games, their development throughout recorded human history, and the… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: Report on Dagstuhl Research Meeting. Authored/edited by all participants. Appendices by Thierry Depaulis

  2. arXiv:1508.01420  [pdf, other

    cs.IR cs.CL cs.CR

    Privacy-Preserving Multi-Document Summarization

    Authors: Luís Marujo, José Portêlo, Wang Ling, David Martins de Matos, João P. Neto, Anatole Gershman, Jaime Carbonell, Isabel Trancoso, Bhiksha Raj

    Abstract: State-of-the-art extractive multi-document summarization systems are usually designed without any concern about privacy issues, meaning that all documents are open to third parties. In this paper we propose a privacy-preserving approach to multi-document summarization. Our approach enables other parties to obtain summaries without learning anything else about the original documents' content. We us… ▽ More

    Submitted 6 August, 2015; originally announced August 2015.

    Comments: 4 pages, In Proceedings of 2nd ACM SIGIR Workshop on Privacy-Preserving Information Retrieval, August 2015. arXiv admin note: text overlap with arXiv:1407.5416

    ACM Class: H.3; I.2.7; K.4.1

  3. arXiv:1507.02907  [pdf, other

    cs.IR cs.CL

    Extending a Single-Document Summarizer to Multi-Document: a Hierarchical Approach

    Authors: Luís Marujo, Ricardo Ribeiro, David Martins de Matos, João P. Neto, Anatole Gershman, Jaime Carbonell

    Abstract: The increasing amount of online content motivated the development of multi-document summarization methods. In this work, we explore straightforward approaches to extend single-document summarization methods to multi-document summarization. The proposed methods are based on the hierarchical combination of single-document summaries, and achieves state of the art results.

    Submitted 10 July, 2015; originally announced July 2015.

    Comments: 6 pages, Please cite: Proceedings of *SEM: the 4th Joint Conference on Lexical and Computational Semantics (bibtex: http://aclweb.org/anthology/S/S15/S15-1020.bib)

  4. arXiv:1407.5416  [pdf, other

    cs.IR cs.CR

    Privacy-Preserving Important Passage Retrieval

    Authors: Luis Marujo, José Portêlo, David Martins de Matos, João P. Neto, Anatole Gershman, Jaime Carbonell, Isabel Trancoso, Bhiksha Raj

    Abstract: State-of-the-art important passage retrieval methods obtain very good results, but do not take into account privacy issues. In this paper, we present a privacy preserving method that relies on creating secure representations of documents. Our approach allows for third parties to retrieve important passages from documents without learning anything regarding their content. We use a hashing scheme kn… ▽ More

    Submitted 21 July, 2014; originally announced July 2014.

    Comments: Secure Passage Retrieval, Important Passage Retrieval, KP-Centrality, Secure Binary Embeddings, Data Privacy, Automatic Key Phrase Extraction, Proceedings of SIGIR 2014 Workshop Privacy-Preserving IR: When Information Retrieval Meets Privacy and Security

    ACM Class: H.3; I.2.7; K.4.1; D.2.8

  5. arXiv:1403.6023  [pdf

    cs.CL cs.LG

    Ensemble Detection of Single & Multiple Events at Sentence-Level

    Authors: Luís Marujo, Anatole Gershman, Jaime Carbonell, João P. Neto, David Martins de Matos

    Abstract: Event classification at sentence level is an important Information Extraction task with applications in several NLP, IR, and personalization systems. Multi-label binary relevance (BR) are the state-of-art methods. In this work, we explored new multi-label methods known for capturing relations between event types. These new methods, such as the ensemble Chain of Classifiers, improve the F1 on avera… ▽ More

    Submitted 24 March, 2014; originally announced March 2014.

    Comments: Preliminary version of the paper

  6. arXiv:1312.6597  [pdf, other

    cs.LG cs.IR

    Co-Multistage of Multiple Classifiers for Imbalanced Multiclass Learning

    Authors: Luis Marujo, Anatole Gershman, Jaime Carbonell, David Martins de Matos, João P. Neto

    Abstract: In this work, we propose two stochastic architectural models (CMC and CMC-M) with two layers of classifiers applicable to datasets with one and multiple skewed classes. This distinction becomes important when the datasets have a large number of classes. Therefore, we present a novel solution to imbalanced multiclass learning with several skewed majority classes, which improves minority classes ide… ▽ More

    Submitted 24 January, 2014; v1 submitted 23 December, 2013; originally announced December 2013.

    Comments: Preliminary version of the paper

  7. arXiv:1306.4908  [pdf

    cs.CL cs.IR

    Recognition of Named-Event Passages in News Articles

    Authors: Luis Marujo, Wang Ling, Anatole Gershman, Jaime Carbonell, João P. Neto, David Matos

    Abstract: We extend the concept of Named Entities to Named Events - commonly occurring events such as battles and earthquakes. We propose a method for finding specific passages in news articles that contain information about such events and report our preliminary evaluation results. Collecting "Gold Standard" data presents many problems, both practical and conceptual. We present a method for obtaining such… ▽ More

    Submitted 20 June, 2013; originally announced June 2013.

    Comments: In 25th International Conference on Computational Linguistics (COLING 2012)

  8. arXiv:1306.4890  [pdf, other

    cs.CL cs.IR

    Key Phrase Extraction of Lightly Filtered Broadcast News

    Authors: Luis Marujo, Ricardo Ribeiro, David Martins de Matos, João P. Neto, Anatole Gershman, Jaime Carbonell

    Abstract: This paper explores the impact of light filtering on automatic key phrase extraction (AKE) applied to Broadcast News (BN). Key phrases are words and expressions that best characterize the content of a document. Key phrases are often used to index the document or as features in further processing. This makes improvements in AKE accuracy particularly important. We hypothesized that filtering out mar… ▽ More

    Submitted 20 June, 2013; originally announced June 2013.

    Comments: In 15th International Conference on Text, Speech and Dialogue (TSD 2012)

  9. arXiv:1306.4886  [pdf

    cs.CL cs.IR

    Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization

    Authors: Luis Marujo, Anatole Gershman, Jaime Carbonell, Robert Frederking, João P. Neto

    Abstract: Fast and effective automated indexing is critical for search and personalized services. Key phrases that consist of one or more words and represent the main concepts of the document are often used for the purpose of indexing. In this paper, we investigate the use of additional semantic features and pre-processing steps to improve automatic key phrase extraction. These features include the use of s… ▽ More

    Submitted 20 June, 2013; originally announced June 2013.

    Comments: In 8th International Conference on Language Resources and Evaluation (LREC 2012)

  10. arXiv:1306.4608  [pdf

    cs.IR

    Hourly Traffic Prediction of News Stories

    Authors: Luis Marujo, Miguel Bugalho, João Paulo da Silva Neto, Anatole Gershman, Jaime Carbonell

    Abstract: The process of predicting news stories popularity from several news sources has become a challenge of great importance for both news producers and readers. In this paper, we investigate methods for automatically predicting the number of clicks on a news story during one hour. Our approach is a combination of additive regression and bagging applied over a M5P regression tree using a logarithmic sca… ▽ More

    Submitted 19 June, 2013; originally announced June 2013.

    Comments: In 3rd International Workshop on Context-Aware Recommender Systems held as part of the 5th ACM RecSys Conference 2011

    ACM Class: H.3.3

  11. arXiv:1306.4606  [pdf, other

    cs.IR

    Keyphrase Cloud Generation of Broadcast News

    Authors: Luis Marujo, Márcio Viveiros, João Paulo da Silva Neto

    Abstract: This paper describes an enhanced automatic keyphrase extraction method applied to Broadcast News. The keyphrase extraction process is used to create a concept level for each news. On top of words resulting from a speech recognition system output and news indexation and it contributes to the generation of a tag/keyphrase cloud of the top news included in a Multimedia Monitoring Solution system for… ▽ More

    Submitted 19 June, 2013; originally announced June 2013.

    Comments: In Proceeding of Interspeech 2011: 12th Annual Conference of the International Speech Communication Association