Skip to main content

Showing 1–16 of 16 results for author: Herrera, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10004  [pdf, other

    eess.IV cs.CV cs.LG

    ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset

    Authors: Johannes Rückert, Louise Bloch, Raphael Brüngel, Ahmad Idrissi-Yaghir, Henning Schäfer, Cynthia S. Schmidt, Sven Koitka, Obioma Pelka, Asma Ben Abacha, Alba G. Seco de Herrera, Henning Müller, Peter A. Horn, Felix Nensa, Christoph M. Friedrich

    Abstract: Automated medical image analysis systems often require large amounts of training data with high quality labels, which are difficult and time consuming to generate. This paper introduces Radiology Object in COntext version 2 (ROCOv2), a multimodal dataset consisting of radiological images and associated medical concepts and captions extracted from the PMC Open Access subset. It is an updated versio… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted for Scientific Data

  2. arXiv:2404.19100  [pdf, other

    cs.SE cs.AI cs.CY cs.LG

    Predicting Fairness of ML Software Configurations

    Authors: Salvador Robles Herrera, Verya Monjezi, Vladik Kreinovich, Ashutosh Trivedi, Saeid Tizpaz-Niari

    Abstract: This paper investigates the relationships between hyperparameters of machine learning and fairness. Data-driven solutions are increasingly used in critical socio-technical applications where ensuring fairness is important. Rather than explicitly encoding decision logic via control and data structures, the ML developers provide input data, perform some pre-processing, choose ML algorithms, and tune… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: To Appear in the 20th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE'24)

  3. arXiv:2403.17748  [pdf, other

    cs.CL

    UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies

    Authors: Leonie Weissweiler, Nina Böbel, Kirian Guiller, Santiago Herrera, Wesley Scivetti, Arthur Lorenzi, Nurit Melnik, Archna Bhatia, Hinrich Schütze, Lori Levin, Amir Zeldes, Joakim Nivre, William Croft, Nathan Schneider

    Abstract: The Universal Dependencies (UD) project has created an invaluable collection of treebanks with contributions in over 140 languages. However, the UD annotations do not tell the full story. Grammatical constructions that convey meaning through a particular combination of several morphosyntactic elements -- for example, interrogative sentences with special markers and/or word orders -- are not labele… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  4. arXiv:2403.17534  [pdf, other

    cs.CL

    Sparse Logistic Regression with High-order Features for Automatic Grammar Rule Extraction from Treebanks

    Authors: Santiago Herrera, Caio Corro, Sylvain Kahane

    Abstract: Descriptive grammars are highly valuable, but writing them is time-consuming and difficult. Furthermore, while linguists typically use corpora to create them, grammar descriptions often lack quantitative data. As for formal grammars, they can be challenging to interpret. In this paper, we propose a new method to extract and explore significant fine-grained grammar patterns and potential syntactic… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Published in LREC-Coling 2024 proceedings

  5. arXiv:2307.04427  [pdf, other

    astro-ph.HE astro-ph.GA cs.LG

    Observation of high-energy neutrinos from the Galactic plane

    Authors: R. Abbasi, M. Ackermann, J. Adams, J. A. Aguilar, M. Ahlers, M. Ahrens, J. M. Alameddine, A. A. Alves Jr., N. M. Amin, K. Andeen, T. Anderson, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, S. Axani, X. Bai, A. Balagopal V., S. W. Barwick, V. Basu, S. Baur, R. Bay, J. J. Beatty, K. -H. Becker, J. Becker Tjus , et al. (364 additional authors not shown)

    Abstract: The origin of high-energy cosmic rays, atomic nuclei that continuously impact Earth's atmosphere, has been a mystery for over a century. Due to deflection in interstellar magnetic fields, cosmic rays from the Milky Way arrive at Earth from random directions. However, near their sources and during propagation, cosmic rays interact with matter and produce high-energy neutrinos. We search for neutrin… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Submitted on May 12th, 2022; Accepted on May 4th, 2023

    Journal ref: Science 380, 6652, 1338-1343 (2023)

  6. arXiv:2301.04339  [pdf, other

    cs.CL cs.IR

    Topics in Contextualised Attention Embeddings

    Authors: Mozhgan Talebpour, Alba Garcia Seco de Herrera, Shoaib Jameel

    Abstract: Contextualised word vectors obtained via pre-trained language models encode a variety of knowledge that has already been exploited in applications. Complementary to these language models are probabilistic topic models that learn thematic patterns from the text. Recent work has demonstrated that conducting clustering on the word-level contextual representations from a language model emulates word c… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted at the 45th European Conference on Information Retrieval (ECIR) 2023

  7. arXiv:2212.06516  [pdf, other

    cs.CV cs.AI cs.MM

    Overview of The MediaEval 2022 Predicting Video Memorability Task

    Authors: Lorin Sweeney, Mihai Gabriel Constantin, Claire-Hélène Demarty, Camilo Fosco, Alba G. Seco de Herrera, Sebastian Halder, Graham Healy, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Mushfika Sultana

    Abstract: This paper describes the 5th edition of the Predicting Video Memorability Task as part of MediaEval2022. This year we have reorganised and simplified the task in order to lubricate a greater depth of inquiry. Similar to last year, two datasets are provided in order to facilitate generalisation, however, this year we have replaced the TRECVid2019 Video-to-Text dataset with the VideoMem dataset in o… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 6 pages. In: MediaEval Multimedia Benchmark Workshop Working Notes, 2022

  8. arXiv:2209.03042  [pdf, other

    hep-ex astro-ph.IM cs.LG physics.data-an physics.ins-det

    Graph Neural Networks for Low-Energy Event Classification & Reconstruction in IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, N. Aggarwal, J. A. Aguilar, M. Ahlers, M. Ahrens, J. M. Alameddine, A. A. Alves Jr., N. M. Amin, K. Andeen, T. Anderson, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, S. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, V. Basu, R. Bay, J. J. Beatty, K. -H. Becker , et al. (359 additional authors not shown)

    Abstract: IceCube, a cubic-kilometer array of optical sensors built to detect atmospheric and astrophysical neutrinos between 1 GeV and 1 PeV, is deployed 1.45 km to 2.45 km below the surface of the ice sheet at the South Pole. The classification and reconstruction of events from the in-ice detectors play a central role in the analysis of data from IceCube. Reconstructing and classifying events is a challen… ▽ More

    Submitted 11 October, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: Prepared for submission to JINST

  9. arXiv:2201.00620  [pdf, other

    q-bio.NC cs.HC cs.LG eess.SP

    Overview of the EEG Pilot Subtask at MediaEval 2021: Predicting Media Memorability

    Authors: Lorin Sweeney, Ana Matran-Fernandez, Sebastian Halder, Alba G. Seco de Herrera, Alan Smeaton, Graham Healy

    Abstract: The aim of the Memorability-EEG pilot subtask at MediaEval'2021 is to promote interest in the use of neural signals -- either alone or in combination with other data sources -- in the context of predicting video memorability by highlighting the utility of EEG data. The dataset created consists of pre-extracted features from EEG recordings of subjects while watching a subset of videos from Predicti… ▽ More

    Submitted 15 December, 2021; originally announced January 2022.

    Comments: 3 pages

  10. arXiv:2112.05982  [pdf, ps, other

    cs.CV cs.AI cs.MM

    Overview of The MediaEval 2021 Predicting Media Memorability Task

    Authors: Rukiye Savran Kiziltepe, Mihai Gabriel Constantin, Claire-Helene Demarty, Graham Healy, Camilo Fosco, Alba Garcia Seco de Herrera, Sebastian Halder, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Lorin Sweeney

    Abstract: This paper describes the MediaEval 2021 Predicting Media Memorability}task, which is in its 4th edition this year, as the prediction of short-term and long-term video memorability remains a challenging task. In 2021, two datasets of videos are used: first, a subset of the TRECVid 2019 Video-to-Text dataset; second, the Memento10K dataset in order to provide opportunities to explore cross-dataset g… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

    Comments: 3 pages, to appear in Proceedings of MediaEval 2021, December 13-15 2021, Online

  11. An Annotated Video Dataset for Computing Video Memorability

    Authors: Rukiye Savran Kiziltepe, Lorin Sweeney, Mihai Gabriel Constantin, Faiyaz Doctor, Alba Garcia Seco de Herrera, Claire-Helene Demarty, Graham Healy, Bogdan Ionescu, Alan F. Smeaton

    Abstract: Using a collection of publicly available links to short form video clips of an average of 6 seconds duration each, 1,275 users manually annotated each video multiple times to indicate both long-term and short-term memorability of the videos. The annotations were gathered as part of an online memory game and measured a participant's ability to recall having seen the video previously when shown a co… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: 11 pages

    Journal ref: Data in Brief, Volume 39, 107671, (2021), ISSN 2352-3409

  12. A Convolutional Neural Network based Cascade Reconstruction for the IceCube Neutrino Observatory

    Authors: R. Abbasi, M. Ackermann, J. Adams, J. A. Aguilar, M. Ahlers, M. Ahrens, C. Alispach, A. A. Alves Jr., N. M. Amin, R. An, K. Andeen, T. Anderson, I. Ansseau, G. Anton, C. Argüelles, S. Axani, X. Bai, A. Balagopal V., A. Barbano, S. W. Barwick, B. Bastian, V. Basu, V. Baum, S. Baur, R. Bay , et al. (343 additional authors not shown)

    Abstract: Continued improvements on existing reconstruction methods are vital to the success of high-energy physics experiments, such as the IceCube Neutrino Observatory. In IceCube, further challenges arise as the detector is situated at the geographic South Pole where computational resources are limited. However, to perform real-time analyses and to issue alerts to telescopes around the world, powerful an… ▽ More

    Submitted 26 July, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: 39 pages, 15 figures, submitted to Journal of Instrumentation; added references

    Journal ref: JINST 16 (2021) P07041

  13. arXiv:2012.15650  [pdf, other

    cs.MM cs.AI cs.CV

    Overview of MediaEval 2020 Predicting Media Memorability Task: What Makes a Video Memorable?

    Authors: Alba García Seco De Herrera, Rukiye Savran Kiziltepe, Jon Chamberlain, Mihai Gabriel Constantin, Claire-Hélène Demarty, Faiyaz Doctor, Bogdan Ionescu, Alan F. Smeaton

    Abstract: This paper describes the MediaEval 2020 \textit{Predicting Media Memorability} task. After first being proposed at MediaEval 2018, the Predicting Media Memorability task is in its 3rd edition this year, as the prediction of short-term and long-term video memorability (VM) remains a challenging task. In 2020, the format remained the same as in previous editions. This year the videos are a subset of… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 3 pages, 1 Figure

    Journal ref: MediaEval Multimedia Benchmark Workshop Working Notes, 14-15 December 2020

  14. arXiv:2004.07454  [pdf

    cs.SI stat.AP

    Sustainable Recipes. A Food Recipe Sourcing and Recommendation System to Minimize Food Miles

    Authors: Juan C. S. Herrera

    Abstract: Sustainable Recipes is a tool that (1) connects food recipes ingredient lists with the closest organic providers to minimize the distance that food travels from farm to food preparation site and (2) recommends recipes given a GPS coordinate to minimize food miles. Sustainable Recipes provides consumers, entrepreneurs, cooking enthusiasts, and restauranteurs in the United States and elsewhere with… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: 9 pages, 3 figures, 1 table, 2 algorithms

    ACM Class: J.0; J.4

  15. arXiv:1901.00229  [pdf

    cs.CE math.NA physics.comp-ph

    The Divide-and-Conquer Framework: A Suitable Setting for the DDM of the Future

    Authors: Ismael Herrera-Revilla, Iván Contreras, Graciela S. Herrera

    Abstract: This paper was prompted by numerical experiments we performed, in which algorithms already available in the literature (DVS-BDDM) yielded accelerations (or speedups) many times larger (more than seventy in some examples already treated, but probably often much larger) than the number of processors used. Based on these outstanding results, here it is shown that believing in the standard ideal speed… ▽ More

    Submitted 1 January, 2019; originally announced January 2019.

    Comments: 14 pages without figures

  16. arXiv:1701.05596  [pdf, other

    cs.IR

    The Parallel Distributed Image Search Engine (ParaDISE)

    Authors: Dimitrios Markonis, Roger Schaer, Alba García Seco de Herrera, Henning Müller

    Abstract: Image retrieval is a complex task that differs according to the context and the user requirements in any specific field, for example in a medical environment. Search by text is often not possible or optimal and retrieval by the visual content does not always succeed in modelling high-level concepts that a user is looking for. Modern image retrieval techniques consist of multiple steps and aim to r… ▽ More

    Submitted 19 January, 2017; originally announced January 2017.

    Comments: 23 pages, 9 figures

    MSC Class: 68P20