Skip to main content

Showing 1–6 of 6 results for author: Romanou, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.16079  [pdf, other

    cs.CL cs.AI cs.LG

    MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

    Authors: Zeming Chen, Alejandro Hernández Cano, Angelika Romanou, Antoine Bonnet, Kyle Matoba, Francesco Salvi, Matteo Pagliardini, Simin Fan, Andreas Köpf, Amirkeivan Mohtashami, Alexandre Sallinen, Alireza Sakhaeirad, Vinitra Swamy, Igor Krawczuk, Deniz Bayazit, Axel Marmet, Syrielle Montariol, Mary-Anne Hartley, Martin Jaggi, Antoine Bosselut

    Abstract: Large language models (LLMs) can potentially democratize access to medical knowledge. While many efforts have been made to harness and improve LLMs' medical knowledge and reasoning capacities, the resulting models are either closed-source (e.g., PaLM, GPT-4) or limited in scale (<= 13B parameters), which restricts their abilities. In this work, we improve access to large-scale medical LLMs by rele… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  2. arXiv:2311.04284  [pdf, other

    cs.CL cs.AI

    CRAB: Assessing the Strength of Causal Relationships Between Real-world Events

    Authors: Angelika Romanou, Syrielle Montariol, Debjit Paul, Leo Laugier, Karl Aberer, Antoine Bosselut

    Abstract: Understanding narratives requires reasoning about the cause-and-effect relationships between events mentioned in the text. While existing foundation models yield impressive results in many NLP tasks requiring reasoning, it is unclear whether they understand the complexity of the underlying network of causal relationships of events in narratives. In this work, we present CRAB, a new Causal Reasonin… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  3. arXiv:2211.15334  [pdf, other

    cs.CY cs.LG

    Beyond S-curves: Recurrent Neural Networks for Technology Forecasting

    Authors: Alexander Glavackij, Dimitri Percia David, Alain Mermoud, Angelika Romanou, Karl Aberer

    Abstract: Because of the considerable heterogeneity and complexity of the technological landscape, building accurate models to forecast is a challenging endeavor. Due to their high prevalence in many complex systems, S-curves are a popular forecasting approach in previous work. However, their forecasting performance has not been directly compared to other technology forecasting approaches. Additionally, rec… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: 16 pages, 8 figures

  4. arXiv:2111.08546  [pdf, other

    cs.LG cs.CL

    Interpreting Language Models Through Knowledge Graph Extraction

    Authors: Vinitra Swamy, Angelika Romanou, Martin Jaggi

    Abstract: Transformer-based language models trained on large text corpora have enjoyed immense popularity in the natural language processing community and are commonly used as a starting point for downstream tasks. While these models are undeniably useful, it is a challenge to quantify their performance beyond traditional accuracy metrics. In this paper, we compare BERT-based language models through snapsho… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Published at NeurIPS 2021: eXplainable AI for Debugging and Diagnosis Workshop

  5. On Representation Learning for Scientific News Articles Using Heterogeneous Knowledge Graphs

    Authors: Angelika Romanou, Panayiotis Smeros, Karl Aberer

    Abstract: In the era of misinformation and information inflation, the credibility assessment of the produced news is of the essence. However, fact-checking can be challenging considering the limited references presented in the news. This challenge can be transcended by utilizing the knowledge graph that is related to the news articles. In this work, we present a methodology for creating scientific news arti… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  6. SciLens News Platform: A System for Real-Time Evaluation of News Articles

    Authors: Angelika Romanou, Panayiotis Smeros, Carlos Castillo, Karl Aberer

    Abstract: We demonstrate the SciLens News Platform, a novel system for evaluating the quality of news articles. The SciLens News Platform automatically collects contextual information about news articles in real-time and provides quality indicators about their validity and trustworthiness. These quality indicators derive from i) social media discussions regarding news articles, showcasing the reach and stan… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: Conference demo paper, 4 pages, 5 figures

    Journal ref: Proceedings of the 46th International Conference on Very Large Data Bases, Tokyo, Japan, Aug 31-Sept 4, 2020