Skip to main content

Showing 1–7 of 7 results for author: Nováček, V

.
  1. Unsupervised extraction, labelling and clustering of segments from clinical notes

    Authors: Petr Zelina, Jana Halámková, Vít Nováček

    Abstract: This work is motivated by the scarcity of tools for accurate, unsupervised information extraction from unstructured clinical notes in computationally underrepresented languages, such as Czech. We introduce a step** stone to a broad array of downstream tasks such as summarisation or integration of individual patient records, extraction of structured information for national cancer registry report… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: To be published at the IEEE BIBM 2022 conference

    Journal ref: IEEE BIBM; 2022; pages 1362-1368

  2. arXiv:2211.09856  [pdf, other

    cs.LG q-bio.QM

    Machine Learning-Assisted Recurrence Prediction for Early-Stage Non-Small-Cell Lung Cancer Patients

    Authors: Adrianna Janik, Maria Torrente, Luca Costabello, Virginia Calvo, Brian Walsh, Carlos Camps, Sameh K. Mohamed, Ana L. Ortega, Vít Nováček, Bartomeu Massutí, Pasquale Minervini, M. Rosario Garcia Campelo, Edel del Barco, Joaquim Bosch-Barrera, Ernestina Menasalvas, Mohan Timilsina, Mariano Provencio

    Abstract: Background: Stratifying cancer patients according to risk of relapse can personalize their care. In this work, we provide an answer to the following research question: How to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients? Methods: For predicting relapse in 1,387 early-stage (I-II), non-small-cell lung cancer (NSCLC) patients from t… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  3. arXiv:1809.07685  [pdf, ps, other

    cs.SI

    Finding Explanations of Entity Relatedness in Graphs: A Survey

    Authors: Raoul Biagioni, Pierre-Yves Vandenbussche, Vit Novacek

    Abstract: Analysing and explaining relationships between entities in a graph is a fundamental problem associated with many practical applications. For example, a graph of biological pathways can be used for discovering a previously unknown relationship between two proteins. Domain experts, however, may be reluctant to trust such a discovery without a detailed explanation as to why exactly the two proteins a… ▽ More

    Submitted 9 August, 2018; originally announced September 2018.

    Comments: 10 pages, 9 Equations, Survey Paper

  4. arXiv:1503.09137  [pdf, other

    cs.AI

    Formalising Hypothesis Virtues in Knowledge Graphs: A General Theoretical Framework and its Validation in Literature-Based Discovery Experiments

    Authors: Vit Novacek

    Abstract: We introduce an approach to discovery informatics that uses so called knowledge graphs as the essential representation structure. Knowledge graph is an umbrella term that subsumes various approaches to tractable representation of large volumes of loosely structured knowledge in a graph form. It has been used primarily in the Web and Linked Open Data contexts, but is applicable to any other area de… ▽ More

    Submitted 28 April, 2015; v1 submitted 31 March, 2015; originally announced March 2015.

    Comments: Pre-print of an article submitted to Artificial Intelligence Journal (after the manuscript has been refused by the editors of Journal of Web Semantics before the peer review process due to being out of scope for that journal)

  5. arXiv:1406.1061  [pdf, other

    cs.AI cs.SI

    A Methodology for Empirical Analysis of LOD Datasets

    Authors: Vit Novacek

    Abstract: CoCoE stands for Complexity, Coherence and Entropy, and presents an extensible methodology for empirical analysis of Linked Open Data (i.e., RDF graphs). CoCoE can offer answers to questions like: Is dataset A better than B for knowledge discovery since it is more complex and informative?, Is dataset X better than Y for simple value lookups due its flatter structure?, etc. In order to address such… ▽ More

    Submitted 4 June, 2014; originally announced June 2014.

    Comments: A current working draft of the paper submitted to the ISWC'14 conference (track information available here: http://iswc2014.semanticweb.org/call-replication-benchmark-data-software-papers)

  6. arXiv:1304.6473  [pdf, other

    cs.CY cs.DB cs.DL

    Technical report: Linking the scientific and clinical data with KI2NA-LHC

    Authors: Vit Novacek, Aisha Naseer

    Abstract: We introduce a use case and propose a system for data and knowledge integration in life sciences. In particular, we focus on linking clinical resources (electronic patient records) with scientific documents and data (research articles, biomedical ontologies and databases). Our motivation is two-fold. Firstly, we aim to instantly provide scientific context of particular patient cases for clinicians… ▽ More

    Submitted 23 April, 2013; originally announced April 2013.

    Comments: A longer version of a paper originally published at the IEEE conference on Computer-Based Medical Systems (CBMS'13), under the name: Linking the Scientific and Clinical Data with KI2NA-LHC - An Outline (authors are the same)

  7. arXiv:1210.3241  [pdf, ps, other

    cs.AI cs.IR

    Distributional Framework for Emergent Knowledge Acquisition and its Application to Automated Document Annotation

    Authors: Vit Novacek

    Abstract: The paper introduces a framework for representation and acquisition of knowledge emerging from large samples of textual data. We utilise a tensor-based, distributional representation of simple statements extracted from text, and show how one can use the representation to infer emergent knowledge patterns from the textual data in an unsupervised manner. Examples of the patterns we investigate in th… ▽ More

    Submitted 11 October, 2012; originally announced October 2012.

    ACM Class: I.2.6; I.2.7; H.2.8