Skip to main content

Showing 1–3 of 3 results for author: Torvik, V I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.04770  [pdf, other

    cs.CL cs.DB cs.LG

    KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution

    Authors: Liri Fang, Lan Li, Yiren Liu, Vetle I. Torvik, Bertram Ludäscher

    Abstract: Entity resolution has been an essential and well-studied task in data cleaning research for decades. Existing work has discussed the feasibility of utilizing pre-trained language models to perform entity resolution and achieved promising results. However, few works have discussed injecting domain knowledge to improve the performance of pre-trained language models on entity resolution tasks. In thi… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  2. arXiv:2005.04308  [pdf

    cs.DL

    Building a PubMed knowledge graph

    Authors: Jian Xu, Sunkyu Kim, Min Song, Minbyul Jeong, Donghyeon Kim, Jaewoo Kang, Justin F. Rousseau, Xin Li, Weijia Xu, Vetle I. Torvik, Yi Bu, Chongyan Chen, Islam Akef Ebeid, Daifeng Li, Ying Ding

    Abstract: PubMed is an essential resource for the medical domain, but useful concepts are either difficult to extract or are ambiguated, which has significantly hindered knowledge discovery. To address this issue, we constructed a PubMed knowledge graph (PKG) by extracting bio-entities from 29 million PubMed abstracts, disambiguating author names, integrating funding data through the National Institutes of… ▽ More

    Submitted 15 May, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: 19 pages, 5 figures, 14 tables

  3. Geographical Distribution of Biomedical Research in the USA and China

    Authors: Yingjun Guan, **g Du, Vetle I. Torvik

    Abstract: We analyze nearly 20 million geocoded PubMed articles with author affiliations. Using K-means clustering for the lower 48 US states and mainland China, we find that the average published paper is within a relatively short distance of a few centroids. These centroids have shifted very little over the past 30 years, and the distribution of distances to these centroids has not changed much either. Th… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.