Skip to main content

Showing 1–11 of 11 results for author: Nielsen, F Å

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07008  [pdf, other

    cs.LG cs.AI

    Knowledge graphs for empirical concept retrieval

    Authors: Lenka Tětková, Teresa Karen Scheidt, Maria Mandrup Fogh, Ellen Marie Gaunby Jørgensen, Finn Årup Nielsen, Lars Kai Hansen

    Abstract: Concept-based explainable AI is promising as a tool to improve the understanding of complex models at the premises of a given user, viz.\ as a tool for personalized explainability. An important class of concept-based explainability methods is constructed with empirically defined concepts, indirectly defined through a set of positive and negative examples, as in the TCAV approach (Kim et al., 2018)… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Preprint. Accepted to The 2nd World Conference on eXplainable Artificial Intelligence

  2. arXiv:2303.15133  [pdf, other

    cs.DL

    Synia: Displaying data from Wikibases

    Authors: Finn Årup Nielsen

    Abstract: I present an agile method and a tool to display data from Wikidata and other Wikibase instances via SPARQL queries. The work-in-progress combines ideas from the Scholia Web application and the Listeria tool.

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: 3 pages, 2 tables, 3 figures, submitted to Wiki Workshop (10th edition)

    ACM Class: H.5.4

  3. arXiv:2005.03521  [pdf, other

    cs.CL

    The Danish Gigaword Project

    Authors: Leon Strømberg-Derczynski, Manuel R. Ciosici, Rebekah Baglini, Morten H. Christiansen, Jacob Aarup Dalsgaard, Riccardo Fusaroli, Peter Juel Henrichsen, Rasmus Hvingelby, Andreas Kirkedal, Alex Speed Kjeldsen, Claus Ladefoged, Finn Årup Nielsen, Malte Lau Petersen, Jonathan Hvithamar Rystrøm, Daniel Varab

    Abstract: Danish language technology has been hindered by a lack of broad-coverage corpora at the scale modern NLP prefers. This paper describes the Danish Gigaword Corpus, the result of a focused effort to provide a diverse and freely-available one billion word corpus of Danish text. The Danish Gigaword corpus covers a wide array of time periods, domains, speakers' socio-economic status, and Danish dialect… ▽ More

    Submitted 12 May, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: Identical to the NoDaLiDa 2021 version

  4. arXiv:1803.04349  [pdf, other

    cs.DL cs.CL

    Linking ImageNet WordNet Synsets with Wikidata

    Authors: Finn Årup Nielsen

    Abstract: The linkage of ImageNet WordNet synsets to Wikidata items will leverage deep learning algorithm with access to a rich multilingual knowledge graph. Here I will describe our on-going efforts in linking the two resources and issues faced in matching the Wikidata and WordNet knowledge graphs. I show an example on how the linkage can be used in a deep learning setting with real-time image classificati… ▽ More

    Submitted 5 March, 2018; originally announced March 2018.

    Comments: 6 pages, Wiki Workshop 2018

  5. arXiv:1710.04099  [pdf, other

    stat.ML cs.CL cs.LG

    Wembedder: Wikidata entity embedding web service

    Authors: Finn Årup Nielsen

    Abstract: I present a web service for querying an embedding of entities in the Wikidata knowledge graph. The embedding is trained on the Wikidata dump using Gensim's Word2Vec implementation and a simple graph walk. A REST API is implemented. Together with the Wikidata API the web service exposes a multilingual resource for over 600'000 Wikidata items and properties.

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 3 pages, 2 figures

    ACM Class: I.2.4; H.3.5

  6. arXiv:1703.04222  [pdf, other

    cs.DL

    Scholia and scientometrics with Wikidata

    Authors: Finn Årup Nielsen, Daniel Mietchen, Egon Willighagen

    Abstract: Scholia is a tool to handle scientific bibliographic information in Wikidata. The Scholia Web service creates on-the-fly scholarly profiles for researchers, organizations, journals, publishers, individual scholarly works, and for research topics. To collect the data, it queries the SPARQL-based Wikidata Query Service. Among several display formats available in Scholia are lists of publications for… ▽ More

    Submitted 13 April, 2017; v1 submitted 12 March, 2017; originally announced March 2017.

    Comments: 16 pages, 5 figures, Scientometrics 2017

    Journal ref: Joint Proceedings of the 1st International Workshop on Scientometrics and 1st International Workshop on Enabling Decentralised Scholarly Communication (2017)

  7. arXiv:1206.2742  [pdf, other

    cs.DL cs.AI stat.AP

    Online open neuroimaging mass meta-analysis

    Authors: Finn Årup Nielsen, Matthew J. Kempton, Steven C. R. Williams

    Abstract: We describe a system for meta-analysis where a wiki stores numerical data in a simple format and a web service performs the numerical computation. We initially apply the system on multiple meta-analyses of structural neuroimaging data results. The described system allows for mass meta-analysis, e.g., meta-analysis across multiple brain regions and multiple mental disorders.

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: 5 pages, 4 figures SePublica 2012, ESWC 2012 Workshop, 28 May 2012, Heraklion, Greece

    MSC Class: 68U35 ACM Class: H.5.4; J.3; G.3

  8. arXiv:1103.2903  [pdf, ps, other

    cs.IR cs.CL

    A new ANEW: Evaluation of a word list for sentiment analysis in microblogs

    Authors: Finn Årup Nielsen

    Abstract: Sentiment analysis of microblogs such as Twitter has recently gained a fair amount of attention. One of the simplest sentiment analysis approaches compares the words of a posting against a labeled word list, where each word has been scored for valence, -- a 'sentiment lexicon' or 'affective word lists'. There exist several affective word lists, e.g., ANEW (Affective Norms for English Words) develo… ▽ More

    Submitted 15 March, 2011; originally announced March 2011.

    Comments: 6 pages, 4 figures, 1 table, Submitted to "Making Sense of Microposts (#MSM2011)"

    MSC Class: 68M11 ACM Class: H.4.3; J.4

    Journal ref: Proceedings of the ESWC2011 Workshop on 'Making Sense of Microposts': Big things come in small packages (2011) 93-98

  9. arXiv:1101.0510  [pdf, ps, other

    cs.SI cs.CL physics.soc-ph

    Good Friends, Bad News - Affect and Virality in Twitter

    Authors: Lars Kai Hansen, Adam Arvidsson, Finn Årup Nielsen, Elanor Colleoni, Michael Etter

    Abstract: The link between affect, defined as the capacity for sentimental arousal on the part of a message, and virality, defined as the probability that it be sent along, is of significant theoretical and practical importance, e.g. for viral marketing. A quantitative study of emailing of articles from the NY Times finds a strong link between positive affect and virality, and, based on psychological theori… ▽ More

    Submitted 3 January, 2011; originally announced January 2011.

    Comments: 14 pages, 1 table. Submitted to The 2011 International Workshop on Social Computing, Network, and Services (SocialComNet 2011)

    MSC Class: 1D30 ACM Class: H.4.3; J.4

  10. arXiv:0805.1154  [pdf, ps, other

    cs.DL cs.NE

    Clustering of scientific citations in Wikipedia

    Authors: Finn Aarup Nielsen

    Abstract: The instances of templates in Wikipedia form an interesting data set of structured information. Here I focus on the cite journal template that is primarily used for citation to articles in scientific journals. These citations can be extracted and analyzed: Non-negative matrix factorization is performed on a (article x journal) matrix resulting in a soft clustering of Wikipedia articles and scien… ▽ More

    Submitted 12 June, 2008; v1 submitted 8 May, 2008; originally announced May 2008.

    Comments: 7 pages; 2 figures, Wikimania 2008; Corrected typos

    ACM Class: G.1.10; G.2.3; H.2.8

  11. arXiv:0705.2106  [pdf, ps, other

    cs.DL cs.IR

    Scientific citations in Wikipedia

    Authors: Finn Aarup Nielsen

    Abstract: The Internet-based encyclopaedia Wikipedia has grown to become one of the most visited web-sites on the Internet. However, critics have questioned the quality of entries, and an empirical study has shown Wikipedia to contain errors in a 2005 sample of science entries. Biased coverage and lack of sources are among the "Wikipedia risks". The present work describes a simple assessment of these aspe… ▽ More

    Submitted 15 May, 2007; originally announced May 2007.

    Comments: 5 pages, 2 figures

    ACM Class: H.3.7; H.3.5; H.3.1

    Journal ref: First Monday, 12(8), 2007 August