-
Luminoso at SemEval-2018 Task 10: Distinguishing Attributes Using Text Corpora and Relational Knowledge
Abstract: Luminoso participated in the SemEval 2018 task on "Capturing Discriminative Attributes" with a system based on ConceptNet, an open knowledge graph focused on general knowledge. In this paper, we describe how we trained a linear classifier on a small number of semantically-informed features to achieve an $F_1$ score of 0.7368 on the task, close to the task's high score of 0.75.
Submitted 11 December, 2018; v1 submitted 5 June, 2018; originally announced June 2018.
Comments: SemEval 2018, 5 pages
Journal ref: Proceedings of The 12th International Workshop on Semantic Evaluation (2018), p. 985-989
-
arXiv:1704.03560 [pdf, ps, other]
ConceptNet at SemEval-2017 Task 2: Extending Word Embeddings with Multilingual Relational Knowledge
Abstract: This paper describes Luminoso's participation in SemEval 2017 Task 2, "Multilingual and Cross-lingual Semantic Word Similarity", with a system based on ConceptNet. ConceptNet is an open, multilingual knowledge graph that focuses on general knowledge that relates the meanings of words and phrases. Our submission to SemEval was an update of previous work that builds high-quality, multilingual word e… ▽ More
Submitted 11 December, 2018; v1 submitted 11 April, 2017; originally announced April 2017.
Comments: 5 pages, accepted to the SemEval workshop at ACL 2017
ACM Class: I.2.7
Journal ref: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), p. 85-89
-
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge
Abstract: Machine learning about language can be improved by supplying it with specific knowledge and sources of external information. We present here a new version of the linked open data resource ConceptNet that is particularly well suited to be used with modern NLP techniques such as word embeddings. ConceptNet is a knowledge graph that connects words and phrases of natural language with labeled edges.… ▽ More
Submitted 11 December, 2018; v1 submitted 12 December, 2016; originally announced December 2016.
ACM Class: I.2.7
Journal ref: AAAI 31 (2017) 4444-4451
-
An Ensemble Method to Produce High-Quality Word Embeddings (2016)
Abstract: A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and word2vec (Mikolov et al., 2013) with structured knowledge from the semantic networks ConceptNet (Speer and Havasi, 2012) and PPDB (Ganitkevitch et al., 2013), merging… ▽ More
Submitted 19 December, 2019; v1 submitted 6 April, 2016; originally announced April 2016.
Comments: Corrected author name, revised reproducibility instructions that didn't work anymore. 12 pages, 3 figures
MSC Class: I.2.7 ACM Class: I.2.7