webLyzard Publications

A Regional News Corpora for Contextualized Entity Discovery and Linking

Brasoveanu, Adrian M. P. and Nixon, Lyndon J.B. and Weichselbraun, Albert and Scharl, Arno (2016) A Regional News Corpora for Contextualized Entity Discovery and Linking. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 23-28, 2016, Portoroz, Slovenia.

[img]
Preview
PDF (A Regional News Corpora for Contextualized Entity Discovery and Linking) - Published Version
248kB

Official URL: http://www.lrec-conf.org/proceedings/lrec2016/pdf/...

Abstract

This paper presents a German corpus for Named Entity Linking (NEL) and Knowledge Base Population (KBP) tasks. We describe the annotation guideline, the annotation process, NIL clustering techniques and conversion to popular NEL formats such as NIF and TAC that have been used to construct this corpus based on news transcripts from the German regional broadcaster RBB (Rundfunk Berlin Brandenburg). Since creating such language resources requires significant effort, the paper also discusses how to derive additional evaluation resources for tasks like named entity contextualization or ontology enrichment by exploiting the links between named entities from the annotated corpus. The paper concludes with an evaluation that shows how several well-known NEL tools perform on the corpus, a discussion of the evaluation results, and with suggestions on how to keep evaluation corpora and datasets up to date.

Item Type:Conference or Workshop Item (Paper)
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
ID Code:96
Deposited By: Adrian
Deposited On:13 Jun 2016 10:47
Last Modified:25 Oct 2017 13:59

Repository Staff Only: item control page