Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kiril Ivanov Simov

Feature-Rich Named Entity Recognition for Bulgarian Using Conditional Random Fields

Sep 26, 2021

Georgi Georgiev, Preslav Nakov, Kuzman Ganchev, Petya Osenova, Kiril Ivanov Simov

Figure 1 for Feature-Rich Named Entity Recognition for Bulgarian Using Conditional Random Fields

Figure 2 for Feature-Rich Named Entity Recognition for Bulgarian Using Conditional Random Fields

Figure 3 for Feature-Rich Named Entity Recognition for Bulgarian Using Conditional Random Fields

Abstract:The paper presents a feature-rich approach to the automatic recognition and categorization of named entities (persons, organizations, locations, and miscellaneous) in news text for Bulgarian. We combine well-established features used for other languages with language-specific lexical, syntactic and morphological information. In particular, we make use of the rich tagset annotation of the BulTreeBank (680 morpho-syntactic tags), from which we derive suitable task-specific tagsets (local and nonlocal). We further add domain-specific gazetteers and additional unlabeled data, achieving F1=89.4%, which is comparable to the state-of-the-art results for English.

* RANLP-2009
* named entity recognition, NER, conditional random fields, CRF, Bulgarian, BulTreeBank

Via

Access Paper or Ask Questions