ArabiaNer: A System to Extract Named Entities from Arabic Content
Publication Type
Conference Paper
Authors

The extraction of named entities from unstructured text is a crucial component in numerous Natural LanguageProcessing (NLP) applications such as information retrieval, question answering, machine translation, to namebut a few. Named-entity Recognition (NER) aims at locating proper nouns from unstructured text and clas-sifying them into a predefined set of types, such as persons, locations, and organizations. There has beenextensive research on improving the accuracy of NER in English text. For other languages such as Arabic,extracting Named-entities is quite challenging due to its morphological structure. In this paper, we introduceArabiaNer, a system employing Conditional Random Field (CRF) learning algorithm with extensive featureengineering steps to effectively extract Arabic named Entities.ArabiaNerproduced state-of-the-art resultswith f1-score of 91.31% when applied on the ANERcrop dataset.

Conference
Conference Title
The 13th International Conference on Agents and Artificial Intelligence (ICAART)
Conference Country
Portugal
Conference Date
Feb. 2, 2021 - Feb. 4, 2021
Conference Sponsor
https://portal.insticc.org/
Additional Info
Conference Website