Home > Articles > All Issues > 2020 > Volume 9, No. 2, February 2020 >

Using Conditional Random Field in Named Entity Recognition for Crime Location Identification

Quintin Jackson Goraseb and Nathar Shah
Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Malaysia

Abstract— Electronic data or information comes in different forms, some are structured data and others unstructured data. The act of collecting such data is known as data mining. This paper will discuss the mining of crime data from electronic news sources in Malaysia, and how this data is further transformed to extract meaningful information from it. Furthermore, the paper will demonstrate how crime locations can be identified within the various news articles. This is significant because there are cases where a location name is mentioned in the news article but that is not the true crime location. To help achieve this, the system makes use of Named Entity Recognition (NER) algorithms. They are task with identifying locations in various sentences. To bring more accuracy to the work, the system will employ machine learning technique known as Conditional Random Field (CRF) to recognize if a sentence is referring to a crime location. 

Index Terms—data mining, machine learning, text extraction

Cite: Quintin Jackson Goraseb and Nathar Shah, "Using Conditional Random Field in Named Entity Recognition for Crime Location Identification" International Journal of Mechanical Engineering and Robotics Research, Vol. 9, No. 2, pp. 252-257, February 2020. DOI: 10.18178/ijmerr.9.2.252-257

Copyright © 2020 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.