Research Article Open Access

Arabic Personal Name Matching: Names Written using Latin Alphabet

Attia Nehar1, Slimane Bellaouar2, Djelloul Ziadi3 and Khaled Moulay Omar4
  • 1 Ziane Achour University - Djelfa, Algeria
  • 2 Universite de Laghouat, Algeria
  • 3 Groupe de Recherche Rouennais en Informatique Fondamentale (GR2IF), Algeria
  • 4 Universite de Ghardaia, Algeria

Abstract

Abstract: In many Arab countries’ public administrations, Arabic personal names are written with Latin alphabet, generally, in various ways by different writers. This has led to many problems when it comes to connecting these administrations. The aim of this study was to propose two new approaches for the pairwise matching of Arabic personal names. The first approach is based on string alignment and phonetic transcription. Appropriate scoring functions were defined to catch similarity between Arabic personal names. In the second approach, we use machine learning techniques to derive a suitable model for this problem. Precisely, we suggest using a Multi-Layer Perceptron (MLP) architecture and experiment with different configurations. Performances of the new models compare well with the best-performing similarity measures (Jaro, Jaro-Winkler, Double Metaphone and Edit Distance) in terms of precision, recall and F1. Even though the work was carried out for the (Algeria/French Alphabet) case, it can be adapted to any other (country/script) case, like (Egypt/English).

Journal of Computer Science
Volume 17 No. 9, 2021, 776-788

DOI: https://doi.org/10.3844/jcssp.2021.776.788

Submitted On: 11 May 2021 Published On: 29 September 2021

How to Cite: Nehar, A., Bellaouar, S., Ziadi, D. & Omar, K. M. (2021). Arabic Personal Name Matching: Names Written using Latin Alphabet. Journal of Computer Science, 17(9), 776-788. https://doi.org/10.3844/jcssp.2021.776.788

  • 2,212 Views
  • 1,008 Downloads
  • 0 Citations

Download

Keywords

  • Personal Name Matching
  • Phonetic Transcription
  • Phonetic Encoding
  • Sequence Alignment
  • Machine Learning