Research Article Open Access

A CNN-KNN Based Recognition of Online Handwritten Symbols within Physics Expressions Using Contour-Based Bounding Box (CBBS) Segmentation Technique

Ujwala Kolte1, Sachin Naik1 and Vidya Kumbhar2
  • 1 Symbiosis Institute of Computer Studies and Research, Symbiosis International (Deemed University), Pune, India
  • 2 Symbiosis Institute of Geoinformatics, Symbiosis International (Deemed University), Pune, India

Abstract

The task of recognizing symbols poses a significant challenge owing to the wide variability in human handwriting. Complexity in terms of the structural representation of symbols used in physics expressions is a major challenge in the recognition process The emergence of online handwriting, fueled by the widespread adoption of handheld digital devices, particularly in educational contexts, highlights the critical importance of precise symbol recognition, especially in the teaching and learning process. In contemporary literature, there is a notable emphasis on LaTex sequencing, symbol recognition and parsing. However, deep learning continues to yield promising results in this domain. The convenience of user input provides benefits to e-learning applications. In this study, we propose three approaches for the recognition of physics symbols within physics expressions (1) A proposed Java user interface for taking input from the user, as convenience of user input provides benefits to e-learning applications. (2) Contour-based bounding box segmentation algorithm, which deals with broken symbols within physics expressions. (3) For recognition, we propose a Convolution Neural Network-K-Nearest Neighbor (CNN-KNN) recognition model, as CNN plays an important role in extracting features, which are further provided as input to the K-NN classifier using the dropout method. Combining these three approaches into a symbol recognition model provides state-of-arts results. Handwritten physics symbols were collected from 20 different writers and each writer has written 5 types of physics expressions under different categories like electric flux, Maxwell’s equations, inductance and pointing vector and moment of Interia. There were 25 classes identified from the 780 samples collected from the users. The recognition rate is identified using (1) Using CNN model, which shows an accuracy of 91.48 and (2) Using the proposed hybrid CNN-KNN model the accuracy reported is 98.06.

Journal of Computer Science
Volume 20 No. 7, 2024, 783-792

DOI: https://doi.org/10.3844/jcssp.2024.783.792

Submitted On: 21 December 2023 Published On: 7 May 2024

How to Cite: Kolte, U., Naik, S. & Kumbhar, V. (2024). A CNN-KNN Based Recognition of Online Handwritten Symbols within Physics Expressions Using Contour-Based Bounding Box (CBBS) Segmentation Technique. Journal of Computer Science, 20(7), 783-792. https://doi.org/10.3844/jcssp.2024.783.792

  • 459 Views
  • 307 Downloads
  • 0 Citations

Download

Keywords

  • CNN
  • Contour Based Bounding Box Segmentation (CBBS)
  • K-NN
  • Physics Expression
  • Symbol Recognition