arrangement of various approaches, algorithms, and
strategies in this work. It is envisioned that this
comprehensive assessment would give insight into
the ideas involved and, maybe, stimulate future
progress in the field. To begin, we discussed the
major challenges of OCR, followed by a detailed
discussion of the main important phases,
architecture, proposed algorithms, and techniques of
OCR. We emphasize that when designing any
application related to OCR, one must pay close
attention to each phase to achieve a highly accurate
character recognition rate. However, we are unable
to provide detailed methods for each phase since
they are dependent on datasets, application
particulars, and parameter specifications. Finally,
important OCR applications and a brief OCR history
are presented.
Although state-of-the-art OCR allows for high-
accuracy text recognition, we believe that OCR has
many more useful uses. We want to employ OCR
in the future for such practical applications for
everyday personal usage. We intend to combine
mobile devices with OCR in a single OCR solution.
Some of our upcoming OCR-based applications
include an automatic book reader and a receipt
tracker, [15], [17]. OCR is no longer just matching
or seeing, with new technology of deep learning it is
now entered into a new phase and it can recognize
the text after scan and then convert it to different
meaning in full applications[15]. With deep learning
software, it provides more robust extraction of
information and high-quality insight and it also can
be used in the robotics field and integrated for
artificial intelligence applications like Chat-GPT,
[16], [17].
References:
[1] Satti, D. A., Offline Urdu Nastaliq OCR for
Printed Text using Analytical Approach. MS
thesis report Quaid- i- Azam University:
Islamabad, Pakistan. 2013, [Online].
https://www.cle.org.pk/research/theses.htm
(Accessed Date: January 24, 2023).
[2] Mahmoud, S.A., & Al-Badr, B., Survey and
bibliography of Arabic optical text
recognition. Signal processing, 41(1), 2017,
pp 49-77.
[3] Bhansali, M., & Kumar, P, An Alternative
Method for Facilitating Cheque Clearance
Using Smart Phones Application.
International Journal of Application or
Innovation in Engineering & Management
(IJAIEM), 2(1), 2018, pp 211-217.
[4] Qadri, M.T., & Asif, M, Automatic Number
Plate Recognition System for Vehicle
Identification Using Optical Character
Recognition presented at International IEEE
Conference on Education Technology and
Computer, Singapore, 2019. Singapore:
[5] A.S. Abutaleb. Automatic thresholding of
gray-level pictures using two-dimensional
entropy. Computer Vision, Graphics, and
Image Processing, 47 (1): pp 22–32, 2014.
[6] A.M. AL-Shatnawi, K. Omar, and A.M. Zeki.
Challenges in thinning of Arabic text. In
ICGST International Conference on Artificial
Intelligence and Machine Learning (AIML-
11), Dubai. United Arab Emirates, pages 127–
133, 2016.
[7] Lazaro, J., Martín, J.L, Arias, J., Astarloa, A.,
& Cuadrado, C, Neuro semantic thresholding
using OCR software for high precision OCR
applications. Image and Vision Computing,
28(4), 2016, pp.571-578.
[8] Lund, W.B., Kennard, D.J., & Ringger, E.K.,
Combining Multiple Thresholding
Binarization Values to Improve OCR Output
presented in Document Recognition and
Retrieval XX Conference, California, USA,
2015. USA: SPIE.
[9] Shaikh, N.A., & Shaikh, Z.A, A generalized
thinning algorithm for cursive and non-cursive
language scripts presented in 9th International
Multitopic Conference IEEE INMIC, Pakistan,
2015. Pakistan.
[10] Shaikh, N.A., Shaikh, Z.A., & Ali, G,
Segmentation of Arabic text into characters
for recognition presented in International
Multi Topic Conference, IMTIC, Jamshoro,
Pakistan, 2018.
[11] A.P. Dempster, N.M. Laird, and D.B. Rubin.
Maximum likelihood from incomplete data via
the em algorithm. Journal of the Royal
Statistical Society. Series B (Methodological),
pages 1–38, 2019.
[12] Mochurad, L. “A New Approach for Text
Recognition on a Video CARD.” Computer
systems and information technologies, 2022,
https://doi.org/10.31891/csit-2022-3-3.
[13] Karthick, K., Ravindrakumar, K.B., Francis,
R., & Ilankannan, S., Steps Involved in Test
Recognition and Recent Research in OCR ; A
Study, 2019, Corpus ID: 212448703.
[14] Faizullah, S., Ayub, M.S., Hussain, S., &
Khan, M.A., A Survey of OCR in Arabic
Language: Applications, Techniques, and
Challenges. Applied Sciences, 2023, 13(7),
4584; https://doi.org/10.3390/app13074584.
[15] Adyanthaya, S.K., Text Recognition from
WSEAS TRANSACTIONS on SIGNAL PROCESSING
DOI: 10.37394/232014.2023.19.20
Shahid Manzoor, Nimra Wahab, M. K. A. Ahamed Khan