OCR can not distinguish between zero and two of consolas font.

8 views (last 30 days)
,,,,,,
I tried using OCR to train ocr to recognise numbers from 1 to 50
The font is consolas. However zero (0) and two (2) are always conflicting.
zero(0) always reads as 2 and sometimes as forward slash when using them in the matlab.
this is the onedrive link to a copy of data. I have also added the code to it
Please what am i doing wrong?
Best regard,
Isaac
  2 Comments
Image Analyst
Image Analyst on 9 Aug 2020
Edited: Image Analyst on 9 Aug 2020
Attach an image where the 2 is determined to be a 0 and another image where it's determined to be a /. I mean here, not some third party site.
ISAAC DOUGHAN
ISAAC DOUGHAN on 9 Aug 2020
Hi, I have attached the two situations to the question in addition to the Data files

Sign in to comment.

Answers (1)

Rishik Ramena
Rishik Ramena on 13 Aug 2020
To correctly identify the Zeros, you can change the value of the TextLayout parameter to ‘Block’ as OCR model is just trying to identify numbers which can be considered as a single block of text.
The documentation for OCR TextLayout parameter says: "If you get poor OCR results, try a different layout that matches the text in your image."

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!