Description

How to Extract Text from Scanned Documents with MATLAB

From the series: “How To” Video Series for Biomedical and Pharmaceutical Applications

Learn how to quickly extract text from your scanned documents using MATLAB^®. To do so, you’ll first see how to convert a PDF to an image, which will then allow you to perform optical character recognition (OCR) in just one line of code. The ocr function also returns properties about the individually recognized words and characters. You’ll see how these properties can then be used to locate and assess the confidence of the recognized text.

Published: 20 Mar 2022

Full Transcript

Related Resources