How to Extract Text from Scanned Documents with MATLAB
From the series: “How To” Video Series for Biomedical and Pharmaceutical Applications
Learn how to quickly extract text from your scanned documents using MATLAB®. To do so, you’ll first see how to convert a PDF to an image, which will then allow you to perform optical character recognition (OCR) in just one line of code. The ocr
function also returns properties about the individually recognized words and characters. You’ll see how these properties can then be used to locate and assess the confidence of the recognized text.
Select a Web Site
Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: .
You can also select a web site from the following list
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.
Americas
- América Latina (Español)
- Canada (English)
- United States (English)
Europe
- Belgium (English)
- Denmark (English)
- Deutschland (Deutsch)
- España (Español)
- Finland (English)
- France (Français)
- Ireland (English)
- Italia (Italiano)
- Luxembourg (English)
- Netherlands (English)
- Norway (English)
- Österreich (Deutsch)
- Portugal (English)
- Sweden (English)
- Switzerland
- United Kingdom (English)
Asia Pacific
- Australia (English)
- India (English)
- New Zealand (English)
- 中国
- 日本Japanese (日本語)
- 한국Korean (한국어)