How can I read text from pdf in Matlab?

10 views (last 30 days)
How can I read text from pdf in Matlab? I tried this code
a=OCR(unit8(test.pdf);
After the execution of the above code value of a is empty.
There is any other option which one I can use.

Accepted Answer

Walter Roberson
Walter Roberson on 25 Apr 2020
  7 Comments
Walter Roberson
Walter Roberson on 1 May 2020
the image file itself is gone, converted to pixel representation.
https://www.mathworks.com/matlabcentral/answers/436679-how-to-read-an-image-into-matlab-stored-as-pdf#comment_654452
It is probably easier to invoke external software to extract the images. Once you have the images you can ocr them. http://www.peteryu.ca/tutorials/publishing/pdf_manipulation_tips2
Balkar Singh
Balkar Singh on 1 May 2020
I tried this but not able to find the exact solution. I also tried OCR but it works for a small image.
Thanks

Sign in to comment.

More Answers (0)

Categories

Find more on Text Analytics Toolbox in Help Center and File Exchange

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!