How can I read text from pdf in Matlab?
10 views (last 30 days)
Show older comments
Balkar Singh
on 25 Apr 2020
Commented: Balkar Singh
on 1 May 2020
How can I read text from pdf in Matlab? I tried this code
a=OCR(unit8(test.pdf);
After the execution of the above code value of a is empty.
There is any other option which one I can use.
Accepted Answer
Walter Roberson
on 25 Apr 2020
7 Comments
Walter Roberson
on 1 May 2020
the image file itself is gone, converted to pixel representation.
https://www.mathworks.com/matlabcentral/answers/436679-how-to-read-an-image-into-matlab-stored-as-pdf#comment_654452
It is probably easier to invoke external software to extract the images. Once you have the images you can ocr them. http://www.peteryu.ca/tutorials/publishing/pdf_manipulation_tips2
More Answers (0)
See Also
Categories
Find more on Text Analytics Toolbox in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!