Extracting the text from pdf file

13 views (last 30 days)
Is it possible to extract the text from pdf file using matlab script?
I need to parse through the pdf and extract the particular text in the pdf.
Is there any way to do it?

Accepted Answer

Stephen23
Stephen23 on 9 Jul 2015
Edited: Stephen23 on 9 Jul 2015
"Is there any way to do it?"
Of course, in principal any data with a known specification can be parsed by MATLAB.
Is there an easy way of reading a PDF into MATLAB?
Not really, because PDF's are not sequentially organized text, although they might look like that when they are displayed or printed. This is also a topic that has been covered before on this forum, and a simple search will bring up these very informative discussions on the topic:

More Answers (0)

Categories

Find more on Text Data Preparation in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!