Trouble opening file with Java to use with PDFBox
10 views (last 30 days)
Show older comments
Michael Boeckel
on 9 May 2016
Commented: Michael Boeckel
on 10 May 2016
I am trying to use the PDFBox library to read the contents of PDF files, but I can't seem to open any of the files in the correct format for PDFBox to use. I'm using the following code to open each document:
javaaddpath('...\pdfParseDemo\pdfbox-2.0.0.jar')
javaaddpath('...\FontBox-0.1.0\FontBox-0.1.0\lib\FontBox-0.1.0.jar')
pdfname = '...\example.pdf';
import java.io.*;
pdfdoc = org.apache.pdfbox.pdmodel.PDDocument; %Define a PDDocument object placeholder
pdfdoc.load(FileInputStream(pdfname)); %Load the PDF file
However, this seems to return an empty object. When I try to query any of the file's properties or contents, it always returns an empty or zero value. I suspect the problem is with how I'm opening the file, because I know PDFBox has been successfully used natively with Java in many cases. Unfortunately the documentation for interfacing with Matlab is very sparse, so I'm not sure what I should be doing differently. Is there some kind of weirdness with how Matlab handles Java file input calls?
0 Comments
Accepted Answer
Elias Gule
on 9 May 2016
Try wrapping your pdfname variable in a java.lang.String variable. This sometimes works:
pdfname = java.lang.String('...\example.pdf');
More Answers (0)
See Also
Categories
Find more on Call Java from MATLAB in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!