PDFBox comes with a series of command-line utilities. They are available as standard Java applications.
See the Dependencies page for instructions on how to set your classpath in order to run PDFBox tools as Java applications.
This application will decrypt a PDF document.
NOTE: You must have the owner password to decrypt the document!
Usage: java -jar pdfbox-app-3.y.z.jar decrypt [OPTIONS] -i=<infile>
Command-Line Parameter | Description |
---|---|
-alias= |
The alias to the certificate in the keystore. |
-h, --help | Show help message and exit. |
-i, --input=<infile> | The PDF file to decrypt. |
-keyStore=<keyStore> | Path to keystore that holds certificate to decrypt the document. This is only required if the document is encrypted with a certificate, otherwise only the password is required. |
-o, --output=<outfile> | The file to save the decrypted document to. If left blank then it will be the same as the input file. |
-password=[<password>] | Password to the PDF or certificate in keystore. |
-V, --version | Print version information and exit. |
This application will encrypt a PDF document.
Usage: java -jar pdfbox-app-3.y.z.jar encrypt [OPTIONS] -i=<infile>
Command-Line Parameter | Default | Description |
---|---|---|
-canAssemble | true | Set the assemble permission. |
-canExtractContent | true | Set the extraction permission. |
-canExtractForAccessibility | true | Set the extraction permission. |
-canFillInForm | true | Set the fill in form permission. |
-canModify | true | Set the modify permission. |
-canModifyAnnotations | true | Set the modify annots permission. |
-canPrint | true | Set the print permission. |
-canPrintFaithful | true | Set the print faithful permission. |
-certFile=<certFile> | Path to X.509 cert file. | |
-h, --help | Show help message and exit. | |
-i, --input=<infile> | The PDF file to encrypt. | |
-keyLength | 256 | Key length in bits (valid values: 40, 128 or 256) |
-o, --output=<outfile> | the encrypted PDF file. If omitted the original file is overwritten. | |
-O=[<ownerPassword>] | set the owner password (ignored if certFile is set) | |
-U=[<userPassword>] | set the user password (ignored if certFile is set) | |
-V, --version | Print version information and exit. |
This application will extract all images from the given PDF document.
Usage: java -jar pdfbox-app-3.y.z.jar export:images [OPTIONS] -i=<infile>
Command-Line Parameter | Description |
---|---|
-h, --help | Show help message and exit. |
-i, --input=<infile> | The PDF file to decrypt. |
-noColorConvert | Images are extracted with their original colorspace if possible. |
-password=[<password>] | Password for the PDF or certificate in keystore. |
-prefix=<prefix> | the image prefix (default to pdf name). |
-useDirectJPEG | Forces the direct extraction of JPEG/JPX images regardless of colorspace or masking. |
-V, --version | Print version information and exit. |
This application will extract all text from the given PDF document.
Usage: java -jar pdfbox-app-3.y.z.jar export:text [OPTIONS] -i=<infile>
Command-Line Parameter | Default | Description |
---|---|---|
-alwaysNext | false | Process next page (if applicable) despite IOException (ignored when -html) |
-console | false | Send text to console instead of file. |
-debug | false | Enables debug output about the time consumption of every stage. |
-encoding=<encoding> | UTF-8 | The encoding type of the text file, e.g. UTF-8 or ISO-8859-1, UTF-16BE, UTF-16LE, etc. |
-endPage=<endPage> | Integer.MAX_INT | The last page to extract (1 based, inclusive) |
-h, --help | Show help message and exit. | |
-html | false | Output in HTML format instead of raw text. |
-md | false | Output in Markdown format instead of raw text. (since 3.0.4) |
-i, --input=<infile> | The PDF file to encrypt. | |
-ignoreBeads | false | Disables the separation by beads. |
-o, --output=<outfile> | the exported text file. | |
-password=[<password>] | Password for the PDF or certificate in keystore. | |
-rotationMagic | false | Analyze each page for rotated/skewed text, rotate to 0° and extract separately (slower, and ignored when -html) |
-sort | false | Sort the text before writing. |
-startPage=<startPage> | 1 | The first page to start extraction (1 based, inclusive) |
-V, --version | Print version information and exit. |
This application will overlay one document with the content of another document
Usage: java -jar pdfbox-app-3.y.z.jar overlay [OPTIONS] -i=<infile> -o=<outfile>
Command-Line Parameter | Default | Description |
---|---|---|
-default=<defaultOverlay> | the default overlay file. | |
-even=<evenPageOverlay> | overlay file used for even pages. | |
-first=<firstPageOverlay | overlay file used for the first page. | |
-h, --help | Show help message and exit. | |
-i, --input=<infile> | the PDF file to be overlayed. | |
-last=<lastPageOverlay> | Overlay file used for the last pages. | |
-o, --output=<outfile> | the resulting PDF file. | |
-odd=<oddPageOverlay> | overlay file used for odd pages. | |
-page=<Integer=specificPageOverlay> | overlay file used for the given page number, may occur more than once. | |
-position=<position> | BACKGROUND | Where to put the overlay, FOREGROUND or BACKGROUND. |
-useAllPages=<useAllPagesOverlay | overlay file used for overlay, all pages are used by simply repeating them | |
-V, --version | Print version information and exit. |
Examples:
This application will take an existing PDF document and allows to analyze and inspect the internal structure. It is used as replacement for the PDFReader which was removed in 2.0.0.
Usage: java -jar pdfbox-app-3.y.z.jar debug [inputfile]
Command-Line Parameter | Description |
---|---|
inputfile | the name of an optional PDF file to open. |
-h, --help | Show help message and exit. |
-password=[<password>] | password to derypt the PDF. |
-viewstructure | Activates the "view structure" view on startup. |
This application will take a list of pdf documents and merge them, saving the result in a new document.
Usage: java -jar pdfbox-app-3.y.z.jar merge [-hV] -o=outfile -i=<infile> [-i=<infile>]
Command-Line Parameter | Description |
---|---|
-h, --help | Show help message and exit. |
-i, --input=<infile> | the PDF files to merge |
-o, --output=<outfile> | the merged PDF file. |
-V, --version | Print version information and exit. |
This application will take an existing PDF document and split it into a number of new documents.
Per default the resulting files will be named after the original filenmame with -<nr>
appended before the suffix.
To override the filename use the outputPrefix
option.
Usage: java -jar pdfbox-app-3.y.z.jar split [OPTIONS] -i=<infile>
Command-Line Parameter | Description |
---|---|
-endPage=<endPage> | end page. |
-h, --help | Show help message and exit. |
-i, --input=<infile> | the PDF file to split |
--outputPrefix=<outputPrefix> | the filename prefix for split files. |
-password=[<password>] | Password to the PDF. |
-split=<split> | split after this many pages (default 1, if startPage and endPage are unset). |
-startPage=<startPage> | start page. |
-endPage | The page to stop at. |
-V, --version | Print version information and exit. |
Examples:
This application will create an image for every page in the PDF document.
Usage: java -jar pdfbox-app-3.y.z.jar render [OPTIONS] -i=<infile>
Command-Line Parameter | Default | Description |
---|---|---|
-color=<imageType> | rgb | The color depth (valid: BINARY, GRAY, RGB, ARGB, BGR) |
-cropbox=<int> <int> <int> <int> | The page area to export. | |
-dpi, -resolution=<dpi> | detected from screen (or 96 if headless) | the DPI of the output image |
-endPage=<endPage> | Integer.MAX_INT | The last page to convert, (one based, inclusive). |
-format=<imageFormat> | jpg | The image file format. |
-h, --help | Show help message and exit. | |
-i, --input=<infile> | the PDF file to convert. | |
-page=<page> | The only page to extract (1-based). | |
-password=[<password>] | Password for the PDF. | |
-prefix, -outputPrefix=<outputPrefix> | Name of PDF document | the filename prefix for image files |
-quality=<quality> | 0 for PNG and 1 for other formats | the quality to be used when compressing the image (0 <= quality <= 1). |
-startPage=<startPage> | 1 | the first page to start extraction (one based) |
-subSampling | activate subsampling (for PDFs with huge images) | |
-time | print timing information to stdout. | |
-V, --version | print version information and exit. |
This application will send a pdf document to the printer.
You must have the correct permissions to print the document!
Usage: java -jar pdfbox-app-3.y.z.jar print [OPTIONS] -i=<infile>
Command-Line Parameter | Default | Description |
---|---|---|
-border | Print with border. | |
-dpi=<dpi> | render into intermediate image with specific dpi and then print. Use -1 for automatic detection of printer resolution. | |
-duplex=<duplex> | DOCUMENT | print using duplex (SIMPLEX, DUPLEX, TUMBLE, DOCUMENT). |
-h, --help | Show help message and exit. | |
-i, --input=<infile> | the PDF file to print. | |
-mediaSize=<mediaSize> | print using media size name. | |
-noColorOpt | disable color optimizations (useful when printing barcodes). | |
-orientation | AUTO | print using orientation (AUTO, LANDSCAPE, PORTRAIT). |
-password=[<password>] | Password for the PDF. | |
-printerName=<printerName> | print to specified printer. | |
-silentPrint | print without printer dialog box. | |
-tray=<tray> | print using tray. | |
-V, --version | print version information and exit. |
This application will create a PDF document from a text file.
Usage: java -jar pdfbox-app-3.y.z.jar fromText [OPTIONS] -i=<infile> -o=<outfile>
Command-Line Parameter | Default | Description |
---|---|---|
-charset=<charset> | UTF-8 | The charset to use. |
-fontSize=<fontSize> | 10.0 | The size of the font to use. |
-lineSpacing=<factor> | 1.05 | The factor of the font size for the line height. |
-margins <left> <right> <top> <bottom> | 40.0 40.0 40.0 40.0 | The page margins. |
-h, --help | Show help message and exit. | |
-i, --input=<infile> | The text file to convert. | |
-landscape | Set orientation to landscape. | |
-o, --output=<outfile> | The generated PDF file. | |
-pageSize=<pageSize> | LETTER | The page size to use (LETTER, LEGAL, A0, A1, A2, A3, A4, A5, A6). |
-standardFont=<standardFont> | Helvetica | The font to use for the text. Either this or -ttf should be specified but not both. |
-ttf=<ttfFile> | The TrueType font to use for the text. Either this or -standardFont should be specified but not both. | |
-V, --version | Print version information and exit. |
The following font names can be used for the parameter standardFont
:
An application to decompress PDF documents.
Usage: java -jar pdfbox-app-3.y.z.jar decode [OPTIONS] <input-file> <output-file>
Command-Line Parameter | Description |
---|---|
input-file | the PDF document to decompress |
output-file | the PDF file top save to |
-h, --help | Show help message and exit. |
-password=[<password>] | Password for the PDF. |
-skipImages | don't uncompress images |
-V, --version | print version information and exit. |