Metadata-Version: 2.1
Name: python-pdfbox
Version: 0.1.1
Summary: Python interface to Apache PDFBox command-line tools.
Home-page: https://github.com/lebedov/python-pdfbox/
Author: Lev E. Givon
Author-email: lev@columbia.edu
License: Apache
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Software Development
Requires-Python: >=3
Requires-Dist: appdirs
Requires-Dist: sarge

Package Description
-------------------
Provides a simple Python 3 interface to the `Apache PDFBox <https://pdfbox.apache.org/>`_
command-line tools.

.. image:: https://img.shields.io/pypi/v/python-pdfbox.svg
    :target: https://pypi.python.org/pypi/python-pdfbox
    :alt: Latest Version

Requirements
------------
Aside from Python 3 and those packages specified in
`setup.py <https://github.com/lebedov/python-pdfbox/blob/master/setup.py>`_,
python-pdfbox requires ``java`` to be present in the system path.

Installation
------------
The package may be installed as follows: ::

    pip install python-pdfbox

One may specify the location of the PDFBox jar file via the ``PDFBOX``
environmental variable. If not set, python-pdfbox looks for the jar file
in the platform-specific user cache directory and automatically downloads
and caches it if not present.

Usage
-----
The interface currently exposes the text extraction feature of PDFBox only: ::

    import pdfbox
    p = pdfbox.PDFBox()
    text = p.extract_text('/path/to/my_file.pdf')

Development
-----------
The latest release of the package may be obtained from
`GitHub <https://github.com/lebedov/python-pdfbox>`_.

Author
------
See the included `AUTHORS.rst 
<https://github.com/lebedov/python-pdfbox/blob/master/AUTHORS.rst>`_ file for more 
information.

License
-------
This software is licensed under the
`Apache 2.0 License <https://opensource.org/licenses/Apache-2.0>`_.
See the included `LICENSE.rst 
<https://github.com/lebedov/python-pdfbox/blob/master/LICENSE.rst>`_ file for more 
information.


