Requirements¶
You need a Linux machine or Unix-like setup that has the following software installed:
- Python >= 3.7
- Django >= 3.0
- Tesseract - because of OCR
- Imagemagick - Image operations
- Poppler - PDF operations
Python¶
Papermerge is a Python 3 application.
Imagemagick¶
Papermerge uses Imagemagick to convert between images format
Poppler¶
More exactly poppler utils are used. For exampple pdfinfo command line utility is used to find out number of page in PDF document.
Tesseract¶
If you never heard of Tesseract software - it is google’s open source Optical Character Recognition software. It extracts text from images. It works fantastically well for wide range of languages.
In addition to the above, there are a number of Python requirements, all of which are listed in a file called requirements/base.txt
in the project root directory.