You need a Linux machine or Unix-like setup that has the following software installed:


Papermerge is a Python 3 application.


Papermerge uses Imagemagick to convert between images format


More exactly poppler utils are used. For exampple pdfinfo command line utility is used to find out number of page in PDF document.


If you never heard of Tesseract software - it is google’s open source Optical Character Recognition software. It extracts text from images. It works fantastically well for wide range of languages.

In addition to the above, there are a number of Python requirements, all of which are listed in a file called requirements/base.txt in the project root directory.