This is a fuzzy receipt parser written in Python. It extracts information like the shop, the date, and the total from scanned receipts. It can work as a standalone script or as part of our IOS and Android application.
receipt-parser-core library depend on
imagemagick. Please install
with your favorite package manager.
To convert all images from the
data/img/ folder to text using tesseract and parse the resulting text files, run
Dockerfile is available with all dependencies needed to run the program.
To build the image, run
To run it on the sample files, try
By default, running the image will execute the
make run command. To use with your own images, run the following:
docker run -v <path_to_input_images>:/usr/src/app/data/img mre0/receipt_parser