Frankly...

Follow @frankm on Micro.blog.

Parsr, is a minimal-footprint document (image, pdf) cleaning, parsing and extraction toolchain which generates readily available, organized and usable data for data scientists and developers. Available as a Docker image. Looks like it could be a useful tool.

Surprise Me
Participate in the conversation
Linkblog of sources
See What Else I Am Doing