axa-group/Parsr: Transforms PDF, Documents and Images into Enriched Structured Data


38 bookmarks. First posted by mac 6 weeks ago.


by Axa

Parsr, is a minimal-footprint document (image, pdf) cleaning, parsing and extraction toolchain which generates readily available, organized and usable data for data scientists and developers.

It provides users with clean structured and label-enriched information set for ready-to-use applications ranging from data entry and document analysis automation, archval, and many others.
scraping  library  tool  data-extraction  data-munging 
3 days ago by davidbenque
Transforms PDF, Documents and Images into Enriched Structured Data - axa-group/Parsr
data  parsing  pdf  extraction  python 
3 days ago by kwbr
Parsr, is a minimal-footprint document (image, pdf) cleaning, parsing and extraction toolchain which generates readily available, organized and usable data for data scientists and developers.
tools  parsr  parser  tool 
3 days ago by temikus
Transforms PDF and Images into Enriched Structured Data - axa-group/Parsr
4 days ago by trailoop
Transforms PDF and Images into Enriched Structured Data - axa-group/Parsr
4 days ago by sshappell
undefined
undefined
image  text  parser  scraper  ocr  tools 
6 days ago by michaelfox
Transforms PDF, Documents and Images into Enriched Structured Data - axa-group/Parsr
parser  markdown  image  data 
6 days ago by jgornick
Hmmm... also from AXA, "Transforms PDF, Documents and Images into Enriched Structured Data" [@benosteen @miaridge]
pdf  textExtraction  pdf2text  pdf2txt  textExtractor  pdf2data  tabula 
6 days ago by psychemedia
A document parsing and extraction tool that generates usable data for data scientists and developers. It can perform document hierarchy regression, page number detection, whitespace removal, link detection, and more. It takes an image or PDF as input and outputs JSON, Markdown, text, CSV, or PDF.
parser  conversion  PDF 
7 days ago by liqweed
Transforms PDF, Documents and Images into Enriched Structured Data - axa-group/Parsr
pdf  extraction  structured-data  parsing 
7 days ago by mjlassila
Transforms PDF, Documents and Images into Enriched Structured Data - axa-group/Parsr
axa  image  data  extraction  structured  table  tool  opensource  floss  pdf 
8 days ago by gilberto5757
Parsr takes as input an image (.JPG, .PNG, .TIFF, ...) or a PDF generates the following output formats: JSON, Markdown, Text, CSV (for tables)...
data  pdf  text  ocr 
9 days ago by marshallim
Transforms PDF, Documents and Images into Enriched Structured Data
pdf  image  data  extraction 
9 days ago by steffenfiedler
Transforms PDF, Documents and Images into Enriched Structured Data
pdf 
9 days ago by raygrasso
Transforms PDF and Images into Enriched Structured Data - axa-group/Parsr
pdf  parsing  extraction 
6 weeks ago by mac