A Unified Toolkit for Deep Learning Based Document Image Analysis
-
Updated
Mar 7, 2024 - Python
A Unified Toolkit for Deep Learning Based Document Image Analysis
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
Read and extract text and other content from PDFs in C# (port of PDFBox)
OCR engine for all the languages
MinerU is a one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Document Layout Analysis resources repos for development with PdfPig.
A toolbox of ocr models and algorithms based on MindSpore
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
A Large Dataset of Historical Japanese Documents with Complex Layouts
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Analysis of Chinese and English layouts 中英文版面分析
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil
A more complete example of programming with PDFMiner, which continues where the default documentation stops
Add a description, image, and links to the layout-analysis topic page so that developers can more easily learn about it.
To associate your repository with the layout-analysis topic, visit your repo's landing page and select "manage topics."