document-ai

Here are 274 public repositories matching this topic...

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Updated Jan 23, 2026
Python

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

nlp ocr computer-vision document-ai multimodal-pre-trained-model eccv-2022

Updated Jul 11, 2024
Python

deepdoctection / deepdoctection

Star

A Repo For Document AI

python nlp ocr tensorflow pytorch document-parser document-layout-analysis table-recognition table-detection document-understanding publaynet layoutlm document-ai document-image-analysis pubtabnet

Updated May 2, 2026
Python

tstanislawek / awesome-document-understanding

Star

A curated list of resources for Document Understanding (DU) topic

Updated Jun 2, 2023

wxyhgk / retain-pdf

Star

在保留版面、公式与结构的前提下进行 PDF 翻译，适用于科研与技术文档

pdf ocr translation document-processing scientific-papers typst document-ai layout-preserving

Updated May 3, 2026
Python

yigitkonur / api-llm-ocr

Star

PDF to markdown using vision LLMs — tables, layouts, and structure preserved

python ocr text-extraction table-extraction fastapi document-ai pdf-to-markdown vision-llm

Updated Feb 21, 2026
Python

run-llama / ParseBench

Star

ParseBench - A Document Parsing Benchmark for AI Agents

benchmark machine-learning ocr evaluation pdf-parsing table-extraction document-ai llm document-parsing llamaindex vision-language-models

Updated May 1, 2026
Python

jpWang / LiLT

Star

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

nlp information-extraction document-analysis document-understanding multilingual-models document-ai multimodal-pre-trained-model

Updated Oct 31, 2022
Python

aiptimizer / TurboOCR

Star

Fast GPU OCR server. 270 img/s on FUNSD. TensorRT FP16, PP-OCRv5, HTTP + gRPC.

ocr grpc nvidia text-recognition text-detection inference-server fp16 tensorrt rag fastapi pdf-extraction paddleocr easyocr document-ai document-parsing qwen-vl gpu-ocr

Updated May 1, 2026
C++

SCUT-DLVCLab / Document-AI-Recommendations

Star

Algorithms, papers, datasets, performance comparisons for Document AI.

document-understanding table-structure-recognition key-information-extraction document-ai visual-information-extraction

Updated Mar 1, 2025

harumiWeb / exstruct

Star

Conversion from Excel to structured JSON (tables, shapes, charts) for LLM/RAG pipelines, and autonomous Excel reading/writing by AI agents via CLI and MCP integration.

skills excel python-library structured-data xlwings rag excel-automation data-ex document-ai llm mcp-server excel-parsing

Updated Apr 22, 2026
Python

doc-analysis / ReadingBank

Star

ReadingBank: A Benchmark Dataset for Reading Order Detection

nlp natural-language-processing ocr document-understanding document-ai document-intelligence

Updated Aug 26, 2024

clovaai / webvicob

Star

Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

nlp ocr document-ai icdar2023

Updated Oct 24, 2023
Python

Keyvanhardani / german-ocr

Sponsor

Star

German-OCR is specifically trained to extract text from German documents including invoices, receipts, forms, and other business documents.

ocr lora vlm fine-tuning apache-2 document-ai llm vision-language-model ollama gguf invoice-extraction qwen3 german-ai german-ocr

Updated Apr 23, 2026
Python

nttmdlab-nlp / SlideVQA

Star

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

nlp ocr computer-vision document-ai aaai2023

Updated Mar 31, 2025
Python

PSPDFKit / ai-assistant-demo

Star

AI Document Assistant for PSPDFKit Demo showcases how to interact with PDFs using natural language commands powered by AI, integrated with PSPDFKit for Web.

chat pdf ai natural-language web-sdk pspdfkit document-processing nutrient document-ai ai-assistant llm

Updated Mar 5, 2026
JavaScript

nttmdlab-nlp / VDocRAG

Star

[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents

nlp ocr computer-vision document-ai cvpr2025

Updated May 26, 2025
Python

PSPDFKit / nutrient-dws-mcp-server

Star

A Model Context Protocol (MCP) server implementation that integrates with the Nutrient Document Web Service (DWS) Processor API, providing powerful PDF processing capabilities for AI assistants.

pdf mcp openai ai-agents claude document-processing nutrient pdf-processing document-ai llm langchain model-context-protocol mcp-server

Updated Mar 25, 2026
TypeScript

googleapis / python-documentai-toolbox

Star

This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-documentai-toolbox

ai gcp google-cloud google-cloud-platform document-ai vertex-ai generative-ai

Updated Mar 6, 2026
Python

ZeningLin / ViBERTgrid-PyTorch

Star

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

information-extraction document-analysis key-information-extraction document-ai visual-information-extraction

Updated Jan 9, 2024
Python

Improve this page

Add a description, image, and links to the document-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-ai

Here are 274 public repositories matching this topic...

microsoft / unilm

clovaai / donut

deepdoctection / deepdoctection

tstanislawek / awesome-document-understanding

wxyhgk / retain-pdf

yigitkonur / api-llm-ocr

run-llama / ParseBench

jpWang / LiLT

aiptimizer / TurboOCR

SCUT-DLVCLab / Document-AI-Recommendations

harumiWeb / exstruct

doc-analysis / ReadingBank

clovaai / webvicob

Keyvanhardani / german-ocr

nttmdlab-nlp / SlideVQA

PSPDFKit / ai-assistant-demo

nttmdlab-nlp / VDocRAG

PSPDFKit / nutrient-dws-mcp-server

googleapis / python-documentai-toolbox

ZeningLin / ViBERTgrid-PyTorch

Improve this page

Add this topic to your repo