DEV Community

# ocr

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
2025 Complete Guide: How to Build End-to-End OCR with HunyuanOCR

2025 Complete Guide: How to Build End-to-End OCR with HunyuanOCR

6 min read
Stop Typing That Image Text: PaddleOCR Makes AI-Powered Text Extraction Effortless

Stop Typing That Image Text: PaddleOCR Makes AI-Powered Text Extraction Effortless

3 min read
What is Intelligent Document Processing?

What is Intelligent Document Processing?

3 min read
Optical Clear Adhesive (OCA): Why It Matters in Modern Display Assembly

Optical Clear Adhesive (OCA): Why It Matters in Modern Display Assembly

4 min read
Paddle OCR-VL & DeepSeek-OCR

Paddle OCR-VL & DeepSeek-OCR

2 min read
DeepSeek OCR in Automation Pipelines: Practical Engineering Insights and Integration Patterns

DeepSeek OCR in Automation Pipelines: Practical Engineering Insights and Integration Patterns

33
8
4 min read
DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

7
9 min read
I am building a document api suite that gives you coordinates for every answer

I am building a document api suite that gives you coordinates for every answer

1 min read
Complete Guide 2025: How DeepSeek OCR Reduces AI Costs by 20x Through "Visual Compression"

Complete Guide 2025: How DeepSeek OCR Reduces AI Costs by 20x Through "Visual Compression"

1
10 min read
PaddleOCR: Revolutionizing OCR with AI-Powered Document Understanding

PaddleOCR: Revolutionizing OCR with AI-Powered Document Understanding

1
3 min read
Kreuzberg: Revolutionizing Document Intelligence in Python

Kreuzberg: Revolutionizing Document Intelligence in Python

1
3 min read
2025 Complete Guide: PaddleOCR-VL-0.9B — Baidu's Ultra-Lightweight Document Parsing Powerhouse

2025 Complete Guide: PaddleOCR-VL-0.9B — Baidu's Ultra-Lightweight Document Parsing Powerhouse

6
1
9 min read
Farsi Image generator

Farsi Image generator

2 min read
Building Purchase Tracker: The MVP That Eats Your Receipts (So You Don’t Have To)

Building Purchase Tracker: The MVP That Eats Your Receipts (So You Don’t Have To)

2 min read
Step-by-Step Guide to Translating Documents Online Without Breaking Formatting

Step-by-Step Guide to Translating Documents Online Without Breaking Formatting

3 min read
The OCR Model That Outranks GPT-4o

The OCR Model That Outranks GPT-4o

5
1
16 min read
Generating Synthetic RTL OCR Data for Donut with SynthDoG-RTL

Generating Synthetic RTL OCR Data for Donut with SynthDoG-RTL

1
2 min read
Major Challenges in Document Processing & How AI Solves Them | 2025 Guide

Major Challenges in Document Processing & How AI Solves Them | 2025 Guide

4 min read
NuMarkdown-8B-Thinking: The Open-Source Reasoning OCR that Converts PDFs to Auditable Markdown for Enterprise RAG Pipelines

NuMarkdown-8B-Thinking: The Open-Source Reasoning OCR that Converts PDFs to Auditable Markdown for Enterprise RAG Pipelines

10 min read
Building an iOS ID Scanner with Face, Document, OCR and MRZ Detection

Building an iOS ID Scanner with Face, Document, OCR and MRZ Detection

2
10 min read
OCR in Healthcare – Comparing Technical Approaches

OCR in Healthcare – Comparing Technical Approaches

2 min read
Kreuzberg: The Python Document Intelligence Framework That Will Blow Your Mind!

Kreuzberg: The Python Document Intelligence Framework That Will Blow Your Mind!

3 min read
OCR Automation: Streamlining Document Processing Efficiently

OCR Automation: Streamlining Document Processing Efficiently

1
4 min read
K-shot training with LLMs

K-shot training with LLMs

5
4
1 min read
Implementing OCR in Azure: A Comparison of Logic App Connectors and Function Apps

Implementing OCR in Azure: A Comparison of Logic App Connectors and Function Apps

3 min read
loading...