AIFuture
Back to news
AI ResearchMarkTechPost·

Baidu Releases Unlimited OCR, a 3B Model That Keeps the KV Cache Flat for Long-Document Parsing

Baidu Releases Unlimited OCR, a 3B Model That Keeps the KV Cache Flat for Long-Document Parsing

Baidu open-sourced Unlimited OCR, a 3B-parameter MoE model that parses dozens of document pages in a single forward pass. Its Reference Sliding Window Attention (R-SWA) holds the KV cache constant, so memory and latency stay flat as output grows. It scores 93.23 on OmniDocBench v1.5, beating the DeepSeek OCR baseline by 6.22 points, under an MIT license. The post Baidu Releases Unlimited OCR, a…

This is a summary curated by AIFuture. Read the complete article at the original source:

Read the full story on MarkTechPost

Build the skills behind the headlines

Data ScienceCoursera

Machine Learning Specialization

Andrew Ng's flagship program covering supervised and unsupervised learning, neural networks, and best practices for real-world ML.

Beginner·Subscription
View Course
Generative AICoursera

Generative AI for Everyone

Andrew Ng explains how generative AI works and how to apply it in your work and life — no coding required.

Beginner·Subscription
View Course
Data ScienceCoursera

Deep Learning Specialization

Five-course series on neural networks, CNNs, sequence models, and transformers from DeepLearning.AI.

Intermediate·Subscription
View Course

Never miss what matters in AI

Get the most important AI news and course picks in your inbox.