Baidu Introduces Unlimited-OCR To Read Long Documents At Constant Speed

    
        By vramkickedin    
     | 
    
            June 30, 2026 at 9:14 pm        
    
     | 
    
        2 min read

Baidu has introduced Unlimited-OCR, a new tool designed to read and transcribe long documents without losing speed. This model processes dozens of pages in a single pass by maintaining a constant memory size instead of slowing down as the text gets longer. It achieves this by using a special attention method that mimics how humans read and copy text efficiently over long periods.

The development team at Baidu built this system by modifying a baseline model and replacing its attention layers to keep memory usage flat. They also combined this new memory design with a highly compressed image encoder to handle large documents effectively. Additionally, the developers recently added support for faster processing frameworks to help users run the model more efficiently.

Features for document processing

Key Features

Reads dozens of pages in one pass.
Maintains constant memory during text generation.
Uses a highly compressed image encoder.
Supports standard processing length of thirty-two thousand.
Processes PDFs and multiple page images.
Applies to translation and audio transcription.

This tool is built for users who need to scan and convert massive amounts of text from images or PDFs. People working with extensive technical documents will benefit from the ability to process multiple pages without running out of memory. Anyone needing local text extraction can use the provided methods to run the model on their own hardware.

Project notes and development details

The developers note that while standard models slow down as output sequences lengthen, this new approach keeps the memory cache constant. They designed the attention mechanism to be general purpose, meaning it could also improve audio transcription and translation tasks in the future. Users can deploy the model using provided container images tailored for specific graphics card setups to ensure smooth operation.