OCR

allenai/olmocr is an open-source toolkit from the Allen Institute for AI (AI2), designed to efficiently convert PDFs and other documents into structured plaintext while maintaining the natural reading order.

Last updated