Last updated 3 days ago
allenai/olmocr is an open-source toolkit from the Allen Institute for AI (AI2), designed to efficiently convert PDFs and other documents into structured plaintext while maintaining the natural reading order.