Using VLLM (like GPT-4o) to parse PDF into markdown. Our approach is very simple (only 293 lines of code), but can almost perfectly parse typography, math formulas, tables, pictures, charts, etc.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Abstract: This paper presents Laminar 2.0, an enhanced serverless framework for running dispel4py streaming work-flows. Building on Laminar 1.0, this version introduces improved dependency management, ...