extraction for petabytes of PDFs created annually Multimodal Data Extraction Throughput Extraction of Enterprise Documents Pages per second, evaluated on publicly available dataset of PDFs consisting of text, charts, and tables. NIM On: nv-yolox-structured-image-v1, nemoretriever-page-elements-v1, nemoretriever- graphic-elements-v1, nemoretriever-table-structure-v1, PaddleOCR, nv-llama3.2-embedqa- 1b-v2. NIM Off: open-source alternative; HW - 1xH100 Customizable & Scalable Document Ingestion—any format, with any modality, of any size Built on NVIDIA NIM Supports docx, pptx, png, jpg, infographics Future: html, xlsx Extract text, structured charts, tables Future: flow charts, block diagrams, infographics Customizable extraction operations GPU accelerated linear scaling 15x improved throughput 12 pages/sec 0.81 pages/sec NIM Off NIM On Higher Throughput Multimodal Retrieval Accuracy NeMo Retriever Extraction Recall@5 Accuracy Retrieval of Enterprise Documents Evaluated on publicly available dataset of PDFs consisting of text, charts, tables, and infographics. NIM On: nemoretriever-page-elements-v2, nemoretriever-table-structure-v1, nemoretriever-graphic-elements-v1, paddle-ocr NIM Off: open-source alternative: HW - 1xH100 50% fewer incorrect answers 91% NIM Off NIM On 81%