Image Extraction Fixtures

PNG, JPEG, TIFF, and scan-style PDF fixtures for OCR, document-photo capture, scan preprocessing, and image-based extraction workflows.

Warum dieser Workflow wichtig ist

  • Mix clean scans, noisy OCR inputs, mobile-photo captures, and TIFF archival sources in one workflow.
  • Use image and PDF fixtures together to compare scan ingestion against fixed-layout document extraction.
  • Anchored to a downloadable pack so teams can seed OCR and preprocessing suites quickly.

Empfohlene Pakete

Image Extraction Fixture Pack

Bundle of real PNG, JPEG, TIFF, and scan-style PDF fixtures for OCR, scan ingestion, and document-photo extraction workflows.

image_extraction_fixture_pack.zip · 382.3 KB

Dokumentenextraktions-Fixture-Paket

Paket mit echten PDF- und TXT-Fixtures fuer Extraktion, Layout-Analyse, OCR-aehnliche Validierung, geschuetzte Dokumente und beschaedigte Dateien.

document_extraction_fixture_pack.zip · 18.9 KB

Fixture-Matrizen

PNG OCR Fixture Matrix

Pick PNG fixtures for clean receipt scans, grayscale OCR preprocessing, and image-based extraction smoke tests.

JPEG Document Capture Matrix

Choose JPEG fixtures for mobile document photos, compressed receipt captures, and OCR workflows that start from camera images.

TIFF Scan Fixture Matrix

Use TIFF fixtures for archival scans, fax-like OCR noise, and image-based extraction workflows that prefer TIFF sources.

Fixture-Matrix fuer PDF-Extraktion

Nutzen Sie die PDF-Matrix, um zwischen textreichen, fixed-layout-, formularartigen oder beschaedigten Fixtures in Preview- und Extraktions-Pipelines zu waehlen.

Empfohlene Fixtures

Dateiname Format Groesse Aktionen
png_receipt_scan_sample.png PNG 4.4 KB
png_ocr_noise_sample.png PNG 42.0 KB
jpeg_mobile_document_capture_sample.jpeg JPEG 36.9 KB
jpeg_receipt_photo_sample.jpeg JPEG 282.7 KB
tiff_archival_scan_sample.tiff TIFF 65.7 KB
tiff_fax_noise_sample.tiff TIFF 540.2 KB
pdf_scan_like_image_sample.pdf PDF 3.7 KB
pdf_ocr_noise_sample.pdf PDF 7.9 KB