Matrice di fixture per estrazione PDF

Usa la matrice PDF per scegliere tra fixture ricche di testo, a layout fisso, tipo form o corrotte.

10 Righe di fixture
3 Come usare questa matrice
Use the matrix when the validation target is a set of variants rather than one canonical sample.
Come usare questa matrice

Coverage

  • Copre PDF a una pagina, multipagina, layout complessi e file corrotti.
  • Pensata per anteprima, estrazione testo, mapping campi e percorsi di errore parser.
  • Utile per fatture, report e workflow documentali dove il layout conta.
Righe di fixture

Available Variants

Variante Profilo Focus del test File Dimensione Scarica
Single-Page Text
Best default sanity check for renderers and PDF text extraction.
Valid baseline Simple rendering and extraction pdf_single_page_text_sample.pdf
.pdf SHA256 3426bbfe53be...
725 B Scarica
Multi-Page Report
Useful for multi-page previews, extraction batching, and document splitting.
Valid document Pagination and page count pdf_multi_page_report_sample.pdf
.pdf SHA256 a22424930c98...
1.3 KB Scarica
Invoice Layout
Targets invoice parsers and structured extraction pipelines.
Layout-driven fixture Field extraction from fixed layouts pdf_invoice_layout_sample.pdf
.pdf SHA256 45c10f35ba18...
774 B Scarica
Scan-Style PDF
Useful for pipelines that distinguish text PDFs from scan-like pages.
Image-heavy fixture OCR-style extraction pdf_scan_like_image_sample.pdf
.pdf SHA256 22a2cb26d64c...
3.7 KB Scarica
OCR-Noise PDF
Targets extraction robustness when scan quality or contrast is poor.
Image-heavy edge Noisy OCR fallback pdf_ocr_noise_sample.pdf
.pdf SHA256 19097c94fe1a...
7.9 KB Scarica
Form-Like PDF
Useful for OCR-adjacent field mapping and fixed-position extraction logic.
Structured layout Form field and box detection pdf_form_like_sample.pdf
.pdf SHA256 6b5c49113a70...
773 B Scarica
Landscape Report
Targets preview rotation, table extraction, and page-fit UI handling.
Orientation variant Wide-table rendering pdf_landscape_report_sample.pdf
.pdf SHA256 927df1c7e742...
743 B Scarica
Multi-Column Report
Useful for column segmentation and reading-order extraction tests.
Layout complexity Column-aware reading order pdf_multi_column_report_sample.pdf
.pdf SHA256 6c5d36e07e3d...
3.3 KB Scarica
Password-Protected PDF
Use password `samplefile` for protected-document handling and UX checks.
Protected document Unlock flow and restricted parsing pdf_password_protected_sample.pdf
.pdf SHA256 37f22291ff8b...
3.2 KB Scarica
Truncated PDF
Good for parser failures, preview fallback, and corrupt-download handling.
Broken fixture Damaged file recovery pdf_truncated_edge_case_sample.pdf
.pdf SHA256 537de4efe227...
701 B Scarica
Pagine strategiche correlate

Related Packs and Workflows

Pack correlati

Pack di fixture per estrazione documenti

Pack di fixture PDF e TXT reali per estrazione, analisi layout, validazione OCR, documenti protetti e file corrotti.

document_extraction_fixture_pack.zip · 18.9 KB

Flussi di lavoro correlati

Fixture per la validazione degli upload

File di test e pack per verificare limiti di upload, validazione MIME, ricezione ZIP e flussi a contenuto misto.

Apri workflow

Fixture per regressione parser

Fixture stabili e casi limite per parser di documenti, dati e archivi che richiedono copertura di regressione deterministica.

Apri workflow

Fixture per estrazione documenti

Fixture PDF e TXT per analisi del layout, estrazione tipo OCR, gestione documenti protetti e normalizzazione testo.

Apri workflow
Pagine strategiche correlate

Related Pages

Best Format Guides

Use-Case Recommendations

How to Convert

Comparisons