Matrice di fixture per estrazione PDF
Usa la matrice PDF per scegliere tra fixture ricche di testo, a layout fisso, tipo form o corrotte.
10
Righe di fixture
3
Come usare questa matrice
Use the matrix when the validation target is a set of variants rather than one canonical sample.
Come usare questa matrice
Coverage
- Copre PDF a una pagina, multipagina, layout complessi e file corrotti.
- Pensata per anteprima, estrazione testo, mapping campi e percorsi di errore parser.
- Utile per fatture, report e workflow documentali dove il layout conta.
Righe di fixture
Available Variants
| Variante | Profilo | Focus del test | File | Dimensione | Scarica |
|---|---|---|---|---|---|
|
Single-Page Text
Best default sanity check for renderers and PDF text extraction.
|
Valid baseline | Simple rendering and extraction |
pdf_single_page_text_sample.pdf
|
725 B | Scarica |
|
Multi-Page Report
Useful for multi-page previews, extraction batching, and document splitting.
|
Valid document | Pagination and page count |
pdf_multi_page_report_sample.pdf
|
1.3 KB | Scarica |
|
Invoice Layout
Targets invoice parsers and structured extraction pipelines.
|
Layout-driven fixture | Field extraction from fixed layouts |
pdf_invoice_layout_sample.pdf
|
774 B | Scarica |
|
Scan-Style PDF
Useful for pipelines that distinguish text PDFs from scan-like pages.
|
Image-heavy fixture | OCR-style extraction |
pdf_scan_like_image_sample.pdf
|
3.7 KB | Scarica |
|
OCR-Noise PDF
Targets extraction robustness when scan quality or contrast is poor.
|
Image-heavy edge | Noisy OCR fallback |
pdf_ocr_noise_sample.pdf
|
7.9 KB | Scarica |
|
Form-Like PDF
Useful for OCR-adjacent field mapping and fixed-position extraction logic.
|
Structured layout | Form field and box detection |
pdf_form_like_sample.pdf
|
773 B | Scarica |
|
Landscape Report
Targets preview rotation, table extraction, and page-fit UI handling.
|
Orientation variant | Wide-table rendering |
pdf_landscape_report_sample.pdf
|
743 B | Scarica |
|
Multi-Column Report
Useful for column segmentation and reading-order extraction tests.
|
Layout complexity | Column-aware reading order |
pdf_multi_column_report_sample.pdf
|
3.3 KB | Scarica |
|
Password-Protected PDF
Use password `samplefile` for protected-document handling and UX checks.
|
Protected document | Unlock flow and restricted parsing |
pdf_password_protected_sample.pdf
|
3.2 KB | Scarica |
|
Truncated PDF
Good for parser failures, preview fallback, and corrupt-download handling.
|
Broken fixture | Damaged file recovery |
pdf_truncated_edge_case_sample.pdf
|
701 B | Scarica |
Pagine strategiche correlate
Related Packs and Workflows
Pack correlati
Pack di fixture per estrazione documenti
Flussi di lavoro correlati
Fixture per la validazione degli upload
Apri workflowFixture per regressione parser
Apri workflowFixture per estrazione documenti
Apri workflowPagine strategiche correlate
Related Pages
Best Format Guides
Use-Case Recommendations
Miglior formato per archivio documentale di lungo periodo
Miglior formato per editing documentale collaborativo
How to Convert
How to Convert DOCX to PDF
How to Convert EPUB to PDF
How to Convert PDF to DOCX
How to Convert PDF to EPUB