Office Document Parsing Fixtures

DOCX and companion document fixtures for office parsing, text extraction, table handling, and document-ingestion workflows.

Por que este fluxo importa

  • Covers multi-section DOCX files, table-bearing documents, and office-style narrative content.
  • Use with PDF companions to compare office parsing against fixed-layout extraction outputs.
  • Anchored to a download pack so office-ingestion suites can start with one bundle.

Pacotes recomendados

Office Document Parsing Pack

Bundle of real DOCX and related document fixtures for office-document parsing, text extraction, and structured-content QA.

office_document_parsing_pack.zip · 12.1 KB

Matrizes de fixtures

DOCX Office Fixture Matrix

Choose DOCX fixtures for office-document parsing, section extraction, table handling, and office-ingestion workflows.

Matriz de fixtures para extracao PDF

Use a matriz PDF para escolher entre fixtures ricos em texto, layout fixo, tipo formulario ou danificados em pipelines de preview e extracao.

Fixtures sugeridos

Nome do arquivo Formato Tamanho Acoes
docx_project_brief_sample.docx DOCX 2.6 KB
docx_meeting_notes_sample.docx DOCX 2.7 KB
docx_table_report_sample.docx DOCX 2.7 KB
docx_policy_manual_sample.docx DOCX 2.6 KB
pdf_multi_column_report_sample.pdf PDF 3.3 KB