Columnar Format Benchmarking Fixtures

Arrow, ORC, and Parquet fixtures for reader benchmarking, schema inspection, and columnar compatibility validation.

Why This Workflow Matters

  • Focuses on columnar file compatibility across Arrow IPC, ORC, and Parquet readers.
  • Useful for benchmarking scan behavior, schema handling, and metadata-aware readers.
  • Anchored to a pack that gives columnar-comparison suites one-click fixture setup.

Recommended Packs

Columnar Compatibility Fixture Pack

Bundle of real Arrow, ORC, and Parquet fixtures for columnar reader compatibility, schema inspection, and warehouse scan validation.

columnar_compatibility_fixture_pack.zip · 128.0 KB

Warehouse Format Comparison Fixture Pack

Bundle of real Arrow, ORC, Parquet, and Avro fixtures for warehouse migration planning, reader compatibility, and cross-format ingestion validation.

warehouse_format_comparison_fixture_pack.zip · 24.6 KB

Fixture Matrices

Arrow IPC Fixture Matrix

Choose Arrow fixtures for IPC reader validation, nested column handling, and warehouse-format interoperability checks.

ORC Columnar Fixture Matrix

Pick ORC fixtures for warehouse scans, predicate pushdown checks, and metadata-aware reader validation.

Parquet Ingestion Fixture Matrix

Choose Parquet fixtures for columnar ingestion, warehouse imports, nested-column handling, and batch-load validation.

Suggested Fixtures

Filename Format Size Actions
arrow_decimal_ipc_sample.arrow ARROW 250.6 KB
arrow_nested_ipc_sample.arrow ARROW 2.6 KB
orc_metadata_sample.orc ORC 41.0 KB
orc_test1_sample.orc ORC 1.7 KB
parquet_list_columns_sample.parquet PARQUET 2.5 KB