Data format
PARQUET Sample Files
Parquet (.parquet) files store columnar analytic datasets optimized for batch ingestion and warehouse-style reads. Use sample Parquet files to test ETL pipelines, schema handling, and columnar import workflows.
3 files
All to 50KB
SHA256 verified
Manifest included
Quick facts
Files first
PARQUET Sample Files — Download
Use cases
PARQUET Testing Workflows
Columnar Preview and Schema Review
Open Format PARQUETBinary and List Column Validation
Open Format PARQUETCompare and decide
PARQUET Format Comparisons
FAQ and reference
PARQUET File FAQ
Checksum Verification
Use checksums to confirm file integrity after download.
shasum -a 256 your_file_name_here
# Compare output with SHA256 values listed above.
Where is the machine-readable manifest?
Use the manifest when you need stable names, SHA256 values, and URLs for automation.
Validation Methodology
- Validate extension and MIME detection before processing.
- Benchmark performance with small and larger files.
- Test malformed-input handling and error messaging.
Fixture Matrix
Use the curated PARQUET matrix to choose the right clean, edge-case, and broken fixtures for this format.
Open Matrix
Workflow Packs