Data format
ORC Sample Files
ORC (.orc) files store optimized columnar datasets for analytics and warehouse workloads. Use sample ORC files to test reader compatibility, predicate pushdown behavior, and columnar migration workflows.
3 files
All to 50KB
SHA256 verified
Manifest included
Quick facts
Files first
ORC Sample Files — Download
Use cases
ORC Testing Workflows
Schema and Metadata Review
Open Format ORCPredicate Pushdown Validation
Open Format ORCCompare and decide
ORC Format Comparisons
FAQ and reference
ORC File FAQ
Checksum Verification
Use checksums to confirm file integrity after download.
shasum -a 256 your_file_name_here
# Compare output with SHA256 values listed above.
Where is the machine-readable manifest?
Use the manifest when you need stable names, SHA256 values, and URLs for automation.
Use in code — curl, Python, Node, wget
Copy any snippet directly into scripts, test suites, or CI pipelines. All URLs are stable and publicly accessible with no auth required.
# Download orc_metadata_sample.orc
curl -L -o orc_metadata_sample.orc \
https://samplefile.com/samples/download/data/orc/orc_metadata_sample.orc/
# Or fetch a random ORC file
curl -s "https://samplefile.com/samples/api/random?format=orc" | jq -r '.download_url'
# Download orc_metadata_sample.orc
wget -O orc_metadata_sample.orc \
https://samplefile.com/samples/download/data/orc/orc_metadata_sample.orc/
import requests
# Download a specific file
url = "https://samplefile.com/samples/download/data/orc/orc_metadata_sample.orc/"
resp = requests.get(url)
with open("orc_metadata_sample.orc", "wb") as f:
f.write(resp.content)
# Or fetch a random ORC file via API
meta = requests.get("https://samplefile.com/samples/api/random?format=orc").json()
resp = requests.get(meta["download_url"])
with open(meta["name"], "wb") as f:
f.write(resp.content)
// Download a specific file
const fs = require("fs");
const https = require("https");
const url = "https://samplefile.com/samples/download/data/orc/orc_metadata_sample.orc/";
https.get(url, (res) => {
res.pipe(fs.createWriteStream("orc_metadata_sample.orc"));
});
// Or fetch a random ORC via the API
const meta = await fetch("https://samplefile.com/samples/api/random?format=orc").then(r => r.json());
const file = await fetch(meta.download_url);
// use file.arrayBuffer(), file.body, etc.
# Random ORC file (JSON response)
GET https://samplefile.com/samples/api/random?format=orc
# All ORC files
GET https://samplefile.com/samples/api/files?format=orc
# Manifest with SHA256 checksums
GET https://samplefile.com/samples/data/orc/manifest.json
# Response includes: name, size_bytes, mime_type, sha256, download_url
Validation Methodology
- Validate extension and MIME detection before processing.
- Benchmark performance with small and larger files.
- Test malformed-input handling and error messaging.
Fixture Matrix
Use the curated ORC matrix to choose the right clean, edge-case, and broken fixtures for this format.
Open Matrix