Data format
Archivos de muestra PARQUET
Parquet (.parquet) files store columnar analytic datasets optimized for batch ingestion and warehouse-style reads. Use sample Parquet files to test ETL pipelines, schema handling, and columnar import workflows.
3 archivos
All to 50KB
SHA256 verified
Manifiesto included
Quick facts
Files first
PARQUET Sample Files — Download
Use cases
PARQUET Testing Workflows
Columnar Preview and Schema Review
Abrir formato PARQUETBinary and List Column Validation
Abrir formato PARQUETCompare and decide
PARQUET Format Comparisons
FAQ and reference
PARQUET File FAQ
Verificacion de checksum
Usa checksums para confirmar la integridad del archivo despues de descargarlo.
shasum -a 256 your_file_name_here
# Compare output with SHA256 values listed above.
Where is the machine-readable manifest?
Use the manifest when you need stable names, SHA256 values, and URLs for automation.
Use in code — curl, Python, Node, wget
Copy any snippet directly into scripts, test suites, or CI pipelines. All URLs are stable and publicly accessible with no auth required.
# Download parquet_alltypes_plain_sample.parquet
curl -L -o parquet_alltypes_plain_sample.parquet \
https://samplefile.com/samples/download/data/parquet/parquet_alltypes_plain_sample.parquet/
# Or fetch a random PARQUET file
curl -s "https://samplefile.com/samples/api/random?format=parquet" | jq -r '.download_url'
# Download parquet_alltypes_plain_sample.parquet
wget -O parquet_alltypes_plain_sample.parquet \
https://samplefile.com/samples/download/data/parquet/parquet_alltypes_plain_sample.parquet/
import requests
# Download a specific file
url = "https://samplefile.com/samples/download/data/parquet/parquet_alltypes_plain_sample.parquet/"
resp = requests.get(url)
with open("parquet_alltypes_plain_sample.parquet", "wb") as f:
f.write(resp.content)
# Or fetch a random PARQUET file via API
meta = requests.get("https://samplefile.com/samples/api/random?format=parquet").json()
resp = requests.get(meta["download_url"])
with open(meta["name"], "wb") as f:
f.write(resp.content)
// Download a specific file
const fs = require("fs");
const https = require("https");
const url = "https://samplefile.com/samples/download/data/parquet/parquet_alltypes_plain_sample.parquet/";
https.get(url, (res) => {
res.pipe(fs.createWriteStream("parquet_alltypes_plain_sample.parquet"));
});
// Or fetch a random PARQUET via the API
const meta = await fetch("https://samplefile.com/samples/api/random?format=parquet").then(r => r.json());
const file = await fetch(meta.download_url);
// use file.arrayBuffer(), file.body, etc.
# Random PARQUET file (JSON response)
GET https://samplefile.com/samples/api/random?format=parquet
# All PARQUET files
GET https://samplefile.com/samples/api/files?format=parquet
# Manifest with SHA256 checksums
GET https://samplefile.com/samples/data/parquet/manifest.json
# Response includes: name, size_bytes, mime_type, sha256, download_url
Metodologia de validacion
- Validate extension and MIME detection before processing.
- Benchmark performance with small and larger files.
- Test malformed-input handling and error messaging.
Matriz de fixtures
Usa la matriz curada de PARQUET para elegir fixtures limpios, limite y rotos para este formato.
Abrir matriz
Packs de flujo