Intelligent Data Curation
From Raw to Analytics-Ready
Transform, mask, and prepare your data with automated SCD support, PII protection, and curated data marts — all without writing a single line of code.
How It Works
Four steps to analytics-ready data
Step 1
Enable Curation on Your Tables
Select which ingested tables to curate. Autolake lets you enable curation per table and choose your source layer — raw ingestion data or an existing curated mart — so you always start from the right foundation.

And just like that, your data is analytics-ready.
Curated, masked, and governed — with full SCD support, PII protection, and automated quality checks.
Less Manual SQL
Faster Mart Deployment
Lines of Code Required
PII Compliance
Capabilities
Everything you need for
enterprise-grade curation
A complete suite of tools to transform, govern, and prepare your data for analytics.
PII Masking & Privacy
Column-level PII detection and masking. Mark sensitive fields, apply obfuscation rules, and auto-generate secure stage views that enforce your privacy policies across every mart.
SCD Type 1, 2, 3
One-click Slowly Changing Dimension support. Choose Type 1 (overwrite), Type 2 (history), or Type 3 (previous value) — Autolake handles the merge logic.
Auto-generated dbt Models
Autolake generates production-ready dbt transformation models from your mart configuration. No manual SQL scripting required.
Iceberg Table Format
All curated marts use Apache Iceberg for time-travel queries, schema evolution, and efficient partition pruning.
Schema Evolution
Automatic handling of schema changes. New columns, type changes, and renamed fields are detected and propagated through the curation pipeline.
Real-time Monitoring
Track curation job status, records processed, failures, and data quality metrics from a unified operations dashboard.
Who Benefits
Built for every role
See how Autolake curation transforms data preparation across your organization.
Key Features
- Auto-generated dbt transformation models
- Iceberg table management with schema evolution
- Pipeline orchestration with dependency chaining
Impact
- Eliminate 80% of manual SQL scripting
- Deploy SCD marts in minutes, not weeks
- Automated schema drift detection & handling
