Intelligent Data Curation
From Raw to Analytics-Ready

Transform, mask, and prepare your data with automated SCD support, PII protection, and curated data marts — all without writing a single line of code.

How It Works

Four steps to analytics-ready data

1

Step 1

Enable Curation on Your Tables

Select which ingested tables to curate. Autolake lets you enable curation per table and choose your source layer — raw ingestion data or an existing curated mart — so you always start from the right foundation.

Curation source selection interface

And just like that, your data is analytics-ready.

Curated, masked, and governed — with full SCD support, PII protection, and automated quality checks.

0%

Less Manual SQL

0x

Faster Mart Deployment

0

Lines of Code Required

0%

PII Compliance

Capabilities

Everything you need for
enterprise-grade curation

A complete suite of tools to transform, govern, and prepare your data for analytics.

PII Masking & Privacy

Column-level PII detection and masking. Mark sensitive fields, apply obfuscation rules, and auto-generate secure stage views that enforce your privacy policies across every mart.

GDPR CompliantHIPAA ReadyColumn-level ControlAuto-detection

SCD Type 1, 2, 3

One-click Slowly Changing Dimension support. Choose Type 1 (overwrite), Type 2 (history), or Type 3 (previous value) — Autolake handles the merge logic.

Type 1Overwrite
Type 2Full History
Type 3Previous Value

Auto-generated dbt Models

Autolake generates production-ready dbt transformation models from your mart configuration. No manual SQL scripting required.

-- auto-generated by Autolake
SELECT *
FROM {{ ref('stg_customers') }}
WHERE _scd_active = true

Iceberg Table Format

All curated marts use Apache Iceberg for time-travel queries, schema evolution, and efficient partition pruning.

Schema Evolution

Automatic handling of schema changes. New columns, type changes, and renamed fields are detected and propagated through the curation pipeline.

Real-time Monitoring

Track curation job status, records processed, failures, and data quality metrics from a unified operations dashboard.

Real-time
Jobs Monitored
Automated
Quality Checks
Multi-channel
Alert Channels

Who Benefits

Built for every role

See how Autolake curation transforms data preparation across your organization.

Key Features

  • Auto-generated dbt transformation models
  • Iceberg table management with schema evolution
  • Pipeline orchestration with dependency chaining

Impact

  • Eliminate 80% of manual SQL scripting
  • Deploy SCD marts in minutes, not weeks
  • Automated schema drift detection & handling

Ready to curate your data lake?

Transform raw data into governed, analytics-ready assets with Autolake's automated curation pipeline.