visibility Static layout preview of Automated Workflows (Pipeline Runner), shown with a file imported, a four-step pipeline configured, and a completed run (results + per-step summary). All pages →

Automated Workflows

Run several tools in a row — save the steps once, reuse them anytime.

upload_file Drag and drop file here Up to 1.5 GB · CSV, TSV, XLSX, XLS · encoding & delimiter auto-detected
customers_export.csv 2.1 MB
Preview: customers_export.csv

18,442 rows, 6 columns

nameemailcityphonesignup_date
0 Jane Doe jane@acme.ioAustin512-555-01902024-01-04
1jane doeJANE@ACME.IOaustin(512) 555-019001/04/2024
2Bob Smithbob@globex.comDenver720.555.77812024-02-11
3R. Smithbob@globex.com720-555-7781Feb 11 2024

Options
Use the recommended default (text-clean → format → missing → dedup) Build interactively Import a saved pipeline JSON

Edit the table to add, remove, reorder (drag the row index), enable, or configure each step. Tool order is recommended, not enforced — violations surface as warnings below the table.

Tool Enabled Options (JSON)
≡ 0 text_clean expand_more check {"trim": true, "collapse_whitespace": true}
≡ 1 format_standardize expand_more check {"column_types": {"phone": "phone", "signup_date": "date"}}
≡ 2 missing expand_more check {"strategy": "flag", "sentinels": ["N/A", "—"]}
≡ 3 dedup expand_more check {"survivor_rule": "most_complete", "merge": true}
Add row
Recommended tool order — why each step belongs where it does

text_clean before format_standardize — format parsers (phone / currency / date) fail on smart-quote-contaminated or NBSP-padded input — clean text first

text_clean before missing — sentinel detection misses cells padded with NBSP / zero-width characters — clean text first

text_clean before dedup — fuzzy matching treats NBSP-padded values as different — clean text first

format_standardize before missing — numeric imputation needs numeric dtypes; canonical phones / currencies improve sentinel detection

format_standardize before dedup — canonical phones / lowercase emails enable cross-format duplicate matching

missing before dedup — deduping rows with mixed NaN sentinels produces brittle merges — resolve missing values first



Results

Initial rows
18,442
Final rows
18,130
Steps run
4
Elapsed
1.84 s

Per-step summary

stepstatuselapsed_mssummaryerror
text_clean ok 214 {"cells_changed": 1204, "columns": ["name", "city"]}
format_standardize ok 388 {"phone": 18301, "signup_date": 17996}
missing ok 121 {"flagged_cells": 642, "sentinels_found": ["—"]}
dedup ok 911 {"input_rows": 18442, "output_rows": 18130, "duplicates_removed": 312, "groups": 147}

Output preview (first 10 rows)

nameemailcityphonesignup_date
0Jane Doejane@acme.ioAustin+1 512-555-01902024-01-04
1Bob Smithbob@globex.comDenver+1 720-555-77812024-02-11
2Carla Reyescarla@initech.coPhoenix+1 480-555-33202024-03-02
3Dan Okafordan@umbrella.net⚑ missing+1 206-555-77452024-03-18
4Emily Tranemily@hooli.comSeattle+1 206-555-11822024-04-05