diff --git a/layout-review/02_text_cleaner.html b/layout-review/02_text_cleaner.html index 6f49c95..3473806 100644 --- a/layout-review/02_text_cleaner.html +++ b/layout-review/02_text_cleaner.html @@ -54,7 +54,12 @@ Preview: contacts_messy.csv

4,120 rows, 4 columns

-
check Show hidden characters in preview
+
check Show hidden characters
+
+ · Whitespace + Smart / special + Control +
@@ -82,7 +87,11 @@ minimal paranoid -
excel-hygiene: trim, collapse whitespace, fold smart quotes, strip invisible chars, normalize line endings, NFC.
+
+ minimal: trim and collapse whitespace only — no character substitutions.
+ excel-hygiene: trim, collapse whitespace, fold smart quotes, strip invisible chars, normalize line endings, NFC.
+ paranoid: everything in excel-hygiene plus strip control characters, strip BOM, and NFKC compatibility fold (lossy). +
@@ -143,17 +152,20 @@
Columns processed
4
-
check Show hidden characters (NBSP, ZWSP, smart quotes, control chars…)
+
+
check Show hidden characters (NBSP, ZWSP, smart quotes, control chars…)
+
Same setting as “Show hidden characters” in the preview above — toggling either updates both.
+

Changes by column

nameemailcompanynotes
- + - - - - + + + +
cells_changed
columncells_changed
company1,604
name1,210
notes982
email151
company1,604
name1,210
notes982
email151
diff --git a/layout-review/03_format_standardizer.html b/layout-review/03_format_standardizer.html index a0bff95..8faacb3 100644 --- a/layout-review/03_format_standardizer.html +++ b/layout-review/03_format_standardizer.html @@ -76,18 +76,23 @@

Format options

- +
- US (default) — ISO 8601 dates · E.164 phones · USD - European — DMY input · INTL phones · EUR comma decimal + US (default) — ISO 8601 dates · E.164 phones · USD + European — DMY input · INTL phones · EUR comma decimal base UK — DD/MM/YYYY · GB phones · Yes/No booleans ISO Strict — ISO 8601 · bare-number currency · true/false Legacy US — MM/DD/YYYY · National phones · Yes/No - Custom — keep current settings + Custom — based on European, 2 controls changed modified
-
Pick a published standard or regional convention as the baseline. Every option below is still individually overridable.
+
+ rule + Individual controls win over the preset. You started from European, then changed Ambiguous input order and Decimal separator below — so the preset is now Custom. The controls' current values are what actually run. +
+
Pick a published standard or regional convention as the baseline. Every option below is still individually overridable; overriding any one switches the preset to Custom.
@@ -97,11 +102,12 @@

Dates

YYYY-MM-DD (ISO)
- +
MDY (US) DMY (EU)
+
Winning value: MDY. Overrides the European base (DMY) — 01/02/2024 reads as 2024-01-02.

Phones

@@ -117,11 +123,12 @@

Currency

- +
dot (1,234.56) comma (1.234,56)
+
Winning value: dot. Overrides the European base (comma) — $1,234.5 reads as 1234.50.
2
Preserve original precision (don't round)
@@ -154,9 +161,30 @@
info - 47 cell(s) in typed columns didn't match a recognizable shape and were left as-is. Check the changes audit below to find them, or re-classify the column to (skip). + 47 cell(s) in typed columns didn't match a recognizable shape and were left as-is. See Unparseable cells below to review them, or re-classify the column to (skip). (They aren't in the changes audit — nothing was changed.)
+ +
+ Unparseable cells (47) +
+

Cells in typed columns that didn't match a recognizable shape and were left unchanged.

+
+ + + + + + + + + +
rowcolumnfield_typevalue (left as-is)
318signup_datedatesoon
902phonephoneext. 4471
1,544amountcurrencyTBD
2,087activebooleanmaybe
3,610signup_datedate00/00/0000
+
+

… and 42 more.

+
+
+

Changes by column

@@ -194,6 +222,7 @@

Standardized preview (first 10 rows)

+

Showing 5 of 6 columns — notes is set to (skip), so it's omitted here.

diff --git a/layout-review/06_outlier_detector.html b/layout-review/06_outlier_detector.html index 546a847..4d73b4c 100644 --- a/layout-review/06_outlier_detector.html +++ b/layout-review/06_outlier_detector.html @@ -12,7 +12,7 @@
visibility - Static layout preview of Find Unusual Values — a Coming Soon tool. The page is a stub/teaser: an "under development" notice, a list of planned features, and disabled placeholder controls (only the file uploader is live). All pages → + Static layout preview of Find Unusual Values — a Coming Soon tool. The page is a stub/teaser: an "under development" notice and disabled placeholder controls. All pages →
@@ -31,40 +31,14 @@ This tool is under development.
- -

Features:

-
    -
  • Z-score detection (configurable threshold)
  • -
  • IQR (interquartile range) detection
  • -
  • MAD (median absolute deviation) detection
  • -
  • Domain-rule violations (e.g., age < 0, price > $1M)
  • -
  • Visual outlier highlighting in data preview
  • -
  • Handling: flag only, remove, cap/winsorize to bounds
  • -
-
- - -
-
- upload_file Drag and drop file here - CSV, TSV, XLSX, XLS · Import a file to preview. Processing is not yet available. -
- -
-

Detection Method

-
Z-Score
-
- -
- -
3.0
+
IQR (interquartile range)
full_namephoneamountsignup_dateactive