docs+code: rename tool labels everywhere
Sweep follow-up to 93e43fc. Display labels now consistent across docs,
landing pages, CLI output, code comments, docstrings, and test prose.
Five parallel surfaces touched:
- docs (EN + ES): README, USER-GUIDE, CLI-REFERENCE, and 11 internal
design/planning docs
- landing pages: index + bookkeeper/revops/shopify-pet
- src: CLI module docstrings, _TOOL_DISPLAY dicts in cli_analyze.py
and gui/components/_legacy.py, core module headers, every tool
page's module docstring
- tests: class/method/module docstrings and section-header comments
- test-cases READMEs
Page slugs (1_Deduplicator etc.), tool_id strings (01_deduplicator
etc.), Python class names (TestDeduplicatorWorkflow, FeatureFlag.*),
URL paths, anchor IDs, CSS classes, and asset filenames were left
intact since they're code identifiers / structural references.
All 2033 tests pass.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -14,7 +14,7 @@ Enter your full name + email, paste the license blob from your purchase email (s
|
||||
|
||||
| Tier | Tools |
|
||||
|---|---|
|
||||
| **Lite** | Deduplicator · Text Cleaner · Format Standardizer |
|
||||
| **Lite** | Find Duplicates · Clean Text · Standardize Formats |
|
||||
| **Core** | All 9 tools |
|
||||
|
||||
A Lite user opening a Core-only tool sees an "Upgrade your license" prompt. The home page also shows a 🔒 Locked badge on tool cards your tier doesn't unlock. To upgrade, paste a Core blob on the Activate page.
|
||||
@@ -53,15 +53,15 @@ Full numbered support matrix: [REQUIREMENTS.md](REQUIREMENTS.md).
|
||||
|
||||
| # | Tool | Purpose | Status |
|
||||
|---|------|---------|--------|
|
||||
| 01 | Deduplicator | Exact + fuzzy match, 5 normalizers, audit | Ready |
|
||||
| 02 | Text Cleaner | Whitespace, smart chars, BOM, line endings, case ops | Ready |
|
||||
| 03 | Format Standardizer | Dates / phones / emails / addresses / names / currencies / booleans | Ready |
|
||||
| 04 | Missing Value Handler | Disguised nulls, imputation, drop-by-threshold | Coming Soon |
|
||||
| 05 | Column Mapper | Rename + enforce schema | Coming Soon |
|
||||
| 06 | Outlier Detector | z-score, IQR, multivariate | Coming Soon |
|
||||
| 07 | Multi-File Merger | Combine multiple files | Coming Soon |
|
||||
| 08 | Validator & Reporter | Rules + PDF/Excel report | Coming Soon |
|
||||
| 09 | Pipeline Runner | One-click multi-tool launcher | Coming Soon |
|
||||
| 01 | Find Duplicates | Exact + fuzzy match, 5 normalizers, audit | Ready |
|
||||
| 02 | Clean Text | Whitespace, smart chars, BOM, line endings, case ops | Ready |
|
||||
| 03 | Standardize Formats | Dates / phones / emails / addresses / names / currencies / booleans | Ready |
|
||||
| 04 | Fix Missing Values | Disguised nulls, imputation, drop-by-threshold | Coming Soon |
|
||||
| 05 | Map Columns | Rename + enforce schema | Coming Soon |
|
||||
| 06 | Find Unusual Values | z-score, IQR, multivariate | Coming Soon |
|
||||
| 07 | Combine Files | Combine multiple files | Coming Soon |
|
||||
| 08 | Quality Check | Rules + PDF/Excel report | Coming Soon |
|
||||
| 09 | Automated Workflows | One-click multi-tool launcher | Coming Soon |
|
||||
|
||||
**Sample data** (`samples/`): `messy_sales.csv`, `bank_export.xlsx`.
|
||||
|
||||
@@ -89,17 +89,17 @@ Get help: `deduplicator --help`. Full reference: [CLI-REFERENCE.md](CLI-REFERENC
|
||||
|
||||
### 3.3 Run order (when running tools manually)
|
||||
|
||||
If you skip the Pipeline Runner, follow this order:
|
||||
If you skip Automated Workflows, follow this order:
|
||||
|
||||
1. **02 Text Cleaner** first — normalizes whitespace + special chars.
|
||||
2. **03 Format Standardizer** — dates, phones, etc. need cleaned text.
|
||||
3. **04 Missing Value Handler** — sentinel codes hide as numbers.
|
||||
4. **05 Column Mapper** — schema before outlier stats.
|
||||
5. **06 Outlier Detector** — needs clean numerics. Stats on data with `NaN` or `-999` are mathematically poisoned.
|
||||
6. **07 Multi-File Merger**, **08 Validator** as needed.
|
||||
7. **01 Deduplicator** is order-flexible (normalizes internally for matching).
|
||||
1. **02 Clean Text** first — normalizes whitespace + special chars.
|
||||
2. **03 Standardize Formats** — dates, phones, etc. need cleaned text.
|
||||
3. **04 Fix Missing Values** — sentinel codes hide as numbers.
|
||||
4. **05 Map Columns** — schema before outlier stats.
|
||||
5. **06 Find Unusual Values** — needs clean numerics. Stats on data with `NaN` or `-999` are mathematically poisoned.
|
||||
6. **07 Combine Files**, **08 Quality Check** as needed.
|
||||
7. **01 Find Duplicates** is order-flexible (normalizes internally for matching).
|
||||
|
||||
The Pipeline Runner enforces this automatically.
|
||||
Automated Workflows enforces this automatically.
|
||||
|
||||
### 3.4 Language
|
||||
|
||||
|
||||
Reference in New Issue
Block a user