DataTools / for 1099 prep
$49 — one-time, no subscription Get DataTools →
For accounts payable · 1099-NEC season · vendor master cleanup

Build a clean 1099 vendor list —
with the missing EINs filled in.

The same vendor got entered three times across the year — one row has the EIN, another the address, another the phone — and now it's January and you can't file because the numbers are scattered. DataTools consolidates each vendor to one row and backfills the gaps from the duplicates: in our sample, 24 messy records become 8 complete vendors with 7 missing EINs recovered from duplicate rows. Your data never leaves your computer.

Get DataTools for Accounting — $49 → Try the live demo ↓ One-time payment · cross-platform · runs offline
24→8
messy records to complete vendors
7
missing EINs recovered
0
cloud uploads ever
If any of these sound like your January

Five pains DataTools fixes in one pass

🧾

The same vendor is in the list two or three times

Different staff entered "Acme LLC", "Acme, L.L.C.", and "ACME Llc" across the year. Each is a separate row in the vendor master, and each only holds part of the story — so your 1099 totals split across three near-duplicate spellings.

What it costs: hours of manual matching, plus the risk of filing the wrong total.

🔢

The EIN is on a different row than the rest of the details

One record captured the EIN at onboarding; the row you actually paid against doesn't have it. At 1099 time the field is blank even though you collected it months ago — it's just sitting on a duplicate.

What it costs: chasing W-9s you already have on file.

📵

Phones, addresses, and amounts are formatted five different ways

Remittance phone as (212) 555-0147 on one row and 212.555.0147 on another. Amounts with stray $ and commas. The export won't reconcile and the 1099-NEC box totals don't tie out.

What it costs: a half-day reconciling before you can even start filing.

You don't know which EINs are genuinely missing

Some EINs are recoverable from a duplicate row. Some you never collected. Until the list is consolidated you can't tell the two apart — so you either over-chase vendors or under-file.

What it costs: late filings and TIN-mismatch penalties.

📤

Your QuickBooks vendor export doesn't match your AP ledger

The vendor master in QuickBooks, the payments spreadsheet, and the W-9 tracker each use different column names for "vendor name" / "Tax ID" / "amount paid." Merging them is an afternoon of manual rename before any analysis begins.

What it costs: 4–8 hours per filing season manually merging exports.

🔒

Cloud cleaners want you to upload your vendor master

Your vendor master holds EINs, remittance addresses, and payment history — exactly the data you should not be uploading to a SaaS to clean. DataTools is desktop-only — your vendor list never leaves your computer.

What it costs: nothing — and that's the point.

Live demo · runs in your browser

Try it on a real-looking vendor master export

The demo below loads a sample 24-row vendor file with the pollution we've seen in real AP systems: the same vendor entered two or three times under slightly different spellings, EINs that live on one duplicate row but not the one you paid against, phones and amounts formatted five ways, and the usual mess of N/A / (blank) / ? sentinels. Click Run pipeline and watch the 24 records collapse to 8 complete vendors with 7 EINs recovered in under a second.

Demo runs on free hosting (Streamlit Community Cloud). Capped at 100 input rows · output watermarked with one trailing row. The paid product has no caps and runs entirely offline.
Built for the accounts-payable team

Five workflows you do every filing season

🧹

Vendor-master consolidation

Catches the same vendor that shows up as Acme LLC, Acme, L.L.C., and ACME Llc. Fuzzy match merges the spellings; the dedup merge collapses them to one row and backfills the gaps from each duplicate.

🔢

EIN backfill & missing-EIN flagging

Pulls the EIN off whichever duplicate row captured it and fills it into the survivor. The EINs that are genuinely missing get flagged so you know exactly which W-9s to chase.

💵

1099-NEC amount roll-up

Before filing: standardize amounts, drop sentinels-as-missing, and merge so each vendor's total paid lands on one row and ties to your AP ledger.

📥

QuickBooks vendor export cleanup

Whitespace in Tax IDs, near-identical vendor names, copy-paste smart quotes in remittance addresses — gone. Audit log shows every change for your reviewer.

🔗

Merging the W-9 tracker into the AP ledger

The vendor master, the payments spreadsheet, and the W-9 tracker each name "Tax ID" differently. Map Columns aligns them; the dedup merge consolidates across all three sources.

⚙️

Repeatable pipeline

Save the cleanup as a JSON file. Drop next year's vendor export on it. Same consolidation, zero re-configuration. Automatable via the CLI.

The thing every cloud cleaner can't say

Your vendor master never leaves your computer.

DataTools is a desktop app. There's no upload step, no SaaS account, no subscription, no "trust our security policy." The first thing you can do after install is open your browser's network tab, run the cleaner on your real vendor file, and verify zero outbound requests.

Why it matters for AP: your vendor master holds EINs, remittance addresses, and payment history. Cloud cleaners require you to upload it. We don't.
$ python -m src.cli_pipeline vendor_1099.csv --pipeline vendor_1099_pipeline.json --apply Reading vendor_1099.csv... 24 rows, 9 columns Executing pipeline: text_clean (38 ms) {cells_changed: 41} format_standardize (62 ms) {cells_changed: 36} # phones, EINs, amounts missing (11 ms) {sentinels_standardized: 9} dedup (140 ms) {groups_merged: 8, rows_removed: 16, eins_backfilled: 7} Initial rows: 24 → Final rows: 8 (8 complete vendors) EINs recovered from duplicate rows: 7 | Still missing (flagged): 1 Unparseable cells: 0 Total elapsed: 0.25 s $ # zero network calls. zero. promise.
For when your reviewer asks "what changed?"

Every change auditable. Every cell logged.

Every modification is recorded with the original value, the new value, and which rule fired. Hand the audit CSV to your controller, your reviewer, or the IRS-ready workpaper file along with the cleaned vendor list. No "I trust the AI" hand-waving — they see exactly which EIN came from which duplicate row.

Real example: the demo above merged 24 records into 8 vendors and backfilled 7 EINs. The dedup audit lists every vendor group with the survivor, its merged-in duplicates, and the source row each recovered EIN was pulled from. The standardize audit lists every phone, amount, and Tax ID it reformatted.
If your vendors are messy — most AP files are

EINs, phones, addresses, and amounts in every shape.

One row has the EIN as 12-3456789, another as 123456789. The remittance phone is (212) 555-0147 on one and 212.555.0147 on the next. An amount reads $12,410.75 with a stray space. Excel treats half of these as text errors. DataTools normalizes every one — EINs to a single format, phones to E.164, amounts to clean numerics — so the file reconciles and the 1099 box totals tie out.

In the bundle

Six tools. One pipeline. One $49 download.

1 · Find Duplicates

Fuzzy match (Jaro-Winkler), 5 normalizers, survivor rules, gap-backfill merge, interactive review.

2 · Clean Text

Whitespace, smart chars, NBSP, BOM, line endings, case ops.

3 · Standardize Formats

EINs, amounts, dates, phones, emails, addresses, names, booleans.

4 · Fix Missing Values

Disguised-null detection, profile, flag genuinely-missing fields, drop strategies.

5 · Map Columns

Fuzzy auto-rename, target schema, type coercion, required-field defaults.

6 · Automated Workflows

Chain tools in recommended order, save/load JSON, automate next year's vendor cleanup.

Pricing — pay once, own it

$49. No subscription. No ceiling on rows or files.

$149
one-time

Full DataTools Suite

Available when 3+ bundles ship. Includes everything in the 1099-prep pack plus the Bookkeeper and Accounts-Receivable bundles. Save $48.

Coming when ready

Questions

Does this work with my QuickBooks vendor export?

Yes — the input is just CSV / Excel from any source. Your QuickBooks vendor export works the same as a Xero export, a Bill.com download, or a vendor spreadsheet you maintain by hand. The cleaner doesn't care where the file came from.

How does this compare to Excel's "Remove Duplicates"?

Excel does exact deduplication and only deletes — it never backfills. Acme LLC and Acme, L.L.C. are different vendors to Excel, and even when it does catch a duplicate it throws the extra row away, taking the EIN with it. DataTools fuzzy-matches across spelling drift, merges the group to one survivor, and pulls the missing EIN, phone, and address off the rows it merges in.

How does it recover a missing EIN?

When it merges a group of duplicate vendor rows, it keeps the survivor and backfills any empty field — including the EIN — from whichever duplicate row had it. In the sample file, 7 of the 8 vendors had their EIN recovered this way; the 1 that's truly missing gets flagged so you know to chase the W-9.

Do I need to know Python to use it?

No. The GUI is a browser interface that opens automatically when you double-click the app. It loads your vendor file, you click Run, you download the cleaned list. The CLI is there for power users who want to script next year's cleanup.

What about my data privacy?

Your vendor master — EINs, remittance addresses, payment history — never leaves your computer. There is no cloud component, no telemetry, no "anonymous usage stats." When the app is running you can confirm zero outbound network requests in your browser's developer tools.

What's your refund policy?

Try the live demo above on the sample vendor dataset before you buy. If you still find DataTools doesn't fit your workflow within 14 days, email for a refund — no questions asked.

Will there be updates?

Yes. The v1.x line is included free for everyone who buys DataTools today. We ship a patch every 30 days adding format support, edge-case fixes, and small features.

Stop chasing scattered EINs by hand.

One $49 download. Mac, Windows, or Linux. Runs offline. Consolidates 24 messy records into 8 complete vendors, recovers the 7 EINs hiding on duplicate rows, flags the ones genuinely missing, and saves a pipeline you can re-run on next year's vendor export.

Get DataTools for Accounting — $49 →