HomeBlog › Building a local AI tax document organizer

Why I Built a Local, Offline AI to Organize Tax Documents

By Jacob Lavoie · June 2026 · ~6 min read · Build in public

The short version: I built PDF Insight — a desktop app that turns a client's messy pile of T4, RL-1, T5 and other slips into one correctly-ordered, merged PDF — and I made it run entirely on the accountant's own computer. The AI, the OCR, the merging: all on-device. No client tax document ever leaves the machine. This is the story of why I chose local over cloud, and what the tool actually does.

I'm an accountant. Every tax season I do the same unglamorous thing hundreds of times: a client drops off (or emails, or photographs) a stack of slips, and before I can do any real work I have to sort it. Which page is the T4 and which is the matching RL-1? Where's the T5, the T4A, the RRSP receipt? Then I scan, rename, reorder and merge it all into one clean file. It's an hour of paper-shuffling that nobody bills well for and nobody enjoys.

The moment I almost pasted a client's T4 into ChatGPT

Like a lot of people, my first instinct was to throw the problem at a chat AI. I had a folder open, a scanned slip in front of me, and my cursor was hovering over the upload button. Then I stopped. That document had my client's name, their SIN region, their employer, their income. I had no idea where that file would land, who could see it, or how long it would sit on someone else's server.

For a Canadian accountant that's not a vague worry — it's the core of the job. We hold the most sensitive financial data people have, and we're expected to guard it. The mainstream tools (TaxDome, SmartVault, Canopy, Dext) are genuinely good, but they're cloud-first: the files go up to be processed. Pasting slips into a general chatbot is worse. I didn't want a better cloud. I wanted the problem to never leave my desk.

So I built it to run on my own machine

The decision that shaped everything: PDF Insight is local-first. The AI model that reads and classifies a slip, the OCR that handles a crooked phone photo of a paper T4, the logic that orders the bundle and merges it — all of it runs on the accountant's own Mac, Windows or Linux computer. Once it's installed, it works offline. There's no upload, no account that holds your files, no server round-trip.

This isn't a marketing absolute about being "more secure than" some competitor — that's not a claim I can prove about your setup. It's a simpler, checkable fact: the file stays on the machine you control. For a solo accountant or a small firm, that's an answer you can actually give a client when they ask where their documents go.

What it actually does

You point it at a client folder. From there it:

  1. Reads every page, including scans and photos, using on-device OCR.
  2. Classifies each slip — T4, T4A, T4A(P), T4E, T5, T3, T5008, T4RSP, and the Québec relevés like RL-1 and RL-31.
  3. Orders the bundle consistently, keeping federal/provincial pairs together (T4 with RL-1, T5 with RL-3).
  4. Exports one merged PDF per client, ready for your working file.

It's bilingual EN/FR by design, because I built it in Quebec where half my documents arrive in French. A relevé labelled "Relevé 1" and a T4 are the same person's employment income, and the tool treats them that way. On my 16GB Mac, a full 11-document bundle goes from pile to ordered, merged PDF in about 100 seconds.

The old wayWith PDF Insight
Open each scan, guess the slip type, rename itLocal AI classifies each slip automatically
Drag pages into the right order by handOrdered to a consistent rule, RL/T pairs kept together
Merge into one PDF in a separate toolOne merged PDF exported in a single pass
Upload to a cloud service to use AIEverything runs on your own machine, offline

Always verify before filing. PDF Insight organizes and merges the file; it doesn't replace your review. Slip names and box numbers change yearly — confirm specifics against current CRA and Revenu Québec guidance, and check the final PDF.

Building it in public

I'm shipping this the way I wish more accounting software was made: honest claims, real numbers, and a free trial so you can judge it on your own hardware instead of a demo video. If you want the full list of what to collect before you even run it, I wrote a companion Canadian & Québec tax document checklist, and there's a walkthrough of the accountant workflow for organizing client documents.

If you've ever hesitated over that upload button the way I did, I built this for you. Download the trial, run it on a folder of your own slips, and see whether 100 seconds of local processing beats your current hour of shuffling.

Try it on your own machine

PDF Insight classifies, orders and merges Canadian and Québec tax slips locally — no client data leaves your computer. 14-day free trial, no card required. Or skip the subscription with Founder Lifetime, $399 CAD once.

Download the free trial Get Founder Lifetime — $399

FAQ

Does PDF Insight send client tax documents to the cloud?

No. The AI model, OCR and merging all run on your own Mac, Windows or Linux computer. The files never leave the machine — that's the entire reason I built it as a local app instead of a web tool.

What slips can it organize for a Canadian or Québec return?

Common federal slips like T4, T4A, T5, T3 and T4RSP, plus Québec RL slips such as RL-1 and RL-31. It's bilingual, so French labels and relevés are handled natively. Verify the final file before you file.

How fast is it, and do I need a powerful computer?

On a 16GB Mac it organizes an 11-document bundle into one merged PDF in roughly 100 seconds, and it works offline once installed. The free trial lets you test it on your own hardware first.