Beyond Tax Slips: 5 Ways to Use a Local-First PDF Sort-and-Merge Tool
Short answer: PDF Insight was built so accountants could turn a pile of Canadian tax slips into one ordered, cited PDF — on their own machine, with nothing uploaded. But the engine underneath isn't accounting-specific. You tell it the order you want in plain English, it classifies your PDFs, sorts them, and merges them into a single file with page citations — all on-device. If your job involves assembling messy stacks of PDFs into one clean document, this post is for you. It is a narrow tool that does one thing well; it is not a case-management or storage system.
If you have ever inherited a folder of forty PDFs and had to drag them into the "right" order by hand — by date, by type, by person, by case — you know the tax-slip problem isn't really a tax problem. It's a sorting and merging problem that shows up in law, medicine, real estate, research, and HR every single day.
Here's the whole idea: point PDF Insight at a folder of PDFs, type the order you want the way you'd explain it to a colleague — "chronological, contracts first, then correspondence" — and it sorts and merges everything into one ordered PDF with a citation telling you which source page each section came from. The classification, the OCR for scanned pages (Tesseract, English and French), and the merge all run locally by default via on-device AI (Ollama). Nothing leaves your machine in the local tier. There is an optional paid cloud lane for speed, but it's off until you turn it on.
Below are five jobs outside accounting where that matters.
1. Lawyers and paralegals — building exhibit and disclosure bundles
A litigation file is a pile of PDFs that has to become one paginated bundle in a defensible order. Doing it by hand in Acrobat is slow and error-prone, and the moment a new document arrives, the whole sequence shifts.
"Order these chronologically by document date. Put the signed contract first, then all correspondence in date order, then invoices, then the photographs last."
The before is a paralegal dragging thumbnails for an afternoon and re-checking every page number. The after is a single merged PDF in the order you specified, with page citations that tell you exactly which source file each section came from — so when opposing counsel asks where exhibit 12 starts, you can answer in seconds.
Why local-first is non-negotiable here: these are privileged client documents. Settlement drafts, witness statements, financial disclosure. With PDF Insight's local tier, none of it is uploaded to a vendor's servers to be sorted — the data stays on the one machine you already control and secure. You are not adding a third party to the list of people who have touched a privileged file.
2. Medical clinics and allied health — assembling a patient record for referral
When you refer a patient out, or respond to a records request, you assemble a chronological packet: intake forms, lab results, imaging reports, consult letters, scanned handwritten notes. They arrive as separate PDFs in no order at all.
"Sort by service date, oldest first. Group lab results together, then imaging, then consult letters. OCR the scanned pages."
The before is a front-desk staffer manually ordering a 30-page record and hoping nothing is out of sequence. The after is one merged, date-ordered PDF, with scanned pages made searchable by on-device OCR, and citations pointing back to each source.
Why local-first is the killer angle for clinics: patient records are among the most sensitive documents there are, and uploading them to a cloud tool just to sort them widens your breach surface and raises real privacy questions. PDF Insight processes everything on the device in front of you — classification, OCR, merge — so the record never leaves the clinic during assembly. (This is a statement about where the data flows, not legal or compliance advice; confirm against your own obligations.)
3. Real-estate agents and brokerages — one clean closing package
A single transaction generates a blizzard of PDFs: listing agreement, offers and counteroffers, inspection report, disclosures, financing letter, the signed agreement of purchase and sale. Buyers, lawyers, and lenders all want them in a sensible order.
"Put the signed purchase agreement first, then disclosures, then the inspection report, then financing documents, then all correspondence by date."
Instead of emailing fourteen separate attachments and hoping the recipient can reassemble the story, you hand over one ordered PDF. The page citations mean everyone can find the inspection report or the financing letter without scrolling through the whole thing.
4. Researchers and grad students — a single ordered reading or evidence file
If you collect PDFs — papers, primary-source scans, interview transcripts, government reports — you eventually need them bound into one ordered document: a literature packet for a supervisor, an appendix for a thesis, an evidence file for a systematic review.
"Order these by publication year, oldest to newest. Put peer-reviewed articles first, then preprints, then government reports. OCR any scanned documents."
The before is renaming files 01_, 02_, 03_ so they merge in the right order, then redoing it when you add a source. The after is describing the order once, in plain English, and getting a single cited PDF where every section points back to its origin — which is exactly what you want when you need to trace a claim to its source page.
5. HR and people teams — assembling an employee or onboarding file
Hiring and offboarding both produce a stack: signed offer, tax forms, ID copies, certifications, policy acknowledgements, performance documents. They need to live as one ordered file per person.
"Group everything by employee, then within each person sort by date. Put the signed offer letter first, then tax and banking forms, then signed policy acknowledgements."
The after is one merged file per employee in a consistent order, instead of a shared drive folder where the offer letter and the NDA are three clicks apart.
Why local-first matters here too: HR files carry SINs, banking details, and ID scans. Keeping the sort-and-merge step on-device means that sensitive personal data isn't shipped to a third-party processor just to be put in order.
What this is not
PDF Insight is a deliberately narrow tool. It is a smart sort + merge wedge — it classifies the PDFs you already have, orders them the way you describe, and merges them into one cited file.
It is not a case-management system, a practice-management suite, a patient or matter database, a CRM, a document-management system, or cloud storage. It will not chase signatures, track deadlines, manage a listing, or store your records for you. It takes files you already have on disk and turns the pile into one clean, ordered, cited document. If that's the bottleneck, it helps a lot. If you need a system of record, this isn't it — and we'd rather tell you that up front.
Try it on your own messy folder
Download PDF Insight, point it at a stack of PDFs, type the order you want in plain English, and see whether the merged file comes out the way you'd have assembled it by hand. It runs locally — nothing is uploaded in the default local tier — so you can try it on real documents without sending them anywhere. 14-day free trial, no card required.
Download the free trialReady to commit? Founder Lifetime — $399 CAD once.
FAQ
Does it work on documents that aren't tax slips?
Yes. The engine classifies, orders, and merges any PDFs based on the plain-English instruction you give it. Tax slips were the first use case, not the only one.
Do my files get uploaded anywhere?
Not in the local tier, which is the default. Classification, OCR (Tesseract, English and French), and merging all run on your own machine via on-device AI. There is an optional paid cloud lane for extra speed, but it's off unless you deliberately opt in.
What do the "page citations" actually do?
For each section of the merged PDF, you get a reference back to the source file and page it came from — so you can always trace a part of the final document to its original.