The Market Thesis9 minDraft

Why 95% of BSA Alerts Being False Positives Is an AI Problem, Not a Staffing Problem

Your compliance team spends most of their time investigating transactions that turn out to be nothing. AI can fix the signal-to-noise ratio.

Sean Hsieh

Founder & CEO, Runline

Article 6 Outline: "Why 95% of BSA Alerts Being False Positives Is an AI Problem, Not a Staffing Problem"

Track: The Market Thesis (deep blue) | Arc: Market Thesis Target: CEOs, Compliance Officers, Ops Leaders Length: ~2,200 words

Opening Hook

Here's a number that should terrify every credit union CEO: 95% of BSA alerts are false positives. Your compliance team — the 2-3 people keeping your institution out of regulatory trouble — spends the vast majority of their time investigating transactions that turn out to be nothing. Meanwhile, only 2% of illicit financial flows globally are actually caught. The system is broken at both ends: too much noise, too few catches. And the answer isn't hiring more people to process more noise. It's using AI to separate signal from noise so your people can do the work that actually requires human judgment.

Act 1 — The 95% Problem (By the Numbers)

95%+ of BSA/AML alerts are false positives — confirmed by McKinsey (90%+), HSBC internal data (95%+), and LexisNexis Risk Solutions reporting across the financial industry. This isn't a CU-specific stat — it's industry-wide, which means CUs are suffering under the same alert fatigue as JPMorgan but with 1/10,000th the compliance staff
Why it's so high: Current monitoring systems are overwhelmingly rules-based, not intelligence-based. They fire on patterns: cash transaction over $X, wire to country Y, account opened less than Z days ago. These rules catch everything that looks suspicious but can't distinguish between a contractor who deposits cash every Friday and a money launderer structuring deposits. Rules see patterns. They don't see people.
The downstream cost:
- 21.4 hours per SAR (Bank Policy Institute study) — that's the real time cost, not FinCEN's absurd estimate of 1.98 hours
- A mid-size CU filing 50-70 SARs per month: that's 1,000-1,500 hours/month just on SAR preparation
- 4% of filed SARs result in any law enforcement action (BPI, 2020). Meaning 96% of the work product — all those hours investigating, documenting, filing — goes into a FinCEN database and is never acted on
- $11,600 average cost to originate a single loan (Freddie Mac, 2024), with 67% being personnel costs. Compliance is a significant chunk of that
Reference: TD Bank just paid $3.09 billion — the largest BSA/AML penalty in US history — for systemic failures in their transaction monitoring. The fine wasn't for lack of staff. It was for inadequate monitoring systems. The regulators are telling you: this is a technology problem.

Act 2 — A Day in Your BSA Analyst's Life (I've Seen It)

I spent time embedded with the fraud and compliance team at a credit union partner. Here's what I watched:
6:30 AM: The BSA Detail file from last night's batch processing lands. Overnight, the core processor dumped every transaction from the previous day. Not real-time — T+1 data, meaning anything suspicious yesterday is only visible today
7:00 AM: The analyst opens 5-6 separate systems to begin her day:
- Core processor (CU*Answers GOLD) for member data
- Verafin for AML alerts and fraud alerts (separate pages — AML and fraud aren't unified)
- IDOC for check images
- A separate tool for SSN aggregation (because the BSA Detail aggregates by account, not individual — joint account holders appear as a single entry)
- An internal spreadsheet tracker because the systems don't talk to each other
8:00 AM-12:00 PM: She works through the alert queue. Each alert requires:
1. Pull the member profile from the core
2. Check transaction history across accounts
3. Cross-reference with previous alerts and filed SARs
4. Check for related accounts (joint holders, business accounts)
5. Look up negative news (manually, in a browser)
6. Determine: is this suspicious, or is this just Maria the florist making her weekly cash deposit?
- For 95 out of 100 alerts, it's Maria. But she has to check every single one.
1:00 PM: A SAR needs to be filed. The narrative — the story of why this activity is suspicious — is entirely manual. She synthesizes notes from multiple systems, transaction records, prior correspondence, and external research into a coherent document. A manual SAR takes 1-3 hours. Even the semi-automated ones (structuring SARs with some Verafin help) take 10-20 minutes.
The math: This team — 2-3 people — handles 400+ CTRs and 50-70 SARs per month. They're operating at 125% capacity, averaging 60-hour weeks. And the transaction volume is growing.
The punchline: These are smart, dedicated, experienced professionals. They're not slow — they're drowning. And the answer isn't to hire a fourth person to drown alongside them.

Act 3 — Why More Staff Doesn't Fix a Systems Problem

67% of credit unions report they can't hire enough qualified staff (CUNA workforce survey). BSA analysts are particularly scarce — the role requires a rare combination of financial expertise, regulatory knowledge, and investigative instinct
The retirement cliff: The most experienced BSA officers — the ones with 20+ years of institutional knowledge about your membership patterns, your examiner relationships, your community's economic rhythms — are retiring. When they leave, they take irreplaceable context with them
The training gap: A new BSA analyst takes 12-18 months to become truly effective. Not because they're slow — because understanding your credit union's patterns, your membership base, your risk profile requires time with the data that no certification can shortcut
The salary competition: Banks and fintechs are bidding up compliance talent. A BSA analyst at a $500M credit union makes $55-75K. The same analyst at a regional bank makes $80-110K. At a money center bank or fintech: $120K+. You're fighting for talent with one hand tied behind your back.
The scaling problem: If transaction volume grows 15% per year (a healthy CU), and your alert rules remain the same, your false positive volume grows 15% per year too. Hiring scales linearly. Alert volume scales with your success. This math never balances.
Reference: The real problem isn't people — it's that you're asking humans to do work that machines should do (sorting signal from noise) instead of work that only humans can do (exercising judgment about complex situations). The 95% false positive rate isn't a staffing failure. It's a technology failure that creates a staffing crisis.

Act 4 — What AI Actually Changes (And What It Doesn't)

Let's be precise about what AI does and doesn't do here, because the hype is real and so is the risk of overpromising:

What AI does:

Triages the 95%. An AI agent that understands your membership patterns — Maria's weekly cash deposits, the construction company's seasonal revenue cycles, the retired teacher's pension schedule — can flag alerts as "routine pattern, consistent with member history" before a human ever sees them. The analyst reviews the AI's reasoning, not the raw transaction.
Drafts SAR narratives. The agent pulls member data, transaction history, prior alerts, account relationships, and external references — then drafts a coherent narrative. The BSA officer reviews, edits, and approves. 1-3 hours becomes 10-15 minutes of review.
Creates tracker notes automatically. 80% of tracker note content is template-standardizable: transaction details, member demographics, risk assessment factors. The AI generates the note; the analyst validates the judgment call.
Monitors in real-time. CDC pipelines (Article 5) replace nightly batch with real-time data. Suspicious patterns are detected as they happen, not 24 hours later.
Reference: HSBC deployed AI-powered AML monitoring and saw 60% fewer false positives while detecting 2-4x more real financial crime. JPMorgan reported a 95% reduction in false positives after AI deployment. These aren't hypotheticals.

What AI doesn't do:

Make the judgment call. Is this activity actually suspicious? That requires understanding context, exercising discretion, and applying institutional knowledge. That's human work. FinCEN and NCUA require human sign-off on all AI-assisted compliance actions. This isn't a limitation — it's the right design.
Replace your BSA officer. It replaces the noise that buries your BSA officer. The goal isn't a smaller compliance team — it's a compliance team that spends 80% of their time on the 5% of alerts that actually matter, instead of 80% of their time on the 95% that don't.

Act 5 — The Examiner Conversation (Closing)

Here's the part that makes compliance officers nervous: "What will the examiner say?"
FinCEN has explicitly encouraged the use of AI and innovative technologies for BSA/AML compliance. Their Innovation Hours program invites financial institutions to discuss how technology can improve suspicious activity monitoring.
NCUA's AI Compliance Plan (September 2025) isn't anti-AI — it's pro-AI-with-guardrails. The requirements (monitoring, control, termination) are a design spec, not a prohibition.
The examiner question that should worry you isn't "Why are you using AI?" — it's "Why are you still using the same rules-based system that generates 95% false positives while TD Bank just paid $3 billion for inadequate monitoring?"
Reference: The FFIEC BSA/AML Examination Manual has been updated to acknowledge technology-assisted monitoring. The regulatory wind is blowing clearly: use better technology, document how it works, keep humans in the loop, and be able to demonstrate it to an examiner. That's exactly how Runline builds.
Callback to Articles 2 & 5: SEC examination muscle (Article 2) → audit-trail-first architecture. Core processor data unlocked via CDC (Article 5) → real-time monitoring instead of T+1 batch. Those building blocks converge here: AI agents that triage BSA alerts using real-time data, with full audit trails your examiner can review.
Closing line direction: "Your BSA team isn't failing. Your monitoring systems are failing your BSA team. The 95% false positive rate is a technology problem with a technology solution — one that doesn't replace your best people but finally lets them do the work they were trained to do."

Key References

95%+ BSA false positive rate — McKinsey, HSBC, LexisNexis
21.4 hours per SAR — Bank Policy Institute (vs. FinCEN's 1.98-hour estimate)
4% of SARs result in law enforcement action — BPI, 2020
TD Bank — $3.09B BSA/AML penalty (largest in history)
$11,600 average cost per loan origination — Freddie Mac, 2024
67% of CUs can't hire enough staff — CUNA workforce survey
HSBC — 60% fewer false positives, 2-4x more real crime detected with AI
JPMorgan — 95% false positive reduction after AI deployment
Only 2% of illicit financial flows caught globally
FinCEN Innovation Hours program — explicit encouragement of technology solutions
NCUA AI Compliance Plan (Sept 2025) — monitoring, control, termination requirements
FFIEC BSA/AML Examination Manual — technology-assisted monitoring updates
First-hand observations: FCU compliance team, 5-6 systems, 60-hour weeks, 125% capacity
Verafin: ~$125K+/year for mid-size CU; AuditLink: ~$18K for similar scope
BSA analyst training gap: 12-18 months to effectiveness

Tone Calibration

Empathy above all: This article is for the compliance officer reading it at 7 PM on a Tuesday after a 12-hour day of alert processing. She needs to feel seen, not lectured. "Your BSA team isn't failing. Your systems are failing your BSA team."
Curiosity: Fascinated by the structural economics — the financial industry spends billions on compliance and catches only 2% of illicit flows. That's not a people problem. That's a system design problem. AI doesn't just make compliance cheaper — it makes compliance work.
Silicon Valley lesson: HSBC and JPMorgan have already proven that AI reduces false positives by 60-95% while increasing real crime detection. This isn't frontier research — it's deployed technology at the world's largest banks. The question for CUs is why they're still using the rules-based approach that those banks have already moved past.
Spicy take: "96% of your SAR work product is never acted on by law enforcement. Not because your analysts did bad work — because the system generates so much noise that even FinCEN can't process it all. You're not fighting financial crime. You're fighting an alert queue."
Vulnerability / lived experience: The Act 2 "I've seen it" section is the article's emotional center. Sean didn't read a report about BSA analyst workflows — he sat next to them, watched them toggle between 6 systems, and saw the 60-hour weeks firsthand. That's the credibility that makes this article different from a vendor whitepaper.

← Back to Insights