AI Meeting Notes for Wealth Management: What Works and What Hallucinates
There is a specific moment, about ten seconds into reviewing an AI-generated meeting note, where the advisor's eyes narrow. They have just spotted a recommendation in the draft that was not actually made in the meeting - a 60/40 rebalance the client asked about, that the assistant has confidently transcribed as a recommendation the advisor agreed to. The advisor strikes the line, mutters "that is not what I said," and the trust-tax begins. Hallucination in wealth-management notes is not a quirky AI failure mode. It is the difference between a tool that gets adopted and one that gets uninstalled within a quarter.
Picture the screen most advisors are looking at when they review these drafts: a split view with the meeting transcript on the left and a note-draft sidebar on the right, each claim in the draft tied back to a citation tooltip pointing at the moment in the conversation it came from. That citation pattern is the single biggest difference between AI notes that work in this vertical and AI notes that do not. This piece walks through what we have seen work, what hallucinates, and the design choices that separate the two.
Generic AI meeting notes - the ones built for sales calls or product reviews - tend to summarize. They produce a tidy three-paragraph recap with action items at the bottom. That format is wrong for wealth management for two reasons. First, examiners read by field, not by paragraph; a Reg BI reviewer wants to find the recommendation field, the rationale field, and the conflicts field, in that order. Second, summarization is where hallucinations breed. The model fills in narrative connective tissue that the conversation did not contain.
Structured extraction is the alternative that has held up across our pilot. Instead of asking the model for a summary, the system prompts it to fill specific fields - investment profile changes, recommendations made, alternatives considered, conflicts disclosed, action items - and to leave any field blank if the conversation did not address it. Empty fields are a feature, not a bug. They tell the advisor what the meeting did not cover, which often surfaces a follow-up that should happen.
Every claim in the draft note should carry a citation pointing back to the moment in the transcript it came from. Per claim, not per section. The advisors in our pilot adopted this pattern faster than any other product feature, because it changed the review task from "is this draft correct" to "is this specific sentence supported by what was said." Citation tooltips also turn out to be the cheapest way to catch hallucinations. If a sentence has no citation, or the citation does not actually support the claim, the advisor strikes it.
Three categories of content drift consistently across LLM-generated wealth-management notes:
The two-strike rule we use during pilots. If an advisor flags two hallucinations in the same note, the draft is not edited - it is regenerated with stricter extraction prompts. Editing around drift is how drift sneaks into the record.
The Care Obligation under Reg BI and the fiduciary standard under the Advisers Act both ask whether the recommendation was in the client's best interest given the profile. That is a judgment, not an extraction. Within our pilot, the suitability rationale field stays empty in the AI draft and is filled in by the advisor as the last step of the review. This is the single design choice that has the most defensive value during examinations - the record cannot be challenged as "AI-generated suitability" because the suitability narrative is human-authored and timestamped separately.
Two areas where the current generation of AI meeting notes still falls short for wealth-management use. First, multi-speaker disambiguation when more than two people are in the room - couples meetings and family-meeting transcripts produce more attribution errors than single-client meetings. Second, jargon density. "Roth conversion ladder," "529 superfund," "step-up in basis," and the dozens of similar terms specific to U.S. wealth planning are well-handled by the better models but not uniformly. The pragmatic answer for now is a per-firm glossary the model is grounded against.
Across our early advisor cohort, mean documentation time fell from 52 minutes per meeting to roughly 8 minutes of editing per meeting after a four-week ramp. The asterisk is that the eight minutes of editing has to actually happen. A draft that goes into the CRM unedited is not a productivity gain; it is a compliance liability. The advisors who treat the AI draft as a starting position, not a finishing one, get the time back. The advisors who try to bypass review do not last in any pilot we have run.