Template-First Documentation vs Ambient AI Scribes: Which Approach Fits Your Practice?

An educational guide comparing the two main approaches to AI-powered clinical documentation: template-first tools that fill structured notes from your input, and ambient AI scribes that listen during sessions and generate notes automatically. Covers accuracy, privacy, cost, and workflow fit to help you choose.

Two Different Ideas About How AI Should Help You Write Notes

There are now dozens of AI documentation tools in the clinical market, and they do not all work the same way. Underneath the similar marketing language ("save time," "AI-powered," "HIPAA compliant"), the products fall into two fundamentally different architectural approaches.

The first approach asks you to bring your session to the AI after it ends. You capture what happened, drop it into a structured template, and the AI fills in the note from your input. The AI is only working with what you gave it.

The second approach asks the AI to attend your session with you. An ambient scribe listens while you and your patient or client talk, transcribes the conversation in real time, and generates a note from the transcript. You review the output afterward.

Both approaches produce AI-assisted clinical notes. But they make very different tradeoffs around accuracy, privacy, cost, and fit for different practice types. This guide explains how each approach works and gives you a framework for deciding which fits your situation.

How Template-First Documentation Works

Template-first documentation starts with a structure you define or select before you see the output. The workflow typically looks like this:

You run your session as you normally would, without any recording device active.
After the session ends, you type or dictate your notes: what the client reported, what you observed, what interventions you used, how they responded.
You paste or submit that raw input to the tool.
The AI reads your input and populates a pre-defined note template (SOAP, DAP, BIRP, GIRP, or a custom format) with the content you provided.
You review the formatted note, make edits, and finalize it.

The critical constraint: the AI can only put into the note what you put into the input. If you did not mention the client's mood, the mood field does not get invented. It either stays blank or flags as incomplete.

Tools built on this approach include NotuDocs, as well as note generators that accept typed session summaries rather than audio recordings.

What Template-First Gets Right

The architecture enforces a hard limit on fabrication. Hallucination in AI systems refers to content the model generates that is plausible-sounding but not based on provided information. In a template-first system, hallucination in the clinical sense (fabricating a diagnosis, inventing symptom history, adding an intervention you never mentioned) is structurally constrained. The AI is filling blanks from your words, not generating a narrative from scratch.

This matters for practitioners whose notes face audit, insurance review, or potential litigation. A note that says a client reported suicidal ideation, when the client did not and you did not document it, is not a minor error. It is a clinically and legally significant problem. Template-first tools reduce that risk by construction, not by policy.

Template-first also gives you precise control over note format. If your payer requires a specific SOAP structure, or your agency uses a custom intake format, you define the template once and every note follows it. The AI adapts to your structure rather than generating a note in whatever format it finds most natural.

Where Template-First Has Friction

The workflow requires a post-session step. After you finish with a client, you need to write up your session input before the tool can do anything. If you are running back-to-back sessions in a busy clinic, that deferred step can pile up.

For practitioners who are already good at writing session notes, template-first may feel like an extra layer rather than a shortcut. The real value shows up when you are staring at a blank SOAP note at 8 PM with five more to write. The template structure and AI formatting turn your rough session notes into a completed, properly formatted note faster than you could write it from scratch.

How Ambient AI Scribes Work

Ambient AI scribe tools work differently. Instead of processing your written input after a session, they capture audio during the session itself.

The typical workflow:

You start the scribe app before your session begins (or the tool runs passively in the background on your device).
The tool listens to or records the conversation between you and your client.
After the session, the tool processes the audio, generates a transcript, and produces a structured clinical note from the transcript.
You review the note, make edits, and approve it.

Tools in this category include Freed, JotPsych, Berries, and Heidi Health, among others. ICANotes AI Scribe, Mentalyc, and Upheal also use audio-based generation, though they differ in whether audio is stored, how long transcripts are retained, and what compliance frameworks apply.

What Ambient Scribes Get Right

For high-volume clinical settings, ambient scribes reduce the total documentation burden more dramatically than template-first tools can. If you are seeing 20 to 30 patients per day in a primary care, urgent care, or psychiatry setting, there is often not time to write detailed session notes between appointments. The ambient scribe is capturing everything that happens in the room so you do not have to reconstruct it later from memory.

The output from a good ambient scribe can be detailed in a way that is hard to replicate from a brief post-session write-up. It captures exact phrasing, specific symptoms the patient mentioned, dosage discussions, and the actual back-and-forth of a clinical conversation. For a physician taking a patient history, that detail can be genuinely valuable.

Ambient scribes also remove the blank-page problem almost entirely. The note is a first draft when you open it. Your job is review and correction rather than composition.

Where Ambient Scribes Have Friction

The accuracy problem. Ambient scribes generate notes from speech, and speech is ambiguous in ways that written input is not. The model must infer meaning from what was said, fill in clinical language around colloquial conversation, and decide what is clinically relevant versus incidental. That inference process is where hallucination risk concentrates.

The risk is not theoretical. There have been documented cases of ambient scribes fabricating clinical content: adding diagnoses not discussed, attributing statements to patients that were not made, filling in treatment plan sections with interventions that were never mentioned. The r/therapists community has discussed specific incidents, including one widely cited case involving an AI tool fabricating a history of abuse in a clinical note. These errors are particularly dangerous because a generated note looks authoritative. It does not flag its own uncertainty.

A published accuracy study of Talkspace Smart Notes (internal, 2025 data) reported 97.7% of generated notes were accepted as accurate by providers. That number is often cited favorably, and it should be: it represents meaningful progress. But at 20 sessions per day, a 2.3% error rate is roughly one fabricated or significantly inaccurate note every two days. That is worth understanding before committing to an ambient-first workflow.

The privacy problem. Recording a clinical session is not a neutral act. You are capturing the audio of a therapeutic conversation, a psychiatric evaluation, or a medical history discussion. Clients must consent to this, and many do not. In some states and jurisdictions, recording consent requirements are more stringent than a simple verbal acknowledgment.

Beyond client consent, there is the question of data handling. Where is the audio processed? Is the transcript stored? Is the audio deleted immediately after transcription, retained for quality assurance, or used in any capacity to improve the model? Different tools have different policies, and those policies can change. Berries and TheraPro advertise that audio is not stored. ICANotes discards audio immediately after processing. JotPsych retains the transcript for review, then deletes it. These distinctions matter and require you to read each vendor's policy, not just their marketing materials.

For therapists specifically, the therapeutic relationship depends on clients feeling safe to speak openly. Telling a client that an AI is listening to and transcribing the session can change the dynamic of the conversation itself. Some clients will accept this readily. Others will find it inhibiting, particularly clients discussing trauma, legal exposure, or stigmatized conditions.

The cost problem. Ambient scribes are substantially more expensive than template-first tools. The tools that use ambient recording and produce detailed clinical notes typically price at $90 to $150 per month for individual practitioners. Freed is priced around $99 per month. JotPsych is $150 per month for individual users. Heidi Health's paid plan is $99 per month. Berries is $99 per month.

The price difference is not arbitrary. Real-time audio processing, transcription infrastructure, and the compute required to generate detailed notes from long-form clinical conversations cost more to run than template-filling from a few hundred words of typed input. But from the practitioner's perspective, the cost difference is real: $99 to $150 per month versus $19 to $25 per month for template-first tools that require no audio capture.

Accuracy and Hallucination Risk: A Closer Look

This is the single most important practical difference between the two approaches, and it is worth spending more time on.

When a template-first tool generates a note, the generation space is bounded. The AI has a defined input (your session summary) and a defined output structure (your template). It is mapping your content into a format. If your input says "client reported low mood and sleep disruption," the Subjective section of the SOAP note will say something close to that. The AI is not going to add that the client also reported suicidal ideation because the language patterns around mood disturbance often co-occur with ideation in training data. That kind of inference is blocked by the template constraint.

When an ambient scribe generates a note from a transcript, the generation space is wider. The model is reading a full conversation transcript and deciding what to include, how to frame it clinically, what diagnoses or concerns the conversation implies, and how to organize it into a note. That is a more creative and more error-prone process. The model's priors (its training on clinical notes) will influence what it writes, not just what was actually said.

This does not mean ambient scribes are inaccurate across the board. High-quality tools with careful prompt engineering and clinical training data perform well most of the time. But the architecture creates higher ceiling risk: when they are wrong, they can be meaningfully wrong in ways that affect the clinical record.

For practitioners who document high-acuity work (crisis assessments, mandated reporting situations, forensic evaluations, medication management with controlled substances), the ceiling risk of ambient scribe errors deserves serious weight.

Privacy Implications: Recording vs Not Recording

If you are using a template-first tool, there is nothing to record. The AI sees your written session summary. Your client's voice, your voice, and the content of the therapeutic conversation are not captured by the tool.

If you are using an ambient scribe, someone is recording. Even if the vendor's policy is to discard audio immediately after transcription, the recording happens. Your client is being recorded. That fact needs to be disclosed, consented to, and documented.

The consent mechanics vary by jurisdiction. In most US states, one-party consent is sufficient for recordings in clinical settings, meaning your consent as the clinician is enough. But some states require two-party or all-party consent. Telehealth introduces additional complexity: if your client is in a different state, the more restrictive state's law may apply.

Beyond the legal minimum, there is the clinical question: how does your client population relate to being recorded? Clients with histories of surveillance, incarceration, or institutional distrust may have heightened sensitivity to recordings. Clients navigating divorce, custody, or legal proceedings may have specific concerns about what is captured and how it could be accessed. Even if the vendor's privacy policy is excellent, a subpoena to a therapy transcript is a different kind of exposure than a subpoena to a therapist's written notes.

None of this makes ambient scribes clinically inappropriate. Many practitioners use them responsibly and with full client consent. But the recording question is not a technical footnote. It is a clinical and ethical decision that affects the therapeutic relationship and the clients you serve.

Cost Comparison

Here is a practical comparison of current market pricing across both approaches:

Template-first tools: $19 to $25 per month. NotuDocs is $25 per month. Most tools in this category do not use note volume limits at entry-tier pricing, or offer generous free tiers.

Ambient scribes: $90 to $150 per month per clinician at the individual tier. Some tools (Heidi Health) have a free tier with limited monthly sessions. Most ambient scribes price at $99 or above for unlimited or high-volume use.

For a solo practitioner seeing 20 clients per week, the annual cost difference is roughly $840 to $1,500 per year. For a group practice with five clinicians, the cost difference across approaches can exceed $5,000 per year.

Whether that difference is worth it depends on how much time the ambient approach actually saves versus the template-first approach, and on the session volume and documentation complexity of the practice. For a solo therapist doing 50-minute individual sessions with moderate documentation requirements, the cost premium for an ambient scribe is hard to justify unless the workflow fit is genuinely better. For a physician doing 15-minute primary care visits with high daily patient volume, the ambient approach may save enough time to justify the premium easily.

Workflow Fit by Practice Type

Neither approach is universally better. They fit different practice contexts.

High-Volume Medical Settings

Physicians, NPs, and PAs in primary care, urgent care, and hospital settings see large numbers of patients with shorter visit windows. There is rarely time between appointments to write detailed notes from scratch. Ambient scribes are well-matched to this context: they capture the full clinical encounter without requiring a separate documentation step.

Tools like Freed, Heidi Health, and Athelas Scribe are designed with this workflow in mind. The value proposition is straightforward: the note happens automatically while you work, and you review a draft instead of composing one.

Therapy and Mental Health Private Practice

Individual and couples therapists, psychologists, and licensed counselors in private practice generally see fewer clients per day (typically 6 to 10), with 45 to 90 minutes per session. Post-session documentation time is built into the schedule, at least in theory.

For this population, template-first tools often fit as well as or better than ambient scribes. The documentation burden is real but comes in smaller batches. The note complexity is moderate (progress notes, treatment plan updates, risk documentation). And the client relationship considerations around recording are often more significant than in medical settings.

The choice between ambient and template-first for therapists tends to come down to three things: how much they trust the output without heavy editing, how their clients would respond to disclosure of recording, and whether the cost premium is justified by their actual time savings.

Behavioral Health Prescribers (PMHNPs, Psychiatrists)

This is a context where ambient scribes are increasingly popular, and where the case for them is stronger. Psychiatric evaluations involve detailed medication histories, symptom timelines, and safety assessments that are genuinely easier to capture from a real-time transcript than from a post-session reconstruction.

JotPsych and Berries both target this segment explicitly, with features for ICD-10 coding, medication documentation, and CPT billing code suggestions. If you are doing high-volume psychiatric evaluations, the ambient approach may save enough time to justify both the cost and the recording disclosure workflow.

Telehealth Practices

Telehealth adds a layer of complexity for ambient scribes. The audio is already passing through a video platform, and now you are also running a scribe app. Device permissions, audio quality, and the patient's comfort with multiple technologies matter.

Some ambient scribes are designed to work with major telehealth platforms (Zoom, Google Meet, Teams). But the setup complexity is higher than in-person, and audio quality issues in video calls can reduce transcription accuracy.

Template-first tools are agnostic to whether the session was in-person or telehealth. You write up your session notes afterward regardless of the modality.

EHR Integration Considerations

Neither approach has a universally better EHR integration story, but the picture is different for each.

Many ambient scribe tools are building direct EHR integrations as a core feature. Freed, JotPsych, and Heidi Health all advertise integrations with major EHR platforms. The appeal is obvious: the note goes from the scribe directly into the record without copy-paste.

Template-first tools typically require you to copy the generated note into your EHR. This adds a step but is not a serious friction point for most solo practitioners. Where it becomes more relevant is in multi-clinician practices where notes need to be routed, reviewed, or co-signed inside the EHR.

If your EHR is a core part of your workflow and you want a frictionless path from session to signed note, it is worth checking whether the specific tool you are evaluating has a native integration with your specific EHR, not just a general claim of "EHR compatibility." Copy-paste into any EHR is not integration.

Choosing Between the Two Approaches

There is no single right answer, but there is a decision structure that helps.

Choose template-first if:

You work in therapy, counseling, social work, or another setting where the therapeutic relationship would be affected by recording disclosure
Your client population includes people with heightened sensitivity to surveillance or recording
You need precise control over note format (specific SOAP variant, agency-required structure, custom intake format)
Your documentation volume is moderate (fewer than 15 sessions per day)
You want the lowest hallucination risk possible given the clinical nature of your work
Cost is a meaningful factor and the $90 to $150 per month range is hard to justify

Choose an ambient scribe if:

You are in a high-volume medical setting (primary care, urgent care, hospital) where documentation volume is genuinely overwhelming
Your clients or patients are comfortable with recording disclosure and the consent workflow is manageable
You have already budgeted for or are currently paying for a higher-tier documentation tool
You are a psychiatric prescriber who needs detailed medication and symptom documentation that is difficult to reconstruct post-session
You have tested an ambient tool's accuracy against your actual note types and found the editing time acceptable

Run a real test before committing either way. Take a week of real sessions, de-identify the content, and evaluate the output of the tool you are considering. For template-first, the test is: does my rough session summary produce a formatted note that I would have written myself? For ambient scribes, the test is: is the generated note accurate enough that my editing time is less than my current note-writing time?

Decision Checklist

Practice Context

What is my average number of clinical sessions per day?
How long is a typical session?
How much post-session documentation time is currently built into my schedule?
Is my practice setting in-person, telehealth, or both?

Client and Relationship Considerations

Would recording disclosure affect the therapeutic relationship with my specific client population?
Does my client population include people with particular sensitivity to recordings?
Do I see clients in jurisdictions with two-party consent requirements?

Accuracy and Format Requirements

What note format does my payer or agency require?
How much editing would I accept in a generated note before it takes longer than writing from scratch?
What is the highest-risk content I document regularly (crisis notes, mandated reports, medication management)?
Am I willing to accept the ceiling risk of ambient scribe hallucination for my clinical context?

Privacy and Data Handling

If evaluating an ambient scribe: has the vendor provided clear written documentation of their audio retention policy?
Does the vendor confirm that session audio or transcripts are not used to train AI models?
Is the vendor's BAA or equivalent compliance documentation available before I enter any client data?

Cost and Commitment

What is the all-in monthly cost at my actual session volume?
What is the annual cost difference between the ambient and template-first options I am considering?
Is the cost premium justified by the time savings I actually experienced in a real trial?
What are the cancellation terms and data export procedures?

Trial Evaluation

Have I tested the tool with de-identified real session content (not vendor demo notes)?
Have I tested what happens when input is incomplete or ambiguous?
Have I measured my actual editing time, not just the note generation time?
Have I evaluated the output after at least one full week of use, not just the first session?

Related reading: