Guide · 12 min read · Updated May 2026

Voice notes for Obsidian — the four real options in 2026

Q: What's the simplest way to add voice notes to Obsidian?

If you're on macOS 14 or later, the simplest path is Apple Dictation: nothing to install, runs locally, dictates directly into the active note. The downside is it's purely transcription — no titles, tags, or backlinks generated automatically. For more structure, the Obsidian Whisper plugin by Nik Danilov is the community standard.

You want to talk into your Obsidian vault. There are four serious tools for doing it on Mac, and they make very different tradeoffs — local vs cloud, in-editor vs system-wide, raw text vs structured notes. Here's an honest map of all four, and how to pick.

Why voice + PKM is harder than it looks

Most "voice notes" tools are built for people who want to dictate text — emails, Slack messages, drafts. The PKM use case is fundamentally different. You're not trying to produce a finished message; you're trying to capture a thought before it evaporates, drop it into a system you've already curated, and have it be findable when you need it weeks later.

That has three consequences a generic dictation tool doesn't handle:

Output destination matters. Text at the cursor is fine for messaging. For PKM, the destination needs to be your vault, with a filename, a frontmatter block, and tags that fit your existing taxonomy.
Structure beats raw text. A 90-second voice memo is a stream of half-sentences and false starts. A vault note that's still useful in three months is structured — title, summary, links to related notes. Something has to do that work.
Capture without review is a junk drawer. Most voice-to-vault workflows fail at the curation step, not the capture step. After two weeks of "I'll review my inbox later," you have 40 captured notes and zero curated ones. The tool needs to make review fast, or capture stops being useful.

With those constraints in mind, here are the four tools serious Obsidian users actually pick on Mac in 2026.

The four real options

Apple Dictation

Free · built into macOS

The "just works" baseline. No install, no setup. macOS 14+ runs it locally.

Output: Text at the cursor — including inside Obsidian's editor
Privacy: Local on macOS 14+ (Sonoma) and later. Older versions cloud.
Speed: Real-time, streaming
Structure: None. You name the file, you add tags, you write frontmatter.
Best for: Quick dictation directly into a note you've already created

Obsidian Whisper plugin (Nik Danilov)

Free · OpenAI API key required

The Obsidian community standard for in-editor voice. Better-than-Apple transcription quality.

Output: Whisper transcript inserted into the active note
Privacy: Audio sent to OpenAI's Whisper API (cloud)
Speed: Cloud streaming via OpenAI
Structure: Raw text only. No titles, tags, or backlinks.
Best for: Obsidian users who want quality transcription and don't mind cloud + per-use cost (typically a few dollars a month)

MacWhisper

$5–$20 one-time · local

Different category — built for transcribing audio files (podcasts, recorded meetings), not live capture.

Output: Transcript file you save into your vault manually
Privacy: 100% local Whisper
Speed: Batch — feed it a file, get a transcript
Structure: None. You manually move the transcript into your vault and add structure.
Best for: Cleanly transcribing existing audio (recorded calls, podcast episodes you want to reference) into your vault. Not live thought-capture.

ThoughtMic

Free up to 2k words/wk · $8/mo Pro

Voice-to-vault pipeline. Local Whisper plus auto-titles, auto-tags from your taxonomy, auto-backlinks, and a Friday-review surface.

Output: Text at cursor in any app, AND a titled/tagged/linked Markdown note in your vault
Privacy: Local Whisper. Optional cloud rephrase is text-only via Zero Data Retention — never audio, never vault contents.
Speed: Local batch — Whisper Large V3 Turbo on Apple Silicon
Structure: Auto-generated title, suggested tags from your existing vault tags, backlinks resolved against existing notes
Best for: Obsidian / Logseq / Foam / plain-Markdown users who want voice capture that actually integrates with their PKM, not just transcribes around it. See thoughtmic.com.

How to pick — three honest questions

1. Does audio need to stay on your machine?

If yes — for compliance, philosophy, sensitive content, or just preference — your shortlist is Apple Dictation (macOS 14+), MacWhisper (for files), and ThoughtMic (for live capture). Skip the Whisper plugin; it sends audio to OpenAI.

If you don't care, the Whisper plugin's transcription quality on technical vocabulary and proper nouns is excellent, and the per-use cost is genuinely small for typical PKM use (a few dollars a month).

2. Do you want voice capture to also work outside Obsidian?

The Whisper plugin only runs inside Obsidian. If you also dictate into Slack, your IDE, email, and everywhere else on your Mac, you need a system-wide tool — Apple Dictation, SuperWhisper, Wispr Flow, or ThoughtMic. Of those, only ThoughtMic also writes a structured note into your vault when you dictate from anywhere on your machine; the others give you text at the cursor and stop there.

3. Do you want voice notes to organize themselves?

This is the question people don't ask early enough. After 30 voice notes piled up in your vault, do you want to spend 20 minutes triaging, retitling, and tagging them — or do you want them to arrive with titles, tags, and backlinks already in place? If the answer is the second one, you've narrowed the field to ThoughtMic; nothing else in the Mac ecosystem does this today. If you're happy curating manually, any of the four works.

The Friday-review problem (the part everyone misses)

Here's the failure mode every voice-into-vault workflow hits within a few weeks of starting:

You set up voice capture. It works great. You dictate forty short notes over three weeks — meeting recaps, ideas, todos, half-formed thoughts. Every one of them lands in your #inbox folder. You promise yourself you'll review them on Friday. Friday comes. Reviewing forty unstructured notes feels like work, so you don't. Now there are eighty. By next month, the inbox is write-only. Voice capture has produced a junk drawer.

This is the Collector's Fallacy: the act of capturing feels productive even when it isn't. The cure is not "discipline." The cure is to make review take five minutes instead of an hour, by giving each note a four-key decision (Discard / Keep / Promote to a structured template / Archive) and walking through them keyboard-driven, two-pane, no mouse.

Apple Dictation, the Whisper plugin, and MacWhisper are all capture tools. None of them have an opinion on what happens after the text lands. ThoughtMic ships with a dedicated Friday-review surface (⌥⇧ R) that turns the "I'll review on Friday" promise into a five-minute keyboard sweep. If you've fallen into the junk-drawer pattern with a previous voice tool, the review surface is what makes the next attempt different.

You can also build this manually with templates and Obsidian queries — it just takes meaningful setup work and ongoing discipline. Worth knowing the pattern either way.

Setup notes for each option

Apple Dictation

System Settings → Keyboard → Dictation → On
Pick a hotkey (default is press Fn twice; Globe key on newer keyboards)
In Obsidian, click into a note, hit the hotkey, talk
That's it. No Obsidian-side setup.

Obsidian Whisper plugin

Settings → Community plugins → Browse → search "Whisper" by Nik Danilov
Install + enable
Get an OpenAI API key from platform.openai.com/api-keys (you'll need a card on file)
Plugin settings → paste the API key, set a hotkey
Click into a note, hit the hotkey, talk, release

MacWhisper

Buy from goodsnooze.gumroad.com/l/macwhisper (one-time price tier)
Open the app, drag in an audio file
Pick a Whisper model size (Small for speed, Large for accuracy)
Wait for transcription, copy result, paste into a new vault note

ThoughtMic

Join the waitlist (launches summer 2026)
At launch: download, point it at your vault folder, set the hotkey
Press ⌥ Space anywhere on your Mac, talk — text appears at your cursor and a structured note lands in your vault
Hit ⌥⇧ R on Fridays to clear your #inbox

Free starter resource (regardless of which tool you pick)

We maintain a free Obsidian starter vault built specifically for voice-first capture — voice-memo template, daily-note scaffolding, a tag taxonomy that survives the way you actually think out loud, and the folder structure we use ourselves. It works whether or not you ever install ThoughtMic. If you're starting voice notes in Obsidian for the first time, grab it as a baseline.

Frequently asked

What's the simplest way to add voice notes to Obsidian?

If you're on macOS 14 or later, Apple Dictation is the simplest path: nothing to install, runs locally, dictates directly into the active note. The downside is it's purely transcription — no titles, tags, or backlinks generated automatically. For better quality, the Obsidian Whisper plugin by Nik Danilov is the community standard.

Is Whisper actually local on Mac, or does it call OpenAI?

Both options exist. Nik Danilov's Whisper plugin uses the OpenAI Whisper API (cloud, requires your own API key, audio leaves your machine). Apps like MacWhisper, VoiceInk, and ThoughtMic run Whisper locally on Apple Silicon — no API key, no audio leaves the device. Pick based on whether you want zero install cost (cloud, pay-per-use) or zero data leaving (local).

Can I dictate into Obsidian from outside Obsidian?

Plugin-based tools (Nik's Whisper plugin) only work inside Obsidian. System-wide dictation tools (Apple Dictation, ThoughtMic, SuperWhisper, Wispr Flow) work in any app — the difference is whether the output also writes a Markdown note into your vault automatically. ThoughtMic does both: text at the cursor anywhere AND a structured note in your vault.

Will voice notes mess up my tag taxonomy?

Only if your tool generates tags blindly. Tools that suggest tags from your existing vault tag list (rather than inventing new ones from the text) preserve your taxonomy. Plain dictation tools insert raw text and leave tagging to you. ThoughtMic specifically biases toward tags that already exist in your vault, only suggesting new ones when nothing close fits.

What about the Friday-review problem — do voice notes pile up?

Yes — this is the failure mode most voice-into-vault workflows hit within a few weeks. Captured #inbox notes accumulate; nobody re-reads them; the vault becomes write-only. The cure is a dedicated review surface (a keyboard-driven walkthrough where every inbox note gets a Discard / Keep / Promote / Archive decision in one keystroke). Without that, even the best voice capture tool degrades into a junk drawer over time.

What about iPhone capture?

None of the four tools above ship a native iOS companion today. Workarounds: Apple Dictation works on iPhone if you dictate into Obsidian Mobile directly. The Whisper plugin doesn't run on mobile. For audio you record on phone, you can sync to Mac and run MacWhisper. ThoughtMic has iOS planned 6–8 weeks after the macOS launch.

I don't use Mac. Does any of this work on Windows or Linux?

Apple Dictation is Mac-only. The Whisper plugin runs on whatever OS Obsidian runs on (Win, Linux, mobile) — but it's cloud-based, so the privacy tradeoff is the same everywhere. MacWhisper is Mac-only. ThoughtMic is Mac-only at launch. For local Whisper on Windows specifically, look at WhisperWriter or the Whisper.cpp ecosystem, though no comparable vault-native tool exists yet outside Mac.

Want voice that lands as structured notes?

ThoughtMic ships summer 2026. Local Whisper, vault-native output, Friday-review built in. Join the waitlist for a launch-day download link and the Founder's Deal ($49 lifetime, first 50 only).

We'll email you at launch. No newsletter. Unsubscribe any time.

Works with Obsidian · Logseq · Foam · VS Code · any .md folder