Local LLM Helper

by Mani Mohan
5
4
3
2
1
Score: 63/100

Description

Category: Productivity Tools

The Local LLM Helper plugin integrates local Large Language Models (LLMs) with Obsidian, offering a suite of tools for text processing, interactive AI chat, and personalized responses. Users can perform tasks such as summarizing text, generating action items, adjusting tone, and chatting directly with indexed notes. The plugin supports privacy by working with offline LLM servers like Ollama and LM Studio, and it allows full customization of prompts and response formats. Additionally, the plugin includes a user-friendly chat interface and integration with ribbon and status bar for quick access. It offers seamless interaction with LLMs for both productivity tasks and dynamic conversations within notes.

Reviews

No reviews yet.

Stats

74
stars
9,014
downloads
12
forks
772
days
20
days
29
days
3
total PRs
0
open PRs
0
closed PRs
3
merged PRs
11
total issues
0
open issues
11
closed issues
64
commits

Latest Version

a month ago

Changelog

What's Changed

  • Added optional Embedding server URL and Embedding API key settings so RAG indexing can use a separate OpenAI-compatible embedding server.
  • Kept the current single-server setup as the default: blank embedding server inherits the chat/default server, and blank embedding key inherits the chat/default key.
  • Added support for not-needed on the embedding API key to suppress the Authorization header.
  • Updated chat and embedding model browsing so each picker queries the correct effective server.
  • Stored embedding metadata now records the effective embedding server and model without storing API keys; old indexes remain loadable and still recommend re-indexing when the embedding backend changes.

Verification

  • npm run build
  • git diff --check
  • GitHub Actions release workflow passed and uploaded main.js, manifest.json, and styles.css.

README file from

Github

Why Use It?

Local LLM Helper turns Obsidian into a private AI workspace without forcing your notes into someone else's app. It works with Ollama, LM Studio, vLLM, LocalAI, OpenAI, and most servers that expose OpenAI-compatible endpoints.

  • Keep your notes in Obsidian - ask questions, summarize, rewrite, and review without changing tools.
  • Use local models when privacy matters - point the plugin at your own chat and embedding server.
  • Search by meaning, not just keywords - index your vault and retrieve the notes that actually matter.
  • Move from chat to action - generate review notes, task updates, and project summaries behind explicit approval cards.

Highlights

Capability What you get
RAG chat over your vault Ask questions against indexed notes with clickable source references.
Scoped note chat Limit chat to the current note, current folder, selected notes, tags, or the whole vault.
Markdown chat rendering Render assistant responses in chat views with Obsidian markdown, or switch back to plain text.
Related Notes sidebar Keep semantically nearby notes visible while writing or researching.
Manual workflow runner Draft weekly reviews, meeting tasks, and project status updates from note context.
Text commands Summarize, rewrite, extract action items, use custom prompts, and run saved prompts.
Vault Actions Let chat propose note writes, then approve or reject each write before it touches your vault.
Personas and model picker Customize assistant behavior and browse models directly from your server.

See It In Action

Chat With Your Notes

Ask questions against your indexed vault, inspect sources, and narrow the context when you need a more precise answer.

Fast Commands From Obsidian

Run text transformations, chats, searches, note indexing, and workflows from the ribbon menu or command palette.

Settings Built For Real Model Setups

Configure providers, chat models, embedding models, personas, reasoning extraction, RAG indexing, saved prompts, workflow defaults, and web search.

Features

🔌 Multi-Provider Support

Works with any server that supports the OpenAI API format (/v1/chat/completions):

  • Ollama - Run open-source models locally
  • OpenAI - Use GPT models with your API key
  • LM Studio - Local inference with a GUI
  • vLLM, LocalAI, text-generation-webui, and more

Switch providers anytime from settings. Configure temperature, max tokens, and streaming.

📚 Notes RAG

Index your vault for semantic search and AI-powered Q&A:

  • Smart chunking - Documents split with overlap for better context.
  • Incremental indexing - Only re-indexes changed files.
  • Content preprocessing - Strips frontmatter and cleans markdown.
  • Attachment-aware retrieval - Index PDF attachments by default, with opt-in local OCR for images and scanned PDFs.
  • Multiple embedding providers - Works with OpenAI-compatible embedding endpoints.
  • Configurable retrieval - Tune how many chunks are sent to the model.

Keep semantic context visible while you write:

  • Uses selected text when you highlight a passage.
  • Falls back to the full active note when nothing is selected.
  • Shows related notes with scores, paths, and previews.
  • Opens any match instantly or starts scoped RAG chat with selected notes.

🧩 Manual Workflow Automation

Run useful review recipes without giving the model autonomous write access:

  • Weekly review - Turn recent or scoped notes into wins, open loops, and next actions.
  • Meeting notes to tasks - Extract action items and follow-ups into a target note.
  • Project status summary - Summarize progress, risks, blockers, and next steps.
  • Approval-first writes - Every generated note change is reviewed before it is applied.

✍️ Text Processing

Transform selected text with AI-powered commands:

Command What it does
Summarize Condense text while preserving key information
Make Professional Rewrite in a formal, polished tone
Generate Action Items Extract actionable tasks from text
Custom Prompt Run your single saved custom prompt
Run Saved Prompt... Fuzzy-search and run any saved prompt
Use as Prompt Send selection directly to your model
Edit with Prompt Choose from presets or write custom instructions

All commands are available via Command Palette (Cmd/Ctrl + P) or the ribbon menu.

💬 Chat Interfaces

LLM Chat for general conversation:

  • Remembers recent exchanges for context.
  • Supports all configured personas.
  • Can propose approval-gated Vault Actions when enabled.
  • Can render assistant responses with Obsidian markdown from the sidebar toggle or settings.
  • Opens in a docked sidebar so your notes stay visible while you chat.

RAG Chat for note-grounded answers:

  • Searches your indexed notes semantically.
  • Shows clickable sources that open notes and indexed attachments.
  • Supports vault, note, folder, tag, and selected-note scopes.
  • Uses the same sidebar as general chat; popup variants remain available from the command palette.

🌐 Web Integration

  • Web Search - Search the web using Tavily, Brave API, or a self-hosted SearXNG instance.
  • News Search - Get recent news on any topic.
  • SearXNG Support - Configure an instance URL in settings; the instance must enable JSON output with json in search.formats.

🎭 Personas

Customize AI behavior with 12 built-in personas — or create your own:

  • Software Developer, Physics Expert, Fitness Expert, Stoic Philosopher
  • Product Manager, Technical Writer, Creative Writer
  • TPM, Engineering Manager, Executive, Office Assistant

Edit any persona's system prompt directly in settings. Create custom personas with your own name and instructions. Restore defaults anytime.

📋 Saved Prompts

Save frequently-used prompts and run them instantly:

  • Each saved prompt registers as a command in the palette — assign hotkeys to your favorites
  • Fuzzy search picker to quickly find and run any saved prompt
  • Full CRUD in settings: create, edit, rename, delete
  • Existing single "Custom Prompt" field still works alongside saved prompts

🧠 Reasoning Extraction

Models like DeepSeek and Qwen wrap their thinking process in <think> blocks. Enable reasoning extraction to automatically strip these from output:

  • Strips <think>, <reasoning>, and <thought> blocks by default
  • Configurable markers via JSON for custom formats
  • Applied to text commands, chat, and RAG responses

🔍 Model Picker

Browse available models directly from your server:

  • Browse button next to Chat and Embedding model fields
  • Fetches from /v1/models endpoint (chat browse uses the chat/default server; embedding browse uses the embedding server override when set)
  • Searchable picker modal — no more guessing model names
  • Manual text entry always available as fallback

⚙️ Organized Settings

Settings organized into clear sections:

  • Connection & Provider
  • Models (with Browse)
  • Chat & Personas
  • Output & Reasoning
  • Custom Prompt & Saved Prompts
  • Notes Index (RAG)
  • Workflow Automation
  • Integrations
  • About & Changelog

🎯 Command Organization

All commands use clear prefixes for easy discovery:

  • Text: - Text transformation commands
  • Prompt: - Your saved custom prompts (hotkey-assignable)
  • Chat: - Open chat interfaces
  • Workflow: - Run review workflows
  • Web: - Web and news search
  • Notes: - RAG indexing and management
  • Settings: - Plugin configuration

Installation

  1. Open Obsidian Settings → Community Plugins
  2. Click "Browse" and search for "Local LLM Helper"
  3. Install and enable the plugin

Manual Installation

  1. Download the latest release from GitHub Releases
  2. Extract to your vault's .obsidian/plugins/ folder
  3. Enable in Settings → Community Plugins

Quick Start

1. Configure Your Provider

Go to Settings → Local LLM Helper and choose your provider:

For Ollama:

  • Server: http://localhost:11434
  • Model: Click Browse to pick from available models, or type llama3.2
  • Embedding Model: mxbai-embed-large

For OpenAI:

  • Server: https://api.openai.com
  • API Key: Your OpenAI API key
  • Model: gpt-4 or gpt-3.5-turbo

For LM Studio:

  • Server: http://localhost:1234
  • Model: Click Browse to see loaded models, or leave blank for default

2. Try Text Commands

  1. Select some text in a note
  2. Open Command Palette (Cmd/Ctrl + P)
  3. Type "Text:" to see available commands
  4. Choose an action like "Summarize" or "Make Professional"

3. Index Your Notes

  1. Command Palette → "Notes: Index notes for RAG"
  2. Wait for indexing to complete
  3. Command Palette → "Chat: Notes (RAG)" to chat with your notes in the sidebar
  4. Command Palette → "Notes: Open related notes" to keep nearby notes visible while writing

4. Run a Workflow

  1. Command Palette → "Workflow: Run workflow..."
  2. Choose a built-in recipe
  3. Confirm the note scope and output target
  4. Review the generated approval card before writing anything to your vault

Notes:

  • Workflows use the same indexed-note context as RAG chat, so run Notes: Index notes for RAG first if your vault changed.
  • PDF attachments are indexed by default; image OCR and scanned-PDF OCR can be enabled in Settings → Notes Index (RAG).
  • Meeting notes to tasks and Project status summary require a target note to append into.
  • Workflow approvals still work even if the chat-only Vault Actions toggle is off.

Changelog

v2.4.7

Chat Rendering

  • Added optional Obsidian markdown rendering for assistant responses across chat views.
  • Added a sidebar toggle and matching setting to switch between rendered markdown and plain escaped text.

v2.4.5

Dockable Chat Sidebar

  • Moved general and notes chat into a right-sidebar workspace view so notes stay visible while chatting.
  • Added a multiline composer with target context, stop state, inline readiness/error banners, and a verification reminder.
  • Improved source citations with note titles and paths that open in the main editor.

v2.4.2

Plugin Review Cleanup

  • Published a fresh release with matching manifest metadata.
  • Added build provenance attestations for release assets.
  • Clarified optional auto-index behavior and kept it disabled by default.

Workflow Automation

  • Added a manual workflow runner for weekly review, meeting notes to tasks, and project status summary recipes
  • Reused note, folder, and tag scopes across RAG chat and workflow runs
  • Kept all workflow note writes behind explicit approval cards

Related Notes Sidebar

  • Added a persistent "Related Notes" workspace view
  • The sidebar updates from the current note or selected text
  • Open matching notes directly or start RAG chat with selected related notes

v2.4.0

Saved Prompts

  • Save frequently-used prompts with title and text
  • Each saved prompt registers as a command — assign hotkeys to your favorites
  • Fuzzy-searchable picker modal (Text: Run saved prompt...)
  • Full CRUD in settings: create, edit, rename, delete

Persona Editing

  • View and edit any persona's system prompt in settings
  • Create fully custom personas with your own name and instructions
  • Delete custom personas, restore all defaults with one click
  • Same backward-compatible persona keys — existing settings just work

Reasoning Extraction

  • Toggle to strip <think>, <reasoning>, <thought> blocks from LLM output
  • Useful for DeepSeek, Qwen, and other models that expose chain-of-thought
  • Configurable markers via JSON for custom formats
  • Applied to text commands, general chat, and RAG chat

Model Picker

  • Browse button next to Chat and Embedding model fields
  • Fetches chat models from the chat/default server and embedding models from the embedding server override when set
  • Searchable picker modal — works with Ollama, LM Studio, vLLM, and others
  • Manual text entry always available as fallback

Bug Fixes

  • Fixed server URL normalization to prevent missing protocol errors
  • Resolved security vulnerabilities and cleaned up dependencies

v2.3.1

New Features

  • Redesigned RAG Chat: New interface with welcome message, example queries, and clickable sources
  • Changelog in Settings: View version history anytime from Settings → About

RAG Improvements

  • Smarter chunking with overlap for better context preservation
  • Incremental indexing - only re-indexes changed files
  • Content preprocessing - strips frontmatter and cleans markdown
  • Better error messages when notes aren't indexed

UI/UX

  • Commands organized with prefixes (Text:, Chat:, Web:, Notes:)
  • Ribbon menu grouped logically with separators
  • Settings page organized into 7 clear sections
  • All prompts improved for better LLM output
  • Persona prompts rewritten to be more actionable

v2.3.0

  • Edit with Prompt: Edit selected text with preset or custom prompts
  • Security Updates: Fixed dependency vulnerabilities
  • Better Error Messages: Clearer embedding failure messages

v2.2.1

  • Fixed re-embedding issue on every restart
  • Proper persistent storage for embeddings
  • Storage diagnostics command

v2.2.0

  • Multi-provider support (Ollama, OpenAI, LM Studio)
  • Easy provider switching in settings
  • Configurable temperature and max tokens

v1.1.3

  • Chat history stored (3 previous exchanges)
  • Response formatting options

v1.1.1 - v1.1.2

  • LLM chat functionality with personas
  • Modern chat interface UI

v1.0.10

  • Ollama support
  • OpenAI API compatibility

v1.0.9

  • Added persona selection

v1.0.8

  • Replace/append output options

v1.0.7

  • Command palette integration

v1.0.6

  • Custom prompt capability
  • Action items generation

v1.0.5

  • Streaming output support

v1.0.4

  • Initial release with summarize, rephrase, and generate

Requirements

  • Obsidian 1.7.2 or later
  • Any LLM server with OpenAI-compatible API (Ollama, LM Studio, OpenAI, vLLM, etc.)

Support

License

MIT License - see LICENSE for details.

Similar Plugins

info
• Similar plugins are suggested based on the common tags between the plugins.
ChatGPT MD
3 years ago by Bram Adams
A (nearly) seamless integration of ChatGPT into Obsidian.
YOLO
5 months ago by Lapis0x0
Smart, snappy, and multilingual AI assistant for your vault.
BMO Chatbot
3 years ago by Longy2k
Generate and brainstorm ideas while creating your notes using Large Language Models (LLMs) from Ollama, LM Studio, Anthropic, Google Gemini, Mistral AI, OpenAI, and more for Obsidian.
SystemSculpt AI
2 years ago by SystemSculpt.com
Enhance your Obsidian App experience with AI-powered tools for note-taking, task management, and much, MUCH more.
Ollama
3 years ago by hinterdupfinger
Prompt ChatGPT
2 years ago by Coduhuey
Chat with Bard
3 years ago by Artel250
An obsidian plugin that enables you to talk to Google Gemnini directly
Copilot auto completion
3 years ago by Jordi Smit
Ollama Chat
2 years ago by Brumik
A plugin for chatting with you obsidian notes trough local Ollama LLM instead of Chat GTP.
Cannoli
3 years ago by blindmansion
Cannoli allows you to build and run no-code LLM scripts using the Obsidian Canvas editor.
Large Language Models
a year ago by eharris128, r-mahoney, & jsmorabito
The LLM plugin gives Obsidian users access to local and web-based, large language models via several chat interfaces: modal, widget, FAB window, and commands.
Nexus AI Chat Importer
2 years ago by Superkikim
Caret
2 years ago by Jake Colling
Caret, an Obsidian Plugin
Flashcards LLM
3 years ago by Marco Pampaloni
Use Large Language Models (such as ChatGPT) to automatically generate flashcards from obsidian notes
Chat Stream
3 years ago by Ryan P Smith
Obsidian canvas plugin for using AI completion with threads of canvas nodes
LLM Summary
2 years ago by QSun
wip
AI Agent
8 months ago by Manuel Magaña López
Empower your Obsidian vault with Google Gemini.
Augmented Canvas
2 years ago by Léopold Szabatura
Supercharge your Obsidian canvas experience with AI features.
AI Editor
3 years ago by Zekun Shen
LLM Test Generator
a year ago by Aldo E George
Gladdis
3 years ago by Aurélien Stébé
Gladdis (Generative Language Artificial Dedicated & Diligent Intelligence System) - it's an AI chatbot.
Steward
8 months ago by Dang Nguyen
A vault-specific agent equipped with agentic capacity, fast search, flexible commands, vault management, and terminals to "jump" into other CLI agents, such as Claude, Gemini, etc.
Smart Export
3 months ago by Iván Sotillo
Plugin that follows wikilinks to a configurable depth, joining the notes into a single export.
Reason
2 years ago by Joshua Pham
Digest your Obsidian notes
LLM workspace
2 years ago by Olivér Falvai
Canvas LLM
8 months ago by Mike Farlenkov
A canvas-like UI to talk with LLMs in Obsidian.
LLM Tagger
a year ago by David Jayatillake
AI Chat
2 years ago by arenasys
Pure Chat LLM
a year ago by Justice Vellacott
Turn notes into conversations with chatGPT or better yet Ollama
Markpilot
2 years ago by Taichi Maeda
AI-powered inline completions and chat view for Obsidian
OpenAugi
a year ago by Chris Lettieri
The self-organizing second brain. Augmented intelligence for thinkers.
AI Chat as Markdown
2 years ago by Charl P. Botha
Simple Prompt
2 years ago by David Zachariae
Simple Prompt Plugin is a plugin for Obsidian that allows you generate content in your notes using LLMs.
WhatsApp backup importer
a year ago by Luigi Cerone
Obsidian plugin that allows you to import exported WhatsApp chat archives (in .zip format) into your vault.
packUp4AI
10 months ago by Jeffry
An Obsidian plugin that helps you manage context for external LLM apps.
LLM docs
a year ago by Shane Lamb
Chat with LLM in regular markdown files in Obsidian
LLM Shortcut
10 months ago by Viktor Chernodub
A plugin for Obsidian that provides a way to create shortcuts for commands powered by LLM capabilities.
Image to notes by Photes.IO
2 years ago by Kanaries Data Inc.
AI Image to text notes plugin in obsidian
Ayanite
2 years ago by jemstelos
PromptCrafter
2 years ago by Fabrice Hong
Create reusable, modular prompts in Obsidian
Notes Refresher
2 years ago by Connor Park
Obsidian plugin for AI-generated note refreshers
NoteSmith
9 months ago by csteamengine
Chat clips
a year ago by sleepingraven
Record chat in ordinary markdown list.
On This Day I
a year ago by Ben Stuart
AI tools for Daily Notes, an Obsidian Plugin