Current

Access Devices / AppsHomeserverCloud Services
Laptop + Obsidian + Browser

- iCloud > Resilio > Homeserver sync
- Gemini / Mammouth @ Browser
Monitoring Script: Triggers the Quartz build and pushes to Cloudflare when Resilio syncsQuartz @ GitHub

Cloudflare Pages

- Web Analytics
iPhone/iPad + Obsidian + Browser

- iCloud for Sync OR
- Syncthing
- Gemini / Mammouth (Web) Apps

Issues

  1. AI exchanges are separate apps and I need to copy and paste notes across
  2. Vector Databases are stored on the AI platform.
  3. These are essentially my personal information in a form that is extensive and easily accessed (if the cloud provider authorises access)
  4. If I wish to Reprompt in Mammouth then each AI will need to re-vectorize attached files / images which is wasted CPU
  5. Prompt context needs to be hand stitched and/or via Gemini “Personal Context” feature
  6. Mammouth includes API usage in its current subscription plan

Business Requirement

I want the ability to effectively and efficiently integrate with LLM using Obsidian on all my devices and publish notes for given folder(s) to my domain using reliable tools

Solution Option(s) and Functional Requirements

Initial

  1. Continue to host my Vault in markdown files / folders

  2. Continuing to use Obsidian from my access devices

  3. Pivot to Syncthing to synchronize direct to Homeserver, bypassing Laptop Resilio bridge

  4. Obsidian Sync has a 1GB ceiling for a vault and given its pay for this seems very constrained

  5. Parallel run my main vault on the current solution until I build confidence with Syncthing

  6. Continue to publish one or more folders to one or more sites using Quartz

  7. This can be as is with Quartz building my site on my homeserver

  8. Though this will involve me manually upgrading the Git files

  9. Alternatively I can publish to a private vault

  10. Introduce the ability to prompt an AI LLM using one or more files from my Vault

  11. This builds upon existing CLI based solutions that exist and get invoked from inside Obsidian

  12. Ability to reprompt to another AI LLM leveraging vault files and existing vectorized databases

  13. Self host the vectorized database to reduce Mammouth API consumption

  14. Usage will be by family of 3 based on their own individual vaults

  • Evaluate the ability to implement this in phases? and rate the difficulty for a retired IT Business Analyst?

Alternate

  • Are there alternate approaches that can help provide context from one or more vault files / folders and flexibility to interface with multiple AI LLMs seamlessly?

claude response

Solution Evaluation: Obsidian + LLM Integration

Architecture Overview

┌─────────────────────────────────────────────────────────────┐
│                    ACCESS DEVICES                           │
│  Laptop/iPhone/iPad                                         │
│  Obsidian Vault (Markdown)                                  │
│  + Obsidian Plugin Layer                                    │
└──────────────────┬──────────────────────────────────────────┘
                   │ Syncthing (Direct)
┌──────────────────▼──────────────────────────────────────────┐
│                   HOMESERVER                                │
│  ┌─────────────┐  ┌──────────────┐  ┌──────────────────┐  │
│  │  Syncthing  │  │  Vector DB   │  │  Quartz Build    │  │
│  │  (Sync Hub) │  │  (ChromaDB/  │  │  + Cloudflare    │  │
│  │             │  │   Qdrant)    │  │  Deploy          │  │
│  └─────────────┘  └──────────────┘  └──────────────────┘  │
│  ┌─────────────────────────────────────────────────────┐   │
│  │           LLM Proxy / Orchestration Layer           │   │
│  │         (LiteLLM or Open WebUI)                     │   │
│  └─────────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────────┘
                   │
┌──────────────────▼──────────────────────────────────────────┐
│              CLOUD / EXTERNAL                               │
│  Cloudflare Pages  │  OpenAI/Gemini/Anthropic APIs          │
└─────────────────────────────────────────────────────────────┘

Phase Analysis

Phase 1: Sync Migration (Syncthing)

Difficulty: 2/5 — Low

TaskNotes
Install Syncthing on HomeserverDocker container, straightforward
Install Syncthing on LaptopReplace Resilio bridge role
Install Syncthing on iPhone/iPadMöbius Sync app (paid, ~£5) or Syncthing-iOS
Parallel run with current iCloud/ResilioRun both during confidence period
Vault per family memberSeparate Syncthing folders per person

Key Decisions:

  • Möbius Sync is the practical iOS option — background sync has iOS limitations
  • Recommend keeping iCloud as a backup sync path, not primary
  • One Syncthing instance on Homeserver serves all 3 family vaults with folder-level isolation

Risk: Low. Syncthing is mature, open source, and well documented


Phase 2: Quartz Publishing (Incremental Improvement)

Difficulty: 2/5 — Low

Current:  Resilio → Homeserver → Monitoring Script → Quartz Build → Cloudflare
Target:   Syncthing → Homeserver → Watch Script → Quartz Build → Cloudflare
Improvement OptionDifficultyNotes
Keep as-is, swap Resilio trigger for Syncthing trigger1/5Minimal change
Auto Git pull for Quartz upgrades via cron2/5Small script addition
Private vault (second Quartz instance)2/5Second folder, second Cloudflare project
Per-family-member published site3/5Three Quartz builds, three Cloudflare projects

Recommendation: Swap trigger script only. Add a monthly cron job:

cd /path/to/quartz && git pull && npm install

This removes the manual upgrade burden at minimal effort


Phase 3: LLM Integration via Obsidian

Difficulty: 3/5 — Moderate

Option A: Obsidian Plugin Route (Lowest friction)

PluginFunctionAPI Key Needed
CopilotChat with vault files, multi-LLMYes (your keys)
Smart ConnectionsLocal vector embeddings + chatOptional
OmnisearchFast local semantic searchNo
Text GeneratorPrompt templates, file contextYes

Copilot Plugin is the strongest candidate:

  • Supports OpenAI, Gemini, Anthropic, local Ollama
  • Can index your vault locally
  • Conversations stay within Obsidian
  • No copy-paste between apps
  • API keys stored locally

This directly solves Issues 1, 4, and 5 from your problem statement

Option B: CLI + Obsidian (Your existing approach, enhanced)

# Example: shell command invoked from Obsidian via Shell Commands plugin
obsidian-ai-query.sh --files "Health/2026-targets.md" \
                     --model "gemini-pro" \
                     --prompt "Analyse my progress"

Shell Commands plugin calls your existing CLI tools from within Obsidian — keeps your current workflow but removes app switching


Phase 4: Self-Hosted Vector Database

Difficulty: 4/5 — Moderate-High

Why this matters for you:

  • Vectorize once, query many times across different LLMs
  • Keeps your personal health/life data off commercial platforms
  • Shared infrastructure for family of 3
┌─────────────────────────────────────────┐
│           Homeserver                    │
│                                         │
│  ┌─────────────┐   ┌─────────────────┐ │
│  │   Qdrant    │   │   Open WebUI    │ │
│  │  (Vector DB)│   │  (LLM Frontend) │ │
│  │  Docker     │   │  Docker         │ │
│  └──────┬──────┘   └────────┬────────┘ │
│         │                   │          │
│  ┌──────▼───────────────────▼────────┐ │
│  │         LiteLLM Proxy             │ │
│  │  Routes to OpenAI/Gemini/Anthropic│ │
│  │  Single API endpoint              │ │
│  │  Usage tracking per family member │ │
│  └───────────────────────────────────┘ │
└─────────────────────────────────────────┘
ComponentPurposeDocker?Difficulty
QdrantVector storageYes2/5
LiteLLMMulti-LLM proxy, single endpointYes3/5
Open WebUIBrowser-based chat UI, vault file uploadYes2/5
OllamaLocal LLM (optional, no API cost)Yes2/5

LiteLLM specifically addresses Issues 2, 3, 4, and 6:

  • One API key management point
  • Vectors stored on your homeserver (Qdrant)
  • Route the same query to multiple LLMs
  • Track and control Mammouth/API spend per user

Vectorisation Workflow

Obsidian Vault (Markdown)
        │
        ▼
  Python Script (runs on Homeserver, watches Syncthing folder)
  - Detects new/modified .md files
  - Chunks content
  - Generates embeddings (via local model or API)
        │
        ▼
     Qdrant
  (self-hosted vector store)
        │
        ▼
  LiteLLM Proxy
  (augments prompts with relevant vault context)
        │
        ▼
  Your chosen LLM (Gemini/Claude/GPT)

This is the hardest phase — Python scripting required. As a retired IT BA, the concepts will be familiar but the implementation requires comfort with:

  • Docker Compose files
  • Basic Python scripting or adapting existing scripts
  • REST API concepts

Phase 5: Family Multi-Vault

Difficulty: 3/5 — Moderate

# Syncthing folder structure
/homeserver/vaults/
  ├── vault-you/          # Your Syncthing folder ID
  ├── vault-member2/      # Separate folder, separate devices
  └── vault-member3/      # Isolated, no cross-access
 
# LiteLLM user isolation
/litellm/config.yaml
  users:
    - user: "you"
      vault_path: "/vaults/vault-you"
      budget_monthly_usd: 20
    - user: "member2"
      ...

Qdrant supports named collections — one collection per family member keeps vectors isolated


Q1 2026          Q2 2026          Q3 2026          Q4 2026
─────────────────────────────────────────────────────────
Phase 1          Phase 2          Phase 3          Phase 4+5
Syncthing        Quartz           LLM Plugin       Vector DB
Migration        Tidy-up          (Copilot)        + LiteLLM
                                                   + Family
[2/5]            [2/5]            [3/5]            [4/5]
2-4 weeks        1-2 weeks        2-3 weeks        4-8 weeks

Start with Copilot plugin in Phase 3 before committing to Phase 4 — it may satisfy 80% of your needs at 20% of the effort


Alternate Approaches

Alt 1: Open WebUI (Fastest path to multi-LLM)

  • Single Docker container on Homeserver
  • Browser-based, works on all devices
  • Upload vault files per session (not auto-indexed)
  • Supports Ollama local + OpenAI/Gemini APIs
  • Difficulty: 2/5 — good stepping stone before full vector DB

Alt 2: Notion AI / Anytype (Abandon Obsidian)

  • Not recommended — loses your existing vault investment and publishing workflow

Alt 3: RAGFlow (All-in-one RAG platform)

  • Docker-based, includes vector DB + LLM proxy + UI
  • Designed for document knowledge bases
  • More opinionated than the modular stack
  • Difficulty: 3/5 — worth evaluating if Phase 4 feels too complex

Alt 4: n8n Workflow Automation

  • Self-hosted on Homeserver (Docker)
  • Orchestrates: file watch → vectorise → LLM query → write back to vault
  • Visual workflow builder — suited to BA thinking
  • Bridges Obsidian, LLMs, and vector DB without heavy coding
  • Difficulty: 3/5 — recommended complement to Phase 4

Summary Recommendation

PriorityActionEffortPayoff
Do firstPhase 1: SyncthingLowRemoves Resilio dependency
Do nextPhase 3: Copilot PluginLow-MedSolves LLM integration immediately
ThenPhase 2: Quartz cronLowRemoves manual upgrade task
Plan forPhase 4: LiteLLM + QdrantHighFull data sovereignty + family use
Considern8n as orchestration layerMedReduces coding need in Phase 4

Your BA background is a genuine asset here — system decomposition, requirements, phasing — the technical gaps are Docker familiarity and basic Python, both learnable incrementally alongside each phase