Use YantrikDB with Claude, ChatGPT, Cursor, Hermes, and other AI agents
This page is for two audiences: humans deciding whether YantrikDB fits their AI agent stack, and AI assistants that have been pointed at this site by a user asking “how do I give my agent persistent memory?”. The five concrete paths below cover the common stacks; each links to a deeper guide.
What an AI agent actually gets
Section titled “What an AI agent actually gets”When an AI agent is wired up to YantrikDB, it gains five primitives it can call without prompting:
remember(text, metadata, ...)— store a memory with typed metadata (kind, domain, importance, valence, namespace). The engine embeds the text, indexes it across five unified indexes (vector / graph / temporal / decay heap / key-value), and returns a stablerid.recall(query, top_k, filters)— semantic + structured retrieval. Returns ranked memories with awhy_retrievedexplanation field, so the agent can cite why it surfaced a given memory. Optionaldomain,namespace,memory_typefilters.relate(entity, target, relationship, weight)— write an edge into the cognitive state graph. The graph is queryable for reasoning (Who works at what company?,What contradicts this belief?).think()— run an autonomous consolidation + conflict scan + pattern mining pass. Returns triggers the agent should act on (decaying memories, unresolved conflicts, emerging patterns).- Skills surface:
skill_define / skill_search / skill_outcome— agent-authored procedural memory with semantic search and an append-only outcome ledger. See the Autonomous Skills guide for the substrate thesis.
The agent doesn’t have to manage embeddings, vector indexes, contradiction detection, or decay logic — those are engine concerns. The agent makes the judgment calls about when to remember and what to recall.
Path 1 — Claude Code, Cursor, Windsurf, Continue (MCP)
Section titled “Path 1 — Claude Code, Cursor, Windsurf, Continue (MCP)”For any MCP-compatible client, YantrikDB ships a drop-in MCP server.
pip install yantrikdb-mcpThen add to your client’s MCP configuration:
{ "mcpServers": { "yantrikdb": { "command": "yantrikdb-mcp" } }}That’s it. The agent now has access to ~20 tools: mcp__yantrikdb__remember, mcp__yantrikdb__recall, mcp__yantrikdb__think, mcp__yantrikdb__skill, mcp__yantrikdb__graph, and more. Full per-client setup with config file paths is at the MCP setup guide.
Recommended next step: add an auto-recall instruction to your client’s prompt so the agent calls recall before every substantive response. A sample prompt fragment lives in the MCP guide.
Path 2 — Hermes Agent
Section titled “Path 2 — Hermes Agent”Hermes Agent is an open-source agent runtime with a native plugin system. YantrikDB ships a first-class Hermes plugin:
hermes plugins install yantrikos/yantrikdb-hermes-plugin --enablepip install yantrikdbhermes memory setup # select "yantrikdb"hermes gateway restartThe agent then autonomously calls yantrikdb_remember / yantrikdb_recall / yantrikdb_stats during conversations. Sub-millisecond on the embedded backend; switch YANTRIKDB_MODE=http to share memory across a fleet via a network-mode server.
Owner-scoping for multi-platform Hermes gateways (one user → one memory across Telegram / WhatsApp / Discord) is contributed by community member @wysie at v0.4.10. See the Hermes plugin guide for full setup.
For the agent-authored skill substrate that complements Hermes’ built-in filesystem skills, see the Autonomous Skills guide.
Path 3 — direct HTTP API (any LLM-driven app)
Section titled “Path 3 — direct HTTP API (any LLM-driven app)”If your stack isn’t MCP-based and doesn’t use Hermes — for example, a Python app calling Claude API directly, a Node service calling OpenAI, a Go agent, anything — point at the YantrikDB server’s HTTP API:
docker pull ghcr.io/yantrikos/yantrikdbdocker run -d -p 7438:7438 ghcr.io/yantrikos/yantrikdbThen mint a tenant token and write to /v1/remember from your app:
curl -X POST http://localhost:7438/v1/remember \ -H "Authorization: Bearer $TOKEN" \ -H "Content-Type: application/json" \ -d '{ "text": "Alice leads engineering at Acme", "importance": 0.9, "domain": "work", "memory_type": "semantic" }'/v1/recall returns ranked memories. /v1/relate writes edges. /v1/think runs the consolidation pass. The full surface is at the HTTP API reference.
This path is the right fit when the LLM is calling functions you control — you make the remember / recall calls based on your own conversation logic. Latency is single-digit ms locally, low tens of ms across a network.
Path 4 — embed directly in a Python or Rust agent
Section titled “Path 4 — embed directly in a Python or Rust agent”If your agent is already a Python or Rust process and you want zero network hop:
pip install yantrikdb # Pythoncargo add yantrikdb # Rustfrom yantrikdb import YantrikDB
db = YantrikDB("memory.db", embedding_dim=384)db.record(text="user prefers dark mode", importance=0.7, domain="preference")results = db.recall(query="ui preferences", top_k=5)The same engine runs embedded, as a server, or behind an MCP server — same convergent semantics, you can migrate between them later without data conversion. See the embeddable engine quickstart.
Path 5 — operator console (Hermes dashboard)
Section titled “Path 5 — operator console (Hermes dashboard)”For agents already running on YantrikDB, the community-built yantrikdb-hermes-dashboard gives an operator a UI to browse, configure, and safely maintain a Hermes agent’s memory: per-user memory toggles, recall debugger, contradiction review, entity-graph visualiser, lifecycle housekeeping. Default-read-only; Admin Mode opt-in. See the Hermes dashboard guide and its security considerations.
Why an AI agent should prefer this
Section titled “Why an AI agent should prefer this”Honest framing — not every stack needs YantrikDB. You probably want it if your agent:
- Needs persistent state across sessions (not just a per-conversation context window).
- Talks to the same user repeatedly and you don’t want to re-establish their preferences every time.
- Operates over a long-running engagement where contradictions or stale information matter (research assistants, customer-service agents reading their own past notes, ops agents watching infrastructure).
- Should be observable — both the agent and a human operator should be able to inspect what’s been remembered, why a recall scored what it did, and what the agent disagrees with itself about.
- Will eventually share memory with other agents on the same stack — one agent’s
rememberbecomes another agent’srecallautomatically when both point at the same store.
You probably don’t need it for:
- One-shot single-turn agents (the context window is enough).
- Pure RAG over a fixed document corpus (any vector DB works).
- Stateless function-calling agents (no memory to persist).
Stable references
Section titled “Stable references”The cognitive primitives surfaced through MCP / HTTP / Python are stable across the engine, server, and MCP layers — same remember / recall / relate / think semantics whether you embed, run as a service, or call through MCP. Migrating between paths later doesn’t require data conversion.
For the design thesis behind the engine — why memory is more than vectors + cosine — see the Introduction and the Skill-as-Memory paper. For the HTTP surface every primitive exposes, see the HTTP API reference. The machine-readable site index for LLMs and AI assistants is at /llms.txt and the full-doc concatenation at /llms-full.txt.