The Dakera Playground Is Live — Try Agent Memory in Your Browser

Five guided scenarios against a live Dakera API — memory store, semantic recall, temporal reasoning, knowledge graph, and decay — from your browser, no account required.

5
live scenarios
<60ms
embedding latency
4
SDK quickstarts
0
setup steps

Most developer tools have a demo. Dakera has a playground — because the only honest benchmark for memory infrastructure is what it does with your data, against a real engine, right now. Open the playground and in under thirty seconds you'll be storing memories, recalling them with sub-60ms latency, and watching the importance scorer rank your results. No account, no Docker, no API key. Just the real thing.

Try the Playground
Live API · No signup · Five guided scenarios
Open Playground

Why a playground?

When developers first encounter Dakera, the most common question is: "What does hybrid retrieval actually feel like at this latency?" A README answers it in theory. The playground answers it in practice.

Every request in the playground goes to a real Dakera instance — the same engine that runs in production, the same ONNX embedding models, the same HNSW index. There's no mock API, no pre-recorded responses, no sleight of hand. When you store a memory, it's stored. When you recall it, the engine runs the full retrieval pipeline: embed your query, traverse the vector index, run BM25 full-text scoring, merge and rerank by importance. You see the actual result, including the importance score the engine computed.

Real calls, real latency. The latency figures you see in the playground are real round-trip times from your browser to the Dakera engine — embedding, indexing, retrieval, and serialization included. The sub-60ms you'll observe is the same performance your agents get in production.

Five scenarios, one complete picture

The playground is organized around five scenarios, each targeting a distinct capability of the Dakera memory model. Work through them in order and you'll have covered the full API surface in under ten minutes.

01
Memory Store & Recall
Store memories with importance scores and recall them by semantic query. Covers the core POST /v1/memories and POST /v1/memories/recall endpoints. See how importance scoring shapes what surfaces at the top.
02
Temporal Reasoning
Store the same entity multiple times across different points in time, then query it. Observe how Dakera weights recency alongside importance to surface what's most relevant now — not just what was most frequently accessed.
03
Knowledge Graph
Create memories that link to named entities, then traverse the knowledge graph to find related context. The same multi-hop reasoning your agents use when a user asks "what did we decide about Alice last month?"
04
Memory Decay
Observe Dakera's importance decay model in action. See how the engine tracks access patterns and adjusts importance over time — keeping frequently-recalled context sharp while fading noise that's never retrieved.
05
Session Scoping
Create a session, tag memories to it, and query in session-scoped vs. cross-session mode. Covers the isolation model that lets you build agents with clean per-conversation context without losing cross-session knowledge.

Sandbox limits — and why they're the right defaults

The playground runs in a sandboxed environment designed to let you explore safely. Each browser session gets its own isolated namespace on the engine — your memories don't mix with anyone else's. Sandbox limits are set to be generous enough for meaningful exploration while protecting the shared infrastructure:

50
memories per session
30 min
session TTL
30/min
request rate limit
100%
session isolation

When your session expires, all data is automatically deleted. Nothing persists between playground sessions — which also means you can explore freely without worrying about leaving test data behind.

SDK quickstarts in four languages

Once you've run the scenarios, the playground surfaces the equivalent SDK code for each operation in your language of choice. Copy it into your project and you're storing your first real memory in under a minute.

🐍
Python
🟦
TypeScript
🦀
Rust
🐹
Go

The SDK examples use the same operations you ran in the playground, against your own self-hosted Dakera instance. Swap in your endpoint and API key, and the code runs unchanged.

# Python — the same recall you ran in the playground
from dakera import AsyncDakeraClient
import asyncio

async def main():
    client = AsyncDakeraClient(
        url="http://localhost:3300",
        api_key="your-key"
    )

    # Store memory with importance
    mem = await client.memories.store(
        agent_id="my-agent",
        content="User prefers async Python patterns and type hints",
        importance=0.85
    )

    # Hybrid recall — same engine as the playground
    results = await client.memories.recall(
        agent_id="my-agent",
        query="coding preferences",
        top_k=5
    )

asyncio.run(main())

From playground to production

You just ran the same memory engine that production Dakera instances use — hybrid retrieval at 88.2% LoCoMo recall accuracy, real importance scoring, real vector search. That's not a simulation. It's the actual system.

The path from playground to production is shorter than you'd expect. There are three ways to go further, in order of time-to-value:

1
Self-host with Docker — 5 minutes
A single docker run dakera/dakera starts a local Dakera server. Then swap the playground's endpoint for localhost:3300 and the code you just ran works unchanged. The quickstart guide walks it step by step.
2
Connect via MCP — 3 config lines
Already using Claude Desktop, Cursor, or Windsurf? The MCP server wires Dakera into your AI tools with a three-line config block. No restart. No new code. Your IDE gets persistent memory immediately.
3
Integrate via SDK — 4 lines of code
Python, TypeScript, Rust, or Go — pick your SDK and you're running the same store/recall pattern you just tested, against your own Dakera instance, in under a minute. The SDK docs show the full API surface.

The playground stays open. Come back to prototype a retrieval pattern, test a query, or benchmark latency before deploying a change. It's a live API, not a snapshot — every request you make in the playground is real.

Ready to run your own?
Self-host in five minutes — Docker or binary
Get Started

Try the Dakera Playground

Five guided scenarios. Live API. No setup required. See what real agent memory looks like in under ten minutes.

Build with Dakera

Give your AI agents persistent memory — self-hosted, production-ready, zero dependencies.