Product · What you get

Everything your
agents were missing.

One Docker container. Connects to any storage you already have. Gives every agent, every model, and every person on your team a governed, persistent, RAG-enabled API to collaborate on your real data — without moving a byte.

docker pull olympusio/olympus:latest
< 5m
Time to first agent API callFrom docker pull to live RAG on your files
0 MB
Data moved to any cloudInference runs on your hardware. Full stop.
Any
LLM works with OlympusClaude, GPT-4o, Gemini, LLaMA, Ollama models
Agent sessions with persistent memoryState survives restarts, compute changes, model swaps
Works on MacBook / macOSWindows ServerLinux / NAS NetApp ONTAPDell PowerScaleHPE AlletraPure Storage SMB / NFSS3-compatibleAmazon FSxNAzure NetApp Files
Sound familiar?

The six things that are
killing your agent deployments

These aren't edge cases. Every developer building agentic AI hits all six. Olympus was built specifically to eliminate them.

Session continuity

Every session starts cold

You build a capable agent. It does excellent work. You close the session — and it's gone. No memory, no context, no continuity. Tomorrow you start over. This makes agents unreliable for anything real.

Olympus persists agent state to your storage on every write
Data access

Agents can't touch your actual files

You end up copy-pasting docs into prompts, writing brittle file-reading hacks, or just keeping agents away from real data entirely. There's no clean, governed path from an agent to a filesystem. So agents stay in the sandbox.

Olympus gives every agent a secure REST API to your real files
Vendor risk

One API change and you start over

Every developer building on a single LLM knows this anxiety. OpenAI changes pricing. Anthropic deprecates a model. Your entire agent's memory, prompt architecture, and tool integrations are coupled to that vendor. There's no abstraction layer underneath you.

Olympus is model-agnostic — swap any LLM in one config line
Multi-agent coordination

Multi-agent handoff is duct tape

Getting two agents to hand off work, share context, or build on each other's output requires custom orchestration code that's fragile and hard to debug. There's no shared workspace, so every team reinvents it — badly.

Olympus API is the shared workspace every agent reads and writes to
Enterprise deployment

Compliance kills every interesting deployment

You build something genuinely useful, try to deploy it inside a real company, and immediately hit the wall. Security review. Data governance. "You can't send company files to OpenAI." The use case dies — not because AI failed, but because there's no governed path to production.

Olympus runs on-prem, enforces AD permissions, logs every access
RAG in production

RAG is harder than it looks

Everyone starts with RAG thinking it's solved. Then you hit chunking problems, stale indexes, permission leakage between users, and models confidently hallucinating from the wrong document. Building RAG that actually works at scale is a serious engineering project most teams badly underestimate.

Olympus handles indexing, freshness, and per-user permission enforcement
What you get

Five things that work
the moment you install

Not a roadmap. Not a beta. Every one of these is live in the container you pull today.

Feature 01
● Live on install

Instant AI API on your existing storage

The moment you run the container and point it at your storage, every file gets a REST API. No migration. No new hardware. No new storage contracts. Your NAS didn't change — it just became agent-ready.

  • Works on MacBook, Windows Server, Linux NAS, or enterprise storage
  • Auto-indexes directories on connect — no manual setup
  • Every file accessible to any agent via standard REST calls
  • Upload, download, search, and chat on files through the same API
GET /api/v1/directory/get-root-directories
# Your NAS directories — instantly accessible
curl https://localhost:3000/api/v1/directory/get-root-directories \
  -H "Authorization: Bearer <token>"

✓ 200 OK — directories indexed
iddir_8841929READY
name/contracts/Q3_2025
files847 files · 12.4 GB
indexed100% · last sync 4s ago
curl -X POST .../api/v1/file/upload \
  -F "file=@Q4_forecast.xlsx" \
  -F "directoryId=dir_8841929"
Feature 02
● Permission-aware

RAG on your real documents — out of the box

Point any agent at any directory and ask questions against your actual files. Contracts, engineering specs, financials, research — all queryable by any LLM. Permission-aware, in place, always current.

  • Ask natural language questions across any directory of files
  • Works with any LLM — Claude, GPT-4o, Gemini, or local Ollama models
  • Agents only see files the authenticated user is authorized to access
  • Index stays current — new files are queryable within seconds
POST /api/v1/genai/chat-on-directory
curl -X POST .../api/v1/genai/chat-on-directory \
  -d '{
    "directoryId": "dir_8841929",
    "question": "Which contracts renew in Q1 > $500k?",
    "selectedModel": "claude-3-5-sonnet@anthropic"
  }'

# Same call — swap any model
"selectedModel": "gpt-4o@openai"
"selectedModel": "llama3.1@ollama"
✓ RAG — sources cited, permissions enforced
answer"3 contracts match: Acme Corp ($820k)..."
sourcesAcme_MSA_2024.pdf · GlobalCo_Renewal.docx
egressed0 bytesSECURE
Feature 03
● Survives restarts

Persistent agent memory and state

Agents stop starting cold. Every session picks up where the last one left off. Memory, work products, and session context are written back through the API to your storage — surviving compute restarts, model swaps, and provider changes.

  • Agents write memories as files — readable by any subsequent session
  • Shut down GPU between tasks — restart anywhere with full context
  • Work products versioned and stored alongside source documents
  • Dramatically reduces re-contextualization cost across long-running tasks
Agent memory — write & resume
# Agent saves memory after every task
curl -X POST .../api/v1/file/upload \
  -F "file=@session_memory.json" \
  -F "directoryId=agent_state_dir"

# Next session — resume with full context
curl -X POST .../api/v1/genai/chat-on-directory \
  -d '{ "directoryId": "agent_state_dir",
        "question": "Resume my last task" }'
✓ Agent resumed — zero context lost
memory_files12 files · sessions 42–53
last_task"Contract review — Acme Q3 renewal"
statusResumed · compute was off 14hREADY
Feature 04
● AD / SSO integrated

One governed layer for every agent and every person

Claude, GPT-4o, Gemini, LLaMA, your custom models, and your human team all go through the same Olympus API. Every call is permission-checked against Active Directory. Every access is logged. IT finally has visibility.

  • Integrates with Active Directory and LDAP — your existing user directory
  • SSO support — agents authenticate as scoped service accounts
  • Per-user permission enforcement — agents only see what their user sees
  • Full audit log of every file access, every model call, every write
Auth & permissions
curl -X PUT .../api/v1/entrance/login \
  -d '{
    "email": "agent-svc@company.com",
    "apiKey": "svc_key_xyz"
  }'
✓ Auth — AD groups resolved
tokeneyJhbGc... (scoped to AD groups)
groupslegal-team · contracts-read · finance-ro
access14 of 38 directoriesGOVERNED
# Agents cannot access dirs outside AD groups
# Every call appended to immutable audit log
Feature 05
● Zero vendor lock-in

Full model freedom — swap any LLM, any time

Because Olympus sits between your agents and your data, you are never married to a model vendor. Run Claude and GPT simultaneously on the same directory. Switch models with a single parameter.

  • Works with Anthropic, OpenAI, Google, Mistral, Meta, and any Ollama model
  • Run multiple models simultaneously on the same data
  • Switch providers mid-project — agent memory and RAG unaffected
  • Local inference via Ollama for fully air-gapped deployments
Model freedom — one API, every LLM
# Cloud models — identical API call
"selectedModel": "claude-3-5-sonnet@anthropic"
"selectedModel": "gpt-4o@openai"
"selectedModel": "gemini-1.5-pro@google"

# Local / air-gapped (via Ollama)
"selectedModel": "llama3.1@ollama"
"selectedModel": "deepseek-r1@ollama"

✓ Your data. Any model. Always.
data egressed0 bytes — inference on your hardware
vendor_locknone — swap in a single parameterFREE
How it works

Three steps from
zero to agent-ready

No new storage to provision. No data to migrate. No cloud accounts to configure. Deploy Olympus as a container and point it at what you already have.

01

Run the container

Pull the Github deploy script. Self-contained — no external dependencies, no cloud calls required to start.

docker run · 60 seconds
02

Connect your storage

Point Olympus at your existing storage — SMB share, NFS mount, S3 bucket, or native NetApp/Dell/HPE API. Olympus auto-indexes your directories. Every file immediately RAG-ready through the API.

auto-indexed · no migration
03

Your agents connect

Any agent — Claude, OpenAI, Gemini, your custom model — authenticates with the Olympus API and immediately has governed, permission-aware, persistent access to all of your real data. Memory persists. Lock-in is gone.

any model · live immediately
terminal — full deployment walkthrough
# ── Step 1: Pull the image ──────────────────────────────────────
docker pull olympusio/olympus:latest

# ── Step 2: Run with your storage and AD config ─────────────────
docker run -d \
  -p 3000:3000 \
  -e NAS_HOST=your-nas.company.com \
  -e AD_LDAP_URL=ldap://dc.company.com \
  -e OLLAMA_HOST=http://gpu-server:11434 \
  -e API_KEY=your-api-key \
  --name olympus \
  olympusio/olympus:latest

# ── Step 3: Authenticate ────────────────────────────────────────
curl -X PUT http://localhost:3000/api/v1/entrance/login \
  -d '{ "email": "you@company.com", "apiKey": "your-api-key" }'

# ── Step 4: Browse your storage ─────────────────────────────────
curl http://localhost:3000/api/v1/directory/get-root-directories \
  -H "Authorization: Bearer <token>"

# ── Step 5: Run RAG on any directory ────────────────────────────
curl -X POST http://localhost:3000/api/v1/genai/chat-on-directory \
  -d '{
    "directoryId": "<id from step 4>",
    "question": "Summarize key themes across these documents",
    "selectedModel": "claude-3-5-sonnet@anthropic"
  }'

✓ Your storage is now agent-ready. Every model. Zero egress. Full governance.

Start building
in the next five minutes.

One container. Any storage. Every model. No vendor owns your data layer but you.

Scroll to Top