Product · What you get

Everything your
agents were missing.

One Docker container. Connects to any storage you already have. Gives every agent, every model, and every person on your team a governed, persistent, RAG-enabled API to collaborate on your real data — without moving a byte.

Deploy Free ↓ Watch a Demo →

docker pull olympusio/olympus:latest

< 5m

Time to first agent API callFrom docker pull to live RAG on your files

0 MB

Data moved to any cloudInference runs on your hardware. Full stop.

Any

LLM works with OlympusClaude, GPT-4o, Gemini, LLaMA, Ollama models

∞

Agent sessions with persistent memoryState survives restarts, compute changes, model swaps

Works on MacBook / macOSWindows ServerLinux / NAS NetApp ONTAPDell PowerScaleHPE AlletraPure Storage SMB / NFSS3-compatibleAmazon FSxNAzure NetApp Files

Sound familiar?

The six things that are
killing your agent deployments

These aren't edge cases. Every developer building agentic AI hits all six. Olympus was built specifically to eliminate them.

Session continuity

Every session starts cold

You build a capable agent. It does excellent work. You close the session — and it's gone. No memory, no context, no continuity. Tomorrow you start over. This makes agents unreliable for anything real.

Olympus persists agent state to your storage on every write

Data access

Agents can't touch your actual files

You end up copy-pasting docs into prompts, writing brittle file-reading hacks, or just keeping agents away from real data entirely. There's no clean, governed path from an agent to a filesystem. So agents stay in the sandbox.

Olympus gives every agent a secure REST API to your real files

Vendor risk

One API change and you start over

Every developer building on a single LLM knows this anxiety. OpenAI changes pricing. Anthropic deprecates a model. Your entire agent's memory, prompt architecture, and tool integrations are coupled to that vendor. There's no abstraction layer underneath you.

Olympus is model-agnostic — swap any LLM in one config line

Multi-agent coordination

Multi-agent handoff is duct tape

Getting two agents to hand off work, share context, or build on each other's output requires custom orchestration code that's fragile and hard to debug. There's no shared workspace, so every team reinvents it — badly.

Olympus API is the shared workspace every agent reads and writes to

Enterprise deployment

Compliance kills every interesting deployment

You build something genuinely useful, try to deploy it inside a real company, and immediately hit the wall. Security review. Data governance. "You can't send company files to OpenAI." The use case dies — not because AI failed, but because there's no governed path to production.

Olympus runs on-prem, enforces AD permissions, logs every access

RAG in production

RAG is harder than it looks

Everyone starts with RAG thinking it's solved. Then you hit chunking problems, stale indexes, permission leakage between users, and models confidently hallucinating from the wrong document. Building RAG that actually works at scale is a serious engineering project most teams badly underestimate.

Olympus handles indexing, freshness, and per-user permission enforcement

What you get

Five things that work
the moment you install

Not a roadmap. Not a beta. Every one of these is live in the container you pull today.

Feature 01

● Live on install

Instant AI API on your existing storage

The moment you run the container and point it at your storage, every file gets a REST API. No migration. No new hardware. No new storage contracts. Your NAS didn't change — it just became agent-ready.

Works on MacBook, Windows Server, Linux NAS, or enterprise storage
Auto-indexes directories on connect — no manual setup
Every file accessible to any agent via standard REST calls
Upload, download, search, and chat on files through the same API

GET /api/v1/directory/get-root-directories

# Your NAS directories — instantly accessible
curl https://localhost:3000/api/v1/directory/get-root-directories \

  -H "Authorization: Bearer <token>"

✓ 200 OK — directories indexed
iddir_8841929READY
name/contracts/Q3_2025
files847 files · 12.4 GB
indexed100% · last sync 4s ago
curl -X POST .../api/v1/file/upload \

  -F "file=@Q4_forecast.xlsx" \

  -F "directoryId=dir_8841929"

Feature 02

● Permission-aware

RAG on your real documents — out of the box

Point any agent at any directory and ask questions against your actual files. Contracts, engineering specs, financials, research — all queryable by any LLM. Permission-aware, in place, always current.

Ask natural language questions across any directory of files
Works with any LLM — Claude, GPT-4o, Gemini, or local Ollama models
Agents only see files the authenticated user is authorized to access
Index stays current — new files are queryable within seconds

POST /api/v1/genai/chat-on-directory

curl -X POST .../api/v1/genai/chat-on-directory \

  -d '{
"directoryId": "dir_8841929",
"question": "Which contracts renew in Q1 > $500k?",
"selectedModel": "claude-3-5-sonnet@anthropic"

  }'

# Same call — swap any model
"selectedModel": "gpt-4o@openai"
"selectedModel": "llama3.1@ollama"
✓ RAG — sources cited, permissions enforced
answer"3 contracts match: Acme Corp ($820k)..."
sourcesAcme_MSA_2024.pdf · GlobalCo_Renewal.docx
egressed0 bytesSECURE

Feature 03

● Survives restarts

Persistent agent memory and state

Agents stop starting cold. Every session picks up where the last one left off. Memory, work products, and session context are written back through the API to your storage — surviving compute restarts, model swaps, and provider changes.

Agents write memories as files — readable by any subsequent session
Shut down GPU between tasks — restart anywhere with full context
Work products versioned and stored alongside source documents
Dramatically reduces re-contextualization cost across long-running tasks

Agent memory — write & resume

# Agent saves memory after every task
curl -X POST .../api/v1/file/upload \

  -F "file=@session_memory.json" \

  -F "directoryId=agent_state_dir"

# Next session — resume with full context
curl -X POST .../api/v1/genai/chat-on-directory \

  -d '{ "directoryId": "agent_state_dir",
"question": "Resume my last task" }'
✓ Agent resumed — zero context lost
memory_files12 files · sessions 42–53
last_task"Contract review — Acme Q3 renewal"
statusResumed · compute was off 14hREADY

Feature 04

● AD / SSO integrated

One governed layer for every agent and every person

Claude, GPT-4o, Gemini, LLaMA, your custom models, and your human team all go through the same Olympus API. Every call is permission-checked against Active Directory. Every access is logged. IT finally has visibility.

Integrates with Active Directory and LDAP — your existing user directory
SSO support — agents authenticate as scoped service accounts
Per-user permission enforcement — agents only see what their user sees
Full audit log of every file access, every model call, every write

Auth & permissions

curl -X PUT .../api/v1/entrance/login \

  -d '{
"email": "agent-svc@company.com",
"apiKey": "svc_key_xyz"

  }'
✓ Auth — AD groups resolved
tokeneyJhbGc... (scoped to AD groups)
groupslegal-team · contracts-read · finance-ro
access14 of 38 directoriesGOVERNED
# Agents cannot access dirs outside AD groups
# Every call appended to immutable audit log

Feature 05

● Zero vendor lock-in

Full model freedom — swap any LLM, any time

Because Olympus sits between your agents and your data, you are never married to a model vendor. Run Claude and GPT simultaneously on the same directory. Switch models with a single parameter.

Works with Anthropic, OpenAI, Google, Mistral, Meta, and any Ollama model
Run multiple models simultaneously on the same data
Switch providers mid-project — agent memory and RAG unaffected
Local inference via Ollama for fully air-gapped deployments

Model freedom — one API, every LLM

# Cloud models — identical API call
"selectedModel": "claude-3-5-sonnet@anthropic"
"selectedModel": "gpt-4o@openai"
"selectedModel": "gemini-1.5-pro@google"

# Local / air-gapped (via Ollama)
"selectedModel": "llama3.1@ollama"
"selectedModel": "deepseek-r1@ollama"

✓ Your data. Any model. Always.
data egressed0 bytes — inference on your hardware
vendor_locknone — swap in a single parameterFREE

How it works

Three steps from
zero to agent-ready

No new storage to provision. No data to migrate. No cloud accounts to configure. Deploy Olympus as a container and point it at what you already have.

Run the container

Pull the Github deploy script. Self-contained — no external dependencies, no cloud calls required to start.

docker run · 60 seconds

Connect your storage

Point Olympus at your existing storage — SMB share, NFS mount, S3 bucket, or native NetApp/Dell/HPE API. Olympus auto-indexes your directories. Every file immediately RAG-ready through the API.

auto-indexed · no migration

Your agents connect

Any agent — Claude, OpenAI, Gemini, your custom model — authenticates with the Olympus API and immediately has governed, permission-aware, persistent access to all of your real data. Memory persists. Lock-in is gone.

any model · live immediately

terminal — full deployment walkthrough

# ── Step 1: Pull the image ──────────────────────────────────────
docker pull olympusio/olympus:latest

# ── Step 2: Run with your storage and AD config ─────────────────
docker run -d \
  -p 3000:3000 \
  -e NAS_HOST=your-nas.company.com \
  -e AD_LDAP_URL=ldap://dc.company.com \
  -e OLLAMA_HOST=http://gpu-server:11434 \
  -e API_KEY=your-api-key \
  --name olympus \
  olympusio/olympus:latest

# ── Step 3: Authenticate ────────────────────────────────────────
curl -X PUT http://localhost:3000/api/v1/entrance/login \
  -d '{ "email": "you@company.com", "apiKey": "your-api-key" }'

# ── Step 4: Browse your storage ─────────────────────────────────
curl http://localhost:3000/api/v1/directory/get-root-directories \
  -H "Authorization: Bearer <token>"

# ── Step 5: Run RAG on any directory ────────────────────────────
curl -X POST http://localhost:3000/api/v1/genai/chat-on-directory \
  -d '{
    "directoryId": "<id from step 4>",
    "question": "Summarize key themes across these documents",
    "selectedModel": "claude-3-5-sonnet@anthropic"
  }'

✓ Your storage is now agent-ready. Every model. Zero egress. Full governance.

Start building
in the next five minutes.

One container. Any storage. Every model. No vendor owns your data layer but you.

Deploy From Github Book a 30-Min Demo →

Everything your
agents were missing.

The six things that are
killing your agent deployments

Every session starts cold

Agents can't touch your actual files

One API change and you start over

Multi-agent handoff is duct tape

Compliance kills every interesting deployment

RAG is harder than it looks

Five things that work
the moment you install

Instant AI API on your existing storage

RAG on your real documents — out of the box

Persistent agent memory and state

One governed layer for every agent and every person

Full model freedom — swap any LLM, any time

Three steps from
zero to agent-ready

Run the container

Connect your storage

Your agents connect

Start building
in the next five minutes.

The Secure AI Stack for MSPs & ISVs.

Product

Company

Contact

© 2025 Olympus.io. All rights reserved.

Everything youragents were missing.

The six things that arekilling your agent deployments

Every session starts cold

Agents can't touch your actual files

One API change and you start over

Multi-agent handoff is duct tape

Compliance kills every interesting deployment

RAG is harder than it looks

Five things that workthe moment you install

Instant AI API on your existing storage

RAG on your real documents — out of the box

Persistent agent memory and state

One governed layer for every agent and every person

Full model freedom — swap any LLM, any time

Three steps fromzero to agent-ready

Run the container

Connect your storage

Your agents connect

Start buildingin the next five minutes.

The Secure AI Stack for MSPs & ISVs.

Product

Company

Contact

© 2025 Olympus.io. All rights reserved.

Everything your
agents were missing.

The six things that are
killing your agent deployments

Five things that work
the moment you install

Three steps from
zero to agent-ready

Start building
in the next five minutes.