DocsOverviewIntroduction

Introduction

RecallOS is the memory infrastructure layer for AI agents and applications. It provides a complete system for storing, indexing, and retrieving knowledge with sub-50ms latency — giving your AI persistent memory that survives across sessions, conversations, and deployments.

Unlike traditional databases or vector stores, RecallOS was built from the ground up for the unique demands of AI workloads. It combines semantic embeddings, structured metadata, and graph-based relationships into a unified memory layer that any language model can query naturally.

Sub-50ms Retrieval

Optimized vector indices with HNSW graphs deliver results in under 50 milliseconds, even at millions of memories.

Enterprise Security

End-to-end encryption at rest and in transit, SOC 2 Type II compliant, with fine-grained access controls.

Model Agnostic

Works with OpenAI, Anthropic, Google, DeepSeek, Mistral, Llama, and any model that produces embeddings.

Semantic Understanding

Goes beyond keyword matching — understands meaning, context, and relationships between memories.

Currently in v2.4
RecallOS v2.4 introduces streaming retrieval, multi-tenant collections, and a redesigned knowledge graph engine. If you're upgrading from v1.x, see the migration guide.