You know the problem. You tell your AI assistant something on Monday and by Thursday it looks at you like you’ve never met. That’s not a memory. That’s a goldfish. Crosmos gives AI agents a real memory system. Not the sticky-note kind. The kind that actually remembers, connects the dots, and gets smarter over time.Documentation Index
Fetch the complete documentation index at: https://docs.crosmos.dev/llms.txt
Use this file to discover all available pages before exploring further.
Normal RAG: the amnesiac’s filing cabinet
Regular RAG works like this: chunk your documents, embed them, stuff them in a vector database, and when someone asks a question, find the closest chunk by cosine similarity. One signal. One dimension. It works okay for “find me the paragraph about quarterly revenue.” It fails spectacularly for things like:- “What did I tell you about my job last month?”
- “How has my opinion on remote work changed over time?”
- “Who did I say I was working with on the AI project?”
A living knowledge graph that grows with every conversation
Instead of flattening everything into vector chunks, Crosmos builds a Monotonic Temporal Knowledge Graph.Monotonic: it only grows, never rewrites
Most systems update records in place. New information overwrites old information. “User lives in Berlin” replaces “User lives in Tokyo.” Problem solved, right? Wrong. You just destroyed history. You no longer know the user ever lived in Tokyo. You can’t answer “Where did I live before Berlin?” Crosmos never deletes or overwrites. Every observation is appended. The graph evolves monotonically: new nodes and edges are added, and older ones that lose relevance are gently pruned by a smart forgetting system that considers importance, recency, and access patterns. The history you need stays, the noise fades.Temporal: every fact has a timestamp
Not just when the system recorded it, but when the event actually happened in the real world. “I started learning Rust in March 2024” has a different temporal meaning than “the system ingested this sentence in January 2025.” Crosmos tracks both. And when you ask “What was I working on last summer?” it uses the event time, not the ingestion time, to find the answer. This is why Crosmos can handle questions like “What changed since we last talked?” or “What did I say about my job in October?” The time dimension is baked into the data model, not bolted on as metadata.Knowledge graph: facts are connected, not isolated
Every piece of knowledge in Crosmos is stored as a relationship between two entities:The retrieval pipeline: four signals, one answer
When you ask Crosmos a question, it doesn’t rely on a single search method. It fires four independent signals in parallel and fuses them together.Semantic search
Embeds your query and finds memories with similar meaning using HNSW indexing. Catches the obvious matches, the things that directly relate to what you’re asking.
Keyword search
Full-text search with relevance scoring. Catches exact name matches, specific terms, and things semantic search might miss. Sometimes you just need to find “Photoshop” and cosine similarity isn’t the best way.
Graph traversal
Walks the knowledge graph following relationship edges from your query. Discovers contextually connected memories even if they share no text similarity with your question.
Temporal search
Activated when a query contains time references like “last summer” or “in October.” Extracts a date window and ranks memories by proximity to that time range. Finds things based on when they happened, not just what they’re about.
Fusion
All four signals are fused together, balancing agreement and disagreement across sources. Then a recency boost adjusts scores based on how fresh the memory is, so recent knowledge naturally surfaces first. The result: you get answers that are semantically relevant, keyword-accurate, graph-connected, temporally aware, and recency-boosted. All at once.The ingestion pipeline
Here’s what happens when you feed Crosmos a conversation or document:Extract
Facts, entities, and relationships are pulled from the raw content. Not summaries. Not keywords. Structured knowledge.
Resolve
Entity mentions are deduplicated. “Rust” and “rust-lang” and “the Rust programming language” all resolve to the same entity node. Entities that share a name but mean different things are kept separate.
Link
Every fact is connected to its source entities via graph edges with confidence scores and timestamps. The knowledge graph grows.
Built for production
Crosmos isn’t a research prototype. It’s designed for production AI agents that need memory they can trust.Multi-tenant
Every organization gets isolated memory spaces. No cross-contamination.
Soft delete
Memories are marked forgotten, never destroyed. Full audit trail.
Content-agnostic
Feed it conversations, documents, markdown, PDFs. The pipeline normalizes everything.
Deterministic retrieval
Results are consistent and predictable. Same query, same answer, every time.