MemoSift
Persistent Memory

Context intelligence
for AI agents.

Files, CSV, JSON, markdown, web results, logs — auto-classified and externalized before they bloat your LLM context. Memories and entities extracted from the content itself. One Python SDK.

RAW TOOL OUTPUT/UNBOUNDED
output.csv12.4k rows
transcript14 turns
classify · scan · extract
COMPACT SUMMARY → LLM~4% tokens
search_res
user_profi
schema
LLM CONTEXT
ARTIFACTS · MEMORIES · ENTITIESverbatim
search_results23 hits
user_profilePII ⚠
schema.sql14 tables
RECALL STORE
EVERY PIECE OF CONTENT GOES THROUGH
Intercept
any content
Classify
23 types
Scan
PII · secrets
Extract
memories · entities
Recall
4 tools
<1ms client
Python 3.12+
pip install memosift
Zero Core Dependencies
Free During Beta

CSV, JSON, code, logs —
externalized, not ingested.

Tool results are auto-classified and stored as retrievable artifacts. The LLM sees a compact summary. Memories and entities are extracted and queryable across sessions. Your token bill drops. Your agent remembers.

<1ms
Intercept + classify
23
Content types detected
~$0.001
Per-turn extraction
SHA-256
Artifact dedup
0.0%
RECALL ACCURACY
47 probes · 170-turn sessions
MEMORY CASCADE
TURN
SESSION
PROJECT

They extract after. We intercept before.

Context intelligence at the tool-execution boundary — not a post-hoc memory layer. Every tool result flows through MemoSift on its way to your LLM.

WITHOUT MEMOSIFT
128k
tokens ingested
every turn. every tool call.
$$$Token bill
Memory
slowLatency
vs
WITH MEMOSIFT
// summary · 1 of 14 turns
user wants Q3 revenue breakdown by region
→ artifact_ref: csv_42a9
→ entities: [Q3, EMEA, APAC]
~5k
tokens to LLM
96% drop. total recall preserved.
-96%Token bill
Persistent
<1msLatency

First-class adapters.
No rewrites.

Decorate your tools, wrap your client, or install hooks. MemoSift meets your agent where it already lives.

Anthropic
native
OpenAI
native
OpenAI Agents
native
Claude Agent SDK
native
LangChain
native
LangGraph
native
Vercel AI
native
Generic
ms.wrap()
1from memosift import MemoSift
2from claude_agent_sdk import Agent
3 
4ms = MemoSift(api_key="msk_...")
5agent = ms.wrap(Agent(tools=[...]))
6 
7# every tool result is now sifted.
8agent.run("summarize Q3 revenue")

Recall that doesn't miss.

RECALL ACCURACY
98.9%
vs 71% (vector only)

47 probes across 170-turn sessions. Hybrid of vector + BM25 + entity-graph overlap, rerank by cross-encoder.

CONTEXT COMPRESSION
96%
token reduction

Two-stage deterministic compaction, type-aware across 23 content types. Tool-call integrity preserved.

LATENCY
<1ms
client overhead

Intercept, classify, and route at the tool boundary. Extraction runs async; your agent never waits.

PII, secrets, injection —
caught before the LLM sees it.

Three-tier compliance pipeline: per-turn findings, session digests with risk trajectory, project-wide reports with executive summaries.

HIPAA
18 Safe Harbor IDs
PCI DSS
CHD · SAD
SOX
financial controls
GDPR
Article 9 special-cat

Ship it in five minutes.

Free during beta. No credit card.