Ø
Tensorpunk  Labs  // RELAY
Coming Soon
<< // context flow protocol // >>

RELAY

Advanced Agentic Memory
100%
Recall@5 Oracle
97%
Recall@5 S-Variant
92.2%
QA Accuracy
// #1 retrieval on LongMemEval //
Benchmark Results
↓↓
LongMemEval Benchmark
Evaluated on LongMemEval (ICLR 2025). 500 questions across 6 reasoning categories.
Independent GPT-4o judge — no self-grading.
Retrieval (Oracle)
100%
500/500 — perfect recall
Retrieval (S-Variant)
97.0%
485/500 — 50 sessions/query
Retrieval recall_all
99.4%
Oracle — ALL gold sessions
End-to-End QA
92.2%
Claude Opus 4.6 + GPT-4o judge
# System Oracle S-Variant QA
1 Relay (ours) 100.0% 97.0% 92.2%
2 neuromcp 99.9%
3 MemPalace 96.6%* 96.6%*
4 Ada Memory 96.6%
5 AgentMemory 95.2% 96.2%
6 OMEGA 95.4%
7 Chronos (PwC) 95.6%
8 Mastra 94.9%
* MemPalace score disputed — see Issue #29 for audit details
QA: Claude Opus 4.6 generates answers, GPT-4o judges correctness (LongMemEval standard)
QA Accuracy by Category (Oracle, 500 questions)
Single-Session Assistant
98.2%
Single-Session User
95.7%
Knowledge Update
93.6%
Multi-Session
93.2%
Temporal Reasoning
92.5%
Preference
63.3%