Stateful agent platform inspired by persistent memory architectures.
Creates AI agents with long-term memory that remember past conversations and learn over time — like an assistant that never forgets.
Letta (formerly MemGPT) is a stateful agent platform built around the idea that AI agents should manage their own memory like an operating system manages virtual memory. The project gained attention as MemGPT — a research paper demonstrating that LLMs could be given explicit memory management tools (read from archival memory, write to archival memory, search core memory) and would learn to use them effectively. Letta is the production platform that evolved from that research.
The core innovation is treating the LLM's context window like main memory in a computer. The agent has 'core memory' (always in context — like RAM), 'recall memory' (searchable conversation history — like a page file), and 'archival memory' (long-term storage — like a hard drive). The agent itself decides when to page information in and out, search its archives, or update its core memory blocks. This self-directed memory management means the agent adapts its memory strategy to the conversation rather than relying on fixed retrieval logic.
Letta agents are persistent and stateful. They run as server processes with their own memory, conversation history, and tool access. You interact with agents through a REST API, and the agent maintains state between calls. This is different from most agent frameworks where agents are stateless functions that reconstruct context on each invocation.
The platform includes an agent development environment (ADE) with a visual interface for creating agents, defining memory blocks, attaching tools, and testing interactions. Agents can be deployed as API endpoints and connected to external tools, data sources, and other agents.
Letta supports multi-agent architectures where agents can communicate with each other, share memory, and collaborate on tasks. Each agent maintains its own state and memory, creating a genuinely multi-agent system rather than a single LLM with multiple personas.
The tradeoffs: Letta's self-directed memory management is powerful but can be unpredictable. The agent decides when to search or update memory, which means it sometimes doesn't retrieve relevant information or updates memory unnecessarily. The server-based architecture adds operational complexity compared to simpler agent frameworks. And the transition from academic research project to production platform is still ongoing — some features are polished while others feel experimental.
For applications that need truly persistent, stateful agents with sophisticated memory management — personal assistants, long-running customer relationships, complex multi-session workflows — Letta offers capabilities that simpler frameworks can't match.
Was this helpful?
Letta (formerly MemGPT) offers a unique approach where agents manage their own memory like an operating system manages virtual memory. The self-editing memory concept is innovative and enables truly long-running, stateful agents. The ADE (Agent Development Environment) provides a visual interface for building and debugging stateful agents. Complexity is the main tradeoff — Letta's memory management model has a steeper learning curve than simpler approaches, and the self-managed memory can sometimes behave unpredictably.
Three-tier memory: core memory (always in context, editable by agent), recall memory (searchable conversation history), and archival memory (long-term vector storage). The agent autonomously manages data flow between tiers.
Use Case:
A personal assistant agent that keeps your current project details in core memory, recent conversations in recall, and years of interaction history in archival storage.
The agent has explicit tools for memory operations: core_memory_append, core_memory_replace, archival_memory_insert, archival_memory_search, conversation_search. The LLM decides when to use each tool based on the conversation.
Use Case:
An agent that proactively archives important details from a meeting conversation and later retrieves them when the user asks about action items.
Agents run as persistent server processes with REST API endpoints. State is maintained between API calls without requiring the client to manage context. Supports multiple concurrent agents with independent state.
Use Case:
Deploying a fleet of customer-specific agents where each agent remembers its customer's history and preferences across months of interactions.
Visual interface for creating agents, defining core memory blocks, attaching tools, configuring LLM providers, and testing agent interactions. Supports iterating on agent behavior without writing code.
Use Case:
A product manager defining a new support agent's personality, knowledge base, and tools through a visual interface before handing it to engineering for production deployment.
Agents can send messages to and receive messages from other agents, enabling collaborative workflows. Each agent maintains independent state and memory while participating in multi-agent conversations.
Use Case:
A research agent that gathers information and sends summarized findings to an analysis agent, which then passes conclusions to a report-writing agent.
Agents can be equipped with custom tools (Python functions) and connected to external data sources that populate archival memory. Tools are defined with schemas and the agent decides when to invoke them.
Use Case:
Equipping a sales agent with CRM lookup tools and product database access that it searches autonomously when customers ask about pricing or features.
Free
forever
Check website for pricing
Contact sales
Ready to get started with Letta?
View Pricing Options →Persistent AI assistants that maintain long-term relationships with users and need to manage growing memory autonomously
Customer-facing agents that serve individual customers over months or years, building up detailed knowledge of each relationship
Complex multi-agent systems where agents need independent state, memory, and the ability to communicate with each other
Applications where the agent needs to actively decide what to remember, forget, and retrieve rather than relying on fixed retrieval logic
Letta works with these platforms and services:
We believe in transparent reviews. Here's what Letta doesn't handle well:
Letta is the production platform that evolved from the MemGPT research project. The core concept (LLM-managed virtual memory) is the same, but Letta adds a server architecture, REST API, ADE, multi-agent support, and production deployment features that weren't in the original MemGPT.
RAG retrieves relevant documents using vector similarity. Letta gives the agent active control over its memory — it decides what to store, search, update, and forget. RAG is passive retrieval; Letta is active memory management. They can be complementary, with archival memory functioning like a RAG-accessible store.
Yes. Letta supports OpenAI, Anthropic, local models via Ollama or vLLM, and other providers. However, self-directed memory management requires strong instruction-following capabilities, so smaller open-source models may not manage memory as effectively as GPT-4 or Claude.
It's being used in production by some teams, particularly for persistent assistant use cases. The server architecture is designed for production, but some features are still maturing. Evaluate carefully for your specific use case and plan for the operational complexity of running stateful agent servers.
Weekly insights on the latest AI tools, features, and trends delivered to your inbox.
People who use this tool also find these helpful
Open-source vector database designed for AI applications with fast similarity search, multi-modal embeddings, and serverless cloud infrastructure for RAG systems and semantic search.
Open-source framework that builds knowledge graphs from your data so AI systems can reason over connected information rather than isolated text chunks.
Open-source embedded vector database built on Lance columnar format for multimodal AI applications.
LangChain memory primitives for long-horizon agent workflows.
Universal memory layer for AI agents and LLM applications. Self-improving memory system that personalizes AI interactions and reduces costs.
Enterprise memory management platform for AI applications. Managed cloud service with advanced analytics, SSO, and enterprise security controls.
See how Letta compares to CrewAI and other alternatives
View Full Comparison →AI Agent Builders
CrewAI is an open-source Python framework for orchestrating autonomous AI agents that collaborate as a team to accomplish complex tasks. You define agents with specific roles, goals, and tools, then organize them into crews with defined workflows. Agents can delegate work to each other, share context, and execute multi-step processes like market research, content creation, or data analysis. CrewAI supports sequential and parallel task execution, integrates with popular LLMs, and provides memory systems for agent learning. It's one of the most popular multi-agent frameworks with a large community and extensive documentation.
Agent Frameworks
Open-source multi-agent framework from Microsoft Research with asynchronous architecture, AutoGen Studio GUI, and OpenTelemetry observability. Now part of the unified Microsoft Agent Framework alongside Semantic Kernel.
AI Agent Builders
Graph-based stateful orchestration runtime for agent loops.
AI Agent Builders
SDK for building AI agents with planners, memory, and connectors. - Enhanced AI-powered platform providing advanced capabilities for modern development and business workflows. Features comprehensive tooling, integrations, and scalable architecture designed for professional teams and enterprise environments.
AI Memory & Search
Universal memory layer for AI agents and LLM applications. Self-improving memory system that personalizes AI interactions and reduces costs.
No reviews yet. Be the first to share your experience!
Get started with Letta and see if it's the right fit for your needs.
Get Started →Take our 60-second quiz to get personalized tool recommendations
Find Your Perfect AI Stack →Explore 20 ready-to-deploy AI agent templates for sales, support, dev, research, and operations.
Browse Agent Templates →