Alphaavatar
A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (real-time model, visual, voice, memory, search, etc.).
Installation
npx alphaavatarAsk AI about Alphaavatar
Powered by Claude Β· Grounded in docs
I know everything about Alphaavatar. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
AlphaAvatar Introduction
AlphaAvatar is a self-hostable Omni Personal Assistant framework designed to evolve into an intelligent personal butler β a continuous, personalized, and proactive assistant that can remember, understand, plan, and act on behalf of the user.
It is built around a plugin-based real-time Agent architecture, combining:
- π§ Memory for long-term user, assistant, and tool interaction history
- 𧬠Persona for user understanding, identity continuity, and personalization
- π‘ Reflection for self-improvement and long-term behavioral adaptation
- π Planning for task decomposition, reminders, and future-oriented actions
- βοΈ Behavior for response style, workflow policy, and proactive assistance
- π§° Tools through MCP, RAG, DeepResearch, and external integrations
- π Virtual Character for real-time voice/avatar interaction
β¨ Fully self-hostable and privacy-first β AlphaAvatar can run locally or on your own infrastructure, giving you control over your data, memory, tools, and behavior.
What AlphaAvatar Is Designed For
1οΈβ£ Personal Data & Life Metrics Management
|
2οΈβ£ Knowledge & Notes Management
|
3οΈβ£ Task & Event Management
|
4οΈβ£ Autonomous Planning & Execution
|
5οΈβ£ Personalized Companion & Context Awareness
|
6οΈβ£ External World Interaction
|
π‘ AlphaAvatar is not just a chatbot. It is a foundation for building stateful, proactive, multimodal, and self-evolving personal AI assistants.
AlphaAvatar Plugins
π§ Memory
Self-improving memory module for Omni-Avatar. |
𧬠Persona
Automatic extraction and real-time matching of multimodal user persona. |
π‘ Reflection
A self-improvement module that reflects on memory, behavior, and interaction history. |
π Planning
Long-horizon planning module for tasks, reminders, goals, and multi-step workflows. |
π€ Behavior
Controls response style, workflow policy, tool-use behavior, and proactive assistance rules. |
π Virtual Character
The real-time generated virtual character that visually represents the Avatar during interactions. |
Tools Plugins
π DeepResearch
Allow AlphaAvatar to access the network and perform single-step/multi-step inference through a separate Agent service to search for more accurate content. |
π RAG
Allow AlphaAvatar to access Documents/Skills (user-uploaded/generated by the Reflection module/URL access) to obtain document-related information. |
π§° MCP
Allows AlphaAvatar to discover and call real-world external tools such as databases, email, calendars, APIs, and productivity apps. |
π SANDBOX
Provide AlphaAvatar with a sandbox environment to interact with the external world or with other agents, thereby enabling multi-agent interaction and exploration. |
Docs and guides
Latest News π₯
-
[2026/05] Released AlphaAvatar version 0.5.4:
- Added LanceDB-backed MCP tool retrieval, enabling AlphaAvatar to semantically search relevant MCP tools based on the Agent query.
- Refactored system prompt and runtime prompt composition to improve prefix-cache efficiency and support dynamic runtime context injection.
- Improved the Persona plugin with deterministic runtime state tracking, including timezone, login time, session ID, room type, and login count.
- Added temporary-user to real-user identity merging, including user workspace migration and identity-aware plugin storage.
- Improved RAG runtime behavior with initialization waiting, current/temporary workspace querying, and LLM-friendly query output.
-
[2026/04] Released AlphaAvatar version 0.5.3:
- Added localized Markdown backup for the Memory plugin.
- Added LanceDB as the default local VDB option when Qdrant credentials are not provided.
-
[2026/03] Released AlphaAvatar version 0.5.0:
-
[2026/02] Released AlphaAvatar version 0.4.0:
- Added RAG support through RAG-Anything.
- Optimized the Memory and DeepResearch modules.
- Released AlphaAvatar version 0.4.1: Fixed Persona plugin bugs and added a new MCP plugin.
-
[2026/01] Released AlphaAvatar version 0.3.0:
- Added DeepResearch support through the Tavily API.
- Released AlphaAvatar version 0.3.1: Added tool-call memory extraction during userβassistant interactions.
2025 Release History
-
[2025/12] Released AlphaAvatar version 0.2.0:
- Added AIRI Live2D-based virtual character display.
-
[2025/11] Released AlphaAvatar version 0.1.0:
- Added automatic memory extraction.
- Added automatic user persona extraction and matching.
Installation βοΈ
Install stable AlphaAvatar version from PyPI:
uv venv .my-env --python 3.11
source .my-env/bin/activate
pip install alpha-avatar-agents
Install latest AlphaAvatar version from GitHub:
git clone --recurse-submodules https://github.com/AlphaAvatar/AlphaAvatar.git
cd AlphaAvatar
uv venv .venv --python 3.11
source .venv/bin/activate
uv sync --all-packages
Quick Start β‘οΈ
Start your agent in dev mode to connect it to LiveKit and make it available from anywhere on the internet.
π§© Step 1. Configure Environment Variables
cd AlphaAvatar
# Copy template
cp .env.template .env.dev
Edit .env.dev and set required environment variables.
π¦ Step 2. Download Required Files
alphaavatar download-files
β Step 3. Run the Agent
ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/pipeline_openai_airi.yaml
# or
ENV_FILE=.env.dev alphaavatar dev examples/agent_configs/pipeline_openai_tools.yaml
To see more supported modes, please refer to the LiveKit doc.
To see more examples, please refer to the Examples README
Usage π
AlphaAvatar supports multiple Access Channels, allowing different types of users β from end users to developers β to interact with the system.
π§ Runtime Architecture
AlphaAvatar is built around a real-time multimodal Agent runtime. It separates streaming interaction, model reasoning, runtime context, plugin state, tool execution, and channel adapters.
AlphaAvatar Runtime
βββββββββββββββββββ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β LiveKit Room β
β audio / text / video / data streams β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β AgentSession β
β real-time turn detection / streaming IO β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β AvatarEngine β
β Realtime Model Pipeline β
β STT / VAD / LLM / TTS / video-ready input β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β Prompt & Runtime Context Layer β
β system prompt / runtime prompt / time / mode β
β memory / persona / plan / reflection / rules β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
ββββββββββββββββββββββββΌββββββββββββββββββββββββ
β β β
βΌ βΌ βΌ
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β Memory Plugin β β Persona Plugin β β Tools Plugins β
β conversation β β user profile β β MCP / RAG β
β tool memory β β speaker / face β β DeepResearch β
β avatar memory β β runtime state β β external APIs β
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β Reflection Pluginβ β Planning Plugin β β Behavior Plugin β
β self-analysis β β tasks / plans β β response policy β
β adaptation β β reminders β β workflow rules β
ββββββββββββββββββββ ββββββββββββββββββββ ββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββ
β UserPath Workspace β
β local data / cache / logs / memory / RAG β
β persona state / artifacts / user files β
ββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β
ββββββββββββββββββββββββ΄ββββββββββββββββββββββββ
β β
βΌ βΌ
ββββββββββββββββββββββββ ββββββββββββββββββββββββ
β Native Web / App β β Channel Adapters β
β voice / text / video β β WhatsApp / future β
β WebRTC / UI β β messaging platforms β
ββββββββββββββββββββββββ ββββββββββββββββββββββββ
π Web Access
AlphaAvatar now provides a browser-based realtime demo interface built on LiveKit.
π Try the Web Demo: https://www.alphaavatar.ai/demo
The Web Demo supports:
- ποΈ Real-time voice interaction
- π¬ Text chat with the Avatar
- π· Camera preview and video-ready interaction
- π Agent audio playback
- π Virtual character / avatar stage
- π§ Full plugin support, including Memory, Persona, RAG, MCP, and DeepResearch
- π Browser timezone metadata, enabling AlphaAvatar to understand local login time
The Web Demo is the recommended way to try AlphaAvatar with a full realtime multimodal experience.
π¬ Social & Messaging Platforms
Interact with AlphaAvatar directly inside messaging platforms.
Capabilities:
- π¬ Text-based conversation
- π€ Voice message interaction
- π§° Tool invocation via chat interface
π¦ Channel introduction: README
βΆοΈ Start WhatsApp Channel
Make sure AlphaAvatar Agent is already running (see Quick Start above).
ENV_FILE=.env.dev sh examples/channels/start_whatsapp.sh
π‘ The WhatsApp channel runs as an independent bridge process and connects to the Agent runtime.
Slack
π² Native Mobile App
A dedicated AlphaAvatar mobile application providing:
- ποΈ Real-time voice communication
- π Live2D / Virtual character visualization
- π§ Persistent memory & persona
π§ͺ Developer Playground
Developers can immediately access AlphaAvatar via the LiveKit Playground.
π https://agents-playground.livekit.io/
After starting your AlphaAvatar server:
- Connect to your LiveKit instance
- Configure the Agent name in the Playground (must match
avatar_name, default:Assistant) to enable Explicit Dispatch. - Connect to the agent room
- Start testing real-time interaction
Supported capabilities:
- ποΈ Voice interaction
- π§ Memory extraction
- π RAG retrieval
- π§° MCP tool invocation
- π Virtual character display

π‘ AlphaAvatar is currently developer-first, with a Web Demo available for realtime interaction.
More user-facing web and mobile experiences are under active development.
