Moltis
A personal AI gateway written in Rust.
One binary, no runtime, no npm.
One binary, no runtime, no npm.
Moltis compiles your entire AI gateway — web UI, LLM providers, tools, and all assets — into a single self-contained executable. There’s no Node.js to babysit, no node_modules to sync, no V8 garbage collector introducing latency spikes.
# Quick install (macOS / Linux)
curl -fsSL https://www.moltis.org/install.sh | sh
Why Moltis?
| Feature | Moltis | Other Solutions |
|---|---|---|
| Deployment | Single binary | Node.js + dependencies |
| Memory Safety | Rust ownership | Garbage collection |
| Secret Handling | Zeroed on drop | “Eventually collected” |
| Sandbox | Docker + Apple Container | Docker only |
| Startup | Milliseconds | Seconds |
Key Features
- 30+ LLM Providers — Anthropic, OpenAI, Google, Mistral, local models, and more
- Streaming-First — Responses appear as tokens arrive, not after completion
- Sandboxed Execution — Commands run in isolated containers (Docker or Apple Container)
- MCP Support — Connect to Model Context Protocol servers for extended capabilities
- Multi-Channel — Web UI, Telegram, API access with synchronized responses
- Long-Term Memory — Embeddings-powered knowledge base with hybrid search
- Hook System — Observe, modify, or block actions at any lifecycle point
- Compile-Time Safety — Misconfigurations caught by
cargo check, not runtime crashes
Quick Start
# Install
curl -fsSL https://www.moltis.org/install.sh | sh
# Run
moltis
On first launch:
- Open the URL shown in your browser (e.g.,
http://localhost:13131) - Add your LLM API key
- Start chatting!
Authentication is only required when accessing Moltis from a non-localhost address. On localhost, you can start using it immediately.
How It Works
┌─────────────┐ ┌─────────────┐ ┌─────────────┐
│ Web UI │ │ Telegram │ │ API │
└──────┬──────┘ └──────┬──────┘ └──────┬──────┘
│ │ │
└────────────────┴────────────────┘
│
▼
┌───────────────────────────────┐
│ Moltis Gateway │
│ ┌─────────┐ ┌───────────┐ │
│ │ Agent │ │ Tools │ │
│ │ Loop │◄┤ Registry │ │
│ └────┬────┘ └───────────┘ │
│ │ │
│ ┌────▼────────────────┐ │
│ │ Provider Registry │ │
│ │ Claude · GPT · Gemini │ │
│ └─────────────────────┘ │
└───────────────────────────────┘
│
┌───────▼───────┐
│ Sandbox │
│ Docker/Apple │
└───────────────┘
Documentation
Getting Started
- Quickstart — Up and running in 5 minutes
- Installation — All installation methods
- Configuration —
moltis.tomlreference
Features
- Providers — Configure LLM providers
- MCP Servers — Extend with Model Context Protocol
- Hooks — Lifecycle hooks for customization
- Local LLMs — Run models on your machine
Deployment
- Docker — Container deployment
Architecture
- Streaming — How real-time streaming works
- Metrics & Tracing — Observability
Security
Moltis applies defense in depth:
- Authentication — Password or passkey (WebAuthn) required for non-localhost access
- SSRF Protection — Blocks requests to internal networks
- Secret Handling —
secrecy::Secretzeroes memory on drop - Sandboxed Execution — Commands never run on the host
- Origin Validation — Prevents Cross-Site WebSocket Hijacking
- No Unsafe Code —
unsafeis denied workspace-wide
Community
- GitHub: github.com/moltis-org/moltis
- Issues: Report bugs
- Discussions: Ask questions
License
MIT — Free for personal and commercial use.