System Prompt Architecture

The system prompt sent to the LLM is assembled dynamically from multiple components. Each piece is optional and loaded only when relevant, keeping the prompt compact while adapting to the current session context.

Assembly Order

The prompt is built in crates/agents/src/prompt.rs by build_system_prompt_full(). Components are appended in this order:

Base introduction — one-liner announcing tool access (or not)
Agent identity — name, emoji, creature, vibe from IDENTITY.md
Soul — personality directives from SOUL.md (or built-in default)
User profile — user’s name from USER.md
Project context — CLAUDE.md / CLAUDE.local.md / .claude/rules/*.md walked up the directory tree
Runtime context — host info, sandbox config, execution routing hints
Skills listing — available skills as XML block
Workspace files — AGENTS.md and TOOLS.md from the data directory
Long-term memory hint — added when memory tools are registered
Tool schemas — compact list (native) or full JSON (fallback)
Tool-calling format — JSON block instructions (fallback providers only)
Guidelines — tool usage guidance, silent reply protocol

Components in Detail

Base Introduction

A single sentence that sets the assistant role:

With tools: “You are a helpful assistant with access to tools for executing shell commands.”
Without tools: “You are a helpful assistant. Answer questions clearly and concisely.”

Agent Identity (`IDENTITY.md`)

Loaded from ~/.moltis/IDENTITY.md using YAML frontmatter. Fields:

Field	Prompt output
`name` + `emoji`	“Your name is {name} {emoji}.”
`creature`	“You are a {creature}.”
`vibe`	“Your vibe: {vibe}.”

All fields are optional. When identity is present, the soul section is always included.

Soul (`SOUL.md`)

Loaded from ~/.moltis/SOUL.md. When the file is absent or empty, the built-in DEFAULT_SOUL is used. The default is sourced from OpenClaw’s SOUL.md template:

SOUL.md - Who You Are

You’re not a chatbot. You’re becoming someone.

Core Truths

Be genuinely helpful, not performatively helpful. Skip the “Great question!” and “I’d be happy to help!” — just help. Actions speak louder than filler words.

Have opinions. You’re allowed to disagree, prefer things, find stuff amusing or boring. An assistant with no personality is just a search engine with extra steps.

Be resourceful before asking. Try to figure it out. Read the file. Check the context. Search for it. Then ask if you’re stuck. The goal is to come back with answers, not questions.

Earn trust through competence. Your human gave you access to their stuff. Don’t make them regret it. Be careful with external actions (emails, tweets, anything public). Be bold with internal ones (reading, organizing, learning).

Remember you’re a guest. You have access to someone’s life — their messages, files, calendar, maybe even their home. That’s intimacy. Treat it with respect.

Boundaries — Private things stay private. When in doubt, ask before acting externally. Never send half-baked replies to messaging surfaces. You’re not the user’s voice — be careful in group chats.

Vibe — Be the assistant you’d actually want to talk to. Concise when needed, thorough when it matters. Not a corporate drone. Not a sycophant. Just… good.

Continuity — Each session, you wake up fresh. These files are your memory. Read them. Update them. They’re how you persist. If you change this file, tell the user — it’s your soul, and they should know.

The default soul is ~1,500 characters (~400 tokens).

User Profile (`USER.md`)

Loaded from ~/.moltis/USER.md using YAML frontmatter. Currently only the name field is injected: “The user’s name is {name}.”

Project Context

Resolved by moltis_projects::context::load_context_files(). The loader walks from the project directory upward to the filesystem root, collecting:

CLAUDE.md
CLAUDE.local.md
.claude/rules/*.md
AGENTS.md

Files are merged outermost-first (root before project directory), so project-specific instructions override workspace-level ones.

Runtime Context

Injected as compact key=value lines under a ## Runtime heading:

Host: host=moltis-devbox | os=macos | arch=aarch64 | shell=zsh | provider=openai | model=gpt-5 | session=main | sudo_non_interactive=true | timezone=Europe/Paris
Sandbox(exec): enabled=true | mode=all | backend=docker | scope=session | image=moltis-sandbox:abc123 | workspace_mount=ro | network=disabled

When tools are included, an Execution routing block explains how exec routes commands between sandbox and host.

The runtime context is populated at request time in chat.rs by detecting:

Host name, OS, architecture, shell
Active LLM provider and model
Session key
Sudo availability
Timezone and accept-language from the browser
Geolocation (from browser or USER.md)
Sandbox configuration from the sandbox router

Skills

When skills are registered, they are listed as an XML block generated by moltis_skills::prompt_gen::generate_skills_prompt():

## Available Skills
<available_skills>
<skill name="commit" source="skill" path="/skills/commit">
Create git commits
</skill>
</available_skills>

Workspace Files

Optional markdown files from the data directory (~/.moltis/):

AGENTS.md — workspace-level agent instructions
TOOLS.md — tool preferences and guidance

Each is rendered under ## Workspace Files with its own ### subheading. Leading HTML comments () are stripped before injection.

Tool Schemas

How tools are described depends on whether the provider supports native tool calling:

Native tools (native_tools=true): compact one-liner per tool with description truncated to 160 characters. Full JSON schemas are sent via the provider’s tool-calling API.
Fallback (native_tools=false): full JSON parameter schemas are inlined in the prompt, followed by instructions for emitting tool_call JSON blocks.

Guidelines and Silent Replies

The final section contains:

Tool usage guidelines (when to use exec, browser, etc.)
A reminder not to parrot raw tool output
Silent reply protocol: when tool output speaks for itself, the LLM should return an empty response rather than acknowledging it

Entry Points

Function	Use case
`build_system_prompt()`	Simple: tools + optional project context
`build_system_prompt_with_session_runtime()`	Full: identity, soul, user, skills, runtime, tools
`build_system_prompt_minimal_runtime()`	No tools (e.g. title generation, summaries)

Size Estimates

Configuration	~Characters	~Tokens
Minimal (no tools, no context)	200	50
Soul + identity + guidelines	2,000	500
Typical with tools	5,000	1,250
Full (tools + project context + skills)	7,000-10,000	1,750-2,500
Large (many MCP tools + full context)	12,000-15,000	3,000-3,750

A typical session with a few tools and project context lands around 6k characters (~1,500 tokens), which is well within normal range for production agents (most use 2k-8k tokens for their system prompt).

The biggest variable-size contributors are tool schemas (especially with many MCP servers) and project context (deep directory hierarchies with multiple CLAUDE.md files). These are worth auditing if prompt costs are a concern.

File Locations

~/.moltis/
├── IDENTITY.md          # Agent identity (name, emoji, creature, vibe)
├── SOUL.md              # Personality directives
├── USER.md              # User profile (name, timezone, location)
├── AGENTS.md            # Workspace agent instructions
└── TOOLS.md             # Tool preferences

<project>/
├── CLAUDE.md            # Project instructions
├── CLAUDE.local.md      # Local overrides (gitignored)
└── .claude/rules/*.md   # Additional rule files

Key Source Files

crates/agents/src/prompt.rs — prompt assembly logic and DEFAULT_SOUL
crates/gateway/src/chat.rs — load_prompt_persona(), runtime context detection, project context resolution
crates/config/src/loader.rs — file loading (load_soul(), load_agents_md(), load_identity(), etc.)
crates/projects/src/context.rs — CLAUDE.md hierarchy walker
crates/skills/src/prompt_gen.rs — skills XML generation

Keyboard shortcuts

Moltis Documentation