Analysis: Spawn-per-Request vs. Long-running Daemon

This analysis compares the "Spawn-per-Request" (CGI Model) with the "Long-running Daemon" model for managing Zene's lifecycle in a web context.

1. How it works

Extreme Isolation: Fresh OS process per task.
Lower Complexity: No multi-threading or complex state management in the engine.
Stateless Backend: Simple management.

Task	Time Cost	Impact
Model Load	100ms - 500ms	Loading local embedding models (`fastembed`).
MCP Connect	200ms - 1s	Handshaking with sub-processes.
Context Index	500ms - 2s	Parsing `tree-sitter` symbols.

Result: A "latency tax" of ~2 seconds before the Agent starts.

Instead of "Spawn-on-Demand," the outer app can maintain a Pool of Zene Daemons:

If your goal is simplicity and safety, the "Spawn-per-Request" model is the winner. If you need instant response, Zene needs to stay "Warm."