Why forget about boxes and deterministic control and start thinking of error tolerance and recovery?
I know, that LLMs are statistical models, but can you not use patterns to enforce a deterministic outcome? (Single responsibility for each agent, retrying llm calls, rephrasing prompts, etc?)