Skip to content

Enterprise Agent System Roadmap

Languages: English · 中文

An enterprise agent system is easier to design when the team starts from layers instead of product names. Natural-language input enters at the interaction layer; structured decisions and actions happen in the behavior layer; evidence, artifacts, and checkpoints live in the data layer.

Start with Three Layers

LayerBusiness questionEngineering questionRecommended entrypoint
Interaction layerUsers express goals, clarify missing data, and watch progressThe entrypoint must turn ambiguous goals into tasks and expose process events to UI or IMAgent Auto-Orchestration, Interaction Layer and Active Tasks
System behavior layerThe system judges, plans, calls tools, waits, recoversLong prompts and if / else cannot own every path; contracts, Actions, and lifecycle are neededOutput Control, Actions, TriggerFlow
Data layerLong tasks contain reports, logs, downloads, evidence, rules, checkpointsDo not put everything into Session or prompt; store externally and recall by goal and budgetWorkspace, Long-Running State

Six Capability Boundaries

StageProblem solvedWhat the business seesAgently entrypoint
Engineering skeletonModel request, config, prompt, business code, service entry are mixedOne AI capability can be called from API, script, or UI reliablyQuickstart, Project Framework, FastAPI
Intelligent loopModel output is text and code does not know the next stepCategories, plans, risks, todos, and replies become consumable fieldsOutput Control, Ticket Triage Playbook
External actionTools grow and full exposure causes wrong or unsafe selectionCapabilities have registration, selection, call records, permissions, errorsActions, MCP, Tool Governance
Complex taskA request needs reusable behavior rules, resources, and execution strategyTeam practices can be selected and reused instead of scattered in promptsSkills Executor, Dynamic Task
Long-running stateTasks move across turns, files, and time; context window is too smallArtifacts, evidence, checkpoints, and rules are traceable and recallableWorkspace, Long-Running State
Production governanceAfter launch, the team must debug, control cost, approve risk, and track qualityRuntime facts, eval results, replay, and update strategy are manageableObservability, Production Governance

From Model Request to Agent System

Stabilize One Request First

The first milestone is not a workflow graph. It is a request whose output can be checked and reused:

python
result = (
    agent
    .input("Summarize this customer ticket and decide the next step: ...")
    .output({
        "priority": (str, "high / medium / low", True),
        "intent": (str, "problem the user wants solved", True),
        "next_step": (str, "next business action", True),
        "customer_reply": (str, "message to the customer", True),
    })
    .get_result()
)

data = result.get_data()

This gives downstream code stable fields before the system grows.

Turn Judgments into Structured Decisions

Once the request is stable, the agent can return separate fields for:

PatternSuitable problemDeliverable
ClassificationRoute, priority, risk levelenum/string with evidence
PlanNext actions, dependencies, approvalstructured steps
User replyHuman-facing messagestring field
Internal commandMachine-facing instructioncommand object consumed by deterministic code

User-facing messages and internal commands should not be the same field.

Add External Action Under Control

Actions let the model call capabilities during a request. The system still controls the visible action surface, argument schema, execution adapter, and call records. If the action depends on MCP, a browser, SQLite, Node.js, or a sandbox, ExecutionEnvironment owns the live resource lifecycle.

Add Workflow Lifecycle Only When Needed

TriggerFlow becomes useful when the task has branches, fan-out, waiting, human approval, runtime stream, save/load, or close snapshots. It should wrap multiple requests/actions/stages; it should not be used to make a single unstable prompt look structured.

Add Workspace for Evidence, Not for Everything

Workspace should hold artifacts, observations, decisions, and checkpoints that need to survive across turns or be recalled later. Execution state should keep compact summaries and refs.

Production Shape

text
Gateway
  auth / tenant / route / request validation / process channel

Agent service
  Agent definition / AgentExecution / output contract / result projection

Capability layer
  Actions / MCP / ExecutionEnvironment / business adapters

Workflow worker
  TriggerFlow / Dynamic Task / pause-resume / runtime stream

State and evidence
  Session / Workspace / checkpoint / business DB refs

Observer and eval
  RuntimeEvent / DevTools / representative cases / release evidence

Next Reading