mirror of https://github.com/bytedance/deer-flow.git synced 2026-05-21 07:26:50 +00:00

Files

T

Nan Gao dcc6f1e678 feat(loop-detection): defer warning injection (#2752 )

* fix(loop-detection): defer warn injection to wrap_model_call

The warn branch in LoopDetectionMiddleware injected a HumanMessage
into state from after_model. The tools node had not yet produced
ToolMessage responses to the previous AIMessage(tool_calls=...), so
the new HumanMessage landed *between* the assistant's tool_calls and
their responses. OpenAI/Moonshot reject the next request with
"tool_call_ids did not have response messages" because their
validators require tool_calls to be followed immediately by tool
messages.

Detection now runs in after_model as before, but only enqueues the
warning into a per-thread list. Injection happens in wrap_model_call,
where every prior ToolMessage is already present in request.messages.
The warning is appended at the end as HumanMessage(name="loop_warning")
— pairing intact, AIMessage semantics untouched, no SystemMessage
issues for Anthropic.

Closes #2029, addresses #2255 #2293 #2304 #2511.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* fix(channels): remove loop warning display filter

* feat(loop-detection): scope pending warnings by run

* docs(loop-detection): update docs

* test(loop-detection): assert deferred warnings are queued

* fix(loop-detection): cap transient warning state

* docs: update docs

* add async awrap_model_call test coverage

* docs(loop-detection): document transient warnings

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

2026-05-21 14:36:07 +08:00

11 KiB

Raw Blame History

Middleware 执行流程

Middleware 列表

create_deerflow_agent 通过 RuntimeFeatures 组装的完整 middleware 链（默认全开时）：

#	Middleware	`before_agent`	`before_model`	`after_model`	`after_agent`	`wrap_model_call`	`wrap_tool_call`	主 Agent	Subagent	来源
0	ThreadDataMiddleware	✓						✓	✓	`sandbox`
1	UploadsMiddleware	✓						✓	✗	`sandbox`
2	SandboxMiddleware	✓			✓			✓	✓	`sandbox`
3	DanglingToolCallMiddleware					✓		✓	✗	始终开启
4	GuardrailMiddleware						✓	✓	✓	Phase 2 纳入
5	ToolErrorHandlingMiddleware						✓	✓	✓	始终开启
6	SummarizationMiddleware		✓					✓	✗	`summarization`
7	TodoMiddleware		✓	✓		✓		✓	✗	`plan_mode` 参数
8	TitleMiddleware			✓				✓	✗	`auto_title`
9	MemoryMiddleware				✓			✓	✗	`memory`
10	ViewImageMiddleware		✓					✓	✗	`vision`
11	SubagentLimitMiddleware			✓				✓	✗	`subagent`
12	LoopDetectionMiddleware	✓		✓	✓	✓		✓	✗	始终开启
13	ClarificationMiddleware						✓	✓	✗	始终最后

主 agent 14 个 middleware（make_lead_agent），subagent 4 个（ThreadData、Sandbox、Guardrail、ToolErrorHandling）。create_deerflow_agent Phase 1 实现 13 个（Guardrail 仅支持自定义实例，无内置默认）。

执行流程

LangChain create_agent 的规则：

before_* 正序执行（列表位置 0 → N）
after_* 反序执行（列表位置 N → 0）

graph TB
    START(["invoke"]) --> TD

    subgraph BA ["<b>before_agent</b> 正序 0→N"]
        direction TB
        TD["[0] ThreadData<br/>创建线程目录"] --> UL["[1] Uploads<br/>扫描上传文件"] --> SB["[2] Sandbox<br/>获取沙箱"] --> LD_BA["[12] LoopDetection<br/>清理 stale warning"]
    end

    subgraph BM ["<b>before_model</b> 正序 0→N"]
        direction TB
        VI["[10] ViewImage<br/>注入图片 base64"]
    end

    subgraph WM ["<b>wrap_model_call</b>"]
        direction TB
        DTC_WM["[3] DanglingToolCall<br/>补悬空 ToolMessage"] --> LD_WM["[12] LoopDetection<br/>注入当前 run warning"]
    end

    LD_BA --> VI
    VI --> DTC_WM
    LD_WM --> M["<b>MODEL</b>"]

    subgraph AM ["<b>after_model</b> 反序 N→0"]
        direction TB
        LD["[12] LoopDetection<br/>检测循环/排队 warning"] --> SL["[11] SubagentLimit<br/>截断多余 task"] --> TI["[8] Title<br/>生成标题"]
    end

    M --> LD

    subgraph AA ["<b>after_agent</b> 反序 N→0"]
        direction TB
        LD_CLEAN["[12] LoopDetection<br/>清理 pending warning"] --> MEM["[9] Memory<br/>入队记忆"] --> SBR["[2] Sandbox<br/>释放沙箱"]
    end

    TI --> LD_CLEAN
    SBR --> END(["response"])

    classDef beforeNode fill:#a0a8b5,stroke:#636b7a,color:#2d3239
    classDef modelNode fill:#b5a8a0,stroke:#7a6b63,color:#2d3239
    classDef wrapModelNode fill:#a8a0b5,stroke:#6b637a,color:#2d3239
    classDef afterModelNode fill:#b5a0a8,stroke:#7a636b,color:#2d3239
    classDef afterAgentNode fill:#a0b5a8,stroke:#637a6b,color:#2d3239
    classDef terminalNode fill:#a8b5a0,stroke:#6b7a63,color:#2d3239

    class TD,UL,SB,LD_BA,VI beforeNode
    class DTC_WM,LD_WM wrapModelNode
    class M modelNode
    class LD,SL,TI afterModelNode
    class LD_CLEAN,SBR,MEM afterAgentNode
    class START,END terminalNode

时序图

sequenceDiagram
    participant U as User
    participant TD as ThreadDataMiddleware
    participant UL as UploadsMiddleware
    participant SB as SandboxMiddleware
    participant LD as LoopDetectionMiddleware
    participant VI as ViewImageMiddleware
    participant DTC as DanglingToolCallMiddleware
    participant M as MODEL
    participant SL as SubagentLimitMiddleware
    participant TI as TitleMiddleware
    participant MEM as MemoryMiddleware

    U ->> TD: invoke
    activate TD
    Note right of TD: before_agent 创建目录

    TD ->> UL: before_agent
    activate UL
    Note right of UL: before_agent 扫描上传文件

    UL ->> SB: before_agent
    activate SB
    Note right of SB: before_agent 获取沙箱

    SB ->> LD: before_agent
    activate LD
    Note right of LD: before_agent 清理同 thread 旧 run 的 pending warning
    LD ->> VI: before_model
    activate VI
    Note right of VI: before_model 注入图片 base64

    VI ->> DTC: wrap_model_call
    activate DTC
    Note right of DTC: wrap_model_call 补悬空 ToolMessage
    DTC ->> LD: wrap_model_call
    Note right of LD: wrap_model_call drain 当前 run warning 并追加到末尾
    LD ->> M: messages + tools
    activate M
    M -->> LD: AI response
    deactivate M

    Note right of LD: after_model 检测循环；warning 入队，hard-stop 清 tool_calls
    LD -->> SL: after_model
    deactivate LD

    activate SL
    Note right of SL: after_model 截断多余 task
    SL -->> TI: after_model
    deactivate SL

    activate TI
    Note right of TI: after_model 生成标题
    TI -->> DTC: done
    deactivate TI

    deactivate DTC

    VI -->> SB: done
    deactivate VI

    Note right of LD: after_agent 清理当前 run 未消费 warning

    Note right of MEM: after_agent 入队记忆

    Note right of SB: after_agent 释放沙箱
    SB -->> UL: done
    deactivate SB

    UL -->> TD: done
    deactivate UL

    TD -->> U: response
    deactivate TD

洋葱模型

列表位置决定在洋葱中的层级 — 位置 0 最外层，位置 N 最内层：

进入 before_*：   [0] → [1] → [2] → ... → [10] → MODEL
退出 after_*：    MODEL → [13] → [11] → ... → [6] → [3] → [2] → [0]
                          ↑ 最内层最先执行

[!important] 核心规则列表最后的 middleware，其 after_model 最先执行。 ClarificationMiddleware 在列表末尾，所以它第一个拦截 model 输出。

对比：真正的洋葱 vs DeerFlow 的实际情况

真正的洋葱（如 Koa/Express）

每个 middleware 同时负责 before 和 after，形成对称嵌套：

sequenceDiagram
    participant U as User
    participant A as AuthMiddleware
    participant L as LogMiddleware
    participant R as RateLimitMiddleware
    participant H as Handler

    U ->> A: request
    activate A
    Note right of A: before: 校验 token

    A ->> L: next()
    activate L
    Note right of L: before: 记录请求时间

    L ->> R: next()
    activate R
    Note right of R: before: 检查频率

    R ->> H: next()
    activate H
    H -->> R: result
    deactivate H

    Note right of R: after: 更新计数器
    R -->> L: result
    deactivate R

    Note right of L: after: 记录耗时
    L -->> A: result
    deactivate L

    Note right of A: after: 清理上下文
    A -->> U: response
    deactivate A

[!tip] 洋葱特征每个 middleware 都有 before/after 对称操作，activate 跨越整个内层执行，形成完美嵌套。

DeerFlow 的实际情况

不是洋葱，是管道。大部分 middleware 只用一个钩子，不存在对称嵌套。多轮对话时 before_model / after_model 循环执行：

sequenceDiagram
    participant U as User
    participant TD as ThreadData
    participant UL as Uploads
    participant SB as Sandbox
    participant LD as LoopDetection
    participant VI as ViewImage
    participant DTC as DanglingToolCall
    participant M as MODEL
    participant SL as SubagentLimit
    participant TI as Title
    participant MEM as Memory

    U ->> TD: invoke
    Note right of TD: before_agent 创建目录
    TD ->> UL: .
    Note right of UL: before_agent 扫描文件
    UL ->> SB: .
    Note right of SB: before_agent 获取沙箱
    SB ->> LD: .
    Note right of LD: before_agent 清理 stale pending warning

    loop 每轮对话（tool call 循环）
        SB ->> VI: .
        Note right of VI: before_model 注入图片
        VI ->> DTC: .
        Note right of DTC: wrap_model_call 补悬空工具结果
        DTC ->> LD: .
        Note right of LD: wrap_model_call 注入当前 run warning
        LD ->> M: messages + tools
        M -->> LD: AI response
        Note right of LD: after_model 检测循环/排队 warning
        LD -->> SL: .
        Note right of SL: after_model 截断多余 task
        SL -->> TI: .
        Note right of TI: after_model 生成标题
    end

    Note right of LD: after_agent 清理当前 run pending warning
    LD -->> MEM: .
    Note right of MEM: after_agent 入队记忆
    MEM -->> SB: .
    Note right of SB: after_agent 释放沙箱
    SB -->> U: response

[!warning] 不是洋葱大部分 middleware 只用一个阶段。SandboxMiddleware 使用 before_agent/after_agent 做资源获取/释放；LoopDetectionMiddleware 也使用这两个钩子，但用途是清理 run-scoped pending warnings，不是资源生命周期对称。before_agent / after_agent 只跑一次，before_model / after_model / wrap_model_call 每轮循环都跑。

硬依赖只有 2 处：

ThreadData 在 Sandbox 之前 — sandbox 需要线程目录
Clarification 在列表最后 — wrap_tool_call 处理 ask_clarification 时优先拦截，并通过 Command(goto=END) 中断执行

结论

	真正的洋葱	DeerFlow 实际
每个 middleware	before + after 对称	大多只用一个钩子
激活条	嵌套（外长内短）	不嵌套（串行）
反序的意义	清理与初始化配对	影响 `after_model` / `after_agent` 的执行优先级
典型例子	Auth: 校验 token / 清理上下文	ThreadData: 只创建目录，没有清理

关键设计点

ClarificationMiddleware 为什么在列表最后？

位置最后使它在工具调用包装链中优先拦截 ask_clarification。如果命中，它返回 Command(goto=END)，把格式化后的澄清问题写成 ToolMessage 并中断执行。

SandboxMiddleware 的对称性

before_agent（正序第 3 个）获取沙箱，after_agent（反序第 1 个）释放沙箱。外层进入 → 外层退出，天然的洋葱对称。

LoopDetectionMiddleware 为什么同时用多个钩子？

after_model 只做检测：重复工具调用达到 warning 阈值时，把 warning 放入 (thread_id, run_id) 作用域的 pending 队列。真正注入发生在下一次 wrap_model_call：此时上一轮 AIMessage(tool_calls) 对应的 ToolMessage 已经在请求里，warning 追加在末尾，不会破坏 OpenAI/Moonshot 的 tool-call pairing。before_agent 清理同一 thread 下旧 run 的残留 warning，after_agent 清理当前 run 没被消费的 warning。

11 KiB Raw Blame History Unescape Escape