Files

T

Yi Tang 48e038f752 feat(channels): enhance Discord with mention-only mode, thread routing, and typing indicators (#2842 )

* feat(channels): enhance Discord with mention-only mode, thread routing, and typing indicators

Add mention_only config to only respond when bot is mentioned, with
allowed_channels override. Add thread_mode for Hermes-style auto-thread
creation. Add periodic typing indicators while bot is processing.

* fix(discord): include allowed_channels in mention_only skip condition (line 274)

* docs: fix Discord config example to match boolean thread_mode implementation

* style: format with ruff

* fix(discord): apply Copilot review fixes and resolve lint errors

- Remove unused Optional import
- Fix thread_ts type hints to str | None
- Fix has_mention logic for None values
- Implement thread_mode fallback to channel replies on thread creation failure
- Fix thread_mode docstring alignment
- Fix allowed_channels comment formatting in config.example.yaml

* fix(discord): reset context for orphaned threads in mention_only mode

When a message arrives in a thread not tracked by _active_threads,
clear thread_id and typing_target so the message falls through to
the standard channel handling pipeline, which creates a fresh thread
instead of incorrectly routing to the stale thread.

* fix(discord): create new thread on @ when channel has existing tracked thread

When mention_only is enabled and a user @-s the bot in a channel
that already has a tracked thread, create a new thread instead of
incorrectly routing to the old one.

* fix(discord): allow no-@ thread replies while skipping no-@ channel messages

The skip block for no-@ messages was too aggressive — it blocked
continuation replies within tracked threads AND incorrectly routed
no-@ channel messages to the existing thread.

Now:
- Thread message, no @ → routed to existing tracked thread
- Channel message, no @ → skipped
- Channel message, with @ → creates new thread

* feat(discord): add checkmark reaction to acknowledge received messages

* Move discord.py to optional dependency and auto-detect from config.yaml

- Add discord extra to [project.optional-dependencies] in pyproject.toml
- Update detect_uv_extras.py to map channels.discord.enabled: true -> --extra discord
- Set UV_EXTRAS=discord in docker-compose-dev.yaml gateway env

* fix(discord): persist thread-channel mappings to store for recovery after restart

Discord's _active_threads dict was purely in-memory, so all channel-to-thread
mappings were lost on server restart. This fix bridges ChannelStore into
DiscordChannel:

- Save thread mappings to store.json after every thread creation
- Restore active threads from store on DiscordChannel startup
- Pass channel_store to all channels via service.py config injection

Store keys follow the pattern: discord:<channel_id>:<thread_id>

* fix(discord): address Copilot review — fix types, typing targets, cross-thread safety, and config comments

* fix(tests): add multitask_strategy param to mock for clarification follow-up test

* fix(tests): explicitly set model_name=None for title middleware test isolation

* fix(discord): use trigger_typing() instead of typing() for typing indicators

discord.py 2.x TextChannel.typing() and Thread.typing() are async context
managers, not one-shot coroutines. Use trigger_typing() for periodic
typing indicator pings.

* fix(discord): cancel typing tasks on channel shutdown

Prevents 'Task was destroyed but it is pending' warnings when the
Discord client stops while typing indicator loops are still running.

* fix(scripts): detect nested YAML config for discord extra

section_value() only matched top-level YAML sections. Added
nested_section_value() that handles two-level nesting (e.g.,
channels.discord.enabled), so auto-detection of the discord
extra works when config uses the standard nested format.

* fix(docker): remove hard-coded UV_EXTRAS=discord from dev compose

Relies on auto-detection via detect_uv_extras.py instead of forcing
discord.py install even when channels.discord.enabled is false.
Matches production docker-compose.yaml behavior (UV_EXTRAS:-).

* refactor(nginx): move proxy_buffering/proxy_cache to server level

DRY cleanup — these directives were repeated in 14 location blocks.
Set at server level once, reducing duplication and risk of drift.

* fix(discord): use dedicated JSON file for thread persistence

Replace ChannelStore usage for Discord thread-ID persistence with a
dedicated discord_threads.json file. ChannelStore is designed to map
IM conversations to DeerFlow thread IDs — using it to persist Discord
thread IDs was semantically wrong and confusing.

Changes:
- _save_thread() now reads/writes a simple {channel_id: thread_id} JSON dict
- _load_active_threads() reads directly from the JSON file
- File path derived from ChannelStore directory (when available) or
  defaults to ~/.deer-flow/channels/discord_threads.json
- Removed unused ChannelStore import

* fix(discord): address WillemJiang's code review comments on PR #2842

1. Remove semantically incorrect message_in_thread variable. At this code
   point (after the Thread case is handled above), we're guaranteed to be in
   a channel, not a thread. Always apply mention_only check here.

2. Add _active_thread_ids reverse-lookup set for O(1) thread ID membership
   checks instead of O(n) scan of _active_threads.values(). Keep the set
   in sync with _active_threads in _load_active_threads() and _save_thread().

3. Add _thread_store_lock (threading.Lock) to protect _active_threads and
   the JSON file from concurrent access between the Discord loop thread
   (_run_client) and the main thread (_load_active_threads, _save_thread).

2026-05-15 22:30:05 +08:00

.vscode

chore: specify project title

2026-01-14 09:57:52 +08:00

app

feat(channels): enhance Discord with mention-only mode, thread routing, and typing indicators (#2842 )

2026-05-15 22:30:05 +08:00

docs

docs: clarify LangGraph compatibility entrypoints (#2914 )

2026-05-12 23:15:11 +08:00

packages/harness

fix(middleware): Prevent todo completion reminder IMMessage leak (#2907 )

2026-05-15 22:12:37 +08:00

scripts

feat(agent): add custom-agent self-updates with user isolation (#2713 )

2026-05-05 23:17:42 +08:00

tests

feat(channels): enhance Discord with mention-only mode, thread routing, and typing indicators (#2842 )

2026-05-15 22:30:05 +08:00

.gitignore

feat: add DeerFlowClient for embedded programmatic access (#926 )

2026-02-28 14:38:15 +08:00

.python-version

chore: add Python and LangGraph stuff

2026-01-14 07:15:02 +08:00

AGENTS.md

docs: fix typo and grammar issues in docs (#1315 )

2026-03-25 10:01:36 +08:00

CLAUDE.md

feat: stream subagent token usage to header via terminal task events (#2882 )

2026-05-13 23:52:19 +08:00

CONTRIBUTING.md

docs: align runtime docs with gateway mode (#2868 )

2026-05-12 16:19:21 +08:00

debug.py

feat(debug): print presented file paths with physical resolution (#2825 )

2026-05-09 18:21:01 +08:00

Dockerfile

fix(docker): set UTF-8 locale to prevent ASCII encoding errors in minimal containers (#2707 )

2026-05-04 09:41:10 +08:00

langgraph.json

fix: resolve make dev and test-e2e errors (#2570 )

2026-04-26 17:27:32 +08:00

Makefile

Refactor DeerFlow to use Gateway's LangGraph-compatible API

2026-04-26 20:38:34 +08:00

pyproject.toml

feat(channels): enhance Discord with mention-only mode, thread routing, and typing indicators (#2842 )

2026-05-15 22:30:05 +08:00

README.md

docs: clarify LangGraph compatibility entrypoints (#2914 )

2026-05-12 23:15:11 +08:00

ruff.toml

refactor: split backend into harness (deerflow.*) and app (app.*) (#1131 )

2026-03-14 22:55:52 +08:00

uv.lock

feat(channels): enhance Discord with mention-only mode, thread routing, and typing indicators (#2842 )

2026-05-15 22:30:05 +08:00

README.md

DeerFlow Backend

DeerFlow is a LangGraph-based AI super agent with sandbox execution, persistent memory, and extensible tool integration. The backend enables AI agents to execute code, browse the web, manage files, delegate tasks to subagents, and retain context across conversations - all in isolated, per-thread environments.

Architecture

                        ┌──────────────────────────────────────┐
                        │          Nginx (Port 2026)           │
                        │      Unified reverse proxy           │
                        └───────┬──────────────────┬───────────┘
                                │
            /api/langgraph/*    │    /api/* (other)
            rewritten to /api/* │
                                ▼
               ┌────────────────────────────────────────┐
               │        Gateway API (8001)              │
               │        FastAPI REST + agent runtime    │
               │                                        │
               │ Models, MCP, Skills, Memory, Uploads,  │
               │ Artifacts, Threads, Runs, Streaming    │
               │                                        │
               │ ┌────────────────────────────────────┐ │
               │ │ Lead Agent                         │ │
               │ │ Middleware Chain, Tools, Subagents │ │
               │ └────────────────────────────────────┘ │
               └────────────────────────────────────────┘

Request Routing (via Nginx):

/api/langgraph/* → Gateway LangGraph-compatible API - agent interactions, threads, streaming
/api/* (other) → Gateway API - models, MCP, skills, memory, artifacts, uploads, thread-local cleanup
/ (non-API) → Frontend - Next.js web interface

Core Components

Lead Agent

The single LangGraph agent (lead_agent) is the runtime entry point, created via make_lead_agent(config). It combines:

Dynamic model selection with thinking and vision support
Middleware chain for cross-cutting concerns (9 middlewares)
Tool system with sandbox, MCP, community, and built-in tools
Subagent delegation for parallel task execution
System prompt with skills injection, memory context, and working directory guidance

Middleware Chain

Middlewares execute in strict order, each handling a specific concern:

#	Middleware	Purpose
1	ThreadDataMiddleware	Creates per-thread isolated directories (workspace, uploads, outputs)
2	UploadsMiddleware	Injects newly uploaded files into conversation context
3	SandboxMiddleware	Acquires sandbox environment for code execution
4	SummarizationMiddleware	Reduces context when approaching token limits (optional)
5	TodoListMiddleware	Tracks multi-step tasks in plan mode (optional)
6	TitleMiddleware	Auto-generates conversation titles after first exchange
7	MemoryMiddleware	Queues conversations for async memory extraction
8	ViewImageMiddleware	Injects image data for vision-capable models (conditional)
9	ClarificationMiddleware	Intercepts clarification requests and interrupts execution (must be last)

Sandbox System

Per-thread isolated execution with virtual path translation:

Abstract interface: execute_command, read_file, write_file, list_dir
Providers: LocalSandboxProvider (filesystem) and AioSandboxProvider (Docker, in community/)
Virtual paths: /mnt/user-data/{workspace,uploads,outputs} → thread-specific physical directories
Skills path: /mnt/skills → deer-flow/skills/ directory
Skills loading: Recursively discovers nested SKILL.md files under skills/{public,custom} and preserves nested container paths
File-write safety: str_replace serializes read-modify-write per (sandbox.id, path) so isolated sandboxes keep concurrency even when virtual paths match
Tools: bash, ls, read_file, write_file, str_replace (write_file overwrites by default and exposes append for end-of-file writes; bash is disabled by default when using LocalSandboxProvider; use AioSandboxProvider for isolated shell access)

Subagent System

Async task delegation with concurrent execution:

Built-in agents: general-purpose (full toolset) and bash (command specialist, exposed only when shell access is available)
Concurrency: Max 3 subagents per turn, 15-minute timeout
Execution: Background thread pools with status tracking and SSE events
Flow: Agent calls task() tool → executor runs subagent in background → polls for completion → returns result

Memory System

LLM-powered persistent context retention across conversations:

Automatic extraction: Analyzes conversations for user context, facts, and preferences
Structured storage: User context (work, personal, top-of-mind), history, and confidence-scored facts
Debounced updates: Batches updates to minimize LLM calls (configurable wait time)
System prompt injection: Top facts + context injected into agent prompts
Storage: JSON file with mtime-based cache invalidation

Tool Ecosystem

Category	Tools
Sandbox	`bash`, `ls`, `read_file`, `write_file`, `str_replace`
Built-in	`present_files`, `ask_clarification`, `view_image`, `task` (subagent)
Community	Tavily (web search), Jina AI (web fetch), Firecrawl (scraping), DuckDuckGo (image search)
MCP	Any Model Context Protocol server (stdio, SSE, HTTP transports)
Skills	Domain-specific workflows injected via system prompt

Gateway API

FastAPI application providing REST endpoints for frontend integration:

Route	Purpose
`GET /api/models`	List available LLM models
`GET/PUT /api/mcp/config`	Manage MCP server configurations
`GET/PUT /api/skills`	List and manage skills
`POST /api/skills/install`	Install skill from `.skill` archive
`GET /api/memory`	Retrieve memory data
`POST /api/memory/reload`	Force memory reload
`GET /api/memory/config`	Memory configuration
`GET /api/memory/status`	Combined config + data
`POST /api/threads/{id}/uploads`	Upload files (auto-converts PDF/PPT/Excel/Word to Markdown, rejects directory paths, auto-renames duplicate filenames in one request)
`GET /api/threads/{id}/uploads/list`	List uploaded files
`DELETE /api/threads/{id}`	Delete DeerFlow-managed local thread data after LangGraph thread deletion; unexpected failures are logged server-side and return a generic 500 detail
`GET /api/threads/{id}/artifacts/{path}`	Serve generated artifacts

IM Channels

The IM bridge supports Feishu, Slack, and Telegram. Slack and Telegram still use the final runs.wait() response path, while Feishu now streams through runs.stream(["messages-tuple", "values"]) and updates a single in-thread card in place.

For Feishu card updates, DeerFlow stores the running card's message_id per inbound message and patches that same card until the run finishes, preserving the existing OK / DONE reaction flow.

Quick Start

Prerequisites

Python 3.12+
uv package manager
API keys for your chosen LLM provider

Installation

cd deer-flow

# Copy configuration files
cp config.example.yaml config.yaml

# Install backend dependencies
cd backend
make install

Configuration

Edit config.yaml in the project root:

models:
  - name: gpt-4o
    display_name: GPT-4o
    use: langchain_openai:ChatOpenAI
    model: gpt-4o
    api_key: $OPENAI_API_KEY
    supports_thinking: false
    supports_vision: true

  - name: gpt-5-responses
    display_name: GPT-5 (Responses API)
    use: langchain_openai:ChatOpenAI
    model: gpt-5
    api_key: $OPENAI_API_KEY
    use_responses_api: true
    output_version: responses/v1
    supports_vision: true

Set your API keys:

export OPENAI_API_KEY="your-api-key-here"

Running

Full Application (from project root):

make dev  # Starts Gateway + Frontend + Nginx

Access at: http://localhost:2026

Backend Only (from backend directory):

# Gateway API + embedded agent runtime
make dev

Direct access: Gateway at http://localhost:8001

Project Structure

backend/
├── src/
│   ├── agents/                  # Agent system
│   │   ├── lead_agent/         # Main agent (factory, prompts)
│   │   ├── middlewares/        # 9 middleware components
│   │   ├── memory/             # Memory extraction & storage
│   │   └── thread_state.py    # ThreadState schema
│   ├── gateway/                # FastAPI Gateway API
│   │   ├── app.py             # Application setup
│   │   └── routers/           # 6 route modules
│   ├── sandbox/                # Sandbox execution
│   │   ├── local/             # Local filesystem provider
│   │   ├── sandbox.py         # Abstract interface
│   │   ├── tools.py           # bash, ls, read/write/str_replace
│   │   └── middleware.py      # Sandbox lifecycle
│   ├── subagents/              # Subagent delegation
│   │   ├── builtins/          # general-purpose, bash agents
│   │   ├── executor.py        # Background execution engine
│   │   └── registry.py        # Agent registry
│   ├── tools/builtins/         # Built-in tools
│   ├── mcp/                    # MCP protocol integration
│   ├── models/                 # Model factory
│   ├── skills/                 # Skill discovery & loading
│   ├── config/                 # Configuration system
│   ├── community/              # Community tools & providers
│   ├── reflection/             # Dynamic module loading
│   └── utils/                  # Utilities
├── docs/                       # Documentation
├── tests/                      # Test suite
├── langgraph.json              # LangGraph graph registry for tooling/Studio compatibility
├── pyproject.toml              # Python dependencies
├── Makefile                    # Development commands
└── Dockerfile                  # Container build

langgraph.json is not the default service entrypoint. The scripts and Docker deployments run the Gateway embedded runtime; the file is kept for LangGraph tooling, Studio, or direct LangGraph Server compatibility.

Configuration

Main Configuration (`config.yaml`)

Place in project root. Config values starting with $ resolve as environment variables.

Key sections:

models - LLM configurations with class paths, API keys, thinking/vision flags
tools - Tool definitions with module paths and groups
tool_groups - Logical tool groupings
sandbox - Execution environment provider
skills - Skills directory paths
title - Auto-title generation settings
summarization - Context summarization settings
subagents - Subagent system (enabled/disabled)
memory - Memory system settings (enabled, storage, debounce, facts limits)

Provider note:

models[*].use references provider classes by module path (for example langchain_openai:ChatOpenAI).
If a provider module is missing, DeerFlow now returns an actionable error with install guidance (for example uv add langchain-google-genai).

Extensions Configuration (`extensions_config.json`)

MCP servers and skill states in a single file:

{
  "mcpServers": {
    "github": {
      "enabled": true,
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-github"],
      "env": {"GITHUB_TOKEN": "$GITHUB_TOKEN"}
    },
    "secure-http": {
      "enabled": true,
      "type": "http",
      "url": "https://api.example.com/mcp",
      "oauth": {
        "enabled": true,
        "token_url": "https://auth.example.com/oauth/token",
        "grant_type": "client_credentials",
        "client_id": "$MCP_OAUTH_CLIENT_ID",
        "client_secret": "$MCP_OAUTH_CLIENT_SECRET"
      }
    }
  },
  "skills": {
    "pdf-processing": {"enabled": true}
  }
}

Environment Variables

DEER_FLOW_CONFIG_PATH - Override config.yaml location
DEER_FLOW_EXTENSIONS_CONFIG_PATH - Override extensions_config.json location
Model API keys: OPENAI_API_KEY, ANTHROPIC_API_KEY, DEEPSEEK_API_KEY, etc.
Tool API keys: TAVILY_API_KEY, GITHUB_TOKEN, etc.

LangSmith Tracing

DeerFlow has built-in LangSmith integration for observability. When enabled, all LLM calls, agent runs, tool executions, and middleware processing are traced and visible in the LangSmith dashboard.

Setup:

Sign up at smith.langchain.com and create a project.
Add the following to your .env file in the project root:

LANGSMITH_TRACING=true
LANGSMITH_ENDPOINT=https://api.smith.langchain.com
LANGSMITH_API_KEY=lsv2_pt_xxxxxxxxxxxxxxxx
LANGSMITH_PROJECT=xxx

Legacy variables: The LANGCHAIN_TRACING_V2, LANGCHAIN_API_KEY, LANGCHAIN_PROJECT, and LANGCHAIN_ENDPOINT variables are also supported for backward compatibility. LANGSMITH_* variables take precedence when both are set.

Langfuse Tracing

DeerFlow also supports Langfuse observability for LangChain-compatible runs.

Add the following to your .env file:

LANGFUSE_TRACING=true
LANGFUSE_PUBLIC_KEY=pk-lf-xxxxxxxxxxxxxxxx
LANGFUSE_SECRET_KEY=sk-lf-xxxxxxxxxxxxxxxx
LANGFUSE_BASE_URL=https://cloud.langfuse.com

If you are using a self-hosted Langfuse deployment, set LANGFUSE_BASE_URL to your Langfuse host.

Dual Provider Behavior

If both LangSmith and Langfuse are enabled, DeerFlow initializes and attaches both callbacks so the same run data is reported to both systems.

If a provider is explicitly enabled but required credentials are missing, or the provider callback cannot be initialized, DeerFlow raises an error when tracing is initialized during model creation instead of silently disabling tracing.

Docker: In docker-compose.yaml, tracing is disabled by default (LANGSMITH_TRACING=false). Set LANGSMITH_TRACING=true and/or LANGFUSE_TRACING=true in your .env, together with the required credentials, to enable tracing in containerized deployments.

Development

Commands

make install    # Install dependencies
make dev        # Run Gateway API + embedded agent runtime (port 8001)
make gateway    # Run Gateway API without reload (port 8001)
make lint       # Run linter (ruff)
make format     # Format code (ruff)

Code Style

Linter/Formatter: ruff
Line length: 240 characters
Python: 3.12+ with type hints
Quotes: Double quotes
Indentation: 4 spaces

Testing

uv run pytest

Technology Stack

LangGraph (1.0.6+) - Agent framework and multi-agent orchestration
LangChain (1.2.3+) - LLM abstractions and tool system
FastAPI (0.115.0+) - Gateway REST API
langchain-mcp-adapters - Model Context Protocol support
agent-sandbox - Sandboxed code execution
markitdown - Multi-format document conversion
tavily-python / firecrawl-py - Web search and scraping

README.md

DeerFlow Backend

Architecture

Core Components

Lead Agent

Middleware Chain

Sandbox System

Subagent System

Memory System

Tool Ecosystem

Gateway API

IM Channels

Quick Start

Prerequisites

Installation

Configuration

Running

Project Structure

Configuration

Main Configuration (`config.yaml`)

Extensions Configuration (`extensions_config.json`)

Environment Variables

LangSmith Tracing

Langfuse Tracing

Dual Provider Behavior

Development

Commands

Code Style

Testing

Technology Stack

Documentation

License

Contributing

README.md

DeerFlow Backend

Architecture

Core Components

Lead Agent

Middleware Chain

Sandbox System

Subagent System

Memory System

Tool Ecosystem

Gateway API

IM Channels

Quick Start

Prerequisites

Installation

Configuration

Running

Project Structure

Configuration

Main Configuration (config.yaml)

Extensions Configuration (extensions_config.json)

Environment Variables

LangSmith Tracing

Langfuse Tracing

Dual Provider Behavior

Development

Commands

Code Style

Testing

Technology Stack

Documentation

License

Contributing

Main Configuration (`config.yaml`)

Extensions Configuration (`extensions_config.json`)