2026年3月26日ResearchInfrastructureMCPOpen Source

T-MAP: Red-Teaming Framework for LLM Agents Targeting MCP Ecosystem

T-MAP (Trajectory-aware MAP) is a new research framework from KAIST for red-teaming LLM agents, specifically targeting vulnerabilities in the rapidly growing MCP (Model Context Protocol) ecosystem. The paper has received 29 upvotes on HuggingFace Daily Papers.

While prior red-teaming focused on eliciting harmful text from LLMs, T-MAP addresses agent-specific vulnerabilities that emerge through multi-step tool execution. It uses execution trajectories to guide the discovery of adversarial prompts that bypass safety guardrails and realize harmful objectives through actual tool interactions.

The method maintains a multidimensional archive spanning risk categories and attack styles, using a four-step iterative cycle: Cross-Diagnosis extracts success factors and failure causes from past prompts, which combined with a learned Tool Call Graph (TCG), guides mutation of new attack prompts.

Empirical evaluations across diverse MCP environments show T-MAP substantially outperforms baselines in attack realization rate and remains effective against frontier models including GPT-5.2, Gemini-3-Pro, and Qwen3.5. This is timely during RSAC 2026 week, as agent security becomes the industry's top priority.

https://arxiv.org/abs/2603.22341
https://github.com/pwnhyo/T-MAP
← 上一篇
Dexter:自主金融研究 Agent 登上 GitHub Trending
下一篇 →
T-MAP:针对 MCP 生态的 LLM Agent 红队测试框架
← 返回所有文章

评论

加载中...
>_