We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.
2026-05-31
Excerpt — This comprehensive survey explores how classical software design patterns can improve communication reliability and scalability in LLM-driven multi-agent systems. It focuses on the Model Context Protocol (MCP), tracing…
2026-05-29
Excerpt — NaviAgent tackles the challenge of coordinating hundreds or thousands of external tools invoked by LLM agents. By leveraging graph-driven bilevel planning, it offers a scalable orchestration method that reduces error…
2026-05-30
Excerpt — This paper exposes vulnerabilities in MCP where tool descriptors—treated as trusted metadata—can be exploited to manipulate LLM reasoning. It highlights a critical security gap in tool-augmented LLMs and proposes…
2026-05-25
Excerpt — ADR is the first large-scale, production-proven framework designed to detect and respond to security threats in agentic AI operating via MCP. It addresses limited observability and the challenge of tracing causal chains…
2026-05-25
Excerpt — LivePI presents a benchmark for evaluating AI agents’ resilience to indirect prompt injection (IPI) attacks via untrusted inputs like emails or downloaded files. It moves beyond small-scale or simulated evaluations to…
2026-05-27
Excerpt — SafeGPT introduces a two-sided guardrail system combining input-side detection/redaction with output-side moderation and reframing to prevent sensitive data leakage and unethical content generation in enterprise…
2026-05-27
Excerpt — This red-teaming study probes the security of Google’s Agent Payments Protocol (AP2), revealing how prompt injection can compromise agent-led financial transactions despite cryptographic safeguards. **Why read it?**…
2026-05-30
Excerpt — This paper identifies a structural failure in diagnosing agent misconduct: memory-layer attacks can mimic model misalignment, leading to misapplied fixes. It formalizes Semantic Norm Drift (SND) as a distinct cause of…
2026-05-26
Excerpt — Moving beyond simple content blocking, this work proposes a control-theoretic framework for AI guardrails that preempt downstream harms like financial or physical damage through dynamic intervention strategies. **Why…
2026-05-26
Excerpt — ESLD introduces a latent-space defense architecture to detect and mitigate malicious inputs from multiple sources before they influence agent reasoning, reducing prompt injection risks. **Why read it?** Engineers…
2026-05-29
Excerpt — CUGA’s policy-as-code system enables enterprises to specify allowed actions, human oversight triggers, and information exposure rules modularly—without rebuilding agents per domain. **Why read it?** This approach…
2026-05-28
Excerpt — Addressing the similarity-utility gap in Retrieval-Augmented Generation pipelines, this paper proposes a two-phase retrieval method to extract actionable evidence from complex financial documents. **Why read it?** It…