ArXiv AI: Weekly Top Picks

1766007197438

Coverage: 2026-04-26 → 2026-05-03

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Toward a Safe Internet of Agents

2026-05-03 · 10 min · 11.8 MB

Excerpt — Autonomous Artificial Intelligence (AI) agents, powered by Large Language Models (LLMs), advance rapidly toward interconnected systems -- an Internet of Agents (IoA). This vision enables complex problem-solving while…

LLM Daily – Toward a Safe Internet of Agents

📝 Article 📄 PDF

LLM Daily – AgentVisor: Defending LLM Agents Against Prompt Injection via Semantic Virtualiz

2026-05-02 · 10 min · 11.9 MB

Excerpt — Large Language Model (LLM) agents are increasingly used to automate complex workflows, but integrating untrusted external data with privileged execution exposes them to severe security risks, particularly direct and…

LLM Daily – AgentVisor: Defending LLM Agents Against Prompt Injection via Semantic Virtualiz

📝 Article 📄 PDF

LLM Daily – Evaluation of Prompt Injection Defenses in Large Language Models

2026-05-02 · 10 min · 11.9 MB

Excerpt — LLM-powered applications routinely embed secrets in system prompts, yet models can be tricked into revealing them. We built an adaptive attacker that evolves its strategies over hundreds of rounds and tested it against…

LLM Daily – Evaluation of Prompt Injection Defenses in Large Language Models

📝 Article 📄 PDF

LLM Daily – Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large

2026-05-01 · 10 min · 14.5 MB

Excerpt — Large language models (LLMs) are increasingly integrated into sensitive workflows, raising the stakes for adversarial robustness and safety. This paper introduces Transient Turn Injection(TTI), a new multi-turn attack…

LLM Daily – Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large

📝 Article 📄 PDF

LLM Daily – AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error

2026-05-01 · 10 min · 13.7 MB

Excerpt — Agentic systems that chain reasoning, tool use, and synthesis into multi-step workflows are entering production, yet prevailing evaluation practices like end-to-end outcome checks and ad-hoc trace inspection…

LLM Daily – AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error

📝 Article 📄 PDF

LLM Daily – Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for

2026-04-30 · 10 min · 13.1 MB

Excerpt — The Model Context Protocol (MCP) has become a common interface for connecting large language model (LLM) agents to external tools, but its reliance on stateless, eager schema injection imposes a hidden per-turn overhead…

📝 Article 📄 PDF

LLM Daily – Breaking MCP with Function Hijacking Attacks: Novel Threats for Function Calling

2026-04-29 · 10 min · 12.0 MB

Excerpt — The growth of agentic AI has drawn significant attention to function calling Large Language Models (LLMs), which are designed to extend the capabilities of AI-powered system by invoking external functions. Injection and…

LLM Daily – Breaking MCP with Function Hijacking Attacks: Novel Threats for Function Calling

📝 Article 📄 PDF

LLM Daily – The Last Harness You'll Ever Build

2026-04-29 · 10 min · 11.8 MB

Excerpt — AI agents are increasingly deployed on complex, domain-specific workflows -- navigating enterprise web applications that require dozens of clicks and form fills, orchestrating multi-step research pipelines that span…

LLM Daily – The Last Harness You'll Ever Build

📝 Article 📄 PDF

LLM Daily – Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks

2026-04-29 · 10 min · 8.1 MB

Excerpt — Anti-money laundering (AML) transaction monitoring generates large volumes of alerts that must be rapidly triaged by investigators under strict audit and governance constraints. While large language models (LLMs) can…

LLM Daily – Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Toward a Safe Internet of Agents

LLM Daily – AgentVisor: Defending LLM Agents Against Prompt Injection via Semantic Virtualiz

LLM Daily – Evaluation of Prompt Injection Defenses in Large Language Models

LLM Daily – Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large

LLM Daily – AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error

LLM Daily – Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for

LLM Daily – Breaking MCP with Function Hijacking Attacks: Novel Threats for Function Calling

LLM Daily – The Last Harness You'll Ever Build

LLM Daily – Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks

Read more

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem

Stop Waiting: This Is the Best Time to Hire Junior Talent.