ArXiv AI: Weekly Top Picks

ChatGPT Image May 9, 2026, 03_31_42 PM

Coverage: 2026-05-10 → 2026-05-17

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – InvThink: Premortem Reasoning for Safer Language Models

2026-05-16 · 10 min · 6.4 MB

Excerpt — We present InvThink, a training and prompting framework that requires the model to enumerate, analyze, and constrain potential failures before generating its final response. Unlike existing safety alignment methods that…

LLM Daily – InvThink: Premortem Reasoning for Safer Language Models

📝 Article 📄 PDF

LLM Daily – From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems

2026-05-16 · 10 min · 7.5 MB

Excerpt — Compound AI Systems (CAIS) are an emerging paradigm that integrates large language models (LLMs) with external components, including retrievers, agents, tools, and orchestrators, to overcome the limitations of…

LLM Daily – From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems

📝 Article 📄 PDF

LLM Daily – Towards Security-Auditable LLM Agents: A Unified Graph Representation

2026-05-15 · 10 min · 8.2 MB

Excerpt — LLM-based agentic systems are rapidly evolving to perform complex autonomous tasks through dynamic tool invocation, stateful memory management, and multi-agent collaboration. However, this semantics-driven execution…

LLM Daily – Towards Security-Auditable LLM Agents: A Unified Graph Representation

📝 Article 📄 PDF

LLM Daily – Switchcraft: AI Model Router for Agentic Tool Calling

2026-05-15 · 10 min · 7.9 MB

Excerpt — Agentic AI systems that invoke external tools are powerful but costly, leading developers to default to large models and overspend inference budgets. Model routing can mitigate this, but existing routers are designed…

LLM Daily – Switchcraft: AI Model Router for Agentic Tool Calling

📝 Article 📄 PDF

LLM Daily – When Routine Chats Turn Toxic: Unintended Long-Term State Poisoning in Personali

2026-05-14 · 10 min · 12.3 MB

Excerpt — Personalized LLM agents maintain persistent cross-session state to support long-horizon collaboration. Yet, this persistence introduces a subtle but critical security vulnerability: routine user-agent interactions can…

LLM Daily – When Routine Chats Turn Toxic: Unintended Long-Term State Poisoning in Personali

📝 Article 📄 PDF

LLM Daily – A Self-Healing Framework for Reliable LLM-Based Autonomous Agents

2026-05-14 · 10 min · 9.7 MB

Excerpt — Autonomous agents based on Large Language Models (LLMs) are increasingly being utilized in complex software systems. However, reliability remains a significant challenge due to unpredictable failures such as…

LLM Daily – A Self-Healing Framework for Reliable LLM-Based Autonomous Agents

📝 Article 📄 PDF

LLM Daily – Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Lar

2026-05-13 · 10 min · 6.5 MB

Excerpt — Large language models increasingly rely on explicit chain-of-thought reasoning to solve complex tasks, yet the safety of the reasoning process itself remains largely unaddressed. Existing work focuses predominantly on…

LLM Daily – Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Lar

📝 Article 📄 PDF

LLM Daily – AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use

2026-05-13 · 10 min · 8.6 MB

Excerpt — Modern AI agents execute real-world side effects through tool calls such as file operations, shell commands, HTTP requests, and database queries. A single unsafe action, including accidental deletion, credential…

LLM Daily – AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use

📝 Article 📄 PDF

LLM Daily – MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory

2026-05-12 · 10 min · 8.2 MB

Excerpt — As large language model (LLM)-powered agents are increasingly deployed to perform complex, real-world tasks, they face a growing class of attacks that exploit extended user-agent-environment interactions to pursue…

LLM Daily – MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory

📝 Article 📄 PDF

LLM Daily – ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection

2026-05-12 · 10 min · 7.7 MB

Excerpt — The rise of Large Language Model (LLM) agents, augmented with tool use, skills, and external knowledge, has introduced new security risks. Among them, prompt injection attacks, where adversaries embed malicious…

LLM Daily – ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection

📝 Article 📄 PDF

LLM Daily – Trojan Hippo: Weaponizing Agent Memory for Data Exfiltration

2026-05-11 · 10 min · 11.6 MB

Excerpt — Memory systems enable otherwise-stateless LLM agents to persist user information across sessions, but also introduce a new attack surface. We characterize the Trojan Hippo attack, a class of persistent memory attacks…

LLM Daily – Trojan Hippo: Weaponizing Agent Memory for Data Exfiltration

📝 Article 📄 PDF

LLM Daily – A Low-Latency Fraud Detection Layer for Detecting Adversarial Interaction Patter

2026-05-10 · 10 min · 12.5 MB

Excerpt — Large Language Model (LLM)-powered agents demonstrate strong capabilities in autonomous task execution, tool use, and multi-step reasoning. However, their increasing autonomy also introduces a new attack surface:…

📝 Article 📄 PDF

LLM Daily – Evaluating Agentic AI in the Wild: Failure Modes, Drift Patterns, and a Producti

2026-05-10 · 10 min · 12.5 MB

Excerpt — Existing evaluation frameworks for large language models -- including HELM, MT-Bench, AgentBench, and BIG-bench -- are designed for controlled, single-session, lab-scale settings. They do not address the evaluation…

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – InvThink: Premortem Reasoning for Safer Language Models

LLM Daily – From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems

LLM Daily – Towards Security-Auditable LLM Agents: A Unified Graph Representation

LLM Daily – Switchcraft: AI Model Router for Agentic Tool Calling

LLM Daily – When Routine Chats Turn Toxic: Unintended Long-Term State Poisoning in Personali

LLM Daily – A Self-Healing Framework for Reliable LLM-Based Autonomous Agents

LLM Daily – Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Lar

LLM Daily – AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use

LLM Daily – MAGE: Safeguarding LLM Agents against Long-Horizon Threats via Shadow Memory

LLM Daily – ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection

LLM Daily – Trojan Hippo: Weaponizing Agent Memory for Data Exfiltration

LLM Daily – A Low-Latency Fraud Detection Layer for Detecting Adversarial Interaction Patter

LLM Daily – Evaluating Agentic AI in the Wild: Failure Modes, Drift Patterns, and a Producti

Read more

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem

Stop Waiting: This Is the Best Time to Hire Junior Talent.