ArXiv AI: Weekly Top Picks

1766007197438

Coverage: 2026-03-15 → 2026-03-22

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-

2026-03-22 · 10 min · 11.6 MB

Excerpt — While scaling individual Large Language Models (LLMs) has delivered remarkable progress, the next frontier lies in scaling collaboration through multi-agent systems (MAS). However, purely autonomous MAS remain ''closed-…

LLM Daily – Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-

📝 Article 📄 PDF

LLM Daily – XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable

2026-03-22 · 10 min · 12.4 MB

Excerpt — Large Language Model (LLM)-based coding agents show promise in automating software development tasks, yet they frequently fail in ways that are difficult for developers to understand and debug. While general-purpose…

LLM Daily – XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable

📝 Article 📄 PDF

LLM Daily – MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ S

2026-03-21 · 10 min · 15.0 MB

Excerpt — Memory-augmented LLM agents maintain external memory banks to support long-horizon interaction, yet most existing systems treat construction, retrieval, and utilization as isolated subroutines. This creates two coupled…

LLM Daily – MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ S

📝 Article 📄 PDF

LLM Daily – From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making

2026-03-21 · 10 min · 13.5 MB

Excerpt — Artificial intelligence (AI) systems are deployed as collaborators in human decision-making. Yet, evaluation practices focus primarily on model accuracy rather than whether human-AI teams are prepared to collaborate…

LLM Daily – From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making

📝 Article 📄 PDF

LLM Daily – Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review

2026-03-20 · 10 min · 12.0 MB

Excerpt — Security code reviews increasingly rely on systems integrating Large Language Models (LLMs), ranging from interactive assistants to autonomous agents in CI/CD pipelines. We study whether confirmation bias (i.e., the…

LLM Daily – Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review

📝 Article 📄 PDF

LLM Daily – Security, privacy, and agentic AI in a regulatory view: From definitions and dis

2026-03-20 · 10 min · 13.4 MB

Excerpt — The rapid proliferation of artificial intelligence (AI) technologies has led to a dynamic regulatory landscape, where legislative frameworks strive to keep pace with technical advancements. As AI paradigms shift towards…

📝 Article 📄 PDF

LLM Daily – Expert Mind: A Retrieval-Augmented Architecture for Expert Knowledge Preservatio

2026-03-19 · 10 min · 13.1 MB

Excerpt — The departure of subject-matter experts from industrial organizations results in the irreversible loss of tacit knowledge that is rarely captured through conventional documentation practices. This paper proposes Expert…

LLM Daily – Expert Mind: A Retrieval-Augmented Architecture for Expert Knowledge Preservatio

📝 Article 📄 PDF

LLM Daily – PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent

2026-03-19 · 10 min · 11.1 MB

Excerpt — Current Graphical User Interface (GUI) agents operate primarily under a reactive paradigm: a user must provide an explicit instruction for the agent to execute a task. However, an intelligent AI assistant should be…

LLM Daily – PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent

📝 Article 📄 PDF

LLM Daily – AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

2026-03-18 · 10 min · 13.8 MB

Excerpt — Large language model (LLM) agents increasingly rely on external memory to support long-horizon interaction, personalized assistance, and multi-step reasoning. However, existing memory systems still face three core…

LLM Daily – AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

📝 Article 📄 PDF

LLM Daily – AI Planning Framework for LLM-Based Web Agents

2026-03-17 · 10 min · 12.4 MB

Excerpt — Developing autonomous agents for web-based tasks is a core challenge in AI. While Large Language Model (LLM) agents can interpret complex user requests, they often operate as black boxes, making it difficult to diagnose…

LLM Daily – AI Planning Framework for LLM-Based Web Agents

📝 Article 📄 PDF

LLM Daily – Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Ampli

2026-03-16 · 10 min · 13.8 MB

Excerpt — Rapid progress in generative AI has given rise to Compound AI systems - pipelines comprised of multiple large language models (LLM), software tools and database systems. Compound AI systems are constructed on a layered…

LLM Daily – Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Ampli

📝 Article 📄 PDF

LLM Daily – Semantic Invariance in Agentic AI

2026-03-16 · 10 min · 13.0 MB

Excerpt — Large Language Models (LLMs) increasingly serve as autonomous reasoning agents in decision support, scientific problem-solving, and multi-agent coordination systems. However, deploying LLM agents in consequential…

LLM Daily – Semantic Invariance in Agentic AI

📝 Article 📄 PDF

LLM Daily – Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability an

2026-03-15 · 10 min · 13.8 MB

Excerpt — Long-term memory has emerged as a foundational component of autonomous Large Language Model (LLM) agents, enabling continuous adaptation, lifelong multimodal learning, and sophisticated reasoning. However, as memory…

LLM Daily – Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability an

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-

LLM Daily – XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable

LLM Daily – MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ S

LLM Daily – From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making

LLM Daily – Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review

LLM Daily – Security, privacy, and agentic AI in a regulatory view: From definitions and dis

LLM Daily – Expert Mind: A Retrieval-Augmented Architecture for Expert Knowledge Preservatio

LLM Daily – PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent

LLM Daily – AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

LLM Daily – AI Planning Framework for LLM-Based Web Agents

LLM Daily – Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Ampli

LLM Daily – Semantic Invariance in Agentic AI

LLM Daily – Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability an

Read more

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem

Stop Waiting: This Is the Best Time to Hire Junior Talent.