ArXiv AI: Weekly Top Picks

1766007197438

Coverage: 2026-04-19 → 2026-04-26

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Doc

2026-04-25 · 10 min · 13.0 MB

Excerpt — Retrieval-Augmented Generation (RAG) systems for financial document question answering typically follow a chunk-based paradigm: documents are split into fragments, embedded into vector space, and retrieved via…

LLM Daily – Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Doc

📝 Article 📄 PDF

LLM Daily – Whispers in the Machine: Confidentiality in Agentic Systems

2026-04-25 · 10 min · 11.0 MB

Excerpt — Large language model (LLM)-based agents combine LLMs with external tools to automate tasks such as scheduling meetings, managing documents, or booking travel. While these integrations unlock powerful capabilities, they…

LLM Daily – Whispers in the Machine: Confidentiality in Agentic Systems

📝 Article 📄 PDF

LLM Daily – An AI Agent Execution Environment to Safeguard User Data

2026-04-23 · 10 min · 13.4 MB

Excerpt — AI agents promise to serve as general-purpose personal assistants for their users, which requires them to have access to private user data (e.g., personal and financial information). This poses a serious risk to…

LLM Daily – An AI Agent Execution Environment to Safeguard User Data

📝 Article 📄 PDF

LLM Daily – ArbGraph: Conflict-Aware Evidence Arbitration for Reliable Long-Form Retrieval-A

2026-04-23 · 10 min · 12.1 MB

Excerpt — Retrieval-augmented generation (RAG) remains unreliable in long-form settings, where retrieved evidence is noisy or contradictory, making it difficult for RAG pipelines to maintain factual consistency. Existing…

LLM Daily – ArbGraph: Conflict-Aware Evidence Arbitration for Reliable Long-Form Retrieval-A

📝 Article 📄 PDF

LLM Daily – HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

2026-04-22 · 10 min · 13.6 MB

Excerpt — When multiple LLM coding agents share a rate-limited API endpoint, they exhibit resource contention patterns analogous to unscheduled OS processes competing for CPU, memory, and I/O. In a motivating incident, 3 of 11…

LLM Daily – HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

📝 Article 📄 PDF

LLM Daily – Parasites in the Toolchain: A Large-Scale Analysis of Attacks on the MCP Ecosyst

2026-04-22 · 10 min · 11.3 MB

Excerpt — Large language models(LLMs) are increasingly integrated with external systems through the Model Context Protocol(MCP),which standardizes tool invocation and has rapidly become a backbone for LLM-powered applications.…

LLM Daily – Parasites in the Toolchain: A Large-Scale Analysis of Attacks on the MCP Ecosyst

📝 Article 📄 PDF

LLM Daily – LLM Hypnosis: Exploiting User Feedback for Unauthorized Knowledge Injection to A

2026-04-21 · 10 min · 7.8 MB

Excerpt — We describe a vulnerability in language models (LMs) trained with user feedback, whereby a single user can persistently alter LM knowledge and behavior given only the ability to provide prompts and upvote / downvote…

LLM Daily – LLM Hypnosis: Exploiting User Feedback for Unauthorized Knowledge Injection to A

📝 Article 📄 PDF

LLM Daily – SafeAgent: A Runtime Protection Architecture for Agentic Systems

2026-04-21 · 10 min · 6.3 MB

Excerpt — Large language model (LLM) agents are vulnerable to prompt-injection attacks that propagate through multi-step workflows, tool interactions, and persistent context, making input-output filtering alone insufficient for…

LLM Daily – SafeAgent: A Runtime Protection Architecture for Agentic Systems

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Doc

LLM Daily – Whispers in the Machine: Confidentiality in Agentic Systems

LLM Daily – An AI Agent Execution Environment to Safeguard User Data

LLM Daily – ArbGraph: Conflict-Aware Evidence Arbitration for Reliable Long-Form Retrieval-A

LLM Daily – HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

LLM Daily – Parasites in the Toolchain: A Large-Scale Analysis of Attacks on the MCP Ecosyst

LLM Daily – LLM Hypnosis: Exploiting User Feedback for Unauthorized Knowledge Injection to A

LLM Daily – SafeAgent: A Runtime Protection Architecture for Agentic Systems

Read more

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem

Stop Waiting: This Is the Best Time to Hire Junior Talent.