ArXiv AI: Weekly Top Picks

1766007197438

Coverage: 2026-04-05 → 2026-04-12

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic

2026-04-12 · 10 min · 10.9 MB

Excerpt — Multi-agent systems powered by large language models (LLMs) are increasingly deployed in settings that shape consequential decisions, both directly and indirectly. Yet it remains unclear whether their outcomes reflect…

LLM Daily – When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic

📝 Article 📄 PDF

LLM Daily – Polaris: A Gödel Agent Framework for Small Language Models through Experience-Ab

2026-04-11 · 10 min · 13.3 MB

Excerpt — Gödel agent realize recursive self-improvement: an agent inspects its own policy and traces and then modifies that policy in a tested loop. We introduce Polaris, a Gödel agent for compact models that performs policy…

LLM Daily – Polaris: A Gödel Agent Framework for Small Language Models through Experience-Ab

📝 Article 📄 PDF

LLM Daily – An Agentic Multi-Agent Architecture for Cybersecurity Risk Management

2026-04-10 · 10 min · 13.7 MB

Excerpt — Getting a real cybersecurity risk assessment for a small organization is expensive -- a NIST CSF-aligned engagement runs $15,000 on the low end, takes weeks, and depends on practitioners who are genuinely scarce. Most…

LLM Daily – An Agentic Multi-Agent Architecture for Cybersecurity Risk Management

📝 Article 📄 PDF

LLM Daily – Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

2026-04-10 · 10 min · 12.6 MB

Excerpt — Collaborative multi-agent large language models (LLMs) can solve complex reasoning tasks by decomposing roles and aggregating diverse hypotheses. Yet, reinforcement learning (RL) for such systems is often undermined by…

LLM Daily – Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

📝 Article 📄 PDF

LLM Daily – Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augme

2026-04-09 · 10 min · 12.4 MB

Excerpt — Adapting Large Language Models in complex technical service domains is constrained by the absence of explicit cognitive chains in human demonstrations and the inherent ambiguity arising from the diversity of valid…

LLM Daily – Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augme

📝 Article 📄 PDF

LLM Daily – FinTradeBench: A Financial Reasoning Benchmark for LLMs

2026-04-09 · 10 min · 13.2 MB

Excerpt — Real-world financial decision-making is a challenging problem that requires reasoning over heterogeneous signals, including company fundamentals derived from regulatory filings and trading signals computed from price…

LLM Daily – FinTradeBench: A Financial Reasoning Benchmark for LLMs

📝 Article 📄 PDF

LLM Daily – FinReporting: An Agentic Workflow for Localized Reporting of Cross-Jurisdiction

2026-04-08 · 10 min · 8.8 MB

Excerpt — Financial reporting systems increasingly use large language models (LLMs) to extract and summarize corporate disclosures. However, most assume a single-market setting and do not address structural differences across…

LLM Daily – FinReporting: An Agentic Workflow for Localized Reporting of Cross-Jurisdiction

📝 Article 📄 PDF

LLM Daily – Who Governs the Machine? A Machine Identity Governance Taxonomy (MIGT) for AI Sy

2026-04-08 · 10 min · 7.6 MB

Excerpt — The governance of artificial intelligence has a blind spot: the machine identities that AI systems use to act. AI agents, service accounts, API tokens, and automated workflows now outnumber human identities in…

📝 Article 📄 PDF

LLM Daily – Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Ma

2026-04-08 · 10 min · 8.7 MB

Excerpt — Large language model (LLM) agents are increasingly acting as human delegates in multi-agent environments, where a representative agent integrates diverse peer perspectives to make a final decision. Drawing inspiration…

📝 Article 📄 PDF

LLM Daily – Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Ma

2026-04-08 · 10 min · 8.7 MB

📝 Article 📄 PDF

LLM Daily – FinReporting: An Agentic Workflow for Localized Reporting of Cross-Jurisdiction

2026-04-08 · 10 min · 7.1 MB

📝 Article 📄 PDF

LLM Daily – A Formal Security Framework for MCP-Based AI Agents: Threat Taxonomy, Verificati

2026-04-08 · 10 min · 7.0 MB

Excerpt — The Model Context Protocol (MCP), introduced by Anthropic in November 2024 and now governed by the Linux Foundation's Agentic AI Foundation, has rapidly become the de facto standard for connecting large language model…

📝 Article 📄 PDF

LLM Daily – Visual Distraction Undermines Moral Reasoning in Vision-Language Models

2026-04-08 · 10 min · 8.6 MB

Excerpt — Moral reasoning is fundamental to safe Artificial Intelligence (AI), yet ensuring its consistency across modalities becomes critical as AI systems evolve from text-based assistants to embodied agents. Current safety…

LLM Daily – Visual Distraction Undermines Moral Reasoning in Vision-Language Models

📝 Article 📄 PDF

LLM Daily – Anticipatory Planning for Multimodal AI Agents

2026-04-08 · 10 min · 7.2 MB

Excerpt — Recent advances in multimodal agents have improved computer-use interaction and tool-usage, yet most existing systems remain reactive, optimizing actions in isolation without reasoning about future states or long-term…

LLM Daily – Anticipatory Planning for Multimodal AI Agents

📝 Article 📄 PDF

LLM Daily – Reliable Control-Point Selection for Steering Reasoning in Large Language Models

2026-04-05 · 10 min · 7.0 MB

Excerpt — Steering vectors offer a training-free mechanism for controlling reasoning behaviors in large language models, but constructing effective vectors requires identifying genuine behavioral signals in the model's hidden…

📝 Article 📄 PDF

LLM Daily – RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Sta

2026-04-05 · 10 min · 7.1 MB

Excerpt — Large Language Model (LLM)-based agents have achieved notable success on short-horizon and highly structured tasks. However, their ability to maintain coherent decision-making over long horizons in realistic and dynamic…

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic

LLM Daily – Polaris: A Gödel Agent Framework for Small Language Models through Experience-Ab

LLM Daily – An Agentic Multi-Agent Architecture for Cybersecurity Risk Management

LLM Daily – Counterfactual Credit Policy Optimization for Multi-Agent Collaboration

LLM Daily – Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augme

LLM Daily – FinTradeBench: A Financial Reasoning Benchmark for LLMs

LLM Daily – FinReporting: An Agentic Workflow for Localized Reporting of Cross-Jurisdiction

LLM Daily – Who Governs the Machine? A Machine Identity Governance Taxonomy (MIGT) for AI Sy

LLM Daily – Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Ma

LLM Daily – Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Ma

LLM Daily – FinReporting: An Agentic Workflow for Localized Reporting of Cross-Jurisdiction

LLM Daily – A Formal Security Framework for MCP-Based AI Agents: Threat Taxonomy, Verificati

LLM Daily – Visual Distraction Undermines Moral Reasoning in Vision-Language Models

LLM Daily – Anticipatory Planning for Multimodal AI Agents

LLM Daily – Reliable Control-Point Selection for Steering Reasoning in Large Language Models

LLM Daily – RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Sta

Read more

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem

Stop Waiting: This Is the Best Time to Hire Junior Talent.