ArXiv AI: Weekly Top Picks

cover

Coverage: 2025-11-10 → 2025-11-17

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – A Workflow for Full Traceability of AI Decisions

2025-11-17 · 10 min · 14.2 MB

Excerpt — An ever increasing number of high-stake decisions are made or assisted by automated systems employing brittle artificial intelligence technology. There is a substantial risk that some of these decision induce harm to…

LLM Daily – A Workflow for Full Traceability of AI Decisions

📝 Article 📄 PDF

LLM Daily – Building the Web for Agents: A Declarative Framework for Agent-Web Interaction

2025-11-17 · 10 min · 15.5 MB

Excerpt — The increasing deployment of autonomous AI agents on the web is hampered by a fundamental misalignment: agents must infer affordances from human-oriented user interfaces, leading to brittle, inefficient, and insecure…

LLM Daily – Building the Web for Agents: A Declarative Framework for Agent-Web Interaction

📝 Article 📄 PDF

LLM Daily – iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference

2025-11-17 · 10 min · 14.3 MB

Excerpt — Large Language Model (LLM) agent systems have advanced rapidly, driven by their strong generalization in zero-shot settings. To further enhance reasoning and accuracy on complex tasks, Multi-Agent Debate (MAD) has…

LLM Daily – iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference

📝 Article 📄 PDF

LLM Daily – Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs

2025-11-17 · 10 min · 13.5 MB

Excerpt — Retrieval-augmented generation (RAG) has rapidly emerged as a transformative approach for integrating large language models into clinical and biomedical workflows. However, privacy risks, such as protected health…

LLM Daily – Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs

📝 Article 📄 PDF

LLM Daily – Experience-Guided Adaptation of Inference-Time Reasoning Strategies

2025-11-17 · 10 min · 13.7 MB

Excerpt — Enabling agentic AI systems to adapt their problem-solving approaches based on post-training interactions remains a fundamental challenge. While systems that update and maintain a memory at inference time have been…

LLM Daily – Experience-Guided Adaptation of Inference-Time Reasoning Strategies

📝 Article 📄 PDF

LLM Daily – PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reas

2025-11-17 · 10 min · 18.1 MB

Excerpt — Frontier model progress is often measured by academic benchmarks, which offer a limited view of performance in real-world professional contexts. Existing evaluations often fail to assess open-ended, economically…

LLM Daily – PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reas

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – A Workflow for Full Traceability of AI Decisions

LLM Daily – Building the Web for Agents: A Declarative Framework for Agent-Web Interaction

LLM Daily – iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference

LLM Daily – Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs

LLM Daily – Experience-Guided Adaptation of Inference-Time Reasoning Strategies

LLM Daily – PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reas

Read more

AI Signals Report — Control planes, not just models

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem