ArXiv AI: Weekly Top Picks

cover

Coverage: 2025-12-09 → 2025-12-16

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Memory in the Age of AI Agents

2025-12-16 · 10 min · 13.1 MB

Excerpt — Memory has emerged, and will continue to remain, a core capability of foundation model-based agents. As research on agent memory rapidly expands and attracts unprecedented attention, the field has also become…

LLM Daily – Memory in the Age of AI Agents

📝 Article 📄 PDF

LLM Daily – Information-Consistent Language Model Recommendations through Group Relative Pol

2025-12-16 · 10 min · 13.8 MB

Excerpt — Large Language Models (LLMs) are increasingly deployed in business-critical domains such as finance, education, healthcare, and customer support, where users expect consistent and reliable recommendations. Yet LLMs…

📝 Article 📄 PDF

LLM Daily – Forgetful but Faithful: A Cognitive Memory Architecture and Benchmark for Privac

2025-12-16 · 10 min · 14.5 MB

Excerpt — As generative agents become increasingly sophisticated and deployed in long-term interactive scenarios, their memory management capabilities emerge as a critical bottleneck for both performance and privacy. Current…

LLM Daily – Forgetful but Faithful: A Cognitive Memory Architecture and Benchmark for Privac

📝 Article 📄 PDF

LLM Daily – AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning

2025-12-16 · 10 min · 12.9 MB

Excerpt — Agentic reinforcement learning has advanced large language models (LLMs) to reason through long chain-of-thought trajectories while interleaving external tool use. Existing approaches assume a fixed inventory of tools,…

LLM Daily – AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning

📝 Article 📄 PDF

LLM Daily – Cooperative Retrieval-Augmented Generation for Question Answering: Mutual Inform

2025-12-14 · 10 min · 15.8 MB

Excerpt — Since large language models (LLMs) have a tendency to generate factually inaccurate output, retrieval-augmented generation (RAG) has gained significant attention as a key means to mitigate this downside of harnessing…

LLM Daily – Cooperative Retrieval-Augmented Generation for Question Answering: Mutual Inform

📝 Article 📄 PDF

LLM Daily – Titans: Learning to Memorize at Test Time

2025-12-13 · 10 min · 12.7 MB

Excerpt — Over more than a decade there has been an extensive research effort on how to effectively utilize recurrent models and attention. While recurrent models aim to compress the data into a fixed-size memory (called hidden…

LLM Daily – Titans: Learning to Memorize at Test Time

📝 Article 📄 PDF

LLM Daily – It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias,

2025-12-13 · 10 min · 14.5 MB

Excerpt — Designing efficient and effective architectural backbones has been in the core of research efforts to enhance the capability of foundation models. Inspired by the human cognitive phenomenon of attentional bias-the…

LLM Daily – It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias,

📝 Article 📄 PDF

LLM Daily – NormCode: A Semi-Formal Language for Context-Isolated AI Planning

2025-12-13 · 10 min · 14.0 MB

Excerpt — Multistep workflows that chain large language model (LLM) calls suffer from context pollution: as information accumulates across steps, models hallucinate, confuse intermediate outputs, and lose track of task…

📝 Article 📄 PDF

LLM Daily – Asynchronous Reasoning: Training-Free Interactive Thinking LLMs

2025-12-13 · 10 min · 12.7 MB

Excerpt — Many state-of-the-art LLMs are trained to think before giving their answer. Reasoning can greatly improve language model capabilities and safety, but it also makes them less interactive: given a new input, a model must…

LLM Daily – Asynchronous Reasoning: Training-Free Interactive Thinking LLMs

📝 Article 📄 PDF

LLM Daily – An End-to-end Planning Framework with Agentic LLMs and PDDL

2025-12-12 · 10 min · 13.4 MB

Excerpt — We present an end-to-end framework for planning supported by verifiers. An orchestrator receives a human specification written in natural language and converts it into a PDDL (Planning Domain Definition Language) model,…

LLM Daily – An End-to-end Planning Framework with Agentic LLMs and PDDL

📝 Article 📄 PDF

LLM Daily – Architectures for Building Agentic AI

2025-12-12 · 10 min · 13.8 MB

Excerpt — This chapter argues that the reliability of agentic and generative AI is chiefly an architectural property. We define agentic systems as goal-directed, tool-using decision makers operating in closed loops, and show how…

LLM Daily – Architectures for Building Agentic AI

📝 Article 📄 PDF

LLM Daily – Systematization of Knowledge: Security and Safety in the Model Context Protocol

2025-12-10 · 10 min · 15.7 MB

Excerpt — The Model Context Protocol (MCP) has emerged as the de facto standard for connecting Large Language Models (LLMs) to external data and tools, effectively functioning as the "USB-C for Agentic AI." While this decoupling…

LLM Daily – Systematization of Knowledge: Security and Safety in the Model Context Protocol

📝 Article 📄 PDF

LLM Daily – A Practical Guide for Designing, Developing, and Deploying Production-Grade Agen

2025-12-10 · 10 min · 12.1 MB

Excerpt — Agentic AI marks a major shift in how autonomous systems reason, plan, and execute multi-step tasks. Unlike traditional single model prompting, agentic workflows integrate multiple specialized agents with different…

LLM Daily – A Practical Guide for Designing, Developing, and Deploying Production-Grade Agen

📝 Article 📄 PDF

LLM Daily – Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI De

2025-12-09 · 10 min · 14.6 MB

Excerpt — LLM-based agents are rapidly being plugged into expert decision-support, yet in messy, high-stakes settings they rarely make the team smarter: human-AI teams often underperform the best individual, experts oscillate…

📝 Article 📄 PDF

LLM Daily – VIGIL: A Reflective Runtime for Self-Healing Agents

2025-12-09 · 10 min · 13.9 MB

Excerpt — Agentic LLM frameworks promise autonomous behavior via task decomposition, tool use, and iterative planning, but most deployed systems remain brittle. They lack runtime introspection, cannot diagnose their own failure…

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Memory in the Age of AI Agents

LLM Daily – Information-Consistent Language Model Recommendations through Group Relative Pol

LLM Daily – Forgetful but Faithful: A Cognitive Memory Architecture and Benchmark for Privac

LLM Daily – AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning

LLM Daily – Cooperative Retrieval-Augmented Generation for Question Answering: Mutual Inform

LLM Daily – Titans: Learning to Memorize at Test Time

LLM Daily – It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias,

LLM Daily – NormCode: A Semi-Formal Language for Context-Isolated AI Planning

LLM Daily – Asynchronous Reasoning: Training-Free Interactive Thinking LLMs

LLM Daily – An End-to-end Planning Framework with Agentic LLMs and PDDL

LLM Daily – Architectures for Building Agentic AI

LLM Daily – Systematization of Knowledge: Security and Safety in the Model Context Protocol

LLM Daily – A Practical Guide for Designing, Developing, and Deploying Production-Grade Agen

LLM Daily – Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI De

LLM Daily – VIGIL: A Reflective Runtime for Self-Healing Agents

Read more

AI Signals Report — Control planes, not just models

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem