ArXiv AI: Weekly Top Picks

cover

Coverage: 2025-11-16 → 2025-11-23

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Distributed Agent Reasoning Across Independent Systems With Strict Data Locality

2025-11-22 · 10 min · 15.3 MB

Excerpt — This paper presents a proof-of-concept demonstration of agent-to-agent communication across distributed systems, using only natural-language messages and without shared identifiers, structured schemas, or centralised…

LLM Daily – Distributed Agent Reasoning Across Independent Systems With Strict Data Locality

📝 Article 📄 PDF

LLM Daily – Trustworthy AI in the Agentic Lakehouse: from Concurrency to Governance

2025-11-22 · 10 min · 21.0 MB

Excerpt — Even as AI capabilities improve, most enterprises do not consider agents trustworthy enough to work on production data. In this paper, we argue that the path to trustworthy agentic workflows begins with solving the…

LLM Daily – Trustworthy AI in the Agentic Lakehouse: from Concurrency to Governance

📝 Article 📄 PDF

LLM Daily – SOLID: a Framework of Synergizing Optimization and LLMs for Intelligent Decision

2025-11-21 · 10 min · 14.9 MB

Excerpt — This paper introduces SOLID (Synergizing Optimization and Large Language Models for Intelligent Decision-Making), a novel framework that integrates mathematical optimization with the contextual capabilities of large…

LLM Daily – SOLID: a Framework of Synergizing Optimization and LLMs for Intelligent Decision

📝 Article 📄 PDF

LLM Daily – As If We've Met Before: LLMs Exhibit Certainty in Recognizing Seen Files

2025-11-20 · 10 min · 13.3 MB

Excerpt — The remarkable language ability of Large Language Models (LLMs) stems from extensive training on vast datasets, often including copyrighted material, which raises serious concerns about unauthorized use. While…

LLM Daily – As If We've Met Before: LLMs Exhibit Certainty in Recognizing Seen Files

📝 Article 📄 PDF

LLM Daily – Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Framework

2025-11-20 · 10 min · 14.9 MB

Excerpt — Large Language Model (LLM)-based agents with function-calling capabilities are increasingly deployed, but remain vulnerable to Indirect Prompt Injection (IPI) attacks that hijack their tool calls. In response, numerous…

LLM Daily – Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Framework

📝 Article 📄 PDF

LLM Daily – AutoTool: Efficient Tool Selection for Large Language Model Agents

2025-11-19 · 10 min · 14.4 MB

Excerpt — Large Language Model (LLM) agents have emerged as powerful tools for automating complex tasks by leveraging the reasoning and decision-making abilities of LLMs. However, a major bottleneck in current agent frameworks…

LLM Daily – AutoTool: Efficient Tool Selection for Large Language Model Agents

📝 Article 📄 PDF

LLM Daily – Streamlining Industrial Contract Management with Retrieval-Augmented LLMs

2025-11-19 · 10 min · 13.9 MB

Excerpt — Contract management involves reviewing and negotiating provisions, individual clauses that define rights, obligations, and terms of agreement. During this process, revisions to provisions are proposed and iteratively…

LLM Daily – Streamlining Industrial Contract Management with Retrieval-Augmented LLMs

📝 Article 📄 PDF

LLM Daily – Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term Use

2025-11-18 · 10 min · 16.0 MB

Excerpt — With the rise of smart personal devices, service-oriented human-agent interactions have become increasingly prevalent. This trend highlights the need for personalized dialogue assistants that can understand user-…

LLM Daily – Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term Use

📝 Article 📄 PDF

LLM Daily – Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

2025-11-18 · 10 min · 18.1 MB

Excerpt — Recent advancements in LLM-powered agents have demonstrated significant potential in generating human-like responses; however, they continue to face challenges in maintaining long-term interactions within complex…

LLM Daily – Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

📝 Article 📄 PDF

LLM Daily – A Workflow for Full Traceability of AI Decisions

2025-11-17 · 10 min · 14.2 MB

Excerpt — An ever increasing number of high-stake decisions are made or assisted by automated systems employing brittle artificial intelligence technology. There is a substantial risk that some of these decision induce harm to…

LLM Daily – A Workflow for Full Traceability of AI Decisions

📝 Article 📄 PDF

LLM Daily – Building the Web for Agents: A Declarative Framework for Agent-Web Interaction

2025-11-17 · 10 min · 15.5 MB

Excerpt — The increasing deployment of autonomous AI agents on the web is hampered by a fundamental misalignment: agents must infer affordances from human-oriented user interfaces, leading to brittle, inefficient, and insecure…

LLM Daily – Building the Web for Agents: A Declarative Framework for Agent-Web Interaction

📝 Article 📄 PDF

LLM Daily – iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference

2025-11-17 · 10 min · 14.3 MB

Excerpt — Large Language Model (LLM) agent systems have advanced rapidly, driven by their strong generalization in zero-shot settings. To further enhance reasoning and accuracy on complex tasks, Multi-Agent Debate (MAD) has…

LLM Daily – iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference

📝 Article 📄 PDF

LLM Daily – Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs

2025-11-17 · 10 min · 13.5 MB

Excerpt — Retrieval-augmented generation (RAG) has rapidly emerged as a transformative approach for integrating large language models into clinical and biomedical workflows. However, privacy risks, such as protected health…

LLM Daily – Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs

📝 Article 📄 PDF

LLM Daily – Experience-Guided Adaptation of Inference-Time Reasoning Strategies

2025-11-17 · 10 min · 13.7 MB

Excerpt — Enabling agentic AI systems to adapt their problem-solving approaches based on post-training interactions remains a fundamental challenge. While systems that update and maintain a memory at inference time have been…

LLM Daily – Experience-Guided Adaptation of Inference-Time Reasoning Strategies

📝 Article 📄 PDF

LLM Daily – PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reas

2025-11-17 · 10 min · 18.1 MB

Excerpt — Frontier model progress is often measured by academic benchmarks, which offer a limited view of performance in real-world professional contexts. Existing evaluations often fail to assess open-ended, economically…

LLM Daily – PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reas

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Distributed Agent Reasoning Across Independent Systems With Strict Data Locality

LLM Daily – Trustworthy AI in the Agentic Lakehouse: from Concurrency to Governance

LLM Daily – SOLID: a Framework of Synergizing Optimization and LLMs for Intelligent Decision

LLM Daily – As If We've Met Before: LLMs Exhibit Certainty in Recognizing Seen Files

LLM Daily – Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Framework

LLM Daily – AutoTool: Efficient Tool Selection for Large Language Model Agents

LLM Daily – Streamlining Industrial Contract Management with Retrieval-Augmented LLMs

LLM Daily – Mem-PAL: Towards Memory-based Personalized Dialogue Assistants for Long-term Use

LLM Daily – Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents

LLM Daily – A Workflow for Full Traceability of AI Decisions

LLM Daily – Building the Web for Agents: A Declarative Framework for Agent-Web Interaction

LLM Daily – iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference

LLM Daily – Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs

LLM Daily – Experience-Guided Adaptation of Inference-Time Reasoning Strategies

LLM Daily – PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reas

Read more

AI Signals Report — Control planes, not just models

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem