ArXiv AI: Weekly Top Picks

cover

Coverage: 2025-12-18 → 2025-12-25

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Step-DeepResearch Technical Report

2025-12-25 · 10 min · 13.1 MB

Excerpt — As LLMs shift toward autonomous agents, Deep Research has emerged as a pivotal metric. However, existing academic benchmarks like BrowseComp often fail to meet real-world demands for open-ended research, which requires…

LLM Daily – Step-DeepResearch Technical Report

📝 Article 📄 PDF

LLM Daily – AprielGuard

2025-12-25 · 10 min · 15.1 MB

Excerpt — Safeguarding large language models (LLMs) against unsafe or adversarial behavior is critical as they are increasingly deployed in conversational and agentic settings. Existing moderation tools often treat safety risks…

📝 Article 📄 PDF

LLM Daily – GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simul

2025-12-24 · 10 min · 13.4 MB

Excerpt — Training capable Large Language Model (LLM) agents is critically bottlenecked by the high cost and static nature of real-world interaction data. We address this by introducing GenEnv, a framework that establishes a…

LLM Daily – GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simul

📝 Article 📄 PDF

LLM Daily – QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retri

2025-12-23 · 10 min · 13.1 MB

Excerpt — Dynamic Retrieval-Augmented Generation adaptively determines when to retrieve during generation to mitigate hallucinations in large language models (LLMs). However, existing methods rely on model-internal signals (e.g.,…

LLM Daily – QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retri

📝 Article 📄 PDF

LLM Daily – AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration o

2025-12-23 · 10 min · 14.0 MB

Excerpt — While reinforcement learning (RL) shows promise in training tool-use large language models (LLMs) using verifiable outcome rewards, existing methods largely overlook the potential of explicit reasoning rewards to…

LLM Daily – AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration o

📝 Article 📄 PDF

LLM Daily – Behavioural Effects of Agentic Messaging: A Case Study on a Financial Service Ap

2025-12-22 · 10 min · 15.6 MB

Excerpt — Marketing and product personalisation provide a prominent and visible use-case for the application of Information Retrieval methods across several business domains. Recently, agentic approaches to these problems have…

LLM Daily – Behavioural Effects of Agentic Messaging: A Case Study on a Financial Service Ap

📝 Article 📄 PDF

LLM Daily – Humanlike AI Design Increases Anthropomorphism but Yields Divergent Outcomes on

2025-12-22 · 10 min · 15.0 MB

Excerpt — Over a billion users across the globe interact with AI systems engineered with increasing sophistication to mimic human traits. This shift has triggered urgent debate regarding Anthropomorphism, the attribution of human…

LLM Daily – Humanlike AI Design Increases Anthropomorphism but Yields Divergent Outcomes on

📝 Article 📄 PDF

LLM Daily – Emergent Bias and Fairness in Multi-Agent Decision Systems

2025-12-20 · 10 min · 14.7 MB

Excerpt — Multi-agent systems have demonstrated the ability to improve performance on a variety of predictive tasks by leveraging collaborative decision making. However, the lack of effective evaluation methodologies has made it…

LLM Daily – Emergent Bias and Fairness in Multi-Agent Decision Systems

📝 Article 📄 PDF

LLM Daily – From Facts to Conclusions : Integrating Deductive Reasoning in Retrieval-Augment

2025-12-19 · 10 min · 13.2 MB

Excerpt — Retrieval-Augmented Generation (RAG) grounds large language models (LLMs) in external evidence, but fails when retrieved sources conflict or contain outdated or subjective information. Prior work address these issues…

LLM Daily – From Facts to Conclusions : Integrating Deductive Reasoning in Retrieval-Augment

📝 Article 📄 PDF

LLM Daily – From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI

2025-12-19 · 10 min · 14.3 MB

Excerpt — Large Language Models (LLMs) have empowered AI agents with advanced capabilities for understanding, reasoning, and interacting across diverse tasks. The addition of memory further enhances them by enabling continuity…

LLM Daily – From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI

📝 Article 📄 PDF

LLM Daily – DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Auto

2025-12-19 · 10 min · 11.0 MB

Excerpt — The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain…

LLM Daily – DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Auto

📝 Article 📄 PDF

LLM Daily – AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models vi

2025-12-19 · 10 min · 15.0 MB

Excerpt — Equipping large language models (LLMs) with search engines via reinforcement learning (RL) has emerged as an effective approach for building search agents. However, overreliance on search introduces unnecessary cost and…

LLM Daily – AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models vi

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Step-DeepResearch Technical Report

LLM Daily – AprielGuard

LLM Daily – GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simul

LLM Daily – QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retri

LLM Daily – AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration o

LLM Daily – Behavioural Effects of Agentic Messaging: A Case Study on a Financial Service Ap

LLM Daily – Humanlike AI Design Increases Anthropomorphism but Yields Divergent Outcomes on

LLM Daily – Emergent Bias and Fairness in Multi-Agent Decision Systems

LLM Daily – From Facts to Conclusions : Integrating Deductive Reasoning in Retrieval-Augment

LLM Daily – From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI

LLM Daily – DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Auto

LLM Daily – AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models vi

Read more

AI Signals Report — Control planes, not just models

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem