ArXiv AI: Weekly Top Picks

cover

Coverage: 2026-01-02 → 2026-01-09

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Defense Against Indirect Prompt Injection via Tool Result Parsing

2026-01-09 · 10 min · 14.1 MB

Excerpt — As LLM agents transition from digital assistants to physical controllers in autonomous systems and robotics, they face an escalating threat from indirect prompt injection. By embedding adversarial instructions into the…

LLM Daily – Defense Against Indirect Prompt Injection via Tool Result Parsing

📝 Article 📄 PDF

LLM Daily – Internal Representations as Indicators of Hallucinations in Agent Tool Selection

2026-01-09 · 10 min · 14.1 MB

Excerpt — Large Language Models (LLMs) have shown remarkable capabilities in tool calling and tool usage, but suffer from hallucinations where they choose incorrect tools, provide malformed parameters and exhibit 'tool bypass'…

LLM Daily – Internal Representations as Indicators of Hallucinations in Agent Tool Selection

📝 Article 📄 PDF

LLM Daily – Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents

2026-01-08 · 10 min · 13.9 MB

Excerpt — Human-agent dialogues often exhibit topic continuity-a stable thematic frame that evolves through temporally adjacent exchanges-yet most large language model (LLM) agent memory systems fail to preserve it. Existing…

LLM Daily – Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents

📝 Article 📄 PDF

LLM Daily – HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resil

2026-01-08 · 10 min · 13.9 MB

Excerpt — Jailbreak attacks pose significant threats to large language models (LLMs), enabling attackers to bypass safeguards. However, existing reactive defense approaches struggle to keep up with the rapidly evolving multi-turn…

LLM Daily – HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resil

📝 Article 📄 PDF

LLM Daily – Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM

2026-01-06 · 10 min · 12.5 MB

Excerpt — As Large Language Model (LLM) agents are increasingly tasked with high-stakes autonomous decision-making, the transparency of their reasoning processes has become a critical safety concern. While Chain-of-Thought (CoT)…

LLM Daily – Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM

📝 Article 📄 PDF

LLM Daily – Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for

2026-01-06 · 10 min · 14.8 MB

Excerpt — Large language model (LLM) agents face fundamental limitations in long-horizon reasoning due to finite context windows, making effective memory management critical. Existing methods typically handle long-term memory…

LLM Daily – Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for

📝 Article 📄 PDF

LLM Daily – LLM Agents for Combinatorial Efficient Frontiers: Investment Portfolio Optimizat

2026-01-05 · 10 min · 13.8 MB

Excerpt — Investment portfolio optimization is a task conducted in all major financial institutions. The Cardinality Constrained Mean-Variance Portfolio Optimization (CCPO) problem formulation is ubiquitous for portfolio…

LLM Daily – LLM Agents for Combinatorial Efficient Frontiers: Investment Portfolio Optimizat

📝 Article 📄 PDF

LLM Daily – Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs

2026-01-05 · 10 min · 14.6 MB

Excerpt — Large language models (LLMs) frequently produce contextual hallucinations, where generated content contradicts or ignores information explicitly stated in the prompt. Such errors are particularly problematic in…

LLM Daily – Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs

📝 Article 📄 PDF

LLM Daily – Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within a

2026-01-02 · 10 min · 12.8 MB

Excerpt — Agentic crafting requires LLMs to operate in real-world environments over multiple turns by taking actions, observing outcomes, and iteratively refining artifacts. Despite its importance, the open-source community lacks…

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Defense Against Indirect Prompt Injection via Tool Result Parsing

LLM Daily – Internal Representations as Indicators of Hallucinations in Agent Tool Selection

LLM Daily – Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents

LLM Daily – HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resil

LLM Daily – Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM

LLM Daily – Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for

LLM Daily – LLM Agents for Combinatorial Efficient Frontiers: Investment Portfolio Optimizat

LLM Daily – Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs

LLM Daily – Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within a

Read more

AI Signals Report — Control planes, not just models

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem