ArXiv AI: Weekly Top Picks

1766007197438

Coverage: 2026-02-08 → 2026-02-15

This week in AI papers

We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.

Unpacked by our trio: Alex the plain-language host, Marc the hands-on power user, and Jamie the senior ML engineer.

LLM Daily – Unknown Attack Detection in IoT Networks using Large Language Models: A Robust,

2026-02-15 · 10 min · 11.9 MB

Excerpt — The rapid evolution of cyberattacks continues to drive the emergence of unknown (zero-day) threats, posing significant challenges for network intrusion detection systems in Internet of Things (IoT) networks. Existing…

LLM Daily – Unknown Attack Detection in IoT Networks using Large Language Models: A Robust,

📝 Article 📄 PDF

LLM Daily – MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context

2026-02-15 · 10 min · 11.0 MB

Excerpt — The evolution of large language models (LLMs) towards applications with ultra-long contexts faces challenges posed by the high computational and memory costs of the Transformer architecture. While existing sparse and…

LLM Daily – MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context

📝 Article 📄 PDF

LLM Daily – CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step

2026-02-13 · 10 min · 23.2 MB

Excerpt — AI agents are increasingly used to solve real-world tasks by reasoning over multi-turn user interactions and invoking external tools. However, applying reinforcement learning to such settings remains difficult:…

LLM Daily – CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step

📝 Article 📄 PDF

LLM Daily – Hidden Licensing Risks in the LLMware Ecosystem

2026-02-12 · 10 min · 20.6 MB

Excerpt — Large Language Models (LLMs) are increasingly integrated into software systems, giving rise to a new class of systems referred to as LLMware. Beyond traditional source-code components, LLMware embeds or interacts with…

LLM Daily – Hidden Licensing Risks in the LLMware Ecosystem

📝 Article 📄 PDF

LLM Daily – NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

2026-02-11 · 10 min · 23.4 MB

Excerpt — NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI NVIDIA Cosmos Reason 2: Reasoning Vision Language Model for Physical AI Key Highlights - Improved spatio-temporal understanding and timestamp precision. -…

📝 Article

LLM Daily – TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces

2026-02-11 · 10 min · 20.4 MB

Excerpt — Sustaining long-term interactions remains a bottleneck for Large Language Models (LLMs), as their limited context windows struggle to manage dialogue histories that extend over time. Existing memory systems often treat…

LLM Daily – TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces

📝 Article 📄 PDF

LLM Daily – Towards Poisoning Robustness Certification for Natural Language Generation

2026-02-11 · 10 min · 26.0 MB

Excerpt — Understanding the reliability of natural language generation is critical for deploying foundation models in security-sensitive domains. While certified poisoning defenses provide provable robustness bounds for…

LLM Daily – Towards Poisoning Robustness Certification for Natural Language Generation

📝 Article 📄 PDF

LLM Daily – Autonomous Action Runtime Management(AARM):A System Specification for Securing A

2026-02-11 · 10 min · 21.5 MB

Excerpt — As artificial intelligence systems evolve from passive assistants into autonomous agents capable of executing consequential actions, the security boundary shifts from model outputs to tool execution. Traditional…

LLM Daily – Autonomous Action Runtime Management(AARM):A System Specification for Securing A

📝 Article 📄 PDF

LLM Daily – Data Science and Technology Towards AGI Part I: Tiered Data Management

2026-02-10 · 10 min · 21.8 MB

Excerpt — The development of artificial intelligence can be viewed as an evolution of data-driven learning paradigms, with successive shifts in data organization and utilization continuously driving advances in model capability.…

LLM Daily – Data Science and Technology Towards AGI Part I: Tiered Data Management

📝 Article 📄 PDF

LLM Daily – Permissive-Washing in the Open AI Supply Chain: A Large-Scale Audit of License I

2026-02-10 · 10 min · 21.5 MB

Excerpt — Permissive licenses like MIT, Apache-2.0, and BSD-3-Clause dominate open-source AI, signaling that artifacts like models, datasets, and code can be freely used, modified, and redistributed. However, these licenses carry…

LLM Daily – Permissive-Washing in the Open AI Supply Chain: A Large-Scale Audit of License I

📝 Article 📄 PDF

LLM Daily – Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM

2026-02-10 · 10 min · 22.0 MB

Excerpt — Query expansion with large language models is promising but often relies on hand-crafted prompts, manually chosen exemplars, or a single LLM, making it non-scalable and sensitive to domain shift. We present an…

LLM Daily – Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM

📝 Article 📄 PDF

LLM Daily – AMEM4Rec: Leveraging Cross-User Similarity for Memory Evolution in Agentic LLM R

2026-02-10 · 10 min · 27.7 MB

Excerpt — Agentic systems powered by Large Language Models (LLMs) have shown strong potential in recommender systems but remain hindered by several challenges. Fine-tuning LLMs is parameter-inefficient, and prompt-based agentic…

LLM Daily – AMEM4Rec: Leveraging Cross-User Similarity for Memory Evolution in Agentic LLM R

📝 Article 📄 PDF

LLM Daily – Grounding Generative Planners in Verifiable Logic: A Hybrid Architecture for Tru

2026-02-10 · 10 min · 19.7 MB

Excerpt — Large Language Models (LLMs) show promise as planners for embodied AI, but their stochastic nature lacks formal reasoning, preventing strict safety guarantees for physical deployment. Current approaches often rely on…

LLM Daily – Grounding Generative Planners in Verifiable Logic: A Hybrid Architecture for Tru

📝 Article 📄 PDF

LLM Daily – Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs

2026-02-10 · 10 min · 22.9 MB

Excerpt — Large language models (LLMs) are commonly treated as stateless: once an interaction ends, no information is assumed to persist unless it is explicitly stored and re-supplied. We challenge this assumption by introducing…

LLM Daily – Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs

📝 Article 📄 PDF

LLM Daily – Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

2026-02-10 · 10 min · 19.6 MB

Excerpt — Multi-agent LLM systems enable advanced reasoning and tool use via role specialization, yet reliable reinforcement learning (RL) post-training for such systems remains difficult. In this work, we theoretically pinpoint…

LLM Daily – Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

📝 Article 📄 PDF

LLM Daily – When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Compute

2026-02-10 · 10 min · 20.9 MB

Excerpt — Computer-use agents (CUAs) have made tremendous progress in the past year, yet they still frequently produce misaligned actions that deviate from the user's original intent. Such misaligned actions may arise from…

LLM Daily – When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Compute

📝 Article 📄 PDF

LLM Daily – Endogenous Resistance to Activation Steering in Language Models

2026-02-09 · 10 min · 21.0 MB

Excerpt — Large language models can resist task-misaligned activation steering during inference, sometimes recovering mid-generation to produce improved responses even when steering remains active. We term this Endogenous…

LLM Daily – Endogenous Resistance to Activation Steering in Language Models

📝 Article 📄 PDF

LLM Daily – PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling

2026-02-08 · 10 min · 19.1 MB

Excerpt — Large language model (LLM)-based multi-agent systems enable expressive agent reasoning but are expensive to scale and poorly calibrated for timestep-aligned state-transition simulation, while classical agent-based…

LLM Daily – PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling

📝 Article 📄 PDF

Listen on Spotify (EN) Copy RSS (EN) Listen on Spotify (FR) Copy RSS (FR)

ArXiv AI: Weekly Top Picks

This week in AI papers

LLM Daily – Unknown Attack Detection in IoT Networks using Large Language Models: A Robust,

LLM Daily – MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context

LLM Daily – CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step

LLM Daily – Hidden Licensing Risks in the LLMware Ecosystem

LLM Daily – NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

LLM Daily – TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces

LLM Daily – Towards Poisoning Robustness Certification for Natural Language Generation

LLM Daily – Autonomous Action Runtime Management(AARM):A System Specification for Securing A

LLM Daily – Data Science and Technology Towards AGI Part I: Tiered Data Management

LLM Daily – Permissive-Washing in the Open AI Supply Chain: A Large-Scale Audit of License I

LLM Daily – Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM

LLM Daily – AMEM4Rec: Leveraging Cross-User Similarity for Memory Evolution in Agentic LLM R

LLM Daily – Grounding Generative Planners in Verifiable Logic: A Hybrid Architecture for Tru

LLM Daily – Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs

LLM Daily – Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

LLM Daily – When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Compute

LLM Daily – Endogenous Resistance to Activation Steering in Language Models

LLM Daily – PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling

Read more

AI Signals Report — Control planes, not just models

Your Bankers Are Ready. Your Bank Isn't.

One Line in Shanghai: What Xi's AI Speech Tells European Banks Betting on Chinese Open Models

Article 50 Goes Live in Five Days — and It Stopped Being a Legal Problem