We keep an eye on new AI papers on arXiv, pick one or two that really matter each day, and share the key ideas — no hype, just clear explanations.
2026-02-15 · 10 min · 11.9 MB
Excerpt — The rapid evolution of cyberattacks continues to drive the emergence of unknown (zero-day) threats, posing significant challenges for network intrusion detection systems in Internet of Things (IoT) networks. Existing…
2026-02-15 · 10 min · 11.0 MB
Excerpt — The evolution of large language models (LLMs) towards applications with ultra-long contexts faces challenges posed by the high computational and memory costs of the Transformer architecture. While existing sparse and…
2026-02-13 · 10 min · 23.2 MB
Excerpt — AI agents are increasingly used to solve real-world tasks by reasoning over multi-turn user interactions and invoking external tools. However, applying reinforcement learning to such settings remains difficult:…
2026-02-12 · 10 min · 20.6 MB
Excerpt — Large Language Models (LLMs) are increasingly integrated into software systems, giving rise to a new class of systems referred to as LLMware. Beyond traditional source-code components, LLMware embeds or interacts with…
2026-02-11 · 10 min · 23.4 MB
Excerpt — NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI NVIDIA Cosmos Reason 2: Reasoning Vision Language Model for Physical AI Key Highlights - Improved spatio-temporal understanding and timestamp precision. -…
2026-02-11 · 10 min · 20.4 MB
Excerpt — Sustaining long-term interactions remains a bottleneck for Large Language Models (LLMs), as their limited context windows struggle to manage dialogue histories that extend over time. Existing memory systems often treat…
2026-02-11 · 10 min · 26.0 MB
Excerpt — Understanding the reliability of natural language generation is critical for deploying foundation models in security-sensitive domains. While certified poisoning defenses provide provable robustness bounds for…
2026-02-11 · 10 min · 21.5 MB
Excerpt — As artificial intelligence systems evolve from passive assistants into autonomous agents capable of executing consequential actions, the security boundary shifts from model outputs to tool execution. Traditional…
2026-02-10 · 10 min · 21.8 MB
Excerpt — The development of artificial intelligence can be viewed as an evolution of data-driven learning paradigms, with successive shifts in data organization and utilization continuously driving advances in model capability.…
2026-02-10 · 10 min · 21.5 MB
Excerpt — Permissive licenses like MIT, Apache-2.0, and BSD-3-Clause dominate open-source AI, signaling that artifacts like models, datasets, and code can be freely used, modified, and redistributed. However, these licenses carry…
2026-02-10 · 10 min · 22.0 MB
Excerpt — Query expansion with large language models is promising but often relies on hand-crafted prompts, manually chosen exemplars, or a single LLM, making it non-scalable and sensitive to domain shift. We present an…
2026-02-10 · 10 min · 27.7 MB
Excerpt — Agentic systems powered by Large Language Models (LLMs) have shown strong potential in recommender systems but remain hindered by several challenges. Fine-tuning LLMs is parameter-inefficient, and prompt-based agentic…
2026-02-10 · 10 min · 19.7 MB
Excerpt — Large Language Models (LLMs) show promise as planners for embodied AI, but their stochastic nature lacks formal reasoning, preventing strict safety guarantees for physical deployment. Current approaches often rely on…
2026-02-10 · 10 min · 22.9 MB
Excerpt — Large language models (LLMs) are commonly treated as stateless: once an interaction ends, no information is assumed to persist unless it is explicitly stored and re-supplied. We challenge this assumption by introducing…
2026-02-10 · 10 min · 19.6 MB
Excerpt — Multi-agent LLM systems enable advanced reasoning and tool use via role specialization, yet reliable reinforcement learning (RL) post-training for such systems remains difficult. In this work, we theoretically pinpoint…
2026-02-10 · 10 min · 20.9 MB
Excerpt — Computer-use agents (CUAs) have made tremendous progress in the past year, yet they still frequently produce misaligned actions that deviate from the user's original intent. Such misaligned actions may arise from…
2026-02-09 · 10 min · 21.0 MB
Excerpt — Large language models can resist task-misaligned activation steering during inference, sometimes recovering mid-generation to produce improved responses even when steering remains active. We term this Endogenous…
2026-02-08 · 10 min · 19.1 MB
Excerpt — Large language model (LLM)-based multi-agent systems enable expressive agent reasoning but are expensive to scale and poorly calibrated for timestep-aligned state-transition simulation, while classical agent-based…