DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking Post date November 14, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs inMultimodal LLMs Post date November 14, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
From Charts to Code: A Hierarchical Benchmark for Multimodal Models Post date November 14, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
ColorAgent: Building A Robust, Personalized, and Interactive OS Agent Post date November 14, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
OmniNWM: Omniscient Driving Navigation World Models Post date November 14, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
OmniNWM: Omniscient Driving Navigation World Models Post date November 14, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
FinSight: Towards Real-World Financial Deep Research Post date November 13, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
olmOCR 2: Unit Test Rewards for Document OCR Post date November 13, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
When Correct Is Not Safe: Can We Trust Functionally Correct Patches Generatedby Code Agents? Post date November 13, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
PokeeResearch: Effective Deep Research via Reinforcement Learning from AIFeedback and Robust Reasoning Scaffold Post date November 12, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Predicting the Unpredictable: Reproducible BiLSTM Forecasting of Incident Countsin the Global Terrorism Database (GTD) Post date November 12, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Unimedvl: Unifying Medical Multimodal Understanding And Generation ThroughObservation-Knowledge-Analysis Post date November 12, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Chem-R: Learning to Reason as a Chemist Post date November 11, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
On Non-interactive Evaluation of Animal Communication Translators Post date November 11, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Foundational Automatic Evaluators: Scaling Multi-Task Generative EvaluatorTraining for Reasoning-Centric Domains Post date November 11, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval andFiltering Post date November 11, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Embody 3D: A Large-scale Multimodal Motion and Behavior Dataset Post date November 10, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Distractor Injection Attacks on Large Reasoning Models: Characterization andDefense Post date November 10, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Constantly Improving Image Models Need Constantly Improving Benchmarks Post date November 10, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI Post date November 10, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
ConsistEdit: Highly Consistent and Precise Training-free Visual Editing Post date November 10, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
When to Ensemble: Identifying Token-Level Points for Stable and Fast LLMEnsembling Post date November 10, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Paper2Web: Let’s Make Your Paper Alive! Post date November 9, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation inMixture-of-Expert models Post date November 9, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
DLER: Doing Length pEnalty Right – Incentivizing More Intelligence per Token viaReinforcement Learning Post date November 9, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning Post date November 9, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Unlocking Out-of-Distribution Generalization in Transformers via RecursiveLatent Space Reasoning Post date November 8, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training Post date November 7, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Agentic Design of Compositional Machines Post date November 7, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild Post date November 7, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures Post date November 7, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
RefusalBench: Generative Evaluation of Selective Refusal in Grounded LanguageModels Post date November 7, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Attention Is All You Need for KV Cache in Diffusion LLMs Post date November 4, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Information Gain-based Policy Optimization: A Simple and Effective Approach forMulti-Turn LLM Agents Post date November 4, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
AI for Service: Proactive Assistance with AI Glasses Post date November 4, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training Post date November 3, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully OpenMLLMs Post date November 2, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
FlashWorld: High-quality 3D Scene Generation within Seconds Post date November 2, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
MultiCOIN: Multi-Modal COntrollable Video INbetweening Post date November 2, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
AndesVL Technical Report: An Efficient Mobile-side Multimodal Large LanguageModel Post date November 2, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment Post date November 1, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
VER: Vision Expert Transformer for Robot Learning via Foundation Distillationand Dynamic Routing Post date November 1, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Graph Diffusion Transformers are In-Context Molecular Designers Post date November 1, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding Post date November 1, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
High-Fidelity Simulated Data Generation for Real-World Zero-Shot RoboticManipulation Learning with Gaussian Splatting Post date October 31, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning Post date October 31, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark forEvaluating LLMs Post date October 31, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Demystifying Reinforcement Learning in Agentic Reasoning Post date October 31, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Demystifying Reinforcement Learning in Agentic Reasoning Post date October 31, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Instant4D: 4D Gaussian Splatting in Minutes Post date October 30, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Formalizing Style in Personal Narratives Post date October 30, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare Post date October 29, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modelingand LLM Alignment Post date October 28, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-HorizonTasks Post date October 27, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling Post date October 26, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards Post date October 26, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Modelsunder Data Constraints Post date October 26, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution Post date October 26, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches viaIn-Context Conditioning Post date October 21, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning
MemMamba: Rethinking Memory Patterns in State Space Model Post date October 20, 2025 Post author By Paperium Post categories In ai, computerscience, deeplearning, machinelearning