Agentic systems are increasingly central to high-stakes computing platforms such as AI PCs, robotics, autonomous web interaction, and software maintenance, with their performance largely determined by how effectively they manage memory and context. Enabled by multimodal foundation models, these agents can coordinate human-like reasoning through structured agentic workflows, unlocking powerful capabilities across software development, web and mobile operations, and embodied manipulation. This progress points to the broad potential of multi-agent, multimodal systems to tackle complex real-world challenges. However, realizing this potential at scale remains difficult due to fundamental limits in algorithmic reasoning, memory-driven context understanding, the need for effective test-time training and scaling, and the challenge of deploying agents efficiently across heterogeneous AI hardware where different components must run on distinct compute fabrics.
Understanding and improving multi-modal agentic memory for reasoning capabilities.
Developing scalable and verifiable agentic AI systems across heterogeneous compute platforms with limited compute and memory budget.
Understanding and improving the test-time scaling and reasoning capabilities of multi-modal agentic systems, mixture-of-agents for task scaling.
Pushing the boundaries of real life physical reasoning and planning for agentic AI.
Principled metrics and benchmarks for reasoning, memory, robustness, and efficiency in multimodal agents.
We invite submissions exploring all aspects of multimodal reasoning in AI systems. We welcome novel algorithms, empirical studies, theoretical analyses, position papers, and work-in-progress research that advances our understanding of how AI systems reason across modalities.
Maximum limit of 8 pages, excluding references and appendix. Ideal for complete and thorough work.
Maximum limit of 2 pages with ICML 2026 style files, excluding references and appendix. Ideal for recent work-in progress research.
All deadlines are Anywhere on Earth (AoE) time.
Stanford University
Stanford University
Princeton University
National University of Singapore
University of North Carolina (UNC), Chapel Hill
KAIST
Google DeepMind
GenBio AI, MBZUAI
Intel Labs
Adobe Research
Adobe Research
Northwestern University
Nanyang Technological University
University of Texas, Austin
Rutgers University, GenBio AI
University of Maryland, College Park
We are grateful to our sponsors for supporting this workshop.
Interested in sponsoring? Contact us