Building
the way to multimodal intelligence.

Open research on multimodal models, evaluation, benchmarks, and lmms-eval tooling - shared as we discover.

Explore Research

LMMS-LAB // BREACH ACTIVENEURAL WEIGHT EXTRACTION

1/7LIVE

_

thinking:

_

Featured Research

OneVision Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

OneVision Encoder:
Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Latest Publications

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

LLaVA-OneVision-1.5: Fully Open Frame...

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

LongVT: Incentivizing "Thinking with ...

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

OpenMMReasoner: Pushing the Frontiers...

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe