LLaVA

Posts

Jan 30, 2024
LLaVA-NeXT: Improved reasoning, OCR, and world knowledge
Apr 30, 2024
LLaVA-NeXT: A Strong Zero-shot Video Understanding Model
May 10, 2024
LLaVA-NeXT: Stronger LLMs Supercharge Multimodal Capabilities in the Wild
May 25, 2024
LLaVA-NeXT: What Else Influences Visual Instruction Tuning Beyond Data?
June 16, 2024
LLaVA-NeXT: Tackling Multi-image, Video, and 3D in Large Multimodal Models
Aug 05, 2024
LLaVA-OneVision: Easy Visual Task Transfer
Oct 04, 2024
LLaVA-Video: Video Instruction Tuning with Synthetic Data
Oct 04, 2024
LLaVA-Critic: Learning to Evaluate Multimodal Models