LLaVA

Posts

  • Jan 30, 2024

    LLaVA-NeXT: Improved reasoning, OCR, and world knowledge

    Apr 30, 2024

    LLaVA-NeXT: A Strong Zero-shot Video Understanding Model

    May 10, 2024

    LLaVA-NeXT: Stronger LLMs Supercharge Multimodal Capabilities in the Wild

    May 25, 2024

    LLaVA-NeXT: What Else Influences Visual Instruction Tuning Beyond Data?

    June 16, 2024

    LLaVA-NeXT: Tackling Multi-image, Video, and 3D in Large Multimodal Models

    Aug 05, 2024

    LLaVA-OneVision: Easy Visual Task Transfer

    Oct 04, 2024

    LLaVA-Video: Video Instruction Tuning with Synthetic Data

    Oct 04, 2024

    LLaVA-Critic: Learning to Evaluate Multimodal Models

subscribe via RSS

LLaVA

  • LLaVA