Posts
-
LLaVA-NeXT: Improved reasoning, OCR, and world knowledge
LLaVA-NeXT: A Strong Zero-shot Video Understanding Model
LLaVA-NeXT: Stronger LLMs Supercharge Multimodal Capabilities in the Wild
LLaVA-NeXT: What Else Influences Visual Instruction Tuning Beyond Data?
LLaVA-NeXT: Tackling Multi-image, Video, and 3D in Large Multimodal Models
LLaVA-OneVision: Easy Visual Task Transfer
LLaVA-Video: Video Instruction Tuning with Synthetic Data
LLaVA-Critic: Learning to Evaluate Multimodal Models
subscribe via RSS