Similar Tracks
EE837 (Fall 2024): AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
IVY & IVL Lab in KAIST
EE837 (Fall 2024): Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
IVY & IVL Lab in KAIST
EE837 (Fall 2024): MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
IVY & IVL Lab in KAIST
EE837 (Fall 2024): MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts
IVY & IVL Lab in KAIST
EE837 (Fall 2024): GroundingGPT: Language Enhanced Multi-modal Grounding Model
IVY & IVL Lab in KAIST