Welcome to Language and Vision Lab (LV-Lab)

The Language and Vision Laboratory (LV-Lab), formerly the Learning and Vision Lab, is a research group focused on advancing the foundations of artificial intelligence through the integrated study of vision, language, and other modalities. In response to the paradigm shift brought by foundation models, our research agenda has expanded beyond single-modality learning toward multimodal representation, reasoning, and generation. We aim to develop general-purpose intelligent systems capable of structured perception, proactive inference, and continual adaptation across complex environments. Our work lies at the intersection of multimodal learning, large-scale model optimization, and cognitive-level decision-making, with the goal of bridging low-level sensory input and high-level intelligence. We welcome collaboration and participation from researchers and students interested in shaping the next generation of AI.

News in LV-Lab

Looking for self-motivated (IPP) Ph.D./Master students, research assistants, research fellows, and visiting students and scholars.