Heekyung (Anne) Lee

Hi! I am Heekyung(Anne) Lee. I am an Undergraduate Student in Computer Science at POSTECH, South Korea. I am also an undergraduate researcher at Berkeley AI Research (BAIR), fortunate to be supervised under Prof. Trevor Darrell’s group.
I am interested in studying Vision and Language. My current research focuses on applications of vision-language models, such as image captioning, visual question answering, and multimodal reasoning.
My recent research interests are on Multi-Modal Language Models.
- Hallucination in Vision Language Models
∘ What is hallucination in Vision Language Models, and what are effective training recipes to mitigate it? How can we address hallucinations caused by misalignment between image patches and text tokens? - Real-World Evaluation of Vision Language Models
∘ What is the gap between the performance of VLMs on benchmark datasets and in real-world tasks? How do Vision Language Models respond to complex tasks, and how should we analyze their responses to guide further improvements?
Feel free to check out my publications and projects.
.
And also, lets be Strava friends! (Follow me here) I love running XD.
news
May 29, 2025 | 🧩 My first paper is out! Puzzled by Puzzles: When Vision-Language Models Can’t Take a Hint.! Follow this 🧵: @visual_puzzles. |
---|---|
May 19, 2025 | I finished my Fall 2024, Spring 2025 exchange program at University California, Berkeley 🐻 |
Apr 18, 2025 | 🔥 Check out our new paper Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospective Resampling! Link to X: @reverse_vlm. |