Fatemeh Pesaran zadeh

Hello! I am a 2nd year Master's student at Seoul National University, Computer Science department, Vision & learning lab, advised by Gunhee Kim. My research interests lie in solving difficult problems in Large Language Models (LLMs) and Vision-Language Models (VLMs) and optimizing them using reinforcement learning. I have worked on designing challenging multimodal tasks and developing optimization algorithm to solve them. I completed my Bachelors in Computer Science from Seoul National University in 2022.

CV   /  Github   /  Twitter  /  Pledge

profile photo

Publications
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback
Fatemeh Pesaran zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim
EMNLP Main, 2024 (Oral Presentation)
Paper | Code | Project Page |

We propose a hierarchical pipeline and a new dataset for chart generation. Moreover, we introducea reinforcement learning-based instruction tuning technique for chart generation tasks without requiring human feedback.

mRedditSum: A Multimodal Abstractive Summarization Dataset of Reddit Threads with Images
Keighley Overbay, Jaewoo Ahn*, Fatemeh Pesaran zadeh*, Joonsuk Park, Gunhee Kim
EMNLP, 2023
Paper | Code |

We present mRedditSum, the first multimodal discussion summarization dataset. It consists of 3,033 discussion threads where a post solicits advice regarding an issue described with an image and text, and respective comments express diverse opinions.


Projects
caricature
Anomaly Detection with Surveillance Camera
Report (Korean) |

We propose developing an AI-assisted CCTV system that enhances surveillance with anomaly detection and face tracking, using INNODEP-provided videos.

caricature
Brand Detection
Poster | Code |

We developed a brand detection model using YOLOv5 to detect brands in South Korea. We collected a dataset of 8.4K images of brands in South Korea and trained a YOLOv5 model to detect them.


Template based on Jon Barron's website.