Fatemeh Pesaran zadeh
Hello!
I am a 2nd year Master's student at Seoul National University, Computer Science department, Vision & learning lab, advised by Gunhee Kim.
My research interests lie in solving difficult problems in Large Language Models (LLMs) and Vision-Language Models (VLMs) and optimizing them using reinforcement learning.
I have worked on designing challenging multimodal tasks and developing optimization algorithm to solve them.
I completed my Bachelors in Computer Science from Seoul National University in 2022.
CV   / 
Github   / 
Twitter  / 
Pledge
|
|
|
Text2Chart31: Instruction Tuning for Chart Generation
with Automatic Feedback
Fatemeh Pesaran zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim
EMNLP, 2024
Paper |
Code |
Project Page |
We propose a hierarchical pipeline and a new dataset for chart generation. Moreover, we introducea reinforcement learning-based instruction tuning technique for chart generation tasks without
requiring human feedback.
|
|
mRedditSum: A Multimodal Abstractive Summarization Dataset of Reddit Threads with Images
Keighley Overbay, Jaewoo Ahn*, Fatemeh Pesaran zadeh*, Joonsuk Park, Gunhee Kim
EMNLP, 2023
Paper |
Code |
We present mRedditSum, the first multimodal discussion summarization dataset. It consists of 3,033 discussion threads where a post solicits advice regarding an issue described with an image and text, and respective comments express diverse opinions.
|
|
Anomaly Detection with Surveillance Camera
Report (Korean) |
We propose developing an AI-assisted CCTV system that enhances surveillance with anomaly detection and face tracking, using INNODEP-provided videos.
|
|
Brand Detection
Poster |
Code |
We developed a brand detection model using YOLOv5 to detect brands in South Korea. We collected a dataset of 8.4K images of brands in South Korea and trained a YOLOv5 model to detect them.
|
|