احمد آزادزاده
دانشجو دکتری دانشکده مهندسی کامپیوتر
هوش مصنوعی
زمینه ها و علایق پژوهشی: Reinforcement Fine-tuning LLMs, LLM Alignment
عضویت در مجامع علمی:
افتخارات علمی و اختراعات:
شرح مختصر (درباره): My name is Ahmad Azadzadeh. I’m a first-year Ph.D. Student at the Human Language Technology (HLTech) Lab, advised by Dr.Jafarinejad.
My research focuses on advancing Large Reasoning Models through Reinforcement Fine-Tuning techniques and Aligning Large Language Models (LLMs) with Human Preferences.
During my master’s studies, I explored the intersection of Deep Reinforcement Learning and Meta-Learning, with a particular emphasis on Offline Meta-Reinforcement Learning algorithms.
تلفن تماس:
تلفن داخلی: