Loading...
「ツール」は右上に移動しました。
利用したサーバー: wtserver2
63いいね 3471回再生

Stanford CS234 I Guest Lecture on DPO: Rafael Rafailov, Archit Sharma, Eric Mitchell I Lecture 9

For more information about Stanford's Artificial Intelligence programs visit: stanford.io/ai

Stanford CS234 Reinforcement Learning I Offline RL 2 and Guest Lecture on Direct Preference Optimization (DPO) by authors:
Rafael Rafailov: arxiv.org/search/cs?searchtype=author&query=Rafail…
Archit Sharma: arxiv.org/search/cs?searchtype=author&query=Sharma…
Eric Mitchell: arxiv.org/search/cs?searchtype=author&query=Mitche…

To follow along with the course, visit the course website:
web.stanford.edu/class/cs234/

Emma Brunskill
Associate Professor of Computer Science at Stanford University
cs.stanford.edu/people/ebrun/

Learn more about the course and how to enroll: online.stanford.edu/courses/cs234-reinforcement-le…

To view all online courses and programs offered by Stanford, visit: online.stanford.edu/

コメント