For more information about Stanford's Artificial Intelligence programs visit: stanford.io/ai
Stanford CS234 Reinforcement Learning I Offline RL 2 and Guest Lecture on Direct Preference Optimization (DPO) by authors:
Rafael Rafailov: arxiv.org/search/cs?searchtype=author&query=Rafail…
Archit Sharma: arxiv.org/search/cs?searchtype=author&query=Sharma…
Eric Mitchell: arxiv.org/search/cs?searchtype=author&query=Mitche…
To follow along with the course, visit the course website:
web.stanford.edu/class/cs234/
Emma Brunskill
Associate Professor of Computer Science at Stanford University
cs.stanford.edu/people/ebrun/
Learn more about the course and how to enroll: online.stanford.edu/courses/cs234-reinforcement-le…
To view all online courses and programs offered by Stanford, visit: online.stanford.edu/
コメント