• Hem
  • Topplistan

John Schulman

TalkRL: The Reinforcement Learning Podcast - En podcast av Robin Ranjit Singh Chauhan

Kategorier:

Teknik

John Schulman, OpenAI cofounder and researcher, inventor of PPO/TRPO talks RL from human feedback, tuning GPT-3 to follow instructions (InstructGPT) and answer long-form questions using the internet (WebGPT), AI alignment, AGI timelines, and more!

Visit the podcast's native language site

  • Alla poddar hos oss
  • Avsnitt
  • Om oss
  • Integritetspolicy
  • Vad Ă€r en podcast?
  • Hur lyssnar man pĂ„ en podd?

© Poddarna.se 2025