Session 13: DeepSeek
DeepSeek-R1, an open-source large language model developed by DeepSeek AI Lab, has quickly gained traction due to its innovative use of pure Reinforcement Learning (RL) instead of Supervised Fine-Tuning (SFT), enabling autonomous reasoning improvements. With strong performance benchmarks, cost-effective training, and local deployment capabilities, DeepSeek-R1 presents a competitive alternative to larger commercial models while also facing challenges like jailbreaking vulnerabilities.