AI Papers Podcast for 04/14/2024
Manage episode 412571046 series 3568650
AI Papers Podcast for 04/14/2024
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences: https://arxiv.org/abs/2404.03715
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
Determines Multimodal Model Performance: https://arxiv.org/abs/2404.04125
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web
Navigating Agent: https://arxiv.org/abs/2404.03648
Stream of Search (SoS): Learning to Search in Language: https://arxiv.org/abs/2404.03683
CantTalkAboutThis: Aligning Language Models to Stay on Topic in
Dialogues: https://arxiv.org/abs/2404.03820
25 jaksoa