Siirry offline-tilaan Player FM avulla!
Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically
Manage episode 414652797 series 3524393
The paper explores inductive bias in transformer models, showing language modeling training leads to hierarchical generalization, supported by pruning experiments and Bayesian analysis.
https://arxiv.org/abs//2404.16367
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1001 jaksoa
Manage episode 414652797 series 3524393
The paper explores inductive bias in transformer models, showing language modeling training leads to hierarchical generalization, supported by pruning experiments and Bayesian analysis.
https://arxiv.org/abs//2404.16367
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1001 jaksoa
All episodes
×Tervetuloa Player FM:n!
Player FM skannaa verkkoa löytääkseen korkealaatuisia podcasteja, joista voit nauttia juuri nyt. Se on paras podcast-sovellus ja toimii Androidilla, iPhonela, ja verkossa. Rekisteröidy sykronoidaksesi tilaukset laitteiden välillä.