A Self-Replicating GPT-4!
Arkistoidut sarjat ("Toimeton syöte" status)
When? This feed was archived on February 03, 2024 00:52 (). Last successful fetch was on March 31, 2023 18:30 ()
Why? Toimeton syöte status. Palvelimemme eivät voineet hakea voimassa olevaa podcast-syötettä tietyltä ajanjaksolta.
What now? You might be able to find a more up-to-date version using the search function. This series will no longer be checked for updates. If you believe this to be in error, please check if the publisher's feed link below is valid and contact support to request the feed be restored or if you have any other concerns about this.
Manage episode 358054925 series 3406958
In this week's MLAISU, we're covering the latest technical safety developments with GPT-4, looking at Anthropic's safety strategy, and covering the fascinating Japanese alignment conference!
- Join our Discord! https://ais.pub/discord
- Join the AI governance hackathon! https://ais.pub/aigov
- Check out the university job opportunities: https://ais.pub/opportunities
Sources
- Japanese alignment conference 2023: https://jac2023.ai/
- Recordings from JAC2023: https://vimeo.com/user196160056
- GPT-4 released: https://openai.com/product/gpt-4
- GPT-4 technical report: https://cdn.openai.com/papers/gpt-4.pdf
- Developer demo: https://youtu.be/outcGtbnMuQ
- Inverse scaling: https://www.lesswrong.com/posts/eqxqgFxymP8hXDTt5/announcing-the-inverse-scaling-prize-usd250k-prize-pool
- IQ score: https://twitter.com/DanHendrycks/status/1635706827215339520
- Why uncontrollable AI seems like a larger risk than ever: https://time.com/6258483/uncontrollable-ai-agi-risks/
- Is power-seeking AI an existential risk? https://arxiv.org/abs/2206.13353
- OpenAI evals: https://github.com/openai/evals
- Anthropic's AI safety views: https://www.anthropic.com/index/core-views-on-ai-safety
- Anthropic releasing Claude: https://www.anthropic.com/index/introducing-claude
- Constitutional AI: https://scale.com/blog/chatgpt-vs-claude#What%20is%20%E2%80%9CConstitutional%20AI%E2%80%9D?
- Palm API opened up: https://developers.googleblog.com/2023/03/announcing-palm-api-and-makersuite.html
- Attention is all you need: https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf
- JAC recordings: https://vimeo.com/user196160056
- Factored cognition: https://primer.ought.org/
25 jaksoa