Lots of stuff in the pipeline, but to start with here's a series of 6 new talks from Evan Hubinger (author of Risks From Learned Optimization, the Mesa-Optimizers paper), going through a load of in...
2023-05-10 15:12:29 +0000 UTC
View Post
I made a new video with Computerphile, about ChatGPT!
https://www.youtube.com/watch?v=viJt_DXTfwA
L...
2023-02-01 16:44:05 +0000 UTC
View Post
New main channel video! I'm still doing these!
This one's about how it's surprisingly hard to make AI systems that consistently tell the truth, and the safety problem that poses.
2022-12-05 20:19:02 +0000 UTC
View Post
Michael Trazzi interviewed me for his podcast The Inside View! We talked about what you'd expect I suppose, YouTube, AI, Stampy, all that good stuff. He also gave me a t-shirt?
2022-08-19 22:25:12 +0000 UTC
View Post
You might notice I don't talk about specific AI takeover scenarios, with hacking, bribery, nanotechnology etc. Why is that?
ht...
2022-08-15 00:57:12 +0000 UTC
View Post
I got a new camera for filming low-friction videos, and took it to the zoo! It takes a while to get it set up, testing footage at
2022-07-19 18:09:25 +0000 UTC
View Post
I'M NOT DEAD! I did get COVID though. But I'm ok now.
I'm still working on the next main channel video, which turned out to be much harder than I expected. In the meantime here is a short vid...
2022-07-07 15:18:13 +0000 UTC
View Post
This is a second talk following up the one I posted here: https://www.patreon.com/posts/62888854
This is someho...
2022-02-22 23:58:55 +0000 UTC
View Post
I'm at a research retreat right now with a few independent alignment researchers! The other day John Wentworth presented a talk giving his take on the alignment problem, and we filmed it just to ha...
2022-02-22 06:06:55 +0000 UTC
View Post
https://youtu.be/HYtJdflujjc
A quick short to let people know that The Alignment Research Center is offering prizes of up to ...
2022-02-08 06:45:01 +0000 UTC
View Post
I get frustrated sometimes. A long time ago I made this video rant about it, but the production quality was far too low to publish it so I only released it to patrons. I've now (largely thanks to u...
2022-01-15 13:40:46 +0000 UTC
View Post
Some friends of mine are running a (free) bootcamp "to bring people interested in AI Alignment up-to-speed with the state of modern ML engineering", and they thought my Patreon would be a quality s...
2021-11-14 21:03:35 +0000 UTC
View Post
Who here is on TikTok? I'm experimenting with it a bit, here's an attempt at a video about misalignment and GitHub Copilot!
My impression so far is:
- 3 minutes is a little limiting, but n...
2021-11-13 17:26:43 +0000 UTC
View Post
I got a new audio recorder and microphone, which works unlike any other I've used!
Thank you for your support :)
2021-10-22 11:48:07 +0000 UTC
View Post
Some people ran real versions of the thought experiments from the 'Mesa-Optimisers' videos! What they found won't shock you (if you've been paying attention)
2021-10-08 20:00:03 +0000 UTC
View Post
I spent the last 3 weeks or so in the USA, doing various things which have given me a lot of video fuel but no actual videos, so here's a remaster of an existing second channel video:
A whil...
2021-06-22 15:36:58 +0000 UTC
View Post
The previous video explained why it's *possible* for trained models to end up with the wrong goals, even when we specify the goals perfectly. This video explains why it's *likely*.
2021-05-22 20:01:00 +0000 UTC
View Post
If you're good at video editing, especially using open source software like Kdenlive and Blender, consider applying, to help me make more videos more quickly! :)
2021-03-27 18:45:57 +0000 UTC
View Post
This "Alignment" thing turns out to be even harder than we thought.
https://youtu.be/bJLcIBixGj8
Also UPCOMING ...
2021-02-13 20:03:36 +0000 UTC
View Post
How do you get an AI system that does better than a human would, without doing anything a human wouldn't?
A follow-up to "Maximizers and Satisficers":
2020-12-05 15:48:51 +0000 UTC
View Post
My camera died in the middle of shooting the new video! So I had to get a new one - this is that.
Thanks for making it possible for me to just buy new equipment when I need it!
2020-10-05 16:28:15 +0000 UTC
View Post
I'm starting a Discord Server! It's been running for a little bit for patrons at the 'Advisor', 'Hero' and 'Personal thank you in a video' support levels, and now I'm opening it up a little wider, ...
2020-09-22 16:10:01 +0000 UTC
View Post
AI might create enormous amounts of wealth, but how is it going to be distributed?
https://youtu.be/7i_f4Kbpgn4
Ther...
2020-07-03 14:56:34 +0000 UTC
View Post
Just thought I should let patrons know that we've had a go at recording Computerphile remotely, talking about OpenAI's new GPT-3 model.
I wanted to include a photo but there's really nothing ...
2020-06-17 10:40:58 +0000 UTC
View Post
Why do some ignore AI Safety? Let's look at 10 reasons people give (adapted from Stuart Russell's list).
https:...
2020-05-31 15:54:50 +0000 UTC
View Post
Answering the other half of the questions you asked: Moral philosophy, biological intelligence enhancement, nukes, regulating compute clusters, lethal autonomous weapons, etc!
2020-05-07 19:12:16 +0000 UTC
View Post
AI systems do what you say, and it's hard to say exactly what you mean.
Let's look at a list of real life examples of specification gaming!
2020-04-25 20:27:15 +0000 UTC
View Post
I shot a video, so I had to cut my hair first. For some reason I filmed it, and now you can watch that. This is what we have become. This is The New Normal.
Seriously though this is just me c...
2020-04-25 20:25:26 +0000 UTC
View Post
You asked, I answered :)
I ended up shooting a lot of footage for this, so this video is part 1 of 2.
Let me know if you want to see more Q+A videos in future!
2020-03-10 12:19:16 +0000 UTC
View Post
In case any of you are as well, look me up on the networking app
2020-02-21 14:53:10 +0000 UTC
View Post