robertskmiles

robertskmiles posts

Early Access: 6 New AI Safety Talks

Lots of stuff in the pipeline, but to start with here's a series of 6 new talks from Evan Hubinger (author of Risks From Learned Optimization, the Mesa-Optimizers paper), going through a load of in...

2023-05-10 15:12:29 +0000 UTC View Post

Computerphile Video: ChatGPT

I made a new video with Computerphile, about ChatGPT!

https://www.youtube.com/watch?v=viJt_DXTfwA

L...

2023-02-01 16:44:05 +0000 UTC View Post

Video: Why AI Systems Lie, and What We Can Do About It

New main channel video! I'm still doing these!

This one's about how it's surprisingly hard to make AI systems that consistently tell the truth, and the safety problem that poses.

2022-12-05 20:19:02 +0000 UTC View Post

Podcast Guest Appearance: The Inside View

Michael Trazzi interviewed me for his podcast The Inside View! We talked about what you'd expect I suppose, YouTube, AI, Stampy, all that good stuff. He also gave me a t-shirt?

2022-08-19 22:25:12 +0000 UTC View Post

Second Channel Video: Why Don't I Give Specifics?

You might notice I don't talk about specific AI takeover scenarios, with hacking, bribery, nanotechnology etc. Why is that?

ht...

2022-08-15 00:57:12 +0000 UTC View Post

Behind The Scenes: Unboxing a Gimbal Camera

I got a new camera for filming low-friction videos, and took it to the zoo! It takes a while to get it set up, testing footage at 2022-07-19 18:09:25 +0000 UTC View Post

Second Channel/Short Video: Would You Help an AI to Escape?

I'M NOT DEAD! I did get COVID though. But I'm ok now.

I'm still working on the next main channel video, which turned out to be much harder than I expected. In the meantime here is a short vid...

2022-07-07 15:18:13 +0000 UTC View Post

Video: Second Research Talk by John Wentworth

This is a second talk following up the one I posted here: https://www.patreon.com/posts/62888854

This is someho...

2022-02-22 23:58:55 +0000 UTC View Post

Video: Research Talk by John Wentworth

I'm at a research retreat right now with a few independent alignment researchers! The other day John Wentworth presented a talk giving his take on the alignment problem, and we filmed it just to ha...

2022-02-22 06:06:55 +0000 UTC View Post

Early Access Short: Win $50k for Solving a Single AI Problem?

https://youtu.be/HYtJdflujjc
A quick short to let people know that The Alignment Research Center is offering prizes of up to ...

2022-02-08 06:45:01 +0000 UTC View Post

Early Access 2nd Channel Video: There's No Rule That Says We'll Make It

I get frustrated sometimes. A long time ago I made this video rant about it, but the production quality was far too low to publish it so I only released it to patrons. I've now (largely thanks to u...

2022-01-15 13:40:46 +0000 UTC View Post

Bootcamp to learn ML for Alignment

Some friends of mine are running a (free) bootcamp "to bring people interested in AI Alignment up-to-speed with the state of modern ML engineering", and they thought my Patreon would be a quality s...

2021-11-14 21:03:35 +0000 UTC View Post

Video: Experimental TikTok about GitHub Copilot

Who here is on TikTok? I'm experimenting with it a bit, here's an attempt at a video about misalignment and GitHub Copilot!

My impression so far is:
- 3 minutes is a little limiting, but n...

2021-11-13 17:26:43 +0000 UTC View Post

Behind the Scenes: Unboxing a new audio recorder! Zoom F2-BT

I got a new audio recorder and microphone, which works unlike any other I've used!
Thank you for your support :)

2021-10-22 11:48:07 +0000 UTC View Post

Early Access Video: We Were Right! Real Inner Misalignment

Some people ran real versions of the thought experiments from the 'Mesa-Optimisers' videos! What they found won't shock you (if you've been paying attention)

2021-10-08 20:00:03 +0000 UTC View Post

New(ish) Video: Intro to AI Safety, Remastered

I spent the last 3 weeks or so in the USA, doing various things which have given me a lot of video fuel but no actual videos, so here's a remaster of an existing second channel video:

A whil...

2021-06-22 15:36:58 +0000 UTC View Post

Early Access Video: Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...

The previous video explained why it's *possible* for trained models to end up with the wrong goals, even when we specify the goals perfectly. This video explains why it's *likely*.

2021-05-22 20:01:00 +0000 UTC View Post

I'm Looking to Hire a Video Editor!

If you're good at video editing, especially using open source software like Kdenlive and Blender, consider applying, to help me make more videos more quickly! :)

2021-03-27 18:45:57 +0000 UTC View Post

Early Access Video: The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment

This "Alignment" thing turns out to be even harder than we thought.
https://youtu.be/bJLcIBixGj8

Also UPCOMING ...

2021-02-13 20:03:36 +0000 UTC View Post

Early Access Video: Quantilizers: AI That Doesn't Try Too Hard

How do you get an AI system that does better than a human would, without doing anything a human wouldn't?
A follow-up to "Maximizers and Satisficers": 2020-12-05 15:48:51 +0000 UTC View Post

BTS Video: New Camera! Unboxing, Testing, and Modification

My camera died in the middle of shooting the new video! So I had to get a new one - this is that.
Thanks for making it possible for me to just buy new equipment when I need it!

2020-10-05 16:28:15 +0000 UTC View Post

Patreon Exclusive: Discord Server

I'm starting a Discord Server! It's been running for a little bit for patrons at the 'Advisor', 'Hero' and 'Personal thank you in a video' support levels, and now I'm opening it up a little wider, ...
2020-09-22 16:10:01 +0000 UTC View Post

Early Access Video: Sharing the Benefits of AI: The Windfall Clause

AI might create enormous amounts of wealth, but how is it going to be distributed?
https://youtu.be/7i_f4Kbpgn4
Ther...
2020-07-03 14:56:34 +0000 UTC View Post

Announcement: Recording for Computerphile about GPT-3

Just thought I should let patrons know that we've had a go at recording Computerphile remotely, talking about OpenAI's new GPT-3 model.
I wanted to include a photo but there's really nothing ...
2020-06-17 10:40:58 +0000 UTC View Post

Early Access Video: 10 Reasons to Ignore AI Safety

Why do some ignore AI Safety? Let's look at 10 reasons people give (adapted from Stuart Russell's list).
https:...
2020-05-31 15:54:50 +0000 UTC View Post

Answering your AI Safety Questions (Part 2)

Answering the other half of the questions you asked: Moral philosophy, biological intelligence enhancement, nukes, regulating compute clusters, lethal autonomous weapons, etc!

2020-05-07 19:12:16 +0000 UTC View Post

Early Access Video: 9 Examples of Specification Gaming

AI systems do what you say, and it's hard to say exactly what you mean.
Let's look at a list of real life examples of specification gaming!
2020-04-25 20:27:15 +0000 UTC View Post

Patron Exclusive (obviously): Quarantine Haircut

I shot a video, so I had to cut my hair first. For some reason I filmed it, and now you can watch that. This is what we have become. This is The New Normal.
Seriously though this is just me c...
2020-04-25 20:25:26 +0000 UTC View Post

Answering your AI Safety Questions (1)

You asked, I answered :)
I ended up shooting a lot of footage for this, so this video is part 1 of 2.
Let me know if you want to see more Q+A videos in future!
2020-03-10 12:19:16 +0000 UTC View Post

I'm at VidCon London

In case any of you are as well, look me up on the networking app
2020-02-21 14:53:10 +0000 UTC View Post

robertskmiles first page