o1 can 'self-correct'. That's kinda significant.

Added 2024-10-03 12:42:49 +0000 UTC

Comments

2024-11-11 05:57:55 +0000 UTC

2024-10-31 01:02:02 +0000 UTC

GOLDEN AMERICAN

2024-10-30 18:05:12 +0000 UTC

2024-10-11 01:50:34 +0000 UTC

Daniel Henderson

2024-10-09 22:19:26 +0000 UTC

Eddine Maiza

2024-10-07 14:59:02 +0000 UTC

2024-10-07 12:05:56 +0000 UTC

2024-10-07 12:03:12 +0000 UTC

2024-10-07 12:02:51 +0000 UTC

2024-10-05 16:12:58 +0000 UTC

2024-10-05 06:11:00 +0000 UTC

2024-10-05 05:09:33 +0000 UTC

2024-10-05 04:28:10 +0000 UTC

Eddine Maiza

2024-10-04 23:03:50 +0000 UTC

Eddine Maiza

2024-10-04 22:56:50 +0000 UTC

Jason Tangen

2024-10-04 19:41:13 +0000 UTC

2024-10-04 02:38:34 +0000 UTC

Steve DeMoss

2024-10-03 23:49:31 +0000 UTC

2024-10-03 19:19:59 +0000 UTC

2024-10-03 17:07:14 +0000 UTC

David Shapiro

2024-10-03 16:49:44 +0000 UTC

2024-10-03 16:21:18 +0000 UTC

John Merkowsky

2024-10-03 15:55:21 +0000 UTC

Jonathan Kirk

2024-10-03 15:28:51 +0000 UTC

2024-10-03 14:55:20 +0000 UTC

Joshua Davis

2024-10-03 14:29:09 +0000 UTC

2024-10-03 14:27:28 +0000 UTC

Barnaby Golden

2024-10-03 14:13:59 +0000 UTC

2024-10-03 13:45:08 +0000 UTC

2024-10-03 13:45:00 +0000 UTC

Joshua Davis

2024-10-03 13:40:54 +0000 UTC

2024-10-03 13:13:11 +0000 UTC

More Creators

marcelinhofeet

jrenaegaming

boudoir_noir

Milena Cipriano

Low End University

billionslove

Stiletto Bella

pixeledasteroid

KyokaSuigetsu

o1 can 'self-correct'. That's kinda significant.

Comments

Thank you 🙏 Put lots of love and care an AI into it.

@enrico love your GitHub project !!!

This makes me think of a cache hierarchy of LLM calls. It feels like there's a lot of room to engineer here, and I'm curious what the limit of optimizing and deploying a system like this at scale could be.

Well said

It does, many congrats Bob, keep us updated. And thank you.

Great points

Yes, stumbled. We could've probed the search space for 2 decades and nobody would've batted an eye. How phenomenal it is that we are finding these easily implementable multipliers to scaling compute so rapidly after one another should not be understated.

The new untapped scaling demention is pretty exciting, and underhyped imo (an AI rarity lol).

Timing just before a major capital raise is probably not a coincidence. Whether there were concerns around quality or safety, and if these were justified, is hard to know. I guess it is the CEO’s job to make that final call.

Stumbled? That seems unfair.

Your content is unmatched. Keep on keeping on!

I guess we are slowly getting to see what Ilya famously saw in July 2023. At the time it seemed to scare him into alignment research so I wonder what the internal viewpoint is on just how far this can scale.

Major milestone, no doubt!

In hindsight, none of this is surprising. Good to hear that OpenAI basically stumbled on the solution "Just let it think longer" as the answer.

This decade can't get more interesting. Thanks as always Philip for what you do.

My understanding is it rewards the model for successful chain of thought so it learns to make more intelligent chains of thought enabling it to solve harder problems. Basically the former of the two situations you presented.

I echo this, definitely set apart from the AI Bro crowd. I hope you build the Patreon following you deserve Philip @AI_Explained

What's not clear to me is whether training on successful chain-of-thought examples teaches the models how to do chain-of-thought in general or if it just gives them templates for chain-of-thought.

Thanks so much prefer!

There are definitely mixed opinions on this within OpenAI, agreed. See: https://scottaaronson.blog/?p=8367

Man I love your content! I am at a point now, where every other youtuber I follow feels like gramophones, parroting AI news, but you always go deeper and have insights no one else has. Thanks Philip!

More Creators