Outside-Iron-8242

2025-03-16 17:06:45

Kevin Weil (OpenAI CPO) claims AI will surpass humans in competitive coding this year

beatomni

2025-03-03 22:41:27

So…. There’s a limit for Pro plan

Outside-Iron-8242

2025-03-02 23:15:15

Sergey Brin says AGI is within reach if Googlers work 60-hour weeks

Public-Tonight9497

2025-03-02 03:03:48

Useful diagram to consider GPT 4.5

Outside-Iron-8242

2025-02-28 19:24:57

this is what Ilya saw

Glittering-Neck-2505

2025-02-28 00:14:02

With 4.5, the question is can we continue to improve creativity without extraordinary costs - that is being currently worked on

Outside-Iron-8242

2025-02-27 23:05:39

According to LiveBench, 4.5 is the best non-thinking model

Outside-Iron-8242

2025-02-27 23:01:59

LiveBench has GPT-4.5 as the best non-thinking model

Outside-Iron-8242

2025-02-27 22:53:24

o3, which powers Deep Research, is capable of successfully handling 42% of the PR contributions made by OpenAI employees

Tasty-Ad-3753

2025-02-27 20:34:57

OpenAI announcement post seems to imply they might not even serve it in the API long term? $75/million input $150/million output tokens current pricing

Outside-Iron-8242

2025-02-27 20:19:58

the pricing is crazy...

finallyharmony

2025-02-27 02:59:14

Tomorrow will be interesting

arknightstranslate

2025-02-26 19:45:39

anonymous-test passes the common sense test.

No-Sheepherder9789

2025-02-26 19:38:22

Information: GPT-4.5 is coming this week, but its performance on certain tasks has been mixed and worse than Claude 3.7 Sonnet.

giYRW18voCJ0dYPfz21V

2025-02-25 22:25:10

Recent benchmark comparisons for different models on theoretical physics. Advanced models seem to easily solve undergraduate problems, while still struggle with research-level physics.

How long till we see global memory and realtime learning? Where a user can demonstrably prove/correct a mistake (like the 🍓 problem) and the model will integrate that knowledge and no longer make the same mistake with other users?