GPT-4.5

cyrano@lemmy.dbzer0.com · edit-2 6 months ago

GPT-4.5

cygnus@lemmy.ca · 6 months ago

Those charts are hilarious: wow, it gives the right answer 62.5% of the time and only makes up completely false answers 37.1% of the time! It’s like Russian roulette, but worse!

olympicyes@lemmy.world · 6 months ago

If you play Russian roulette with two bullets like a real man, then this model is about the same outcome!

regrub@lemmy.world · edit-2 4 months ago

deleted by creator

BetaDoggo_@lemmy.world · 6 months ago

In their human choice benchmarks it was only chosen 59% of the time compared to 4o. That’s a 15-20x cost increase for 9% difference.

GPT-4.5

GPT-4.5

Just a moment...