Lemmy.one
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
David Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · 4 days ago

Oxford pretends AI benchmarks are science, not marketing

pivot-to-ai.com

external-link
message-square
5
fedilink
17
external-link

Oxford pretends AI benchmarks are science, not marketing

pivot-to-ai.com

David Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · 4 days ago
message-square
5
fedilink
Chatbot vendors routinely make up a new benchmark, then brag how well their hot new chatbot does on it. Like that time OpenAI’s o3 model trounced the FrontierMath benchmark, and it’s just a coincid…

How could all these benchmarks be fake, it’s a mystery

https://www.youtube.com/watch?v=KcYZN6sTZjQ&list=UU9rJrMVgcXTfa8xuMnbhAEA - video
https://pivottoai.libsyn.com/20251106-oxford-pretends-ai-benchmarks-are-science-not-marketing - podcast

time: 6 min 16 sec

  • o7___o7@awful.systems
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 days ago

    unending scream

TechTakes@awful.systems

techtakes@awful.systems

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !techtakes@awful.systems

Big brain tech dude got yet another clueless take over at HackerNews etc? Here’s the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 111 users / day
  • 413 users / week
  • 1.41K users / month
  • 4.77K users / 6 months
  • 10 local subscribers
  • 2.29K subscribers
  • 1.12K Posts
  • 31.4K Comments
  • Modlog
  • mods:
  • David Gerard@awful.systems
  • BE: 0.19.7
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org