Just putting that out there. While we might have struggle sessions over bullshit, the larger internet zeitgeist is putrid and rancid.
Just putting that out there. While we might have struggle sessions over bullshit, the larger internet zeitgeist is putrid and rancid.
I feel like eventually we’ll see an “evolution” of LLMs where the big innovation will be cutting 90% of the Internet out of the training data without breaking the whole thing. Imagine if LLM output was as dry, neutral, and reliable as the average encyclopedia (yes I know those aren’t perfect either but it’s an improvement over reddit threads at least).
I don’t know if it’ll be framed as an innovation, per se, but that’s going to be the main utility for this technology. Small, focused models that can help you turn a large amount of pre-qualified data into something usable. That would be pretty cool. Wasn’t ever going to be anything more than that, but we’ll have to watch a trillion dollar market bubble pop before people start to narrow their ambitions and actually make something useful out of these things.
It’s such a depressingly stupid time to waste a bunch of information and digital tech so you can have Racist Google instead of what it was 20 years ago. We’re on the cusp of environmental changes brought about by wasting resources. That they built a giant wasteful bubble is the least surprising part of their behavior.
reddit was actually good for a bunch of how-to kinda shit that would never be in an encyclopedia. the trick is sifting the “hey you might have a carbon monoxide leak” from the “it’s cool to throw car batteries into the sea”