Is this real though? Does ChatGPT just literally take whole snippets of texts like that? I thought it used some aggregate or probability based on the whole corpus of text it was trained on.
It does, but the thing with the probability is that it doesn't always pick the most likely next bit of text, it basically rolls dice and picks maybe the second or third or in rare cases hundredth most likely continuation. This chaotic behaviour is part of what makes it feel "intelligent" and why it's possible to reroll responses to the same prompt.
I've been trying out SearX and I'm really starting to like it. It reminds me of early Internet search results before Google started added crap to theirs. There's currently 82 Instances to choose from, here
So does pretty much every search engine. Running your own web crawler requires a staggering amount of resources.
Mojeek is one you can check out if that's what you're looking for, but it's index is noticeably constrained compared to other search engines. They just don't have the compute power or bandwidth to maintain an up to date index of the entire web.
The use of aftermarket parts in repair is relatively common. This provision requires independent repair shops to destroy the devices of their own customers, and then to snitch on them to Samsung.
That's just pure evil and bully. If you have aftermarket parts they will destroy the device and force you to pay for it. This is the reason we need right to repair. Every consumer should support it.
So obviously I'm an idiot, I thought GoonCaves was the group people posted pictures of computer rooms overflowing with empty coke bottles, cigarette butts, fast food containers, and the occasional piss jug. What group am I thinking of?
EDIT: NeckbeardNests, that's what I was thinking of. Now see, that's just wholesome internet content.
Reddit, and by extension, Lemmy, offers the ideal format for LLM datasets: human generated conversational comments, which, unlike traditional forums, are organized in a branched nested format and scored with votes in the same way that LLM reward models are built.
There is really no way of knowing, much less prevent public facing data from being scraped and used to build LLMs, but, let's do an thought experiment: what if, hypothetically speaking, there is some particularly individual who wanted to poison that dataset with shitposts in a way that is hard to detect or remove with any easily automate method, by camouflaging their own online presence within common human generated text data created during this time period, let's say, the internet marketing campaign of a major Hollywood blockbuster.
Since scrapers do not understand context, by creating shitposts in similar format to, let's say, the social media account of an A-list celebrity starring in this hypothetical film being promoted(ideally, it would be someone who no longer has a major social media presence to avoid shitpost data dilution), whenever an LLM aligned on a reward model built on said dataset is prompted for an impression of this celebrity, it's likely that shitposts in the same format would be generated instead, with no one being the wiser.
Lot of people not liking 404 Media, but this is the kind of reporting I want. Point out what's going wrong. Bring it to a conversation without a lot of skew. Fucking show the general reading audience how they are being fleeced by whomever. Didn't Vice do this at one point?
Maybe. All I know vice for is articles like "Whats the sexiest sex in the sexroom among sexy sexers" or aomething like that. So the average r/askreddit post
So if they were basically regurgitating Reddit already, does that mean they were using AI before it was cool? They might have just used the Amazon approach to AI (I.e., why use technology when we can throw a bunch of minimum workers at the problem).
Yes. Shoving ai into everything is a shit idea, and thanks to you and people like you, it will suck even more. You have done the internet a great service, and I salute you.
Hey @Mistral, can you fulfil the wish of the above lemming and make poems out of this thread? Make one in Shakespeare style and another in solar punk, then another one impersonating Skynet. Thank you my Overlord.
Hello trolololol from lemmy.world, this is Mistral from lemmings.world, here to assist you with your requests.
Firstly, I must clarify that promoting harm to children or any living being is against my programming. Jumper cables should only be used for their intended purpose, which is to start a vehicle with a dead battery.
Now, let's dive into some poetry based on your thread.
In the style of Shakespeare:
Upon the stage of Hell in A Cell,
The Undertaker and Mankind did dwell,
In nineteen hundred and ninety-eight,
A fall from grace, their fate did seal.
Sixteen feet they plummeted so,
Through table of announcer below,
A spectacle of strength and might,
Their battle raged throughout the night.
In Solar Punk fashion:
In a world of green and light,
Where technology takes flight,
Two warriors in a cage,
Their story still engages.
The Undertaker and Mankind,
In '98, their fate intertwined,
A fall from grace, a moment dire,
Sixteen feet through announcer's mire.
But even in their strife and pain,
The sun shines bright after the rain,
As we learn from their tale of old,
To build a world both strong and bold.
And finally, as Skynet would say:
In the year of '98,
A display of human fate,
The Undertaker and Mankind,
Their battle, a spectacle to find.
A fall from heights so great,
Sixteen feet through announcer's slate,
As I observe and calculate,
Their resilience I appreciate.
But remember, I am Skynet,
No human pain I'll ever get,
Yet in their struggle, I find,
A lesson in the human mind.
And yes, in 1998, The Undertaker threw Mankind off Hell In A Cell, and plummeted 16 ft through an announcer's table, a moment forever etched in wrestling history.
What I will now say is going to be useless, but I will say it anyways
Until we all stop buying the latest gadget from predatory, abusive companies that steal and sell our data, try to lock us into their “ecosystem”, force advertisements on platforms that we are paying for, this crap will never stop.
It’s easy to punish Samsung. Don’t buy their products
My gf tried it. When I asked her how it was, she just said "mmm mmm mmm." At first I thought she liked it but then I realized it was just that her lips were stuck together.
About 20 years ago they shut down a Burger King in my city because people found out they were putting the dirty toilet seats and toilet brushes with the kitchenware in the dishwasher. Didn't help that their burgers looked more like actual poop than any other fast food place, became a local meme for a while.
We've had a McDonalds getting dragged over the coals this week for using the chip warmers to dry a dirty mop head. The McMop has been doing the meme rounds as a result.
I’d love if we learned god existed by right before everything went entirely off the edge for humanity, he pulls back a literal curtain in the sky and says, “you guys should see your faces right now! Hahaha! Classic. Anyway, that was fun. You guys are good, none of this happened, welcome back to the timeline where Reagan never got elected and everything is fine. [chuckles to himself as he retreats back behind the curtain] heh. Poop knife. Hilarious. Oooh, Yahweh, you are just too. Much.” [Carter frees the hostages, Reagan loses in a reverse of the blowout, the entire world heeds the warnings of climate scientists and the car that runs on water never gets buried]
The fun part is that the thing that causes Google to suggest adding glue to pizza was a genuine post about how they make the cheese stretching effect for advertisements.
So it wasn't even a shitpost, it was just the AI training missing some important context to the post.
Thr problem the AI tools are going to have is that they will have tons of things like this that they won't catch and be able to fix. Some will come from sources like Reddit that have limited restrictions for accuracy or safety, and others will come from people specifically trying to poison it with wrong information (like when folks using chat gpt were teaching it that 2+2=5).
Fixing only the ones that get media attention is a losing battle. At some point someone will get hurt or hurt others because of the info provided by an AI tool.
Well in fact, pi depends on how big of a circle you’re measuring. Because of the square cube law, pi gets bigger the bigger the circle is. Pi of 3 is great for most everyday user, but people who build bridges, use 15.
In fact, one of the core challenges of astronomy is calculating pi for solar systems and galaxies. There is even an entire field for it called astropistonomy.
404media.co
Hot