Using the lemmyverse user generated content to train AI

EffortlessOps@sh.itjust.works · 9 months ago

Using the lemmyverse user generated content to train AI

AlligatorBlizzard@sh.itjust.works · 9 months ago

Judging by the kind of content we have on the fedi, I can’t wait to see AI sying stuff eat the rich, Blahaj is so cuuuuuuuuttte ewewewew, There is no OS but GNU and Stallman is the prophet, Capitalism is the problem, we need to re-establish the proletariate dictatorship would at least be fun.

If someone did create an LLM using fedi content and let it loose in the comments, I wonder how long it would take for people to realize it’s a bot? I’m sure not flagging it as a bot is a violation of most instances rules, and it existing would probably upset some people, but it’s still a fun question.

[email protected]@sh.itjust.works · 9 months ago

No one would notice. At worst, people would accuse it of trolling as it doubles down on factual inaccuracies. It may, and I say this without any irony, already be here and blending in. Paper books are the future.

Draconic NEO@sh.itjust.works · 9 months ago

Paper books are the future.

As if paper books can’t contain garbage and misinformation oh wait (article has link to amazon page which contains listing that has option for paperback).

[email protected]@sh.itjust.works · 9 months ago

Cool. Not remotely what I meant, but I do sincerely enjoy a good nitpick.

Draconic NEO@sh.itjust.works · 9 months ago

Oh, I didn’t exactly understand what you meant 😅

cosmic_skillet@lemmy.ml · 9 months ago

We’re going to get a weird feedback loop soon where future AI is going to be trained on posts created by current AI, eventually poisoning the well of trainable content