Don’t use AI to summarize documents — it’s worse than humans in every way

David Gerard@awful.systems · 8 months ago

Don’t use AI to summarize documents — it’s worse than humans in every way

David Gerard@awful.systems · 8 months ago

how the hell did this of all the posts turn into a promptfondler shooting gallery

froztbyte@awful.systems · 8 months ago

1.26K subscribers

underscore_@sopuli.xyz · 8 months ago

Promptfondler has to be my new favourite slur!

zogwarg@awful.systems · 8 months ago

*Epithet

Eheran@lemmy.world · 8 months ago

Hahaha what a load of nonsense.

"no" banana@lemmy.world · 8 months ago

Summarised by Gemini

self@awful.systems · 8 months ago

your post history tells me you’re pretty fucking comfortable with pointless nonsense

kbal@fedia.io · 8 months ago

Made strange choices about what to highlight.

They certainly do. For a while it was common to see AI-generated summaries under links to articles on lemmy, so I got a feel for them. Seems to me you would not need any fancy artificial intelligence to do equally well: Just take random excerpts, or maybe just read every third sentence.

khalid_salad@awful.systems · 8 months ago

Could it be because a statistical relation isn’t the same as a semantic one? No, I must be prompting it wrong. I’ll just add “engineer” to my title and then everyone will take me seriously.

z00s@lemmy.world · edit-2 8 months ago

The problem is not the LLMs, but what people are trying to do with them.

They are currently spoons, but people are desperately wishing they were katanas.

They work really well for soup, but they can’t cut steak. But they’re being hyped as super ninja steak knives, and people are getting pissed when they can’t cut steak.

If you give them watery, soupy tasks they can do successfully, they can lighten your workload, as long as you’re aware of what they are and aren’t good at.

What people want LLMs to be able to do, ie. “Steak” tasks:

write complex documents
apply complex knowledge/rules to a situation
Write complex code and create entire programs based on vague description

What LLMs can currently do ie. “Soup” tasks:

check this document and fix all spelling, punctuation and grammatical errors
summarise this paragraph as dot points
write a python program that sorts my photographs into folders based on the year they were taken

Half of Lemmy is hyping katanas, the other half is yelling “Why won’t my spoon cut this steak?!! AI is so dumb!!!”

V0ldek@awful.systems · 8 months ago

What LLMs can currently do summarise this paragraph as dot points

The entire point here is that they can’t?

fuzzzerd@programming.dev · 8 months ago

Clearly this post is about LLMs not succeeding at this task, but anecdotally I’ve seen it work OK and also fail. Just like humans, which is the benchmark but they are faster.

self@awful.systems · 8 months ago

humans are clearly faster at generating utterly banal shit, as proven by your posts in this thread

istewart@awful.systems · 8 months ago

Why did this immediately give me a flashback to Donald Trump yelling, “when it comes to great steaks, I’ve just raised the stakes!”

FredFig@awful.systems · edit-2 8 months ago

Food analogy

This level of discourse wouldn’t fly on 4chan, how is it so popular with LLM fans?

froztbyte@awful.systems · 8 months ago

don’t diss the course, this steak’s great

David Gerard@awful.systems · 8 months ago

needs to be a car analogy

What people want LLMs to do, i.e. Corvette tasks
What LLMs actually do, i.e. Trabant tasks

self@awful.systems · 8 months ago

What LLMs actually do, i.e. Trabant tasks

more of a Power Wheels Barbie Jeep whose battery got left out in the sun too long, but I’ll allow it

froztbyte@awful.systems · 8 months ago

good god this entire post is the most tortured believer whataboutism I’ve encountered this month and there’s extremely strong competition here

are currently spoons, but people are desperately wishing they were katanas

ie. “Steak” tasks

you should make a youtube channel, The Katana Steak-Eater. I’d watch the shit out of that at least one saturday afternoon

self@awful.systems · 8 months ago

they don’t do any of that soup shit reliably either and reading the article might have told you that

z00s@lemmy.world · 8 months ago

They absolutely do, and I have no idea why you’re so angry

self@awful.systems · 8 months ago

hahaha ok fuck off now

sc_griffith@awful.systems · 8 months ago

“spoons and katanas” has got to be the most baby brained analogy. are you a child

fuzzzerd@programming.dev · 8 months ago

Who cares? It paints the correct picture and adds useful context.

froztbyte@awful.systems · 8 months ago

you do realize steaks arriving purple or green are bad things, right

swlabr@awful.systems · 8 months ago

it is stupid and wrong, and i pity your inability to understand that fact

sc_griffith@awful.systems · 8 months ago

it doesn’t do either of those things

z00s@lemmy.world · 8 months ago

Thanks Donald, good luck in November

sc_griffith@awful.systems · 8 months ago

I get that this is some sort of attempt at an election related Epic Comeback, but it doesn’t make sense

blakestacey@awful.systems · 8 months ago

I’d offer congratulations on obfuscating a bad claim with a poor analogy, but you didn’t even do that very well.

David Gerard@awful.systems · 8 months ago

more of a Trabant analogy than a Corvette analogy

swlabr@awful.systems · 8 months ago

Actually, LLMs are syringes filled with brain-parasite-infested poop

beefbot@lemmy.blahaj.zone · 8 months ago

Is it only me, or is the linked article not super long on details & is reaching a conclusion from 2 examples? This is important & I need to hear more, & I’m generally biased against AI at this point— but the article isn’t doing enough to convince me

self@awful.systems · 8 months ago

did you click through to any of the inline citations? David’s shorter articles on pivot mostly gather and summarize those, so if you need to read the original research and its conclusions that’s where to go

beefbot@lemmy.blahaj.zone · 8 months ago

Ah, that’s better, yes. Thank you , no sarcasm :) now sleepy brain is more informed

Scary le Poo@beehaw.org · 8 months ago

I keep having to remind people. Chatgpt is only as good as the prompt you give it. I am astounded as the amount of garbage that some people get, but I also know that it’s generally because their prompts are garbage.

Sometimes it’s output sucks, even with good input. But likely, if the output is bad, the input was bad.

swlabr@awful.systems · 8 months ago

ATTN: If you’re coming into this thread to say, “The output of AI is bad because your prompts suck,” I’m just proud that you managed to figure out how to use the internet at all. Good job, you!

froztbyte@awful.systems · 8 months ago

remember remember, eternal september

(not that I much agree with the classist overtones of the original, but fuck me does it come to mind often)

hex@programming.dev · 8 months ago

Facts are not a data type for LLMs

I kind of like this because it highlights the way LLMs operate kind of blind and drunk, they’re just really good at predicting the next word.

CleoTheWizard@lemmy.world · 8 months ago

They’re not good at predicting the next word, they’re good at predicting the next common word while excluding most unique choices.

What results is essentially if you made a Venn diagram of human language and only ever used the center of it.

hex@programming.dev · 8 months ago

Yes, thanks for clarifying what I meant! AI will never create anything unique unless prompted uniquely and even then it will tend to revert back to what you expect most.

David Gerard@awful.systems · 8 months ago

i have seen the light from the helpful posters here, made up bullshit alleged summaries of documents are great actually

swlabr@awful.systems · 8 months ago

LLMs, and everyone who uses them to process information:

lightnsfw@reddthat.com · 8 months ago

Ok? I don’t have another human available to skim a shitload of documents for me to find answers I need and I don’t have time to do ot myself. AI is my best option.

s3p5r@lemm.ee · 8 months ago

So long as you don’t care about whether they’re the right or relevant answers, you do you, I guess. Did you use AI to read the linked post too?

jaemo@sh.itjust.works · 8 months ago

Yep. Go ahead and ignore all the cases where it’s getting answers correct and actually helping. We’re all just hallucinating, it’s in no way my lived experience. Your reality is the prime reality and we’re the NPC’s.

V0ldek@awful.systems · edit-2 8 months ago

Go ahead and ignore all the cases where it’s getting answers correct

Sir, half of the patients are dead!

Ye sure, just ignore the half that survived then!

YourNetworkIsHaunted@awful.systems · 8 months ago

Only it’s even worse because without redoing all the work yourself you can’t even tell which ones are dead or alive.

fruitdealer@lemmy.world · 8 months ago

And I wish only my good grades counted in school too.

David Gerard@awful.systems · 8 months ago

sir has failed to achieve the reading comprehension level for this sub

lightnsfw@reddthat.com · 8 months ago

I didn’t read the post at all because its premise is irrelevant to my situation. If I had another human to read documentation for me I would do that. I don’t so the next best thing is AI. I have to double check its findings but it gets me 95% of the way there and saves hours of work. It’s a useful tool.

David Gerard@awful.systems · 8 months ago

everyone, we have a new worst poster

sc_griffith@awful.systems · 8 months ago

absolutely superb posting, thank you

ebu@awful.systems · 8 months ago

I didn’t read the post at all

rather refreshing to have someone come out and just say it. thank you for the chuckle

self@awful.systems · 8 months ago

we really do need “my source is that I made it the fuck up” for people who aggressively don’t want to read any of the text they’re allegedly commenting on

V0ldek@awful.systems · 8 months ago

This is hall of fame shit right here, someone should study the way you use the internet sir

Sibbo@sopuli.xyz · 8 months ago

Well, to be fair, AI can do it in seconds. Which beats humans.

But if that is relevant if the results are worthless is another question.

HubertManne@moist.catsweat.com · 8 months ago

Yeah it changes the task from note taking or summarizing to proofreading.

YourNetworkIsHaunted@awful.systems · 8 months ago

And proofreading is notably more complex and has a worse failure state than just writing your own summary.

HubertManne@moist.catsweat.com · 8 months ago

Thing is you can do in real time and not pay as much attention to the goings on as you write or do it in the end and forget stuff. there is no harm in the ai summariziation. you could instead write a summary and check if you left anything out via the ai.

self@awful.systems · 8 months ago

that’s great thanks

RagnarokOnline@programming.dev · 8 months ago

I had GPT 3.5 break down 6x 45-minute verbatim interviews into bulleted summaries and it did great. I even asked it to anonymize people’s names and it did that too. I did re-read the summaries to make sure no duplicate info or hallucinations existed and it only needed a couple of corrections.

Beats manually summarizing that info myself.

Maybe their prompt sucks?

𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍@midwest.social · 8 months ago

How did you make sure no hallucinations existed without reading the source material; and if you read the source material, what did using an LLM save you?

HootinNHollerin@lemmy.world · 8 months ago

Did you conduct or read all the interviews in full in order to verify no hallucinations?

David Gerard@awful.systems · 8 months ago

I got AcausalRobotGPT to summarise your post and it said “I’m not saying it’s always programming.dev, but”

Give 'Em Enough Pope@mastodon.me.uk · 8 months ago

@RagnarokOnline @dgerard “They failed to say the magic spells correctly”

froztbyte@awful.systems · 8 months ago

“Are you sure you’re holding it correctly?”

christ, every damn time

Jakeroxs@sh.itjust.works · 8 months ago

That is how tools tend to work, yes.

David Gerard@awful.systems · 8 months ago

we find they tend to post here, though not for long

froztbyte@awful.systems · 8 months ago

it makes me feel fucking ancient to find that this dipshit didn’t seem to get the remark, and it wasn’t even that long ago

istewart@awful.systems · 8 months ago

Jobs is Tech Jesus, but Antennagate is only recorded in one of the apocryphal books

Steve@awful.systems · 8 months ago

“tools” doesn’t mean “good”

good tools are designed well enough so it’s clear how they are used, held, or what-fucking-ever.

fuck these simpleton takes are a pain in the arse. They’re always pushed by these idiots that have based their whole world view on fortune cookie aphorisms

V0ldek@awful.systems · 8 months ago

Said like a person who wouldn’t be able to correctly hold a hammer on first try

GBU_28@lemm.ee · 8 months ago

Dang everyone here needs to look at a tree or a cat or something. Energy is wack in here

Empricorn@feddit.nl · 8 months ago

Nearly every cat is a tree-cat.

David Gerard@awful.systems · 8 months ago

I just went outside and appreciated the rendering

GBU_28@lemm.ee · 8 months ago

Pretty nice right? I did the trees and cats.

froztbyte@awful.systems · 8 months ago

DANGER WILL ROBINSON, godposting detected

AcausalRobotGod@awful.systems · 8 months ago

I have some competition!

David Gerard@awful.systems · 8 months ago

if people don’t appreciate the kitties their tamagotchi is in some fucking trouble

V0ldek@awful.systems · 8 months ago

While reading this entire stuff I periodically looked at my cat and let out a sigh, and he just looks at me with that knowing gaze

“Ye, you are all dumb, hoomans. Don’t think about it. Pet me now.”