LLMs as sounding boards

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 year ago

LLMs as sounding boards

moujikman [none/use name]@hexbear.net · 11 months ago

I used to love it as a socratic tutor. All I need it to do is ask reasonable questions to challenge my understanding. But later releases of chatgpt made it too focused on giving you the answer.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 11 months ago

If your machine is beefy enough, running a local model works pretty well and there are a lot to choose from now tuned for different use cases. I’ve had good luck with using gpt4all as a no hassle app for running this stuff on my machine.

MinekPo1 [it/she]@lemmygrad.ml · 11 months ago

I did this with ChatGPT once and can’t say I had a good time . To be fair , what I was pitching to it was to unusual yet similar for it to grasp it .

So yeah as long as the LLM doesn’t get to overconfident with thinking it knows what you’re talking about , you might be able to get somewhere .

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 11 months ago

Yeah it definitely works for some topics better than others. Also depends a lot on the model and what it was trained on.

queermunist she/her@lemmy.ml · edit-2 1 year ago

That’s why I argue on the internet. I am as likely to convince a poster to change their mind as I am to convince a robot, but they generate such interesting responses that force me to think hard about my own positions. Grinding random encounters in the posting RPG for exp

pinguinu [any]@lemmygrad.ml · 1 year ago

Reddit is the Dark Souls of posting

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 year ago

I’ve noticed that as well. A lot of the time you can use the arguing to refine your own idea and just use the other party to provide you with a feedback loop. :)

lil_tank@lemmygrad.ml · 11 months ago

LLMs are way more interesting when talking about coding rather than asking them to generate code. The code generation is janky but if you keep asking questions you might get new directions, learn about concepts you didn’t know etc … It’s great to learn tech stuff

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 11 months ago

Yeah it’s handy for pointing you in the right direction when you don’t know what you need.

loathsome dongeater@lemmygrad.ml · 1 year ago

I read about this on the cursed orange site. Some guy talked about going on a walk with his wireless warplugs on, talking to ChatGPT’s audio interface discussing some world building he was doing.

Are there any LLM services that can be reasonably used without paying? I tried some llamafiles but seems like my laptop cannot handle them well.

lurkerlady [she/her]@hexbear.net · edit-2 11 months ago

seconding gpt4all, makes it quick and easy to run and if youre fancy you can stream the output from your computer to your phone. i run a capybara-hermes-mistral mix but i would suggest starting with mistral instruct until claude3 comes out

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 1 year ago

As long as you don’t care about your inputs being harvested, gemini is free currently. I’ve been using GPT4All to run stuff locally, but if your laptop is having trouble with llamafiles, then it’s probably gonna have trouble with that too.

FuckBigTech347@lemmygrad.ml · edit-2 11 months ago

On the topic of GPT4ALL, I’m curious is there an equivalent of that that but for txt2img/img2img models? All the FOSS txt2img stuff I’ve tried so far is either buggy (some of the projects I tried don’t even compile), require a stupid amount of third party dependencies, are made with NVidia hardware in mind while everyone else is second class or require unspeakable amounts of VRAM.

lurkerlady [she/her]@hexbear.net · edit-2 11 months ago

automatic1111 webui launcher, its stable diffusion. fun fact its icon is a pic of ho chi minh

if you wait, stable diffusion 3 is coming out soon. nvidia will run faster because its tensors are better unfortunately. SD is more ethical than others, you can load up models that are trained only on public art and pics

FuckBigTech347@lemmygrad.ml · 11 months ago

I’m pretty sure I tried that one but it kept running out of VRAM. Also it utilizes proprietary AMD/NVidia software stacks which are a pain to set up. GPT4ALL is a lot better in that regard, they just use Vulkan compute shaders to run the models.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 11 months ago

There’s also ComfyUI, but the learning curve is a bit steeper https://github.com/comfyanonymous/ComfyUI

although there’s CushyStudio frontend for it that’s more user friendly https://github.com/rvion/CushyStudio

FuckBigTech347@lemmygrad.ml · 11 months ago

ComfyUI seems like the most promising but it also uses ROCm/CUDA which don’t officially support any of my current GPUs (models load successfully but midway through computing it fails). Why can’t everyone just use compute shaders lol.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 11 months ago

Oh yeah that whole thing is just such a mess, another L for proprietary tech.

loathsome dongeater@lemmygrad.ml · 11 months ago

What model do you run?

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 11 months ago

I find I like Wizard 1.2 and Hermes the best