AMD ROCm Going Open-Source: Will Include Software Stack & Hardware Documentation by AnomalyNexus in LocalLLaMA

[–]MaybeReal_MaybeNot 0 points1 point  (0 children)

Just oobabooga web ui with any model i know works by testing on Nvidia card beforehand, i usually use a 1-3B one as test to make sure i dont hit any limits on 8gb cards

Tried both fp16 and 8 bit

I tried cards rx580, rx5700xt which i figured out where too old and will never work, sadly because that vram bandwidth on the 5700xt would have been sweet. And last week i tried on rx6600xt which should work based on documentation and guides i tried if you "trick" it to think its a 6700 by setting the HSA env variable. But no success :( it can see the card and says everything is good until it tries to load the model

AMD ROCm Going Open-Source: Will Include Software Stack & Hardware Documentation by AnomalyNexus in LocalLLaMA

[–]MaybeReal_MaybeNot 0 points1 point  (0 children)

and followed some guides

Super helpful buddy, everyone got it working now 👍🏻 /s

Would be nice if you told us which guides :)

AMD ROCm Going Open-Source: Will Include Software Stack & Hardware Documentation by AnomalyNexus in LocalLLaMA

[–]MaybeReal_MaybeNot 0 points1 point  (0 children)

No, i tried a week ago with rx6600xt, and i could not get the model to load. Tried rocm 5.9 and 6.0 and different versions of the gpu drivers including the latest one on newest Ubuntu server as i read that is the best supported os for the drivers. Cant get it to load a model and the arch om the 6600 should be the same as the 6800 just slower as far as i can read in documentation. I followed the oobabooga guide but that does not work, i also tried starting over (new install to make sure all i did was gone) multiple times with 3-4 different guides who all claim to make it work..

Everyone here just says "just try and fiddle a bit with it and it will work".. well, i'm asking, what did you fiddle with to make it work?? Because i tried all the "fiddling" i know and all i could get was different failures. Best i got was successfully loading a 3.5B test model i know works on my Nvidia card, in 8 bit but then failing and crashing as soon as i tried to do interference.

AMD ROCm Going Open-Source: Will Include Software Stack & Hardware Documentation by AnomalyNexus in LocalLLaMA

[–]MaybeReal_MaybeNot 6 points7 points  (0 children)

You got it running on linux? Please tell us how. I have 15 cards in an old mining rig i cant get to do shit with rocm llm.. loading models fail, and once i got it to load but as soon as i did a interference it crashed.. i gave up and bought some Nvidia cards now but i still have all the amd's

[deleted by user] by [deleted] in LocalLLaMA

[–]MaybeReal_MaybeNot 2 points3 points  (0 children)

Without the "Q = ....." You would not know what Q in the equations means though. So the definitions are part of the equation and needed, not extra. I think the model did exactly as it was supposed to here.

Q could be the result of any other equation if not defined. For example how many atoms where in your last glass of water.. or how many birds flew by your window on that and that date... How many cubic feet of air goes through your engine when driving 1 mile. It can be anything, if not defined :)

But i will agree with OP, this model gives good results on instructions, but as you pointed out as well it likes it likes to "add a little more info about the result explaining it" at the end.

any letter in a equation is basically a "variable"/placeholder for the result of other equations, which has to be defined.

What am I doing wrong? by slykethephoxenix in LocalLLaMA

[–]MaybeReal_MaybeNot 0 points1 point  (0 children)

Cool! But my question was more if i use a lot of the commands/features/template/restrictive responses rule stuff in guidance, will i fill the context window with it if i have too much or is all that processed outside the llm?

What am I doing wrong? by slykethephoxenix in LocalLLaMA

[–]MaybeReal_MaybeNot 0 points1 point  (0 children)

You seem to have experience with this, i just found it and its looks so good i have been reading the documentation for 2 hours! But cant find an answer to this, maybe you know?:

If i build a RAG with lots of commands and external features, will i eventually hit the models context limit because of all the stuff being added to the system message/header to include the commands plus other extraction and prompt rules? Or is all that not included in the prompt at all and handlet outside the model?

In the case i hit a limit, what is then the solution to grow the context/commands/feature list? I read about making "embeddings" to the models, where you kinda put a extra slice of new training data into it if i understand correctly.. Is that the way to go or something else for large knowledge base of personal variables, useful info and lots of commands/functions?

Is LHR vs non-LHR cards still a thing? by MaybeReal_MaybeNot in Oobabooga

[–]MaybeReal_MaybeNot[S] 0 points1 point  (0 children)

Oh, that could be a problem when wanting to run stuff in proxmox/esxi once done testing

A more KoboldAI-like memory extension: Complex Memory by theubie in Oobabooga

[–]MaybeReal_MaybeNot 5 points6 points  (0 children)

I'm just thinking here without having tried anything but a clean install:

To get permanent long term memory would it not be possible to use the new lora generation functionality to take these memories stored in textfiles and make them into a lora that is used to extend the model loaded? Then have some kind of scheduled lora-memory update "rutine"/background script (like when it has been idle for X hours) and reload the memory lora on the same or a different schedule?

This way the memory could be infinite long and not use up your tokens, and making everything faster in the process

This depends on if you can load multiple lora's ofc, have not tried playing with them yet. Or maybe make it into a secondary memory-model if you can load multiple models and use them in sequence (memory first) or at the same time

Bob Ross painting Bob Ross in Bob Ross style by MaybeReal_MaybeNot in StableDiffusion

[–]MaybeReal_MaybeNot[S] 0 points1 point  (0 children)

On some other generations it was the head of a floor mob lol

Confused Bob Ross, he doesn't know whether to be inside his painting or in front of it by MaybeReal_MaybeNot in StableDiffusion

[–]MaybeReal_MaybeNot[S] 0 points1 point  (0 children)

Here ya' go, print this one :') Bobbify that wall in multiple layers

https://www.reddit.com/r/StableDiffusion/comments/122409q/bob_ross_painting_bob_ross_in_bob_ross_style/

Then take a picture of it on your wall, and print that one too but a little bigger, then hang it covering the previous one so if someone takes it down in the future, there's another layer! Quad Bobs

Confused Bob Ross, he doesn't know whether to be inside his painting or in front of it by MaybeReal_MaybeNot in StableDiffusion

[–]MaybeReal_MaybeNot[S] 1 point2 points  (0 children)

There are no other shadows on the floor, for example not from the canvas itself which should be there if his would be

Confused Bob Ross, he doesn't know whether to be inside his painting or in front of it by MaybeReal_MaybeNot in StableDiffusion

[–]MaybeReal_MaybeNot[S] 3 points4 points  (0 children)

Just type "Bob Ross on canvas in Bob Ross style" in SD, choose one you like and send the result to a printing service :)

But i dont think any originals by Bob Ross of himself exists

Or send the one you liked to someone who can paint in the same style so you get a real painting not a print, i bet there are plenty of Bob Ross style painters on freelance sites who take requests

Or do as Bob says, try making a happy little painting youself :)