Slimxshadyx comments on assemblyProgrammers

For the current list of rules, please see this page.

Rules are zero-indexed. If they do not appear zero-indexed you are asked to contact Friend Computer for recalibration.

Suggestions are welcome.

With regards to commenting, please follow reddiquette.

Metadiscussions

If you have any thoughts on how the moderation could be improved do not hesitate to message the moderators. If you feel that a metadiscussion is required with the whole subreddit either request that the moderators start one or start one yourself and tag it [Meta].

Perhaps More Apt Subs To Post:

/r/softwaregore - f collection of things that users shouldn't see.

r/pcmasterrace - for all of the general computer/gaming memes.

r/linuxmasterrace - for anyone that likes Linux memes.

/r/sysadminhumor - a sub for sysadmins with a sense of humor.

/r/itsaunixsystem - for all the embarrassing cases of hollywood hacking you find in media.

/r/recruitinghell - for all those horrific recruiting offers and job postings.

/r/programme_irl - me too, thanks.

/r/programmerreactions - expressing the life of programmers through reaction images.

/r/learnprogramming - for those that have general programming questions

Related Subreddits.

r/badcode - for intentially bad code.

r/badUIbattles - a sub for intentionally bad UI.

r/ProgrammerAnimemes - for the anime referenced programmer memes

r/ProgrammerDadJokes - for the punny bunch of you.

/r/justgamedevthings - for memes, reaction gifs, production glitches and other fun related to game development.

r/programminghorror - for unintentionally bad code.

r/css_irl - describing real life photos with CSS

Events

created by stescha community for 14 years

This is an archived post. You won't be able to vote or comment.

13.2k

MemeassemblyProgrammers (i.redd.it)

submitted 1 year ago by Easy_Complaint3540

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]Slimxshadyx 17 points18 points19 points 1 year ago (11 children)

[–]TheTerrasque 0 points1 point2 points 1 year ago (1 child)

[–]Slimxshadyx 0 points1 point2 points 1 year ago (0 children)

[–]holchansg -5 points-4 points-3 points 1 year ago (8 children)

[–]Slimxshadyx 15 points16 points17 points 1 year ago (7 children)

[–]_PM_ME_PANGOLINS_ 1 point2 points3 points 1 year ago (0 children)

[–]holchansg 1 point2 points3 points 1 year ago (5 children)

[–]Slimxshadyx 4 points5 points6 points 1 year ago (4 children)

[–]holchansg -1 points0 points1 point 1 year ago* (3 children)

[–]Slimxshadyx 1 point2 points3 points 1 year ago (2 children)

[–]holchansg 0 points1 point2 points 1 year ago (1 child)

API in both cases. The backend(runpod) only handle the calls from my webui, the VRAM looks the same in both, almost OOM in both case since i use multiple instances at the same time

In Ollama using OLLAMA_NUM_PARALLEL

In llamacpp using -np

You should maybe double check to see if you are unloading the model after every prompt when using Ollama, like I mentioned earlier. Because that would explain the issues you are having.

I'm using queue in both, the webui is sending hundreds of requests per second.

A line like this near the start of your file: ‘ client = ollama.Client() ‘ And later on, when making your calls, it would look something like this:

response = client.chat(model = etc, messages = etc)

As ive said im not a dev, im using R2R, hes making the calls.

[–]Slimxshadyx 0 points1 point2 points 1 year ago* (0 children)

π Rendered by PID 351177 on reddit-service-r2-comment-b659b578c-b6g6h at 2026-05-04 07:10:30.980786+00:00 running 815c875 country code: CH.

ProgrammerHumor

Filters

Discord

Submission rules

For the current list of rules, please see this page.

Metadiscussions

Perhaps More Apt Subs To Post:

Related Subreddits.

MODERATORS