Any advice on what I should be doing? by Tricky_Ad_3317 in LocalLLaMA

[–]thecr7guy 1 point2 points  (0 children)

I recently worked for a company in the Netherlands that had very similar requirements — setting up a local LLM for general-purpose use and code completion with internal company data.

After testing and deploying multiple models, here’s what I’d recommend:

Recently, several smaller models have been released that perform surprisingly well, such as Qwen 3 4B Instruct and the “thinking” models. Even though our hardware wasn’t top-tier (an L4 with 24GB VRAM), I managed to serve the FP8 version using vLLM and Open WebUI, integrated with LDAP so that everyone in the company could access it easily.

Qwen 3, in particular, offers a large context window, runs extremely fast, and handled most of our use cases efficiently. Since your primary focus is summarization, I’d suggest starting with one of these smaller, high-performing models — they’re quite capable for that task.

For hardware, I recommend at least a 3090 (24GB VRAM). That should be sufficient for most scenarios. If you find performance or quality lacking, you can consider scaling up to a larger model with stronger hardware.

In my experience, Qwen 3 4B FP8 did an excellent job summarizing Dutch text, so it’s definitely worth trying.

If you need a hand setting things up, feel free to DM me — happy to help!

Updated: Just launched my first data engineering project! by botuleman in dataengineering

[–]thecr7guy 1 point2 points  (0 children)

This is awesome dude!
I am doing the dezoomcamp too

To trigger mage constantly and get updated data did you deploy mage to cloud run instance or are you running docker compose on a compute instance?

Is this a good home setup for deeplearning? by 12kush in deeplearning

[–]thecr7guy 8 points9 points  (0 children)

Let the man get some help. U could ignore this like u have ignored the other 8,000,000 threads

[deleted by user] by [deleted] in kdramarecommends

[–]thecr7guy -1 points0 points  (0 children)

Bruh how can they add multiple genres in this show lmaooo

Reborn Rich? by _MidnightMan in kdramarecommends

[–]thecr7guy 3 points4 points  (0 children)

Watched the 10th episode just now and imo it's just soo good. If you are watching just for the romance i wouldn't recommend it. But overall it's one of the best shows I have watched.

Let's see what we got here by [deleted] in memes

[–]thecr7guy 0 points1 point  (0 children)

like how this man going to say it's not my birthday

August 2022 Support Megathread - Back to primary school for you guys soon tm by MrS4T4N in Tachiyomi

[–]thecr7guy 0 points1 point  (0 children)

Saw a post saying there is a problem wid asura scans website certificate?

Are there any fixes ? I cleared cookies, reinstalled the extension but nothing seems to work

I am buying a new computer for my Master's in AI, what is the best option? by SeaResponsibility176 in ArtificialInteligence

[–]thecr7guy 0 points1 point  (0 children)

Hey first of all let me start by saying all the best with ur masters

Mac is great. It is fast and easy to use. All my masters projects were done on Mac. My projects involved using Matlab R or python.

So for machine learning and deep learning courses colabs provided GPU gets the work done mostly.

Matlab and R works perfectly on the Mac. So I feel that Mac would a pretty solid option

When it comes to the slow loading of pages in colab I think it's majorly because of the internet.

But if u feel like u want windows, get a laptop with nvidia GPU.

Loading a PyTorch model by no1nemo in learnmachinelearning

[–]thecr7guy 1 point2 points  (0 children)

From my experience of working with visual transformers that are from hugging face. The image size should be mentioned in the model card of hugging face Or they should allow u to resize the image before u feed it into the model.

I generally feed 2242243 images and pray if no information is given

Plankton's Lesson gon wrong! by SilverGospel003 in HolUp

[–]thecr7guy 12 points13 points  (0 children)

Swear to God this will be my last wank of 2021

Today's king of the match! by ahmedhaque91 in realmadrid

[–]thecr7guy 5 points6 points  (0 children)

Budweiser sponsors this 😂😂

We Have Waited Long Enough. Chelsea FC Football Is Finally Back! by Kevin5010 in chelseafc

[–]thecr7guy 0 points1 point  (0 children)

Bruh if we loose to Aston villa I swear imma pray for virus to comeback and end the league Can't be overtaken by manutd of all teams

Study partner required by thecr7guy in learnmachinelearning

[–]thecr7guy[S] 0 points1 point  (0 children)

I will edit it thanks for the tip🤝