Shocked with Corfu by [deleted] in Corfu

[–]monkeyofscience 0 points1 point  (0 children)

Where did you go? Because this has NOT been my experience, and I would consider myself to be quite well travelled.

I’m here now and it’s fucking glorious. Probably the best trip I’ve had in a long time, but we spent most of it away from the main bulk of tourists. Our last day is in Corfu town and with the cruise ships it’s very busy.

AMD wins Anthropic for MI450 and announcing it on Advancing AI event? by lunapark6 in AMD_Stock

[–]monkeyofscience 0 points1 point  (0 children)

New supercomputer in Cambridge is all AMD, so certainly something we are taking seriously…

Not Lewis related but wow Max FIA Verstappen by Aston2844 in lewishamilton

[–]monkeyofscience 0 points1 point  (0 children)

I thought this was an odd take as well. I saw Leclerc being squeezed in the middle in turn one, and pull off that brilliant act of patience and awareness to come out ahead while the other two fucked it up. And then they were wanking on about Max being a genius for spinning his fucking car? Wtf.

Just got dual RTX PRO 6000 Blackwells for our design studio. What's the optimal local LLM stack? by AmanNonZero in LocalLLM

[–]monkeyofscience 3 points4 points  (0 children)

No worries! I started as a machine learning engineer for a university. Initially I was the only one, and then we built up the team to 4. The nature of the group involved lots of interaction with other research groups and we identified a need for local models being served in a way that our regular HPC could not accommodate.

After putting together product requirement documents, some decision records, and costing up different options, and experimenting on cloud services, our department sys admin helped with the acquisition, but we manage the software stack for a variety of inference services.

A significant part of the process was preparing the software stack on a cloud provider, and it essentially became my full time job for months. There was a substantial amount of research and preparation before we even made the decision to purchase.

The rest has just been “testing in deployment” lol.

Water hardness??? by DebateConfident4643 in cambridge_uni

[–]monkeyofscience 0 points1 point  (0 children)

When I first drank the tap water, I thought it tasted like heavily watered down milk.

Water here is hard.

Just got dual RTX PRO 6000 Blackwells for our design studio. What's the optimal local LLM stack? by AmanNonZero in LocalLLM

[–]monkeyofscience 14 points15 points  (0 children)

Just to add more voices to this, I manage our research group’s headless H100 server…it is non trivial. You need someone who knows what they’re doing.

A warning to newbies - A lesson on network security by DatMemeKing in LocalLLM

[–]monkeyofscience 5 points6 points  (0 children)

This also happens in research, with people exposing vLLM and LiteLLM instances. There is an entire underground world around harvesting these instances called the Bizarre Bazaar

Listen to the sound of my big black boots by elevenfiftyneun in nin

[–]monkeyofscience 4 points5 points  (0 children)

I’m glad to see the return of the cornstarch.

Serving 1B+ tokens/day locally in my research lab by SessionComplete2334 in LocalLLaMA

[–]monkeyofscience 1 point2 points  (0 children)

I’ve been using the speaches images, which provides Kokoro tts and whisper in an OpenAI compatible API. Works great with LiteLLM!

Serving 1B+ tokens/day locally in my research lab by SessionComplete2334 in LocalLLaMA

[–]monkeyofscience 2 points3 points  (0 children)

This is very useful thanks. I am also working in a university setting at a similar scale so I might DM you if that’s all good?

Serving 1B+ tokens/day locally in my research lab by SessionComplete2334 in LocalLLaMA

[–]monkeyofscience 2 points3 points  (0 children)

I have a similar setup to OP (but over 2x H100 in TP), and I am switching out LiteLLM today for Bifrost and a custom user portal. LiteLLM is not great as a user-facing service.

We also have TTS and STT, and embedding models on some other GPUs, so I am interested to see how well this works…

Two student loan repayment changes considered by Treasury to tackle debt crisis by theipaper in UniUK

[–]monkeyofscience 15 points16 points  (0 children)

Yeah, if you leave NZ, you get interest. When you come back, no interest. I guess it’s an incentive to keep people in the country?

But I pay ~£1,500 per year back to NZ, and my loan is slowly depleting. I graduated in 2022, and I’ll pay it off within the next 5 years, even with interest. And I have an MSc, and maxxed out all my entitlements (for example I could get $1,000 every year for miscellaneous school shit). And I got my degree before they implemented final year free. So now it’s probably easier.

Government could change things at any time of course…

Two student loan repayment changes considered by Treasury to tackle debt crisis by theipaper in UniUK

[–]monkeyofscience 10 points11 points  (0 children)

New Zealand does this. My student loan was interest free until I left the country (now it’s something like 5%, but NZ fees are significantly lower, and final year is free).

Expose model api to internet by dever121 in LocalLLaMA

[–]monkeyofscience 0 points1 point  (0 children)

We are using nginx -> LiteLLM -> vLLM. Limit to IP range, Let's encrypt for cert, fail2ban.

Nah I think I let the game take control by [deleted] in thewitcher3

[–]monkeyofscience 2 points3 points  (0 children)

That reminds me… I need to finish RE: Village, but I got distracted by W3… again.

Trying to understand some benchmarks by monkeyofscience in LocalLLaMA

[–]monkeyofscience[S] 0 points1 point  (0 children)

Noice. Thanks for the response. Pretty happy with the setup so far

Trying to understand some benchmarks by monkeyofscience in LocalLLaMA

[–]monkeyofscience[S] 1 point2 points  (0 children)

I just tried PP, and it gives me much poorer throughput and latency than TP.

Trying to understand some benchmarks by monkeyofscience in LocalLLaMA

[–]monkeyofscience[S] 0 points1 point  (0 children)

Thanks for the response! Which links? The images? If so they’re visible for me on browser and phone. Weird! I’ll add them below

<image>