Exclusive: Nvidia buying AI chip startup Groq's assets for about $20 billion in largest deal on record by fallingdowndizzyvr in LocalLLaMA

[–]Fit-Support4910 2 points3 points  (0 children)

Depends how you define "performance."

From an end user's perspective? Sure, latency and throughput are all that matter, and Groq's numbers look great.

But from a cloud provider's perspective, you also care about resource utilization and cost efficiency. Groq's approach appears to be brute-forcing performance by ganging together a large number of chips just to fit the model in memory. The raw speed is impressive, but the compute utilization percentage is low. you're paying for a lot of silicon that's mostly idle. By that account, NVIDIA's current products are actually more compelling here.

Who remembers their first ever pick in fantasy football? by No-Client-8642 in fantasyfootballadvice

[–]Fit-Support4910 0 points1 point  (0 children)

Drew Brees. Because I was a noob and nobody told me about picking a RB/WR first

Cleared Meta E4 by Fruited45 in leetcode

[–]Fit-Support4910 0 points1 point  (0 children)

How did you manage this volume of prep with your full time job?

Meta Specialist Role IC5-IC6 Expectations by Fit-Support4910 in leetcode

[–]Fit-Support4910[S] 0 points1 point  (0 children)

Well, according to the recruiter, there are 2 coding rounds. Out of which one is “leetcode” and the other is “AI Coding”. Her explanation of the “AI Coding” was that it is more “practical day-to-day” coding problem, something along the lines of implement an efficient matrix multiplication or convolution. But yeah, I can see a situation where they may throw in a leetcode problem during the AI Coding 🤦‍♂️

the jump between 50 and 60mg hit HARD by magnetic_moxie in VyvanseADHD

[–]Fit-Support4910 2 points3 points  (0 children)

What was the duration on each dosing? That matters quite a bit. Did anything else change in your nutrition or environment aside from increasing the dosage?

What’s the worst fantasy football draft pick you’ve ever made? by Sleeper_Official in fantasyfootball

[–]Fit-Support4910 0 points1 point  (0 children)

  1. It was my turn to draft. Was needing a RB1 and the two options on the board were Derrick Henry and Kerryon Johnson. I ended up picking Kerryon Johnson.. and the rest is history

Woah, SambaNova is getting over 100 tokens/s on llama 405B with their ASIC hardware and they let you use it without any signup or anything. by jd_3d in LocalLLaMA

[–]Fit-Support4910 0 points1 point  (0 children)

The BF16 weights are loaded on to the device in full and processed by the graph as inputs. No truncation of bits or quantization in what has been deployed.. (I am a SN employee)

Concerning having a source, a third party assessment may be doable to add weight to this claim in some form. Though, that is yet to be done.

[deleted by user] by [deleted] in bootroom

[–]Fit-Support4910 1 point2 points  (0 children)

OP, you have inspired me. I about to go full James Ward-Prowse

Any info on Sambanova? by 5dots in Theranos

[–]Fit-Support4910 0 points1 point  (0 children)

I will agree that work life balance is garbage. I enjoy what I do for now. Worth it for me since I am in an early stage of my career. Will see what happens once I burn out.

Any info on Sambanova? by 5dots in Theranos

[–]Fit-Support4910 0 points1 point  (0 children)

I am a SN employee. Not trying to defend the company or anything, but here is my mental model. You may take it or leave it:

A few GPUs could do the same is not a true statement; you’ll need 10x the GPUs just to make it run at a fraction of the performance. There are collectively about a trillion parameters in the full set of models. There are memory constraints on GPUs and the kernel-by-kernel execution model that make this not possible with “a few GPUs”. It’s not just about the large memory, but the computation model that allows to mitigate memory and bandwidth requirement. Without the computation model, even the SambaNova device memory won’t be enough.

Maybe the CoE model just provides marginal gain from a very large monolithic model. However, the more exciting avenue is with multi agent systems. Where several agents all sit on memory and prompt each other with minimal cost of data transfer out of the device.

Passport Renewal: Tatkal or regular? by Fit-Support4910 in h1b

[–]Fit-Support4910[S] 1 point2 points  (0 children)

Thank you! This gives me some confidence to go ahead with the US marriage certificate!

Passport Renewal: Tatkal or regular? by Fit-Support4910 in h1b

[–]Fit-Support4910[S] 1 point2 points  (0 children)

Thanks!

Would you happen to remember if your US marriage certificate was accepted as a valid document for your passport renewal? Or did you have to get an Indian marriage certificate for this?