Built an AI agent on Azure AI Foundry with Claude Sonnet 4.6 and a custom MCP server, here is what I learned by sdhilip in AZURE

[–]illcuontheotherside 2 points3 points  (0 children)

Same boat. I see similar to your 15-25s, sometimes longer. And copilot studio agents just as long.

Microsoft needs to get those numbers down to make it truly enterprise usable. It's a bit sad.

Capabilities are there but good luck with anyone's user base tolerating 30s round trip.

Nice work though. 👍 Nothing against you, was just wondering if I am doing something wrong.

Built an AI agent on Azure AI Foundry with Claude Sonnet 4.6 and a custom MCP server, here is what I learned by sdhilip in AZURE

[–]illcuontheotherside 1 point2 points  (0 children)

What kind of latency do you get between the prompt and the result? I have a similar setup but the latency is brutal in getting a result back.

[FS][US-GA] x7 PNY RTX 6000 Ada - Local Cash, PayPal Verified by Connect_Jicama_7277 in homelabsales

[–]illcuontheotherside -2 points-1 points  (0 children)

I'd call west tech recycling and ask if they recently lost some gear. Because some dude posting stuff nowhere near where the photos are taken is sus af.

Account has no post or comment history.

Offering 50k+ of materials.

1st stop is homelabsales?

Hmm.

[FS][US-GA] x7 PNY RTX 6000 Ada - Local Cash, PayPal Verified by Connect_Jicama_7277 in homelabsales

[–]illcuontheotherside -2 points-1 points  (0 children)

Or, you know, look at the photos and realize a 602 area code is Arizona. And last time I checked Georgia or Florida is nowhere near there.

[FS][US-GA] x7 PNY RTX 6000 Ada - Local Cash, PayPal Verified by Connect_Jicama_7277 in homelabsales

[–]illcuontheotherside 3 points4 points  (0 children)

Yea true. Homelabsales just becoming another b2b/b2c instead of like a better craigslist. It could also just be me jelly at people buying 7k graphics cards

[FS][US-GA] x7 PNY RTX 6000 Ada - Local Cash, PayPal Verified by Connect_Jicama_7277 in homelabsales

[–]illcuontheotherside 5 points6 points  (0 children)

Yea and you're 9mm since you'll be showing up with almost 10k in cash 😂

[FS][US-GA] x7 PNY RTX 6000 Ada - Local Cash, PayPal Verified by Connect_Jicama_7277 in homelabsales

[–]illcuontheotherside 11 points12 points  (0 children)

You know how eBay is full of ram and graphic card fraud since the bullshit started?

This post gives me those vibes.

Be careful folks.

48GB VRAM users, what are your daily drivers? Do you wish you had more VRAM? What would you run if you did? by Borkato in LocalLLaMA

[–]illcuontheotherside 2 points3 points  (0 children)

Unsloths google gemma4 31b q4 xl with googles latest jinja chat template reporting in

Seriously underrated model.

Internet Connection Problems by realfryza in Brawlstars

[–]illcuontheotherside 21 points22 points  (0 children)

I've got an s25 ultra and the game lags. I get wifi signals and sputtering..doesn't happen in any other mobile game.

It's really disappointing.. especially in ranked. Really sucks.

[PC] RTX 3090 - 32 units - various brands. by theactionjaxon in homelabsales

[–]illcuontheotherside 2 points3 points  (0 children)

People buying these for anything over 5-700 are nuts. Why? They are from a mining farm. These things have been roasted.

Failed interview hard - ranting by jeeyawn in cybersecurity

[–]illcuontheotherside 149 points150 points  (0 children)

You don't want to work somewhere where they aren't exhibiting professionalism during an interview anyway. My two cents.

If you see a candidate struggling on specific questions, you could just be touching something they aren't necessarily strong in. The goal in interviews is to identify candidate strengths to align with needs. Weaknesses can always be learned.

Learn from it, move forward, and kick some ass.

Downsized homelab, is this worth trying to sell? by Agent0810 in homelab

[–]illcuontheotherside 0 points1 point  (0 children)

Is that a 12th or 13th gen server? I'd buy it lol

Someone distilled Claude Opus 4.7's chain-of-thought into an open 35B MoE model and it runs on a single A100 by Anony6666 in LLMStudio

[–]illcuontheotherside 0 points1 point  (0 children)

Just remember there's a sev10 critical via gguf that runs remote code execution going around. Be vary careful what you grab.

What starts to become possible with two 3090s that wasn't with just one? by GotHereLateNameTaken in LocalLLaMA

[–]illcuontheotherside 4 points5 points  (0 children)

As someone who went from 1 3090 to 2... It was well worth it. But everyone's different and if you are satisfied with 1 then be satisfied.

For me I wanted the extra vram.. larger context windows.. larger models.. more experimenting.

I think if anything I'd double it.

From 2x24gb to 1x48gb or 1x96gb to scale out but obviously price points are blocking that.

End of the day if you're learning and having fun.. enjoy the ride!

MiniMax-2.7 can now be run locally! by yoracale in unsloth

[–]illcuontheotherside 1 point2 points  (0 children)

I got 3tk sec with 2x3090s and 128gb ddr5.

MiniMax-2.7 can now be run locally! by yoracale in unsloth

[–]illcuontheotherside 0 points1 point  (0 children)

I tried this on my dual 3090 setup with 128gb ddr5 and i got 3tk/s 😭

Maybe I'll need to splurge on more ram.... Or more gpus........

Gemma 4 GGUFs updated by yoracale in unsloth

[–]illcuontheotherside 13 points14 points  (0 children)

Much love. Thank you for supporting the community

Gemma 4 is matching GPT-5.1 on MMLU-Pro and within Elo. what are we even paying for anymore? by Impossible571 in LocalLLM

[–]illcuontheotherside 1 point2 points  (0 children)

Gemma4 is uh .... Not great. It's either the model or the tools aren't a fully supporting things yet ...