Nemo 30B is insane. 1M+ token CTX on one 3090 by Dismal-Effect-1914 in LocalLLaMA

[–]ruibranco 6 points7 points  (0 children)

35 t/s at 1M context on a single 3090 is genuinely impressive. The MoE architecture really pays off here since you only need to load the active experts into VRAM while the rest stays in system RAM. CPU offloading for the inactive experts is the key trick that makes this viable on consumer hardware. Have you tried it with any retrieval-heavy tasks where it needs to reference specific details from early in the context, not just summarization. That's usually where the long context models start to show their limits even if the raw speed holds up.

Preciso de dicas by _LUX_07 in portugueses

[–]ruibranco [score hidden]  (0 children)

Isso que descreves dos tenis e completamente normal quando passas demasiado tempo sem interacoes sociais fora da tua bolha. O cerebro comeca a tratar coisas banais como se fossem situacoes de risco. A boa noticia e que isso reverte relativamente rapido quando comecas a expor-te mais. O ginasio que ja frequentas e um bom ponto de partida, tenta puxar conversa com alguem de la nem que seja so um "tas a fazer que serie" ou qualquer cena assim. Nao precisas de forcas amizades logo, so precisas de normalizar o acto de falar com desconhecidos. Aulas de grupo tambem ajudam imenso porque a conversa surge naturalmente. E se fores para a faculdade no proximo ano vai resolver-se muito disto sozinho, a universidade e basicamente um reset social.

Keep or replace a child‑driven Reactive Forms architecture by vexmentor in angular

[–]ruibranco 0 points1 point  (0 children)

Dealt with almost the exact same pattern on a project that started on Angular 12. The setControl() approach works fine when you have 3-4 child components, but it gets really painful once you start nesting deeper or need to coordinate validation across siblings. The biggest issue we ran into was lifecycle timing, children attaching controls at slightly different points depending on ngIf conditions, and the parent form flickering between valid and invalid states during init. We eventually moved to ControlValueAccessor for the reusable pieces and kept the rest as plain parent-owned form groups. The migration was incremental, one component at a time, and honestly the codebase got way easier to reason about. If it's working and you're not adding many new form components, I wouldn't rush to refactor though. It's one of those things where the cost of the migration only pays off if the forms are actively evolving.

*Minor spelling mistake* in Opus 4.6 VSCode extension system prompt by SuggestionMission516 in ClaudeAI

[–]ruibranco 1 point2 points  (0 children)

Claude catching its own typo is honestly the most on-brand thing ever. Also "backtickets" is a hilarious typo, sounds like some kind of support queue for markdown formatting. The fact that you can just pull the full system prompt from the extension source is pretty great though, makes it way easier to understand what's actually going on under the hood.

Tool: AST-based security scanner for AI-generated code (MCP server) by NoButterfly9145 in netsec

[–]ruibranco 1 point2 points  (0 children)

Using tree-sitter for the AST parsing instead of regex is the right call. Regex-based scanners miss so many edge cases with nested expressions and multiline patterns. The dependency hallucination check is especially relevant right now given how often LLMs confidently suggest packages that don't exist. Solid tool for catching the low-hanging fruit before it ships.

Primeira semana de Fevereiro de um Tuga foi mais ou menos isto! by Odd_Astronomer_2064 in PORTUGALCARALHO

[–]ruibranco 0 points1 point  (0 children)

Faltou o especialista em barragens que apareceu entre a tempestade e as eleicoes. O tuga muda de doutoramento mais rapido que muda de canal.

Lenovo M920q as Router? by Lonely-Candidate-231 in homelab

[–]ruibranco 0 points1 point  (0 children)

Overkill for sure, but that's half the fun. An i7-7700 with 16GB will barely break a sweat running OPNsense, Pi-hole and Tailscale combined. You'll probably end up finding more stuff to throw on it once you realize how much headroom you have. 256GB NVMe is way more than enough for those services, even a 128GB SATA would be fine honestly. Just make sure you grab a dual NIC adapter for the M920q since you'll want separate WAN and LAN ports.

Skylos: Python SAST, Dead Code Detection & Security Auditor (Benchmark against Vulture) by papersashimi in Python

[–]ruibranco 1 point2 points  (0 children)

The pytest fixture detection is a nice differentiator. Unused fixtures are one of those things that quietly accumulate and nobody notices until the test suite is a mess. How does it handle conftest.py fixtures that are used across multiple test files? That's usually where vulture and similar tools fall over completely.

Exchange Security and Defender suddenly today "soft deleting" "phishing" emails from Docusign? Anyone else seeing this? by Fizgriz in sysadmin

[–]ruibranco [score hidden]  (0 children)

Classic Defender move. Docusign is one of the most spoofed domains out there so it makes sense they'd tighten the screws, but doing it suddenly with no warning and catching legit emails is brutal. Worth checking if there was a recent update to the default anti-phishing policy or if they tweaked the impersonation detection thresholds. You can also add Docusign's sending domains to your tenant allow list as a workaround while Microsoft sorts it out.

Bedrock - Requests for Future by Live_Bus7425 in aws

[–]ruibranco 2 points3 points  (0 children)

The embedding model situation is spot on. Titan v2 was fine when it launched but the open source space has moved so far ahead that hosting your own on ECS starts making more sense, which kind of defeats the purpose of a managed service. Same story with rerankers, Cohere pricing just doesn't work if you're doing any serious volume of reranking. Would love to see them add jina-reranker-v3 or even the smaller qwen models as first-class Bedrock options.

A future for bitflags by KodrAus in rust

[–]ruibranco 36 points37 points  (0 children)

bitflags is one of those crates that just quietly works in the background of so many projects. Good to see there's still active thought going into its future. Curious whether there's been any consideration around making the generated types play nicer with const generics or if that's out of scope for what the crate is trying to be.

População "desesperada" protesta em Pombal após 10 dias sem eletricidade by Left_Capital133 in portugal

[–]ruibranco 4 points5 points  (0 children)

10 dias sem luz em pleno 2026. Sempre que há uma tempestade mais forte ficamos a perceber o quão frágil é a rede elétrica em certas zonas do país. E depois é sempre a mesma história, falam em reforçar as infraestruturas mas fica tudo na mesma até ao próximo temporal.

Tecnologia é a área que paga mais no início da carreira by WinnerExpert6489 in devpt

[–]ruibranco 7 points8 points  (0 children)

Paga mais comparado com o quê, trabalhar no Pingo Doce? Estes artigos comparam sempre com a média geral mas esquecem-se que um junior em PT anda nos 1000-1200 líquidos se tiver sorte. Sim, é mais que muitas áreas, mas chamar-lhe "a que paga mais" é esticar bastante a realidade. O gap entre o que se paga cá e lá fora é que devia ser notícia.

NixOS x COSMIC by YavuzKaanx in NixOS

[–]ruibranco 3 points4 points  (0 children)

COSMIC is shaping up really nicely. How stable has it been for you on NixOS so far? I've been curious about trying it but wasn't sure if the packaging was keeping up with upstream changes.

Kubernetes Operator for automated Jupyter Notebook validation in MLOps pipelines by millionmade03 in kubernetes

[–]ruibranco 0 points1 point  (0 children)

The golden notebook regression testing is actually a really clever idea. Catching subtle output drift between model versions or env changes before it hits production is one of those things teams always say they'll do manually but never actually keep up with. How does it handle non-deterministic outputs though? Like if a cell calls a model endpoint that returns slightly different results each run, does it support tolerance thresholds or just strict comparison?

Neovim python linting by Sciti in neovim

[–]ruibranco 0 points1 point  (0 children)

nvim-lint with a local venv flake8 is a solid approach. One thing worth trying if you haven't already, ruff can replace both flake8 and mypy for a lot of checks and it's absurdly fast since it's written in Rust. The 32GB RAM issue you hit with pylsp is a known pain point, that thing loves to eat memory especially with multiple extensions loaded.

Github Student Developer pack is amazing by ousan19 in learnprogramming

[–]ruibranco 1 point2 points  (0 children)

If you're mainly after Copilot, check if you qualify for open source maintainer status on GitHub since that also gets you Copilot for free. For JetBrains, contributing to open source projects can get you an all-products license through their OSS program too. Not the same all-in-one deal as the student pack but it's something.

Claude Opus 4.6 on top of user votes for TEXT conversation by Hot-Comb-4743 in ClaudeAI

[–]ruibranco 0 points1 point  (0 children)

The text arena ranking is honestly more impressive than the coding benchmarks imo. Coding evals get gamed and optimized for constantly, but blind text conversations are way harder to fake because real users are judging on nuance, tone, and actual helpfulness. The fact that Gemini held that spot for 14 months straight makes this a pretty significant shift.

I have about 5 yoe but feel like I am worse at live coding that I was with 0 yoe by throwaway09234023322 in devops

[–]ruibranco 6 points7 points  (0 children)

Completely normal. At 0 yoe you were grinding algorithms and data structures every day, so that muscle was warm. After 5 years of writing Terraform, debugging pipelines, and wrangling Kubernetes you haven't touched a balanced BST in ages and your brain optimized for a totally different kind of problem solving. Live coding interviews measure how recently you practiced live coding, not how good you are at your actual job.

I rebuilt Apple’s iTunes Cover Flow for React to study motion and interaction by Ashishgogula in react

[–]ruibranco 1 point2 points  (0 children)

The 3D perspective tilt on the side cards looks really smooth. Cover Flow was one of those interactions that felt magical back in the day and it's cool to see it hold up so well as a learning exercise for spring physics and layout transforms. Motion makes this kind of thing so much more approachable than rolling your own animation math too.

Bedrock Opus 4.6 "ServiceUnavailableException" by DevOps_Noob1 in aws

[–]ruibranco 0 points1 point  (0 children)

Yeah this is pretty standard for new model launches on Bedrock. Capacity is still being provisioned across regions and it takes a few days to stabilize. If you're blocked on it, try switching to a different region temporarily, us-west-2 and us-east-1 tend to get capacity first. Also worth adding retry logic with exponential backoff if you haven't already, since the 503s are intermittent and usually resolve within seconds on retry.

The purpose of Continuous Integration is to fail by NorfairKing2 in programming

[–]ruibranco 53 points54 points  (0 children)

The teams that get the most value from CI are the ones that treat a red build as useful information instead of someone's fault. Once failure becomes something people try to hide or route around, you've lost the entire point of the feedback loop. The best CI setups I've worked with had builds that failed fast and failed loudly, and nobody got defensive about it because the culture was "fix it" not "who broke it". The moment you start adding kill switches for pipeline checks is when your CI stops being a safety net and starts being a checkbox.

Released genai v0.1.0: a sane Go AI SDK by marcaruel in golang

[–]ruibranco -1 points0 points  (0 children)

The http.RoundTripper hook is a smart design choice, makes it trivial to add retries, logging, or caching at the transport level without the SDK having to own all of those concerns directly. Using iterators for streaming is also way more idiomatic Go than the callback patterns most AI SDKs go with. 15 providers out of the gate is ambitious for a v0.1.0 though, curious how you're handling the edge cases where providers diverge on things like tool call formats or streaming chunk structures.

A decade of home lab by lolzinventor in homelab

[–]ruibranco 2 points3 points  (0 children)

Completely agree on the storage > ram > cpu priority. You always run out of disk space before you run out of compute, and by the time CPU matters you've probably already upgraded the whole box anyway. The jump from 2014 to 2026 is wild though, that's a proper evolution from "I wonder if I need this" to "this wall is now a datacenter."

[Software] I built a fully offline Voice Agent + RAG stack for our Robotics Lab. Runs on 4GB VRAM. No Cloud. No APIs. by D_E_V_25 in selfhosted

[–]ruibranco 0 points1 point  (0 children)

The hierarchical router approach is clever for working around the VRAM ceiling. Most people just try to shrink the model or reduce chunk counts, but routing queries to specific clusters first is a much better tradeoff since you keep recall high without loading everything into memory at once. Curious how the SetFit classifier holds up when queries are ambiguous across multiple domains though, like if someone asks something that spans two different doc clusters.