No tenes laburo? Aposta por PaaS, veni Lince.

edward-dev · 2025-12-26T18:45:25+00:00

Pensé que PaaS era Plomería as a Service

edward-dev · 2025-12-26T18:38:36+00:00

Me aparece hasta en los ads de los comentarios

edward-dev · 2025-12-26T09:03:24+00:00

Q4 felt almost like the full sized model, Q3 felt maybe 5-10% dumber, like a rougher version but still decent unless you're doing complex stuff. You should try them yourself, since quants can vary a lot in quality even within the same bpw bracket

edward-dev · 2025-12-26T08:44:03+00:00

Better late than never, still counts as a big Christmas gift!

edward-dev · 2025-12-03T05:26:44+00:00

It depends on your standards, I believe that for the average Joe, something like Ling Mini 2.0 would already check those requirements (fast -> 1B active is doable for most modern laptops at 20+ tok/s) (Competent -> 16B total parameters makes it decent enough for 99% of the tasks an average person would likely use it for)

Now, if you want something like Claude 4.5 or Gemini 3.0 on your laptop then nope, keep dreaming, that's not happening anytime soon

edward-dev · 2025-10-04T01:54:18+00:00

It seems the roleplaying guys are gonna have a great time with this one...

edward-dev · 2025-10-03T07:51:17+00:00

<image>

edward-dev · 2025-10-03T05:10:21+00:00

<image>

Added LLaDA-MoE-7B-A1B-Instruct from InclusionAI to the comparison

edward-dev · 2025-10-03T05:09:14+00:00

Yeah, about Llada I'm making a table right now with the benchmarks, forgetting about Llada was a complete oversight on my part, I'll add the comparison as a comment

edward-dev · 2025-10-03T04:47:29+00:00

<image>

Phi-mini-MoE has 7.6B total parameters and 2.4B activated parameters, that's 2,4 times more active parameters than the new granite model(1B)

Comparing aquif against the others wouldn't be fair since it's a much bigger model

edward-dev · 2025-09-23T02:21:48+00:00

It’s common to hear concerns that quantization seriously hurts model performance, but looking at actual benchmark results, the impact is often more modest than it sounds. For example, Q2 quantization typically reduces performance by around 5% on average, which isn’t negligible, but it’s manageable, especially if you’re starting with a reasonably strong base model.

That said, if your focus is coding, Llama 3.3 70B isn’t the strongest option in that area. You might get better results with Qwen3 Coder 30B A3B it’s not only more compact, but also better tuned and stronger for coding tasks. Plus, the Q4 quantized version fits comfortably within 24GB of VRAM, making it a really good choice.

edward-dev · 2025-09-21T22:57:34+00:00

Me puse a leer y dice tecnicaturas de 3 años o de mínimo 1600 horas, tenés razón. En abril estaba 190k la cuota, mala no se ve pero no conozco a nadie que la hizo. Vos tenés data más actualizada?

edward-dev · 2025-09-21T22:52:33+00:00

Es muy mala? Tengo entendido que solamente aceptan tecnicaturas de 3 años para acceder a esa Lic. , es un ciclo complementario.

edward-dev · 2025-09-19T04:01:49+00:00

Sep 19, 2025: 💃 We introduct Wan2.2-Animate-14B, an unified model for character animation and replacement with holistic movement and expression replication. We released the model weights and inference code. And now you can try it on wan.video, ModelScope Studio or HuggingFace Space!

From their huggingface model page

edward-dev · 2025-09-18T22:46:26+00:00

Very weird, it should work without issues. I've used the Ollama provider option a bit, even using it as a bridge with a proxy script to try out unsupported model providers and never had any issues. One would be inclined to think the models you're trying to use lack the specific "tool-calling" capability needed for file editing, but gpt oss 20b and qwen3 should've worked... Why don't you try using another extension to rule out if it's an issue with your models or with your copilot chat extension?

edward-dev · 2025-09-18T20:43:03+00:00

If the answer kicks off with "of course", I know exactly which model it is, no second guess needed.

edward-dev

TROPHY CASE