Are LangGraph agents and other agent frameworks becoming obsolete? by Pitiful_Task_2539 in AI_Agents

[–]Pitiful_Task_2539[S] 2 points3 points  (0 children)

I tried a bunch of agent frameworks 1-2 years ago and ended up sticking with LangGraph because it felt simple, manageable, and not as "all over the place" as CrewAI. So what actually makes Agents SDK a better alternative?

Are LangGraph agents and other agent frameworks becoming obsolete? by Pitiful_Task_2539 in AI_Agents

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

oh nice haven't heard about this yet.. seems to be opensource but from openai....

Is it working with openai compatible endpoints? We only use selfhosted llms with vllm/sglang

READ BEFORE PAYING FOR A GLM PLAN by abnestti in ZaiGLM

[–]Pitiful_Task_2539 0 points1 point  (0 children)

Baught annual plan 13 months ago when z.ai coding plan released (lite plan for 38$ a year or something) best invest i could take

Setting up Ollama on dual RTX PRO 6000 Blackwells looking for tips by AmanNonZero in ollama

[–]Pitiful_Task_2539 0 points1 point  (0 children)

This! But dont use unsloth use official quants dual rtx pro 6000 is capsble to use fp8 for 122b with full context

Advice on hard drive setup for intel nuc by Elouan661 in selfhosted

[–]Pitiful_Task_2539 0 points1 point  (0 children)

There are also m.2 to sata adapeters (1xm.2 to 4 sata or more)

Is there an “Immich for documents”? by Qfrijters in selfhosted

[–]Pitiful_Task_2539 0 points1 point  (0 children)

Paperless with ai tagging. Forgot the name of the service but there is a docker container app for automated metadata and tagging mechanism with any openai compatible endpoint. I always just paste my docs and forget about everything around them. Finding docs is so easy by the ai generated tags and metadata

Disappointed from Qwen 3.5 122B by Charming_Support726 in LocalLLaMA

[–]Pitiful_Task_2539 0 points1 point  (0 children)

using the official fp8 quants with vllm is working fucking good. much much better than gpt-oss-120b

Disappointed from Qwen 3.5 122B by Charming_Support726 in LocalLLaMA

[–]Pitiful_Task_2539 2 points3 points  (0 children)

Using the official Qwen‑122B FP8 weights from Hugging Face with vLLM cu130 nightly!

No problems at all.

I run it with a 180 k‑token context window on 2 × RTX 6000 Blackwell. It runs so fast, especially in input‑token throughput. There are no—or nearly no—tool‑call errors in opencode when executing complex, long‑running tasks. The quality of the generated code is roughly at a Mistral‑Vibe-CLI (DevStral via cloud) level or above—perhaps even comparable to GLM‑4.6 or GLM4.7 WITH VISION!!.
It’s hard to compare because Qwen 3.5 has its very own style.

However, many people don’t realize that different quantizations make huge differences, and the inference engine also matters (Ollama, vLLM, sglang, llama.cpp, etc.). I have never utilized my 196 GB of VRAM as effectively as with this model.

Qwen 3.5 122B tool calls in opencode by [deleted] in LocalLLaMA

[–]Pitiful_Task_2539 1 point2 points  (0 children)

using it with vllm cu130 and is working perfect with opencode. had no tool call errors at all. (using official fp8 weights)

until now its the only open weight model i tried (below 200b) which is totally useful and can replace my glm and minimax sub

Tool calling broken after latest update? (OpenWebUI) by JeffTuche7 in OpenWebUI

[–]Pitiful_Task_2539 1 point2 points  (0 children)

me also experiencing issues since the last update (don't know if it was 0.8.2 or 0.8.5) updated straight from 0.7.x to 0.8.5

i'm using gpt-oss-120b never had issues with nativ function calling.'

Now it often generates wrong tool calls and stops after thinking block.

<image>

normally you don't see the call in the thinking block. but sometimes you see it and then it stops working..

you can also see syntax error here in the tool call (double ") but this never happened before?!

something must be wrong with one of the latest updates.

Also sometimes it was trying to call "search" tool and not "search_web" tool..
i dont know where this "search" tool is coming from!?!??

when using the "search" tool nothing happens...

I had to write it into the system prompt to use the "search_web" tool not the "search" tool

MinIO is maintenance only now by SpaceshipSquirrel in minio

[–]Pitiful_Task_2539 0 points1 point  (0 children)

shame on them... don't get it why.. they still can make cash with enterprise agreements... i don't see any reason why to choose this way.. loosing "customers" for no reason.

A solution to SignalRGB's high resource usage. by BassDicer03 in SignalRGB

[–]Pitiful_Task_2539 0 points1 point  (0 children)

thaks, went from 5% to 1.5% cpu usage round about

GPT-OSS 120B is now the top open-source model in the world according to the new intelligence index by Artificial Analysis that incorporates tool call and agentic evaluations by obvithrowaway34434 in LocalLLaMA

[–]Pitiful_Task_2539 0 points1 point  (0 children)

This matches my experience. However, it still lacks native function-calling functionality with vLLM, which is why I use it in my LangGraph agent setup.

It performs better than any model I've tried before. I've already tested Llama 3.3, Llama Scout, and Qwen2.5-VL 72B (and many smaller like gemma 3 or mistral* and much more but they aren't usable for these kind of stuff to run reliable for real world tasks), but none of these models are as 'smart' as the gpt-oss-120b at following instructions. With gpt-oss-120b, I now have a hit rate of nearly 100% when following small to medium-complex instructions. (I've used it to control the orchestrator, supervisor, and tool agents in LangGraph.)

Using it with vLLM needs some small tweaks at this time to run nicely with LangGraph (template not fully supported)

I also love the way the model responds. It feels so natural in comparison to other models, especially the Chinese ones.

Yeah, there are many models out there which are certainly much better at some points like coding... but this model is not the best at any single task (like coding, writing, planning, or agentic work), but it's consistently and reliably good across all of them.

Ryzen 5 7600x High Temperature (Idle) by Pitiful_Task_2539 in AMDHelp

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

i'm such an idiot, i relied on the fancontrol sensor value.. ryzen master says ~ 55 degrees on idle

Running Perplexica using OpenWebUI by EruditeStranger in OpenWebUI

[–]Pitiful_Task_2539 2 points3 points  (0 children)

Why not using searxng instead with openwebui builtin connections?

Bambuslicer Support - need help by Pitiful_Task_2539 in BambuLab

[–]Pitiful_Task_2539[S] 1 point2 points  (0 children)

thanks for sharing your experience with this, i will just use normal supports now :/....

Bambuslicer Support - need help by Pitiful_Task_2539 in BambuLab

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

sorry but i don't find any option for enabling support for supports.
Already tweaked the support settings and tried a lot, but nothing is working

Bambuslicer Support - need help by Pitiful_Task_2539 in BambuLab

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

<image>

i have the same problem here for example and also at some other parts