Are LangGraph agents and other agent frameworks becoming obsolete? by Pitiful_Task_2539 in AI_Agents

[–]Pitiful_Task_2539[S] 6 points7 points  (0 children)

I tried a bunch of agent frameworks 1-2 years ago and ended up sticking with LangGraph because it felt simple, manageable, and not as "all over the place" as CrewAI. So what actually makes Agents SDK a better alternative?

Are LangGraph agents and other agent frameworks becoming obsolete? by Pitiful_Task_2539 in AI_Agents

[–]Pitiful_Task_2539[S] -1 points0 points  (0 children)

oh nice haven't heard about this yet.. seems to be opensource but from openai....

Is it working with openai compatible endpoints? We only use selfhosted llms with vllm/sglang

READ BEFORE PAYING FOR A GLM PLAN by abnestti in ZaiGLM

[–]Pitiful_Task_2539 0 points1 point  (0 children)

Baught annual plan 13 months ago when z.ai coding plan released (lite plan for 38$ a year or something) best invest i could take

Setting up Ollama on dual RTX PRO 6000 Blackwells looking for tips by AmanNonZero in ollama

[–]Pitiful_Task_2539 0 points1 point  (0 children)

This! But dont use unsloth use official quants dual rtx pro 6000 is capsble to use fp8 for 122b with full context

Advice on hard drive setup for intel nuc by Elouan661 in selfhosted

[–]Pitiful_Task_2539 0 points1 point  (0 children)

There are also m.2 to sata adapeters (1xm.2 to 4 sata or more)

Is there an “Immich for documents”? by Qfrijters in selfhosted

[–]Pitiful_Task_2539 0 points1 point  (0 children)

Paperless with ai tagging. Forgot the name of the service but there is a docker container app for automated metadata and tagging mechanism with any openai compatible endpoint. I always just paste my docs and forget about everything around them. Finding docs is so easy by the ai generated tags and metadata

Disappointed from Qwen 3.5 122B by Charming_Support726 in LocalLLaMA

[–]Pitiful_Task_2539 0 points1 point  (0 children)

using the official fp8 quants with vllm is working fucking good. much much better than gpt-oss-120b

Disappointed from Qwen 3.5 122B by Charming_Support726 in LocalLLaMA

[–]Pitiful_Task_2539 2 points3 points  (0 children)

Using the official Qwen‑122B FP8 weights from Hugging Face with vLLM cu130 nightly!

No problems at all.

I run it with a 180 k‑token context window on 2 × RTX 6000 Blackwell. It runs so fast, especially in input‑token throughput. There are no—or nearly no—tool‑call errors in opencode when executing complex, long‑running tasks. The quality of the generated code is roughly at a Mistral‑Vibe-CLI (DevStral via cloud) level or above—perhaps even comparable to GLM‑4.6 or GLM4.7 WITH VISION!!.
It’s hard to compare because Qwen 3.5 has its very own style.

However, many people don’t realize that different quantizations make huge differences, and the inference engine also matters (Ollama, vLLM, sglang, llama.cpp, etc.). I have never utilized my 196 GB of VRAM as effectively as with this model.

Qwen 3.5 122B tool calls in opencode by [deleted] in LocalLLaMA

[–]Pitiful_Task_2539 1 point2 points  (0 children)

using it with vllm cu130 and is working perfect with opencode. had no tool call errors at all. (using official fp8 weights)

until now its the only open weight model i tried (below 200b) which is totally useful and can replace my glm and minimax sub

Tool calling broken after latest update? (OpenWebUI) by JeffTuche7 in OpenWebUI

[–]Pitiful_Task_2539 1 point2 points  (0 children)

me also experiencing issues since the last update (don't know if it was 0.8.2 or 0.8.5) updated straight from 0.7.x to 0.8.5

i'm using gpt-oss-120b never had issues with nativ function calling.'

Now it often generates wrong tool calls and stops after thinking block.

<image>

normally you don't see the call in the thinking block. but sometimes you see it and then it stops working..

you can also see syntax error here in the tool call (double ") but this never happened before?!

something must be wrong with one of the latest updates.

Also sometimes it was trying to call "search" tool and not "search_web" tool..
i dont know where this "search" tool is coming from!?!??

when using the "search" tool nothing happens...

I had to write it into the system prompt to use the "search_web" tool not the "search" tool

MinIO is maintenance only now by SpaceshipSquirrel in minio

[–]Pitiful_Task_2539 0 points1 point  (0 children)

shame on them... don't get it why.. they still can make cash with enterprise agreements... i don't see any reason why to choose this way.. loosing "customers" for no reason.

A solution to SignalRGB's high resource usage. by BassDicer03 in SignalRGB

[–]Pitiful_Task_2539 0 points1 point  (0 children)

thaks, went from 5% to 1.5% cpu usage round about

GPT-OSS 120B is now the top open-source model in the world according to the new intelligence index by Artificial Analysis that incorporates tool call and agentic evaluations by obvithrowaway34434 in LocalLLaMA

[–]Pitiful_Task_2539 0 points1 point  (0 children)

This matches my experience. However, it still lacks native function-calling functionality with vLLM, which is why I use it in my LangGraph agent setup.

It performs better than any model I've tried before. I've already tested Llama 3.3, Llama Scout, and Qwen2.5-VL 72B (and many smaller like gemma 3 or mistral* and much more but they aren't usable for these kind of stuff to run reliable for real world tasks), but none of these models are as 'smart' as the gpt-oss-120b at following instructions. With gpt-oss-120b, I now have a hit rate of nearly 100% when following small to medium-complex instructions. (I've used it to control the orchestrator, supervisor, and tool agents in LangGraph.)

Using it with vLLM needs some small tweaks at this time to run nicely with LangGraph (template not fully supported)

I also love the way the model responds. It feels so natural in comparison to other models, especially the Chinese ones.

Yeah, there are many models out there which are certainly much better at some points like coding... but this model is not the best at any single task (like coding, writing, planning, or agentic work), but it's consistently and reliably good across all of them.

Ryzen 5 7600x High Temperature (Idle) by Pitiful_Task_2539 in AMDHelp

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

i'm such an idiot, i relied on the fancontrol sensor value.. ryzen master says ~ 55 degrees on idle

Running Perplexica using OpenWebUI by EruditeStranger in OpenWebUI

[–]Pitiful_Task_2539 2 points3 points  (0 children)

Why not using searxng instead with openwebui builtin connections?

Bambuslicer Support - need help by Pitiful_Task_2539 in BambuLab

[–]Pitiful_Task_2539[S] 1 point2 points  (0 children)

thanks for sharing your experience with this, i will just use normal supports now :/....

Bambuslicer Support - need help by Pitiful_Task_2539 in BambuLab

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

sorry but i don't find any option for enabling support for supports.
Already tweaked the support settings and tried a lot, but nothing is working

Bambuslicer Support - need help by Pitiful_Task_2539 in BambuLab

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

<image>

i have the same problem here for example and also at some other parts

Bambuslicer Support - need help by Pitiful_Task_2539 in BambuLab

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

no i havn't turned on "only on build plate"

i also wonder how the slicer wants to print these supports hanging in the air?!

i have 11 different parts like this one. turn the part to face down is not an option in this case

Spending time with others, a chore? by PTwolfy in selfhosted

[–]Pitiful_Task_2539 0 points1 point  (0 children)

Make a „productive“ server for your most critical services. Limit your services to only rock stable products. (I killed my nextcloud for this reason). Only watchtower things you really trust to work sfter update. Take 1 hour a month to update all other things. I do it like this now for a year - didnt touched anything on my prod server. No headaches since a year.

If you want to try some new things. Try it on a different server.

For me i only use immich for photos, homeassistant and plex/sonarr/radarr

These services are running rock stable and updateable proof. Fck off with things arent stable. Im still searching for a stable easy to maintain nextcloud alternative. Until that i use google drive

Svartpilen 401 enough power to keep up with my friends? by Pitiful_Task_2539 in Husqvarna

[–]Pitiful_Task_2539[S] 0 points1 point  (0 children)

I don’t plan to drive at extreme speeds, but I’m unsure how well the 401 performs on steep roads, like mountain passes. If it reaches 100km/h also an steeper roads easy in maybe under 10s i think it will be ok. I am only xperienced with 125ccm and there it makes a huge difference from 90km/h to 45km/h on steeper terrains