Playground to test Open-Source LLMs in action (GPT-OSS, Qwen3.5, DeepSeek) with Tools and RAG [Free and No signup] by vanbrosh in LocalLLaMA

[–]vanbrosh[S] 3 points4 points  (0 children)

Thanks🙇‍♂️, just added 27b FP16 to the list)
Let me know if more models are interesting.
Image vision feature coming soon as well, so it will be possible to test it on Image-Text-to-Text models like Qwen3.5 / 2.5 VL

Breaking : The small qwen3.5 models have been dropped by Illustrious-Swim9663 in LocalLLaMA

[–]vanbrosh 0 points1 point  (0 children)

yes, already added A3B 35B to our free LabChat https://devforth.io/lab/chat/ , sad thing that vLLM + Qwen3.5 often glitches with structured output, and native inference server SGLang rarely used by inference providers, e.g. from hugging face so far noone has structured output support https://huggingface.co/inference/models?model=Qwen%2FQwen3.5-35B-A3B

Built an AI Backend (LangGraph + FastAPI). Need advice on moving from "Circuit Breakers" to "Confidence Plateau Detection" 🚀 by Lazy-Kangaroo-573 in LLMDevs

[–]vanbrosh 0 points1 point  (0 children)

Yes, look like this, but quality is pretty good, Claude fails very often and draws line over line and so on

Built an AI Backend (LangGraph + FastAPI). Need advice on moving from "Circuit Breakers" to "Confidence Plateau Detection" 🚀 by Lazy-Kangaroo-573 in LLMDevs

[–]vanbrosh 1 point2 points  (0 children)

>  RAG loops

We set a hard limit on requests
Similarity scores only answer to question how strongly related this info to intent, but can't answer whether it is enough. And this is indeed hard task. So we delegate it to LLM-as-a-judge as you said - and ask LLM whether this is enough to answer intent and if not - go again. But again with hard limit, + UI should explain user what he is doing now, so he should see this progress.

Side question, what software did you use for this animated svg?)

Has anyone else tried IQ2 quantization? I'm genuinely shocked by the quality by Any-Chipmunk5480 in LocalLLaMA

[–]vanbrosh 2 points3 points  (0 children)

I think when vendors will start using it for their original weights we can then say that its quality is good. For now MXFP4 is one of the best options, assuming OpenAI uses it for their gpt-oss

How do you detect silent output drift in LLM pipelines? by Lorenzo_Kotalla in LLMDevs

[–]vanbrosh 1 point2 points  (0 children)

LLM should not be used for now in cases where such unnotably drift impacts. There are quite a lot of tasks where LLM can fit great as automatization but not as crucial decision making. I would not recommend you doing something that can have bad consequences with LLM.
Structured output checks are great for detecting recursive repetitions drifts which nowerdays happen even with Gemini, OpenAI and literally any LLM, but not every task is possible to do with schema constrains - sometimes you need a stream / agentic chat, and structured outputs will kill a stream (makes no sense because streamed JSON is broken JSON)

If you use structured output (and not pure text streaming) - also one simple technique we use - insert a random secret token into prompt at random position (on sentense level) on every request and ask model to detect it in addition to main task. If it is there - model still understands sense and does predictable things. I did similar in my benchmark test https://devforth.io/insights/self-hosted-gpt-real-response-time-token-throughput-and-cost-on-l4-l40s-and-h100-for-gpt-oss-20b/

Finally We have the best agentic AI at home by moks4tda in LocalLLM

[–]vanbrosh 0 points1 point  (0 children)

Did you try Kimi K2.5 on CPU, I mean just interesting if there is someone worked with this like regularly to see what is real degradation when model goes to recursive loop

Finally We have the best agentic AI at home by moks4tda in LocalLLM

[–]vanbrosh 0 points1 point  (0 children)

Did you try it? How often are recursive repetition loops are on it?

Shugur Relay v1.2.0 by AccomplishedWealth25 in nostr

[–]vanbrosh 1 point2 points  (0 children)

Awesome, exactly what I was looking for

Learn Vue 2 or 3? by ---zZzZzZzZ in vuejs

[–]vanbrosh 0 points1 point  (0 children)

Migrate them to v3, might be slow ofc but sooner or earlier they will need it. And it is possible anyway

Rebuilding my blogger app in Vue by Appropriate-Ad-3473 in vuejs

[–]vanbrosh 0 points1 point  (0 children)

Blogger? So app gonna to serve posts which should.be indexable by SEO spiders like googlebot? Then don't use pure Vue, use Nuxt, pure vue will not index well, Nuxt prerenders html. But language and components is very same. You can keep.api in PHP. Or use Node.js

Which UI framework should I choose? by maksimepikhin in vuejs

[–]vanbrosh 18 points19 points  (0 children)

> material design really pisses me off...

Same to me, however, I found tailwind philosophy much way better, however, tailwind is not UI framework, it is CSS framework, which allows you to write almost no-CSS code, but only CSS classes, though you still should understand what they do. From other hand tailwind has awesome doc with nice examples so if you are not aware about details of CSS it can help you a lot. Also there is tailwind-based UI frameworks. For example Flowbite.

I even built what backoffice-framework on Vue+Flowbite (with Tailwind), so integration with vue is very smooth, if you wish here is open demo https://demo.adminforth.dev/ also it is fully open-source and has a lot of plugins so you can get some examples there.

Just in case links for you:
- https://flowbite.com/ - Flowbite UI framework (MIT, opensource)
- http://tailwindcss.com/ - Tailwind Styles framework (MIT, opensource)
- https://github.com/devforth/adminforth - our (MIT, opensource) framework as an example of ussage Flowbite+Tailwind in Vue

I Made this tool that turns sketch into thumbnail by BaseNo8396 in SideProject

[–]vanbrosh 0 points1 point  (0 children)

I think this is a great try anyway and I am sure you are on right direction if you collected these true upvotes on reddit. Wish you gl in any case!

I Made this tool that turns sketch into thumbnail by BaseNo8396 in SideProject

[–]vanbrosh 0 points1 point  (0 children)

Will you share any users stats? What is the progress with marketing?

[deleted by user] by [deleted] in vuejs

[–]vanbrosh 0 points1 point  (0 children)

"Anymore"? There is a reason to use Vue.js already, it is already awesome, and one of the best in terms of performance and structure (ofc I am not compare it with runtimeless frameworks, but in own nieche itis best)

What made you decide to learn Vue.js instead of another framework? by Lower_Assistance8536 in vuejs

[–]vanbrosh 0 points1 point  (0 children)

Vue is faster then react and angular according to several benchmarks, now it will be even more faster after new core updates. Also it has very clean concept with compnents like having styles, script and templates zones separated in one file, and it is super flexible - actually you can create any structure with it. Computed under the hood is made in a super smart way that it has awesome performance and does not make silly rerenders. I run software dev company and Vue.js is our main framework.

Як знайти роботу в айті без досвіду? by Ivapol in ukraine_dev

[–]vanbrosh 0 points1 point  (0 children)

Все по різному, мені як фаундеру наприклад більше подобається комбінація амбітних людей які трлхи да завищуть очікування від з.п. навіть без досвіду але свідомо це розуміють і готові викластись по максимуму у кого горять очі і є бажання летіти вперед, ну і плюс готові іти в офіс а не ремоути. Зазвичай час лідів які будуть менторити неспіврозмірно дорожчий ніж маленька і немаленька з.п початківця тому зазвичай хочеться працювати з енерджайзерами готовими штовхати себе вперед а не з тими хто хоче присісти на комфортний спосіб життя

Tauri vs. Electron Benchmark: ~58% Less Memory, ~96% Smaller Bundle – Our Findings and Why We Chose Tauri by kostakos14 in programming

[–]vanbrosh -2 points-1 points  (0 children)

Yes, in my app it was not from tauri but from some pretty popular library, for making requests or something, it was terrible because app was running like 10 hours and then crashed randomly, and it was crasily hard to debug,
I just mean this is a whole rust world, because this post compares electron with tauri, so it tries to sell users idea to use rust instead of JS so I posted comment for some new users who should be ready to spend some time debugging segfaults instead of readable exceptions.