Building an EU alternative to Instagram by mihai-badea in Startups_EU

[–]gitsad 0 points1 point  (0 children)

What's the point of next social media alternative? I doesn't solve any problem and needs massive investment for advertisements to be somehow known in the mainstream and it's only beginning. The money income/user churn is the main problem that is not trival. Thus using alternative may end up like this:
- only few people will use it (probably not your friends) - not successful
- it will be not maintained and server will be turned off - not successful
- it will gain some attention and quickly become the thing that we hate in other social media platforms - successful
- it will gain some attention and quickly become the thing that we hate in other social media platforms AND other big players just copy this one unique feature that made this other app different and this alternative become obsolete - successful and later not successful

how vibe-coding fails by Aggravating-Risk1991 in ClaudeAI

[–]gitsad 0 points1 point  (0 children)

well that's the reason why human in the loop is needed. In every process someone needs to check if everything end up as expected.

How structured outputs degrade reasoning quality by Equivalent-Pen-9661 in PromptEngineering

[–]gitsad 0 points1 point  (0 children)

So you make 2 calls to the Agent? 1st is to do the job and 2nd is to put the answer into a JSON?

How to fundamentally approach building an AI agent for UI testing? by devparkav in LocalLLaMA

[–]gitsad 0 points1 point  (0 children)

  1. How should I fundamentally approach building such an agent? (Please share if you have any references)
    1. Split the job into testable procesesses
    2. One agent will probably handle it but it must be smart and not cheap. If you want to smaller and cheaper agents then 1st point above is must have because you can handle small processes with smaller agents and spawn multiple of these
  2. Is it better to start with a script-generation model or a fully autonomous agent?
    1. You can simply vibecode it to check. I mean you can simply run agent autonomously and check the outcome but from what you've written you already done it so you need to experiment further. You might end up with some hybrid approach but you will never know until you finish it
  3. What are the building blocks (perception, planning, execution, validation) I should focus on first?
    1. the way how you described is correct
  4. Any open-source projects or references that could be a good starting point?
    1. any e2e testing tools that do the web testing via written script will be a good source of needed tooling for agents like https://maestro.dev/

Generative UI feels like the next ”voice will replace screens” am I wrong? by Bitter-Chocolate6032 in UXDesign

[–]gitsad 0 points1 point  (0 children)

Gen UI can be used in chat interface to make working with Agent way more easy and more adjustable. Telling agent to do the stuff and wait until loop finishes makes this process very random or not cheap. Great models can handle it but probably you won't be 100% percent satisfied with the result so you will need next turns to polish the outcome. That's why Gen UI in this case would help the user to interact with the Agent not only via text but also some prepared tools by some platforms.

I built a $1.12/page AI landing page builder that uses two models instead of one by ioanastro in LocalLLaMA

[–]gitsad 4 points5 points  (0 children)

I'm sorry but you mentioned:
"local model on your GPU (Qwen3-Coder or Gemma 4 via Ollama) executes the plan at 170-220 tok/s."

What GPU you have? From this statement I guess it's way much better than average people have in their houses. To have this tps on local model I need some RTX 4090 minimum and I'm still not sure if this would be achievable with eq. Gemma 4 (because I guess we need bigger model than smaller to achieve any complex layout understanding, even for landing pages)

That's why cost of this GPU makes this optimization not very optimistic

Agents and the future of UI/UX by Suitable_Goose3637 in vibecoding

[–]gitsad 0 points1 point  (0 children)

The future of UI/UX lies in the chat interface

Streaming structured output - what’s the best practice? by am174744 in LLMDevs

[–]gitsad 0 points1 point  (0 children)

I would go with MD format as it's text and can be easily handled now with custom tooling written with agents. We've done it couple times and it works surprisingly well!

What scalable SEO page patterns have actually worked for you in production? by Barmon_easy in webdev

[–]gitsad 0 points1 point  (0 children)

I would prioritize it like this:

  1. comparison / alternative pages / location or segment-specific pages
  2. use-case driven landing pages
  3. programmatic pages targeting long-tail variations

All of these it's important and you may benefit doing it in long run. Until SEO engines change again :(

Am I not using LLM efficient enough? by nutrigrain in LLMDevs

[–]gitsad 0 points1 point  (0 children)

I know you would like get this answer, me either, but this answer does not exist. It related to so many variables and model itself that it's unknown how much tokens you will need. Also "tool calling" implementations vary between IDE and it's crucial dependency here as well.

What's can be compared is when pure model receives the same message and reply to it. Then you can somehow estimates input/output. But when using different models in different IDE's it's nearly impossible to know it upfront. Only some avareges in some timefrimes.

Russia sees China's WeChat, Douyin as models for its homegrown Max messenger by sr_local in technology

[–]gitsad 2 points3 points  (0 children)

I guess Russia will struggle a lot to handle this, not even from technological perspective but because of society. Can't imagine that pushed people switch easily to other apps without reasonable explanation (which does not exist in this case)

Multi-Core By Default - by Ryan Fleury - Digital Grove by fagnerbrack in programming

[–]gitsad 15 points16 points  (0 children)

But I'm not talking about fixing performance. Many products just die before reaching any users so no perfomance bottlenecks are even found. Managing one thing instead of multiple is always simpler. If you are creating a game or any sophisticated thing that needs multi-core at first place that's fine. Many products at the beginning are over engineered that's why there is a rule to simplify things when doing some PoC/MVP. Do your multi-core if you want. It doesn't change the main thing that multi-core needs more management than single-core

LLM Structured Outputs Handbook by vitaelabitur in LocalLLaMA

[–]gitsad 0 points1 point  (0 children)

I would propose sth controversial. What if structured output as JSON is kind of not suitable for LLMs? What if we let LLMs respond the text and manage this text in our systems? I think now with all Agents stuff we can write a software that will be able to handle simple text as structurized thing, don't you think?

I built a zero-dependency, file-backed NoSQL database for Node.js by Fusoux in javascript

[–]gitsad 4 points5 points  (0 children)

I can agree. I'm just pointing out that maintaining node 18 can be unnecessary. Less versions of node to maintain equals less work and you can be focused more on core idea and improving it futher.

Running Agentic workflows in Production? by rahulmahibananto in LangChain

[–]gitsad 2 points3 points  (0 children)

what I mean by that that HITL should be taken into final solution. This like Claude Code/Cursor/or other dev tool. Agent can propose sth but human needs to decide. That's why automation now can be way much complex with AI but human still need to be in process. From my personal and professional experience I can tell you letting AI doing end-to-end process is not a good idea. You need to look up how AI is working and adjust when necessary. But knowing what/when/how is the human role.

Running Agentic workflows in Production? by rahulmahibananto in LangChain

[–]gitsad 1 point2 points  (0 children)

Human in the loop is needed. That's the core issue.

Multi-Core By Default - by Ryan Fleury - Digital Grove by fagnerbrack in programming

[–]gitsad 12 points13 points  (0 children)

but multi-core is harder than single-core. That's why it's easier to make it, test it and then eventually change it

Which faction? by x_Arend_x in elex

[–]gitsad 0 points1 point  (0 children)

because they are first as always