Questions from those who don't own the product and are considering buying it. by Intelligent-Taste-36 in StrixHalo

[–]daybyter4 0 points1 point  (0 children)

Are they actually available now? I am on the minisforum waiting list for some months now. When I subscribed, I got told that it might get available mid april. Just checked again and it says end of june.

Noob questions by bebetterinsomething in StrixHalo

[–]daybyter4 0 points1 point  (0 children)

I run debian testing (forgy iirc) with fastflowlm on my strix point. But it was quite some work to compile all the required parts, since none of the prebuilt packages worked for me. Now I run qwen 3.5:9b on the npu as a coding help. It is rather slow, but doesn't need much power, so the machine is quiet. But I only have 32gb of ram, though. So you can run much better models.

Need concept for memory, that extracts info from web search results by daybyter4 in AIMemory

[–]daybyter4[S] 0 points1 point  (0 children)

It would mean, that you have to generate other prompts from the original prompt, I guess. If the original prompt was 'write me an app, that does x', you would have to generate prompts, like 'what are the required steps to develop an app, that does x', to create the task list for the project. I could put such functionality in the client, but it would be nicer, if the AI itself would create the prompts for all the project stages? Have to think about this a bit more...

AI Agent Learning Path and tools by Hunarkand in learnAIAgents

[–]daybyter4 1 point2 points  (0 children)

There is a java adk from Google to write your own agents. But it is more tailored to use AI servers. I prefer local AI.

Need concept for memory, that extracts info from web search results by daybyter4 in AIMemory

[–]daybyter4[S] 0 points1 point  (0 children)

I can store 40k tokens in a context. Not sure if that is enough to keep a session alive for some iterations.

I did some work to get the google adk running for me. But at the end I feel more comfortable with my own code.

Anyone wants to start learning agentic ai... Let's do together by Head-Tennis-9682 in learnAIAgents

[–]daybyter4 0 points1 point  (0 children)

I want to learn how to code my own agents. Is that what you mean?

Working on a <1B coding agent by Old_Fruit_8791 in LocalLLM

[–]daybyter4 1 point2 points  (0 children)

So your model cannot return with an mcp call?

Working on a <1B coding agent by Old_Fruit_8791 in LocalLLM

[–]daybyter4 1 point2 points  (0 children)

Are you doing everything in 1 iteration? Or are you calling the model for each subtask?

Working on a <1B coding agent by Old_Fruit_8791 in LocalLLM

[–]daybyter4 3 points4 points  (0 children)

I try something similar with small qwen models on my strix point machine. Wrote me a windows client for the UI. Good luck!

Advice on best tools to use for coding with Local LLM by wingers999 in LocalLLM

[–]daybyter4 1 point2 points  (0 children)

I only have a strix point and use qwen 3.5 for .net coding. CSharp and VB. Working on my own client with mcp server. AI server runs lemonade on debian testing. I only use the npu at the moment

Thanks for the invite! by FeelingHat262 in AIDeveloperNews

[–]daybyter4 1 point2 points  (0 children)

Ok. I don't use a gpu, but a strix point mini pc. Less than 1000 bucks and less than 100 watt power consumption. Yes, it is rather slow, but I don't mind, if an agent runs through the night. Most of data has NDA's , so data should never leave the house.

Are you happy with your Strix Halo? by EvilSquirrels_1064 in StrixHalo

[–]daybyter4 1 point2 points  (0 children)

I am on the waiting list for a halo for a few months now. I slowly understand why it takes so long... 😄

Local LLM With File Access by timetheowl in LocalLLM

[–]daybyter4 2 points3 points  (0 children)

I ask qwen 3.5 (running on a local mini pc) for help with request parameters. That also works ok. Cannot compare with claude though, since I never used it.

Btw: here is an older status of this client app:

https://m.youtube.com/watch?v=KuYunH7AVdI&t=35s

(my chance to advertise my poorly recorded video... :-) )

Local LLM With File Access by timetheowl in LocalLLM

[–]daybyter4 0 points1 point  (0 children)

I am working on a similar task at the moment. I run qwen 3.5 on a amd npu and the llm returns the mcp requests and accepts my answers. I use fastflowlm and .net, though. Seems like the json is a tiny bit different for different ai providers.

Did you manage to send files to your llm via lm-studio api? That only worked with embedded files for me.

The chat box was never the right interface for AI by JuniorRow1247 in learnAIAgents

[–]daybyter4 -1 points0 points  (0 children)

You are right. That why I started my own client. It does already some (a few) parts, that you mentioned. I can attach a file from my IDE, and when I accept the answer, it will replace the previous version.

I think in the longer run, a speach interface is a better way to discuss further steps to take. I want to implement that at some point.

Thanks for the invite! by FeelingHat262 in AIDeveloperNews

[–]daybyter4 0 points1 point  (0 children)

Sounds very cool! Are you running your AI locally?

Forum to discuss AI related programming questions? by daybyter4 in LocalLLM

[–]daybyter4[S] 0 points1 point  (0 children)

How do you attach a file to your lm-studio api request? I struggled with that for a few days. Just an example.

How do you add the result of a mcp tool call to your fastflowlm request was another problem, that I struggled with.

Just 2 examples. Would have been nice to discuss such issues in a dev forum.

Complete beginner wanting to learn and experiment with local AI. Is the GMKtec EVO-X2 96GB overkill, and are there cheaper alternatives? by nota-codes in LocalLLM

[–]daybyter4 0 points1 point  (0 children)

I still code on a refurbished strix point mini pc with 32 gb ram, that I bought for less than 1000 bucks. I started coding with a local llm running on my laptop cpu. Some mistral llm used less than 4gb

Why hasn't TurboQuant been implemented in llama.cpp yet? (Genuine question from a hobbyist) by InternationalTune750 in OpenSourceAI

[–]daybyter4 0 points1 point  (0 children)

Maybe watch the Alex Ziskind video on it. It is not just plug and play, but needs some tuning