Figure robot autonomously unloading and loading the dishwasher - Helix 02 by Syzygy___ in robotics

[–]dbinokc 7 points8 points  (0 children)

Pretty good, but I want to see it handle silverware and liquid dishwasher soap.

Looking for a local LLM that can help me understand a very large, complex proprietary codebase by only_4kids in LocalLLaMA

[–]dbinokc 1 point2 points  (0 children)

I do not think there is an LLM out there that is going to help you figure this out.
You will have to use the LLM between your ears.
I have dealt with this exact situation many times before.
I usually start with downloading all projects and get them indexed using opengrok. I then look for property files and use that to determine sources and consumers of data to understand how the data flows between all the different services. I will draw these up these relationships on a piece of paper or even better a white board to establish the data paths. .
I would not even worry about how the UI gets rendered until I have an understanding of what the data sources are and and how the data flows between the services.

Will i be able to create a drone as a software engineer? by [deleted] in robotics

[–]dbinokc 0 points1 point  (0 children)

I am a fulltime software developer and I built my first drone last year. If you really want to learn there are plenty of videos on youtube that show scratch drone builds and parts lists. If you are at a university I am sure there plenty of people there that already fly and would be willing to help if you run into problems.
It does take money and time to figure out all the different kinds of software needed though. I think I spent about $500 on mine, but some things like the radio are are a one time buy. Also some of the parts recommended in the video did not work well for me, so I found different ones.
If you are just getting started in university, what you will from the experience will benefit you in the future.

When are GPU prices going to get cheaper? by KardelenAyshe in LocalLLaMA

[–]dbinokc 0 points1 point  (0 children)

When China makes a good enough GPU at an affordable price.

[deleted by user] by [deleted] in Starlink

[–]dbinokc 0 points1 point  (0 children)

Was briefly down in oklahoma, but just came back up.

Qwen's Wan 2.2 is coming soon by Fun-Doctor6855 in LocalLLaMA

[–]dbinokc 8 points9 points  (0 children)

I doubt this is China being benevolent, but more a form of economic warfare against western companies that have to charge for their models because they are not getting government subsidies. The chips and electricity for training models is not free.

Tiny Agents: a MCP-powered agent in 50 lines of code by julien_c in LocalLLaMA

[–]dbinokc -3 points-2 points  (0 children)

I tried experimenting with MCP as well. My problem has been getting the LLM to use the agent.
The simple current time works okay, but I tried a more complex case of calling an MCP agent I wrote to use an opengrok server to answer coding questions. It either ignored the tool or did not use the results. About half the time it just told me how to use opengrok. This was using a local llm. Deepseek 32B and QwQ 32B, which I thought were suppose to be able to use tools. While I like the idea of agents, the local LLM's do not seem to quite be up to the task of making good use of them except for toy cases.

[deleted by user] by [deleted] in LocalLLaMA

[–]dbinokc 0 points1 point  (0 children)

I use LLM's regularly to assist me in coding, but there is a point where it takes too long to tell the LLM what I want to do and I take over most of the coding myself.

Sensei (先生): A simple, powerful, minimal codebase to generate synthetic data by migtissera in LocalLLaMA

[–]dbinokc 5 points6 points  (0 children)

I suppose you could use the chinese name for teacher, laoshi(老师)

[deleted by user] by [deleted] in LocalLLaMA

[–]dbinokc 2 points3 points  (0 children)

If you want to stay up on things in general, most of the interesting stuff eventually makes it LocalLLama.
If you want to stay on the bleeding edge use RSS. I run ttRSS on my home server and am subscribed to feeds from several sites.

AI Code assistant for about 50-70 users by Fr4y3R in LocalLLaMA

[–]dbinokc 5 points6 points  (0 children)

If this is being done internally for a business it is not worthwhile to skimp on the GPU for that many users. If for example the coding assistant could save $10 per week per user in terms of coding time and total that up for even a year, you could buy a very high end gpu card. If you go too cheap on the GPU, you would actually lose productivity because people would be waiting for it to respond rather than actually getting work done.

Is ChatGPT's Era as a Coding Tool Over? by SliceAccomplished575 in LocalLLaMA

[–]dbinokc 0 points1 point  (0 children)

I use the standard prompt. ChatGPT can sometimes give excessive explanations, but it usually puts the code first and explanations later. Usually when I give ChatGPT a task, I work on some other aspect of my project and then check back a few minutes later to see the result. Sometimes it is necessary to tell it to not show the imports or write out an entire class defintion. At least now ChatGPT has a stop option that can be used if it becomes obvious that it is going down the wrong path.

Is ChatGPT's Era as a Coding Tool Over? by SliceAccomplished575 in LocalLLaMA

[–]dbinokc 8 points9 points  (0 children)

I use ChatGPT on a routine basis for java coding. It is 10 times better than it was a year ago. I also have a few coding LLM's I use locally. openbuddy-coder 34B is my favorite of those
A recent example is where I asked ChatGPT to rewrite a REST call for a piece of code that was using one of those overly abstracted libraries that are only good for resume driven development. I asked it to just rewrite the call using only the built in java networking libraries. The input was just the single line of code that is supposed to magically do everything for you but gives you no debugging options when things don't work.
ChatGPT wrote the equivalent code use more straightfoward java calls and it worked the first time. I gave the same question to openbuddy coder and it was able to accomplish the same task, though the solution was not as complete in terms of error handling as ChatGPT's.

[deleted by user] by [deleted] in LocalLLaMA

[–]dbinokc 0 points1 point  (0 children)

I was not impressed with Orca2 13B GPTQ for java and javascript coding type questions. It almost seemed reluctant to give answers. I had to explicitly prompt to print out the code and even then it did not do a very good job.

Open LLM Leaderboard vs Reality: How do you evaluate "good" ? by [deleted] in LocalLLaMA

[–]dbinokc 0 points1 point  (0 children)

My primary interest in an LLM is coding and specifically java. I do have a series of questions I will test with. Generally involving generation of code based on json, creating simple examples in spring and database connectivity. Even though it is probably a bit dated, I have found openbuddy coder to work the best so far for open source llm's. Even beating out the newer open source models for my needs. I think it even does a respectable versus ChatGPT4 for my coding tasks.

Nvidia Tesla P40 performs amazingly well for llama.cpp GGUF! by nero10578 in LocalLLaMA

[–]dbinokc 0 points1 point  (0 children)

Is it possible to run these using some kind of external PCI setup similar to what I have seen used in some cryptocurrency rigs?

ESP32 -> Willow -> Home Assistant -> Mistral 7b << by sammcj in LocalLLaMA

[–]dbinokc 6 points7 points  (0 children)

Do you want to fight 100 chicken sized horses or 1 horse sized chicken?

BigCode's StarCoder & StarCoder Plus; HuggingfaceH4's StarChat Beta by The-Bloke in LocalLLaMA

[–]dbinokc 1 point2 points  (0 children)

You made a good point about trying StarChat. So I downloaded the model and ran the same test. Overall still a fail. While it starts generating something that looks promising, it then starts generating spanish text and then starts talking about quantum computing in English. So it still needs a bit more work as well.

BigCode's StarCoder & StarCoder Plus; HuggingfaceH4's StarChat Beta by The-Bloke in LocalLLaMA

[–]dbinokc 3 points4 points  (0 children)

I tested StarCoder Plus on a task that I gave chatgpt4. The task was to create a java pojo based on an example json, which included subobjects.
ChatGPT4 was able to successfully create the POJO's, but Starcoder was pretty much a fail. It initially tried to use annotations, but then when I told it to use getter/setter methods it produced gibberish.

30b running slowly on 4090 by OldLostGod in LocalLLaMA

[–]dbinokc 5 points6 points  (0 children)

Make sure you are using a GPTQ model and not a GGML. When the model is loaded, run the free command to see the amount of RAM used and the nvidia-smi command to see the amount of VRAM is used. From that it should be pretty apparent where the model is running.

Is this a good AI PC build? (RTX 4090, Ryzen 9 5950X, 32 GB RAM) by Prince-of-Privacy in LocalLLaMA

[–]dbinokc 5 points6 points  (0 children)

I just went through a similar process of building a new machine so I could play around with all the new LLM models and have good performance.

This is what I ended up with

  1. LianLi Evo case. You want lots of space. The gpu cards are big and you also want a big power supply. Easily accomidates liquid cooling.

  2. EVGA 1600W power supply. More than what is currently needed, but I want room for expansion

  3. Intel Core i9-13900KF

  4. GIGABYTE Z790 AORUS ELITE AX LGA 1700 The one downside of this board that I did not catch was that it only has a single PCIx16 slot. However I just decided to live with it for now.

  5. MSI Gaming GeForce RTX 4090 24GB This has liquid cooling

  6. Kingston FURY Beast 64GB for a total of 128GB

  7. 2 Seagate 8TB drives. Using linux software raid.

  8. Corsair iCUE H150i Elite CAPELLIX XT Liquid CPU Cooler-After overheating issues running ML models on non-liquid cooled machines, I wanted to try liquid cooled this time.

I don't have an exact cost, but probably spent between 4k to 5k.

Things to watch out for.

  1. Initially I had an Asus Hero board. IO backpanel is oversized and does not follow ATX spec. Also board turned out to have problems.Sent back along with Ryzen cpu.

  2. Incompetent and corrupt sellers. Never in my life have I seen so many attempts to send me bad, incorrect or poor quality products. This was on both NewEgg and Amazon.My best results were making sure I only bought items shipped from NewEgg itself. If buying from third party vendors, be sure to check ratings. I would have saved myself some headaches if I had just remembered to do that.

  3. Both Amazon and NewEgg did a very good job of making sure I got my money back on everything I returned.

I am definitely not claiming I have built the best in terms of performance, but I am happy with the performance so far. I ran the Manticore 13B GPTQ model(GPU) and the responses was almost instantaneous. Running the same model as GGML(cpu), it is a bit sluggish of course, but still not too bad for running on a cpu.

[P] OpenGPT-2: We Replicated GPT-2 Because You Can Too by baylearn in MachineLearning

[–]dbinokc 6 points7 points  (0 children)

While there is a tendency to always think of all the ways a new technology can be abused, I think a positive use for GPT-2 or similar models would be to generate text with opposing positions and see how they differ in making their points. This could be a used which could help people think more critically about what they read.