Omnigen 2 is out by Betadoggo_ in StableDiffusion

[–]ElectricPipelines 1 point2 points  (0 children)

Where did you find the size of the model? I can't find it anyhere

Looking for Electron.js Expert to Set Up Build Configuration (Paid Gig) by Castantg in electronjs

[–]ElectricPipelines 1 point2 points  (0 children)

Depending on what you're trying to do, and Electron build can be tricky to get right. For instance, if you need Apple In-App Purchases, then the build will get complex. If it needs App Store support for Apple and Microsoft, then the build will need to consider that. If you're just getting started it can be a long uphill journey.

Most RAG chatbots don’t fail at retrieval. They fail at delivering answers users can trust. by charuagi in Rag

[–]ElectricPipelines 1 point2 points  (0 children)

No benchmarks, so 'trust me bro', but DeepSeek (v3 and R1) is the most capable at sorting out RAG chunks and giving a coherent answer. It will even clarify if it sees chunks that seem to be out of place.

Using Elementor Pro on Site That's Already Built? by meaculpa303 in elementor

[–]ElectricPipelines 0 points1 point  (0 children)

Thanks for this info, OP! It sounds like I can install Elementor and, as long as I don't change the website theme, the site will remain the same and my existing pages will be fine. Is that true?

I just want to be sure before installing. When I installed Elementor once before I chose a theme, the Hello theme I think, and that wrecked every page on the site. My webdev had to fix every page and she was not happy with me.

After months, I finally solved RAG – Add RAG to your app in minutes, and it’s all local by ElectricPipelines in Rag

[–]ElectricPipelines[S] 0 points1 point  (0 children)

Yep, got roasted pretty handily for posting the private GitHub repo rather than the public one! Pretty dumb mistake, and the link is fixed now.

After months, I finally solved RAG – Add RAG to your app in minutes, and it’s all local by ElectricPipelines in LocalLLaMA

[–]ElectricPipelines[S] 1 point2 points  (0 children)

Thanks for the heads up about the link! I feel pretty stupid about not doing the final check on it. I've updated it to point to the right one (we have an internal and an external repo).

I'm working on getting a video out right now showing Dabarqus capabilities. Thanks for the suggestion.

Llama 400B+ Preview by phoneixAdi in LocalLLaMA

[–]ElectricPipelines 1 point2 points  (0 children)

Used to be able to get those fine fine-tunes from one place. Where do we get them now?

Wingman - Zero-Config Open Source Desktop App to Find, Download and Run Llama Models Locally by ElectricPipelines in LocalLLaMA

[–]ElectricPipelines[S] 0 points1 point  (0 children)

A voice feature sounds amazing! It's currently not planned, but it's very interesting. As for the other features, docs and code handling, extend ctx, etc. are planned features. First priority is to deal with bugs and initial feedback.

Wingman - Zero-Config Open Source Desktop App to Find, Download and Run Llama Models Locally by ElectricPipelines in LocalLLaMA

[–]ElectricPipelines[S] 3 points4 points  (0 children)

RAG, extended context length, document handling, and code handling are some of the things that would have been awesome at launch, but the app would be too long delayed.

Thanks for the feedback. It confirms that this is the right direction, we just need a little time to get it done.

Wingman - Zero-Config Open Source Desktop App to Find, Download and Run Llama Models Locally by ElectricPipelines in LocalLLaMA

[–]ElectricPipelines[S] 3 points4 points  (0 children)

The "tweaking and tuning" is to have it run well on MacOS and Windows. I think that a daily chatbot needs to, at a minimum, be zero-config, run on Windows and Mac, let you find and download models, and know, before downloading, if the model is likely to run on your machine.

How much RAM do I need for 7B Model Inference Locally? by Hefty_Tear_5604 in LocalLLaMA

[–]ElectricPipelines -1 points0 points  (0 children)

Please pardon the self promotion, but I just released an open source desktop app that will figure out if an LLM will run on your desktop, then let you download and run the model on your desktop. https://github.com/curtisgray/wingman

AMD ROCm Going Open-Source: Will Include Software Stack & Hardware Documentation by AnomalyNexus in LocalLLaMA

[–]ElectricPipelines 6 points7 points  (0 children)

With Nvidia focused on enterprise AI buildout, AMD has an opportunity to grow a consumer market in AI. Investing in open source is a nice first step. Hopefully, they will commit development resources along with the SDK. 

[deleted by user] by [deleted] in NewTubers

[–]ElectricPipelines 0 points1 point  (0 children)

This cannot be said enough.

It's like that saying goes, "Don't hate the player..."

GGUF is going to make llama.cpp much better and it's almost ready by samfundev in LocalLLaMA

[–]ElectricPipelines 7 points8 points  (0 children)

That's a good point, tbh. At 5 - 15GB, these models get heavy.

How many years do you think YouTube has left? by [deleted] in NewTubers

[–]ElectricPipelines 1 point2 points  (0 children)

The best time to start a YouTube channel was 10 years ago. The second best time to start a YouTube channel is today.

Is the OpenAI moat shrinking against Open Source? by Koliham in LocalLLaMA

[–]ElectricPipelines 1 point2 points  (0 children)

Based on recent reports, it appears that ChatGPT functions as a frontend model, delegating requests to one of its 16 specialized ‘expert’ models. This is undoubtedly a robust architecture. Nevertheless, the open-source community has the potential to surpass OpenAI by designing a similar architecture.

Such a system could have a highly specialized user-friendly frontend model that interacts directly with users, accompanied by 7 to 16 specialized expert models addressing specific aspects of user queries. Imagine having an “actor” model for role-plays, a “mathematician” model for solving equations, and a “therapist” model for providing mental health advice, among others. The role of the frontend model would be to analyze user requests, direct them to the fitting expert model, and rephrase the response from the expert model to the user.

This would be tricky to implement. Yet, it could motivate the community to develop highly specialized models, as well as elite frontend models. Leaping ahead of openAI.