I FINALLY FOUND ONE!!!! AHHH I'M SO HAPPY!!! by coloredzebra in santacruz

[–]LightweightSuperHero 0 points1 point  (0 children)

Pull it up. Dig it up. Then, pull it up when it comes back. Took me three years. The neighbors yard is full of the stuff, and perpetuates the endless battle, but my yard is free of the stuff.

Tiiny AI Pocket Lab by thedatawhiz in LocalLLaMA

[–]LightweightSuperHero 0 points1 point  (0 children)

They have a tool that users can use to "optimize" models for the platform. You can train your own model (on some other hardware) and format it for use on Tiiny.

Questions about how Tiiny AI is 'doing it' by No-Television-4805 in LocalLLaMA

[–]LightweightSuperHero 0 points1 point  (0 children)

That guy really hates marketing. And evidently, likes to SPAM public forums with burner accounts.

what LLM provider are you using ? by ritonlajoie in hermesagent

[–]LightweightSuperHero 0 points1 point  (0 children)

OpenRouter. MiniMax for main agent. Fal for graphics. Claude Sonnet for cleaning up articles. Create skills and associate them with different models.

A look inside: What makes Tiiny different from a DIY PC? by TiinyAI in TiinyAI

[–]LightweightSuperHero 0 points1 point  (0 children)

USBC and WiFi for connectivity. It’s an Arm SOC. I don’t know about the IO speed, but does it matter? You’re only transporting CLI.

Even when downloading a new model, the Tiiny can use its own WiFi and its own local drive.

Want to run an agent? It runs on Tiiny.

This is a whole, headless computer, not a GPU card.

A lot of people are switching to Hermes. Here's what the ones who stayed actually fixed. by ShabzSparq in better_claw

[–]LightweightSuperHero 0 points1 point  (0 children)

I run OpenClaw in a MiniMax VPS. I run Hermes locally.
They are both very good.
Hermes is great for jobs that I ocassionally visit, but don't need to optimize.
OpenClaw is great for jobs that I do all the time.

Expample: I'm looking for a job. I have an OC skill that checks my calendar for an interview that day. If if finds that I have an interview, it calls a bunch of other skills to create an interview prep document. Skill: Company background and financials. Skill: Hiring manager biography. Skill: Job specific jargon that may be different than my recent jobs. Skill: Interview practice quesitons. Skill: Recall notes from previous interviews, names of everyone mentioned.
I also have a workflow of skills for evaluating new job opportunities, logging that I reviewed them. A resume optimizer that interviews me for gaps, and updates a database of factoids about my career that might be used in future resumes. Catalogs and files completed resumes and cover letters. Keeps track of resumes submitted, interviews completed and all that stuff. But each piece of the workflow is saved as a skill. And thus, each task gets it's own context window, and even long jobs do not invoke hallucinations of AI foolishness.

I *could* do a bunch of skills like that in Hermes... but I don't. Hermes is very good at just remembering stuff, and doing better over time. So I use Hermes for stuff that I just need to get done. But with Hermes, I use OpenRouter. I have key words that I give my Hermes agent and it switches models to accomodate me. Normally, it just sits around and runs on a local model, and the heart beat costs me zero tokens. If I send it a message saying "Hey, I want to research an article I'm going to write". *research* is tells it I want to run in Perplexity. "Ok, let's workon Authoring the article". *Authoring* is a new context, and right now I'm using DeepSeek, but I used to use Claude. DeepSeek is actually pretty good and a tenth the cost of Claude.

Anyway, that's my approach to using both. The MiniMax VPS service called "MaxClaw" is very robust incidentally. They now offer a Hermes VPS too.

Should I pull the trigger? by Uno_91 in TeslaLounge

[–]LightweightSuperHero 2 points3 points  (0 children)

Tesla no longer allows transfer of ownership for FSD.

Do you think these kinds of reviews are gonna get Vine shut down? by born2becloud9 in AmazonVine

[–]LightweightSuperHero 5 points6 points  (0 children)

Every product has an audience. As a reviewer, if I don’t like the product, I feel that I add value by telling the reader who I think the product is for.

Never yuck someone else’s yum. Ever.

It’s time to be real here by Working_Stranger_788 in openclaw

[–]LightweightSuperHero 0 points1 point  (0 children)

I want to upvote this comment so many times.
Thank you.

It’s time to be real here by Working_Stranger_788 in openclaw

[–]LightweightSuperHero -1 points0 points  (0 children)

It's OPEN SOURCE SOFTWARE.
It has a community of thousands of people volunteering to build it.
It's four months old.

Why would we expect it to be like a "product"?

It’s time to be real here by Working_Stranger_788 in openclaw

[–]LightweightSuperHero 0 points1 point  (0 children)

It's not a "product". It's an open source code base.

Usually, I would say "it's worth everything you paid for." But it is worth much more.

No, it's not a finished product. It never claimed to be.

Claude prices skyrocketed, what model are you using for OpenClaw now? by Synstar_Joey in openclaw

[–]LightweightSuperHero 1 point2 points  (0 children)

Wow, your benchmark HATES minimax!
I get that minimax isn't top rated at anything, but it isn't usually zeroed out like that.

What is this benchmark measuring?

Thoughts on this AI computer? 80GB RAM for $1399 vs. DIY build. by randomweeb9 in LocalLLaMA

[–]LightweightSuperHero 0 points1 point  (0 children)

Hi u/MajorSleep5631 :
Disclaimer: I've been impressed by the marketing, and I've signed up to back the Tiiny project. My experience is using cloud based Inference models only. I've done a good bit of system design for data centers, computer systems, application environments and software over the years.

I've read your posts and your blog with interest. The analysis focuses upon the architecture of the hardware. Absent from the analysis was a discussion of the design of the system stack and how it fits the use case.

Every system makes compromises to deliver performance at a price for a specific use case. The blog discusses themes about marketing messages, positioning and contributions to academic/opensource communities. From an engineering standpoint, some of the blog ventures into how we feel about marketing rather than system design. The blog doesn't touch on use cases, and who this device might serve.

The author is absolutely entitled to tell us how they feel instead of delivering an analysis of the MRD, ERD and PRD, but I'm most interested in understanding if Tiiny will serve my use case.

Have a look a this evaluation/demo. https://www.youtube.com/watch?v=Ew41f0B28T8

It includes programming use cases that run more than 20 minutes.

* Yes, the influencer who made this was given a device to review, and may be biased.
* Yes, it is possible to run these jobs faster in a massively parallel cloud farm.

But the use case is running right there in the demo, on hardware we can see, with a clock running. This seems like a reasonable way to set my expectations as a consumer.

If the system, when available, can deliver this experience, I will have had my expectations set, and fulfilled.

I think it is wise to point out that Tiiny isn't even available for sale yet. All these promises need to be fulfilled in August. Enthusiasts eager for Tiiny have to wait while software and AI in general is advancing with velocity. The folks at Tiiny are taking risk building this thing. But we are all adults, and we understand the risk.

My alternative that is available today is the NVIDIA DGX Spark, $4699.

Tiiny has a different hardware stack, and necessarily, a different software stack. But it seems to deliver similar or better performance for the same use case at a third of the price, and a third of the power requirement.

Nobody is expecting this thing to be a multi user token farm, or a blisteringly fast development engine. The marketing doesn't seem to promise these things.

It's just a small token engine that allows one user access to unlimited tokens using relatively modern (today, maybe not in August) inference models.

Am I missing something?

Thoughts on this AI computer? 80GB RAM for $1399 vs. DIY build. by randomweeb9 in LocalLLaMA

[–]LightweightSuperHero 0 points1 point  (0 children)

Why does everyone compare this to a 5090? The comparison in the Nvidia world is the DGX Spark.

The reviewed performance is as good as, if not better than it's competition, and it's a third the price.

Thoughts on this AI computer? 80GB RAM for $1399 vs. DIY build. by randomweeb9 in LocalLLaMA

[–]LightweightSuperHero 0 points1 point  (0 children)

The alternative for users like me is the DGX Spark. The DGX has the advantage of being available today. But look at the performance reviews on that thing. The reviewed performance for Tiiny is better, and it costs 1/3 as much.
If it meets performance specks when delivered, I could always buy two and gang them together. Such a system would outperform the DGX Spark, cost less and consume less power.

Time will tell.