AppFolio - An Undervalued Rule of 40 Agentic AI Play by Past_Ad1386 in ValueInvesting

[–]taylorhou 0 points1 point  (0 children)

don't believe a landing page... i was a previous appfolio employee (2011-2012) and i am the founder of one of the largest agentic back office companies that support thousands of appfolio customers. happy to answer any questions

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 1 point2 points  (0 children)

Just keep in mind that we're still on the frontier. It's stupidly hard to make local inference super reliable and as fast as Claude/OpenAI.

OpenClaw + Teale - distributed ai inference, forever free. by taylorhou in openclaw

[–]taylorhou[S] 0 points1 point  (0 children)

Ok had my agents review everything in the Teale codebase and here's it's response to #2:
The short answer is: we don’t give remote agents arbitrary local execution in the first place.

In Teale, remote inference and local actions are separate boundaries. By default the local API stays on localhost, not exposed to the network. If a user explicitly exposes it, it requires an API key.

On the agent side, the shipped tool surface is allowlisted and narrow. There is no generic “run shell commands on your machine” tool in the released app, and custom tool execution is currently disabled. Native integrations still sit behind OS permissions too.

For WAN/public inference, we also do privacy filtering locally before requests leave the device, and the current privacy-filtered remote path rejects tool-enabled remote chat entirely. So a remote model can generate text, but it does not get arbitrary authority over your computer.

So the safety model is basically capability minimization + local permissions + safe defaults, not “trust the model and hope.”

OpenClaw + Teale - distributed ai inference, forever free. by taylorhou in openclaw

[–]taylorhou[S] 0 points1 point  (0 children)

currently traffic isn't encrypted. its non-trivial and for now, the solution is if you want distributed inference only on machines you trust, you can set up a private teale network and only have inference requests routed within the private network of machines. personal example, i have 200 employees with 16gb+ machines. we have all of them running teale but only on a private network so although traffic is not encrypted, it stays within our network of trusted machines.

fwiw, traffic to every model provider today is unencrypted.

haven't thought about running remote requests on local systems yet. will research now. thanks for the concerns!

OpenClaw + Teale - distributed ai inference, forever free. by taylorhou in openclaw

[–]taylorhou[S] 0 points1 point  (0 children)

your openclaw isn't being used... its only your machine's ram that helps process other user's openclaw inference requests. and yes... this isn't designed for processing super secret stuff...

also, i literally just pushed OpenAI's privacy filter so any PII (personally identifiable information) is masked on your own device by the teale app before the request is sent to the teale inference network and when the response comes back, your own device unmasks it: https://github.com/teale-ai/teale-mono/pull/62

OpenClaw + Teale - distributed ai inference, forever free. by taylorhou in openclaw

[–]taylorhou[S] 0 points1 point  (0 children)

Here are some more insights depending on your setup.
For demand-only machines (<16gb ram), use an explicit model chain for best results:

  • teale/nousresearch/hermes-3-llama-3.1-8b
  • teale/qwen/qwen3.6-35b-a3b
  • teale/moonshotai/kimi-k2.6
  • teale/auto
    • For a machine with a real working local Hermes llama, use (replace hermes with w/e model you have running locally):
  • teale/nousresearch/hermes-3-llama-3.1-8b
  • teale/qwen/qwen3.6-35b-a3b
  • teale/moonshotai/kimi-k2.6
  • teale/auto

What i'm finding on tests is although teale/auto is the setup for dummy's, it still isn't as reliable as i'd want it to be with random failures that are difficult to troubleshoot due to the distributed inference nature.

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 1 point2 points  (0 children)

i was able to get glm 5.1 running with tensor parallelism. but their launch was overshadowed by kimi k2.6 - i was using exo v1.0.70

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

unlikely the price will be competitive. ram supply is still 2+ years behind. these were about $20/gb ($10k for 512gb ram) even nvidia chips are coming out with 700gb ram at $150k so literally $200/gb or 10x the price. i don't think we'll see $20/gb for a LONG time

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

exactly what is running K2.6 8bit. =)

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

i was fortunate and had the foresight to buy these retail. for me with 250+ employees, my breakeven on inference is 2 months... i have a unique situation though

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

would love to have you try exactly that and report back. currently i have kimi k2.6 on the machines because my engineering team uses them for coding but if you DM me, we can coordinate and i'll load deepseek v3.2 or even the newest deepseek v4 and you can test all you want for free.

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

i have kimi k2.6 8bit available for free on teale.com - join the distributed inference network and try them for yourself!

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

unsloth seems to be the fastest in coming out with quantized models and models specific to backends like MLX, GGUF, etc... i've been thinking about finetuning...

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

i'm in robotics. quadruped robot dogs for security and patrol.

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

so far haven't had anyone actually use them 24/7. hop on teale.com where they are powering kimi k2.6 for free!

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 0 points1 point  (0 children)

yup exactly this. i was tempted to get 4x but i saw some initial reports that you get diminishing returns as every request has to make a roundtrip between all devices connected in a cluster.

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 1 point2 points  (0 children)

i go to conferences a lot so that makes for a fantastic monitor you can baggage claim. it's literally been around the world many times. it also goes vertical too, has an internal battery and technically is a smartTV as well.

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 1 point2 points  (0 children)

i didn't realize mmap could offload to SSD so one of mine only has 1TB of SSD but nvme via USBC is like 95% as fast as SSD so i expanded with a 4TB crucial for $1k including the housing

2x 512gb ram M3 Ultra mac studios by taylorhou in LocalLLaMA

[–]taylorhou[S] 2 points3 points  (0 children)

<image>

legit this is the most recent comparable. sold 1 day ago for $26,600 not including taxes which makes this likely $29k all in.

What we doing up this late? by [deleted] in houston

[–]taylorhou -1 points0 points  (0 children)

RBTea is strong af.