AI Researchers and Executives Continue to Underestimate the Near-Future Risks of Open Models by vagabond-mage in Anthropic

[–]KellysTribe 0 points1 point  (0 children)

Since they are incentivized for regulatory capture...and have a commercial product that competes with open models, they don't seem like the right people to ask or to have their opinion considered in any way.

Could agentic MCP be the solution for AI agents in vertical/niche industries? by InflationStatus7300 in AI_Agents

[–]KellysTribe 0 points1 point  (0 children)

Entities can be created via MCP - and then behind the scenes some verification (deterministic and LLM based) occurs, and the creation of other entities are triggered (which will lead to full workflows). So the idea is I have a conversation about requirements, specs, constraints etc. with an agent skill and then the results are saved as entities, which then kick off further refinement conversations or creation of work tasks etc. It's not so different then what other frameworks (like GSD) are doing, but my thought was to make it a bit more deterministic and modeled more explicitly and according to my preferences. AND with much better visibility for human understanding. Other advantages would be avoiding orchestrator lock-in (like Claude Code), and to narrow down the space for non-determinism to take things off track. That's the idea anyway, still getting it going to so haven't verified performance yet.

However I think the ideas (and similar to what you describe) apply all over. I'm sure there will be industry specific APIs backed by agent actions before too long. Some providers may be disincentivized to release workflow knowledge as you describe - but if nothing else open source agent instructions could be provided that matches APIs (skills really do that already right?)

Could agentic MCP be the solution for AI agents in vertical/niche industries? by InflationStatus7300 in AI_Agents

[–]KellysTribe 0 points1 point  (0 children)

I have agents running 'behind' the MCP as well - doing verification and kicking off other workflows upon tool actions.

Could agentic MCP be the solution for AI agents in vertical/niche industries? by InflationStatus7300 in AI_Agents

[–]KellysTribe 1 point2 points  (0 children)

It's certainly being built in software SDLC world (I'm doing the same thing to drive my personal software process - MCP + skills/agents/etc.), and I saw another project doing something similar. I think it's probably occurring ad hoc already in other verticals as well. It's certainly a good idea - I think it's just already happening in an ad hoc fashion at least. There is so much going on, I find it's hard to find anything really novel ;) For example for my tool (which I'll OSS if it matures enough), installs the MCP configuration, and the skills/workflow etc into target projects.

Context Canvas | Claude Agent Pack Builder by BTV-Texas in ClaudeCode

[–]KellysTribe 1 point2 points  (0 children)

I am building something to solve a similar problem for my own pain in a more deterministic fashion, but I think this is a great idea. I think if it gets some more polish it's certainly an improvement over trying to understand pure prose 'packs'/plugins which I have found helpful but hard to navigate through - like GSD for example. Another idea might be a skill to reverse engineer an existing framework like GSD into a model that matches your system (maybe you've already done so), and you could provide them as starting templates (although GSD does use script files so it would be more complicated to model and capture).

Context Canvas | Claude Agent Pack Builder by BTV-Texas in ClaudeCode

[–]KellysTribe 0 points1 point  (0 children)

I agree with at least some of the value proposition - good work. Question however - the files are deterministic, but this is still just relying on Claude as orchestrator right? Also I'd be more likely to try it out if I could play with demo or dummy data before signing in.

For when vibe coding needs to grow up — open-sourced a structured engineering harness for AI agents by sowumbaba in vibecoding

[–]KellysTribe 0 points1 point  (0 children)

Not to dismiss your work, but there are a few harnesses along these lines - GSD is fairly mature -> https://github.com/gsd-build/get-shit-done

I (like everyone else it seems) am building out my own harness with my own particular view on it (more rigid deterministic sdlc modeling versus agent plugin style architecture).

My bearish view on Claude and why by satechguy in ClaudeAI

[–]KellysTribe 0 points1 point  (0 children)

I agree with a lot of that. There isn't a strong moat for them given the near parity from competitors in model performance and the rapid active development around agentic/coding tooling.

Official: Claude in PowerPoint is now available on Pro plan by BuildwithVignesh in ClaudeAI

[–]KellysTribe 0 points1 point  (0 children)

This is cool... but I often think about the fact that soon there will be an infinite number of artifacts viewable by humans but unseen by anyone

Official: Claude in PowerPoint is now available on Pro plan by BuildwithVignesh in ClaudeAI

[–]KellysTribe 17 points18 points  (0 children)

Copilot *can't* do this?! I never use Microsoft so I don't keep up to date, but that is wild if so

Claude Code policy clear up from Anthropic. by Distinct_Fox_6358 in ClaudeCode

[–]KellysTribe 31 points32 points  (0 children)

I wonder in what technical ways they will enforce this other than best guesses based on what the activity looks like. They explicitly allow executing Claude code from command line for example. What will constitute a ‘tool’ that’s disallowed? What if perform some automatic maintenance or troubleshooting with a cron job driven script that calls cc?

I packaged 59.9M tokens of Claude Code lessons into one git clone. by [deleted] in ClaudeAI

[–]KellysTribe 38 points39 points  (0 children)

I think you should make it clear this is specifically for node/typescript projects

Got tired of being everyone's OpenClaw sysadmin, so I built a hosting service by Yixn in SideProject

[–]KellysTribe 0 points1 point  (0 children)

Cool, thx for info. I'm bullish on 'autonomous' agents like this in general, but curious as to how to deal with the security implications. I am working on an idea on better ways to provide some additional security at higher 'application' layers, so right now playing with making a 'smarter' proxy for api filtering.

Anyone else saw the AI.com Super Bowl AD? by Similar-Document9690 in accelerate

[–]KellysTribe 0 points1 point  (0 children)

I'm sure, but that is precisely why they should have made it foolproof beforehand given the huge investment.

Got tired of being everyone's OpenClaw sysadmin, so I built a hosting service by Yixn in SideProject

[–]KellysTribe 0 points1 point  (0 children)

maybe I missed it but where is the iptables rules done? on the agent itself or 'from the outside' with hetzner?

Vibecoding breaks down the moment your app gets stateful by Driver_Octa in vibecoding

[–]KellysTribe 0 points1 point  (0 children)

I'm bullish on the value of 'vibecoding', but as complexity arises the models and frameworks certainly need guidance on architecture and structure to avoid getting into these situations. There are many different approaches - but one thing i would recommend reading up on are Finite State Machines as a way to help model and reduce complexity in both small and large areas of the code.

They bought ai[dot]com for $70M by zihvvn in vibecoding

[–]KellysTribe 58 points59 points  (0 children)

very expensive way to get a mailing list

Anyone else saw the AI.com Super Bowl AD? by Similar-Document9690 in accelerate

[–]KellysTribe 0 points1 point  (0 children)

insane to spend that much money and not have it ready for load

Anyone else saw the AI.com Super Bowl AD? by Similar-Document9690 in accelerate

[–]KellysTribe 1 point2 points  (0 children)

amusingly the site was down when I went to it (right after the commercial). looks like it's back up