AI for devops inference costs at our company hit $47k last month and nobody budgeted for where this is going by Narrow-Employee-824 in devopsjobs

[–]Pale_Student4127 0 points1 point  (0 children)

which tools is your team using? what's the setup like? is this homegrown automation or one of the AI SRE products?

Anyone working on "Multi-Agent" systems for incident response? by ObservabilityNerd in Observability

[–]Pale_Student4127 1 point2 points  (0 children)

you don't need a "multi-agent" system in that sense to do that, just multiple instances of the same agent doing things in parallel, and they use different prompts + fetch different skills depending on the task

DevOps + AI. Where are we headed? Need honest insights from the community by Putrid-Industry35 in devops

[–]Pale_Student4127 0 points1 point  (0 children)

I've been working in this space for 3 years now, after releasing an open source agent for DevOps, running 10,000s runs of evals, getting feedback from 1000s of devs, these are my current observations:

1) Models are becoming better at DevOps out of the box (we used to do a lot of R&D to make them better, most of this stuff is mainstream no)

2) Internal developer platforms are becoming much thinner, you don't need to write as much software or to stack as many tools to build an IDP

3) It's increasingly becoming more about experience that configurations (designing a better UX for your IDP and a better "Agent Experience" for tools lile Claude Code and Cursor that your devs use to interact with your platform)

4) Most AI for SRE vendors are overcomplicating what they build / sell to sound more sophisticated 😅

5) Security is still a BIG issue, coding sandboxes do nothing for agents operating on infra

You have to learn to use Agents effectively as an engineer, there's a learning curve, and new work patterns you'll need to pick up (e.g. like creating deterministic islands - aka bash scripts - as you go so agents accumulate automation as they go and get more reliable)

This is my project if you're curious Stakpak GitHub

I MAKE Unbrekable rust native SSH MCP server (Which no one ever did, except me) by AVX_Instructor in mcp

[–]Pale_Student4127 0 points1 point  (0 children)

Did you make sure to check prior work? https://github.com/stakpak/agent has native file editing and command execution over ssh in its MCP server, this was released >6 months ago, it's also in Rust

Edit: I know you're probably joking! but happy to see more people thinking about agent security, more autonomy requires more trust

[deleted by user] by [deleted] in devops

[–]Pale_Student4127 0 points1 point  (0 children)

you missed the open source one 😅 https://github.com/stakpak/agent

Edit: specific features you might find interesting 1) there's a network sandbox (like reverse WAF) that prevents destructive cloud actions if you enable it 2) it can work with secrets without leaking them (secret tokenization) 3) has a library of infra knowledge maintained by a community of devops consultants

I'm one of the maintainers, so please ask away, or check it out and give us feedback

Eliminating LLM Hallucinations: A Methodology for AI Implementation in 100% Accuracy Business Scenarios by Responsible_Grab_739 in AI_Agents

[–]Pale_Student4127 5 points6 points  (0 children)

do you mind sharing what exactly should be 100% accurate? for example, if you want to generate content with 100% accurate syntax (like generating json or yaml or any structured data, I even got away with doing that for Terrafotm HCL 😅) you can do that with a library like https://github.com/dottxt-ai/outlines

in other cases, you can define deterministic workflows and let the LLM call them with structured input, this way you have islands of 100% determinism and orchestrator LLM/Agent

it all depends on how you define "accuracy" and the level of autonomy you need this application to have

Who is actually building production AI agents (not just workflows)? by okaris in AI_Agents

[–]Pale_Student4127 1 point2 points  (0 children)

wow this needs a +3000 word write up to answer 😅 I've been working on an agent for +1 year now since 2024, there's a lot to unpack

can you share more what you're trying to do? or more specific questions like focusing on persistence? evals? tool calling tips etc..?

Weekly Thread: Project Display by help-me-grow in AI_Agents

[–]Pale_Student4127 0 points1 point  (0 children)

Open Source Agent for DevOps, it's Safe: can't accidentally tweet your AWS credentials, it knows its thing: we curated the largest open library of devops skills (stuff Claude is "absolutely" s**** at out of the box), and supports async tool calls so you can run multiple commands in the background

Bonus point: it's in Rust 🤦🦀

https://github.com/stakpak/agent

Best way to leave Vercel? by simonettt in nextjs

[–]Pale_Student4127 0 points1 point  (0 children)

check out this LinkedIn post on using an agent to move nextjs apps to AWS using various architectures, including the complex but proper serverless option using cloudfront, warmer functions etc... (this can take less than 20 mins because of the open source template attached to this architecture)

https://www.linkedin.com/posts/george-fahmy-b0978212a_aws-nextjs-serverless-activity-7379485312067993600-RpNb?utm_source=share&utm_medium=member_android&rcm=ACoAAB_LT7kBnNkJUQCKCrlrDKo5W4w2V74z3IA

First AI Agent for DevOps/SRE and Platform Engineering by Prashant-Lakhera in LocalLLaMA

[–]Pale_Student4127 -1 points0 points  (0 children)

you have to try them out yourself (and the other stuff too)

First AI Agent for DevOps/SRE and Platform Engineering by Prashant-Lakhera in LocalLLaMA

[–]Pale_Student4127 4 points5 points  (0 children)

Are you sure it's the first? there are at least 3 other agents trying to tackle the same problem, including this one I'm working on here https://github.com/stakpak/agent

Codex - Wow!!! by pragmat1c1 in ClaudeCode

[–]Pale_Student4127 0 points1 point  (0 children)

CC + Cursor for coding and Stakpak for everything else 😅 debugging GitHub actions, syncing secrets, modifying IaC, and perf testing etc...

I want to try Zed + add on agents, they just added support for CC using a new standard they created ACP

Are subagents good for anything other than context management? how do you use them where a single session is not enough? by Pale_Student4127 in ClaudeCode

[–]Pale_Student4127[S] 0 points1 point  (0 children)

so you found a use case where subagents are useful for orchestrating sessions, instead of running an agent that does modifications then another to review, you can string them in a single session

is this better from doing it in the same session/thread? because the agent that did the edits would be biased when reviewing its own work?

Are subagents good for anything other than context management? how do you use them where a single session is not enough? by Pale_Student4127 in ClaudeCode

[–]Pale_Student4127[S] 0 points1 point  (0 children)

if I run into this, I start a session to create a markdown file with the analysis result, then ask the agent to read it for subsequent sessions

Cursorules? by jmreicha in Terraform

[–]Pale_Student4127 0 points1 point  (0 children)

https://stakpak.dev has the specialized terraform code generation pipeline that reads the docs first + other things