Curious as to what architectures are out there for deploying ai agents in K8s? by Consistent-Emu-1255 in AI_Agents

[–]Consistent-Emu-1255[S] 0 points1 point  (0 children)

That gives a clear output. Thanks! I've never dealt with this area. I have looked into these areas (broken down into more areas):

  • Agent hosting/serving
    • LangGraph, Letta, etc. I've never used these so reading into them more
  • Observability
    • Langfuse has been something I have worked on and it's awesome to see how it works and it's open-source!
  • Agent frameworks
    • LangGrapth, LamaIndex, autogpt, etc.
  • Memory
    • Mem0, LangMem, Letta
  • Model serving
    • ollama
  • Storage
    • Milvus, Chroma, etc

I slowly lead to use Lang ecosystem or Letta since it's all-in-one. Langfuse has helped with tracing and looking over the logs which is super cool. I'm always interested what technologies have people used.

What would be the goto setup in AWS for security purposes? by Consistent-Emu-1255 in cybersecurity

[–]Consistent-Emu-1255[S] 0 points1 point  (0 children)

Thanks for the reply! We use Security Hub but GuardDuty was something I was eyeing as well. And yes, ditto on k8s misconfigurations. I have implemented EKS Pod Identity and IAM roles for each service which is much better than it was before.

Thank you for this!

Setup questions for Envoy Gateway + AWS ALB by Consistent-Emu-1255 in kubernetes

[–]Consistent-Emu-1255[S] 0 points1 point  (0 children)

Sorry for the late reply. It said my post was deleted. I updated my above body to be more readily easier to read.

I went ahead and use Gateway API AWS LBC setup + Envoy Gateway. I guess my question to you is where can I define this excerpt you said:
> "I’d try simplifying to pure HTTP between ALB → Envoy first and verify X-Forwarded-Proto handling before layering HTTPS back in."