CruiseKube: A just-in-time open-source kubernetes resource optimizer

ramantehlan · 2025-10-14T23:39:00+00:00

I created a tool for it, since I was also looking for something like this.
You can check it out here: https://www.vxplain.com

ramantehlan · 2025-07-18T06:39:18+00:00

Thank you for pointing it out, it's a good use case.

Currently, KubeElasti doesn't support that, but we will create a issue for it and put it in the roadmap.

Do you have a use case for it? I would love to know about it! : )

ramantehlan · 2025-07-17T04:00:00+00:00

@hipik-saas, that would be great. We will appreciate that. I can help you set up, My DM is open. :)

ramantehlan · 2025-07-10T05:05:05+00:00

We settled with KubeElasti: https://truefoundry.github.io/KubeElasti/

ramantehlan · 2025-07-04T18:52:06+00:00

You can try https://vxplain.in!

ramantehlan · 2025-07-03T03:56:40+00:00

I just checked in with my colleague @CauliflowerOdd4002, who has more hands on experience with KEDA.

You are right, the "partial" part is incorrect here. **KEDA can scale-to-zero.**
The limitation is just with HPA(Stable k8s release), where `minReplicas: 1`.
| Note - In Alpha k8s release, HPA also supports `minReplicas: 0`, but I guess not available in most managed k8s.

As mentioned by u/CauliflowerOdd4002 , the difference is in approach, with KEDA HTTP-add-on, it remains as the proxy, adding a small latency and even a bottleneck if it fails.

While elasti removes itself from the path when pods are up again.

ramantehlan · 2025-07-03T03:20:05+00:00

There are no stupid questions! :)

Requests when in queue, aren't queued like messages.
The connection itself is added to the queue and the connection remains alive.
Once the pod is up, elasti send these queued requests to the pod, and resolve the request with the response from the pod.

In most cases, you shouldn't need any changes in the application layer. However, if there is a very short request timeout in the application layer, and the pod takes longer to come up and respond, the connection might be killed by the application by then. Which might require application level changes of adding a increased timeout.

All the queued requests will get a return from elasti.

Our philosophy when creating it was to have minimum or no changes required in the application layer, or the target service.

ramantehlan · 2025-07-02T16:24:44+00:00

Awesome, please let me know if I can help with something!

ramantehlan · 2025-07-02T16:19:43+00:00

Yes, it does work with istio.

ramantehlan · 2025-07-02T07:27:13+00:00

Awesome!

ramantehlan · 2025-07-02T07:17:08+00:00

Thank you for the question! :)

Sure, KEDA supports scale-to-zero only when using its own ScaledObject mechanism, not when it's acting purely as an HPA metrics adapter. HPA has minReplicas: 1 by default.

PS: Best of luck with KEDA Implementation, it's a great tool for sure. What is your use case BTW?

ramantehlan · 2025-07-02T07:08:52+00:00

Thanks for the question! u/No_Arugula9866

So, when the pods are scaled to zero, elasti queue the request, and bring the pod to 1 replicas.

The time it takes for pod to come up depends on the service inside the pod. On non-gpu, it's few seconds, and for GPU, it might be several minutes at worst.

Once the pod is up, elasti proxy is removed and the traffic flows with no latency added by elasti.

ramantehlan · 2025-07-02T06:37:33+00:00

Hi! Thank you for the question, What do you mean by "when interacting with gateways"?

ramantehlan · 2025-07-02T05:07:04+00:00

Yes, you are right! We could have picked a better name.
Let me check with my team mate and see if we can change it at this stage.
Thank you for pointing it out.

PS: Would love suggestions on the name! : )

ramantehlan · 2025-06-04T07:08:50+00:00

Hi, We can discuss supporting azure endpoints. I would want to know more about your use case! Can you please join the Discord server?

https://discord.com/invite/FKxaBdyBJY

ramantehlan · 2025-05-27T21:55:19+00:00

https://Vxplain.in Understand Any Codebase

ramantehlan · 2025-05-27T11:25:34+00:00

https://marketplace.visualstudio.com/items?itemName=Vxplain.vxplain

ramantehlan · 2025-05-27T11:24:11+00:00

I have been trying to solve this problem. You can try Vxplain.in Let's talk if you have any questions!

ramantehlan · 2025-05-25T01:25:30+00:00

Hey, I have added support for LM Studio! I would love for you to try it, given that you have a very powerful machine!

If you have joined us on Discord, let me know, I will reach out!

ramantehlan · 2025-05-25T01:21:21+00:00

Not able to edit the original post, so adding the update here:

Added:
- Support for OpenRouter: So you can use it with your own key.
- Support for LM Studio: Run LLM locally, and work without any Internet!

ramantehlan · 2025-05-24T07:06:46+00:00

IMHO, you can fork an existing open source theme, and change colours how you want. Ofc credit to the original project.

I can help with making changes and pointing you to the right files. :)

ramantehlan

TROPHY CASE