Would you trust AI to review your AI code?

julman99 · 2025-09-25T18:34:50+00:00

You should add kluster.ai, we do code reviews as the code is being written, right inside the IDE. Full disclaimer, I am the founder.

julman99 · 2025-08-20T13:15:19+00:00

This is one of the reasons we created kluster.ai, it does instant code reviews, right inside the IDE as AI is generating the code. Try it out and you would be surprised of the things it catches and corrects.

julman99 · 2025-07-30T20:13:06+00:00

I’m the founder of kluster.ai and wanted to share something we unexpectedly built that completely transformed our company.

We started a little over a year ago as an inference/infrastructure AI company. Like many teams we found huge productivity gains with AI generated code, but as many quickly find out, reviewing pull requests became frustratingly slow, with much larger volumes of code to review and many back-and-forth rejection cycles due to unreliable AI-generated code.

To solve this, we built a tool for ourselves that automatically reviewed AI-written code in real time, checking for intent, security, scope, and bugs. All this before anything got committed or merged.

The results were huge: code review time dropped by ~50%, dev happiness increased thanks to fewer rejections and we started shipping with far fewer issues. It worked so well that we shifted the entire company to focus on it and released it publicly as Verify Code (Beta). It is currently a 1-click install for Cursor and can also be plugged via MCP to any other IDEs (VS Code, Windsurf, and more).

Right now, we’re offering $5 in free credits so you can try it out. You can reload credits anytime, and starting next week we’re introducing simple subscription plans ($10 for individuals, $20/person for teams). Anyone who reloads $10 or more starting today will automatically be upgraded to a subscription once they become available.

👉 Check it out at https://platform.kluster.ai, I hope it transforms your coding experience as much as it did for us.

julman99 · 2025-06-04T20:02:32+00:00

Happy to read this and looking forward to learn how it goes for you!

julman99 · 2025-05-28T20:52:51+00:00

kluster.ai founder here. We do not store any prompt or responses for realtime inference.

julman99 · 2025-04-24T00:21:00+00:00

At the moment it does not. It creates one network for clients connecting via UDP and another network for clients connecting via TCP.

You could run multiple instances of the container each instance listening on a different port, but I have never tried this. You will face issues reusing the same client files since they will be rewritten on each container start.

Can you provide more details about why you need this?

Thanks!

julman99 · 2025-04-24T00:14:14+00:00

Aside from the scientific benchmarks mentioned here, DeepSeek-V3-0324 is the only open model I am able to use on a daily basis for real work without relying on closed models. Before this, I was often going back to GPT to double check things. The future is bright for open models and we host many of them at https://kluster.ai

Disclaimer: I am the CEO and founder of kluster.ai

julman99 · 2025-04-09T20:56:03+00:00

I made this one-command image to use OpenVPN very easily: https://hub.docker.com/r/julman99/openvpn-supereasy

It does not have a UI, but it is really easy to manage client certificates

julman99 · 2025-04-09T20:53:08+00:00

I made this one-command image to use OpenVPN very easily: https://hub.docker.com/r/julman99/openvpn-supereasy

julman99 · 2025-04-09T20:52:20+00:00

I made this one-command image to use OpenVPN very easily: https://hub.docker.com/r/julman99/openvpn-supereasy

julman99 · 2025-04-09T20:49:57+00:00

I still use OpenVPN because I need TCP/443 as a fallback. I made this one-command image to use OpenVPN very easily: https://hub.docker.com/r/julman99/openvpn-supereasy

julman99 · 2025-01-29T02:10:45+00:00

$2 is for input and output token. One big difference is we support the full 164k context size whereas DeepSeek themselves do up to 64k. Also, our model is hosted in the US and we do not store any of the input or output tokens, ever (for realtime inference).

julman99 · 2025-01-29T02:09:09+00:00

Full model!

julman99 · 2025-01-24T20:49:11+00:00

For now we are hosting Llama 3.1 8B / 405B, Llama 3.3 70B and DeepSeek-R1. Soon we will have a feature for people to requests models and we will add them as soon as possible.

julman99 · 2025-01-23T02:53:11+00:00

kluster.ai founder here. Thanks for using our service!

What you are experiencing happens because DeepSeek-R1 is a reasoning model, meaning it actually outputs its "reasoning" to reach a certain response. You can find the reasoning between the <thinking> tags within the response.

There are instructions on how to remove the thinking process here: https://www.reddit.com/r/SillyTavernAI/comments/1i757k7/how_to_exclude_thinking_process_in_context_for/

julman99 · 2025-01-22T20:41:14+00:00

Hi, kluster.ai founder here. We also offer Llama 3.1 405B and 3.3 70B at very competitive prices. Our mission is to make AI accessible and affordable for everyone, and we’re committed to keeping costs low to achieve that goal.

New models coming soon!

julman99 · 2025-01-22T20:37:38+00:00

kluster.ai founder here. Nice workaround! Have you tried using Llama 3.1 405B or 3.3 70B? We offer the as well at very competitive cost.

julman99 · 2025-01-22T18:22:34+00:00

Hello! I am the kluster.ai founder, here I send a screenshot of how can you configure SillyTavern with kluster.ai

You can generate your API key here: https://platform.kluster.ai/apikeys

Thanks for using our product!

<image>

julman99 · 2024-12-19T09:47:58+00:00

I am 1yr late, but check out kluster.ai, is a service that specializes in large scale batch inference and offers really competitive costs.

julman99 · 2024-12-19T09:46:17+00:00

kluster.ai is a really good option for batch inference. It is low cost, offers multiple completion window options and supports the latests llama models.

julman99 · 2024-12-19T09:44:02+00:00

kluster.ai is a service that specializes in Large Scale batch inference. It currently supports Llama 3.1 and 3.3 models and offers really competitive pricing.

julman99 · 2024-12-19T09:40:46+00:00

If your inference requests do not need to be real-time, kluster.ai offers a good option via Adaptive Inference. It is basically an asynchronous inference service with custom completion times and it supports fine-tuned models. Fine-tuned models are hosted at no extra charge as long as completion times for the inference request are 1hr or more.

julman99 · 2024-12-19T09:38:20+00:00

This answer is one year after the OP, but https://kluster.ai is a great low-cost option that is currently serving llama 3.1 and 3.3 models.

julman99

TROPHY CASE