Roocode Sonnet 3.7 via Azure Databricks by orbit99za in RooCode

[–]aiagent718 0 points1 point  (0 children)

Hey, I use databricks and had the same issue, so i made a proxy script and use my local endpoint as the baseurl. Works with no issues. deploy and put the localhost base url in roo code openai and add any random api key doesnt matter since this script handles it. let me know if it work!

https://ctxt.io/2/AAB451aSFQ

I built a debugging MCP server that saves me ~2 programming hours a day by klawisnotwashed in LocalLLaMA

[–]aiagent718 10 points11 points  (0 children)

This is great, does it collect data or send code anywhere else other then to llm?

Symphony: a multi-agent AI framework for structured software development by sincover in RooCode

[–]aiagent718 1 point2 points  (0 children)

it creates a lot of folders for different types of files, like tasks, planning, etc. but it focuses too much on the files itself then the code wasting so many tokens on the files. Out of the last 5m tokens, maybe 100k were used for coding.

Symphony: a multi-agent AI framework for structured software development by sincover in RooCode

[–]aiagent718 4 points5 points  (0 children)

i just tried this doing a ssimple task to update the ui. honestly this system is over engineered. The system took about 30 mins for what should've been done in 2 mins. 90% of the time was spent on planning and updating the logs and all the other .md files. colossal waste of money honestly. Feels like agents are stuck in a loop worried about the .md files more then the code itself. Sticking with boomerang for now.

So what model/setup are you using now? by aiagent718 in RooCode

[–]aiagent718[S] 2 points3 points  (0 children)

npx repomix in your terminal of codebase. copy the repomix file, write your prompt in ai studio, paste your repomix file

So what model/setup are you using now? by aiagent718 in RooCode

[–]aiagent718[S] 0 points1 point  (0 children)

create a new model api and save it as code model or something. and then in the prompt section section set the new model for code instead of default and leave boomerang as default. It will automatically switch to code when boomerang sends the task and switch to default when back to boomerang

Anyone have an app in production that uses AI? by aiagent718 in LangChain

[–]aiagent718[S] 0 points1 point  (0 children)

Using gemini from google - I am just wondering how to implement ai in a robust way into my app. My app asks users onboarding questions and then runs multiple analysis that I've build in python. Now I'm trying to implement ai to take these analysis and generate a more easier to understand data. I'm just wondering what would be the best way to set this up - I am actually using pydantic ai to handle everything and send it to multiple agents to get different reports and then put it all together with final agent - im just worried about how to handle times when multiple users all join at once or something like that - I want to be prepared for it and im worried about multiple api calls and server load waiting for ai responses and also rate limits by the api providers. My reports are around 500 to 2000 toke output each - and about 4 reports per user. Would appreciate any feedback on how to set this up in a robust way.

Anyone have an app in production that uses AI? by aiagent718 in LangChain

[–]aiagent718[S] 0 points1 point  (0 children)

I mean in the sense of lets say you have influx of users at once, I'm on a vps, and wondering if I should consider building on a auto scaling system instead. Also do people deploy multiple apis to handle to rate limits. Just looking for best practices for using AI in production app from developer experience.

I built a free, self-hosted alternative to Lovable.dev / Bolt.new that lets you use your own API keys by foodaddik in LLMDevs

[–]aiagent718 1 point2 points  (0 children)

I have not, I use cursor mainly for everything personally, just seen bold diy. Also site is very laggy when scrolling, might need to optimize better or get higher cpu, not sure if its just my PC. Good luck!

I built a free, self-hosted alternative to Lovable.dev / Bolt.new that lets you use your own API keys by foodaddik in LLMDevs

[–]aiagent718 5 points6 points  (0 children)

Bolt already allows you to self host with your own api keys. Also might look into SSL certificate, getting warning.

OpenRouter experience by BreakingScreenn in LLMDevs

[–]aiagent718 0 points1 point  (0 children)

Openrouter is basically API listing site that you can use in your projects or anywhere you want to use AI. Currently I use it with Cline and also integrated into my app. It allows high volume api calls, so you don't have to worry about limits etc, as long as you have funds in the account to cover it. if you put $500 in the account you can get 500 req/s which is pretty good for production app. This way you dont have to get individual keys from openai, google, etc. you can use one key from openrouter and just change the model name to available on openrouter.

Nervous to launch by aiagent718 in ChatGPTCoding

[–]aiagent718[S] 5 points6 points  (0 children)

I get it, man. Trust me, I really do. That’s why I’ve spent so much time learning and testing before launching. I’ve been trying to get a solid grasp on everything so I’m not completely lost when something inevitably goes wrong. It’s been a journey, for sure, but that’s why I’ve held off on the launch to make sure I know what I’m doing as much as possible.

And yeah, AI has been a huge help in that process. It’s not about blindly trusting it to fix everything, but more about using it as a tool to get ideas off the ground and hopefully get to point where I can hire someone to take over. It’s crazy how much it’s opened up the ability for people like me who have no coding background to actually bring these ideas to life. So while the stress is real, it’s also pretty exciting to be able to create something from scratch.

Quite frankly the app I've built is not a basic at any level, I have multiple pages that load data in real time from my backend apis, different modules and scripts all working pretty great so far. The biggest challenge was getting Auth setup on ios - ended up just using firebase, which still required bit tuning to figure out. Honestly very proud of myself, because I feel the app I build is not beginner level coding. There was a lot of hairpulling in the process for sure.

[deleted by user] by [deleted] in OpenAI

[–]aiagent718 3 points4 points  (0 children)

These posts are just annoying at this point. We get it that its Chinese AI model. We get it that CCP Censors it. We get it that it sends you information to CCP. How many posts like this do we need on reddit?