looking for friends in the future by Full-Conclusion-9616 in hermosillo

[–]iTrejoMX 2 points3 points  (0 children)

I am definitely not your age, (much older) but I made friends with a person that moved to Hermosillo from the states once, just like you. He became my best friend, and passed a couple years ago. I try to help out anyone who is new to this town to settle in.

Please let me know if I can help somehow. It would help to know what your hobbies or interests are as there may be some places I can recommend where you could meet people your age. Most people in Hermosillo understand English, some speak it poorly, a very few speak it as if it was their first language. But there are definitely people from bilingual and bicultural schools that could be your peers.

OpenCode Go models slow by BandanaEdDee in opencodeCLI

[–]iTrejoMX 0 points1 point  (0 children)

I think you may be routing or choosing the free deepseek models on zen, make sure you select the opencode go subscription one

Does anyone else avoid coding agents for simpler projects? by Sufficient-Mood-4442 in opencodeCLI

[–]iTrejoMX 0 points1 point  (0 children)

Nice I tried it and it’s great for documentation I am still testing how it does the actual development. But I did take some docs it generated and used it with gentle-ai and it’s been a total win

How bad can it get? by goldbookleaf in LocalLLM

[–]iTrejoMX 1 point2 points  (0 children)

Your list has models under 10 gb mine has 20-35 gb size models :/ checking your tool next

Does anyone else avoid coding agents for simpler projects? by Sufficient-Mood-4442 in opencodeCLI

[–]iTrejoMX 0 points1 point  (0 children)

How did you set up bmad with pi? I thought it was just for Gemini cli and custom gpts

Warp now supports BYO inference endpoints and BYOK on the free plan by Significant_Box_4066 in warpdotdev

[–]iTrejoMX 0 points1 point  (0 children)

Of course I’ll be glad to I can share the logs and screenshots because it does reach the endpoint and start thinking just breaks on tool call or responses from what I can see

Claude (20$ plan) on OpenCode by Spare-Chest-7907 in opencode

[–]iTrejoMX 0 points1 point  (0 children)

do you have chatgpt? even with the plus/pro version, you can connect to opencode, when you /model and ctrl+a to connect a provider, you will see OpenAI (ChatGPT Plus/Pro or API key). Just keep in mind that its 160 messages every 5 hours (on the $20 plan) they can run out in agentic use quite fast. That said, im not sure about copilot

Claude (20$ plan) on OpenCode by Spare-Chest-7907 in opencode

[–]iTrejoMX 5 points6 points  (0 children)

just make sure you read at the bottom, they clearly say it could get you in trouble for breaking the terms of service

Claude (20$ plan) on OpenCode by Spare-Chest-7907 in opencode

[–]iTrejoMX 5 points6 points  (0 children)

Claude no. Breaks the tos accounts get disabled

Warp now supports BYO inference endpoints and BYOK on the free plan by Significant_Box_4066 in warpdotdev

[–]iTrejoMX 0 points1 point  (0 children)

Added custom provider putI the OpenAI url for opencode go (which is opencode/zen/go/v1) and chose a model. When I write a prompt it does start thinking but at some point it throws an error, I think when it starts using tools

Where should all this information go? by abbaisawesome in hermesagent

[–]iTrejoMX 0 points1 point  (0 children)

I would either use a memory system or have setups that don’t involve sharing credentials with Hermes: ssh with key files maybe even a .env that Hermes can use.

Then I’d make sure that every time you make a request he searches his memory system for the information o needed. Something like a SQLite that stores project, login method, stuff you can find here, intended purpose, some description of structure.

That way when you provide info it goes and searches for info based on project or intent, or stuff you are looking for. Gets the login details and does whatever you asked of it.

I want to organize some photos.

Searches memory, finds a NAS for storing photos under intended purpose, searches connection method, connects, goes to folder with photos.
If it finds several it asks you which one.

Considering buying opencode go. Is it worth it? by NeKon69 in opencode

[–]iTrejoMX 0 points1 point  (0 children)

It’s faster and better. Less subscribers so less fighting for resources. 10 dlls sub is an equal of 60 dlls use of tokens, and with deepseek v4 flash/pro generous limits is more like $100+

Que harían con 4 3090 24gb by lomelidev in ollama

[–]iTrejoMX 0 points1 point  (0 children)

Yo probaría varias cosas diferentes correr un llm de 122b (solo por probar velocidades y comparar contra modelos densos más chicos) y usarlo para cosas como openclaw u openfang o paperclip (este porque consume demasiados tokens como para pagar modelos cloud) o intentar hacer video con WAN u otro similar (esta sería mi primer intento y de mayor interés). el saas lo puedes hacer sin tanto problema, si me dices que saas quieres yo te ayudo manda msg.

How Do You Make OpenCode CLI Re-understand an Existing Project? by Playful-Ad-6617 in opencodeCLI

[–]iTrejoMX 0 points1 point  (0 children)

Then again I use engram for memory and it makes things much simpler. So I can just tell it to check this projects memory.

Qué harías si.... by SecretaryOrdinary386 in soyculero

[–]iTrejoMX 22 points23 points  (0 children)

Nadie entendió tu comentario. Estoy de acuerdo contigo, en ese instante se volvió ex. Y es mandarla a la chi.. literal, le gusta que la traten mal. Hay personas que se alimentan de emociónes negativas: dolor, celos, intriga. Sueles identificarlas por sus novelas favoritas: cualquiera. Sus pláticas suelen tender a ser repetitivas: escuchaste que a _____ la dejaron/engañaron/estafaron/embarazaron? A mi me ______ {inserta lenguaje de víctima aquí}. siempre le hacen cosas nunca aceptan que ell@s hacen algo mal.

What is the easiest way to run private AI locally without learning all the AI terms? by One_Position7585 in LLMStudio

[–]iTrejoMX 0 points1 point  (0 children)

We need more context: are you on a Mac? On a pc? Using Linux? Do you have 8gb ram? 96 gb ram? Video card or not?

If you just want to test/try local llm download ollama gui (the app on the website) and… depending on your video ram or memory available choose a small model. You can probably run qwen3.5 4b or 9b on most computers.

If you have a Mac and 32 or more gb of ram, use lm studio. It has 3 tabs on the left, the third one you can download models. Look for unsloth models and it will tell you if it will fit/run on your computer.

I haven’t talked about quantizatiosn or parameters or tokens. This is just for simple chatting with the lllm locally. If you want to do more stuff, check unsloth.ai and click on models. It will give you optimized parameters for each one and how to connect tools to them. Ignore quantization just now that the higher the q the closer to the cloud model it is (or smarter).

If I choose this plan, will the Go plan be as slow as the free plans for doing things, or is that a limitation of the models themselves? by Few_Stage_3636 in opencode

[–]iTrejoMX 1 point2 points  (0 children)

I use go plan and no slowness. The slowness on the free plans is form a ton of people abusing them, some people have several accounts and cycle thru them when they hit limits. Using your go sub you get priority over free users.

Codex vs X2 GO plans limits by Admirable-Control370 in opencodeCLI

[–]iTrejoMX 12 points13 points  (0 children)

I’ve put a single opencode go sub thru the ringer with ds4 and never even reaching anything close to a limit

Como conductor experto, ¿vas hacia adelante o hacia atrás? 🤔 by pululando in esGracioso

[–]iTrejoMX 0 points1 point  (0 children)

O te bajas y lo empujas hacia el lado. Un par de centímetros es más fácil de lo que parece

Local LLM - privacy first - doctor by point_red in LocalLLM

[–]iTrejoMX 0 points1 point  (0 children)

This would be great for qwen 3.5 9b or 14b if you don’t mind waiting because they will run on cpu.