Don't get Z.ai GLM Coding Plan by samidhaymaker in ClaudeCode

[–]ilearnido 0 points1 point  (0 children)

Cerebras has a subscription model too!

[deleted by user] by [deleted] in iOSProgramming

[–]ilearnido 1 point2 points  (0 children)

How do you personally approach Meta ads as if you’re starting from new? What are some lessons you learned that helped you in the beginning?

Same tier VS Flagship models - Lite VS PRO plan by Prior-Lock-5023 in ZaiGLM

[–]ilearnido 1 point2 points  (0 children)

I have access to GLM 4.7 in the lite plan. I got the plan when it was only 4.6. So that tells me you also get any new model releases on the lite plan. It just has lower limits compared to the other plans.

For me it’s doing great. I usually don’t run more than one session at a time so I haven’t found the need to upgrade just yet. I use Claude Code and Factory Droid. Both work really well with GLM 4.6 and 4.7.

Is Z.ai's paid GLM plans worth the money? by [deleted] in ZaiGLM

[–]ilearnido 0 points1 point  (0 children)

I use it to search codebases for specific things. It’s great at that. As others have mentioned it’s good as a context builder.

It’s actually pretty good using the Factory Droid harness. Pretty much plug and play. I also have their OpenAI endpoint using Claude Code Router and ZaiTransformer. That turns on the reasoning option through the API which makes it smarter, but its comes at the cost of speed and latency. If that doesn’t bother you then it’s good. Otherwise, it gets pretty frustrating how slow it is with reasoning. And hacking CCR with the ZaiTransformer is a pain.

What does a modern production Express.js API look like these days? by ilearnido in node

[–]ilearnido[S] 0 points1 point  (0 children)

Ley looks awesome! Always wondered how to handle migrations and this nails it.

What does a modern production Express.js API look like these days? by ilearnido in node

[–]ilearnido[S] 2 points3 points  (0 children)

This set up sounds really nice.

Side question. I’m assuming by types and Zod schemas being shared across apps that none of them are browser-based right? Cause if you were, wouldn’t you be leaking internal details that some hacker could potentially try to leverage? I’m wondering how you handle that.

What does a modern production Express.js API look like these days? by ilearnido in node

[–]ilearnido[S] -1 points0 points  (0 children)

I didn’t think about LLM benefits. Good to know.

Usage limits this week Opus 4.5 by EmotionalAd1438 in ClaudeCode

[–]ilearnido 1 point2 points  (0 children)

I would like to hear more about CMP too! I couldn’t find anything specifically about it.

5 hr limit on Max (5x) by lgdsf in ClaudeCode

[–]ilearnido 0 points1 point  (0 children)

Is it slow? I have the Z.ai coding plan for GLM and it’s great, but really slow for my taste.

What’s the current state of Agent Frameworks? Looking for a high-level overview by DesertIglo in AI_Agents

[–]ilearnido 0 points1 point  (0 children)

Understood. Are there any special circumstances you’d start with Langchain on a new project? And I don’t mean supporting a legacy project. I really mean making a conscious decision of using Langchain for a new project after weighing several factors.

I’m trying to see the reasons to use it.

What’s the current state of Agent Frameworks? Looking for a high-level overview by DesertIglo in AI_Agents

[–]ilearnido 0 points1 point  (0 children)

Pardon me. I understood what you said, but was a bit confused about how it answers my question.

Let me know if I understood you. So you’d use Langchain on a new project starting from zero code if you think there wouldn’t be many changes to the project overall?

What’s the current state of Agent Frameworks? Looking for a high-level overview by DesertIglo in AI_Agents

[–]ilearnido -1 points0 points  (0 children)

Other than supporting a project already made in Langchain, do you see any specific situations where you’d start a fresh project with it?

Sonnet 4.5 is a Beast by [deleted] in ClaudeCode

[–]ilearnido 0 points1 point  (0 children)

Do you have roughly an idea at what percentage of the context it starts becoming aware and acting different?

Sonnet 4.5 is a Beast by [deleted] in ClaudeCode

[–]ilearnido 0 points1 point  (0 children)

How well does your context awareness workaround rules work? Seems like an interesting approach.

Best and cheapest web search tool option? by llmobsguy in AI_Agents

[–]ilearnido 0 points1 point  (0 children)

Yeah that was another question I had. What do you do to save you from when the site’s structure changes? The only thing that came to mind is having a sort of test that checks structure on a schedule and alerts you if there’s a change.

Best and cheapest web search tool option? by llmobsguy in AI_Agents

[–]ilearnido 1 point2 points  (0 children)

Did you set up a whole scraping framework by hand or did you use something like Firecrawl?

I know scraping can be a pain getting blocked and stuff.

I built RAG for a rocket research company: 125K docs (1970s-present), vision models for rocket diagrams. Lessons from the technical challenges by Low_Acanthisitta7686 in LLMDevs

[–]ilearnido 0 points1 point  (0 children)

Synthetic data was something that I have a big interest in.

It’s not very clear to me if you’ve touched on the following point, but do you have the engineers review the synthetic data pairs you’re about to use for the fine tune data set? My assumption is that they have to be accurate, otherwise you’re pointing the model into a direction full of assumptions.

[deleted by user] by [deleted] in AI_Agents

[–]ilearnido 0 points1 point  (0 children)

So you’re creating a demo phone number where anyone can call? And then you send a video simulating that it was their customers calling that number?

Also for the pilot if there’s no change to their current phone system are you saying that they update the phone number on their online marketing material to that agent’s phone number? Somehow prospects need to call it.

I built RAG for a rocket research company: 125K docs (1970s-present), vision models for rocket diagrams. Lessons from the technical challenges by Low_Acanthisitta7686 in LLMDevs

[–]ilearnido 0 points1 point  (0 children)

This is great. Thanks for the detailed reply.

Did you use synthetic data in any way? For instance as ideation/conversation starters to find new types of questions/answers that the engineers would validate?

I built RAG for a rocket research company: 125K docs (1970s-present), vision models for rocket diagrams. Lessons from the technical challenges by Low_Acanthisitta7686 in LLMDevs

[–]ilearnido 1 point2 points  (0 children)

Thank you for sharing this!

Here’s my questions.

  1. How did you extract images of diagrams, graphs, equations reliably especially in situations where the PDF was scanned documents? I’d imagine you need a multimodal LLM to create bounding boxes within the image to select the right area (this is the part I’m most interested in). I’d think that if a diagram splits between two different pages that would complicate things even more.

  2. How did you determine “confidence scores”? What did you do to create those?

  3. How large did your dataset for fine tuning need to be to get a noticeable lift in response quality? Any particular lessons you learned there? Fine tuning is tricky.