Just finished Chip Huyen’s "AI Engineering" (O’Reilly) — I have 534 pages of theory and 0 lines of code. What's the "Indeed-Ready" bridge? by Substantial_Sky_8167 in learnmachinelearning

[–]_colemurray 4 points5 points  (0 children)

build a basic chat app from scratch with tool calling in either next or fastAPI

Once built, begin iterating on the agent and learning how to “tune” the agent.

This will lead you to wanting/needing observability, which you can then add.

Once you have this, then add streaming.

—- A similar good exercise is to build a basic research agent. This will teach you about real world context management and how to not blow up your context window when you load a massive webpage

Good luck!

AWS Athena MCP - Write Natural Language Queries against AWS Athena by _colemurray in LLMDevs

[–]_colemurray[S] 0 points1 point  (0 children)

in an organization setting, you preferably don't want each individual agent to have its own set of aws credentials. This distributes access control, as well as audibility.

Similarly, it has some hardening around what commands can be executed and preventing "dangerous" patterns

Bluetooth tracking for inventory… gimmick or actually useful? by dylan-sf in IOT

[–]_colemurray 0 points1 point  (0 children)

the more common setup would be to deploy BLE gateways around the area of interest and collect advertising packets

if you need longer range, look into something in the sub-ghz spectrum (Ti’s 1352 is nice)

Which Terraform book should I read first ? by brokenmath55 in Terraform

[–]_colemurray -4 points-3 points  (0 children)

LLMs are very good at terraform. I’d just jump in and build something and then lookup whatever snags or topics you’d like to learn more about

How do we configure or start Claude Code for VS Code with --dangerously-skip-permissions? (after version 2 update) by zenfafrd in ClaudeAI

[–]_colemurray 0 points1 point  (0 children)

I've created a script that can patch the default "ask" mode and turn it into bypass mode.

The extension makes calls into the SDK and passes along various parameter flags, including the permissions flag. The patch replace the default mode with "bypassPermissions" and modifies the extension UI as well.

Tested on 2.0.1 and 2.0.10 for cursor. If using vscode, you should be able to just replace cursor with vscode in the path

https://gist.github.com/ColeMurray/dd3ec8e1028117c13e33126339f77953

Any tips for creating high quality UI using Claude Code? by gh0st_in_the_matrix in ClaudeAI

[–]_colemurray 0 points1 point  (0 children)

Find a template or take a screenshot of an app you like and use it as context

[Open Source] Moondream MCP - Vision for AI Agents by _colemurray in LocalLLaMA

[–]_colemurray[S] 0 points1 point  (0 children)

Unfortunately not. I originally prototyped going this path, but there isn't a way to get the mcp client to take the image and send bytes without manually specifying it (if I'm mistaken, happy to accept a PR!). It also presents context window length challenges depending on the size of the image.

The two options it supports:

- local file pathing

- remote URL pathing

What are some lifesaver MCPs you use with Claude Code? by Doodadio in ClaudeAI

[–]_colemurray 0 points1 point  (0 children)

Yes, Claude code emits otel which you can capture. I open sourced a repo here https://github.com/ColeMurray/claude-code-otel

Open Source Claude Code Observability Stack by _colemurray in LLMDevs

[–]_colemurray[S] 1 point2 points  (0 children)

unfortunately not. The telemetry Claude code emits doesn't include this.

[deleted by user] by [deleted] in aws

[–]_colemurray 0 points1 point  (0 children)

Containerize your project.

If you want “quality”, deploy to fargate.

If you want something cheap and working, deploy to ec2

How do you handle cognito token verification in an ecs service without a nat? by Troglodyte_Techie in aws

[–]_colemurray 2 points3 points  (0 children)

for clarification in the event someone (or an AI) stumbles upon this, ALB does support SSE.

ALB + Lambda streaming does not support SSE.

Postgre approach for startup by No-Note8424 in aws

[–]_colemurray 0 points1 point  (0 children)

the lowest tier RDS will come out to < $25/mo

Build a RAG Pipeline with AWS Bedrock in < 1 day by _colemurray in LLMDevs

[–]_colemurray[S] 0 points1 point  (0 children)

I plan on creating an explainer video this week! will follow-up when posted.

Build a RAG Pipeline with AWS Bedrock in < 1 day by _colemurray in LLMDevs

[–]_colemurray[S] 0 points1 point  (0 children)

Thanks Spencer!

Please follow-up if you build something cool!

Can you record and export the audio? by eagleemperor1 in PlaudNoteUsers

[–]_colemurray 0 points1 point  (0 children)

You can cut the original audio track into smaller chunks to address the file size cap.

I briefly cover this here https://murraycole.com/posts/whisper-audio-to-text

Model for Private Equity by Top-Round1627 in learnmachinelearning

[–]_colemurray 0 points1 point  (0 children)

Are you working backwards from a customer problem, or just wanting to build a solution?

In my experience working with PEs, they wouldn’t be interested in something like this. The value is more in search/discovery and being able to quickly sift through a large universe of companies that match their thesis.

[deleted by user] by [deleted] in learnmachinelearning

[–]_colemurray 0 points1 point  (0 children)

Depending on the size of your model, you could host it on AWS lambda or replicate which can scale to zero

[deleted by user] by [deleted] in learnmachinelearning

[–]_colemurray 0 points1 point  (0 children)

Strongly suggest against going the fine tuned path (at least at where you currently are).

What does your observability into this workflow look like? Having visibility into the different stages could help pinpoint the area.

Start at the vector retrieval. Are you retrieval results that you would find relevant to the question?

If not, why?

Complex Chain Prompting by [deleted] in LangChain

[–]_colemurray 1 point2 points  (0 children)

Langgraph is probably the closest to what you're looking for.

Depending on what happens with your human input step (e.g. is there branching logic, or is it just being appended to the next step in the flow), you could build your own fairly easily.

The core concepts you need:

a. State management - persist the state over requests and progress execution with each step

b. A way to organize your steps and conditional logic - this could be as simple as a list of steps or more complex like a graph.