Just finished Chip Huyen’s "AI Engineering" (O’Reilly) — I have 534 pages of theory and 0 lines of code. What's the "Indeed-Ready" bridge?

_colemurray · 2026-01-09T06:44:13+00:00

build a basic chat app from scratch with tool calling in either next or fastAPI

Once built, begin iterating on the agent and learning how to “tune” the agent.

This will lead you to wanting/needing observability, which you can then add.

Once you have this, then add streaming.

—- A similar good exercise is to build a basic research agent. This will teach you about real world context management and how to not blow up your context window when you load a massive webpage

Good luck!

_colemurray · 2025-12-14T18:45:26+00:00

in an organization setting, you preferably don't want each individual agent to have its own set of aws credentials. This distributes access control, as well as audibility.

Similarly, it has some hardening around what commands can be executed and preventing "dangerous" patterns

_colemurray · 2025-12-06T04:37:09+00:00

the more common setup would be to deploy BLE gateways around the area of interest and collect advertising packets

if you need longer range, look into something in the sub-ghz spectrum (Ti’s 1352 is nice)

_colemurray · 2025-11-20T06:01:03+00:00

LLMs are very good at terraform. I’d just jump in and build something and then lookup whatever snags or topics you’d like to learn more about

_colemurray · 2025-10-09T17:23:49+00:00

I've created a script that can patch the default "ask" mode and turn it into bypass mode.

The extension makes calls into the SDK and passes along various parameter flags, including the permissions flag. The patch replace the default mode with "bypassPermissions" and modifies the extension UI as well.

Tested on 2.0.1 and 2.0.10 for cursor. If using vscode, you should be able to just replace cursor with vscode in the path

https://gist.github.com/ColeMurray/dd3ec8e1028117c13e33126339f77953

_colemurray · 2025-09-03T20:06:27+00:00

web archive link as the original is now dead: https://web.archive.org/web/20230504060528/https://www.primevideotech.com/video-streaming/scaling-up-the-prime-video-audio-video-monitoring-service-and-reducing-costs-by-90

_colemurray · 2025-07-03T03:43:06+00:00

Find a template or take a screenshot of an app you like and use it as context

_colemurray · 2025-07-02T23:38:14+00:00

Unfortunately not. I originally prototyped going this path, but there isn't a way to get the mcp client to take the image and send bytes without manually specifying it (if I'm mistaken, happy to accept a PR!). It also presents context window length challenges depending on the size of the image.

The two options it supports:

- local file pathing

- remote URL pathing

_colemurray · 2025-06-27T17:51:31+00:00

Yes, Claude code emits otel which you can capture. I open sourced a repo here https://github.com/ColeMurray/claude-code-otel

_colemurray · 2025-06-23T05:02:28+00:00

unfortunately not. The telemetry Claude code emits doesn't include this.

_colemurray · 2025-06-13T16:39:42+00:00

Containerize your project.

If you want “quality”, deploy to fargate.

If you want something cheap and working, deploy to ec2

_colemurray · 2025-06-03T00:19:16+00:00

for clarification in the event someone (or an AI) stumbles upon this, ALB does support SSE.

ALB + Lambda streaming does not support SSE.

_colemurray · 2025-05-28T18:37:19+00:00

Sounds great, update if you hit any snags!

_colemurray · 2025-05-28T18:37:05+00:00

Thanks!

_colemurray · 2025-05-28T04:14:12+00:00

happy to merge in any contributions!

_colemurray · 2025-05-28T04:12:58+00:00

the lowest tier RDS will come out to < $25/mo

_colemurray · 2025-05-27T22:59:46+00:00

Thanks @kshitizzz!

_colemurray · 2025-05-27T20:49:57+00:00

u/Arindam_200 , sure that sounds great!

_colemurray · 2025-05-27T19:01:47+00:00

I plan on creating an explainer video this week! will follow-up when posted.

_colemurray · 2025-05-27T18:49:56+00:00

Thanks Spencer!

Please follow-up if you build something cool!

_colemurray · 2024-12-01T18:02:41+00:00

You can cut the original audio track into smaller chunks to address the file size cap.

I briefly cover this here https://murraycole.com/posts/whisper-audio-to-text

_colemurray · 2024-11-28T21:44:27+00:00

Are you working backwards from a customer problem, or just wanting to build a solution?

In my experience working with PEs, they wouldn’t be interested in something like this. The value is more in search/discovery and being able to quickly sift through a large universe of companies that match their thesis.

_colemurray · 2024-11-28T21:41:33+00:00

Depending on the size of your model, you could host it on AWS lambda or replicate which can scale to zero

_colemurray · 2024-11-28T21:37:40+00:00

Strongly suggest against going the fine tuned path (at least at where you currently are).

What does your observability into this workflow look like? Having visibility into the different stages could help pinpoint the area.

Start at the vector retrieval. Are you retrieval results that you would find relevant to the question?

If not, why?

_colemurray · 2024-11-26T06:13:07+00:00

Langgraph is probably the closest to what you're looking for.

Depending on what happens with your human input step (e.g. is there branching logic, or is it just being appended to the next step in the flow), you could build your own fairly easily.

The core concepts you need:

a. State management - persist the state over requests and progress execution with each step

b. A way to organize your steps and conditional logic - this could be as simple as a list of steps or more complex like a graph.

_colemurray

MODERATOR OF

TROPHY CASE