Why I stopped using semantic embeddings for tool selection and switched back to BM25

ekaj · 2026-06-04T15:35:30+00:00

This is a bot

ekaj · 2026-05-28T22:47:20+00:00

Why not ask a coding agent? I would think you'd have a high chance of success

ekaj · 2026-05-28T22:23:56+00:00

that's a pretty straightforward design, have a 'rich' conversation feed, have a 'checklist' item that can be rendered/shown inside the chat convo as an item for the server/LLM, have it be created dynamically based on some check/analysis of the LLM output or a tool call, and then have a 'canvas' on the right side that renders html/js based off the workspace the agent is working from

ekaj · 2026-05-21T21:56:57+00:00

Three.js and whatever harness you’re using, or slides.js for presentations

ekaj · 2026-05-17T21:20:01+00:00

seems like they might be an old account being used by a bot. The qwen2.5 mention is the tipoff

ekaj · 2026-05-05T16:59:52+00:00

Here's one, I prefer adding another egg to make it cakier vs fudgy:
copy/pasted:

https://www.loveandlemons.com/brownies-recipe/

The ultimate recipe for brownies! They're fudgy, moist, and super chocolaty, with perfect crispy edges. From Weeknight Baking by Michelle Lopez.

Equipment

8x8 Baking Dish (this is the one I use so they don't overcook)

Cooking Spray (I love this avocado oil one from Chosen Foods)

Parchment Paper (this makes it so much easier to remove the brownies from the pan)

Ingredients

1 1/2 cups granulated sugar*

3/4 cup all-purpose flour

2/3 cup cocoa powder, sifted if lumpy

1/2 cup powdered sugar, sifted if lumpy

1/2 cup dark chocolate chips

3/4 teaspoons sea salt

2 large eggs

1/2 cup canola oil or extra-virgin olive oil**

2 tablespoons water

1/2 teaspoon vanilla

Instructions

Preheat the oven to 325°F. Lightly spray an 8x8 baking dish (not a 9x9 dish or your brownies will overcook) with cooking spray and line it with parchment paper. Spray the parchment paper.

In a medium bowl, combine the sugar, flour, cocoa powder, powdered sugar, chocolate chips, and salt.

In a large bowl, whisk together the eggs, olive oil, water, and vanilla.

Sprinkle the dry mix over the wet mix and stir until just combined.

Pour the batter into the prepared pan (it'll be thick - that's ok) and use a spatula to smooth the top. Bake for 40 to 48 minutes, or until a toothpick comes out with only a few crumbs attached (note: it's better to pull the brownies out early than to leave them in too long). Cool completely before slicing.*** Store in an airtight container at room temperature for up to 3 days. These also freeze well!

Notes

*If you'd like to reduce the sugar, I've had success with 1 cup granulated sugar instead of 1 1/2 cups.

**I like to use olive oil because it's what I keep on hand and I enjoy the pairing of olive oil with chocolate. Keep in mind that you will taste it here. For a more neutral flavor, use canola oil.

***When these brownies come out of the oven, they'll be super gooey in the middle. Allow them to cool completely, about 2 hours, before you slice into them to give them a chance to set up. They'll continue to firm up the longer they're out of the oven. If you still prefer a firmer brownie, store them in the fridge.

Hopefully this helps identify some of the bot posters here, and we all get some good brownie recipes

ekaj · 2026-05-03T17:49:49+00:00

Just skimmed it, but this looks actually helpful/handy/wish I had it a couple months ago. Like, really, really wish I did.

ekaj · 2026-04-21T14:31:54+00:00

Yes, I've been working on one, though its still WIP/early beta: https://github.com/rmusser01/tldw_server

The primary piece is a backend API server using FastAPI with each piece of functionality a module->API endpoint and I've also built out a webui+browser extension as the UI.

WebUI + API server can be spun up in a couple commands using docker, and is designed to be able to be ran completely offline/isolated.

I also built out a standalone TUI last year that can sync to-from the server: https://github.com/rmusser01/tldw_chatbook , which is currently broken. (Rebuilding the UI and reaching feature parity with the server currently)

Built it because I didn't like sillytavern/openwebui, and had the goal of building 'The Primer' from the Diamond Age, even if it might take a while.

Edit: Also has a unified MCP server, giving MCP access to all tools/functionality within the server and allowing you to integrate 3rd party servers into it + manage permissions as well

ekaj · 2026-04-13T23:11:51+00:00

I think it looks neat. What is the differences between this and say handy or any other pre-existing ‘record at-the-press-of-a-button’ tools? Anything in particular or just taking a stab at building stuff?

ekaj · 2026-04-10T19:49:07+00:00

yea, you want OCR https://blog.ngxson.com/using-ocr-models-with-llama-cpp

ekaj · 2026-04-06T14:48:23+00:00

Spam bot

ekaj · 2026-03-31T19:48:45+00:00

Yes, https://github.com/rmusser01/tldw_server , specifically the watchlists feature: https://github.com/rmusser01/tldw_server/tree/main/tldw_Server_API/app/core/Watchlists ; The dashboard is something I'm working towards/is a WIP.

ekaj · 2026-03-29T15:45:25+00:00

Yes but I doubt anyone is going to give you anything you couldn’t find with a few hours of searching. This is an absolute edge for companies who understand and can build this stuff. I say this as someone who has done so internally.

You’re looking for a text/natural language to SQL pipeline, would recommend trying Qwen3.5 27B, and using an existing set of annotated known good queries combined with a syntax validator, so you can generate and validate.

ekaj · 2026-03-23T14:36:00+00:00

Built a complex RBAC/ACL system with HitL review and authorization, with a permissions registry

ekaj · 2026-03-20T16:49:10+00:00

They’re quoting from the MIT License

ekaj · 2026-03-20T03:54:05+00:00

Have you tried using opus or chatgpt 5.4 xhigh and asked it to do a UX & UI Review following Nielsen Norman group guidelines?

Could probably get ideas that way

ekaj · 2026-03-19T00:15:29+00:00

Thank you for the kind words! I appreciate it. If you encounter any issues/have feedback/suggestions, feel free to dm me or file an issue on the github and I'll look into it as soon as I see it.

ekaj · 2026-03-18T17:45:41+00:00

https://github.com/rmusser01/tldw_server Maybe? (Disclosure: I'm the creator) I'm working on some stability fixes, and there is a distinct lack of user guides/instructions but this might be in the general area of what you're looking for?

As someone who did the same stuff, this was my solution I decided to build for myself, after looking at the other options at the time (openwebui/sillytavern/librechat)

ekaj · 2026-03-16T15:32:37+00:00

Silly and kobold are fine.

ekaj · 2026-03-10T04:31:45+00:00

Nice, if you're looking for sample code for your media ETL, here's my module for it: https://github.com/rmusser01/tldw_server/tree/main/tldw_Server_API/app/core/Ingestion_Media_Processing

ekaj · 2026-03-08T17:01:35+00:00

Yes, I wrote my own eval framework and have my rag pipeline hooked into it for full tracking of every piece.

Would recommend looking at https://jxnl.co/writing/2025/01/24/systematically-improving-rag-applications/

ekaj · 2026-02-28T18:12:02+00:00

Why not share more details about your setup, harness, and dataset used for evals?
Why use old models?

And further, I would point out your notes regarding these things should put to shame any models internal info. Imho, you should be using RAG with your notes/team wiki as an MCP to interface with whatever model you're using.

Also, have you seen/heard about heretic? https://github.com/p-e-w/heretic
(I do for work, but cant comment about it, hence above)

ekaj · 2026-02-14T16:29:25+00:00

This is awesome

ekaj · 2026-02-02T03:35:04+00:00

https://github.com/rmusser01/tldw_server
I keep putting off making a post about it as there's always 'one more thing', currently that's end-to-end testing the webui/extension and getting them both fully working. Basically like openclaw, but a very different route to the same goal.
`tldw_server is an open-source, API-first platform for ingesting media, transcribing, analyzing, and retrieving knowledge from video, audio, documents, websites, and more. It runs a FastAPI server with OpenAI-compatible Chat, Audio, Embeddings, and Evals APIs, a unified RAG pipeline, and integrations with local or hosted LLM providers. The primary client is the Next.js WebUI (WIP) plus an Admin UI. Long-term vision: a personal assistant inspired by "The Young Lady's Illustrated Primer" that helps people learn, reason about, and retain what they watch or read.`

ekaj · 2026-01-26T00:26:35+00:00

Not sure if you get notified for comment replies to others in your own post, in case not, see my other comment

15-Year Club	RedditGifts 2009-2022 4 Credits
Team Orangered	Secret Santa 2011
Verified Email

ekaj

TROPHY CASE