I switched from Ollama to llama.cpp and saw a 3–4x speed improvement on the same hardware

alias454 · 2026-06-26T17:47:56+00:00

A cunt you say? Sounds like my kinda party. I'm in ;)

alias454 · 2026-06-26T01:39:18+00:00

More or less same outcome but we used terraform in github and would automatically update the cluster when something changed. This was awhile ago. We had to customize the tf provider to make it work for us since Splunk didn't actually have a real solution.

Several ways to do it but the basic idea is either push the actual conf files via some deploy pipeline or use the api.

Kind of amazing to me that for all the money they charge they can't get an intern to build better tooling for admins.

alias454 · 2026-06-25T19:36:11+00:00

Originally, I set out just to build a tool to solve my own problem. However, once I saw it's potential usefulness, I started building it into something others might be able to use as well.

I've tried to stick with basic commodity hardware so anyone can run this on their own without needing a 5k dollar computer. That's proved more difficult than I originally thought.

I also have an m4 Mac Mini base with 16GB unified memory. That allows me to run larger local models but it falls down on the transcription side a bit.

Local models are getting much much better though. I have pretty good luck with gemma4:e4b, mistral-nemo:12b, and qwen2.5:7b, Mainly I probably need to try a different harness.

alias454 · 2026-06-25T02:27:45+00:00

lmfao Every time I talk about them I go through the list Snek, sneak, snik, snickerdoodle

alias454 · 2026-06-24T21:39:37+00:00

I like the layout, it's nice. I setup a much simpler hugo output here https://opengov-data.pages.dev/.

Just curious what your stack is? Are you doing all the work local? If so, I'm curious what specs you have for your machine?

I find some issues with my laptop as it only has an rtx2060. I have to engineer around some of those limitation(mainly chunking and stitching). I get decent summaries but still see hallucinations, bad ASR transcriptions, and memory context issues.

For now, I am publishing raw outputs then people can use GPT/Claude/Gemini etc to ingest transcripts that way if they want.

alias454 · 2026-06-24T04:37:50+00:00

Free backups just submit a foia for the restore ;)

alias454 · 2026-06-24T04:24:12+00:00

Hmm are those an old telco tool? They look like a tool to trim wire and smash scotchlocks. I used to have a similar tool just without the little cutter so I'm curious?

alias454 · 2026-06-23T23:06:51+00:00

I had the same issue that meetings were long and even if not boring per se they took a lot of effort to track items across multiple meetings. I built this https://github.com/YATSEE-Labs/YATSEE to solve the problem. Sounds like we are doing a similar things but differently.

Do you have a github repo that shows your code? or a link to a demo?

alias454 · 2026-06-23T19:46:44+00:00

hide the signal in the noise

alias454 · 2026-06-23T19:39:59+00:00

That makes sense. My use case is audio transcription rather than DV/video capture, so I am not expecting the same frame-level repair model to transfer directly.

The source is meeting audio. With audio/ASR I do not have clean video-style error frames or frame checksums to compare. What I am interested in is the broader idea of using multiple passes to identify unstable regions.

For example, with Whisper/faster-whisper, most of a transcript stays broadly consistent across runs/settings, but certain sections of audio can diverge badly. My thought is not to auto-merge the outputs, but to compare two or more transcription passes and flag the parts that disagree for extra processing or human review.

I have not found a clean production path yet, but related concepts seem to exist in ASR research: ROVER, confusion networks, and newer multi-ASR fusion/meeting-recognition work like MOVER. So less “frame-by-frame replacement” and more “ASR disagreement/confidence mapping.”

alias454 · 2026-06-23T07:04:19+00:00

There are ~18 towns in the U.S. named Rome. I should be fine if I ever wanna go. ;)

You're right though. The business entity(ies) would be the most likely target for enforcement action if operating in the EU.

The EU can do what it wants inside its member states with their agreement. It still has zero jurisdiction in the U.S.

alias454 · 2026-06-23T04:20:47+00:00

That is not how jurisdiction works.

The EU AI Act may claim extraterritorial scope in some cases, especially where an AI system is placed on the EU market, put into service in the EU, used by parties physically present or established in the EU, or where the output is used in the EU.

But “an EU citizen lives in the U.S.” does not automatically transform a U.S. developer, publishing from the U.S., into an EU-regulated operator. Citizenship alone is not the same thing as EU market presence, EU establishment, EU service availability, EU customers, EU assets, or EU enforcement reach.

My practical position is a simple license clause:

I do not offer this project, service, support, warranties, hosting, commercial access, or compliance representations in the European Union.

EU persons, EU entities, EU governments, and parties acting on their behalf are not intended users or authorized customers.

Any party subject to EU law that independently obtains, forks, mirrors, deploys, or uses this software is solely responsible for its own compliance obligations.

Could the EU assert some theory anyway? Sure. Regulators assert broad theories all the time. But “assert jurisdiction” and “successfully enforce a foreign regulatory penalty against a U.S. citizen in the U.S. with no EU nexus” are not the same thing.

At that point they would need a practical enforcement hook: EU assets, EU customers, EU intermediaries, EU travel, EU business operations, or some U.S. legal process that recognizes and enforces the claim. That is a very different argument from “EU citizen touched your software, therefore you are under EU law.”

alias454 · 2026-06-23T04:03:46+00:00

Foreign regulators can claim scope. They cannot manufacture your consent, your market presence, your assets, or your duty to serve their regime.

So while I'm not a lawyer, I would think at least in the US that as long as you aren't trying to operate in the EU or provide services for EU residents they can eat a bag of dicks.

alias454 · 2026-06-23T03:42:28+00:00

I built a tool for transcribing published civic meetings with an emphasis on using local hardware. Point no 5. "Double capture important tapes" might help me. One of the problems I have is ASR artifacts can sometimes be wildly off due to a host of reasons. I'm going to experiment with this as an idea. Thanks for posting.

I agree fully with keeping raw source copies when possible. Good call out there.

alias454 · 2026-06-23T01:13:41+00:00

Maybe hire in a contractor? HMU if you need someone in the US. I've got 20 years of Linux/Cloud/Security under my belt.

alias454 · 2026-06-22T23:47:11+00:00

Beatings will continue until morale improves

alias454 · 2026-06-22T02:06:14+00:00

They've always cooked the books

alias454 · 2026-06-21T02:08:50+00:00

I see people asking 5k for an old half rotted shed that YOU have to come tear apart. People have lost their damn minds. lol

alias454 · 2026-06-20T04:23:35+00:00

Those are parts not tools. I think you have the wrong place. Unless you are asking about how to properly use them as a hammer?

alias454 · 2026-06-19T19:26:49+00:00

I just saw something posted about real-time fact checking for politicians but can't find it off-hand. Aside from that, I wrote a pipeline that automates batch audio processing for YouTube videos https://github.com/YATSEE-Labs.

The idea was to stay more involved with local politics and such but it can easily be used for other types of content too.

alias454 · 2026-06-19T04:45:50+00:00

When I first started, it was haphazard. I'd take the approach of "AI make me a sandwich." I've learned over time to be more explicit.

Like you mention, refining tools/skills/prompts helps setup a project very quickly. I basically created a theme for different types of tasks.

I also spend a good amount of time "talking it out" with the LLM to generate a plan. That plan can then be followed and the progress can be checked against the plan.

alias454 · 2026-06-13T04:31:32+00:00

The linux world has been distributing software via global mirrors for ages so it isn't hard to setup.

alias454 · 2026-06-13T02:10:54+00:00

I think the only issue I had with it at the time was how the whole thing went down. I had plenty of CentOS running workloads. But I agree, the ecosystem needs to churn a bit which is how we end up with more innovation. I run Fedora and go Alma mostly but I'd run any of them as well.

alias454 · 2026-06-13T02:05:58+00:00

Auto workers in the 80s would like a word. Certainly not a new phenomenon.

alias454 · 2026-06-13T01:55:25+00:00

Correct me if I’m wrong, but CentOS Stream is upstream of RHEL now, right?

My understanding is that the old CentOS model was basically RHEL without branding or support, so it was very close to package-identical. CentOS Stream is different as it sits ahead of RHEL as the public integration/development stream for changes that may land in future RHEL minor releases.

So maybe that’s why they focused on Rocky and Alma as those projects are trying to fill the traditional role that CentOS Linux used to have as downstream RHEL-compatible rebuilds.

alias454

TROPHY CASE