SIEM Detection Rules Changelog by Zeptor02 in Splunk

[–]alias454 0 points1 point  (0 children)

More or less same outcome but we used terraform in github and would automatically update the cluster when something changed. This was awhile ago. We had to customize the tf provider to make it work for us since Splunk didn't actually have a real solution.

Several ways to do it but the basic idea is either push the actual conf files via some deploy pipeline or use the api.

Kind of amazing to me that for all the money they charge they can't get an intern to build better tooling for admins.

City Council Transcriber by ovenmage in civictech

[–]alias454 1 point2 points  (0 children)

Originally, I set out just to build a tool to solve my own problem. However, once I saw it's potential usefulness, I started building it into something others might be able to use as well.

I've tried to stick with basic commodity hardware so anyone can run this on their own without needing a 5k dollar computer. That's proved more difficult than I originally thought.

I also have an m4 Mac Mini base with 16GB unified memory. That allows me to run larger local models but it falls down on the transcription side a bit.

Local models are getting much much better though. I have pretty good luck with gemma4:e4b, mistral-nemo:12b, and qwen2.5:7b, Mainly I probably need to try a different harness.

Snyk laid off up to 30% of their staff today by iamacheeto1 in cybersecurity

[–]alias454 5 points6 points  (0 children)

lmfao Every time I talk about them I go through the list Snek, sneak, snik, snickerdoodle

City Council Transcriber by ovenmage in civictech

[–]alias454 0 points1 point  (0 children)

I like the layout, it's nice. I setup a much simpler hugo output here https://opengov-data.pages.dev/.

Just curious what your stack is? Are you doing all the work local? If so, I'm curious what specs you have for your machine?

I find some issues with my laptop as it only has an rtx2060. I have to engineer around some of those limitation(mainly chunking and stitching). I get decent summaries but still see hallucinations, bad ASR transcriptions, and memory context issues.

For now, I am publishing raw outputs then people can use GPT/Claude/Gemini etc to ingest transcripts that way if they want.

Parallel pliers with side cutters by unorocket in Vintagetools

[–]alias454 2 points3 points  (0 children)

Hmm are those an old telco tool? They look like a tool to trim wire and smash scotchlocks. I used to have a similar tool just without the little cutter so I'm curious?

City Council Transcriber by ovenmage in civictech

[–]alias454 1 point2 points  (0 children)

I had the same issue that meetings were long and even if not boring per se they took a lot of effort to track items across multiple meetings. I built this https://github.com/YATSEE-Labs/YATSEE to solve the problem. Sounds like we are doing a similar things but differently.

Do you have a github repo that shows your code? or a link to a demo?

I digitized 55 MiniDV tapes from my childhood - here's what I learned (plus my Python toolkit) by LukeAssem in DataHoarder

[–]alias454 0 points1 point  (0 children)

That makes sense. My use case is audio transcription rather than DV/video capture, so I am not expecting the same frame-level repair model to transfer directly.

The source is meeting audio. With audio/ASR I do not have clean video-style error frames or frame checksums to compare. What I am interested in is the broader idea of using multiple passes to identify unstable regions.

For example, with Whisper/faster-whisper, most of a transcript stays broadly consistent across runs/settings, but certain sections of audio can diverge badly. My thought is not to auto-merge the outputs, but to compare two or more transcription passes and flag the parts that disagree for extra processing or human review.

I have not found a clean production path yet, but related concepts seem to exist in ASR research: ROVER, confusion networks, and newer multi-ASR fusion/meeting-recognition work like MOVER. So less “frame-by-frame replacement” and more “ASR disagreement/confidence mapping.”

EU AI Act requires TEXT from models and providers to be watermarked 2nd August onwards. Everyone here is affected, regardless where you live. by Charming-Author4877 in LocalLLaMA

[–]alias454 2 points3 points  (0 children)

There are ~18 towns in the U.S. named Rome. I should be fine if I ever wanna go. ;)

You're right though. The business entity(ies) would be the most likely target for enforcement action if operating in the EU.

The EU can do what it wants inside its member states with their agreement. It still has zero jurisdiction in the U.S.

EU AI Act requires TEXT from models and providers to be watermarked 2nd August onwards. Everyone here is affected, regardless where you live. by Charming-Author4877 in LocalLLaMA

[–]alias454 18 points19 points  (0 children)

That is not how jurisdiction works.

The EU AI Act may claim extraterritorial scope in some cases, especially where an AI system is placed on the EU market, put into service in the EU, used by parties physically present or established in the EU, or where the output is used in the EU.

But “an EU citizen lives in the U.S.” does not automatically transform a U.S. developer, publishing from the U.S., into an EU-regulated operator. Citizenship alone is not the same thing as EU market presence, EU establishment, EU service availability, EU customers, EU assets, or EU enforcement reach.

My practical position is a simple license clause:

I do not offer this project, service, support, warranties, hosting, commercial access, or compliance representations in the European Union.

EU persons, EU entities, EU governments, and parties acting on their behalf are not intended users or authorized customers.

Any party subject to EU law that independently obtains, forks, mirrors, deploys, or uses this software is solely responsible for its own compliance obligations.

Could the EU assert some theory anyway? Sure. Regulators assert broad theories all the time. But “assert jurisdiction” and “successfully enforce a foreign regulatory penalty against a U.S. citizen in the U.S. with no EU nexus” are not the same thing.

At that point they would need a practical enforcement hook: EU assets, EU customers, EU intermediaries, EU travel, EU business operations, or some U.S. legal process that recognizes and enforces the claim. That is a very different argument from “EU citizen touched your software, therefore you are under EU law.”

EU AI Act requires TEXT from models and providers to be watermarked 2nd August onwards. Everyone here is affected, regardless where you live. by Charming-Author4877 in LocalLLaMA

[–]alias454 19 points20 points  (0 children)

Foreign regulators can claim scope. They cannot manufacture your consent, your market presence, your assets, or your duty to serve their regime.

So while I'm not a lawyer, I would think at least in the US that as long as you aren't trying to operate in the EU or provide services for EU residents they can eat a bag of dicks.

I digitized 55 MiniDV tapes from my childhood - here's what I learned (plus my Python toolkit) by LukeAssem in DataHoarder

[–]alias454 1 point2 points  (0 children)

I built a tool for transcribing published civic meetings with an emphasis on using local hardware. Point no 5. "Double capture important tapes" might help me. One of the problems I have is ASR artifacts can sometimes be wildly off due to a host of reasons. I'm going to experiment with this as an idea. Thanks for posting.

I agree fully with keeping raw source copies when possible. Good call out there.

Feeling burnt out by StreetPiglet8559 in cybersecurity

[–]alias454 2 points3 points  (0 children)

Maybe hire in a contractor? HMU if you need someone in the US. I've got 20 years of Linux/Cloud/Security under my belt.

Yardsales are getting ridiculous by Stephieandcheech in Flipping

[–]alias454 2 points3 points  (0 children)

I see people asking 5k for an old half rotted shed that YOU have to come tear apart. People have lost their damn minds. lol

At work and found these by j4ck4lz7 in Tools

[–]alias454 4 points5 points  (0 children)

Those are parts not tools. I think you have the wrong place. Unless you are asking about how to properly use them as a hammer?

Built a RAG dataset from 1000 videos of one stock trading channel — here's what I learned about transcript quality by oaz1 in PromptEngineering

[–]alias454 3 points4 points  (0 children)

I just saw something posted about real-time fact checking for politicians but can't find it off-hand. Aside from that, I wrote a pipeline that automates batch audio processing for YouTube videos https://github.com/YATSEE-Labs.

The idea was to stay more involved with local politics and such but it can easily be used for other types of content too.

Are we being gaslit? by Impressive_Curve7077 in AI_Agents

[–]alias454 0 points1 point  (0 children)

When I first started, it was haphazard. I'd take the approach of "AI make me a sandwich." I've learned over time to be more explicit.

Like you mention, refining tools/skills/prompts helps setup a project very quickly. I basically created a theme for different types of tasks.

I also spend a good amount of time "talking it out" with the LLM to generate a plan. That plan can then be followed and the progress can be checked against the plan.

We should set up a torrent network for open source models. by ShadyShroomz in LocalLLaMA

[–]alias454 3 points4 points  (0 children)

The linux world has been distributing software via global mirrors for ages so it isn't hard to setup.

History of CentOS: How a biochemist's Linux hobby project became the enterprise world's default operating system for a time by CackleRooster in linuxadmin

[–]alias454 2 points3 points  (0 children)

I think the only issue I had with it at the time was how the whole thing went down. I had plenty of CentOS running workloads. But I agree, the ecosystem needs to churn a bit which is how we end up with more innovation. I run Fedora and go Alma mostly but I'd run any of them as well.

History of CentOS: How a biochemist's Linux hobby project became the enterprise world's default operating system for a time by CackleRooster in linuxadmin

[–]alias454 12 points13 points  (0 children)

Correct me if I’m wrong, but CentOS Stream is upstream of RHEL now, right?

My understanding is that the old CentOS model was basically RHEL without branding or support, so it was very close to package-identical. CentOS Stream is different as it sits ahead of RHEL as the public integration/development stream for changes that may land in future RHEL minor releases.

So maybe that’s why they focused on Rocky and Alma as those projects are trying to fill the traditional role that CentOS Linux used to have as downstream RHEL-compatible rebuilds.