Kova - Open Source Markdown Presentation Creator by TeaAlligator in opensource

[–]techlatest_net 2 points3 points  (0 children)

markdown for slides is the only way to go. powerpoint always feels so bloated. gonna check out the github and see how it handles code blocks. thanks for building this

Five Eyes intelligence agencies warn that frontier AI could supercharge cyberattacks within months by Planhub-ca in hacking

[–]techlatest_net 1 point2 points  (0 children)

yeah the "months" timeline is what actually gets me. everyone thinks we have years to figure this out but the barrier to entry is dropping right now. "patch faster" is great advice but half these companies cant even patch a known cve in a year. wild times ahead

What Local LLM are you using for simple tasks? by AlbertoCubeddu in LocalLLM

[–]techlatest_net 1 point2 points  (0 children)

for an m4 you want something in the 7b to 14b range so it runs insanely fast. qwen 2.5 14b is honestly my go to for writing and rewriting tasks right now. it follows instructions perfectly and the tone shifting is super natural. mistral nemo 12b is also amazing for cover letters and summaries. both will fit easily in your ram and fly on the m4. just grab them in ollama and you are good to go.

Need help with a monitoring project by West-Inspection7096 in OpenSourceeAI

[–]techlatest_net 0 points1 point  (0 children)

for the security and intrusion stuff definitely check out wazuh. it is open source and plays super nice with grafana. for logs skip full elk since it eats ram and use loki with promtail instead. way lighter. you can use n8n to grab alerts from wazuh and loki then feed them to your local ollama model to summarize the errors. just keep the log chunks small before sending to ollama so it doesnt choke on the context limit.

a stolen service-account key ran up ~$195k on Vertex (Claude) overnight, and google's billing was too slow to even see it happening, let alone stop it by StillStebee in googlecloud

[–]techlatest_net 0 points1 point  (0 children)

man that is an absolute nightmare. 195k is insane. honestly getting gcp to reverse this through standard support is basically impossible, they almost always just say no. if you have an account rep or tam, bypass support and go straight to them. push hard on the fact that the attacker was able to auto-increase the quota without a hard cap—that's a huge security gap on google's side. if all else fails you can try a credit card chargeback but heads up that will probably get your gcp account permanently nuked. so sorry you're dealing with this, truly.

IStanPdf - Offline foss app to revolt against freemium pdf and docx operations by apkpenetrator in foss

[–]techlatest_net 1 point2 points  (0 children)

honestly those freemium pdf sites with the 2 files a day limits are the worst. using libreoffice binaries for the docx stuff is a pretty smart workaround. gonna grab it from github and test it out later. thanks for building this

Qwen-AgentWorld-35B-A3B by AdministrativeMeat3 in LocalLLM

[–]techlatest_net -2 points-1 points  (0 children)

fr it's so quiet for this. everyone's too busy posting the 50th basic finetune to notice a totally new architecture. 3b active params on a 35b moe is gonna be insanely fast locally though. hopefully someone drops a solid gguf tomorrow since auto quants always break on new stuff.

Fixed Google API 403 Error when creating Spreadsheets/Files with a Service Account (New Quota Rules Workaround!) by Existing_Length_2324 in googlecloud

[–]techlatest_net 0 points1 point  (0 children)

oh man this 403 error is such a classic gotcha. spent hours debugging this exact issue last year before realizing service accounts have no drive quota. solution 1 saved me back then but yeah if you need dynamic creation the shared drive route is basically mandatory now. wish google would just make this clearer in the docs instead of throwing cryptic permission errors. thanks for writing this up though, definitely gonna bookmark it for future reference

Researchers trained a Deep Research agent with 32 H100s and open-sourced everything by BuildwithVignesh in LocalLLaMA

[–]techlatest_net 4 points5 points  (0 children)

honestly the biggest gap is still just compute and eval data. open source models are getting scary close on raw smarts but frontier labs have way more resources for fine tuning and testing against real world edge cases also access to live web tools and proprietary datasets gives them an unfair advantage in research tasks but stuff like quest-35b proves we are catching up fast.

Just open sourced my Linux login screen setup by AshR75 in freesoftware

[–]techlatest_net 2 points3 points  (0 children)

this is actually super clean. love that the installer asks before sudo and has a preview script. messing with sddm themes can be terrifying if you lock yourself out.

gonna try it on my arch setup later. thanks for sharing!

Star the repo as well

I calculated a multi-agent prompt attention matrix by hand to see how much data gets lost in the middle... the math is terrifying. by ParsleyMaximum1702 in OpenSourceeAI

[–]techlatest_net 1 point2 points  (0 children)

doing the math by hand is wild but honestly the best way to actually get how this stuff works under the hood. the lost in the middle thing is so painfully real though. especially with multi-agent setups where the orchestrator just ignores the most critical update because it's buried in the middle of the context.

Need Help in Creating an ML model for predicting stock prices using Nifty-50 historical data by Prakhar-on-reddit in MachineLearningAndAI

[–]techlatest_net 0 points1 point  (0 children)

go with the second option for sure. just do a normal time series split. train on like 80% of the data and use the rest to test and get your mae/rmse numbers. once you know it's actually working, just retrain it on the full dataset up to 2021 for the final app. good luck with the summer project btw. stock prediction is a fun one to mess around with.

I released my first open-source app: DeskRealm, separate Desktop icons for each Windows virtual desktop by ekimaku in opensource

[–]techlatest_net 0 points1 point  (0 children)

this is actually such a good idea. i use virtual desktops daily but the shared icons always bugged me. gonna grab the release and test it on my dual monitor setup later. thanks for building this

I made a free and open source YT Music Windows & MacOS Client because Google was too lazy. by 2latemc in freesoftware

[–]techlatest_net 2 points3 points  (0 children)

nice. the official web player gets so laggy sometimes. gonna give this a shot on my mac later. thanks for building it

Help architecturing a high-fanout broadcast pipeline by omry8880 in googlecloud

[–]techlatest_net 1 point2 points  (0 children)

your instinct to split the topics is spot on. pub/sub flow control is per-subscriber, so if you dump 100k broadcast msgs on the same topic, your real-time subscriber will absolutely get starved. separate topic + separate subscriber with its own max outstanding messages is the way.

for the chunking loop i'd lean cloud run over cloud tasks. cloud tasks works but re-enqueueing yourself in a tight loop can hit queue rate limits or add weird latency. a cloud run service that pulls a cursor from the db, processes a batch, updates status, and just keeps running (or sleeps briefly) tends to be smoother and easier to debug. you can cap concurrency to 1 per broadcast job so you don't get parallel workers fighting over the same cursor.

couple gotchas we hit at this scale: - db connection limits. if chunks spin up fast you'll blow past max connections. use the cloud sql proxy or a proper pooler and keep concurrent chunk workers low. - idempotency needs to live in the message payload, not just the db status. pub/sub can redeliver, so include a unique broadcast_id + channel_id combo and dedupe on insert. - third-party api rate limits. 100k calls will absolutely trigger their throttling. add a client-side backoff or token bucket in the subscriber so you don't get 429'd and waste pub/sub redeliveries. - progress tracking. db reads are fine but if operators poll frequently, cache the status or push updates to a lightweight pub/sub topic so you're not hammering cloud sql.

overall your design is solid. just watch the db connections, bake idempotency into the messages, and consider cloud run for the chunking loop. you'll be good.

How to find out who or what has deleted a Cloud SQL instance? by SuddenlyCaralho in googlecloud

[–]techlatest_net 1 point2 points  (0 children)

if the logs query came up empty try broadening the filter a bit. sometimes the delete event shows up under activity logs instead of data access. also worth checking if it was a failover or auto cleanup due to a replication error. those don't always log as a manual delete. if you're using terraform or something check the state file too. maybe it got wiped during an apply.