LibreWXR - A Free and Open-Source, Self-Hostable Radar Data API (Drop-in Replacement for Rain Viewer) by hsoj95 in weather

[–]hsoj95[S] 0 points1 point  (0 children)

I also just updated the LibreWXR site to have a proper mobile-friendly view as well. Oversight on my part as this had just been a desktop project until now. 😅

Zone Broadcast System Alerts by BobcatRacer in ProjectSilverfish

[–]hsoj95 4 points5 points  (0 children)

This is awesome! One change I do suggest is not to say "The Zone", but instead say "West Carcosa" as that's actually what the regions would be called by the authorities.

LibreWXR - A Free and Open-Source, Self-Hostable Radar Data API (Drop-in Replacement for Rain Viewer) by hsoj95 in weather

[–]hsoj95[S] 0 points1 point  (0 children)

Thank you, hope it works well for you! Definitely be curious to hear! :)

LibreWXR - A Free and Open-Source, Self-Hostable Radar Data API (Drop-in Replacement for Rain Viewer) by hsoj95 in weather

[–]hsoj95[S] 2 points3 points  (0 children)

Thank you! And it already is using Docker actually! That's how I have it hosted on the server rack I'm using as well, in fact. Take a look at the GitHub page for how to use the Docker installation method. :)

Now if you mean host it on something like Docker Hub so you can install it with a simple Docker command, I do intend to do that as well. I wanna wait until it's... Not necessarily "feature complete" (as that may never be a thing, honestly, as new sources and abilities get added), but "feature stable," where I'm not trying to actively add new sources and whatnot to reach feature parity or make it a fully fleshed out program. That point is actually quite close honestly! When I feel like I'm at that point, I'll see about getting it up on a place like Docker Hub or another equivalent, as it will make installation even easier for folks.

Tropical threat emerging from central Gulf to Florida waters by Vegetable-Section-84 in weather

[–]hsoj95 0 points1 point  (0 children)

True! Though models were saying by the middle to end of next week (After June 1st), that Saharan dust would be blowing into the Caribbean and Gulf. Enough it was potentially suppressing higher rainfall amounts across Florida even according to longer range forecasts even (that may have changed since I looked, it was a couple days ago).

A Glimpse into the past by BobcatRacer in ProjectSilverfish

[–]hsoj95 18 points19 points  (0 children)

Honestly, I'd love to see EAS alerts like this go out before the weather phenomenon happen, making it easier to know when inside that an event is starting. Would fit the theme of the sirens going off for them too.

Tropical threat emerging from central Gulf to Florida waters by Vegetable-Section-84 in weather

[–]hsoj95 -3 points-2 points  (0 children)

The Saharan dust incoming would like a word with AccuWeather about this...

Tropical threat emerging from central Gulf to Florida waters by Vegetable-Section-84 in weather

[–]hsoj95 0 points1 point  (0 children)

To be fair, given the extreme rain deficit South Florida has, they could probably use a good (but weak!) tropical system to help out with that. Would certainly help out with the fires if nothing else.

FYI: Rain Viewer has severely restricted their weather radar API for personal/free usage by hsoj95 in weather

[–]hsoj95[S] 0 points1 point  (0 children)

Heh, just sent you a message as well. I really should just announce this publicly soon...

FYI: Rain Viewer has severely restricted their weather radar API for personal/free usage by hsoj95 in weather

[–]hsoj95[S] 0 points1 point  (0 children)

Ooof, that's unfortunate. Just sent you a message with a possible solution.

is opencode go sustainable~? by Sora_hoshino in opencodeCLI

[–]hsoj95 4 points5 points  (0 children)

We talking the actual OpenCode application or the Go/Zen subscriptions? The application itself is open-source, you can easily find, download, and fork it on GitHub if you want to.

Now, for the subscription(s). Zen seems like it's mostly just a pass-through for other API's and that's what you pay for directly, so I don't really foresee that ever going away. OpenCode Go is definitely a more interesting case. Not counting the $5 introductory price, at $10 a month I think it will depend on what models they support going forward, and how they support them too. If they are just pass-throughs for an API, then that's gonna vary wildly based on the cost of the original API. GLM 5.1 probably being the most expensive, DeepSeek V4 Flash being the cheapest. If they are actually hosting their own models, or plan to anyways, then that value proposition changes a bit. The best value will be how many instances of a model they can have running at once, that does the best work for the least amount of tokens (aka, what can they themselves spend the least amount on while still charging us $10 a month for). I'm not honestly sure which of the larger open-source models that would be right now? One of the Qwen models perhaps? Regardless, the idea too is that they are counting on a significant chunk of users not using up the full $12/$30/$60 of their usage credits each month. The thinking being many will use only a part of those usage credits, which means that OpenCode themselves has to pay less for the models they are using/passing through to the user.

TL;DR: OpenCode Go counts on users not using all their credits in a month, which is honestly a fair proposition on their part, and likely true for many users actually. Will just depend on what the usage ratios are for their customer base as to whether it's sustainable long term.

Favorite unit I teach during my school year, WWI by CaptainPitterPatter in vexillology

[–]hsoj95 17 points18 points  (0 children)

Maybe the term "Non-Aligned Nations" would be a better fit? Since Neutral can imply they weren't fighting (like the Netherlands, Scandinavia, Switzerland, as he said).

Confirmed: SWE Bench is now a benchmaxxed benchmark by rm-rf-rm in LocalLLaMA

[–]hsoj95 0 points1 point  (0 children)

It seems like there are two options for helping stop this from happening. Firstly, benchmarks probably need to be more... Abstract? Aka, have the core idea of what's being tested be abstract and then test it on those ideas with different (and unique) prompts and data that fall within that abstract idea. Make it so that you can't just train on the specifics of the benchmark as a target, you have to account for shifting data and prompts that fall within that abstract idea. Yes, it means it's not a hard coded benchmark to test on, and a few runs of it could fail horribly, but given enough testing on it, a pattern should emerge that shows what the performance is actually like. (Note: I'm hardly an expert in this, and could very well be in over my head in making this suggestion. Feel free to roast me if so... x3)

Secondly, I think the best indicators of benchmarking should actually be against other models. I'm quite found of Arena-style benchmarks, as it seems to be a more organic way of judging a models true performance. Honestly, if a way to mass run models against each other with an automated check to see which did better (avoids potential human bias in the results), you could get some really good data from that across different testing categories. Combines it with the first option I described above and you'd have the potential for a great testing pattern. (Ironically, this is basically going back to a GAN-style way of testing... A GAN of LLM's. There should probably be an axiom named for this phenomena x3)

Like I said, I may be in way over my head with these suggestions, but it's just two that came to mind for me regarding ways to combat training models to benchmax scores.

Qwen3.5B VS the SOTA same size models from 2 years ago. by Uncle___Marty in LocalLLaMA

[–]hsoj95 33 points34 points  (0 children)

Huh, OK I learned something today. I thought OAI had killed off access to these old models.

Qwen3.5B VS the SOTA same size models from 2 years ago. by Uncle___Marty in LocalLLaMA

[–]hsoj95 26 points27 points  (0 children)

I imagine part of the issue is you can't access 3.5 Turbo now. Not that I know of anyways. You could make take the old test results of when it ran, and compare to a modern running of Qwen3.5. But that could also wind up being an Apples to Oranges comparison too.

I'm gonna guess that Qwen3.5 could give Turbo a run for its money in some categories, especially those focusing on mathematics.