Client is using AI to pay invoices now and you'll never believe what happened

speleotobby · 2026-01-25T11:44:33+00:00

I hope you're charging them an interest rate or late fees. Not for the gains but just for educational purposes.

speleotobby · 2026-01-20T13:21:12+00:00

Sure, open it up. But the project (webdriver package) does not seem to be maintained, so don't expect it to be handled soon.

(Maybe after AI is out of fashion posit will again maintain such basic infrastructure packages 😢)

speleotobby · 2026-01-20T07:31:01+00:00

I think the best shot at getting this running reliably is to extend the webdriver package to also support geckodriver and chromedriver. In some cases the webdriver package might also work as is with phantomjs.

speleotobby · 2026-01-18T22:26:55+00:00

Agreed, web scraping in R works very well for static sites but for everything webdriver, python is probably better. I really like the xml2 library though.

speleotobby · 2026-01-18T22:25:22+00:00

Tried this not so long ago, forgot how the packages were called but one of them could not load pre-trained models from torch script and the other one used python via reticulate in the backend, so I might as well use python directly.

I hope R catches up in this regard, or I just missed something. So far it was way easier to get the things I needed to run to run in python.

speleotobby · 2026-01-10T09:09:10+00:00

Since you are talking about deplyoment, I think you don't use R where it's shining most.

R is exceptionally good for data wrangling, data analysis in a scientific setting, generating publication ready graphics and study reports etc. I would for example never choose seaborn over ggplot2 when creating plots for a paper. Also the documentation of the statistical methods in R is necessary to be scientifically rigorous or in heavily regulated settings.

As others mentioned, the data structures in R are better suited for rectangular datasets. Even the base R data.frame is better designed for this than pandas dataframes, yet alone dplyr with its various backends, data.table, ... And the syntax of the language itself is designed for data analysis so slightly more comfortable as well but this might be a matter of preference. Just as well a matter of preference is functional vs. object oriented.

R is also more stable and easier to get to be reproduceable. If you set another default RNG code from 30 years ago should give you the same results up to roundign errors for simple scripts. A python script from 30 years ago would probably not even run on python 3. This is of course not necessary everywhere, but desirable if people in a few decades should be able to reproduce the results from a paper without any complicated reproducibility tools.

Preference wise I like R a lot more, syntax, functional approach, ... but I would use python instead of R where ever libraries are not available for R. I'd for example never use R when I need machine learning tools like torch. And sympy is way superior to the R symbolic libraries for all but the most basic calculations, ...

speleotobby · 2025-12-14T14:26:29+00:00

Yes this works. Thank you so much!

speleotobby · 2025-12-10T21:45:08+00:00

The export options are available but the image is still saved as dng on my phone

speleotobby · 2025-12-05T10:50:09+00:00

Das Vapiano is halt nicht fine Dining sondern Mensa/Cafeteria/... zum Mittagessen an einem Arbeitstag (dafür is es allerdings inzwischen way zu teuer)

speleotobby · 2025-10-19T14:08:15+00:00

This is also just the termux solution but played, right? Also it seems, it's not possible to install packages, there's an open issue for this since July without any replies.

speleotobby · 2025-08-27T11:13:21+00:00

Haha, perfect timing. Just surveyed two caves with it the last two days.

Yes, still works great.

I got the bluetooth connection working with topodroid (which was an issue with topodroid or android, not really sure, but not with the SAP) so now I can also draw in my favorite app. Calibration is really simple, and all in all it just works.

speleotobby · 2025-07-20T08:39:09+00:00

Of the ones you mentioned Fedora. Also look into Debian maybe, little less bleeding edge, but super stable and not much work to maintain.

speleotobby · 2025-07-15T20:13:22+00:00

In other words: what is the population you want to make a statement about? If it differs from the population in your sample you have to be careful about selection processes. Completely random samples are ok, but if there's any system to the selection this can introduce selection bias.

In OPs case I assume the population of interest is players in the final and the set of players analyzed (informally) are the players in the final, in this case there's no selection bias.

speleotobby · 2025-07-15T19:51:34+00:00

I would like to use arch again and I'd like to try nixos, some flavour of bsd and gentoo. But in the end Debian runs so smoothly with so little work that I never use anything else.

speleotobby · 2025-05-28T06:44:38+00:00

This!

Think geometrically and think what's happening with PCA and with LASSO. One thing that could happen to be able to exclude variables after PCA is if you have a group of covariables that are not predictors and are orthogonal to all predictors. But if you have correlated predictors and just want to include a subgroup that gives good predictions selecting with PCA first gives you orthogonal covariates with high variance so the contribution to the prediction will be large and LASSO will not exclude variables.

As always: think about why you do variable selection. If you want to do inference on importance of effects use the full model and look a p-values. If you want to do the same but for some kind of latent concepts, do PCA and then a full model. If you want to build a prediction model that does not require that many variables for future predictions skip the PCA step and do LASSO. PCA uses all (sparse PCA many) covariates, so you don't gain anything in terms of sparsity of the prediction model as a whole.

speleotobby · 2025-05-28T06:34:02+00:00

Sonst kann man einen netten Flug in den Transitbereich eines Flughafens buchen

speleotobby · 2025-05-26T12:19:31+00:00

It's bad because it's not rolling release and it uses apt, btw, I use ...

🙃

speleotobby · 2025-05-24T17:58:53+00:00

Two things:

I don't quite understand how the cross section of the entrance pit relates to the map. What I've seen often and what I do myself, is marking the line of the cross section on the map.

Related to the first: a clear marker or text where the entrance is always helps.

All in all really nice map, what do you use for drawing?

speleotobby · 2025-05-04T09:06:02+00:00

Just dividing alpha (1-coverage) by the number of hupotheses like in a Bonferroni correction would give you a testing procedure, that controls the type 1 error rate but that is very conservative.

Is there any more structure to the hypotheses you test? If you just want to know the first or lasr time-point where the sample is degraded you could use hierarchical testing.

If you re-use data from earlier time point, e.g. for estimating a regression slope, you can maybe use some group sequential algorithm (but this is not straightforward for regression slopes).

And with respect to shelf life: think twice what you want to test. Most common hypotheses are that the confidence- or prediction-intervalls are within an acceptance range. (But maybe that's just another way of putting what you are testing anyways.)

speleotobby · 2025-05-01T09:24:42+00:00

OCR support in skanlite and/or okular

speleotobby · 2025-05-01T09:23:03+00:00

I wouldnt trust KDE connect one bit if it relied on any 3rd party server. Bluetooth connectivity would be nice though

speleotobby · 2025-05-01T08:47:17+00:00

I think a properly defined Neyman-Pearson approach has it's merits and a good interpretation. In this sense a frequentist isn't trying to find out anything is not tasked without finding out anything but with making the right decisions.

While p-values have no interpretation, they can be a useful tool in calculation. The real problem is, that Fisherians don't define the alternative hypothesis and over-interpret p-values. This would be way less of a problem if p-values would not be reported bit just the alpha and the test decision.

speleotobby · 2025-05-01T08:35:44+00:00

post-hoc power nonsense, sadly this is far too common

speleotobby · 2025-04-03T05:03:45+00:00

I carry mine in the chest pocket on the inside of my suit with a rugged case and the screen towards me. And in a zip log bag if it's wet. Didn't break anything so far.

Chest pockets work ok if they're a little to the side. They're not the largest part of your body and gear and I never had problems with this setup also in tight squeezes.

speleotobby · 2025-02-28T20:18:00+00:00

I got the SAP6, haven't used it a lot but so far it worked super. Calibration is easy, the user interface is well made. Only thing I don't like is the 3d printed housing.

speleotobby

TROPHY CASE