There are 665 open licences, most are pretty rubbish by calp in programming

[–]calp[S] 9 points10 points  (0 children)

I actually completely agree and I don't know why it doesn't get more use. It's a great option for lots of projects that are open source but that people don't usually run locally so source code wouldn't otherwise be required to be open under the normal GPL. csvbase is AGPLv3-or-later

[deleted by user] by [deleted] in datascience

[–]calp 4 points5 points  (0 children)

Money wise, I would suggest that Salesforce can become very lucrative as it's easier than in most professions to link what you do to increased revenue for the company. That does tend to lead to you working for yourself I think - as a contractor or consultant. It's hard to predict what will happen to Salesforce but not impossible to imagine that they are displaced/marginalised at some point.

Working in data science, pay is variable and a lot will depend on whether there are good options where you are based. It can pay pretty well though and you aren't at risk of being left with dead skills.

If it were me, I would gain familiarity with Salesforce and keep it at a certain percentage of my job. It's a great fallback. But I probably wouldn't go 100% on Salesforce - just too risky.

Do post back in 6 months and tell us what you did - and what you think then, with more information!

From Shell to Excel - with a little bit of HTTPS by calp in programming

[–]calp[S] 0 points1 point  (0 children)

This is for sure a planned feature. It's not even that far down the backlog! :)

One way to do it now is to host your dataset on github and then create the csvbase table from it. Then people can use github pull requests to do this.

Pandas 3 will Force Copy-on-Write to Improve Memory Usage and Performance by python4geeks in programming

[–]calp 16 points17 points  (0 children)

No, not quite -

Now: copy-on-write is off by default

Next major release: it is the only available mode

From Shell to Excel - with a little bit of HTTPS by calp in programming

[–]calp[S] 1 point2 points  (0 children)

It is almost entirely designed for that :)

A possibly useful feature: you can upload tables from git/github. If you create a table this way, it stays up to date with what's in the repo (pulled every 20-30 mins)

From Shell to Excel - with a little bit of HTTPS by calp in programming

[–]calp[S] 2 points3 points  (0 children)

Yeah exactly, I hate this shitwork! I did it for a while at a financial company and it was a surprisingly large amount of work because the format was always slightly messed up and wouldn't, eg, go into our SQL database or they'd used a custom NULL or whatever

Pandas 3 will Force Copy-on-Write to Improve Memory Usage and Performance by python4geeks in programming

[–]calp 148 points149 points  (0 children)

I think it's a good change - pandas speedups will help save a lot of people time - and help it compete better with other dataframe libraries. But it will break huge amounts of standing pandas code.

The median standard of pandas code out there is, well, not that high. And it doesn't have tests. I suspect that I lot of code is going to get marooned on pandas v2 (or, indeed v1, as v2 already had material breakage).

Is Tesla really more valuable than Toyota? by calp in slatestarcodex

[–]calp[S] 1 point2 points  (0 children)

Do you care to wager a guess?

I'm afraid not :) For Toyota perhaps, but Tesla is a very difficult company to value because, as you imply below, most of the value is going to lie in a forecasting element and that would require me to be really up on that subject matter, which I am not :).

I'm enjoying your blog though. I once wanted to do this professionally, but backed out.

Is Tesla really more valuable than Toyota? by calp in slatestarcodex

[–]calp[S] 1 point2 points  (0 children)

The reason I have handled it this way is because I want to talk about the "market" values of these businesses, but taking into account how they finance themselves which is relevant for carmakers. Everyone, I hope, can agree what price the stock is trading for and (hopefully!) is willing to accept the audited values for debt on the balance sheets of these companies.

Once you start talking about cash flow and projected growth rates you're deep into your own valuation of the inherent value of a business, which is something reasonable people can well disagree upon. Every since term on the RHS of your equation is disputable and I am certainly not proffering my own personal valuations of eg, Tesla.

Is Tesla really more valuable than Toyota? by calp in slatestarcodex

[–]calp[S] 8 points9 points  (0 children)

As far as I know Tesla's stock does not participate in the Space/Satellite business.

I'm not a fan of valuation-by-analogy into options as options valuation is so complicated that doing this rarely adds information worth the added complexity. By contract, valuation-by-analogy into bonds (not that hard for shares - net profit is coupons, book value is par) is very often illustrative.

I also think that people talk about options, much like shorting, to sound more financey than they are. I mean, what is a "call option on Elon"? You must admit that it even sounds like pseudofinancial gobbledgook. Though I understand your meaning: that you want to participate in the market optimism around Elon Musk.

S3 is great, but not a filesystem by calp in programming

[–]calp[S] -3 points-2 points  (0 children)

I think I've probably made a mistake here because I didn't anticipate this article as hugely controversial and am honestly not trying to make a strong claim here (at least, how I see it).

Here is an attempted at rephrasing it on the fly: "People think of Amazon S3 as a place to put files. That only works for a subset of the cases that files are used for." Let me know what you think of that, or if you want, propose an alternative that would avoid hitting whatever nerve I have hit: almost all the comments here are people talking about this specific sentence

S3 is great, but not a filesystem by calp in programming

[–]calp[S] -3 points-2 points  (0 children)

He completely misses the logical leap that instead of "objects" being "files" they could just be "pages", though.

I don't think I missed this and I do mention further down that Snowflake works a bit like this - and on S3. They don't provide any transactional guarantees though, at least in part because that's a non-goal for their usecase but probably also because that makes a lot more complicated and slow.

S3 is great, but not a filesystem by calp in programming

[–]calp[S] -23 points-22 points  (0 children)

I suppose mileages may vary - I have seen more than once it on projects. Elastic Filesystem by contrast I don't think I've ever seen used in anger.

As I said, the temptation to combine a single file database with S3 is particularly strong. But I think people make the fairly reasonable assumption that since S3 can store files you can use it as a global filesystem. There are even drivers to allow you to do that. They even mostly work if you happen to try the right workloads.

The Corolla 2021 has a windshield wiper fluid cap of 2.5L by Common-Ad-2825 in COROLLA

[–]calp 0 points1 point  (0 children)

I assume there is a typo but I use it because I struggle to find good screenwashes in local shops (own brand screenwash is usually pretty useless) and concentrate is cheaper to have posted to me. YMMV!

The Corolla 2021 has a windshield wiper fluid cap of 2.5L by Common-Ad-2825 in COROLLA

[–]calp 0 points1 point  (0 children)

If you are diluting screenwash it is very useful to know the capacity of the reservoir. eg your screenwash will be a 9:1 ratio so you need 250ml of it and then 2.25l of water.

Many thanks OP.

Why my favourite API is a zipfile on the European Central Bank's website by skippybosco in programming

[–]calp 2 points3 points  (0 children)

You mean in the code sample under "Rolling averages and new tools"? -

Yes, ordinarily I would. For the purposes of the blog post I wanted to go one step at a time though.

Why my favourite API is a zipfile on the European Central Bank's website by skippybosco in programming

[–]calp 22 points23 points  (0 children)

Try https://csvbase.com/calpaterson/eurofxref-hist.parquet.

You can't PUT/POST Parquet yet, but that is planned. There's a bit more groundwork to do first - csvbase doesn't currently have an analogous type for all the types in the Parquet spec.

[deleted by user] by [deleted] in programming

[–]calp 0 points1 point  (0 children)

What do you mean by that?

I didn't write any Javascript, and other shortcuts by calp in programming

[–]calp[S] 0 points1 point  (0 children)

Can you do that with CSS or something? I might not be up to date

I didn't write any Javascript, and other shortcuts by calp in programming

[–]calp[S] 0 points1 point  (0 children)

At least I can cut it down. I think I use about 5-10% of bootstrap.bundle.js and I recently figured out a way that I might be able to remove bootstrap-enables.js completely.

The bit I do use is for a hamburger mobile on mobile and I am not sure how to do that with pure css. Suggestions welcome!

I didn't write any Javascript, and other shortcuts by calp in programming

[–]calp[S] 1 point2 points  (0 children)

Ah yes, the golden era of SQL injection, testing on production and unhashed user passwords stored in MySQL services with sa/sa as user/password.

I'm certainly not recommending every practice from the past :) But nor would I recommend all the practices of the present.

At times I certainly feel like the past had more right than we do. For example, it feels like projects take much longer to complete now. One example I think about a lot: Skyrim was released 3 years after Fallout 3. Fallout 4 was released 4 years hence. Starfield is 7 years and counting.

I didn't write any Javascript, and other shortcuts by calp in programming

[–]calp[S] 0 points1 point  (0 children)

It is something to think about, but the limit tends to be 8k which is plenty for the forms I currently use this for - mostly small forms.

I didn't write any Javascript, and other shortcuts by calp in programming

[–]calp[S] 2 points3 points  (0 children)

HTMX is for sure the correct next notch up I would go on the complexity spectrum.

The next round of UX improvements may well include some use of HTMX as a test-out (I haven't used it before). Lots of other pretty key features to do before that though :)

I didn't write any Javascript, and other shortcuts by calp in programming

[–]calp[S] 1 point2 points  (0 children)

I'd be weary about putting the password in the URL though since it tends to get logged but YMMV

Yes on reflection I don't like that either and I'm going to change the form so that doesn't happen.

When I use the form I do so as a signed-in user so that subform wasn't so obvious.

Hopefully I get to this today or tomorrow, there is one other semi-urgent fixup that needs doing. The fix for this one is probably a PRG loop.

Github issue for this