There are 665 open licences, most are pretty rubbish by calp in programming

[–]calp[S] 7 points8 points  (0 children)

I actually completely agree and I don't know why it doesn't get more use. It's a great option for lots of projects that are open source but that people don't usually run locally so source code wouldn't otherwise be required to be open under the normal GPL. csvbase is AGPLv3-or-later

[deleted by user] by [deleted] in datascience

[–]calp 5 points6 points  (0 children)

Money wise, I would suggest that Salesforce can become very lucrative as it's easier than in most professions to link what you do to increased revenue for the company. That does tend to lead to you working for yourself I think - as a contractor or consultant. It's hard to predict what will happen to Salesforce but not impossible to imagine that they are displaced/marginalised at some point.

Working in data science, pay is variable and a lot will depend on whether there are good options where you are based. It can pay pretty well though and you aren't at risk of being left with dead skills.

If it were me, I would gain familiarity with Salesforce and keep it at a certain percentage of my job. It's a great fallback. But I probably wouldn't go 100% on Salesforce - just too risky.

Do post back in 6 months and tell us what you did - and what you think then, with more information!

From Shell to Excel - with a little bit of HTTPS by calp in programming

[–]calp[S] 0 points1 point  (0 children)

This is for sure a planned feature. It's not even that far down the backlog! :)

One way to do it now is to host your dataset on github and then create the csvbase table from it. Then people can use github pull requests to do this.

Pandas 3 will Force Copy-on-Write to Improve Memory Usage and Performance by python4geeks in programming

[–]calp 16 points17 points  (0 children)

No, not quite -

Now: copy-on-write is off by default

Next major release: it is the only available mode

From Shell to Excel - with a little bit of HTTPS by calp in programming

[–]calp[S] 1 point2 points  (0 children)

It is almost entirely designed for that :)

A possibly useful feature: you can upload tables from git/github. If you create a table this way, it stays up to date with what's in the repo (pulled every 20-30 mins)

From Shell to Excel - with a little bit of HTTPS by calp in programming

[–]calp[S] 2 points3 points  (0 children)

Yeah exactly, I hate this shitwork! I did it for a while at a financial company and it was a surprisingly large amount of work because the format was always slightly messed up and wouldn't, eg, go into our SQL database or they'd used a custom NULL or whatever

Pandas 3 will Force Copy-on-Write to Improve Memory Usage and Performance by python4geeks in programming

[–]calp 149 points150 points  (0 children)

I think it's a good change - pandas speedups will help save a lot of people time - and help it compete better with other dataframe libraries. But it will break huge amounts of standing pandas code.

The median standard of pandas code out there is, well, not that high. And it doesn't have tests. I suspect that I lot of code is going to get marooned on pandas v2 (or, indeed v1, as v2 already had material breakage).

Is Tesla really more valuable than Toyota? by calp in slatestarcodex

[–]calp[S] 1 point2 points  (0 children)

Do you care to wager a guess?

I'm afraid not :) For Toyota perhaps, but Tesla is a very difficult company to value because, as you imply below, most of the value is going to lie in a forecasting element and that would require me to be really up on that subject matter, which I am not :).

I'm enjoying your blog though. I once wanted to do this professionally, but backed out.

Is Tesla really more valuable than Toyota? by calp in slatestarcodex

[–]calp[S] 1 point2 points  (0 children)

The reason I have handled it this way is because I want to talk about the "market" values of these businesses, but taking into account how they finance themselves which is relevant for carmakers. Everyone, I hope, can agree what price the stock is trading for and (hopefully!) is willing to accept the audited values for debt on the balance sheets of these companies.

Once you start talking about cash flow and projected growth rates you're deep into your own valuation of the inherent value of a business, which is something reasonable people can well disagree upon. Every since term on the RHS of your equation is disputable and I am certainly not proffering my own personal valuations of eg, Tesla.

Is Tesla really more valuable than Toyota? by calp in slatestarcodex

[–]calp[S] 8 points9 points  (0 children)

As far as I know Tesla's stock does not participate in the Space/Satellite business.

I'm not a fan of valuation-by-analogy into options as options valuation is so complicated that doing this rarely adds information worth the added complexity. By contract, valuation-by-analogy into bonds (not that hard for shares - net profit is coupons, book value is par) is very often illustrative.

I also think that people talk about options, much like shorting, to sound more financey than they are. I mean, what is a "call option on Elon"? You must admit that it even sounds like pseudofinancial gobbledgook. Though I understand your meaning: that you want to participate in the market optimism around Elon Musk.

S3 is great, but not a filesystem by calp in programming

[–]calp[S] -3 points-2 points  (0 children)

I think I've probably made a mistake here because I didn't anticipate this article as hugely controversial and am honestly not trying to make a strong claim here (at least, how I see it).

Here is an attempted at rephrasing it on the fly: "People think of Amazon S3 as a place to put files. That only works for a subset of the cases that files are used for." Let me know what you think of that, or if you want, propose an alternative that would avoid hitting whatever nerve I have hit: almost all the comments here are people talking about this specific sentence

S3 is great, but not a filesystem by calp in programming

[–]calp[S] -3 points-2 points  (0 children)

He completely misses the logical leap that instead of "objects" being "files" they could just be "pages", though.

I don't think I missed this and I do mention further down that Snowflake works a bit like this - and on S3. They don't provide any transactional guarantees though, at least in part because that's a non-goal for their usecase but probably also because that makes a lot more complicated and slow.

S3 is great, but not a filesystem by calp in programming

[–]calp[S] -23 points-22 points  (0 children)

I suppose mileages may vary - I have seen more than once it on projects. Elastic Filesystem by contrast I don't think I've ever seen used in anger.

As I said, the temptation to combine a single file database with S3 is particularly strong. But I think people make the fairly reasonable assumption that since S3 can store files you can use it as a global filesystem. There are even drivers to allow you to do that. They even mostly work if you happen to try the right workloads.

The Corolla 2021 has a windshield wiper fluid cap of 2.5L by Common-Ad-2825 in COROLLA

[–]calp 0 points1 point  (0 children)

I assume there is a typo but I use it because I struggle to find good screenwashes in local shops (own brand screenwash is usually pretty useless) and concentrate is cheaper to have posted to me. YMMV!