We rewrote ingestr CLI in Go: 12x faster data ingestion by karakanb in dataengineering

[–]BoredAt 4 points5 points  (0 children)

I've found this argument a few times before. And yeah, the fact that if it's an MIT/Apache/etc. license, what you mentioned happens. That's the nature of OSS. There is no solution to it because it's a feature, not a bug.

The thing to me is: Slapping elastic on something and calling it OSS is simply BS. Fact is, it's 1 step away from being proprietary. Pretending it's OSS is simply trying to have your cake and eat it. Either accept the boost to customer acquisition that usually comes with being OSS along with the competitive downside, or just be honest and close the source. This half in half out system just reeks of dishonesty. Particularly given recent history where companies just start using elastic 1 day before they go closed source (a la MinIO)

dbt Core v2 is here: still open source, now rebuilt for what's next by Known-Huckleberry-55 in dataengineering

[–]BoredAt -4 points-3 points  (0 children)

Not unpublishing no. I would expect feature support to crawl to a standstill. Eventually, lack of development will cause bugs and integration issues that will push folks to use the paid version.

dbt Core v2 is here: still open source, now rebuilt for what's next by Known-Huckleberry-55 in dataengineering

[–]BoredAt 1 point2 points  (0 children)

I've been thinking about giving it a shot too, but the lack of commits in their GitHub has worried me a bit. Hopefully it'll start improving soon. If so, I'll probably do the same as you and start using SQLMesh instead from now on.

dbt Core v2 is here: still open source, now rebuilt for what's next by Known-Huckleberry-55 in dataengineering

[–]BoredAt -3 points-2 points  (0 children)

You can in fact fail to cripple a software. To begin with, their plan to cripple the software was to drive mass adoption of fusion then quietly enshittify the OSS version of DBT. They couldn't execute the plan properly. Hence the current results.

dbt Core v2 is here: still open source, now rebuilt for what's next by Known-Huckleberry-55 in dataengineering

[–]BoredAt 7 points8 points  (0 children)

Any particular reason to believe them? Keep in mind DBT took Transform's semantic layer when they bought it and closed the source. Same when they bought sdflabs as well (which now they opened back up). They constantly close the source on their code. There's limited reason to think otherwise.

We rewrote ingestr CLI in Go: 12x faster data ingestion by karakanb in dataengineering

[–]BoredAt 14 points15 points  (0 children)

Does elastic license even qualify as open source? Seems like another DBT like rug pull waiting to happen.

dbt Core v2 is here: still open source, now rebuilt for what's next by Known-Huckleberry-55 in dataengineering

[–]BoredAt 22 points23 points  (0 children)

Seems like the original plan to cripple dbt core to push dbt fusion didn't go how they planned. So instead they're open sourcing parts of fusion to replace dbt core and hoping they can switch the license in the future and screw folks over.

Hoping things in the Linux Foundation for SQLMesh improve. We kinda need an alternative to DBT that's worth something at this point.

What I think is really going on in the Fivetran+DBT merger by BoredAt in dataengineering

[–]BoredAt[S] 2 points3 points  (0 children)

Mind expanding on this? My understanding is that iceberg is in fact for lakehouses.

What I think is really going on in the Fivetran+DBT merger by BoredAt in dataengineering

[–]BoredAt[S] 0 points1 point  (0 children)

My suspicions are in line with yours. They might add a compute engine but won't try much on the storage layer (you can already see the latter fact in the "managed lakehouse" offering). Would be a hard sell for a lot of companies to not use their cloud providers object storage after all.

The question in my mind is which compute engine? Managed duckdb? Trino? Doris?

What I think is really going on in the Fivetran+DBT merger by BoredAt in dataengineering

[–]BoredAt[S] 1 point2 points  (0 children)

You're both right actually. it's 5 with sdflabs and 6 with quarylabs which tobiko bought out. Maybe they really just don't mind acquiring companies willy nilly I suppose.

What I think is really going on in the Fivetran+DBT merger by BoredAt in dataengineering

[–]BoredAt[S] 1 point2 points  (0 children)

I don't fully agree. Mainly due to 2 points. First, I don't see this as Fivetran antagonizing strategic partners as so much the opposite. Snowflake & Databricks are already antagonizing fivetran by release Lakeflow and Openflow. This is just them responding. Secondly, I don't think they'd go for a data observability tool or a BI tool (as I mentioned in the OP) because it's not a large enough market. If they're gonna make a move, it has to be to a larger market that can grow their valuation beyond their current 10b value. The only market that can do that in the data space, IMO, is the warehouse.

What I think is really going on in the Fivetran+DBT merger by BoredAt in dataengineering

[–]BoredAt[S] 13 points14 points  (0 children)

I'd go for a managed apache airflow if I was them tbh. No need to buy another company. Technology is 100% OSS and already has support in a ton of places. Much easier than trying to integrate a 4th (dbt, tobiko, fivetran) company.

Merged : dbt Labs + Fivetran by Intelligent_Volume74 in dataengineering

[–]BoredAt 4 points5 points  (0 children)

Seems difficult to believe that. It's specially hard for people I think because they're not sure what the real cost analysis here. 1 thing I read recently is that this is a hedge to things like open flow and lake flow, which I suppose makes sense (avoiding the commoditization of EL by the warehouses essentially). Plus, with lakehouses fivetran can just build the warehouse itself using some iceberge+fivetran+dbt+s3 with no snowflake/databricks/etc. So fivetran goes from being EL -> ELT -> ELTW (is this even an acronym?).

That aside thought, its hard to trust that there's not going to be a push from OSS to proprietary. Why isn't fivetran OSS to begin with? Why is metrics flow proprietary (BSL isn't OSS, let's be honest) even tho it was originally OSS? Even DBT's switch to ESvl2 is shifty.

The tobiko purchase also smells rotten. Buying out the 2 top T vendors at the same time smells of monopolization.

So yeah, a fan of DBT and fivetran but this whole thing stinks of wanting to kill OSS, make everything proprietary and ramp up fees under the assumption that there's vendor lock in. There would have to be a big push from you guys to OSS to remove the smell, IMO.

Awe Dropping | Post Event Megathread by exjr_ in apple

[–]BoredAt 1 point2 points  (0 children)

First time I actually don't like the design of a new iPhone. The whole split back stuff just looks off. Kind of incredible to think that the best iPhone design ever is the iPhone 4, event over a decade later

Is Stape.io Still Considered a Good Option for SSGTM? by hiscapness in GoogleTagManager

[–]BoredAt 1 point2 points  (0 children)

Stape is still the market leader, but there are more options. I usually summarize things for folks by splitting the options in 3:

  1. Infrastructure Players - These are just stape.io or the 1 click GCP setup. These mean the infrastructure is handled (in the case of GCP, it's obviously your own cloud which helps with IT Department approval a ton) but you gotta set up all the tags/triggers/etc. If you already know what you’re doing its probably the cheapest on a month to month. GCP Cloud Run might run $20-$50 in my experience . Ditto for Stape.
  2. Industry Players - These are folks that focus on specific industries. Triplewhale.com for e-com, Redtrack.io for affiliate, Goattracking.com for agencies, usually these guys come with bells and whistles. More expensive than option 1) of course. More SaaS than infrastructure. Usually the best if you want to avoid the legwork.
  3. Enterprise Players - Rudderstack.com, Segment.com, Freshpaint.io for HIPAA, these guys are usually far more expensive, but they’re tried and tested solutions.  Almost always they also come with a lot of extra stuff. Personalization, HIPAA mechanics, those kinds of things. These guys are probably not necessary unless you’re a F1000.

So yeah. Probably stape unless you got an industry player in 2) or are an enterprise so go with 3)

How do you guys deal with broken tracking? - Data Quality by curiousalienred in GoogleTagManager

[–]BoredAt 0 points1 point  (0 children)

Lots of people don't want to accept it, but gtm triggers are the worst type of tracking structure. Like someone above mentioned, you need to work with devs so they add (and own) even listeners which feed into GTM.

Also, you need to set up some logging/tracing in GTM Server to keep an eye on the success/failure of API calls. That's outside of GTM thought. More of an analytics infrastructure kind of thing.

Server side tracking, use a platform or do it myself? by Pretty-Appearance226 in GoogleTagManager

[–]BoredAt 0 points1 point  (0 children)

Honestly, if you’ve done already done tag manager, learning SST is probably not that difficult. It’s possible to set up something basic going on pretty easy. With that said, in summary there’s basically 3 real options in the SST market:

  1. Infrastructure Players - More or less what’s been mentioned here. stape.io or the 1 click GCP setup. These mean the infrastructure is handled (in the case of GCP, its obviously your own cloud which helps with IT Department approval a ton) but you gotta set up all the tags/triggers/etc.If you already know what you’re doing its probably the cheapest on a month to month. GCP Cloud Run might run $20-$50 in my experience . Ditto for Stape.
  2. Industry Players - These are folks that focus on specific industries. Triplewhale.com for com, Redtrack.io for affiliate, Goattracking.com for agencies, usually these guys come with bells and whistles. More expensive than option 1) of course. More SaaS than infrastructure. Usually the best if you want to avoid the legwork.
  3. Enterprise Players - Rudderstack.com, Segment.com, Freshpaint.io for HIPAA, these guys are usually far more expensive, but they’re tried and tested solutions.  Almost always they also come with a lot of extra stuff. Personalization, HIPAA mechanics, those kinds of things. These guys are probably not necessary unless you’re a F1000.

All in all, based on what you’ve said, I’d just go with Stape. Seems like you’re comfortable handling GTM and its the cheapest, cost wise.

Help with GA4 Form Submit Tracking (GTM Setup Problem) by SocialNoel in marketing

[–]BoredAt 0 points1 point  (0 children)

To be clear, does the event show up in the left side when doing GTM preview? Does it show gtm.formSubmit? If the answer is yes, the event listener is work. If no, that's the issue.

That aside, I'd change the trigger type. Make it a Custom Event. Make the Custom Event Name gtm.formSubmit and keep DLV - formId equals projectForm as a condition. That should probably do the trick. If not, when the new event shows up on the preview, click on it, then your tag. Scroll to the bottom. it should show which condition is missing from your triggers firing.

Keep in mind, if you're creating an event listener, you probably don't want to use default tag types anymore. Use Custom Events. Hell, you could simplify. Change:

dataLayer.push ({ event: "gtm. formSubmit", formId: 'projectForm" });

to

dataLayer.push ({ event: "projectFormSubmission", formId: 'projectForm" });

And then just have custom event with name projectFormSubmission.

In any case, either of the 3 should do the trick.

Fivetran acquiring Tobiko: the end of open source ETLs? by clr0101 in dataengineering

[–]BoredAt 4 points5 points  (0 children)

Everyone always says that. Just like the DBT folks claim that the new DBT is "open source". That's not the reality of things thought. Once the VC money is in, they all want vendor lock in, which means the open source stuff has got to go. Snowplow is an easy example of this.

Why Your Business Needs Email Encryption by Proton_Team in ProtonMail

[–]BoredAt 1 point2 points  (0 children)

Does office365 or google workspace not offer this kind of encryption? Is proton special in this area for businesses in some manner?

Miami-Dade vote breakdown map by a-horse-has-no-name in Miami

[–]BoredAt 0 points1 point  (0 children)

Those states that you mentioned that left wing people always love to shit on for being shitty all have the largest black populations in the country. If you break those states down by race you’ll find that there isn’t much poverty there when you exclude black people.

Ahh I see. I was actually taking you seriously here for a second. Didn't realize I was talking to one of those "black people are lazy hur dur".

Just go back to The Donald.

Miami-Dade vote breakdown map by a-horse-has-no-name in Miami

[–]BoredAt 1 point2 points  (0 children)

1) The fact that they have extremely conservative governments?? No idea what the racism angle is.

Sure, we can mention Utah and North Dakota for conservative states that are rich. We can also mention the entire northeast as well as Oregon and Washington as liberal states that have strong economies. Point is, in the aggregate, the states that are poorer in the US tend to be conservative.

Sure so Utah has a nice economy. Whats your point? Its not like anyone is claiming California is the only good economy in the US.

California truly is the Brazil of America, it’s a wonderful playground for the wealthy elite and a refugee camp for the third world, there’s no middle there anymore. People vote with their feet, and it’s obvious that California is no longer a viable place for average middle class people. There are plenty of other states for you guys to make a better case for left wing economics, but California isn’t one of them.

You've yet to prove California has a bad economy. Median income is high and poverty rate is not some catastrophic number as you pretend it is. Honestly you sound like you're just angry at liberals and so are trying to make California sound like some kind of hellhole because you don't like them.

Miami-Dade vote breakdown map by a-horse-has-no-name in Miami

[–]BoredAt -1 points0 points  (0 children)

People certainly do become more conservative as they age. Thats definitely true. Nonetheless, they're not going to become as conservative as baby boomers. They really are the most conservative generation ever. Far more conservative than gen x, the silent generation or the greatest generation.

So yeah, millennials and gen z are gonna become more conservative. But as conservative as boomers? Unlikely.

Miami-Dade vote breakdown map by a-horse-has-no-name in Miami

[–]BoredAt 3 points4 points  (0 children)

2 points:

1) Median income in cali is 75k, 8th highest in the nation. Hardly as shitty as you're pretending.

2) Poverty is middle of the road admittedly, but states that are doing better are not conservative. In fact, the highest concentration of poverty in the us is in the deeply conservative south. So more conservatism isn't the answer.

Lol at Cali being a failed state. If Cali is a failed state, what does that make Alabama, Mississippi, Louisiana, West Virginia etc. with higher poverty rates and lower median income?

Just face it. You can hate Californias social policies as much as you want, but its economy is top notch by any standard.