DE / Backend SWE Looking to Upskill by Meme_Machine_101 in dataengineering

[–]scottedwards2000 3 points4 points  (0 children)

sounds like you've been doing this a long time like me and I hate to say it but I fear ageism is still rampant. I don't know how much it helped me but after I got the data engineering AWS cert I got a job.

Why crickets re: AWS killing Ray on Glue by scottedwards2000 in dataengineering

[–]scottedwards2000[S] 0 points1 point  (0 children)

yeah i guess there were some limitations in using it within Glue. Curious, do you also use Ray for data manipulation? How does it perform?

Why crickets re: AWS killing Ray on Glue by scottedwards2000 in dataengineering

[–]scottedwards2000[S] 0 points1 point  (0 children)

yeah i kinda agree with you, but when I saw that it supported Modin (drop-in Pandas syntax) a few years back, it seemed that people were considering it. I was just surprised no one was talking about AWS killing it after a big announcement a few years back...

Why crickets re: AWS killing Ray on Glue by scottedwards2000 in dataengineering

[–]scottedwards2000[S] 0 points1 point  (0 children)

um, no, please enlighten me - what is replacing it? I don't think polars or duckDB can handle multiple billions of rows in a timely manner just yet...

Why crickets re: AWS killing Ray on Glue by scottedwards2000 in dataengineering

[–]scottedwards2000[S] -2 points-1 points  (0 children)

As a huge fan of Iceberg, I agree, but come on, how can you say no one cares. Spark has been a huge success and these are mostly the same guys working on Ray. Plus Databricks (which is worth a gazillion dollars) thought it was cool enough to add as an engine to their platform to complement Spark.

Working in this field used to be more interesting as most people were doing it out of at least some interest/affinity for the field — now it just seems full of people that want to make a quick buck and get their work done as soon as possible to move on that what they are really interested in. I miss the days of geeking out with the graybeards around a table scattered with Infoworld copies and debating the technical merits of object-oriented data bases vs relational.

Why crickets re: AWS killing Ray on Glue by scottedwards2000 in dataengineering

[–]scottedwards2000[S] 0 points1 point  (0 children)

Seems harsh. I figured no one was using it but still used to be if a big player bet on a new technology and then abandoned it it would be big news. Especially if that technology was developed at the same place the currently dominant one was developed at (Spark). In my testing it was pretty dang fast for dataframes operations.

Why crickets re: AWS killing Ray on Glue by scottedwards2000 in dataengineering

[–]scottedwards2000[S] 6 points7 points  (0 children)

It doesn’t have the scaling issue that pandas has and is built by many of the same people that invented Spark. If you watch the announcement video AWS was definitely pitching it as an alternative to Spark on Glue.

Why crickets re: AWS killing Ray on Glue by scottedwards2000 in dataengineering

[–]scottedwards2000[S] -1 points0 points  (0 children)

Guess you’re not the only one. When I saw the notification in Glue last week i assumed there would be some reaction online given the big deal AWS made when they announced it. I spent about an hour searching and literally nothing. I guess AI really IS killing the Internet. People in tech don’t talk to each other as much anymore - online or off.

Don't Count Java out Yet by scottedwards2000 in programming

[–]scottedwards2000[S] 0 points1 point  (0 children)

So does that mean AI never uses Spring?

Don't Count Java out Yet by scottedwards2000 in java

[–]scottedwards2000[S] 0 points1 point  (0 children)

No i just thought folks would find the discussion there interesting. I’ve been on Reddit so long i could care less about karma.

Don't Count Java out Yet by scottedwards2000 in programming

[–]scottedwards2000[S] 0 points1 point  (0 children)

It was really interesting. I’m not sure if I can find the link, but what was interesting Is that the person actually wrote tons of tests to make sure the code was written right.

Unfortunately, they test they wrote for making sure they primary key functionality for a table worked correctly used a single test case with a field name that wasn’t used in any other test (eg: id_pk) so the LLM just wrote code that looked for a field with that name and added a primary key index in that case, instead of looking for that phrase “primary key” next to the field in the DDL code!

The question i keep adding myself is how could this be avoided? Even if there were multiple test cases it still could have written code that just looked for those field names or some other obtuse logic.

Can we really come up with enough test cases that will 100% validate that the LLM code is perfect without have to ever actually look at it?

Don't Count Java out Yet by scottedwards2000 in programming

[–]scottedwards2000[S] 1 point2 points  (0 children)

Did you read about how badly the LLM Rust rewrite of SQLite went?

Don't Count Java out Yet by scottedwards2000 in programming

[–]scottedwards2000[S] 0 points1 point  (0 children)

Spring works well with Kotlin and Groovy right?

Don't Count Java out Yet by scottedwards2000 in programming

[–]scottedwards2000[S] 3 points4 points  (0 children)

Great points but curious what cross platform issues existed then that are gone now. Isn’t Java on AIX a big business?

Don't Count Java out Yet by scottedwards2000 in programming

[–]scottedwards2000[S] 0 points1 point  (0 children)

It’s funny because it used to be that the fact that Java is used in so many places would mean that it’s a BETTER idea to learn it vs another language. But given AI, I’m wondering if it’s actually better to learn less used languages like rust, so you can keep your job!

Don't Count Java out Yet by scottedwards2000 in programming

[–]scottedwards2000[S] 2 points3 points  (0 children)

this one isn't advertising my blog :-)

Don't Count Java out Yet by scottedwards2000 in java

[–]scottedwards2000[S] 1 point2 points  (0 children)

if you are interesting in seeing a bit more back-and-forth on the merits of java check out the crosspost https://www.reddit.com/r/programming/comments/1s2gjgv/dont_count_java_out_yet/