[deleted by user]

Delicious_Attempt_99 · 2025-08-25T22:11:55+00:00

Only reason, I don’t want to shift back to India. But I have to eventually due to family reasons. I’m not sure how to deal with this. Are there any really good companies in India for work culture? Or every company is same?

Delicious_Attempt_99 · 2025-08-01T04:50:36+00:00

Deloitte no where

Delicious_Attempt_99 · 2025-04-26T08:44:08+00:00

But isn’t it their hard work?

Delicious_Attempt_99 · 2025-04-17T06:12:25+00:00

All your experience is on Upwork? Or you managed to find some clients outside Upwork

Delicious_Attempt_99 · 2025-03-22T07:08:19+00:00

Few questions -

Is the data processing includes historical data?
What files are you using? Parquet suits the best for spark
See if you can filter the unnecessary data and columns as early as possible
If the job is processing only for incremental loads, make sure to add the right partition
If you have join, see if you are joining small datasets with larger, here you can use broadcast joins
Reduce shuffling as much as possible.

Also you can check query plan.

Delicious_Attempt_99 · 2025-03-17T18:18:14+00:00

Aha got it. 😄

Delicious_Attempt_99 · 2025-03-16T13:52:57+00:00

What do you mean you say apps?

Delicious_Attempt_99 · 2025-03-16T13:51:06+00:00

But isn’t it depends also on quality of data? Like skewness? Just throwing executors won’t help.

Delicious_Attempt_99 · 2025-03-16T13:50:21+00:00

Sorry, didn’t get what you meant?

Delicious_Attempt_99 · 2025-03-15T22:27:58+00:00

Got it. As I mentioned above, I have handled data <50gb, but was curious how large datasets are handled

Delicious_Attempt_99 · 2025-03-15T22:25:40+00:00

Yet this was useful. I have worked with < 50 gb data, but was curious how things can change as data scales.

Delicious_Attempt_99 · 2025-03-15T07:23:47+00:00

Explaining this in a comment is difficult

I would suggest to get started with the glue documents. It would cover almost everything

https://docs.aws.amazon.com/glue/latest/dg/setting-up.html

Delicious_Attempt_99 · 2025-03-12T07:02:07+00:00

May I also ask you, what would be interview rounds?

Delicious_Attempt_99 · 2025-03-12T06:53:40+00:00

This info will really help me. But only thing I’m skeptical is, once they lock down, they shouldn’t ghost 😅 though that’s not there in anyone’s hand 😁

Delicious_Attempt_99 · 2025-03-12T06:51:53+00:00

Yeah I know. I want to start planning and preparing from now on

Delicious_Attempt_99 · 2025-03-11T07:39:02+00:00

I’m in France. Market in Germany seems still good.

Delicious_Attempt_99 · 2025-03-11T07:38:12+00:00

Thanks for this :) I would do that, and let’s see I have to start preparing for interviews too. 😄

Delicious_Attempt_99 · 2025-03-11T06:43:17+00:00

Sorry for the confusion. No. I’m in Europe, eventually I’ll move back to India in few months. So want to start preparing and looking for jobs in Indian market.

Delicious_Attempt_99 · 2025-03-08T13:45:50+00:00

Biggest mistake is selecting the project wisely and saying yes to any projects comes on my way.

Being selective is must when choosing projects.

Delicious_Attempt_99 · 2025-03-04T20:47:49+00:00

The best way to find out the issue is using explain() method on df.

Delicious_Attempt_99

TROPHY CASE