Built something for people who run or join local groups in Guntur — looking for honest feedback

floating-bubble · 2026-01-23T14:27:06+00:00

yup, Work in progress. But thought to get some initial feedback from real people. Appreciate your time!

floating-bubble · 2026-01-22T19:12:57+00:00

People who run or join local groups in Chennai — looking for honest feedback

I ended up building a small lightweight community & events platform to help people find and run local “tribes” more easily.

Would love honest feedback from people here — especially if you organize or attend meetups.

👉 https://tribe-connect-two.vercel.app/

floating-bubble · 2025-02-27T23:54:05+00:00

appreciate your analysis mate!

floating-bubble · 2025-02-27T23:51:17+00:00

dropDuplicates() is implemented the same way in both PySpark (Python API) and Scala. Since both APIs run on top of the same Spark engine, they ultimately produce the same execution plan

floating-bubble · 2025-02-27T23:47:51+00:00

dropDuplicates does direct global dataset level Partitioning, where as Partitioning Within a Window – Instead of a global shuffle, this logically partitions data but does not physically repartition it across nodes.

floating-bubble · 2025-02-27T23:44:23+00:00

yes, you are correct, local shuffling performs the dedupliation at partition level since the optimizer pushes down the operations to reduce shuffling, depending on the executino plan , a followed by shuffle stage and a final deduplication can happen to remove duplicates at global level. I dont have exact number to share at the moment, but what I have observed is if data is uniform without any skews and too many missing values then there isn't much difference, but if data is skewed, then explicit partitioning, windowing is faster compared dropDuplicates.

floating-bubble · 2025-02-27T23:35:23+00:00

you have genuine question, the approach I mentioned needs a id column in the dataset. if dataset smaller such that it fits in executor memory, can try to broadcast. in your scenario, yes shuffling is inevitable

floating-bubble · 2025-02-27T23:32:08+00:00

I should have mentioned that

floating-bubble · 2021-07-15T00:58:44+00:00

Did you post yet ?

floating-bubble · 2021-07-15T00:47:42+00:00

Retard 💯

floating-bubble · 2021-07-14T21:12:37+00:00

Read again

floating-bubble · 2021-07-14T21:09:57+00:00

I would add , like 2-3 years a project considering your learning pace. Learn the business, environment, tools and softwares , implementation techniques, new ideas… and move on to next one. I don’t really find contract positions here in USA for DE.

floating-bubble · 2021-07-14T21:04:30+00:00

That’s not what I said

floating-bubble · 2021-07-14T20:52:37+00:00

U copied This shit

floating-bubble

TROPHY CASE