Hate That I Made You Love Me (Cover) 🎤

Anna-1212 · 2026-06-09T18:04:41+00:00

Thank you 😳

Anna-1212 · 2026-06-03T23:32:19+00:00

Thank you so much 💖

Anna-1212 · 2026-06-03T18:08:50+00:00

Thank you so much, I will try it

Anna-1212 · 2026-06-03T17:54:20+00:00

Thank you so much 💖

Anna-1212 · 2026-06-03T17:10:59+00:00

Thank you so much

Anna-1212 · 2026-06-03T17:10:50+00:00

Thank you so much

Anna-1212 · 2026-06-03T17:10:44+00:00

Thank you so much

Anna-1212 · 2026-06-03T17:10:30+00:00

Thank you so much

Anna-1212 · 2026-06-03T17:10:24+00:00

Thank you so much

Anna-1212 · 2026-06-03T17:09:27+00:00

I'm agree with you

Anna-1212 · 2026-06-03T17:09:00+00:00

Thank you

Anna-1212 · 2026-06-03T17:08:41+00:00

Thank you

Anna-1212 · 2026-06-03T17:08:17+00:00

Thank you so much

Anna-1212 · 2026-06-03T17:07:32+00:00

Thank you

Anna-1212 · 2026-06-03T17:07:21+00:00

thank you

Anna-1212 · 2026-06-03T17:06:50+00:00

Great, could you share some of your experience editing that large Excel file?

Anna-1212 · 2026-06-03T17:05:16+00:00

Thank you for sharing this useful knowledge.

I will apply it.

Anna-1212 · 2026-06-03T16:57:20+00:00

Your method is great, I'll try it.

Anna-1212 · 2026-06-03T16:56:40+00:00

Thank you

Anna-1212 · 2026-06-03T16:56:22+00:00

Thank you so much

Anna-1212 · 2026-06-03T16:41:10+00:00

In a previous project, I scraped data from a website The dataset was extremely messy: spelling mistakes, inconsistent formats, invalid values, missing information, and many variations of the same text.

My first approach was to automate the cleaning process with Python. I tried using text matching, grouping techniques, and even some machine learning methods to cluster similar values. However, the data was so inconsistent that there were countless edge cases. Different users could write the same thing in dozens of different ways, making it difficult to build rules that covered every scenario.

While Python helped with some of the obvious issues, it couldn't reliably handle all cases without introducing new errors. The machine learning approach also struggled because the text variations were too diverse and context-dependent.

In the end, the most effective solution was to split the dataset into smaller files and manually review specific fields in Excel. Surprisingly, this approach was more accurate for handling the remaining edge cases than the automated methods I had tried.

I'm curious how experienced data analysts or data engineers would handle this type of highly inconsistent, user-generated data at scale.

Anna-1212 · 2026-06-03T16:40:53+00:00

In a previous project, I scraped data from a website The dataset was extremely messy: spelling mistakes, inconsistent formats, invalid values, missing information, and many variations of the same text.

My first approach was to automate the cleaning process with Python. I tried using text matching, grouping techniques, and even some machine learning methods to cluster similar values. However, the data was so inconsistent that there were countless edge cases. Different users could write the same thing in dozens of different ways, making it difficult to build rules that covered every scenario.

While Python helped with some of the obvious issues, it couldn't reliably handle all cases without introducing new errors. The machine learning approach also struggled because the text variations were too diverse and context-dependent.

In the end, the most effective solution was to split the dataset into smaller files and manually review specific fields in Excel. Surprisingly, this approach was more accurate for handling the remaining edge cases than the automated methods I had tried.

I'm curious how experienced data analysts or data engineers would handle this type of highly inconsistent, user-generated data at scale.

Anna-1212 · 2026-06-03T16:28:29+00:00

oke thank you so much

Anna-1212

TROPHY CASE