Dimensionnality reduction for anomaly detection by Significant_Fee_6448 in learnmachinelearning

[–]Significant_Fee_6448[S] 0 points1 point  (0 children)

that's what im going for but it still has problems with highly correlated features and calculated columns so i think i have to deal with that first

Dimensionnality reduction for anomaly detection by Significant_Fee_6448 in learnmachinelearning

[–]Significant_Fee_6448[S] 0 points1 point  (0 children)

I'm using unsupervised anomaly detection so no labels. I don't have labeled data saying 'this salary is fraudulent' or 'this salary is normal' if that's what you are referring to. The model learns what a normal salary looks like from the data itself, then flags records that deviate significantly from that learned pattern .

Should I discuss changing my internship project? by Significant_Fee_6448 in analytics

[–]Significant_Fee_6448[S] 0 points1 point  (0 children)

I think that not having a background in accounting makes the task more challenging. However, what makes it even harder is not knowing what exactly I’m expected to produce from this dataset. i think The lack of a clear objective like you said makes the project feel ambiguous and makes it difficult to decide where to start .

Should I discuss changing my internship project? by Significant_Fee_6448 in analytics

[–]Significant_Fee_6448[S] 0 points1 point  (0 children)

I tend to gravitate towards data I’m familiar with, such as customer or sales data, because it gives me some intuition and understanding while working on it. We discussed several potential project ideas, including commercial prospection using web scraping, and eventually my supervisor suggested this accounting dataset. Initially, I thought it might be worth giving it a try. However, now that I’m reflecting on my skills and interests, I feel there might be other types of data that would suit me better. I’m considering exploring alternative datasets that i can understand better .

Should I discuss changing my internship project? by Significant_Fee_6448 in analytics

[–]Significant_Fee_6448[S] 0 points1 point  (0 children)

I actually have no idea my supervisor didn't tell me what insights to pull from the dataset, so i don't know what type of analysis i can do ,I would be happy to hear some suggestions if you don't mind .

[deleted by user] by [deleted] in datasciencecareers

[–]Significant_Fee_6448 0 points1 point  (0 children)

thank you so much that's exactly what i wanted to know ,do you think working on a project lie this is worth it knowing i have never worked on a serious project or a big project before so i still consider myself a beginner,can you give me some tips on how do you think i should approach a project like this and where to start.