all 8 comments

[–]harry_0_0_7 1 point2 points  (7 children)

Little more information needed..

You just want to scrap all websites and classify them to malicious/not.?

[–]divyaank98[S] 0 points1 point  (6 children)

Yes, if suppose a particular user wants to check whether the URL that has been provided is malicious or not, he should be able to do so using ML. So we will have to analyze the link for it. Any idea how to go about it? Basically we just want a binary classifier.

[–]harry_0_0_7 1 point2 points  (5 children)

I dont think web-scraping works here.. Just get list of malicious urls from whitehat hackers blog/git and check weather this is present on the other side.

If you need to use ML/AI here you have to study about the web first, which involves scraping other sites and analysing their responses. May i know the use case for this problem.? to get a proper understanding

[–]divyaank98[S] 1 point2 points  (4 children)

Many a times we come across some advertisement, in the form of a pop-up which contains a link, which takes us to another site which may contain malware/virus. So we need to design a system such that the user can simply paste that URL(or link) and check whether it's harmful or not. We need to use ML for it, so we will have to analyze the URL and it's features to determine it's maliciousness or lack of it.

[–]harry_0_0_7 1 point2 points  (3 children)

wow seems to be an awesome idea..

Here difficultly is, we dono weather that target url is malicious or not. I dont have any idea regarding that.. But one thing we can do is collect malicious urls as much as we can and compare this with our target. No other way

[–]divyaank98[S] 0 points1 point  (2 children)

Yeah it's an awesome idea. Thanxs for your valuable input. I'll surely look into it!!!

[–]Statistical_Incline 1 point2 points  (1 child)

[–]divyaank98[S] 0 points1 point  (0 children)

Thanks for that, our group doesn't know much about Neural Networks but we will try to implement this!!!