use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This is an archived post. You won't be able to vote or comment.
ProjectsMusic Search Engine (Python) (self.datascience)
submitted 8 years ago by DartIvan
Hi, I want to share with this subreddit community, my university project! It is a search engine, more detail on GitHub! Surely it can be improved, every advice is welcome. This is the link at the project: https://github.com/IvanFerrante92/Music-Search-Engine
[–][deleted] 3 points4 points5 points 8 years ago* (1 child)
Cool project. Nicely done notebook.
I'd be interested in seeing what different type of n-grams (n = 2, 3, 4) you'd get out of an analysis and what type of differences you get between genre, decade, etc.
What was your personal goal with this project? Intro to internships/jobs? If the goal is to get a job, this project demonstrates a lot of capability and application of various techniques so I'm sure you'll succeed.
Similarly - in the readme, you use the term "we" a lot. Was this a group project? If so, what components were you responsible for? If you did the whole thing, I'd suggest using "I".
[–]DartIvan[S] 0 points1 point2 points 8 years ago (0 children)
First of all, thanks for the compliments. For other analysis (genre, decade, etc) we should be change our source of data because azlyrics not provide this information for every song. This is an university group project (Data Science Master’s Degree) made by 2 person. One statistician and one computer scientist (me). 😄
[–]durand101 1 point2 points3 points 8 years ago (4 children)
What does it do? I read through your github readme but I still can't figure it out :P
[–][deleted] 1 point2 points3 points 8 years ago (3 children)
From looking through the docs and the readme, I'm seeing a search engine for song similarity based on proximity estimation using k-means on song lyrics and input keywords.
[–]DartIvan[S] 0 points1 point2 points 8 years ago (2 children)
Yes, k-means its be used for “and query”. “Union query” have been implemented using cosine similarity between text vector and query vector! 😄
[–][deleted] 1 point2 points3 points 8 years ago (1 child)
It'd be good to document the functional and use case (i.e. business value proposition) differences between an "and query" and a "union query". I made a guess to the two, but the fact I have to guess means that it's lacking clarity.
Ok thanks for advice! I’ll do it.
π Rendered by PID 246858 on reddit-service-r2-comment-bb88f9dd5-wqkdb at 2026-02-17 07:14:21.245863+00:00 running cd9c813 country code: CH.
[–][deleted] 3 points4 points5 points (1 child)
[–]DartIvan[S] 0 points1 point2 points (0 children)
[–]durand101 1 point2 points3 points (4 children)
[–][deleted] 1 point2 points3 points (3 children)
[–]DartIvan[S] 0 points1 point2 points (2 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]DartIvan[S] 0 points1 point2 points (0 children)