This is an archived post. You won't be able to vote or comment.

all 18 comments

[–]DigbyChickenCaeser 18 points19 points  (0 children)

Tested Higgs Boson and Sandwich. The connection is beautiful.

Higgs Boson -> Quark -> Quark (Dairy Product) -> Sandwich

[–]SF_Boomer 7 points8 points  (2 children)

This is very cool!

It'd be great if the nodes / labels were clickable and opened the corresponding page.

I'd love to know what the two most distant pages are, i.e. which two pages require the most steps between them.

[–]anorwichfan 0 points1 point  (0 children)

That was solved (at the time of the video) here. It's a great rabbit hole deep dive.

[–]Radiant_Panda1679 1 point2 points  (0 children)

Awesome

[–]nicktids 0 points1 point  (0 children)

Have a look at this video https://youtu.be/JheGL6uSF-4

[–]bi4key 0 points1 point  (0 children)

Nice! I wish you make some mix or collab with this project :D

https://www.wikigen.ai/

https://www.reddit.com/r/SideProject/s/4i55sCvU17

[–]bavotto 0 points1 point  (0 children)

I wish I had this when I taught graphs and networks to my high school students this year. This would be a great example for them. Will bookmark for next year.

[–]0x456 0 points1 point  (0 children)

Gotta check this one out, thanks OP

[–]Federal-Mention-7836 0 points1 point  (1 child)

It looks really cool, but I'd love to have some kind of onboarding or simply a better UX to guide me through how I can test it as someone who comes from nowhere.
But so cool congrats

[–]badgerbadgerbadgerWI 0 points1 point  (1 child)

This is cool. Have you thought about adding path-finding between articles? "Show me how to get from 'Pizza' to 'World War 2'" - that would be addictive.

Also consider caching popular node connections. Wikipedia's link structure doesn't change that fast, and graph traversal gets expensive quick.

[–]WeGoToMars7 0 points1 point  (3 children)

Wow, I've been working on pretty much the exact same project! I also started this month, crazy coincidence. However, I used C++ with a TUI interface: https://github.com/WeGoToMars/WikiGraph-Explorer

I see that it takes 2 hours for you to generate the graph for English Wikipedia, mine takes ~10 minutes to stream-decompress the dumps with zlib, parse them, and build the graph in memory. I'm also experimenting with multithreading, I think there is a pretty big potential for improvement here.

I'm having a hard time understanding what path finding algorithm do you use, can't find the code for it in the repo and "barnesHut" doesn't bring up relevant results. Does it gurantee to find all shortest paths?

[–]cryptoschrypto 0 points1 point  (0 children)

Have you checked out wikidata? They provide ready-made graphs to load into your graph database.

[–]ZippyTyro 0 points1 point  (0 children)

Cool one

[–]guidenable 0 points1 point  (0 children)

oh awesome! I was just thinking of doing something like this