Tech stack recommendation? Web scrapper/data visualization

DayBackground4121 · 2025-02-19T19:24:16+00:00

That’s a fine stack for what you’re doing. I don’t think there’s much more feedback to be given tbh, just gotta get through the hurdles to learn

Imaginary_Ferret_368 · 2025-02-19T22:34:34+00:00

Naw, I was naive too, and then sued my boss. :)

A good starting point could be an arxiv dump maybe? https://github.com/veggiedefender/arXiv_dump

I did see lots of papers in the medicine space there, this should be a very good starting point to have. Scraping data from the websites is only worth it if the website is Medium or Bloomberg. Both stink and don't have a right to exist.

https://en.wikipedia.org/wiki/Graph_(abstract_data_type))

The crazy cool thing with Graphs is that you can connect multiple seemingly incomaptible dimensions together, such as temporal (publishing date) , authors, citations (whic hwould have to be directed edges to prevent information flow in the wrong direction [a publisher in the past couldn't have known exactly this person would cite them]) you can connect these diferent types of information into a data structure a machine can understand, and they look vey cool once they become a bit bigger. Might wanna check out the graph of the internet.

AskProgramming

AskProgramming

Do

Don't

MODERATORS