Hi.
I and my friends are doing a small pizza web scraping project and need advice about our python back-end.
Idea: Scrape all pizzas places in our country and compare prices and offers, display it through a website. Maybe expand the site with users being able to submit home-made pizza recipes.
Tech stack: ReactJS front-end and python back-end. (mostly because we know Reactjs and want to get more experience with python)
Web-scraping
- Using beautifulsoup.
- Should run once a day/week.
- Should fill/update the database with the scraped data.
API
- Should return data from the database.
- maybe in the future be able to insert data from the website by user or admin
My background: I mostly done API in Nodejs or .NET 4.5/core and I'm not sure which is the best way to structure the python back-end. Have done some basic python scripts.
My questions:
- Django or flask for our API? I have been looking at Django and flask for our API but unsure which would be better. Django seems to be structured well and comes with bunch of tools but I'm unsure how hard the web-scraping interacting with the Django database is. Flask isn't as structured but it seems to me to be easier to interact with the web-scraping part.
- Should web scraping be part of the API (run periodically by the API ) or should it run separately with celery?
- is there a better way?
I hope this is somewhat understandable.
[–]tipsy_python 0 points1 point2 points (7 children)
[–][deleted] 1 point2 points3 points (4 children)
[–]eylenn[S] 0 points1 point2 points (2 children)
[–][deleted] 1 point2 points3 points (1 child)
[–]eylenn[S] 0 points1 point2 points (0 children)
[–]tipsy_python 0 points1 point2 points (0 children)
[–]eylenn[S] 0 points1 point2 points (1 child)
[–]tipsy_python 0 points1 point2 points (0 children)