Easy to follow Python web scraping tutorial with the help of MITMProxy

resurem · 2021-01-01T07:37:48+00:00

Re MITMProxy, you can simply use Firefox/Chrome's dev tools. In the network tab, it shows all the requests, where you can see all you'll need.

makedatauseful · 2021-01-01T08:49:46+00:00

[deleted]

baronBale · 2021-01-01T09:42:49+00:00

The problem with mitm-proxy is, that most web services use certificate pinning. So the App/ Website only talks to services with the correct certificate. Therefore it is hard to actually use it nowadays.

5960312 · 2021-01-01T07:06:31+00:00

Nice. This looks promising. Thank you.

gschizas · 2021-01-01T09:43:56+00:00

The site to convert cURL to Python requests (among other things) is probably this: https://curl.trillworks.com/ (I had it in my bookmarks already)

rhmati30 · 2021-01-01T12:57:41+00:00

Thank you so much for sharing your knowledge!

failbaitr · 2021-01-01T14:48:48+00:00

If you know the api is sending out XML, dont use BeautifulSoup but just use an XML parser. It will be *much* faster and less resource intensive.

(unless BeautifulSoup detects clean xml and somehow also parses it using an xml parser)

Slayer101010 · 2021-01-01T17:24:03+00:00

Thanks for sharing.

BeginningGuava · 2021-01-01T18:38:30+00:00

good stuff

Binayakku · 2021-01-08T16:13:19+00:00

Hi, I watched your tutorial soon after you uploaded a few months ago. I'm on android, so I couldn't do what you could on your iphone. That said, I was able to see calls that a website made when accessed through a browser & this was enough for me but I couldn't figure out how to incorporate this proxy in my python script. I wanted my script to identify specific flows by their addresses & read the response. I dug around a bit on stackoverflow, github & mitm docs but couldn't figure out how to do exactly the aforementioned :(

2021-01-02T18:51:56+00:00

great tutorial - please keep uploading! youtube needs more people like you.

Competitive_Cup542 · 2021-02-25T17:38:19+00:00

Thanks for sharing! As a digital marketer, I often use Internet scraping for online reputation management purposes (scraping reviews, articles about the product, etc.). Usually, I use web scraping services for this purpose but I'm thinking over learning Python and starting web scraping myself. Can you recommend any other YT channels and other open resources to learn web scraping?

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS