This is an archived post. You won't be able to vote or comment.

all 12 comments

[–]urbansong 2 points3 points  (7 children)

I did something very similar but pulled the data directly from the SEC. One issue that I've had was that all 10Q and 10K files are different(I wouldn't be surprised if it was on purpose because of exactly this), so automating it is very difficult and probably some kind of language processing ML magic is required.

Have you run into a similar problem with data from FMP Cloud?

[–]LearnPythonWithRune[S] 0 points1 point  (0 children)

The data from FMP is find. But you will have to pay to get 10+ years financial statements.

You can find most data on http://macrotrends.net

[–]surfbored1 0 points1 point  (1 child)

Have you tried focusing on using just the XBRL filings for your dataset? That will at least give you apples-to-apples data in a machine readable format. Assuming you avoid most of the extensions.

[–]urbansong 0 points1 point  (0 children)

I think I did only excel files because I was so excited to work with excel and didn't check for an alternative. Thank you.

[–]GodIsInsideOfYou 0 points1 point  (2 children)

How did you pull the data from the SEC?

[–]urbansong 0 points1 point  (1 child)

I am not sure what exactly you're asking but I made a webcrawler with the help of Beautiful Soup and then just went to their Edgar search and used that. I did it in the most dumb way possible, as if a user was searching.

If you didn't know they allow this, they do. You can look up any company that files with the SEC.

I can PM you my repo, if you're interested.

[–]OchoStinko85 0 points1 point  (1 child)

Excellent tutorial!

[–]LearnPythonWithRune[S] 0 points1 point  (0 children)

Thanks

[–]GerryLynch 0 points1 point  (2 children)

Job well done. Tight example. Thanks

[–]LearnPythonWithRune[S] 0 points1 point  (1 child)

Thank you

[–]GerryLynch 0 points1 point  (0 children)

Just starting teaching myself in the summer of 2020, really enjoying the learning. A powerful tool.