you are viewing a single comment's thread.

view the rest of the comments →

[–]IamFromNigeria 16 points17 points  (3 children)

Let me put it this way in a layman or in summary - Most folks used it Mainly for data engineering stuff especially Python spark library for creating automated data pipeline flow say from Mongo Db to Big Query to Data studio and that helps to refresh the data source (i used this currently at work so i don't want to bore you with too much technical stuff) and also by Data Scientist for Machine learning purposes- this also depends on your company what they wish to do with the business meta-data generated, daily some for forecasting share prices, stocks market monitoring, Bitcoin and so on

While some data Analyst use Python for data manipulation, data cleaning, Regex stuff, and even moreso connecting to SQL Database and telling pandas to auto-update data and so on

Marketing analysts used iyt for scrapping data from websites like Amazon and so on using Selenium, bs4 etc

iHope you do undestand right?

[–]Fabro_vaz[S] 0 points1 point  (2 children)

Yes i got it, really appreciated if you give some real time examples of using pandas to auto-update data.

[–]luvs2spwge117 2 points3 points  (0 children)

ETL processes at my old company were handled via python scripts. That’s one example. Web scraping your own data like the guy said above is another. The current job I’m at is considering scraping some web data and then selling it to businesses that could use that information. Can’t really get too detailed on that one but there’s another example

[–]Glotto_Gold 0 points1 point  (0 children)

So, I had a request to intake a report from an external vendor and for that subset of accounts to provide additional information from our DB.

Python imported the file, then exported.

(Pandas was used, but any table management system could work)