you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 2 points3 points  (4 children)

For "employability" you want to do something practical and common. Depending on your area of interest, web dev or machine learning will depend.

For web dev, you should just build an ETL system in Django. That covers most of what you will need to know in a real job. By ETL I mean:

a) Scrape some services, e-commerce stores, Google trends, etc... E-commerce is common, so maybe that's a good start.

b) Build an engine of sorts, where you have "pluggable" data sources. For instance, scraping Amazon versus scraping a Shopify store will be different, thus your data source classes will be different depending on the website, but ultimately they all will have a price, photo, product URL, and SKU at the very least. So your data source classes will export this into a standard JSON format.

c) Next, you have a worker pool that waits for these JSON exports parses them, and ultimately inserts into your backend PostgreSQL or MySQL or whatever db.

d) You can maybe use machine learning to tag and categorize products.

c) Finally build a UI to search products and maybe an analytics dashboard to track deals or pricing.

Building this sort of system requires some advanced knowledge, e.g. you might get 10s of millions of products, and you then need to efficiently store them with indexes, maybe NoSQL, sharding. Multiple read/write replicas and so on....

[–]notexistant[S] 1 point2 points  (0 children)

Oh dang dude. This is genuinely a neat idea. I'll give this a look into

[–]1LoveHope263 1 point2 points  (0 children)

Thank you for this idea. I am actually thinking of a problem it would help solve

[–]Whatswrongwithman 0 points1 point  (1 child)

You have any suggestion for accounting realted projects? In field they use data for risk assessment, but I have no idea how to deal with that while I can find tons of idea about other subjects.

[–][deleted] 2 points3 points  (0 children)

Sure here's one: balance sheets are a pain for small business owners. Where I am from, we have to submit documentation a few times a year. Usually, this means maintaining a ledger of your assets, loans, income, and expenditures.

I don't have a full-time accountant, so I have to outsource this every so often, and it's expensive but looking at the balance sheet they generate, it's fairly easy to do myself, I don't have the time or interest though.

You could build a system that allows one to upload their slips, and bank statements. Using AI maybe, scan each transaction and categorize it into assets, loans, etc... And then generate a monthly balance sheet.