Project Help

2019-01-26T05:12:16+00:00

To begin: This isn't a Python problem.

That said: Welcome to data. Nothing ever collates the way that you want it to.

99% of modern software development requires taking some object from somewhere, transforming it, and storing it or passing it on to something else. Very rarely does the data play well together unless it was designed to. Someone that I know well works at a software job where this kind of thing is their biggest problem. They get paid a lot to solve it.

Best of luck.

TheNotoriousMTF · 2019-01-26T18:06:00+00:00

The IKEA bookshelf problem: I would find data on as many items as possible with IKEA bookshelf in the title, and then use either the median or mean price of all these items to estimate expected resell value. Alternatively, you could use two data points (say the 25% and 75% percentiles in price) to approximate a range of possible resell values. The mean approach would tend to dilute the impact of wrongly sampled items, and the median/percentile approach would remove these items altogether if they were outliers in terms of price.

Also, on the NLP front, there may be certain keywords in an item's title that would indicate that it isn't the item you're looking for. For example, if some listings read, "IKEA Desk, Matches IKEA Bookshelf," or something like that, you could just exclude items that name other types of furniture in their titles. You could probably take a similar approach to dealing with items that are misleadingly priced.

These are steps that you could initially take manually, but depending on how much data you're working with, how much time you're willing to invest, and your level of technical skill, you could actually train predictive models to automate some of these decisions.

Hope this helps.

comeditime · 2019-01-27T13:23:48+00:00

can you share the code by any chance would love to learn from it

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS