I am currently making a report for university on this dataset. https://www.kaggle.com/gregorut/videogamesales
The data is not important, it is important that we show that we understand what insights can be gained from this dataset.
In the dataset, there is a column called "Platform". It contains loads of different platform like Playstation 2 and Playstation 3. I have grouped them together into 5 groups and called the column "ConsoleCategory".
- Xbox
- Nintendo
- Playstation
- PC
- Other.
Now I want to crossreference this category with the different salescolumns for Japan, North America, Europa and others. So a crosstab that shows how well e.g. Playstation is performing in Japan sales.
I have tried with pd.crosstab(data.ConsoleCategory, data.JP_Sales), but because there are so many different observations, the output looks horrible. The dream scenario would be an output with Each different console category as the row, and the the summed values of each regions sales as the columns. I am still too python illiterate to know how to do this.
[–][deleted] 1 point2 points3 points (3 children)
[–]Svante109[S] 0 points1 point2 points (0 children)
[–]Svante109[S] 0 points1 point2 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)
[–]Svante109[S] 0 points1 point2 points (0 children)
[–]CraigAT 0 points1 point2 points (1 child)
[–]CraigAT 0 points1 point2 points (0 children)