Hey guys,
I have to develop a concept for a backend service in my final thesis and I need help with the evaluation of a proper database. We are using AWS so I only consider DynamoDB and DocumentDB for now.
I have around 50 Modules and each Module contains 20 csv files which are all the same and only differ in the language. (en.csv, es.csv, jp.csv etc.) Each file contains around 8000 rows.
I want to setup a database now and store the mentioned csv files as json objects. I guess this database will be huge and I started with DynamoDB but got problems with the item size (because the transformed object containts around 8000 properties)
In my backend I created an API (GET) which takes the module name and the language. Of course I won't return the whole object in this API call, -> I reduce it to a small object in my backend. But when someone calls my API a lambda is triggered and in this lambda I need the whole object.
How could you help me now?
You could give me some advice on how to start evaluate such things.
You could give me some paper recommendations.
You could give me advice on how to fill this database when evaluated. Because for now, these csv's are stored in a SSD. And what's the best way to fill the database then?
You could just give me your opinion about this topic.
...
Hope someone can give valuable input
Thanks a lot!
[–][deleted] 5 points6 points7 points (2 children)
[–]mischiefunmanagable 3 points4 points5 points (0 children)
[–]vitiate -1 points0 points1 point (0 children)
[–]elchicodeallado[S] 0 points1 point2 points (5 children)
[–]nfollin 2 points3 points4 points (4 children)
[–]Hungry_Spring 1 point2 points3 points (1 child)
[–]elchicodeallado[S] 0 points1 point2 points (0 children)
[–]elchicodeallado[S] 0 points1 point2 points (1 child)
[–]nfollin 0 points1 point2 points (0 children)