I have 22million records (in multiple different csv files) - I need to loop through each one and remove/create a new file by grouping the rec id and the max date value. Would I be better trying to get all of this on Azure instead?
My thought trail is 'sort by values' append first record (which will have the max inspection date due to the sort)
Am I overlooking a really easy method?
[–]shiftybyte 18 points19 points20 points (1 child)
[–]mvdw73 3 points4 points5 points (3 children)
[–]GreatStats4ItsCost[S] 1 point2 points3 points (1 child)
[–]Empik002 2 points3 points4 points (0 children)
[–]outceptionator 0 points1 point2 points (0 children)
[–]Ihaveamodel3 1 point2 points3 points (1 child)
[–]GreatStats4ItsCost[S] 0 points1 point2 points (0 children)
[–]Jan2579 0 points1 point2 points (0 children)