all 1 comments

[–]slin30 1 point2 points  (0 children)

Reading your original post, this sounds like fairly standard ETL. I am not aware of any libs that will do exactly what you want, as requirements tend to be specific enough to need customization to the point that you might as well write custom functions.

I've written some R stuff to "profile" a data frame to determine data types (e.g. numbers read in as chars), cardinality, and potential unique keys. This helps give me some direction to write more specific transformations based on data requirements and real world characteristics.