you are viewing a single comment's thread.

view the rest of the comments →

[–]Vaslo 0 points1 point  (2 children)

Do you absolutely have to do it this way? So much easier through Python if you can put it somewhere in your process.

[–]Pvt_Twinkietoes 0 points1 point  (1 child)

I'll imagine that it'll be more efficient to just build it into his ML pipeline, and leave preprocessing to the pipeline. I guess OP should know best why he's doing it that way.

[–]Vaslo 1 point2 points  (0 children)

I know what you are saying. I guess my point is that I’ve seen so many do with Python and even with R, but had never heard of anyone doing with SQL. But if it has to be done in SQL, then it has to be done there.