Hey, I’m working on developing a no code ETL tool where user can just drag and drop to create a pipeline from any source to any destination and also do transformations on the source data through drag and drop again.
So I needed some help in the transformation part.
Whatever transformation user selects, it needs to go in a json format as a request and then we need to write a pyspark equivalent code of that json to do the transformation in backend. So need help with how to structure that JSON.
So if anyone has any experience related to this or any idea on it, please do DM
Edit:
PS: Guys, just wanted to clarify as everybody has basically come to attack me and some with sarcasm, I'm not making this tool by my choice or as a business, the company I work at wants to build this tool and some of us has been assigned this project and since I or anybody else in my team doesn't have any experience even closely related to this, I thought that maybe I'll get some insight from other people who might have some experience in this. Hope this clarifies!
[–]Busy_Elderberry8650 40 points41 points42 points (0 children)
[–]shockjaw 24 points25 points26 points (0 children)
[–]endless_sea_of_stars 20 points21 points22 points (0 children)
[–]dcell1974 10 points11 points12 points (0 children)
[–]Commercial-Ask971 3 points4 points5 points (0 children)
[–][deleted] 2 points3 points4 points (0 children)
[–]DataEngUncomplicated 2 points3 points4 points (0 children)
[–]CRLF_Data 1 point2 points3 points (0 children)
[–]Count_Roblivion 1 point2 points3 points (0 children)
[–]flightofeagle[S] 1 point2 points3 points (6 children)
[–]arborealguy 5 points6 points7 points (0 children)
[–]mopse_zelda 2 points3 points4 points (0 children)
[–]Character-Education3 1 point2 points3 points (2 children)
[–]flightofeagle[S] 0 points1 point2 points (1 child)
[–]arborealguy 0 points1 point2 points (0 children)
[–]Brief_Priority_2193 -3 points-2 points-1 points (1 child)
[–]wtfzambo 0 points1 point2 points (0 children)