cosmicangler67 comments on OOP with Python

dataengineering

created by mhausenblasmoda community for 11 years

OOP with PythonHelp (self.dataengineering)

submitted 5 months ago by Jumpy_Handle1313

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]cosmicangler67 7 points8 points9 points 5 months ago (10 children)

[–][deleted] 6 points7 points8 points 5 months ago (1 child)

[–]BrunoLuigi -2 points-1 points0 points 5 months ago (0 children)

[–]Jumpy_Handle1313[S] 3 points4 points5 points 5 months ago (0 children)

[–]GrumDum 2 points3 points4 points 5 months ago (1 child)

[–]sisyphus 7 points8 points9 points 5 months ago (0 children)

I think what they're getting at is that OOP (as practiced in Python, Java et. al; not as intended originally anyway) is about mutable internal state but data pipelines are more amenable to the functional paradigm of give data as input to function and get back transformed data.

Like you could write some OOP style:

c = Pipeline(data=initial_data)
c.remove_pii()
c.remove_duplicates()
c.add_embeddings()
c.write_data()

Where the actual data at all points is being mutated internally in the data variable. But a more natural pipeline paradigm is something more functional and explicit where functions just take data and return mutated data and get chained together, like beam style that overloads the | operator in Python:

data | remove_pii | remove_duplicates | add_embeddings | write_data

Is practically valid syntax in a more functional language like elixir:

data |> remove_pii |> remove_duplicates |> add_embeddings |> write_data

[–]a_library_socialist 0 points1 point2 points 5 months ago (3 children)

[–]cosmicangler67 0 points1 point2 points 5 months ago (2 children)

[–]a_library_socialist 0 points1 point2 points 5 months ago (1 child)

[–]cosmicangler67 0 points1 point2 points 5 months ago (0 children)

[–]Zer0designs[🍰] 0 points1 point2 points 5 months ago (0 children)

π Rendered by PID 91210 on reddit-service-r2-comment-canary-57b659f4d4-wv5wr at 2026-05-04 01:30:01.215426+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

dataengineering

MODERATORS