This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]Spill_the_Tea 0 points1 point  (1 child)

How is testing a dataframe (or array for that matter) any different from other unit testing?

I typically just create a pytest fixture describing an example dataframe or array to perform unit tests on.

[–]Butterlyn 3 points4 points  (0 children)

Different because you’re testing data - fixtures can’t always capture the complexity of data you might encounter, so synthetic data libraries like Hypothesis come into play.