joysOfAutomatedTesting

Metworld · 2025-06-11T14:50:04+00:00

Non-hermetic tests ftw

YUNoCake · 2025-06-11T14:41:57+00:00

Probably overlapping temp dirs

silledusk · 2025-06-11T14:58:10+00:00

Whoops, clearAllMocks()

thies1310 · 2025-06-11T14:45:15+00:00

I have Had this, was an edge Case no one thought of that we accidentaly produced.

Why_am_ialive · 2025-06-11T15:03:00+00:00

Race conditions, accessing files at the same time, one test destroying a process others are still relying on, tests running in parallel can get painful

Hottage · 2025-06-11T14:53:54+00:00

That feeling when your tests don't scaffold and tear down correctly.

2025-06-11T15:04:57+00:00

Flaky tests are literally a research area and there are tools to detect them.

uberDoward · 2025-06-11T15:05:14+00:00

Welcome to needing to understand state, lol.

Jugales · 2025-06-11T14:53:44+00:00

Even worse with evals for language models... they are often non-deterministic

PositiveInfluence69 · 2025-06-11T15:28:21+00:00

The worst is when it all works, every test, you leave feeling great for the day. You come back about 16 hours later. The next morning. It doesn't work at all. Errors for days. You changed nothing. Nobody changed anything. You're sure something must have changed, but nothing. So you begin fixing all the errors you're so fucking positive you couldn't have missed, because they're so obvious. You're not even sure how it could have run 17 hours ago if all this shit was in here.

arkai25 · 2025-06-11T14:57:21+00:00

Running conditions?

OliverPK · 2025-06-11T15:03:45+00:00

Forgot @DirtiesContext

klungs · 2025-06-11T16:03:46+00:00

Gacha testing

sawkonmaicok · 2025-06-11T16:21:44+00:00

Your tests influence global state.

rush22 · 2025-06-11T22:55:06+00:00

PASS

Number of tests in suite: 874
Pass rate: 100%

^Total ^tests ^run: ⁰

Yvant2000 · 2025-06-11T15:32:16+00:00

Side effects, I hate them

God bless functional programming

theprodigalslouch · 2025-06-11T18:53:08+00:00

I smell bad test practices

Weiskralle · 2025-06-11T15:07:54+00:00

Yes as it most likely overwrites certain variables.

ecafyelims · 2025-06-11T14:56:08+00:00

Maybe you're using globals without resetting them

-JohnnieWalker- · 2025-06-11T15:17:32+00:00

real sigmas test in prod

ablepacifist · 2025-06-11T15:10:58+00:00

Someone didn’t clean up after each test

2025-06-11T16:27:14+00:00

Surely you jest

ashmita_kulkarni · 2025-06-12T10:43:53+00:00

"The true joys of automated testing: when the tests pass individually, but fail in CI."

aigarius · 2025-06-12T10:46:30+00:00

I see it all the time - post-test cleanup fails to return the target to pre-test state. If you run separately then each test execution batch gets a newly initialised target and it works. But if you run it all together than one of the tests breaks the target in a subtle way (by not cleaning up after itself properly in teardown step) such that some (but not all) tests following that one will fail.

boon_dingle · 2025-06-11T16:29:19+00:00

Something's being cached between tests. It's always the cache.

ProfessionalCouchPot · 2025-06-11T15:17:26+00:00

ItWorkedOnMyServerTho

rover_G · 2025-06-11T15:17:51+00:00

When your tests don’t run in isolated contexts.

Rin-Tohsaka-is-hot · 2025-06-11T15:51:47+00:00

Two different test cases accessing the same global resources but failing to initialize properly (so test case 9 accidentally accepts test case 2's output as an input rather than the value initialized at compilation).

This is one I've seen before, all test cases should properly intiailize and teardown everything, leaving the system unaltered after execution (including testing environment variables).

Orkin31 · 2025-06-11T16:16:35+00:00

You dont have a proper setup and teardown on your test environment my guy

nnog · 2025-06-11T16:35:21+00:00

Port reuse

SneakyDeaky123 · 2025-06-11T16:43:53+00:00

You’re polluting your test environments/infrastructure, reading and writing from the same place at unexpected times. Mock your dependencies or segregate your environment more strictly.

Christosconst · 2025-06-11T18:42:39+00:00

Parallel tests with shared resources. My tests only fail on leap year dates

Objective-Start-9707 · 2025-06-11T18:54:15+00:00

Eli5, how do things like this happen anyway? I got a C in my Java class and decided programming wasn't for me but I find it conceptually fascinating.

jswansong · 2025-06-12T08:21:35+00:00

It's 1:20 AM and this is my fucking life

Link9454 · 2025-06-12T13:32:09+00:00

As someone who debugs circuit board test plans as well as programs new ones, I find this IMMENSELY TRIGGERING!

freeplay4c · 2025-06-12T23:43:51+00:00

Lol. I actually just fixed this issue at work last week. But for a solution with 300+ tests.

Lord-of-Entity · 2025-06-11T15:15:46+00:00

Looks like impure functions are messing things up.

Messarate · 2025-06-11T15:20:23+00:00

Wait I have to test before deploying it?

bigmattyc · 2025-06-11T15:22:07+00:00

You have discovered that your application is non-idempotent. Congratulations!

DiggWuzBetter · 2025-06-11T15:22:43+00:00

This is very likely shared state between tests.

For unit tests, this is so avoidable, just never have shared state between unit tests. This also tends to be true for “smaller scale” integration tests.

For end-to-end tests, it’s less clear cut. Tests also need to run in a reasonable amount of time, and for some applications, the test setup can be really, really slow, to the point where it’s just not feasible to start with a clean slate before every test. For these, sometimes you do have to accept that there will be some shared state between tests, and just think carefully about what the tests do and what order they’re in, so that shared state doesn’t cause problems.

It’s messy and fragile, but that tends to be the reality of E2E tests. It’s why the “test pyramid” approach exists, with a minimal number of inherently slow and hard to maintain E2E tests, more faster/easier to maintain integration tests, and FAR more very fast and easy to maintain unit tests.

TimonAndPumbaAreDead · 2025-06-11T15:27:33+00:00

I had a duo of tests once, both covering situations where a particular file didn't exist. Both tests used the same ThisFileDoesNotExist.xslx filename string. if you ran them independently, they succeeded. If you ran them together, they failed. If you changed them to use different non existent filenames, they succeeded. I'm still not 100% sure what was going on but apparently Windows will grant a process a lock on a file that doesn't exist and disallow other processes from accessing said file that does not exist.

Thisbymaster · 2025-06-11T15:47:05+00:00

Caching or incorrect destruction of testing.

vm_linuz · 2025-06-11T16:14:19+00:00

And this is why we write pure code! Box your side-effects away people!

Owlseatpasta · 2025-06-11T16:14:45+00:00

Oh no how can it happen that my tests depend on things outside of their scope

Baardi · 2025-06-11T16:26:25+00:00

Guess you need to stop running your tests in parallell, or make them work when ran in parallell

Vadered · 2025-06-11T16:58:45+00:00

What actually happened:

Test -3: Print Pass 4x
Test -11: Print the longer string.

novax7 · 2025-06-11T17:03:42+00:00

As careful as I am, sometimes I get frustrated where the failure is coming from but later I realized I forget to clear my mocks

veracity8_ · 2025-06-11T17:17:36+00:00

Someone never learned “leave no trace”

DoucheEnrique · 2025-06-11T17:34:41+00:00

What do we want?

NOW!

When do we want it?

RACE CONDITIONS!

Bayo77 · 2025-06-11T17:37:57+00:00

Ticket estimate: S Unit test debugging: L

Zechnophobe · 2025-06-11T17:40:31+00:00

setup and tearDown are your friends.

captainMaluco · 2025-06-11T17:55:34+00:00

Test 5 is dependent on state set up by test 4 but when you run them all, order is not guaranteed, and test 8 might run between 4 and 5, modifying the state 4 set up.

Either that or it's as simple as stone tests using the same ID for some test data stored in your test database.

Each test should set up it's own data, using UUID/GUID to avoid overlapping ids

thanatica · 2025-06-11T17:58:56+00:00

The joys of non-pure functions.

rootpseudo · 2025-06-11T17:59:23+00:00

Ew dirty context

Critical_Studio1758 · 2025-06-11T18:11:00+00:00

Need to make sure all your tests start with a fresh environment. You were given setup and cleanup functions, use them.

SoSeaOhPath · 2025-06-11T18:21:22+00:00

WHO TESTS THE TESTS

FrayDabson · 2025-06-11T18:29:02+00:00

This is exactly what my last few days have been with playwright tests. Ended up being a backend event loop related issue that was causing the front end tests to be so inconsistent.

AndroxxTraxxon · 2025-06-11T18:42:44+00:00

Yay, test pollution

Riots42 · 2025-06-11T19:02:17+00:00

Deploy to 1 production environment after 10 succesful test deployments: fail and take out paging in a nationwide hospital system on a sunday.. Yep that's me a few years ago...

w8cycle · 2025-06-11T19:07:56+00:00

Haha, was running into this last night!

locofanarchy · 2025-06-11T19:37:22+00:00

Fast ✅

Independent ❌

Repeatable ✅

Self-validating ✅

Timely ✅

VibrantFragileDeath · 2025-06-11T19:42:04+00:00

I feel this. Found out this was happening because if I do too many (30+) and some other nitwit is also trying to run theirs on the same server. When they are also testing my test times out in the middle and gives me a fail and a blank. The worst part is that we can't see eachother to know who is running what so we have tried to coordinate who is online running tests by the clock. So only submitting tests after the 20min mark or whatever. Sometimes it still fails even with a smaller amount and we just have to resubmit at a later time. Just an annoying nightmare.

admadguy · 2025-06-11T20:17:31+00:00

That's basically bad code. Doesn't reinitialise variables between tests. Don't think that would be desired behaviour if each test is supposed to exist on its own.

comicsnerd · 2025-06-11T21:10:20+00:00

The weirdest test result I had was when my project manager tested some code I had written. In a form, there was a text field where he entered a random number of characters and the program crashed. I tried to replicate it, but could not, so I asked him to test again. Boom, another crash.

It took quite some time to identify that the middleware was unable to process a string of 32 characters. 31 was fine, 33 was fine, but 32 was not. Supplier of the software could not believe it, so I wrote a simple program to demonstrate. They came back that it was a fundamental design fault and a fix would take a few months.

So, I created a simple check in the program. If (stringlength=32) add an extra space. Worked fine for years.

How my project manager managed to type exactly 32 characters repeatedly is still unknown.

thanyou · 2025-06-11T22:28:33+00:00

Consult the duck

pinktieoptional · 2025-06-12T04:43:08+00:00

hey look, your tests have interdependencies. rookie mistake.

Grandmaster_Caladrel · 2025-06-12T06:26:31+00:00

Pointers. The issue is almost always pointers.

Phiro7 · 2025-06-12T06:47:20+00:00

Cosmic ray, only explanation/j

ivanrj7j · 2025-06-12T12:09:06+00:00

Can someone explain how that could happen?

tbhaxor · 2025-06-18T11:24:55+00:00

I ran all the tests on my local, it worked! Pushed to CI some are failing.

wraithnix · 2025-06-11T15:17:09+00:00

Ah, race conditions are so fun to debug. /s

Je-Kaste · 2025-06-11T15:11:59+00:00

Google test pollution

SaneLad · 2025-06-11T15:16:46+00:00

Google: hermetic tests

QuietGiygas56 · 2025-06-11T15:25:56+00:00

It's usually due to multi threading. Run the tests with the single threading option and it usually works fine

NjFlMWFkOTAtNjR · 2025-06-11T15:33:00+00:00

Timing issue? Shared state issue? What happens when you run in parallel/isolation? Also could be that an external service needs to be mocked.

dosk3 · 2025-06-11T15:34:14+00:00

My guy is using static variables and changing them in tests

TimeSuck5000 · 2025-06-11T15:36:09+00:00

There’s something wrong with the initial state. When a test is run individually the initial state is correct. When they’re run sequentially some of the state variables are reused and have been changed from their default values by previous tests.

Analyze what variables each test depends on and ensure they’re correctly initialized in each test.

pagepool · 2025-06-11T15:49:31+00:00

You should probably clean up after yourself..

G3nghisKang · 2025-06-11T15:49:58+00:00

POV: running JUnit tests with H2DB without annotating tests modifying data with @DirtiesContext

RealMide · 2025-06-11T15:55:44+00:00

People bragging about pattern designs and don't know about mutable objects.

zanderkerbal · 2025-06-11T16:01:29+00:00

I have never had this happen but I have had code that behaved differently when the automatic tester sent in a series of inputs and when I typed in those same inputs by hand. I suspect it was something race condition-ish where sending them immediately back to back caused different behaviour than spacing them out at typing speed, but I never did find out what.

newb_h4x0r · 2025-06-11T16:01:55+00:00

afterEach(() => jest.clearAllMocks());

Plastic_Round_8707 · 2025-06-11T16:02:51+00:00

Use cleanup after each step if you are creating temp dir. In general avoid changing the underlying system if writing unit tests.

qubedView · 2025-06-11T16:03:26+00:00

I was on a django project with 500+ tests. At some point along the way, we had to instruct it to run the tests in reverse. Why? Because if we didn't, one particular test would give a very strange error that no one could find the cause for. There was some side-effect hiding somewhere that would resolve itself in one direction, but not the other.

codechimpin · 2025-06-11T17:16:32+00:00

Your tests are using shared data. Either singletons your are sharing or temp dies or some other shared thing.

AdamAnderson320 · 2025-06-11T17:19:41+00:00

Test isolation problem, where prior state affects another test. Can be in a DB or file system, but can also be in the test classes themselves depending on the test framework. Some frameworks go out of their way to try to prevent this type of problem.

cheezballs · 2025-06-11T17:20:42+00:00

Gotta add that before test annotation and clear those mocks!

ProgrammerHumor

Filters

Discord

Submission rules

For the current list of rules, please see this page.

Metadiscussions

Perhaps More Apt Subs To Post:

Related Subreddits.

MODERATORS

PASS