Comment by whynotmaybe

1 day ago

Because it should be `today + 1 year + randomInt(1,42) days`.

Always include some randomness in test values.

34 comments

whynotmaybe

Not a good idea for CI tests. It will just make things flaky and gum up your PR/release process. Randomness or any form of nondeterminism should be in a different set of fuzzing tests (if you must use an RNG, a deterministic one is fine for CI).

dathinab 20 hours ago
if it makes thing flaky
then it actually is a huge success
because it found a bug you overlooked in both impl. and tests
at least iff we speak about unit tests
- jstanley 19 hours ago
  
  Only if it becomes obvious why it is flaky. If it's just sometimes broken but really hard to reproduce then it just gets piled on to the background level of flakiness and never gets fixed.
  
  2 replies →
- jamesfinlayson 8 hours ago
  
  I remember having a flaky test with random number generation a few years ago - it failed very rarely (like once every few weeks) and when I finally got to fixing it, it was an actual issue (an off by one error).
- yxhuvud 17 hours ago
  
  This will often break on stuff like daylight saving changes, while almost as often you don't give a rats ass about the boundary behaviour.
- tremon 18 hours ago
  
  Burma-shave
whynotmaybe 21 hours ago

That's why it's "randomInt(1,42)", not "randomLong()".

zelos 20 hours ago

Generate fuzz tests using random values with a fixed seed, sure, but using random values in tests that run on CI seems like a recipe for hard-to-reproduce flaky builds unless you have really good logging.

CoastalCoder 21 hours ago

> Always include some randomness in test values.

If this isn't a joke, I'd be very interested in the reasoning behind that statement, and whether or not there are some qualifications on when it applies.

j1mr10rd4n 12 hours ago

There's another good reason that hasn't been detailed in the comments so far: expressing intent.
A test should communicate its reason for testing the subject, and when an input is generated or random, it clearly communicates that this test doesn't care about the specific _value_ of that input, it's focussed on something else.
This has other beneficial effects on test suites, especially as they change over the lifetime of their subjects:
* keeping test data isolated, avoiding coupling across tests * avoiding magic strings * and as mentioned in this thread, any "flakiness" is probably a signal of an edge-case that should be handled deterministically and * it's more fun [1]
[1] https://arxiv.org/pdf/2312.01680
dathinab 21 hours ago
humans are very good at overlooking edge cases, off by one errors etc.
so if you generate test data randomly you have a higher chance of "accidentally" running into overlooked edge cases
you could say there is a "adding more random -> cost" ladder like
- no randomness, no cost, nothing gained
- a bit of randomness, very small cost, very rarely beneficial (<- doable in unit tests)
- (limited) prop testing, high cost (test runs multiple times with many random values), decent chance to find incorrect edge cases (<- can be barely doable in unit tests, if limited enough, often feature gates as too expensive)
- (full) prop testing/fuzzing, very very high cost, very high chance incorrect edge cases are found IFF the domain isn't too large (<- a full test run might need days to complete)
- ssdspoimdsjvv 21 hours ago
  
  I've learnt that if a test only fails sometimes, it can take a long time for somebody to actually investigate the cause,in the meantime it's written off as just another flaky test. If there really is a bug, it will probably surface sooner in production than it gets fixed.
  
  3 replies →
- SkyBelow 19 hours ago
  
  Can't one get randomness and determinism at the same time? Randomly generate the data, but do so when building the test, not when running the test. This way something that fails will consistently fail, but you also have better chances of finding the missed edge cases that humans would overlook. Seeded randomness might also be great, as it is far cleaner to generate and expand/update/redo, but still deterministic when it comes time to debug an issue.
  
  1 reply →
whynotmaybe 21 hours ago
Must be some Mandela effect about some TDD documentation I read a long time ago.
If you test math_add(1,2) and it returns 3, you don't know if the code does `return 3` or `return x+y`.
It seems I might need to revise my view.
- Izkata 21 hours ago
  
  I vaguely remember the same advice, it's pretty old. How you use the randomness is test specific, for example in math_add() it'd be something like:
  jitter = random(5) assertEqual(3 + jitter, math_add(1, 2 + jitter))
  If it was math_multiply(), then adding the jitter would fail - that would have to be multiplied in.
  Nowadays I think this would be done with fuzzing/constraint tests, where you define "this relation must hold true" in a more structured way so the framework can choose random values, test more at once, and give better failure messages.
  
  2 replies →
- ajs1998 21 hours ago
  
  Randomness is useful if you expect your code to do the correct thing with some probability. You test lots of different samples and if they fail more than you expect then you should review the code. You wouldn't test dynamic random samples of add(x, y) because you wouldn't expect it to always return 3, but in this case it wouldn't hurt.
- brewmarche 18 hours ago
  
  This sounds like the idea behind mutation testing

andai 1 day ago

Interesting, haven't heard this before (I don't know much about testing). Is this kind of like fuzzing?

whynotmaybe 1 day ago

I recently had race condition that made tests randomly fail because one test created "data_1" and another test also created "data_1".
- Test 1 -> set data_1 with value 1
- Test 1 -> `do some magic`
- Test 1 -> assert value 1 + magic = expected value
- Test 2 -> set data_1 with value 2
But this can fail if `do some magic` is slow and Test 2 starts before Test 1 asserts.
So I can either stop parallelism, but in real life parallelism exists, or ensure that each test as random id, just like it would happen in real life.

devin 1 day ago

Are you joking? This is the kind of thing that leads to flaky tests. I was always counseled against the use of randomness in my tests, unless we're talking generative testing like quickcheck.

dathinab 20 hours ago
or, maybe, there is something hugely wrong with your code, review pipeline or tests if adding randomness to unit test values makes your tests flaky and this is a good way to find it
- devin 19 hours ago
  
  or, maybe, it signals insufficient thought about the boundary conditions that should or shouldn't trigger test failures.
  doing random things to hopefully get a failure is fine if there's an actual purpose to it, but putting random values all over the place in the hopes it reveals a problem in your CI pipeline or something seems like a real weak reason to do it.
  
  1 reply →
whynotmaybe 1 day ago
`today` is random.
- InsideOutSanta 19 hours ago
  
  If "today" were random, our universe would be pretty fricken weird.
- Eldt 21 hours ago
  
  It's dynamic, but it certainly isn't random, considering it follows a consistent sequence