Comment by hedgehog

1 month ago

Once you start looking at the world through the lens of frequency domain a lot of neat tricks become simple. I have some demo code that uses fourier transform on webcam video to read a heartrate off a person's face, basically looking for what frequency holds peak energy.

It's effectively the underpinning of all modern lossy compression algorithms. The DCT which underlies codecs like Jpeg, h264, mp3, is really just a modified FFT.

  • Inter/intra-prediction is more important than the DCT. H264 and later use simpler degenerate forms of it because that's good enough and they can define it with bitwise accuracy.

>Once you start looking at the world through the lens of frequency domain a lot of neat tricks become simple.

Not the first time I've heard this on HN. I remember a user commenting once that it was one of the few perspective shifts in his life that completely turned things upside down professionally.

There is also a loose analogy with finance: act (trade) when prices cross a certain threshold, not after a specific time.

I don't think pulsing skin (due to blood flow) is visible from a webcam though.