It’s a bit more complicated than that though. First, the GPUs in the recent years experienced a massive power inflation. We went from a 200 watts for the ultimate high end GPU to 400-450 watts. Second, the peak FLOPs also got inflated. For example, new AMD GPUs have doubled the amount of compute units, doubling the theoretical performance. But achieving that performance in practice became much more difficult as there are limitations which operations can execute simultaneously.
Having 10 TFLOps GPU in a compact laptop (M1 Max) is not too bad. So no, I can’t agree that the situation got worse over time. Desktop, yes, to a certain degree, but not mobile.
P.S. And no, you are not getting 200 TFLOPs of compute in a sub 10k machine.