What’s funny is that when we are designing processors, we don’t use *any* of these benchmarks when making our design decisions. I always find it amusing how people obsess about these things.
I would imagine you especially don’t run them in a vm.
Out of curiosity, how did you judge the capabilities of a processor during design? Do you run SPEC? Or not even that - purely in house stuff?