You keep describing architecture, which is exactly what I am saying.I really don’t follow you here. If I compile the same code on two machine using the same toolkit and one machine does it twice as fast, I say that this machine has twice the performance. If I run a machine learning algorithm to categorize images on two machines and one does it twice as fast, I say this machine has twice the performance. The list continues. Which machine consumes more power is secondary dir this question, I care about how fast can it run things.
These benchmarks take some common tasks (like compiling code, categorizing images, compression etc.) and see which CPU can run them faster. The higher the score, the faster the CPU.
Incidentally, Apple CPUs can do the same amount as fast as faster while consuming less energy than Intel CPUs. This is what makes them more powerful according to all definitions of “powerful” I am aware of (unless you mean to say that Intel CPU is more powerful because it wastes more power).
Clock speed and wattage is irrelevant. Apple CPUs have more execution units than Intel CPUs. They can do more work with lower clock speed. Which in turn gives them lower power consumption.
The misinterpretation of Geekbench is when people look at a 5W CPU running at 1.9Ghz and think it's as fast as a 105W CPU running at 5.3GHz with similar architecture efficiency because they get similar Geekbench scores. Anyone who understands physics knows why that is literally impossible, and why your iPhone isn't faster than a 105W Desktop grade CPU or similar.