I find it depends on the exercise I did.
Score from running was 7 points higher than outdoor walk. Although perhaps it was incorrect or still calibrating. I think this is still learning, perhaps. And now that apple felt comfortable enough to roll it out officially enough to drive notifications, it may be more accurate.
We’ll see what happens as I keep feeding it data.