Easy. Sending audio over WiFi to separate speakers is not as easy as it sounds. You need to make sure that both (if you have a stereo pair) get their audio at exactly the same time. Even the slightest delay in unacceptable.
Note that by delay I’m not talking about large obvious delays like the sound not syncing up with a movie where their lips are moving and their voice is completely out of whack. I’m talking about tiny millisecond delays. Our ears localize sound based on the volume and phase (time delay) that sound reaches our ears. If the signal is out of phase slightly (because of a slight time delay) it can have a significant affect on the sound.
I’m really very curious how Apple is going to deal with this.
I get what you're saying. Doing this correctly is very technical.
The point is, they shouldn't have announced it and they certainly shouldn't have started selling a $350 speaker that is reliant on Airplay 2 for a number of its features.
The length of time it takes to complete this is dependent on the complexity of the project, the number of people working on it, the number of people testing it, the amount of time committed to the project, and I'm sure some other variables that I'm not thinking of. The point is, the consumer doesn't know the answer to any of these variables, so when Apple tells us something is coming and starts selling a product reliant on that feature coming, we need to trust that they've taken these variables into account and are giving us reasonable information. As someone looking to integrate my HomePod into my Sonos system, its disappointing to see how this looks to be shaking out.