I think the software integration is very cool and Apple nailed it but I don't like the hardware aspect! I just hate the fact you have to put it on to your head. Keep in mind this headset is also heavy. So, don’t expect to be using it at all times.
VisionOS is the product, the VisionPro is just the hardware needed to make the product work at the moment. Which is pretty normal for Apple, right? The original Mac was not sold on how amazing 128k of RAM or a 9" monochrome display was, but how it was just enough hardware to get the software working. Same for the iPhone, the software/interface was the selling point, the hardware just made it possible.
iPhone sold barely 1mm units in its first 6 months, and barely 10mm in its first full year (including the much improved 3G model for much of that time), but after 5 years was selling over 100mm units per year. OVER 99% of 2012 iPhone buyers did not find the 2007 iPhone compelling enough to buy. But a huge part of why they did find the 2012 iPhone compelling was developments in hardware, software, and awareness of the product's uses made possible by selling that first clunky little iPhone. It's easy to forget now that for everyday people the smartphone revolution started in 2009/2010, not 2007. It took almost 3 years for real momentum to build behind the iPhone/Andriod duopoly. This puts the rumored ~2027 timeline for a more glasses-like true AR product right in line with the smartphone adoption hockey stick curve moment.
Personally, my three biggest concerns are: trying to make myself wait for a better model before buying (losing battle, I'm ordering day one, ugh); if it's a whole new computing paradigm, will it be locked down and restricted hardware and/or software wise like an iPhone/iPad or be allowed to breathe on its own like a Mac; and how are they going to solve the FaceTime problem.
First one is purely a personal problem, I can't resist buying the new cool thing, even when I know it's not ready to become a daily driver and I'd be happier waiting for version 3 or 4 after the novelty wears off.
Second one is purely up to apple, to really let this thing take off to full potential it should be much more like a Mac than iPad in terms of user/developer openness. But Tim Cook's apple is almost certain to shoot themsleves in the foot by locking the thing down rather than trusting their products to sell on merit, even if other 3rd software or features exist. This also goes for hardware -- if VisionOS is going to really take off for things like productiuvty, it needs a lot more than just Bluetooth and Wifi for I/O. My Macs regularly have devices plugged into every port, many of whuch are hubs and adapters to reach even more devices. I'd love to use a VisionPro or successor hardware to edit video or photos, but if the only way to get data to the device is the cloud, content creation is DOA. And even using it as an extension of your mac is only good enough for visual work if they can connect via TB. Wireless display streaming to work with RAW photos or 4k60+ ProRes or RAW video is just not going to be good enough. Personally, I feel the battery pack is too small and limited. Not by the two hour runtime (which is short, but not egregious for a heavy headset imo), but by the lack of IO. The connector should be handling at least an 8x PCIe/2 thunderbolt port datalink, and the battery pack should have at least 2 TB ports, ideally with an additional USB-C or magsafe just for power. And we should be able to buy more capable batteries as well, some with 10G ethernet and HDMI/Displayport in, or card readers, or built in expanded storage, or longer runtimes.
Third one seems like a physics problem. Even when some sort of "glasses frame" rather than "ski goggles" version launches, how do you record an actual video of yourself to use for facetime? A floating virtual bust is not it, and never will be, for FaceTime to be truly better in VisionOS than iOS, we need actual footage of the person we're talking to, ideally with the abilty to capture their full body, to place them next to you on the couch, or across from you at the table while you share a meal or film with someone you can't actaully be with in that moment. I don't know how to solve this-- ideas such as a super wide angle cameras and very smart 3d reconstruction or a camera drone/pylon you can place in front of you are far from ideal, but it's a problem someone must solve if Facetime in VisionOS is going to take off.