I think it's likely that that camera bump will have the same triangular camera configuration, but the camera bump will be larger and the camera lenses will be flush against the surface of the camera bump. We could see the camera configuration akin to the Pixel 9 Pro if Apple figures out how to use Apple Intelligence to do spatial image recording with that type of camera bump.
The triangle positions all 3 cameras closer than average eye width. So if improved spatial is important, getting them spread out a bit more seems key to that. One way would be to go the Pixel Pro way, abandon the triangle and spread them out across the back in a row. Anyone who wears glasses can best "see" this by laying the existing phone face down and placing the center of one glasses lens on one of the camera lenses. Now look at where the center of the other glasses lens is vs. the other camera lenses.
Imagine the pixel "straight line" distribution across the entire back. Apply the same concept and it's easy to see that there would be room to get the 2 cameras for spatial spread out just fine like that.
Why didn't Apple already go this way? Because the triangle predates shooting spatial with iDevices? So why would they switch now? Because they may be looking for other configurations to improve spatial capture. That obviously needs the lenses spread out more and a bigger triangle does not seem like the best option.
But, but, but... how would we shoot landscape video? My guess is by holding the camera as most do now when they shoot video: in PORTRAIT orientation. Those of us accustomed to rotating our iDevices for landscape would have to adjust to holding it like seemingly most other people in PORTRAIT. Software could then ask someone wanting to record video how they want it to appear on the screen where they want to watch it. If they want portrait, it would advise the user to rotate the phone to capture the skinny strip portrait view. And if they want landscape, they just hold it like most do now. In other words, landscape capture would become the default instead of portrait.
For on iDevice playback, playing back portrait captures would fill the device screen held in portrait and landscape captures would play sideways, so user would obviously know to rotate the phone to watch a landscape video.
Bonus, the Pixel layout would finally kill the wobble.
Else, a straight line DOWN the side of the iDevice would preserve the wobble and get a third lens for spatial near the middle of an iPhone length. So IMO, across the top makes sense, kills the wobble and those of us who know how to capture landscape video would need to adapt
our approach vs. hoping that the bulk of the world finally catches on instead of shooting everything portrait and then griping about the huge black blocks left & right when played back on TVs or monitors... but never seeming to learn from that the next time they shoot new video.