Here's one of my recent experiences, and also a theory.
I was listening very carefully to Puccini's opera Turandot - specifically, the London/Decca CD with Pararotti, Sutherland, Caballe et al. I was comparing my standalone Classe CDP-.5 connected via balanced analog interconnects to the same unit used as a transport feeding a Classe SSP-75's internal DACs. The power amp is a Classe CA-200 with ML Odyssey speakers.
I wrote in my notes that especially the SSP seemed to get the image wrong, and in fact it seemed to be backward with the orchestra in front of the singers. (The CDP's internal DACs yielded a much "flatter" image front-to-back.) And the vertical image seemed goofed up, too, since the singers appeared to be higher vertically than the instruments traditionally found at the back of the orchestra (e.g, double bass, precussion).
The penny dropped on the famous "Non piaget"*, when I noticed that Liu's (Caballe, I think) sonic image was distinctly lower than that of Calaf (Pavarotti). Of course! Liu is begging. I've never seen the opera myself, but it makes sense that she would have a lower vertical image position on this track, because she would be on her knees. Then it hit me that this is an opera, dummy, not a choral performance - so the orchestra is supposed to be in the pit, DOWN in the front, and the singers are UP on stage, behind the orchestra.
The opera regulars are probably screaming "DOH!!!" but I'm new to opera. I had fixed in my mind a picture of a performance of an oratorio such as Handel's Messiah, in which the chorus is usually arrayed behind the orchestra with the soloists in front.
In the end, it was the vertical image information that gave me the clues necessary to understand the recording despite an almost certainly erroneous initial mental picture. And moreover, a pretty decent CDP wasn't sufficient to tell - it took the better DACs in the SSP75 to see it at all.
A week later I was able to listen to the same performance on LP, using a vastly more expensive system [SME 20 / Graham 2.2 / Koetsu Rosewood? / Aesthetix IO / Aesthetix Callisto / Wavestream Kinetics / Avalon Eidolon Diamonds]. It was very easy to identify the image, although I wonder how much of that is due to the clearly superior capabilities of that system, how much is due to the LP vs CD encoding, and how much is due simply to the fact that now I had recognized what the reality almost certainly was. My guess is that all three are in play.
I'm pretty certain that the imaging is both correct and reproducible in all dimensions in this case, although I couldn't prove it unless I see a video or photograph of that performance.
-----
After thinking about it some more, I think I have a theory. Although we're hearing "only" two channels (the above is definitely a stereo recording), they carry more than two point sources worth of information. In this case, I think what is happening is that the various reflections from the hall were accurately captured by the mikes. Perhaps there is some way in which we correlate reflections arriving at different times in each ear with vertical (and other) position information. However we psychoacoustically reassemble them, the better systems were in this case clearly able to convey the image more precisely as it was recorded.
* apologies for the massacre of Italian spelling!