AVS › AVS Forum › Blu-ray & HD DVD › HDTV Software Media Discussion › Advanced topics in HD audio
New Posts  All Forums:Forum Nav:

Advanced topics in HD audio - Page 4

post #91 of 461
Thread Starter 
Quote:
Originally Posted by DJ79 View Post

Amir, could you please explain what sibilance is, what causes it, and what can be done about it? Thanks.

Sibilane is when the "S" sound is overemphasized. Indeed, the device/algorithm/cirucuit that fixes it after the fact is called "De-essing." Unfortunately, causes are many. It can the microphone (type, distance from signer, etc.), compression (as in loudness processing), very high ratio audio compression (as in codec), etc. It can also be caused by someone putting too much emphasize over that letter when talking. It is a pesky problem to solve at times.

I have only had to deal with it in low bit rate codec processing. I am sure the others having to deal with it in the mixing studio can add more insight than I can.
post #92 of 461
Thread Starter 
Quote:
Originally Posted by Milt99 View Post

Finally my question: Is there a learning curve involved for studio sound people with the advent of higher resolution soundtracks?
Are there subtleties in sound that formerly weren't captured with Dolby Digital\\DTS that are now being revealed with the higher quality codecs and consumer equipment?
I hope my question is not too OT from the technical bent of the thread.

To the extent movie sound is created in uncompressed domain and in a studio with expensive audio systems, then I don't think there is anything special required for delivery in higher fidelity codecs. I guess you could say that people could be a bit more demanding now that they can examine such audio in more detail at home than they could in theater but I am not sure that is a signficant enough factor.

Quote:


Thanks Amir for starting this great thread and to others for contributing.

My pleasure. .
post #93 of 461
Amir,

You have talked before about jitter, in respect to real-time optical audio playback. I went back and read through some of those posts including all the posts by Shore.. that guy is WAY over my head.

I think I have a grasp of optical disc correlated jitter vs seek error, and why sending compressed audio (i.e. not LPCM) over HDMI to be decoded in the AVR may be the best solution for audio fidelity.

This got me thinking about something you said wrt to buffering. Someone said "Why not just buffer a lot of the optical disc data, like a full second or two, so that any errors or timing problems are handled by the buffering" and your response was that in theory it sounds like a good idea but in practice people don't like their remote control actions delayed.

So my question, why would the RC control data be delayed? A large amount of buffering only really makes sense in the optical disc player, and it knows the milisecond you press stop to do whatever is necessary to stop playback, regardless of what is in the buffers, no? Now one could argue that a second or two of buffering might cause a delay with smooth ffwd/rwd or skip ffwd/rwd, but those actions wouldn't need buffering since you aren't concerned about fidelity at 4x playback speed.
post #94 of 461
Quote:
Originally Posted by efxmaster View Post

Amir,

Besides I am hoping DXD will be more prevalent and used more in place of DSD.

You can download some sample DXD originated files from:
http://www.2l.no/hires/index.html
davidcw8
post #95 of 461
Interestning thread Amir, thanks for the information.

My question and thought about this:

In your opionion, is the new lossless audio format not longer the "limitation" of sound quality, if one could think that the legacy ones (DD and DTS) where sort of a bottle neck and the hardware was capable of more is certain ways?
I know there is MANY limitations before the "holy grail" of surround sound is reached (room acoustics, powerful amps, speakers) but if you are free to speak a bit around it?

An exampel, would you rather listend to DD with a Meridian preamp instead of a TrueHD version with a Average Joe $1500 receiver, with everything else equal? Imagine the other equipment and room to be just an inch from perfect.

The thing with D/A-converters beeing not true 20 or 24 bits in "cheap" modern electronics gets me thinking a bit. Do you have any general tips, if getting a ML D/A for $10000 is out of the question...?

Thanks

/Chuck
post #96 of 461
Thread Starter 
Quote:
Originally Posted by hellokeith View Post

Amir,

You have talked before about jitter, in respect to real-time optical audio playback. I went back and read through some of those posts including all the posts by Shore.. that guy is WAY over my head.

Ok, let me explain it one more time as to get others caught up with what is buried in other thread.

We all know that the goal of our systems is to covert the analog signal to digital, process it, transport it, and then convert it back to analog. The first stage is accomplished by analog to digital converted (A/D) and the latter with digital to analog converter (D/A).

The A/D samples the analog input at regular intervals as dictated by the sampling rate (which is usually 48Khz in video). The math says that we can 100% replicate the analog signal as long as we try to only preserve half the bandwidth as represented by the sampling rate (which is 48/2 = 24 Khz in our example). The math however, assumes perfect timing at both end. If the intervals were exactly 0.00001112 seconds at input (I am making up this number), then the D/A converter better convert that digital sample at exactly 0.00001112 not 0.00001113 or 0.00001111.

What happens unfortunately in real life is that there is no way to achieve 100% accuracy. Let's take the S/PDIF or Toslink digital connection. Both of these are serial interfaces meaning there is only a single wire carrying the pulses representing the source digital stream. Buried in the same signal is a clock which determines the timing mentioned above. A circuit called PLL (phased locked loop) is used to extract this clock. Which tells us when we should look at each sample on the wire. Yes, the samples are digital on this wire. But their timing is not! The time is captured by the receiver based on the PLL accuracy which itself is dependent on quality of the PLL and signal quality on the wire. If you use a cheap and long capable, the digital pulses lose fidelity and the timing detected on the receiver will then change.

Now if the timing just changes by a fixed delay, that is OK. The signal will only be delayed but will not become distorted. Unfortunately, what we get is not this, but a signal that changes in timing all the time and sometimes at very high frequency. That is, the arrival time of the input pulses jumps forward and back within a range and at certain speed (frequency). We call this variation jitter' and the characterize it by its frequency and amplitude.

Obviously if the timing variation (amplitude) is 1 billionth of a second and frequency is 1Hz, (i.e. once a second), you are not going to hear any distortion. But at certain frequencies and certain magnitude it becomes audible.

Where is the distortion coming from? The variation in timing means that the input source is getting warped back and forth at every one of those points in time. Imagine taking points on a graph and tugging them left and right with certain pattern (frequency) and amount (amplitude). The curve starts to look different, right? The simplest case maybe is easier to see. Imagine the source being a straight line at 45 degrees. If you take points along it and move them up and down a bit, the line is no longer ruler flat.

In the example above, you can see how we go from a source that was linear (i.e. it simple increased in value over time) to a curve which was no longer flat, or non-linear. Non-linearity, means harmonic distortion. Just like the quantization distortion we talked about. Putting it all together, jitter causes harmonic distortion with the amount dictated by the characteristics of jitter.

Now, let's bring the topic down to optical formats. Here, we are writing a sequence just like the digital interfaces above, except that we are putting them on an optical disc. Same schemes must be used to recover the clock from it, etc. However, there is one big difference. We use these optical discs as data. The input has error correction, and is buffered. Buffered means we read things into memory and then use data out of it. By definition the, what comes out of the buffer is reliable since it is not subject to variations of a serial device. And the error correction makes sure that if some sample is read incorrectly due to jitter, that it is able to correct that mistake. So jitter does not play a role here with respect to fidelity (but is an important characteristic with respect to how reliable your drive is in recovering from borderline signal conditions).

We have a secondary line of defense here. As soon as you compress audio or video, they also become immune to jitter. Reason is that once you compress the signal, it loses any sense of timing it had. Sending Dolby Digital stream for example, over even the worst S/PDIF connection does not suffer one bit from jitter. If you hear it, it is perfect! Why? Because once the source becomes compressed, it becomes like computer data. The destination only needs to extract their digital values. It does not matter when they came because we are not trying to play them right then and then. Instead, we just need the correct sequence of bytes to feed the decoder for Dolby Digital so that it can recreate the PCM values its encoder means to be reproduced by the DD encoder.

Here is another way to look at it. Take TrueHD. We know that on the average it compresses the source by a factor near 3:1. So for every three PCM samples, you get one TrueHD sample. Therefore, it goes without saying that if I transmit such a signal from one side or the other, I am in essence sending three bytes for every one byte of the source. In other words, it is working faster than real-time. Because of this, the receiver must have a buffer (memory) where it stores the decoded data as often, it may get ahead of the time when we need to play it. And as soon as we buffer things, then jitter goes out the window as a consideration.

Same is of course true of video compression. All codecs require some amount of buffering to operate.

So let's put what we just learned to use.

If we take the PCM samples stored on CD, we are not dealing with compressed signals so no buffering and no advantage from data transmission. Therefore, everything in the chain becomes a source of (accumulative) jitter. The source could add it, the connection to AVR may add it. And the paths through the AVR may add it. Issues here have made some people think the same applies to HD optical but such is not the case as I explained above and more below.

If you decode these codecs in your player, and convert to analog in your player, then the only place jitter can interfere is from the output of the audio codec to input of D/A converter. This distance being short and proprietary, it can be of extremely high quality and subject to little jitter. Note that it is not a given that this short path is high quality. With video running around plus a bunch of processors inside a single box, there is liable to be jitter and a lot of it. But in theory, this is a rather optimal chain. Unfortunately, this means that you now have to find an analog only path for amplification which not many people have. As soon as you digitize this signal for say, base management in the AVR, all bets are off.

If you decode audio in the player and send it as PCM over HDMI to the receiver, those samples are real-time and therefore, subject to jitter. This jitter is additive to whatever jitter also exists between the HDMI receiver and the DAC. In other words, you have all the problems of above scenario plus jitter induced by HDMI.

If you send out the bitstream of the codec (i.e. prior to decoding in the player) over HDMI then you eliminate jitter on the link since we go back to data transmission, not time sensitive PCM. If jitter is well under control in the AVR and much lower than what HDMI capable contributes, then you get better sound. But the same caution applies as decoding in the player with respect to having a lot of noisy parts in the AVR causing jitter.

Quote:


This got me thinking about something you said wrt to buffering. Someone said "Why not just buffer a lot of the optical disc data, like a full second or two, so that any errors or timing problems are handled by the buffering" and your response was that in theory it sounds like a good idea but in practice people don't like their remote control actions delayed. So my question, why would the RC control data be delayed? A large amount of buffering only really makes sense in the optical disc player, and it knows the milisecond you press stop to do whatever is necessary to stop playback, regardless of what is in the buffers, no? Now one could argue that a second or two of buffering might cause a delay with smooth ffwd/rwd or skip ffwd/rwd, but those actions wouldn't need buffering since you aren't concerned about fidelity at 4x playback speed.

Let me expand on what this kind of buffering is. If you have some memory on the receiver, you can use it to store the incoming samples. Yes, the samples will still come with timing errors but remember, their digital value (i.e. what the PCM number is) is still correct. So the data that is in the memory, is correct. When you decide to output, you use a new, high quality clock that runs at the same sampling rate as the source (e.g. 48Khz). Since you are not relying on input pulse train for your clock, then variations/jitter is removed and you get better sound.

But with buffering, you run into a new problem. There is no way to have a clock that has the same accuracy as the source. Even world's best oscillators (clocks) have accuracy differences and drift with time and temperature among other factors. This means that your receiver clock may run faster or slower than the original clock that was used to encode the analog sound. If you go slower, then your buffer starts to overflow. If the clock in the receiver goes faster than the source, then you soon run out of samples. Then what do you do? The solutions are not good. You wind up using a form of sample rate conversion to eliminate to manufacture new samples! And you better hope a good solution is used to do this as otherwise, you occasionally get anomalies as samples are created or destroyed.

Problem becomes more complicated for video where you now have to store both audio and video. And video takes even more space than audio and unfortunately, it cannot be sample rate converted easily. So you wind up making audio slave to video requiring even more resampling.

OK, assuming you are OK with all the complexity, and cost in the player, you now have a new problem stated in your question. If you buffer too little, then you wind up running dry too many times. If you buffer more, then anytime the user wants to play something, you have to wait to fill the buffer before doing so. If you buffer just quarter of a second, this could cause noticeable delay to user remote control actions to jump forward, etc. Could you tolerated it? Sure. But given the cost and complexity above, I doubt that it will be implemented anytime soon.

Believe it or not, the best solution here is a Home Theater in a Box! It is ironic that such components are budget systems. A high-end solution could run circles around having separate pieces and having to move things through long cables. I just noticed a $3,500 box like this from Arcam with built-in amps and such. If this had HD DVD in it, it would have the potential to outperform any separate device in audio!

Probably a lot more than you ever wanted to know about the topic but here it is .
post #97 of 461
So basically, analog output from the player is still the way to go if you don't require any of the extensive features that new AVRs or pre/pros provide.

Even though I have an HDMI AVR that handles multichannel PCM signals, I am strongly considering buying a quality Blu-ray player and using its analog section. The main reasons are better DACs and minimal jitter.
post #98 of 461
Thread Starter 
Quote:
Originally Posted by MSmith83 View Post

So basically, analog output from the player is the way to go if you don't require any of the extensive features that new AVRs or pre/pros provide. Even though I have an HDMI AVR, I am strongly considering buying a quality Blu-ray player and using its analog section.

Well, maybe . Unless you see professional audio reviews of said player as compared to an AVR over HDMI, you don't really know if it is any better.

Also, if you go this way, I assume you will use base management in the player. If so, better make sure all the features are there and some way of verifying its performance. If the player doesn't have proper signal processing with dither and such for base management, it could screw up the quality there.

Finally, let's keep in mind that not everyone is able to hear jitter. I would characterize it as making 1 to 2% quality difference -- far, far less than differences between speakers. Indeed, I think at most, it makes 2-3X the difference of audio interconnect cables! So use the "wrong" cable above and you may introduce distortion similar to jitter.

And for movies, it is not even a consideration in my book. There is a reason we call this thread "advanced" topics in audio .

I have three BD and a few HD DVD players. Maybe next week I can post some results on their audio quality. Unfortunately, the BD players are first gen so probably not representative of what you might buy but may shed some light here.
post #99 of 461
Quote:
Originally Posted by amirm View Post

Well, maybe . Unless you see professional audio reviews of said player as compared to an AVR over HDMI, you don't really know if it is any better.

Also, if you go this way, I assume you will use base management in the player. If so, better make sure all the features are there and some way of verifying its performance. If the player doesn't have proper signal processing with dither and such for base management, it could screw up the quality there.

If the player is the same brand and has the same DSP chipset as the AVR, as well as the required speaker settings for bass management, time delays, and channel adjustments, then I suppose it would be reasonable to assume the player would provide the better option.

My move would be from sending multichannel PCM to a Denon AVR-4306 to using the analog section of the upcoming Denon DVD-3800BDCI. Assuming the implementation is correct and it has every speaker adjustment I need, it would on the outset be a better solution, correct?
post #100 of 461
Quote:
Originally Posted by amirm View Post

Finally, let's keep in mind that not everyone is able to hear jitter. I would characterize it as making 1 to 2% quality difference -- far, far less than differences between speakers. Indeed, I think at most, it makes 2-3X the difference of audio interconnect cables! So use the "wrong" cable above and you may introduce distortion similar to jitter.

And for movies, it is not even a consideration in my book. There is a reason we call this thread "advanced" topics in audio .

I hear you there. Even though I am someone who does notice some difference in sonic signatures between using two different digital interfaces with my universal disc player (from audio CDs), it may not be attributed to jitter and could be a result of improper testing.

Quote:
Originally Posted by amirm View Post

I have three BD and a few HD DVD players. Maybe next week I can post some results on their audio quality. Unfortunately, the BD players are first gen so probably not representative of what you might buy but may shed some light here.

That would be great.
post #101 of 461
Quote:
Originally Posted by amirm View Post

...We have a secondary line of defense here. As soon as you compress audio or video, they also become immune to jitter. Reason is that once you compress the signal, it loses any sense of timing it had...

I always understood that DD, DTS, TrueHD,... was immune to jitter because the data was in data packets, not a serial stream like LPCM and not because it was compressed. Is this wrong are you just using compressed because all the formats are?
post #102 of 461
Thread Starter 
Quote:
Originally Posted by William View Post

I always understood that DD, DTS, TrueHD,... was immune to jitter because the data was in data packets and not a serial stream and not because it was compressed.

It is one in the same. The compressed bitstream is "data" so it is transmitted as such. The core advantage remains the fact that we are no longer operating in time domain so variations of the same are immaterial.

Note that the data is still transmitted as a "serial stream." The channel is still the channel.
post #103 of 461
Quote:
Originally Posted by William View Post

I always understood that DD, DTS, TrueHD,... was immune to jitter because the data was in data packets, not a serial stream like LPCM and not because it was compressed. Is this wrong are you just using compressed because all the formats are?

Unfortunately what comes after the decoding can induce jitter. Once you've gone from packets to PCM data it needs to come out with high precision to either the DSP section or the D/A convertor depending on the architecture.

Without precision here there's still a fair level of jitter possible.
post #104 of 461
Quote:
Originally Posted by amirm View Post

It is one in the same. The compressed bitstream is "data" so it is transmitted as such. The core advantage remains the fact that we are no longer operating in time domain so variations of the same are immaterial.

Note that the data is still transmitted as a "serial stream." The channel is still the channel.

The input is immune to jitter but the output is not.

Cheers,
post #105 of 461
Quote:
Originally Posted by amirm View Post

OK, assuming you are OK with all the complexity, and cost in the player, you now have a new problem stated in your question. If you buffer too little, then you wind up running dry too many times. If you buffer more, then anytime the user wants to play something, you have to wait to fill the buffer before doing so. If you buffer just quarter of a second, this could cause noticeable delay to user remote control actions to jump forward, etc. Could you tolerated it? Sure. But given the cost and complexity above, I doubt that it will be implemented anytime soon.

I must be missing something here. When I turn the key off in my car, the engine stops. It's not really relevant to me if/how much fuel the fuel injectors already have, because the engine stopped.

So why would the RC control data be tied to / delayed by the amount of video+audio in the buffers?
post #106 of 461
Thread Starter 
Quote:
Originally Posted by hellokeith View Post

I must be missing something here. When I turn the key off in my car, the engine stops. It's not really relevant to me if/how much fuel the fuel injectors already have, because the engine stopped.

So why would the RC control data be tied to / delayed by the amount of video+audio in the buffers?

Well, you need to fill the buffer before playing. If it takes quarter of a second before the buffer fills, that is how much your response time is slowed by that.

By the way, if you have a turbo engine in your car, you need to idle for a while there too .
post #107 of 461
Quote:
Originally Posted by amirm View Post

It is one in the same. The compressed bitstream is "data" so it is transmitted as such. The core advantage remains the fact that we are no longer operating in time domain so variations of the same are immaterial.

Note that the data is still transmitted as a "serial stream." The channel is still the channel.

Then that leads to my next question: How is LPCM handled in the studio? It must be passed and sent through multiple times during a mix. So how is jitter delt with if it is a serial LPCM stream?
post #108 of 461
Quote:
Originally Posted by John Kotches View Post

Unfortunately what comes after the decoding can induce jitter. Once you've gone from packets to PCM data it needs to come out with high precision to either the DSP section or the D/A convertor depending on the architecture.

Without precision here there's still a fair level of jitter possible.

If decoding is done in the processor/receiver can jitter still be introduced in the internal DSP's stages?
post #109 of 461
Thread Starter 
Quote:
Originally Posted by William View Post

Then that leads to my next question: How is LPCM handled in the studio? It must be passed and sent through multiple times during a mix. So how is jitter delt with if it is a serial LPCM stream?

I am not a studio guy so perhaps others with more experience there can chime in.

For now though, in today's world we are talking a digital audio workstation. Content is digitized and then stays in a computer the whole time, immune to any of the issues we are talking about. Also, studio equipment is driven by common reference clock which also heps avoid issues with synchornization and extraction of clock.

To the extent digital serial connection used, it is the higher quality, balanced cables which gets rid of ground loops and things like that.
post #110 of 461
Quote:
Originally Posted by William View Post

If decoding is done in the processor/receiver can jitter still be introduced in the internal DSP's stages?

It doesn't matter where the decoding is done. It still has to come out at a very accurate rate ... or ... you have to reclock going into the DSP and/or DACs.

Best,
post #111 of 461
Thread Starter 
Quote:
Originally Posted by William View Post

If decoding is done in the processor/receiver can jitter still be introduced in the internal DSP's stages?

The DSP is operating in "data" domain so no jitter or timing error can enter in that part of the pipeline. But, when the decoding is done in the DSP, the PCM samples are then fed to the DAC. Fortunately, since we are inside one box, we can use a seperate clock and guarantee better accuracy.

All is not well though. There is no such thing as an ideal clock. The oscilator generating the pulses can still move back and forth in time. We call this "phase error." Things that can cause this are spikes in the power supply (caused by activity in other parts of the processor), poor oscilator design, noise bleeding into clock oscilator, even vibration (don't put your processor on the washing machine ).
post #112 of 461
Quote:
Originally Posted by amirm View Post

Ok, let me explain it one more time as to get others caught up with what is buried in other thread.
...............
Probably a lot more than you ever wanted to know about the topic but here it is .

Amir, great explanation of jitter. Thanks for the effort, it was what I've been looking forward to hearing for a long time. Jitter has been a labour of love of mine for some time, and what you describe seems consistent with my understanding, which for say CD, can be summed up quite simply:
  • Digital audio essentially contains two data streams - amplitude information (data), and timing information (clock).
  • Both information streams must remain intact to reproduce the original signal accurately.
  • Reproducing digital audio is like plotting a graph - both the vertical (amplitude) and horizontal (timing) co-ordinates have to be accurate.
  • Amplitude info is encoded with 1s & 0s, and is entirely digital, BUT timing info is not encoded digitally - and is analogue!
  • Jitter is an undesirable attribute of TIMING information, and has nothing to do with AMPLITUDE info.
  • Jitter depends on the digital audio replay architecture - where the timing info comes from (the master clock), where it goes to (the DAC), and how it gets there (connections and buffers, etc).
  • Compressed audio cannot carry timing info; only amplitude (data), so the timing info has to be generated separately (but it still has to be generated.)
  • Finally, jitter has a significant impact on digital audio, and maintaining the integrity of the timing info may be MORE important than that of the amplitude info, which is generally reliable in the digital domain.
  • Therefore, we need to worry more about the architecture of the timing info more than that of the data, which may take different paths.
From your logic, the conclusion of this is that compressed audio does not carry jitter from the player to the amp. I want to pick up on this and take it forwards, because I don't think an audio-only approach will still work with high definition media.

New HDM-based digital audio reproduction is generally dependant on HDMI interconnection, which I think have a significant bearing on how jitter plays out. HDMI carries timing info not with the data (like SPDIF) but in a physically separate clock channel. This is a video clock, and the audio is derived from this in the amp. This means that even PCM audio data carried over HDMI does not carry jitter - it comes in the clock.

I'm less clear about how timing is handled with compressed data, but I presume it will be the same as for PCM - the amp takes the video clock and decimates it by a dynamic rational factor that is controlled by the player. the data that is carried by the data channels remains just that - data, and is unaffected by the timing info architecture.

If it happens to be compressed for transmission, then so what? It is decompressed by the appropriate Dolby or DTS decoder in the amp, and is sent to the DAC as PCM, where it re-joins the timing info. Whether it is compressed or not during the transmission may be immaterial, as the data is the same when it gets to the DAC either way. And the important thing is that the timing info gets there the same way, too, and that's what affects the jitter.

At least that is my presumption, and I would be really grateful if anyone could confirm this.

regards, Nick
post #113 of 461
Thread Starter 
Quote:
Originally Posted by welwynnick View Post

...you describe seems consistent with my understanding, which for say CD, can be summed up quite simply:
  • Digital audio essentially contains two data streams - amplitude information (data), and timing information (clock).
  • Both information streams must remain intact to reproduce the original signal accurately.
  • Reproducing digital audio is like plotting a graph - both the vertical (amplitude) and horizontal (timing) co-ordinates have to be accurate.
  • Amplitude info is encoded with 1s & 0s, and is entirely digital, BUT timing info is not encoded digitally - and is analogue!
  • Jitter is an undesirable attribute of TIMING information, and has nothing to do with AMPLITUDE info.
  • Jitter depends on the digital audio replay architecture - where the timing info comes from (the master clock), where it goes to (the DAC), and how it gets there (connections and buffers, etc).
  • Compressed audio cannot carry timing info; only amplitude (data), so the timing info has to be generated separately (but it still has to be generated.)
  • Finally, jitter has a significant impact on digital audio, and maintaining the integrity of the timing info may be MORE important than that of the amplitude info, which is generally reliable in the digital domain.
  • Therefore, we need to worry more about the architecture of the timing info more than that of the data, which may take different paths.
From your logic, the conclusion of this is that compressed audio does not carry jitter from the player to the amp. I want to pick up on this and take it forwards, because I don't think an audio-only approach will still work with high definition media.

Wow! Most excellent summary of what I described. Sometime I feel like I need to go back to school and learn to summarize better .

Quote:


New HDM-based digital audio reproduction is generally dependant on HDMI interconnection, which I think have a significant bearing on how jitter plays out. HDMI carries timing info not with the data (like SPDIF) but in a physically separate clock channel. This is a video clock, and the audio is derived from this in the amp. This means that even PCM audio data carried over HDMI does not carry jitter - it comes in the clock.

That's correct. Audio is slave to video in HDMI.

Quote:


I'm less clear about how timing is handled with compressed data, but I presume it will be the same as for PCM - the amp takes the video clock and decimates it by a dynamic rational factor that is controlled by the player. the data that is carried by the data channels remains just that - data, and is unaffected by the timing info architecture.

You know I have not looked at the detailed spec either. But I assume that if you send compressed data, then it is assumed that each packet goes with the individual video frame. The processor then, is responsible to sync'ing these two together.

If so, the above pushes the responsibility for audio sync from encoding/player to decoder. Given the "freewheelig" clock in the processor driving audio, I would think that it could run too fast or two slow, requiring interpolation to stay sync'ed to video.

I just got the new Onkyo and will do some tests of AVR based decoding. But has anyone else played with this and notice different sync issues than player based decoding?


Quote:


If it happens to be compressed for transmission, then so what? It is decompressed by the appropriate Dolby or DTS decoder in the amp, and is sent to the DAC as PCM, where it re-joins the timing info. Whether it is compressed or not during the transmission may be immaterial, as the data is the same when it gets to the DAC either way. And the important thing is that the timing info gets there the same way, too, and that's what affects the jitter.

Not quite. The HDMI clock is used to recover the source data for the codec, but not the timing for PCM playback. The packet of data is sent to the decoder which then produces PCM samples. At that point, it is the job of the AVR to marry the two together. The original clock on HDMI is long long and onto the next frame of audio as this is going on.

If so, then jitter is eliminated from HDMI but we have the interpolation/sample rate conversion with buffered audio that I talked about before. If care is not taken, every so often, there could be an audible degradation if the buffer runs dry, or overruns. Alternatively, the audio could drift and be ahead or behind and hence my question above.

Quote:


At least that is my presumption, and I would be really grateful if anyone could confirm this.

regards, Nick

The above are my assumptions too . Let me play around with the Onkyo to see if I can uncover anything. But I suspect it is hard to reverse engineer how this works....
post #114 of 461
Ok Amir, so let's get to the next problem child..

You've said before that perfectly syncing audio to video is simply impossible. This seems to fly in the face of what you have said about the data being immune to jitter in the case of compressed audio codecs, particularly if the AVR is the waiter delivering the meal.
post #115 of 461
Thread Starter 
Quote:
Originally Posted by hellokeith View Post

Ok Amir, so let's get to the next problem child..

You've said before that perfectly syncing audio to video is simply impossible. This seems to fly in the face of what you have said about the data being immune to jitter in the case of compressed audio codecs, particularly if the AVR is the waiter delivering the meal.

Oh, I didn't say it was impossible. I said you have to perform sample rate conversion for the buffer overflow/underruns. Am I missing your point?
post #116 of 461
Quote:
Originally Posted by amirm View Post

Oh, I didn't say it was impossible.



http://www.avsforum.com/avs-vb/showt...6#post11621416

Quote:
Originally Posted by amirm View Post

Quick summary, sync is an exceptionally difficult problem to solve. I will go so far as saying that outside of a production studio, getting percise sync is impossible. Yes, impossible. Unless audio and video use the same clock, which they never do in your home, you are going to see sync issues including dynamic ones you mention. And such sync issues will vary based on your equipment.
post #117 of 461
Thread Starter 
Oh, I see where confusion came from. What I talked about there is the fact that true, 100% sync as we enjoy in a studio, is essentially impossible in a consumer scenario. In a studio we have one reference clock. In HD optical, we have a multiplexed stream which the player is then told to try to keep in sync. It does the best it can, but it can still get ahead or behind and attempt to get close to each other.

To the extent you seperate audio and video, you add to that problem and I showed one way how you can compensate for that. But the resampling does not get you back to studio level sync.
post #118 of 461
Quote:
Originally Posted by amirm View Post

Oh, I see where confusion came from. What I talked about there is the fact that true, 100% sync as we enjoy in a studio, is essentially impossible in a consumer scenario. In a studio we have one reference clock. In HD optical, we have a multiplexed stream which the player is then told to try to keep in sync. It does the best it can, but it can still get ahead or behind and attempt to get close to each other.

To the extent you seperate audio and video, you add to that problem and I showed one way how you can compensate for that. But the resampling does not get you back to studio level sync.

OK, so now for a slightly off-topic question - why hasn't anyone implemented SMPTE-style time sync systems for home theater gear? This would seem to be a relatively easy way to get rid of the sync problems we're seeing with split chains (i.e, audio going one place and video going another).
post #119 of 461
Amir:

Is there anything illegal in the HDMI spec that prevents a manufacturer from sending two discrete outputs somewhere in the HDMI chain?

Output 1: Merged audio + video

Output 2: Blank frames of video + audio

I don't have access to specifications for HDMI to know what's legal and what isn't. It stands to reason that sending blank/empty frames of video (at say 480p or 720p) still gives you a video blanking interval to send the audio data in IMO it's no different from when there is a merged A/V stream with actual video content.

Cheers,
post #120 of 461
First of all can say how much I appreciate hearing from an audiophile in a video world, who can comfortably straddle both the subjectivists and obectivists camps.
Quote:
Originally Posted by amirm View Post

Wow! Most excellent summary of what I described. Sometime I feel like I need to go back to school and learn to summarize better .

My words actually; this is a synopsis of what I've been saying for a year or so in my journey of jitter discovery.
http://www.avforums.com/forums/blog....=recent&page=2
I'm pleased that there is such a good correlation with what you were saying, especially about the nature of the data and timing streams. To begin with, I used to make long posts that few people actually read, and even fewer understood, so now it's bare bones or nothing.
Quote:
Originally Posted by amirm View Post

You know I have not looked at the detailed spec either. But I assume that if you send compressed data, then it is assumed that each packet goes with the individual video frame. The processor then, is responsible to sync'ing these two together.

If so, the above pushes the responsibility for audio sync from encoding/player to decoder. Given the "freewheelig" clock in the processor driving audio, I would think that it could run too fast or two slow, requiring interpolation to stay sync'ed to video.

I just got the new Onkyo and will do some tests of AVR based decoding. But has anyone else played with this and notice different sync issues than player based decoding?

Not quite. The HDMI clock is used to recover the source data for the codec, but not the timing for PCM playback. The packet of data is sent to the decoder which then produces PCM samples. At that point, it is the job of the AVR to marry the two together. The original clock on HDMI is long long and onto the next frame of audio as this is going on.

If so, then jitter is eliminated from HDMI but we have the interpolation/sample rate conversion with buffered audio that I talked about before. If care is not taken, every so often, there could be an audible degradation if the buffer runs dry, or overruns. Alternatively, the audio could drift and be ahead or behind and hence my question above.

Rather than respond to each point, I wonder if we are slightly talking at cross-purposes. By "timing info" I don't mean information relating to say, bit-rate, or synchronisation (where each audio sample should be recreated, relative to some specific slot / pixel / line / frame).

Although timing or clock could refer to either bits or words, what I specifically mean is the precise point in time where the DAC must recreate each successive new sample. More of a metronome than a clock. For now, I don't care whether audio and video are unsynchronised, just as long as the recurring period between each audio sample is as close to constant as possible.

My understanding of the HDMI implementation comes from the HDMI spec, which describes Audio Sample Clock Capture and Audio Clock Regeneration. ACR is the process of regenerating the audio clock from the TMDS clock. The implementation is not defined, though, (as loose as ever) a couple of possible examples are given - simple TMDS clock decimation; and a PLL clock recovery circuit.

What is described there suggests the audio clock is tied to the video clock, and that the clock recovery does not depend on the audio data coding method. If this is not the case, and compressed audio doesn't take it's clock from the TMDS clock, then does that mean it's not the case for PCM data as well? I'm curious rather than worried about how this process actually works, but of far greater interest is any differences between linear and compressed audio.

I thought I had a good handle on digital audio, but HDMI seems to change everything. Dolby Digital is one thing, but how can I explain that CD sounds much better to me over toslink than it does over HDMI? That's counter-intuitive.

Ever willing to learn, Nick
New Posts  All Forums:Forum Nav:
  Return Home
  Back to Forum: HDTV Software Media Discussion
AVS › AVS Forum › Blu-ray & HD DVD › HDTV Software Media Discussion › Advanced topics in HD audio