To calculate the description length of the data given the Bayesian network, we utilize the fact that the description length is inversely proportional to the probability of the observed data inferred by the model.
where
The chain rule allows rewriting this equation with:
is the n-dimensional observation described in row , and
is the joint probability of this observation returned by the Bayesian network .