Calculating Complexity: DL(B)
Description Length of the Bayesian Network — DL(B)
is the number of bits required to represent a Bayesian network. We can break down this value into the sum of two components:
- , which stands for the number of bits required to represent the graph of the Bayesian network,
- represents the number of bits required to represent the set of probability tables .
Calculating DL(G)
To calculate , we need to determine the number of nodes and the number of their parent nodes.
DL(G) = \sum\limits_i^n {\left( {{{\log }_2}(n) + {{\log }_2}\left( {\begin{array}{*{20}{c}} n\\ {\left\| {P{a_i}} \right\|} \end{array}} \right)} \right)}
where
- n is the number of random variables (nodes):
- is the set of the random variables that are parents of in graph
- and is the number of parents of the random variable .
Calculating DL(P|G)
Computing is straightforward as it is proportional to the number of cells in all probability tables.
where
- is the number of states of the random variable
- is the probability associated with the cell.
As the probability p cannot be known prior to learning the network, we use the following classical heuristic in BayesiaLab: