Categories

# Why does KL divergence not satisfy the triangle inequality?

$$D_{KL}=\sum_i p(x_i)log(p(x_i)/q(x_i)$$

Also can’t you make it satisfy the triangle inequality by taking absolute value of information at every point?