! • Models with multiple layers of hidden variables allow for efficient inference in a much larger class of distributions. • However, deep architectures leads are difficult to learn and require approximate methods (MCMC, etc.) • Complex Networks have intractable Partition functions: