Fits on a hard disk but in RAM” (500GB – 5TB?) • Non-linear – because real-world problems are not. • Single CPU – Because parallelization is hard (and often unnecessary)
core learning • The scikit-learn way • Hashing trick • Kernel approximation • Random neural nets • Supervised Feature Extraction • Neural nets • What else is out there