Euclidean/Angular). • Didn't rewrite the code, utilized the optimized C++ code provided by Annoy. • Easy parallelized by Scala. queries.par.map(q => annoy.query(q)) • JVM with C++ native code is fast and type-safe. • Annoy itself is fast (10x faster than lsh4s).
(for now) if you're not using linux-x86-64. > compileNative > publish • May not be as simple as lsh4s when broadcasting the index onto each worker in Spark. • My C++ skill is poor, as well as my JN* knowledge.