• Examples : ◦ map(): one value → another value ◦ mapToPair(): one value → a tuple ◦ filter(): filters values/tuples given a condition ◦ groupByKey(): groups values by key ◦ reduceByKey(): aggregates values by key ◦ join(), cogroup()...: joins two RDDs
count(): counts values/tuples ◦ saveAsHadoopFile(): saves results in Hadoop’s format ◦ foreach(): applies a function on each item ◦ collect(): retrieves values in a list (List<T>)
of trees by specie Spark - Example geom_x_y;circonfere;adresse;hauteurenm;espece;varieteouc;dateplanta 48.8648454814, 2.3094155344;140.0;COURS ALBERT 1ER;10.0;Aesculus hippocastanum;; 48.8782668139, 2.29806967519;100.0;PLACE DES TERNES;15.0;Tilia platyphyllos;; 48.889306184, 2.30400164126;38.0;BOULEVARD MALESHERBES;0.0;Platanus x hispanica;; 48.8599934405, 2.29504883623;65.0;QUAI BRANLY;10.0;Paulownia tomentosa;;1996-02-29 ...