This talk explores the MapSet API and the implementation of protocols and streaming, through the construction of NaturalSet, a full-featured but simpler set type designed for dense sets of small integers.
, s t r e a m a b l e , u n d e r s t a n d a b l e NATURAL SET Learning about protocols and streams by implementing a new data type from scratch Luciano Ramalho @ramalhoorg
= ["FACE", "CAT", "EYES"] ... 1F637;FACE WITH MEDICAL MASK 1F638;GRINNING CAT FACE WITH SMILING EYES 1F639;CAT FACE WITH TEARS OF JOY 1F63A;SMILING CAT FACE WITH OPEN MOUTH 1F63B;SMILING CAT FACE WITH HEART-SHAPED EYES 1F63C;CAT FACE WITH WRY SMILE 1F63D;KISSING CAT FACE WITH CLOSED EYES 1F63E;POUTING CAT FACE 1F63F;CRYING CAT FACE 1F640;WEARY CAT FACE 1F641;SLIGHTLY FROWNING FACE 1F642;SLIGHTLY SMILING FACE 1F643;UPSIDE-DOWN FACE 1F644;FACE WITH ROLLING EYES ... That's a subset test! Q ⊆ N Show character if all words in the query Q appear in name field N.
from each word (eg. "FACE") to a set of code points with that word in their names (eg. 9860, 128516, etc.)... 10 source: https://github.com/standupdev/gimel
16 new() Creates new empty MapSet. new(enum) Creates new MapSet with elements from enumerable. new(enum, transform) Same as above, applying transform to each element. member?(set, element) Is element included MapSet? Same as: e ∈ M put(set, element) Inserts element. No-op if element already in set. delete(set, element) Removes element from set. size(set) Returns number of elements in set. to_list(set) Builds new list from elements in set.
17 new() Creates new empty MapSet. new(enum) Creates new MapSet with elements from enumerable. new(enum, transform) Same as above, applying transform to each element. member?(set, element) Is element included MapSet? Same as: e ∈ M put(set, element) Inserts element. No-op if element already in set. delete(set, element) Removes element from set. size(set) Returns number of elements in set. to_list(set) Builds new list from elements in set. These are all the operations JS ES6 gives you...
error-prone looping. 20 intersection(set1, set2) Intersection between sets. A ∩ B union(set1, set2) Union of two sets. A ∪ B difference(set1, set2) Difference of set1 ➖ set2. A ∖ B subset?(set1, set2) Are all elements of set1 in set2? A ⊆ B equal?(set1, set2) Do set1 and set2 have all equal elements? A = B disjoint?(set1, set2) Do set1 and set2 have only distinct elements? A ∩ B = ∅
error-prone looping. 21 intersection(set1, set2) Intersection between sets. A ∩ B union(set1, set2) Union of two sets. A ∪ B difference(set1, set2) Difference of set1 ➖ set2. A ∖ B subset?(set1, set2) Are all elements of set1 in set2? A ⊆ B equal?(set1, set2) Do set1 and set2 have all equal elements? A = B disjoint?(set1, set2) Do set1 and set2 have only distinct elements? A ∩ B = ∅ Beyond the von Neumann bottleneck!
elements [0, 4, 5], then ns.bits is 49, a.k.a. 0b110001. To put element 2: •shift 1 left by 2: result is 4, a.k.a. 0b100 •bitwise OR 0b100 with ns.bits: result is 53, a.k.a. 0b110101 •build new set with those bits: result is #NaturalSet<[0, 2, 4, 5]>
elements [0, 4, 5], then ns.bits is 49, a.k.a. 0b110001. To put element 2: •shift 1 left by 2: result is 4, a.k.a. 0b100 •bitwise OR 0b100 with ns.bits: result is 53, a.k.a. 0b110101 •build new set with those bits: result is #NaturalSet<[0, 2, 4, 5]>
elements [0, 4, 5], then ns.bits is 49, a.k.a. 0b110001. To put element 2: •shift 1 left by 2: result is 4, a.k.a. 0b100 •bitwise OR 0b100 with ns.bits: result is 53, a.k.a. 0b110101 •build new set with those bits: result is #NaturalSet<[0, 2, 4, 5]>
elements [0, 4, 5], then ns.bits is 49, a.k.a. 0b110001. To put element 2: •shift 1 left by 2: result is 4, a.k.a. 0b100 •bitwise OR 0b100 with ns.bits: result is 53, a.k.a. 0b110101 •build new set with those bits: result is #NaturalSet<[0, 2, 4, 5]>
count/1, member?/2, slice/1 and reduce/3. slice/1 would require size/1, so this implementation returns an error. This is a convention. ENUMERABLE PROTOCOL IMPLEMENTATION 61
and function/1. In this example: •initial accumulator is {0, 1}: first pair of the sequence, passed to function/1. •function/1 must return: {number_to_emit, next_accumulator} •next_accumulator is {next_a, next_b} 64
numbers from the Fibonacci series until the next number a is larger than the max argument. Stream.unfold/2 stops when the inner function yields nil. 65
takes accumulator and next_one/1: •accumulator is {bits, index}, where index is the value of a (possible) element. •next_one/1 returns: nil or {element_to_emit, {next_bits, next_index}} 67
written a lot of redundant code. •MapSet has a rich API, including powerful operations with whole sets. •Implementing protocols allow custom types to interoperate with core parts of the Elixir standard library: Kernel, Enums, Streams... 71
written a lot of redundant code. •MapSet has a rich API, including powerful operations with whole sets. •Implementing protocols allow custom types to interoperate with core parts of the Elixir standard library: Kernel, Enums, Streams... •Streaming can be implemented with the help of Streams.unfold/2 (and other helpers in the Streams module). 72
written a lot of redundant code. •MapSet has a rich API, including powerful operations with whole sets. •Implementing protocols allow custom types to interoperate with core parts of the Elixir standard library: Kernel, Enums, Streams... •Streaming can be implemented with the help of Streams.unfold/2 (and other helpers in the Streams module). •To learn more, study the code for MapSet and NaturalSet. 73 https://hex.pm/packages/natural_set