Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Emojiの正規表現
Search
ima1zumi
June 24, 2022
0
15
Emojiの正規表現
ima1zumi
June 24, 2022
Tweet
Share
More Decks by ima1zumi
See All by ima1zumi
Ruby Taught Me About Under the Hood
ima1zumi
3
7k
Exploring Reline: Enhancing Command Line Usability
ima1zumi
0
71
10年物のRailsアプリにキャッチアップ!〜コードを読まずに理解したかった〜
ima1zumi
0
79
RubyKaigiの登壇者一覧ページを作った
ima1zumi
0
340
Relineのその後の生活
ima1zumi
0
210
IRB and Reline Kaigi 2024
ima1zumi
0
11
Exploring Reline: Enhancing Command Line Usability
ima1zumi
3
14k
Reline 1分 Cooking
ima1zumi
0
35
続・mruby/cにUTF-8 を実装する
ima1zumi
1
27
Featured
See All Featured
Typedesign – Prime Four
hannesfritz
41
2.6k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
23
2.7k
We Have a Design System, Now What?
morganepeng
52
7.5k
Code Review Best Practice
trishagee
67
18k
Being A Developer After 40
akosma
91
590k
Building Applications with DynamoDB
mza
94
6.3k
Statistics for Hackers
jakevdp
798
220k
Optimising Largest Contentful Paint
csswizardry
37
3.2k
Adopting Sorbet at Scale
ufuk
76
9.3k
Music & Morning Musume
bryan
47
6.5k
BBQ
matthewcrist
88
9.6k
Building Adaptive Systems
keathley
41
2.5k
Transcript
Emojiͷਖ਼نදݱ🤯 2022-06-24 ima1zumi
ࣗݾհ • ima1zumi • จࣈͱจࣈίʔυ͕͖
ڥ • Unicode 14.0 (2021-09-14) • UTF-8 • Ruby 3.1.0
• ೖྗϑΥʔϜͰֆจࣈΛೖྗෆՄʹ͍ͨ͠ • e.g. ❤ 🐝 👪 0⃣ •
ͦͷଞͷจࣈೖྗՄͱ͢Δ • e.g. 𩸽 (΄͚ͬ) • UTF-8
4όΠτҎ্ͷจࣈྻΛ͘
Unicode Character Properties
Unicode จࣈϓϩύςΟ • จࣈͷ࣋ͭଐੑੑ࣭ΛUnicodeίϯιʔγΞϜ͕ఆΊͨͷ • Unicode Technical Report #23 THE
UNICODE CHARACTER PROPERTY MODEL • Unicode® Standard Annex #44 UNICODE CHARACTER DATABASE • https://www.unicode.org/reports/
Hiragana
Unicode Utilities https://util.unicode.org/UnicodeJsps/character.jsp
Unicodeͷ֤จࣈͷϓϩύςΟΛਖ਼Ҿ͖ɾٯҾ͖Ͱ͖Δ
Hiraganaʹ⾕ͳͲؚ·ΕΔ🧐
Emojiؔ࿈ͷϓϩύςΟ • Emoji • Emoji_Presentation • Emoji_Modi fi er •
Emoji_Modi fi er_Base • Emoji_Component • Extended_Pictographic • Basic_Emoji • RGI_Emoji • RGI_Emoji_Flag_Sequence • RGI_Emoji_Keycap_Sequence • RGI_Emoji_Modi fi er_Sequence • RGI_Emoji_Tag_Sequence • RGI_Emoji_Zwj_Sequence 🆕
p{Emoji}
👀
p{Emoji}ʹؚ·ΕΔͷɺؚ·Εͳ͍ͷ • 0⃣ `U+30, U+FE0F, U+20E3` ͷ3ͭͷจࣈ͔ΒΓཱͭ • keycap
sequence • U+30 == 0 • 0EmojiͷҰ෦ʹͳͬͯ͠·͍ͬͯΔ • #,*,0-9͖ͨ͘ͳ͍ • U+20E3ؚ·Εͳ͍
Basic_Emoji • ྑͦ͞͏ • Modi fi erܥؚ·Εͳ͍ͷͰଞͷϓϩύςΟͱΈ߹ΘͤΔ • Ruby(Onigmo)ະରԠͷϓϩύςΟͷͨΊಈ࡞֬ೝͰ͖ͣ😢
Unicodeਪͷਖ਼نදݱ http://www.unicode.org/reports/tr51/ 🤷
·ͱΊ • EmojiΛਖ਼نදݱͰఆ͢Δͷਂ͍প͕͍ͬͯΔ • ϥΠϒϥϦΛ͓͏ • Basic_Emoji ͕͑ΔΑ͏ʹͳΕͬͱεϚʔτʹॻ͚Δ͔ • 🌟
Unicode Utilities 🌟 • https://util.unicode.org/UnicodeJsps/character.jsp