Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able...

34

Transcript of Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able...

Page 2: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Unicode EmojiM. Davis

Page 3: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Unicode Consortium

Enable everybody,

speaking every language on the Earth,

to be able to use their language

on computers and smartphones.

Page 4: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Unicode Consortium Projects

Internationalization CodeICU

Locale DataCLDR

Characters

Char Props & AlgorithmsUTC

Page 5: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Unicode Characters

Page 6: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

1999: Japan

Page 7: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

2007: Unicode expands scope

Page 8: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

2008: Gmail / iPhone / …

Page 9: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

2010: Unicode emoji

Page 10: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

from “Instagram Emojineering”

Page 12: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

General Case Normalization Shaping and RenderingName Uppercase Canonical_Combining_Class Join_ControlName_Alias Lowercase Decomposition_Mapping Joining_GroupBlock Lowercase_Mapping Composition_Exclusion Joining_TypeAge Titlecase_Mapping Full_Composition_Exclusion Line_BreakGeneral_Category Uppercase_Mapping Decomposition_Type Grapheme_Cluster_BreakScript Case_Folding NFC_Quick_Check Sentence_BreakScript_Extensions Simple_Lowercase_Mapping NFKC_Quick_Check Word_BreakWhite_Space Simple_Titlecase_Mapping NFD_Quick_Check East_Asian_WidthAlphabetic Simple_Uppercase_Mapping NFKD_Quick_Check Prepended_Concatenation_MarkHangul_Syllable_Type Simple_Case_Folding NFKC_Casefold BidirectionalNoncharacter_Code_Point Soft_Dotted Changes_When_NFKC_Casefolded Bidi_ClassDefault_Ignorable_Code_Point Cased Miscellaneous Bidi_ControlDeprecated Case_Ignorable Math Bidi_MirroredLogical_Order_Exception Changes_When_Lowercased Quotation_Mark Bidi_Mirroring_GlyphVariation_Selector Changes_When_Uppercased Dash Bidi_Paired_BracketIdentifiers Changes_When_Titlecased Sentence_Terminal Bidi_Paired_Bracket_TypeID_Continue Changes_When_Casefolded Terminal_Punctuation CJKID_Start Changes_When_Casemapped Diacritic IdeographicXID_Continue Numeric Extender Unified_IdeographXID_Start Numeric_Value Grapheme_Base RadicalPattern_Syntax Numeric_Type Grapheme_Extend IDS_Binary_OperatorPattern_White_Space Hex_Digit Indic_Positional_Category IDS_Trinary_Operator

... ASCII_Hex_Digit Indic_Syllabic_Category Unicode_Radical_Stroke

Properties

Page 13: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Emoji Properties

Property Characters that …Emoji are emoji

Emoji_Presentation have emoji presentation by default

Emoji_Modifier are emoji modifiers (skin tones)

Emoji_Modifier_Base are bases for emoji modifiers

Page 14: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Keycaps

#

Page 15: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Flags

AC

Page 16: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Skin Tones

Page 17: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

How many? (Emoji 5.0β)

Type Count SubtotalSingletons 1,126 1,126

Incomplete* Singletons -38 1,088

Emoji_Combining_Sequence: Keycaps 12 1,100

Emoji_Flag_Sequence: Valid RI sequences 258 1,358

Emoji_Modifier_Sequence: Skin tones 465 1,823

Page 18: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Emoji ZWJ Sequence

ZWJ

ZWJ

Page 20: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Gendered w/ Object

ZWJ

Page 21: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Gendered w/ Sign

ZWJ

Page 22: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

UI Actions: Backspace Example

ZWJ

Page 23: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Segmentation

ZWJ

Don’t break!

Page 24: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

ZWJ Sequences (Emoji 5.0β)

Type Count SubtotalFamily 31 1,854

Gendered Role, with object 192 2,046

Gendered Role, with sign 72 2,118

Gendered Activity, with sign 216 2,334

Gendered Gestures, with sign 108 2,442

Other 2 2,444

Page 25: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.
Page 26: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

CLDR Names, Keywords

unicode.org/cldr/charts/30/annotations

nerd face nørdansigt nerderig gezicht

Nerd-Smiley

nördaandlit

Face| geek| nerd

ansigt | nørd

geek| gezicht

| nerd

Gesicht| Nerd

andlit| lúði| nörd

Page 28: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Birth of an emoji character

< Oct.Nov.

Mar.

June

> July

β

http://unicode.org/emoji/selection.html

Page 29: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.
Page 30: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

benefiting Digitally Disadvantaged Languages

1. Emoji pushed products to improve their Unicode handling!

2. And led us to launching a fundraising program for DDLs:

Page 31: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

AMA

http://goo.gl/XklA87

Page 32: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Old slides

Page 33: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Future: Flags?

F g b s c t ✦

Tag Key

Region = United Kingdom(Country, … )

Tag ValueTag Base

Subdivision = Scotland(Province, State, Canton,...)

Tag Term

Page 34: Unicode EmojiUnicode Consortium Enable everybody, speaking every language on the Earth, to be able to use their language on computers and smartphones.

Gendered Activity/Gesture, with sign

ZWJ