1 Alphabets of Languages with Bidirectional Scripts and their Support Israel Ervin Gidali IBM...

Post on 27-Mar-2015

216 views 0 download

Tags:

Transcript of 1 Alphabets of Languages with Bidirectional Scripts and their Support Israel Ervin Gidali IBM...

1

Alphabets of Languages

with Bidirectional Scripts

and their Support

Israel Ervin Gidali

IBM Globalization Centre of

Competency-

Complex Text Languages

2

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Agenda The predecessors of the first true alphabets The first alphabets Direction of writing The “modern” RTL scripts Bidirectionality Bidi – some of the challenges Implementation aspects

3

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The Alphabet Predecessors

The predecessors of the first true alphabets:

The Egyptian hieroglyphics (since 3000 BCE)

The Mesopotamian cuneiforms (since 3100 BCE)

4

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The Egyptian Hieroglyphs

PictogramsLogogramsPhonograms

5

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The Cuneiform Writing Systems

logo-syllabic syllabic words

6

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The First Alphabets

7

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The First Semitic Alphabets

Proto-Sinaic and Proto-Canaanite. Originated around the 18th or 17th

centuries BCE, under the influence of Egyptian hieroglyphs.

8

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The First Semitic Alphabets

The revolution: purely phonetic (only consonants without vowels).

Influenced originally by the polyphony practice in hieroglyphic and cuneiform scripts.

9

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Proto-Canaanite

Limit the set of sounds to 22 consonants only, still without vowels.

Acrophonic. Letters easy to distinguish and

remember (their shapes resemble familiar objects).

10

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Proto-Canaanite Descendents

11

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The alphabet success

Proto-Canaanite, Phoenician and Greek

12

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Ancient Hebrew and Samaritan

13

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

South Arabian

14

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Aramaic

15

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The Square Hebrew Script

16

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Nabatean

17

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Direction of Writing Hieroglyphs were written in both

directions.

Starting from the 11th century BCE, the writing direction of all Semitic

scripts (except Ethiopic) is from right to left.

18

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The “Modern” RTL scripts

אין כל-חדש תחת השמש

)קהלת פרק א פסוק ט'(

There is nothing new under the sun. (Qohelet/Ecclesiastes 1/9)

19

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Arabic

20

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Arabic Script – the Script of Islam

The Arabic script, the script of Quran, used for: Arabic Persian (Farsi) Urdu Ottoman Turkish (until 1929)

Uighur, Kazakh,Uzbek, Tajik, Kirghiz, Old Malay, Swahili, Hausa, Baluchi, Kashmiri, Sindhi,

Pashto, Lahnda,Dargwa,Morrocan Arabic, Adighe, Ingush, Berber,Kurdish, Jawi/Javanese……

21

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Arabic Vowels

22

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Hebrew Script

Used for:•Hebrew•Yiddish•Ladino (Judezmo)•Arabic•Karaite/Karaim•Turkish

עברית שפה יפה

23

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Hebrew Script

24

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Hebrew script and diacritics

Hebrew text:

Vocalized with “points”

and cantillation marks:

בראשית ברא אלהים את השמים ואת הארץ

25

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Syriac

26

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Thaana

27

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Decimal digits forms

European digits (Arabic digits):(Used in Hebrew script and in some Arabic countries)

1 2 3 4 5 6 7 8 9 0

Arabic-Indic digits:

Numbers are written from left to right regardless of their formand regardless of regional variety

(Used in Arabic)

28

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Bidirectionality National language (Arabic, Hebrew, etc...)

text is written from Right to Left

TXET CIBARA Numbers and English (or French,Russian,

etc.) text is written from Left to Right

english text 123 TXET CIBARA

29

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Bidi Appearance Aspects- Directionality

Mixed direction of text segments:

Page alignment on the right Book binding on the right Mirroring of GUI elements

(only when translated)

means Hebrew is beautiful

30

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Bidi Data Processing Aspects –

some of the challenges Bidirectional text data entry Visual versus Logical text type The Paragraph Orientation Arabic script cursiveness:

shaping and ligatures Variety of text layouts in use

31

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The Bidi Layout Challenges

Bidirectional text in different systems and applications has multiple possible layouts

In heterogeneous environments proper layout transformations should be performed

Higher order protocols integration

32

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The Challenge of GUI Mirroring

When translating the interface of an application to a language with Bidirectional script, provisions must be made to ensure that the GUI is properly mirrored.

33

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Implementation Aspects Almost all platforms and Operating

Systems provide support for Bidirectional text entry and processing

New platforms should react to this challenge too

Except for adequately engraved keyboards, there is no need for special hardware for Bidi text support.

34

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

Last Word

RTL scripts are not a novelty. As a matter of fact they have preceded the current Western world scripts

Their support is different but not necessarily much more complex, as long as one is prepared for it.

35

18th International Unicode Conference Hong Kong, April 2001

Alphabets of languages with Bi-directional scripts and their support

The End

Thank You