BBN--ANG--183 Typography Lecture 5A: Breaking textseas3.elte.hu/typo/05A-breaking.pdf ·...

Post on 19-Jul-2020

3 views 0 download

Transcript of BBN--ANG--183 Typography Lecture 5A: Breaking textseas3.elte.hu/typo/05A-breaking.pdf ·...

BBN–ANG–183 Typography

Lecture 5A: Breaking text

Zoltan Kiss & Peter Szigetvari

Dept of English Linguistics, Eotvos Lorand University

kz & szp (delg) typo/breaking (5A) 1 / 23

outline

probelms with WYSIWYGchanging appearancesWYSIWYG is dangerousthe importance of document structure

breaking textinto linesinto pagesfootnotes

conclusions

sample exam questions

kz & szp (delg) typo/breaking (5A) 2 / 23

breaking text into lines and pages is the central issue in typesetting(cf. Hungarian tordeles, tordelo) — today, we’re going to see

kz & szp (delg) typo/breaking (5A) 3 / 23

breaking text into lines and pages is the central issue in typesetting(cf. Hungarian tordeles, tordelo) — today, we’re going to see

◮ why this is so

kz & szp (delg) typo/breaking (5A) 3 / 23

breaking text into lines and pages is the central issue in typesetting(cf. Hungarian tordeles, tordelo) — today, we’re going to see

◮ why this is so

◮ how to do it smartly

kz & szp (delg) typo/breaking (5A) 3 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with size

kz & szp (delg) typo/breaking (5A) 4 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with size

◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)

kz & szp (delg) typo/breaking (5A) 4 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with size

◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)

◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document

kz & szp (delg) typo/breaking (5A) 4 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with size

◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)

◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document

◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes

kz & szp (delg) typo/breaking (5A) 4 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with size

◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)

◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document

◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes

the solution

kz & szp (delg) typo/breaking (5A) 4 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with size

◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)

◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document

◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes

the solution

◮ the size of the textblock (width and height of the area where text willbe set) should be specified

kz & szp (delg) typo/breaking (5A) 4 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with size

◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)

◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document

◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes

the solution

◮ the size of the textblock (width and height of the area where text willbe set) should be specified

◮ but this is less trivial, since one must calculate the necessary marginsizes from the given paper size beforehand

kz & szp (delg) typo/breaking (5A) 4 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with other text properties

kz & szp (delg) typo/breaking (5A) 5 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with other text properties

◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:

kz & szp (delg) typo/breaking (5A) 5 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with other text properties

◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:

◮ e.g., font face, size is part of the document, even when this is not necessary

kz & szp (delg) typo/breaking (5A) 5 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with other text properties

◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:

◮ e.g., font face, size is part of the document, even when this is not necessary

a solution

do not use .doc/.odt when plain text (.txt) will do

kz & szp (delg) typo/breaking (5A) 5 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with other text properties

◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:

◮ e.g., font face, size is part of the document, even when this is not necessary

a solution

do not use .doc/.odt when plain text (.txt) will do

a problem with this solution

some necessary properties of the text (e.g., italicization, special characters) willalso be lost in plain text

kz & szp (delg) typo/breaking (5A) 5 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with other text properties

◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:

◮ e.g., font face, size is part of the document, even when this is not necessary

a solution

do not use .doc/.odt when plain text (.txt) will do

a problem with this solution

some necessary properties of the text (e.g., italicization, special characters) willalso be lost in plain text

the ultimate solution

do not use word processors (i.e., WYSIWYG technology)

kz & szp (delg) typo/breaking (5A) 5 / 23

probelms with WYSIWYG changing appearances

why does my document’s appearance change all the time?

a problem with other text properties

◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:

◮ e.g., font face, size is part of the document, even when this is not necessary

a solution

do not use .doc/.odt when plain text (.txt) will do

a problem with this solution

some necessary properties of the text (e.g., italicization, special characters) willalso be lost in plain text

the ultimate solution

do not use word processors (i.e., WYSIWYG technology): use mark-up languages

kz & szp (delg) typo/breaking (5A) 5 / 23

probelms with WYSIWYG WYSIWYG is dangerous

the perils of WYSIWYG

WYSIWYG word processing is dangerous for two reasons

kz & szp (delg) typo/breaking (5A) 6 / 23

probelms with WYSIWYG WYSIWYG is dangerous

the perils of WYSIWYG

WYSIWYG word processing is dangerous for two reasons

1. the structure of the document will not be specified

a section title will not be specified as a section title, it will simply beset in a larger/emboldened/whatever font, with extra space aboveand below

kz & szp (delg) typo/breaking (5A) 6 / 23

probelms with WYSIWYG WYSIWYG is dangerous

the perils of WYSIWYG

WYSIWYG word processing is dangerous for two reasons

1. the structure of the document will not be specified

a section title will not be specified as a section title, it will simply beset in a larger/emboldened/whatever font, with extra space aboveand below

2. line and page breaks will be chosen during writing the text

as a consequence, any subsequent change (or simply reading the fileby another word processor) may result in undesired line and pagebreaks

kz & szp (delg) typo/breaking (5A) 6 / 23

probelms with WYSIWYG WYSIWYG is dangerous

the perils of WYSIWYG

WYSIWYG word processing is dangerous for two reasons

1. the structure of the document will not be specified

a section title will not be specified as a section title, it will simply beset in a larger/emboldened/whatever font, with extra space aboveand below

2. line and page breaks will be chosen during writing the text

as a consequence, any subsequent change (or simply reading the fileby another word processor) may result in undesired line and pagebreaks

both dangers can be avoided, but generally users of word processors fallinto these traps: WYSIWYG technology notoriously guides users into them

kz & szp (delg) typo/breaking (5A) 6 / 23

probelms with WYSIWYG the importance of document structure

the importance of document structure

why is it a problem if the structure of the document is not specified?

the main points of producing electronic documents are

kz & szp (delg) typo/breaking (5A) 7 / 23

probelms with WYSIWYG the importance of document structure

the importance of document structure

why is it a problem if the structure of the document is not specified?

the main points of producing electronic documents are

◮ reusability: (parts of) a document should be used on severaloccasions, it is rarely useful to create documents for the sake of asingle printing

kz & szp (delg) typo/breaking (5A) 7 / 23

probelms with WYSIWYG the importance of document structure

the importance of document structure

why is it a problem if the structure of the document is not specified?

the main points of producing electronic documents are

◮ reusability: (parts of) a document should be used on severaloccasions, it is rarely useful to create documents for the sake of asingle printing

◮ machine-parsability: the document should be correctly interpretableby artificial intelligence, too (e.g., blind people may want to have itread, we may want to convert it to another format — e.g., HTML forweb publishing)

kz & szp (delg) typo/breaking (5A) 7 / 23

probelms with WYSIWYG the importance of document structure

the importance of document structure

why is it a problem if the structure of the document is not specified?

the main points of producing electronic documents are

◮ reusability: (parts of) a document should be used on severaloccasions, it is rarely useful to create documents for the sake of asingle printing

◮ machine-parsability: the document should be correctly interpretableby artificial intelligence, too (e.g., blind people may want to have itread, we may want to convert it to another format — e.g., HTML forweb publishing)

both aims are easy/possible to achieve only if the structure of thedocument is specified

kz & szp (delg) typo/breaking (5A) 7 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

kz & szp (delg) typo/breaking (5A) 8 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

◮ the width of the column (line, textblock)

kz & szp (delg) typo/breaking (5A) 8 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

◮ the width of the column (line, textblock)

◮ the font’s properties: face, weight, size

kz & szp (delg) typo/breaking (5A) 8 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

◮ the width of the column (line, textblock)

◮ the font’s properties: face, weight, size

◮ the shape of the paragraph (e.g., justified or not)

kz & szp (delg) typo/breaking (5A) 8 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

◮ the width of the column (line, textblock)

◮ the font’s properties: face, weight, size

◮ the shape of the paragraph (e.g., justified or not)

options

kz & szp (delg) typo/breaking (5A) 8 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

◮ the width of the column (line, textblock)

◮ the font’s properties: face, weight, size

◮ the shape of the paragraph (e.g., justified or not)

options

◮ normally these are not inherent properties of the text, when any ofthem changes, the places of line breaks change

kz & szp (delg) typo/breaking (5A) 8 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

◮ the width of the column (line, textblock)

◮ the font’s properties: face, weight, size

◮ the shape of the paragraph (e.g., justified or not)

options

◮ normally these are not inherent properties of the text, when any ofthem changes, the places of line breaks change

◮ if these properties are inherent in the text, create an image file ofyour text (e.g., a JPEG file) and store your text in that (you will notbe able to modify the text though!)

kz & szp (delg) typo/breaking (5A) 8 / 23

probelms with WYSIWYG the importance of document structure

breaks and reusability

where a line is broken depends on

◮ the width of the column (line, textblock)

◮ the font’s properties: face, weight, size

◮ the shape of the paragraph (e.g., justified or not)

options

◮ normally these are not inherent properties of the text, when any ofthem changes, the places of line breaks change

◮ if these properties are inherent in the text, create an image file ofyour text (e.g., a JPEG file) and store your text in that (you will notbe able to modify the text though!) — this is like printed text:lacking the versatility of electronic text

kz & szp (delg) typo/breaking (5A) 8 / 23

breaking text

the rule of breaking text

text

here means the flowing text of the document, not including illustrations(pictures, charts, tables, etc.), which are usually less rigidly positioned indocuments

kz & szp (delg) typo/breaking (5A) 9 / 23

breaking text

the rule of breaking text

text

here means the flowing text of the document, not including illustrations(pictures, charts, tables, etc.), which are usually less rigidly positioned indocuments

breaking text

must be done “blindly”, i.e., without knowing where the breaks willactually be in a specific version of the text, i.e., we must specify placeswhere text may and where it may not be broken

kz & szp (delg) typo/breaking (5A) 9 / 23

breaking text

the rule of breaking text

text

here means the flowing text of the document, not including illustrations(pictures, charts, tables, etc.), which are usually less rigidly positioned indocuments

breaking text

must be done “blindly”, i.e., without knowing where the breaks willactually be in a specific version of the text, i.e., we must specify placeswhere text may and where it may not be broken

while sentences and paragraphs are primarily logical units, lines and pagesare rigid typographical units, which text must fit

kz & szp (delg) typo/breaking (5A) 9 / 23

breaking text into lines

points of breaking text into lines

a line of text

is a string of two types of element:

kz & szp (delg) typo/breaking (5A) 10 / 23

breaking text into lines

points of breaking text into lines

a line of text

is a string of two types of element:

◮ characters

kz & szp (delg) typo/breaking (5A) 10 / 23

breaking text into lines

points of breaking text into lines

a line of text

is a string of two types of element:

◮ characters

◮ whitespace ( SPACE , TAB , NEWLINE )

kz & szp (delg) typo/breaking (5A) 10 / 23

breaking text into lines

points of breaking text into lines

a line of text

is a string of two types of element:

◮ characters

◮ whitespace ( SPACE , TAB , NEWLINE )

SPACE

is a universal breaking point: lines may be broken at a SPACE

kz & szp (delg) typo/breaking (5A) 10 / 23

breaking text into lines

points of breaking text into lines

a line of text

is a string of two types of element:

◮ characters

◮ whitespace ( SPACE , TAB , NEWLINE )

SPACE

is a universal breaking point: lines may be broken at a SPACE

TAB

is not uniform in its behaviour wrt line breaking, but it is not used in text

kz & szp (delg) typo/breaking (5A) 10 / 23

breaking text into lines

points of breaking text into lines

a line of text

is a string of two types of element:

◮ characters

◮ whitespace ( SPACE , TAB , NEWLINE )

SPACE

is a universal breaking point: lines may be broken at a SPACE

TAB

is not uniform in its behaviour wrt line breaking, but it is not used in text

NEWLINE

breaks the line obligatorily

kz & szp (delg) typo/breaking (5A) 10 / 23

breaking text into lines

points of breaking text into lines

a line of text

is a string of two types of element:

◮ characters

◮ whitespace ( SPACE , TAB , NEWLINE )

SPACE

is a universal breaking point: lines may be broken at a SPACE

TAB

is not uniform in its behaviour wrt line breaking, but it is not used in text

NEWLINE

breaks the line obligatorily; in word processors, it ends the paragraph, intext formatters two or more NEWLINE s represent a paragraph division

kz & szp (delg) typo/breaking (5A) 10 / 23

breaking text into lines

human intervention

explicit specification of breaking points

is necessary when automatic breaking points are not appropriate, i.e.,

kz & szp (delg) typo/breaking (5A) 11 / 23

breaking text into lines

human intervention

explicit specification of breaking points

is necessary when automatic breaking points are not appropriate, i.e.,

◮ a line should not be broken at an automatic breaking point

kz & szp (delg) typo/breaking (5A) 11 / 23

breaking text into lines

human intervention

explicit specification of breaking points

is necessary when automatic breaking points are not appropriate, i.e.,

◮ a line should not be broken at an automatic breaking point

◮ a line should be broken, but there is no automatic breaking point

kz & szp (delg) typo/breaking (5A) 11 / 23

breaking text into lines

human intervention

explicit specification of breaking points

is necessary when automatic breaking points are not appropriate, i.e.,

◮ a line should not be broken at an automatic breaking point

◮ a line should be broken, but there is no automatic breaking point

do not break at breaking point: nonbreaking space

a.k.a. hard space, is like SPACE except that it is not a breaking point

kz & szp (delg) typo/breaking (5A) 11 / 23

breaking text into lines

human intervention

explicit specification of breaking points

is necessary when automatic breaking points are not appropriate, i.e.,

◮ a line should not be broken at an automatic breaking point

◮ a line should be broken, but there is no automatic breaking point

do not break at breaking point: nonbreaking space

a.k.a. hard space, is like SPACE except that it is not a breaking point

◮ symbols:   (HTML), ~ (TEX, LATEX), in word processors usually

shown as a grey space: ·

kz & szp (delg) typo/breaking (5A) 11 / 23

breaking text into lines

human intervention

explicit specification of breaking points

is necessary when automatic breaking points are not appropriate, i.e.,

◮ a line should not be broken at an automatic breaking point

◮ a line should be broken, but there is no automatic breaking point

do not break at breaking point: nonbreaking space

a.k.a. hard space, is like SPACE except that it is not a breaking point

◮ symbols:   (HTML), ~ (TEX, LATEX), in word processors usually

shown as a grey space: ·

◮ examples of use: Henry ·VIII, VIII. ·Henrik, G. ·B. ·Shaw,

2 ·+ ·2 ·= ·4

kz & szp (delg) typo/breaking (5A) 11 / 23

breaking text into lines

human intervention

explicit specification of breaking points

is necessary when automatic breaking points are not appropriate, i.e.,

◮ a line should not be broken at an automatic breaking point

◮ a line should be broken, but there is no automatic breaking point

do not break at breaking point: nonbreaking space

a.k.a. hard space, is like SPACE except that it is not a breaking point

◮ symbols:   (HTML), ~ (TEX, LATEX), in word processors usually

shown as a grey space: ·

◮ examples of use: Henry ·VIII, VIII. ·Henrik, G. ·B. ·Shaw,

2 ·+ ·2 ·= ·4

◮ longer stretches: <nobr>text not to be broken</nobr>

kz & szp (delg) typo/breaking (5A) 11 / 23

breaking text into lines

other breaking points in lines

other points where lines may automatically be broken

are software specific; they may include the following

kz & szp (delg) typo/breaking (5A) 12 / 23

breaking text into lines

other breaking points in lines

other points where lines may automatically be broken

are software specific; they may include the following

◮ hyphen (“-”)

kz & szp (delg) typo/breaking (5A) 12 / 23

breaking text into lines

other breaking points in lines

other points where lines may automatically be broken

are software specific; they may include the following

◮ hyphen (“-”)

◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)

kz & szp (delg) typo/breaking (5A) 12 / 23

breaking text into lines

other breaking points in lines

other points where lines may automatically be broken

are software specific; they may include the following

◮ hyphen (“-”)

◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)

it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)

kz & szp (delg) typo/breaking (5A) 12 / 23

breaking text into lines

other breaking points in lines

other points where lines may automatically be broken

are software specific; they may include the following

◮ hyphen (“-”)

◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)

it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)

inhibiting breaking

if the line is automatically broken at a hyphen, this can be inhibited by anonbreaking (a.k.a. hard) hyphen: - , e.g.,

kz & szp (delg) typo/breaking (5A) 12 / 23

breaking text into lines

other breaking points in lines

other points where lines may automatically be broken

are software specific; they may include the following

◮ hyphen (“-”)

◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)

it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)

inhibiting breaking

if the line is automatically broken at a hyphen, this can be inhibited by anonbreaking (a.k.a. hard) hyphen: - , e.g.,

◮ talking about the suffix -ing, 1969-07-20

kz & szp (delg) typo/breaking (5A) 12 / 23

breaking text into lines

other breaking points in lines

other points where lines may automatically be broken

are software specific; they may include the following

◮ hyphen (“-”)

◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)

it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)

inhibiting breaking

if the line is automatically broken at a hyphen, this can be inhibited by anonbreaking (a.k.a. hard) hyphen: - , e.g.,

◮ talking about the suffix -ing, 1969-07-20

◮ rajzolhatunk-e?

kz & szp (delg) typo/breaking (5A) 12 / 23

breaking text into lines

breaking lines elsewhere

if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is

kz & szp (delg) typo/breaking (5A) 13 / 23

breaking text into lines

breaking lines elsewhere

if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is

hyphenation

kz & szp (delg) typo/breaking (5A) 13 / 23

breaking text into lines

breaking lines elsewhere

if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is

hyphenation

◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish

kz & szp (delg) typo/breaking (5A) 13 / 23

breaking text into lines

breaking lines elsewhere

if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is

hyphenation

◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish

◮ some technically correct hyphen positions are

kz & szp (delg) typo/breaking (5A) 13 / 23

breaking text into lines

breaking lines elsewhere

if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is

hyphenation

◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish

◮ some technically correct hyphen positions are◮ esthetically displeasing: e.g., a-roma, Indi-a, (small bits of words are

not hyphenated off)

kz & szp (delg) typo/breaking (5A) 13 / 23

breaking text into lines

breaking lines elsewhere

if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is

hyphenation

◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish

◮ some technically correct hyphen positions are◮ esthetically displeasing: e.g., a-roma, Indi-a, (small bits of words are

not hyphenated off)◮ morphologically misleading: e.g., elo-dong vs. el-odong ‘saunter about’

(it is better to hyphenate between morphemes)

kz & szp (delg) typo/breaking (5A) 13 / 23

breaking text into lines

breaking lines elsewhere

if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is

hyphenation

◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish

◮ some technically correct hyphen positions are◮ esthetically displeasing: e.g., a-roma, Indi-a, (small bits of words are

not hyphenated off)◮ morphologically misleading: e.g., elo-dong vs. el-odong ‘saunter about’

(it is better to hyphenate between morphemes)

◮ even the best hyphenation algorithms occasionally fail

kz & szp (delg) typo/breaking (5A) 13 / 23

breaking text into lines

soft hyphen

a soft/optional/discretionary hyphen

is manually inserted at positions where the hyphenation algorithm fails, itonly appears if the position occurs at the end of the line —&shy; (in HTML), \- (in TEX, LATEX), - -appendix (in word processors)

kz & szp (delg) typo/breaking (5A) 14 / 23

breaking text into lines

tricky hyphenation conventions

◮ Polish kung-fu → kung- -fu

kz & szp (delg) typo/breaking (5A) 15 / 23

breaking text into lines

tricky hyphenation conventions

◮ Polish kung-fu → kung- -fu

◮ Hungarian vissza → visz- sza

kz & szp (delg) typo/breaking (5A) 15 / 23

breaking text into lines

tricky hyphenation conventions

◮ Polish kung-fu → kung- -fu

◮ Hungarian vissza → visz- sza

◮ Norwegian bussjafør → buss- sjafør

kz & szp (delg) typo/breaking (5A) 15 / 23

breaking text into lines

tricky hyphenation conventions

◮ Polish kung-fu → kung- -fu

◮ Hungarian vissza → visz- sza

◮ Norwegian bussjafør → buss- sjafør

◮ Dutch omaatje → oma- tje

kz & szp (delg) typo/breaking (5A) 15 / 23

breaking text into lines

tricky hyphenation conventions

◮ Polish kung-fu → kung- -fu

◮ Hungarian vissza → visz- sza

◮ Norwegian bussjafør → buss- sjafør

◮ Dutch omaatje → oma- tje

◮ German Becker → Bek- ker (the new orthography dictates Be- cker)

kz & szp (delg) typo/breaking (5A) 15 / 23

breaking text into lines

tricky hyphenation conventions

◮ Polish kung-fu → kung- -fu

◮ Hungarian vissza → visz- sza

◮ Norwegian bussjafør → buss- sjafør

◮ Dutch omaatje → oma- tje

◮ German Becker → Bek- ker (the new orthography dictates Be- cker)

◮ Greek Maòou → Ma- �ou (cf. ma�a → mai- a)

kz & szp (delg) typo/breaking (5A) 15 / 23

breaking text into lines

tricky hyphenation conventions

◮ Polish kung-fu → kung- -fu

◮ Hungarian vissza → visz- sza

◮ Norwegian bussjafør → buss- sjafør

◮ Dutch omaatje → oma- tje

◮ German Becker → Bek- ker (the new orthography dictates Be- cker)

◮ Greek Maòou → Ma- �ou (cf. ma�a → mai- a)

discretionary

such patterns are difficult to deal with, but TEX/LATEX provide a way, e.g.,vi\discretionary{sz-}{sz}{ssz}a,oma\discretionary{-}{}{a}tje give the correct output, but still allsuch patterns must be marked

kz & szp (delg) typo/breaking (5A) 15 / 23

breaking text into lines

widow/orphan word

widow/orphan word

many consider it ugly to leave (part of) a single (especially short) word inthe last line of a paragraph (it is esthetically displeasing)

kz & szp (delg) typo/breaking (5A) 16 / 23

breaking text into lines

widow/orphan word

widow/orphan word

many consider it ugly to leave (part of) a single (especially short) word inthe last line of a paragraph (it is esthetically displeasing)

examples

Lorem ipsum dolor sit amet, consectetuer adipiscing elit. Quisque sed nunc eu-

ismod tortor ornare rutrum. Suspendisse sagittis odio non metus. Praesent sem

magna, pretium vel, elementum at, consectetuer et, mi. Donec aliquet tortor ut

ipsum. Vivamus et arcu sed lacus rhoncus vulputate. Ut condimentum, erat vel

lobortis faucibus, neque quam faucibus sapien, eget dapibus magna metus ve-

lit.

Phasellus non odio. Ut sollicitudin. Vestibulum blandit, massa at vehicula

varius, dui enim porttitor augue, id auctor est metus a nisi. Quisque ligula mi,

aliquam ac, consectetuer nonummy, fermentum eu, urna. Sed aliquam felis quis

quam. Pellentesque non metus. Nisi vestibulum nisi elementum amoris tutae

sit.

kz & szp (delg) typo/breaking (5A) 16 / 23

breaking text into pages

breaking text into pages

◮ text is broken into lines that form paragraphs

kz & szp (delg) typo/breaking (5A) 17 / 23

breaking text into pages

breaking text into pages

◮ text is broken into lines that form paragraphs

◮ paragraphs are broken into pages

kz & szp (delg) typo/breaking (5A) 17 / 23

breaking text into pages

breaking text into pages

◮ text is broken into lines that form paragraphs

◮ paragraphs are broken into pages

◮ page breaking is the vertical analogue of line breaking

kz & szp (delg) typo/breaking (5A) 17 / 23

breaking text into pages

breaking text into pages

◮ text is broken into lines that form paragraphs

◮ paragraphs are broken into pages

◮ page breaking is the vertical analogue of line breaking

default page breaks

the best place to break text into pages is between two paragraphs (theanalogue of SPACE )

kz & szp (delg) typo/breaking (5A) 17 / 23

breaking text into pages

breaking text into pages

◮ text is broken into lines that form paragraphs

◮ paragraphs are broken into pages

◮ page breaking is the vertical analogue of line breaking

default page breaks

the best place to break text into pages is between two paragraphs (theanalogue of SPACE )

except

between a secion title and the first paragraph of the section!

kz & szp (delg) typo/breaking (5A) 17 / 23

breaking text into pages

breaking text into pages

◮ text is broken into lines that form paragraphs

◮ paragraphs are broken into pages

◮ page breaking is the vertical analogue of line breaking

default page breaks

the best place to break text into pages is between two paragraphs (theanalogue of SPACE )

except

between a secion title and the first paragraph of the section!(most word processors and text formatters take care of this after paragraphsmarked as a heading: another advantage of marking document structure)

kz & szp (delg) typo/breaking (5A) 17 / 23

breaking text into pages

breaks within a paragraph

usually page breaks fall within a paragraph (the analogue of hyphenation)

paragraphs

in the default case, can be broken between any two lines

kz & szp (delg) typo/breaking (5A) 18 / 23

breaking text into pages

breaks within a paragraph

usually page breaks fall within a paragraph (the analogue of hyphenation)

paragraphs

in the default case, can be broken between any two lines

restrictions

avoid breaking a paragraph

kz & szp (delg) typo/breaking (5A) 18 / 23

breaking text into pages

breaks within a paragraph

usually page breaks fall within a paragraph (the analogue of hyphenation)

paragraphs

in the default case, can be broken between any two lines

restrictions

avoid breaking a paragraph

◮ after a line that ends in a hyphenated word, esp. on a recto page

kz & szp (delg) typo/breaking (5A) 18 / 23

breaking text into pages

breaks within a paragraph

usually page breaks fall within a paragraph (the analogue of hyphenation)

paragraphs

in the default case, can be broken between any two lines

restrictions

avoid breaking a paragraph

◮ after a line that ends in a hyphenated word, esp. on a recto page

◮ before the last line (creating a widow or orphan line)

kz & szp (delg) typo/breaking (5A) 18 / 23

breaking text into pages

breaks within a paragraph

usually page breaks fall within a paragraph (the analogue of hyphenation)

paragraphs

in the default case, can be broken between any two lines

restrictions

avoid breaking a paragraph

◮ after a line that ends in a hyphenated word, esp. on a recto page

◮ before the last line (creating a widow or orphan line)

◮ after the first line (creating a club line)

kz & szp (delg) typo/breaking (5A) 18 / 23

breaking text into pages

breaks within a paragraph

usually page breaks fall within a paragraph (the analogue of hyphenation)

paragraphs

in the default case, can be broken between any two lines

restrictions

avoid breaking a paragraph

◮ after a line that ends in a hyphenated word, esp. on a recto page

◮ before the last line (creating a widow or orphan line)

◮ after the first line (creating a club line)

(cf. the inhibition of hyphenating off small bits of words)

kz & szp (delg) typo/breaking (5A) 18 / 23

breaking text into pages

widow and club lines

a widow line

Lorem ipsum dolor sit amet, con-sectetuer adipiscing elit. Aliquamjusto quam, porta et, lacinia vel,molestie ac, arcu. Curabitur inurna. Donec eu sapien. Vestibu-lum suscipit. Etiam lorem. Donecpurus. Etiam odio nisi, sem-per nec, feugiat eget, faucibusvel, turpis. Nam tincidunt plac-erat felis. Phasellus ac ante necnunc lobortis malesuada. Donecac massa. Morbi id tellus. Crasvel felis. Pellentesque lacus pede,pretium at, elementum in, ul-lamcorper in, pede. Cras nequenunc, dapibus ut, sodales sit amet,cursus at, velit. In vitæ velitsit amet metus vulputate con-sectetuer. Fusce eu nibh. Nam id

tortor.Sed diam. Curabitur sapien

enim, rutrum id, iaculis sed, mat-tis vel, nisi. In fermentum pellen-tesque turpis. Mauris sed massaat mauris suscipit suscipit. Sedvel lectus in neque porttitor mat-tis. Suspendisse condimentum dic-tum arcu. Phasellus eget do-lor quis enim interdum tempor.Nam lobortis. Nam augue est,lacinia non, tempor in, placeratut, neque. Phasellus convallis do-lor vel velit. Nam nec enim eu felismalesuada feugiat. Donec turpis.Nulla vel tortor id felis aliquetpretium. Nullam sit amet erat infelis eleifend egestas.

Morbi vel tortor sed tortor con-

sectetuer mollis. Aliquam maurisnunc, fermentum sit amet, fermen-tum vitæ, venenatis et, tellus.

Cras et ipsum. Etiam so-dales tempus mauris. Ut fermen-tum fermentum nisi. Cras nonmassa. Sed nunc. Donec facili-sis diam non nunc. Aliquam fer-mentum bibendum dolor. Duismalesuada ultrices ipsum. Fuscediam. Suspendisse pulvinar do-lor iaculis magna. Vestibulum so-dales. Vestibulum erat. Sus-pendisse potenti. Sed imperdietpretium erat. Morbi molestie,leo nec pharetra facilisis, sapiensapien vulputate erat, id aliquamvelit nulla non eros. Nulla facilisi.Ænean nec diam. Proin sapien dui.

a club line

kz & szp (delg) typo/breaking (5A) 19 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

◮ if this is impossible, it must begin on the page it is referred to

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

◮ if this is impossible, it must begin on the page it is referred to

◮ in the latter case, the footnote has to be broken between pages

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

◮ if this is impossible, it must begin on the page it is referred to

◮ in the latter case, the footnote has to be broken between pages

◮ it is better to break a footnote on a verso page than on a recto page

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

◮ if this is impossible, it must begin on the page it is referred to

◮ in the latter case, the footnote has to be broken between pages

◮ it is better to break a footnote on a verso page than on a recto page;why?

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

◮ if this is impossible, it must begin on the page it is referred to

◮ in the latter case, the footnote has to be broken between pages

◮ it is better to break a footnote on a verso page than on a recto page;why? because the reader does not have to turn the page this way

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

◮ if this is impossible, it must begin on the page it is referred to

◮ in the latter case, the footnote has to be broken between pages

◮ it is better to break a footnote on a verso page than on a recto page;why? because the reader does not have to turn the page this way

◮ before the break in a footnote the text should end abruptly (by ahyphenated word, not by the end of a sentence), so that the reader iswarned that the footnote did not end

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

◮ a footnote must appear on the page it is referred to

◮ if this is impossible, it must begin on the page it is referred to

◮ in the latter case, the footnote has to be broken between pages

◮ it is better to break a footnote on a verso page than on a recto page;why? because the reader does not have to turn the page this way

◮ before the break in a footnote the text should end abruptly (by ahyphenated word, not by the end of a sentence), so that the reader iswarned that the footnote did not end

◮ a footnote continued on a following page is sometimes marked by aline above it

kz & szp (delg) typo/breaking (5A) 20 / 23

breaking text footnotes

breaking footnotes

kz & szp (delg) typo/breaking (5A) 21 / 23

conclusions

conclusions

you must always think twice before

◮ breaking a line with an explicit line break

kz & szp (delg) typo/breaking (5A) 22 / 23

conclusions

conclusions

you must always think twice before

◮ breaking a line with an explicit line break

◮ hyphenating a word with an explicit hyphen

kz & szp (delg) typo/breaking (5A) 22 / 23

conclusions

conclusions

you must always think twice before

◮ breaking a line with an explicit line break

◮ hyphenating a word with an explicit hyphen

◮ breaking a paragraph with an explicit page break

kz & szp (delg) typo/breaking (5A) 22 / 23

sample exam questions

sample exam questions

Which of the following is an inherent property of a text?

1. the size of the letters

2. the width of the text block

3. the height of the text block

4. the italicization of words

kz & szp (delg) typo/breaking (5A) 23 / 23

sample exam questions

sample exam questions

Which of the following is an inherent property of a text?

1. the size of the letters

2. the width of the text block

3. the height of the text block

4. the italicization of words

kz & szp (delg) typo/breaking (5A) 23 / 23

sample exam questions

sample exam questions

Which of the following is an inherent property of a text?

1. the size of the letters

2. the width of the text block

3. the height of the text block

4. the italicization of words

What is a club line?

1. the last line of a paragraph with the rest on the preceding page

2. the first line of a paragraph with the rest on the following page

3. a thick line separating the text body and the footnotes

4. a line under the header of a page.

kz & szp (delg) typo/breaking (5A) 23 / 23

sample exam questions

sample exam questions

Which of the following is an inherent property of a text?

1. the size of the letters

2. the width of the text block

3. the height of the text block

4. the italicization of words

What is a club line?

1. the last line of a paragraph with the rest on the preceding page

2. the first line of a paragraph with the rest on the following page

3. a thick line separating the text body and the footnotes

4. a line under the header of a page.

kz & szp (delg) typo/breaking (5A) 23 / 23