Quantitative summary (Version 10.0.2)

Quantitative summary of present corpus

COMPARA version 10.0.2, 8 Abr 2008

Aligned texts

Aligned texts

Alignment units

Alignment units

Number of words

Number of words

Words per alignment unit

Words per alignment unit

Types

Types

Tokens

Tokens

Type/token ratio

Type/token ratio

Textual tags

Semantic tags

Distribution of texts per language variety

Language variaty Total of different texts Source texts (different) Transtations
PT Angola 2 2 0
PT Brazil 28 25 (24) 3
PT Mozambique 2 2 0
PT Portugal 43 12 31
EN South Africa 4 4 0
EN United States 32 7 25
EN United Kingdom and Ireland 39 23 (21) 16

Distribution of words per language variety

Language variaty Total of different words Words in source texts (different) Words in translations
PT Angola 5454 5454 0
PT Brazil 557711 466676 (463483) 91035
PT Mozambique 20115 20115 0
PT Portugal 1134134 281578 852556
EN South Africa 101697 101697 0
EN United States 673046 170328 502718
EN United Kingdom and Ireland 1021150 677863 (615308) 343287

Distribution of parallel texts per language variety

source texts ranslations PA PB PM PP ES EU EB
PT Angola (PA) - - - - 0 2 0
PT Brazil (PB) - - - - 0 2 4
PT Mozambique (PM) - - - - 0 0 2
PT Portugal (PP) - - - - 0 3 9
EN South Africa (ES) 0 1 0 3 - - -
EN United States (EU) 0 0 0 7 - - -
EN United Kingdom and Ireland (EB) 0 2 0 21 - - -

Type of alignment in COMPARA

Type of alignment in COMPARA