[Esta página em português]

COMPARA corpus contents
version 6.3

7-Aug-2005

Quantitative summary of currently available texts
Text-pairs available
Previous versions


Quantitative summary of present corpus

Aligned texts Source texts Translations
Portuguese 34 27
English 25 35
Total 59 62

Alignment units
Portuguese to English direction 39217
English to Portuguese direction 45369
Total 84586

Words Source texts Translations Source texts & translations
Portuguese 542862 684807 1227669
English 702277 620545 1322822
Portuguese & English 1245139 1305352 2550491

Words/alignment unit Source texts Translations Source texts & translations
Portuguese 13.84 15.09 14.51
English 15.48 15.82 15.64
Portuguese & English 14.72 15.43 30.15

Types Source texts Translations Source texts & translations
Portuguese 66012 57403 73929
English 32753 44051 46557

Tokens Source texts Translations Source texts & translations
Portuguese 658955 812299 1471254
English 816992 724499 1541295
Portuguese & English 1475947 1536798 3012549

Type/token ratio Source texts Translations Source texts & translations
Portuguese 0.1216 0.0838 0.0602
English 0.0466 0.0710 0.0352

Translation notes Foreign words or expressions Named entities Titles Emphasis
Portuguese 92 1915 375 635 685
English 131 953 126 617 994
Total 223 2868 501 1252 1679

Language variety Words in Portuguese Words in English Alignment units Texts
United States 591969 547775 1383 27
Angola 2085 2145 91 1
Brazil 506824 547171 3111 25
United Kingdom 876310 847537 1124 32
Portugal 941894 967976 1124 34
Mozambique 20128 24397 1229 2
South Africa 73410 75619 1038 3

Type of alignment in COMPARA

Type 1-1 1-1/2 1-2 1-0 1-3 1-1/3 1-1/4 1-4 1-5 1-1/5 1-6 1-8 1-7 1-9
Total 76203 3773 3282 403 370 364 69 66 23 15 10 4 3 1
% 90.09 4.46 3.88 0.48 0.44 0.43 0.08 0.08 0.03 0.02 0.01 0.00 0.00 0.00
EBDL1T1 2014 50 60 17 2 6
EBDL1T2 2064 33 33 9 1 4 5
EBDL2 1438 16 11 6
EBDL3T1 1453 10 12 5 1
EBDL3T2 1423 30 27 1
EBDL4 1345 7 18 2 2 3
EBDL5 1799 63 126 16 8 1
EBDL6 2419 270 71 2 3 18 12 1
EBIM1 940 24 10 1 3
EBJB1 1174 12 9 4
EBJB2 1379 29 48 4 4 4
EBJC1 737 12 30 1
EBJT1 1886 122 30 23 2 11 4
EBJT2 2225 94 36 15 2 3
EBJT3 1627 161 40 15 19
EBKI1 4401 188 30 20 3 15 1
EBLC1 581 10 29 1
EBOW1 1645 50 26 9 6
ESNG1 627 12 33 2
ESNG2 1864 44 32 4 4 9
ESNG3 1010 4 19 4 1
EUEP1 725 73 41 1 9 3 1 2
EUHJ1 994 58 24 2
EUHJ2 446 75 98 5 9 8 3
EUHJ3 443 41 114 1 5 1 3
EURZ1 3226 236 44 9 1 9 8 2 5
EURZ2 2094 128 39 31 2 9
PAJA1 71 9 5 5 1
PBAA1 218 12 51 2 6 2 1
PBAA2 1616 105 245 5 62 14 4 7 2 1
PBAD1 1242 12 17 1 1
PBAD2 1407 17 9 1
PBCB1 433 2 7 1
PBJA1T1 209 4 4
PBJA1T2 188 6 20 3
PBMA1 673 6 128 1 30 3 1
PBMA2 361 7 106 7 24 1 2
PBMA3 654 3 21 2
PBMA4 771 9 111 10 1
PBMA5 1122 57 133 2 5 2 1
PBMA6 1002 28 78 1 8 1
PBMAA1 342 29 105 23 2 1 1
PBMR1 1921 26 26 6
PBOL1 1258 16 3
PBPC1 707 33 17 15
PBPC2 1106 66 46 12 2 6
PBPM1 876 24 93 3 19 6 2 1 1
PBPM2 1434 22 209 3 40 16 4 6 3 1
PBRF1 2669 59 64 8 2 3 1
PBRF2 2801 123 123 32 15 12 1 3 1
PMMC1 715 30 9 4
PMMC2 1198 15 7 9
PPCC1 1143 30 28 6 2
PPCP1 469 132 147 42 16 64 14 7 3 5
PPEQ1 220 22 32 10 13 2 1
PPEQ2 1226 87 76 3 3 9 4 1
PPEQ3 297 31 23 4 1 3
PPJS1 2379 644 166 5 11 90 8 5
PPJSA1 724 19 28 3 1
PPMC1 1278 65 29 5 5 1
PPSC1 541 55 11 4 1 3
PPSC2 953 116 15 10 2 24 4


Text-pairs available

Publication dates in bold refer to first editions

PAJA1 Agualusa, José Eduardo

1992 A Feira dos Assombrados. Lisbon: Vega 1992, pp 11-17. Copyright Vega © 1992. Used by permission of José Eduardo Agualusa and Vega.
SOURCE: Portuguese, Angola; 2086 tokens, 1812 words, 924 types, 91 alignment units.

1994 Shadowtown translated by Richard Zenith. Prague: Trafika. Digital text by Richard Zenith. Used by permission of Trafika and Richard Zenith.
TRANSLATION: English, US; 2136 tokens, 1860 words, 845 types.

PBJA1T1 Alencar, José de

1865 Iracema http://www.bibvirt.futuro.usp.br/textos/autores/josedealencar/iracema/iracema.html , pp 1-12 [06/12/1999]. Corpus text extracted from digital text prepared by the Biblioteca Virtual do Estudante Brasileiro, based on 24th ed., São Paulo: Ática, 1991.
SOURCE: Portuguese, Brazil; 3744 tokens, 3199 words, 1248 types, 217 alignment units.

2000 Iracema translated by Clifford Landers. Corpus text based on New York: Oxford University Press, 2000, pp 1-17. Copyright © 2000 by Oxford University Press. Used by permission of Oxford University Press.
TRANSLATION: English, US; 4197 tokens, 3623 words, 1176 types.

PBJA1T2 Alencar, José de

1865 Iracema http://www.bibvirt.futuro.usp.br/textos/autores/josedealencar/iracema/iracema.html , pp 1-12 [06/12/1999]. Corpus text extracted from digital text prepared by the Biblioteca Virtual do Estudante Brasileiro, based on 24th ed., São Paulo: Ática, 1991.
SOURCE: Portuguese, Brazil; 3744 tokens, 3199 words, 1248 types, 217 alignment units.

1886 Iracema, the honey lips: a legend of Brazil translated by Lady Isabel Burton. Corpus text based on London: Bickers, 1886, pp 1-17.
TRANSLATION: English, UK; 4291 tokens, 3707 words, 1235 types.

PBMAA1 Almeida, Manuel Antônio de

1852 Memórias de um sargento de milícias http://www.bibvirt.futuro.usp.br/textos/autores/manuelantonio/sargento/sargento.html [08/08/2003], pp 76-101. Electronic version by Artur Araújo for the Biblioteca Virtual do Estudante Brasileiro, based on the 25th edition, São Paulo: Ática, 1996.
SOURCE: Portuguese, Brazil; 13170 tokens, 11275 words, 2923 types, 503 alignment units.

1999 Memoirs of a Militia Sergeant, edited by Thomas Holloway and translated by Ronald W. Sousa, pp 100-130. Copyright © 1999 Oxford University Press Inc. Used by permission of Oxford University Press.
TRANSLATION: English, USA; 15097 tokens, 12939 words, 2585 types.

PBAA1 Azevedo, Aluísio

1890 O Cortiço, http://www.bibvirt.futuro.usp.br/textos/autores/aluisioazevedo/cortico/cortico.html [06/12/1999].
SOURCE: Portuguese, Brazil; 8328 tokens, 7146 words, 2376 types, 292 alignment units.

2000 The slum: a novel translated by David Rosenthal. New York: Oxford University Press, pp 1-17. Copyright © 2000 by Oxford University Press. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 7445 tokens, 6416 words, 2025 types.

PBAA2 Azevedo, Aluísio

1881 O Mulato http://vbookstore.uol.com.br/nacional/aluisioazevedo/mulato.shtml [03/01/2004].
SOURCE: Portuguese, Brazil; 40011 tokens, 32654 words, 7621 types, 2061 alignment units.

1990 Mulatto translated by Graeme MacNicoll. Cranbury: Associated University Presses, pp 31-119. Copyright © 1990 AUP. Used by permission of Associated University Presses, Inc.
TRANSLATION: English, US; 43779 tokens, 37896 words, 6585 types.

EBJB1 Barnes, Julian

1984 Flaubert's parrot Corpus text based on London: Picador, 1985, pp 11-65. Copyright © 1984 by Julian Barnes. Used by permission of Julian Barnes.
SOURCE: English, UK; 21443 tokens, 18525 words, 4612 types. 1199 alignment units.

1988 O papagaio de Flaubert translated by Ana Maria Amador. Corpus text based on Lisboa: Queztal Editores, 1988, pp 11-74. Copyright © 1988 by Quetzal Editores. Used by permission of Quetzal Editores.
TRANSLATION: Portuguese, Portugal; 21043 tokens, 17837 words, 5017 types.

EBJB2 Barnes, Julian

1989 A History of the World in 10 ½ Chapters Corpus extract based on 1990 edition, London: Picador, pp. 3-80. Copyright © 1989 Julian Barnes. Used by permission of Julian Barnes.
SOURCE: English, UK; 31587 tokens, 28146 words, 5149 types. 1468 alignment units.

1990 A História do Mundo em 10 Capítulos e ½ translated by José Vieira de Lima. Lisboa: Quetzal, pp. 9-93. Copyright © 1989 Julian Barnes. Used by permission of Julian Barnes and Quetzal.
TRANSLATION: Portuguese, Portugal; 34607 tokens, 29940 words, 6721 types.

PBCB1 Buarque, Chico

1995 Benjamim São Paulo: Companhia das Letras, pp 9-52. Copyright © Chico Buarque 1995. Used by permission of Chico Buarque.
SOURCE: Portuguese, Brazil; 12124 tokens, 10599 words, 3513 types, 443 alignment units.

1997 Benjamin translated by Cliff Landers. London: Bloomsbury, pp 3-45. Copyright © Cliff Landers 1997. Used by permission of Cliff Landers.
TRANSLATION: English, UK; 13282 tokens, 11806 words, 3083 types.

PPCP1 Cardoso Pires, José

1983 Balada da Praia dos Cães Lisboa: Edições "O Jornal", pp 13-60. First published by Publicações Dom Quixote, 1982. Copyright © 1982 by José Cardoso Pires. Used by permission of Maria Edite Cardoso Pires.
SOURCE: Portuguese, Portugal; 17563 tokens, 14903 words, 4428 types, 899 alignment units.

1986 Ballad of Dog's Beach translated by Mary Fitton. London: John M. Dent, pp 5-38. Copyright © 1986 by John M. Dent. Used by permission of The Orion Publishing Group Ltd.
TRANSLATION: English, UK; 15425 tokens, 12840 words, 3873 types.

EBLC1 Carrol, Lewis

1871 Through the Looking Glass Corpus extract based on WWW Project Gutenberg 1994 lglass18.txt edition pp 7-18 & 24-33.
SOURCE: English, UK; 12972 tokens, 11072 words, 1630 types. 621 alignment units.

1977 Alice do Outro Lado do Espelho translated by Yolanda Artiaga, Nina Videira and Luís Lobo. Lisboa: Estampa, pp 25-47 & 63-79. Copyright © 1996 Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 12162 tokens, 9366 words, 2188 types.

PPMC1 Carvalho, Mário de

1994 Um deus passeando pela brisa da tarde Corpus text based on Lisboa: Caminho, 1995, pp 13-87. Copyright © 1994 Editorial Caminho. Used by permission of Editorial Caminho.
SOURCE: Portuguese, Portugal; 25019 tokens, 20839 words, 6463 types, 1383 alignment units.

1997 A god strolling in the cool of the evening translated by Gregory Rabassa. Corpus text based on London: Phoenix, 1999, pp 1-79. Copyright © 1997 by Lousiana StateUniversity Press. Used by permission of Lousiana StateUniversity Press.
TRANSLATION: English, US; 27097 tokens, 23532 words, 4892 types.

PPCC1 Castelo Branco, Camilo

1862 Amor de Perdição corpus extract taken from Obras integrais de Camilo Castelo Branco em CD-ROM. Projecto Vercial 2000, pp 60-92.
SOURCE: Portuguese, Portugal; 19072 tokens, 15684 words, 3962 types, 1209 alignment units.

2000 Doomed Love translated by Alice Clemente. Providence: Gávea Brown, pp 96-144. Copyright © 2000 Alice Clemente and Gávea-Brown Publications. Used by permission of Alice Clemente and Gávea-Brown Publications.
TRANSLATION: English, USA; 20410 tokens, 17132 words, 3009 types.

PBPC1 Coelho, Paulo

1988 O alquimista Corpus text based on Rio de Janeiro: Rocco, 2000, pp 21-79. Copyright © 1988 by Paulo Coelho. Used by permission of Paulo Coelho.
SOURCE: Portuguese, Brazil; 11742 tokens, 9933 words, 2241 types, 772 alignment units.

1993 The alquemist translated by Alan Clarke. Corpus text based on London: Thorsons, 1997, pp 3-50. Copyright © by Paulo Coelho & Alan Clarke. Used by permission of Paulo Coelho and Alan Clarke.
TRANSLATION: English, US; 13065 tokens, 11124 words, 1780 types.

PBPC2 Coelho, Paulo

1987 O Diário de um Mago Corpus text based on 2000 edition, Rio de Janeiro: Rocco, pp 163-233. Copyright © Paulo Coelho 1987. Used by permission of Paulo Coelho.
SOURCE: Portuguese, Brazil; 21236 tokens, 18342 words, 3865 types, 1238 alignment units.

1992 The Pilgrimage: a contemporary quest for ancient wisdom translated by Alan Clarke. Corpus text based on 1995 edition, New York: HarperCollins, pp 151-214. Copyright © Paulo Coelho and Alan Clarke 1992. Used by permission of Paulo Coelho and Alan Clarke.
TRANSLATION: English, USA; 23263 tokens, 20311 words, 3105 types.

EBJC1 Conrad, Joseph

1902 Heart of Darkness. http://www.gutenberg.net/etext96/hdark12a.txt Project Gutenberg, 1996, chapter 3.
SOURCE: English, UK; 14226 tokens, 12162 words, 2658 types. 780 alignment units.

1983 O Coração das Trevas. Translated by Aníbal Fernandes. Lisboa: Estampa, pp 95-131. Copyright © Editorial Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 13508 tokens, 11191 words, 3418 types.

PMMC1 Couto, Mia

1987 Vozes Anoitecidas. Lisboa: Editorial Caminho, pp 23-64. Copyright © Editorial Caminho 1987. Used by permission of Editorial Caminho
SOURCE: Portuguese, Mozambique; 7622 tokens, 6077 words, 2189 types, 758 alignment units.

1990 Voices Made Night translated by David Brookshaw. Oxford: Heinneman, pp 1-28. Copyright © 1990 by David Brookshaw. Used by permission of David Brookshaw .
TRANSLATION: English, UK; 9349 tokens, 8032 words, 1977 types.

PMMC2 Couto, Mia

1990 Cada Homem é uma Raça. Lisboa: Editorial Caminho, pp 11-68. Copyright © Editorial Caminho 1990. Used by permission of Editorial Caminho
SOURCE: Portuguese, Mozambique; 12508 tokens, 9933 words, 3365 types, 1229 alignment units.

1993 Every Man is a Race translated by David Brookshaw. Oxford: Heinneman, pp 1-37. Copyright © 1993 by David Brookshaw. Used by permission of David Brookshaw .
TRANSLATION: English, UK; 15026 tokens, 12789 words, 2784 types.

PBAD1 Dourado, Autran

1973 O Risco do Bordado Rio de Janeiro: Editora Expressão e Cultura, pp 13-68. Copyright © Autran Dourado 1970. Used by permission of Autran Dourado.
SOURCE: Portuguese, Brazil; 20246 tokens, 17131 words, 3784 types, 1273 alignment units.

1984 Pattern for a Tapestry translated by John Parker. London: Peter Owen, pp 13-54. Copyright © John M. Parker 1984. Used by permission of John Parker.
TRANSLATION: English, UK; 22346 tokens, 19288 words, 3370 types.

PBAD2 Dourado, Autran

1975 Os Sinos da Agonia Rio de Janeiro: Exped, pp.15-64. Copyright © Autran Dourado 1975. Used by permission of Autran Dourado.
SOURCE: Portuguese, Brazil; 28088 tokens, 23775 words, 5186 types, 1434 alignment units.

1988 The Bells of Agony translated by John Parker. London: Peter Owen, pp 13-65. Copyright © John M. Parker 1988. Used by permission of John Parker.
TRANSLATION: English, UK; 30795 tokens, 26494 words, 4516 types.

PPEQ1 Eça de Queirós, J. M.

1880 O mandarim http://www.ipn.pt/opsis/litera/queiros.htmpp 2-10 [10/12/1999]. Corpus text extracted from digital text prepared by Deolinda Cabrera for Projecto Vercial.
SOURCE: Portuguese, Portugal; 7280 tokens, 5996 words, 2497 types, 300 alignment units.

1993 The mandarinin The mandarin and other storiestranslated by Margaret Jull Costa. Corpus text based on Sawtry: Dedalus Ltd, 1993, pp 5-25. Used by permission of Margaret Jull Costa and Dedalus Press.
TRANSLATION: English, UK; 8213 tokens, 7262 words, 2416 types.

PPEQ2 Eça de Queirós, J.M.

1887 A Relíquia corpus extract taken from 1887 edition prepared by Deolinda Rodrigues Cabrera. for the Projecto Vercial. The orthography has been updated. Obras integrais de Eça de Queirós em CD-ROM. Projecto Vercial 1996 - 2003, pp 1-42.
SOURCE: Portuguese, Portugal; 30279 tokens, 24783 words, 6615 types, 1409 alignment units.

1994 The Relic translated by Margaret Jull Costa. Sawtry: Dedalus, pp 7-89. Copyright © 1994 Margaret Jull Costa. Used by permission of Margaret Jull Costa.
TRANSLATION: English, UK; 34053 tokens, 29856 words, 5495 types.

PPEQ3 Eça de Queirós, J.M.

1925 Alves e Companhia WWW: Projecto Vercial, http://www.ipn.pt/literatura/obrqueiros.htm, pp 3-13.
SOURCE: Portuguese, Portugal; 7291 tokens, 5951 words, 1987 types, 359 alignment units.

1993 The Yellow Sofa translated by John Vetch. Manchester: Carcanet, pp 7-31. Copyright © 1993 John Vetch. Used by permission of Carcanet Press in the name of the translator (deceased).
TRANSLATION: English, UK; 7811 tokens, 6741 words, 1831 types.

PBRF1 Fonseca, Rubem

1988 Vastas emoções e pensamentos imperfeitos São Paulo: Companhia das Letras,pp 7-102.Copyright © Rubem Fonseca. Used by permission of Rubem Fonseca.
SOURCE: Portuguese, Brazil; 34619 tokens, 27446 words, 5882 types, 2806 alignment units.

1997 The lost manuscript translated by Clifford Landers. London: Bloomsbury, pp 3-107. Copyright © 1997 by Clifford Landers. Used by permission of Clifford Landers.
TRANSLATION: English, US; 36085 tokens, 31099 words, 4769 types.

PBRF2 Fonseca, Rubem

1983 A Grande Arte. Corpus text based on 1987 edition. Rio De Janeiro: Livraria Francisco Alves Editora, pp 7-100. Copyright Rubem Fonseca. Used by permission of Rubem Fonseca.
SOURCE: Portuguese, Brazil; 39126 tokens, 31059 words, 7156 types, 3111 alignment units.

1987 High Art. Translated by Ellen Watson. London: Collins pp 3-111. Copyright Ellen Watson. Used by permission of Ellen Watson.
TRANSLATION: English, US; 41759 tokens, 33609 words, 6109 types.

ESNG1 Gordimer, Nadine

1990 My son's story Corpus text extracted from digital text prepared by the ENPC based on London: Penguin, 1991, pp 3-46. Copyright © 1990 Nadine Gordimer. Used by permission of Nadine Gordimer.
SOURCE: English, South Africa; 15933 tokens, 14027 words, 3047 types. 674 alignment units.

1992 A história do meu filho translated by Geraldo Galvão Ferraz. Corpus text extracted from digital text prepared by the ENPC based on São Paulo: Editora Siciliano, 1992, pp 11-49. Copyright © 1992 Siciliano. Used by permission of Editora Siciliano.
TRANSLATION: Portuguese, Brazil; 15037 tokens, 12999 words, 3594 types.

ESNG2 Gordimer, Nadine

1979 Burger's Daughter Corpus text based on Penguin 1981 edition, pp 14-108.
SOURCE: English, South Africa; 40323 tokens, 35212 words, 6227 types. 1957 alignment units.

1992 A filha de Burger translated by J.Teixeira de Aguilar. Porto: Asa, pp 17-155. Copyright © 1992 Edições Asa. Used by permission of Edições Asa.
TRANSLATION: Portuguese, Portugal; 42580 tokens, 37209 words, 8111 types.

ESNG3 Gordimer, Nadine

1981 July's People. Corpus extract based on 1992 Curley large print edition, pp 103-162. Copyright © Nadine Gordimer 1981. Used by permission of Nadine Gordimer.
SOURCE: English, South Africa; 17154 tokens, 14517 words, 3179 types. 1038 alignment units.

1986 A Gente de July. Translated by Paula Reis. Lisboa: Teorema, pp 112-176. Copyright © Teorema. Used by permission of Teorema.
TRANSLATION: Portuguese, Portugal; 18047 tokens, 15059 words, 4171 types.

EBKI1 Ishiguro, Kazuo

1995 The Unconsoled. London: Faber & Faber (1996), pp. 377-535. Copyright © Kazuo Ishiguro 1995. Used by permission of Kazuo Ishiguro.
SOURCE: English, UK; 71910 tokens, 62736 words, 5285 types. 4658 alignment units.

1995 Os Inconsolados. translated by Fernanda Pinto Rodrigues. Lisboa: Gradiva, pp. 304-425. Portuguese rights by Gradiva. Used by permission of Gradiva.
TRANSLATION: Portuguese, Portugal; 68234 tokens, 56627 words, 8127 types.

EUHJ1 James, Henry

1880 Washington Square, http://www.ibiblio.org/gutenberg/etext01/wassq10.txt, Project Gutenberg, 2001, pp 6-43.
SOURCE: English, USA; 23431 tokens, 19339 words, 3010 types. 1078 alignment units.

1990 A Herdeira translated by M. F. Gonçalves de Azevedo. Lisboa: Editorial Estampa, pp 7-77. Copyright © 1990 Editorial Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 21267 tokens, 17833 words, 3882 types.

EUHJ2 James, Henry

1895 The Altar of the Dead WWW: Project Gutenberg, etext96/altdd10.txt, 2002, pp 1-25.
SOURCE: English, USA; 16702 tokens, 14969 words, 2609 types. 644 alignment units.

1978 O Altar dos Mortos translated by Manuel João Gomes. Lisboa: Editorial Estampa, pp 225-276. Copyright © 1978 Editorial Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 16597 tokens, 14372 words, 3573 types.

EUHJ3 James, Henry

1893 Nona Vincent WWW: Project Gutenberg, etext01/nonav10.txt, 2001.
SOURCE: English, USA; 14533 tokens, 12718 words, 2340 types. 608 alignment units.

1978 Nona Vincent translated by Manuel João Gomes. Lisboa: Editorial Estampa, pp 107-150. Copyright © 1978 Editorial Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 13835 tokens, 11569 words, 3197 types.

PBOL1 Lins, Osman

1966 Nove, Novena. São Paulo: Livraria Martins Editora, pp 9-83. Copyright © 1966 Osman Lins. Used by permission of next of kin.
SOURCE: Portuguese, Brazil; 22156 tokens, 17808 words, 5435 types, 1277 alignment units.

1995 Nine, Novena. Translated by Adria Frizzi. Los Angeles: Sun & Moon Press, pp 25-106. Copyright © 1995 by Adria Frizzi. Used by permission of Adria Frizzi.
TRANSLATION: English, US; 24201 tokens, 20338 words, 4226 types.

EBDL1T1 Lodge, David

1995 Therapy London: Secker & Warburg, pp 3-97. Copyright © 1995 David Lodge. Used by permission of David Lodge. (Revised version by the COMPARA team, February 2003.)
SOURCE: English, UK; 43744 tokens, 37665 words, 6873 types. 2149 alignment units.

1995 Terapia translated by Maria do Carmo Figueira. Lisboa: Gradiva, pp 11-88. Copyright © 1995 Gradiva Publicações, Lda. Used by permission of Gradiva Publicações, Lda.
TRANSLATION: Portuguese, Portugal; 45169 tokens, 38995 words, 7920 types.

EBDL1T2 Lodge, David

1995 Therapy London: Secker & Warburg, pp 3-97. Copyright © 1995 David Lodge. Used by permission of David Lodge. (Revised version by the COMPARA team, February 2003.)
SOURCE: English, UK; 43744 tokens, 37665 words, 6873 types. 2149 alignment units.

1997 Terapia translated by Lídia Cavalcante-Luther. São Paulo: Scipione, pp 11-115. Copyright © 1997 Scipione. Used by permission of Editora Scipione.
TRANSLATION: Portuguese, Brazil; 45192 tokens, 39103 words, 7532 types.

EBDL2 Lodge, David

1989 Nice Work London: Penguin, pp 13-89. Copyright © 1988 by David Lodge. Used by permission of David Lodge.
SOURCE: English, UK; 28179 tokens, 24547 words, 5918 types. 1471 alignment units.

1996 Um almoço nunca é de graça. Translated by Maria Carlota Pracana. Lisboa: Gradiva, pp 13-67. Copyright © Gradiva Publicações, Lda. Used by permission of Gradiva Publicações, Lda.
TRANSLATION: Portuguese, Portugal; 29011 tokens, 24448 words, 6437 types.

EBDL3T1 Lodge, David

1975 Changing PlacesLondon: Secker & Warburg, pp 3-71. Copyright © David Lodge 1975. Used by permission of David Lodge.
SOURCE: English, UK; 30208 tokens, 25489 words, 6049 types. 1481 alignment units.

1995 A Troca translated by Helena Cardoso. Porto: Edições Asa, pp 9-67. Copyright © 1995 Edições Asa. Used by permission of Edições Asa.
TRANSLATION: Portuguese, Portugal; 29313 tokens, 24340 words, 6594 types.

EBDL3T2 Lodge, David

1975 Changing PlacesLondon: Secker & Warburg, pp 3-71. Copyright © David Lodge 1975. Used by permission of David Lodge.
SOURCE: English, UK; 30208 tokens, 25489 words, 6049 types. 1481 alignment units.

1998 Invertendo os Papéis translated by Lídia Luther-Cavalcante. São Paulo: Scipione, pp 15-93. Copyright © 1998 Scipione. Used by permission of Scipione.
TRANSLATION: Portuguese, Brazil; 30854 tokens, 26252 words, 6318 types.

EBDL4 Lodge, David

1980 How Far Can You Go? London: Secker & Warburg, pp 1-72. Copyright © David Lodge 1980. Used by permission of David Lodge.
SOURCE: English, UK; 34253 tokens, 29425 words, 5899 types. 1377 alignment units.

1997 Até Onde se Pode Ir?. Translated by Helena Cardoso. Lisboa: Gradiva, pp 9-74. Copyright © Gradiva Publicações. Used by permission of Gradiva Publicações.
TRANSLATION: Portuguese, Portugal; 32428 tokens, 27619 words, 6708 types.

EBDL5 Lodge, David

1991 Paradise News. Corpus text based on Penguin 1992 edition, pp 3-97. Copyright © David Lodge 1991. Used by permission of David Lodge.
SOURCE: English, UK; 31914 tokens, 27517 words, 5372 types. 2013 alignment units.

1992 Notícias do Paraíso. Translated by Carlos Grifo Babo. Lisboa: Gradiva, pp 11-91. Copyright © Gradiva Publicações. Used by permission of Gradiva Publicações.
TRANSLATION: Portuguese, Portugal; 34082 tokens, 28194 words, 6486 types.

EBDL6 Lodge, David

1984 Small World. London: Penguin (1985), pp. 231-333. Copyright © David Lodge 1984. Used by permission of David Lodge.
SOURCE: English, UK; 48236 tokens, 39732 words, 7462 types. 2796 alignment units.

1996 O Mundo é Pequeno. translated by Lucinda Santos Silva. Porto: Edições Asa, pp 251-355. Portuguese rights by Edições Asa. Used by permission of Edições Asa.
TRANSLATION: Portuguese, Portugal; 49467 tokens, 40924 words, 8909 types.

PBMA1 Machado de Assis, Joaquim

1886 Quincas Borba http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/quincasborba/quincasborba.html pp 1-21 [06/12/1999] Corpus text extracted from digital text prepared by Núcleo de Pesquisas em Informática, Literatura e Lingüística da Universidade Federal de Santa Catarina, based on Obra Completa, by Machado de Assis, vol. I, Nova Aguilar, Rio de Janeiro, 1994.
SOURCE: Portuguese, Brazil; 13760 tokens, 11140 words, 3179 types, 842 alignment units.

1998 Quincas Borba edited by Celso Favaretto and David T. Haberly; translated by Gregory Rabassa. Corpus text based on New York: Oxford University Press 1998, pp 5-45. Copyright © 1998 by Oxford University Press Inc. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 15778 tokens, 13216 words, 2661 types.

PBMA2 Machado de Assis, Joaquim

1881 Memórias póstumas de Brás Cubas http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/brascubas/cubas.html pp 1-24 [06/12/1999]. Corpus text extracted from digital text prepared by Renato Lima for Virtual bookstore, livros e literatura.
SOURCE: Portuguese, Brazil; 12410 tokens, 10218 words, 3390 types, 508 alignment units.

1998 The posthumous Memoirs of Brás Cubas translated by Gregory Rabassa. Corpus text based on New York: Oxford University Press, 1998, pp 5-34. Copyright © 1998 by Oxford University Press Inc. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 13365 tokens, 11466 words, 2935 types.

PBMA3 Machado de Assis, Joaquim

1899 Dom Casmurro http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/dcasmurro/dcasmurro.html pp 1-18 [06/12/1999]. Corpus text extracted from digital text prepared by Núcleo de Pesquisas em Informática, Literatura e Lingüística da Universidade Federal de Santa Catarina.
SOURCE: Portuguese, Brazil; 12025 tokens, 9903 words, 2989 types, 680 alignment units.

1997 Dom Casmurro translated by John Gledson. Corpus text based on New York: Oxford University Press, 1997, pp 3-39. Copyright © 1997 by Oxford University Press Inc. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 14075 tokens, 11825 words, 2558 types.

PBMA4 Machado de Assis, Joaquim

1904 Esaú e Jacó http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/esauejaco/esauejaco.html [08/08/2003], pp 147-149. Electronic edition based on Obra Completa, de Machado de Assis, vol. I, Nova Aguilar, Rio de Janeiro, 1994, by the Núcleo de Pesquisas em Informática, Literatura e Lingüística da Universidade Federal de Santa Catarina.
SOURCE: Portuguese, Brazil; 15317 tokens, 12734 words, 3440 types, 902 alignment units.

2000 Esau and Jacob, translated by Elizabeth Lowe and edited by Dain Borges, New York: Oxford University Press, pp 200-245. Copyright © 2000 Oxford University Press Inc. Used by permission of Oxford University Press Inc.
TRANSLATION: English, US; 17405 tokens, 15068 words, 2796 types.

PBMA5 Machado de Assis, Joaquim

1876 Helena http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/helena/helena.html [08/08/2003]. Electronic edition by Costa Flosi Ltda. for Biblioteca Virtual da Prefeitura de Ribeirao Preto, pp 118-174.
SOURCE: Portuguese, Brazil; 22477 tokens, 18760 words, 4554 types, 1322 alignment units.

1984 Helena: a novel translated by Helen Caldwell. Berkeley, los Angeles and London: University of California Press, pp 134-197. Copyright © 1984 by the Regents of the University of California. Used by permission of the Regents of the University of California.
TRANSLATION: English, USA; 25195 tokens, 22199 words, 3658 types.

PBMA6 Machado de Assis, Joaquim

1908 Memorial de Aires. http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/memorial/ [22/04/2005], pp 1-29.
SOURCE: Portuguese, Brazil; 20360 tokens, 17222 words, 3907 types, 1118 alignment units.

1972 Counselor Ayres' Memorial. translated by Helen Caldwell. Berkeley: University of California Press, pp 9-71. Copyright © The Regents of the University of California. Used by permission of The Regents of the University of California.
TRANSLATION: English, US; 24051 tokens, 20817 words, 3308 types.

EBIM1 McEwan, Ian

1992 Black Dogs. London: Picador (1993), pp 123-174. Copyright © Ian McEwan 1992. Used by permission of Ian McEwan.
SOURCE: English, UK; 17669 tokens, 15703 words, 3495 types. 978 alignment units.

1993 Cães Pretos. translated by Fernanda Pinto Rodrigues. Lisboa: Gradiva, pp 109-152. Portuguese rights by Gradiva. Used by permission of Gradiva.
TRANSLATION: Portuguese, Portugal; 17729 tokens, 15224 words, 4436 types.

PBPM1 Melo, Patrícia

1998 O elogio da mentira Corpus text based on São Paulo: Companhia das Letras, 1998, pp 11-61. Copyright © 1998 by Patrícia Melo. Used by permission of Patrícia Melo.
SOURCE: Portuguese, Brazil; 15164 tokens, 12404 words, 3457 types, 1025 alignment units.

1999 In praise of lies translated by Clifford Landers. Corpus extract based on London: Bloomsbury, 1999, pp 1-53. Copyright © 1999 by Clifford Landers. Used by permission of Clifford Landers.
TRANSLATION: English, US; 16686 tokens, 14206 words, 2953 types.

PBPM2 Melo, Patrícia

1995 O Matador. São Paulo: Companhia das Letras, pp 9-71. Copyright 1995 by Patrícia Melo. Used by permission of Patrícia Melo.
SOURCE: Portuguese, Brazil; 23176 tokens, 18883 words, 3992 types, 1738 alignment units.

1998 The Killer translated by Clifford Landers. London: Bloomsbury, pp 3-71. Copyright 1997 by Clifford Landers. Used by permission of Clifford Landers.
TRANSLATION: English, US; 24653 tokens, 20477 words, 3303 types.

EUEP1 Poe, Edgar Allan

1837 The Narrative of Arthur Gordon Pym of Nantucket, Corpus extract based on WWW Project Gutenberg 2000, etext00/poe3v11.txt edition, pp. 12-43.
SOURCE: English, USA; 27593 tokens, 24412 words, 3663 types. 855 alignment units.

1988 Aventuras de Arthur Gordon Pym, translated by Eduardo Guerra Carneiro. Lisboa: Estampa, pp. 25-95. Copyright © 1988 Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 25188 tokens, 22123 words, 4883 types.

PBMR1 Rey, Marcos

1986 Memórias de um Gigolô. São Paulo: Ática Editorial, pp 159-224. Copyright Ática Editorial. Used by permission of Ática Editorial. http://www.marcosrey.com.br.
SOURCE: Portuguese, Brazil; 22774 tokens, 18468 words, 5365 types, 1979 alignment units.

1987 Memoirs of a Gigolo translated by Clifford Landers. New York: Avon pp 155-217. Copyright Clifford Landers. Used by permission of Clifford Landers.
TRANSLATION: English, US; 26409 tokens, 21669 words, 4363 types.

PPSC1 Sá-Carneiro, Mário de

1914 A confissão de Lúcio http://www.ipn.pt/opsis/litera/carneiro.htm pp 2-19 [10/12/1999]. Corpus text extracted from digital text prepared by Deolinda Cabrera for Projecto Vercial, based on orthographically updated 1st edition, Lisbon, 1914.
SOURCE: Portuguese, Portugal; 10261 tokens, 8244 words, 2797 types, 615 alignment units.

1993 Lucio's confession translated by Margaret Jull Costa. Corpus text based on Sawtry: Dedalus Ltd, 1993, pp 19-50. Copyright © 1993 by Dedalus Ltd. Used by permission of Dedalus Ltd.
TRANSLATION: English, UK; 11441 tokens, 9792 words, 2425 types.

PPSC2 Sá-Carneiro, Mário de

1915 A Grande sombra electronic version obtained from Obras completas de Sá-Carneiro, vol. 3 "Céu em fogo", 2nd edition, 1956. Lisboa: Editora Ática, pp 39-123. Updated orthography.
SOURCE: Portuguese, Portugal; 17214 tokens, 13620 words, 4298 types, 1124 alignment units.

1996 The Great Shadow (and other stories) translated by Margaret Jull Costa. Sawtry: Dedalus, pp 15-74. Copyright © Margaret Jull Costa. Used by permission of Margaret Jull Costa and Dedalus Press.
TRANSLATION: English, UK; 18526 tokens, 15712 words, 3460 types.

PPJSA1 Saramago, José

1995 Ensaio Sobre a Cegueira Lisboa: Caminho, pp. 11-98. Copyright © José Saramago and Editorial Caminho 1995. Used by permission of Editorial Caminho.
SOURCE: Portuguese, Portugal; 34437 tokens, 29237 words, 5864 types, 775 alignment units.

P>1997 Blindness translated by Giovanni Pontiero. Corpus text based on 1999 edition, London: Harvill Press, pp 3-90. Copyright © Professor Juan Sager, 1997. Used by permission of Juan Sager.
TRANSLATION: English, UK; 38107 tokens, 33276 words, 4260 types.

PPJS1 Sena, Jorge de

1978 Sinais de Fogo Lisboa: Edições 70, Lda., pp 25-136. Copyright © Mécia de Sena and Edições 70. Used by permission of Mécia de Sena and Edições 70.
SOURCE: Portuguese, Portugal; 52526 tokens, 42485 words, 7671 types, 3308 alignment units.

1999 Signs of Fire translated by John Byrne. Manchester: Carcanet Press, pp 1-115. Copyright © 1999 John Byrne (deceased). Used by permission of Carcanet Press.
TRANSLATION: English, UK; 59683 tokens, 52128 words, 5656 types.

EBJT1 Trollope, Joanna

1996 Next of kin Corpus text based on London: Black Swan, 1997, pp 7-87. Copyright © 1996 Joanna Trollope. Used by permission of Joanna Trollope.
SOURCE: English, UK; 32483 tokens, 28105 words, 4586 types. 2078 alignment units.

1998 Parentes próximos translated by Ana Falcão Bastos. Lisboa: Gradiva, 1998, pp 7-77. Copyright © 1998 Gradiva Publicações, Lda. Used by permission of Gradiva Publicações, Lda.
TRANSLATION: Portuguese, Portugal; 32612 tokens, 27216 words, 5708 types.

EBJT2 Trollope, Joanna

1993 A Spanish Lover London: Bloomsbury pp 223-313. Copyright © 1993 Joanna Trollope. Used by permission of Joanna Trollope.
SOURCE: English, UK; 37370 tokens, 32302 words, 4492 types. 2375 alignment units.

1999 Um Amante Espanhol translated by Ana Falcão Bastos. Lisboa: Gradiva, pp 223-310. Copyright © 1993 Joanna Trollope, Portuguese translation: Gradiva Publicações. Used by permission of Gradiva Publicações.
TRANSLATION: Portuguese, Portugal; 35926 tokens, 29710 words, 5686 types.

EBJT3 Trollope, Joanna

1995 The Best of Friends London: Black Swan, pp. 7-87. Copyright © 1995 Joanna Trollope. Used by permission of Joanna Trollope.
SOURCE: English, UK; 27827 tokens, 23956 words, 4301 types. 1862 alignment units.

1998 Melhores Amigos translated by Ana Falcão Bastos. Lisboa: Gradiva, pp 7-75. Copyright © 1995 Joanna Trollope, Portuguese translation: Gradiva Publicações. Used by permission of Gradiva Publicações.
TRANSLATION: Portuguese, Portugal; 27918 tokens, 23158 words, 5329 types.

EBOW1 Wilde, Oscar

1890 The Picture of Dorian Gray http://www.ibiblio.org/gutenberg/etext94/dgray10.txt. Project Gutenberg, 1994, chapters 8-20.
SOURCE: English, UK; 29961 tokens, 25652 words, 4193 types. 1736 alignment units.

1990 O Retrato de Dorian Gray translated by Januário Leite. Lisboa: Estampa, pp 115-194. Copyright © 1990 Editorial Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 27282 tokens, 22985 words, 5757 types.

EURZ1 Zimler, Richard

1998 The Last Kabbalist of Lisbon Corpus extract based on 1998 edition, London: Arcadia Books Ltd. pp 7 & 205-295 & 315-318. Copyright © Richard Zimler 1998. Used by permission of Richard Zimler.
SOURCE: English, USA; 44145 tokens, 36045 words, 5803 types. 3540 alignment units.

1996 O Último Cabalista de Lisboa translated by José Lima. Lisboa: Quetzal Editores, pp 17-20 & 205-294. Copyright © 1996 Queztal. Used by permission of Quetzal Editores.
TRANSLATION: Portuguese, Portugal; 44825 tokens, 37207 words, 7483 types.

EURZ2 Zimler, Richard

2000 Angelic Darkness Corpus extract based London: Arcadia, pp. 19-39, 70-86, 98-116, 170-192. Copyright © Richard Zimler 1999. Used by permission of Richard Zimler.
SOURCE: English, USA; 29244 tokens, 25150 words, 4177 types. 2303 alignment units.

1998 Trevas da Luz translated by José Lima. Lisboa: Quetzal, pp. 19-36, 53-69, 79-96, 140-162. Copyright © 1998 Richard Zimler and Quetzal. Used by permission of Richard Zimler and Quetzal. This translation is based on an earlier version of the published English edition.
TRANSLATION: Portuguese, Portugal; 28386 tokens, 23307 words, 5483 types.


Previous versions


START USING COMPARA MORE INFORMATION ABOUT COMPARA ACKNOWLEDGEMENTS   HOW TO CONTRIBUTE  
Simple Search Project team
Complex Search Contents
Questions from users Publications
Search Help The DISPARA interface
Tutorial Building the corpus

Last update to this page: 28 Aug 2005

Send questions, comments and suggestions