[Esta página em português]

COMPARA corpus contents
version 5.1

(Last updated: 19-Nov-2003)

Text-pairs available
Quantitative summary of currently available texts
Other text-pairs to be included in COMPARA
Previous versions

Text-pairs available

Publication dates in bold refer to first editions

EBDL1T1 Lodge, David

1995 Therapy London: Secker & Warburg, pp 3-97. Copyright © 1995 David Lodge. Used by permission of David Lodge. (Revised version by the COMPARA team, February 2003.)
SOURCE: English, UK; 43754 tokens, 40451 words, 6871 types. 2149 alignment units.

1995 Terapia translated by Maria do Carmo Figueira. Lisboa: Gradiva, pp 11-88. Copyright © 1995 Gradiva Publicações, Lda. Used by permission of Gradiva Publicações, Lda.
TRANSLATION: Portuguese, Portugal; 45159 tokens, 42389 words, 7930 types.

EBDL1T2 Lodge, David

1995 Therapy London: Secker & Warburg, pp 3-97. Copyright © 1995 David Lodge. Used by permission of David Lodge. (Revised version by the COMPARA team, February 2003.)
SOURCE: English, UK; 43754 tokens, 40451 words, 6871 types. 2149 alignment units.

1997 Terapia translated by Lídia Cavalcante-Luther. São Paulo: Scipione, pp 11-115. Copyright © 1997 Scipione. Used by permission of Editora Scipione.
TRANSLATION: Portuguese, Brazil; 45199 tokens, 42489 words, 7550 types.

EBDL2 Lodge, David

1989 Nice Work London: Penguin, pp 13-89. Copyright © 1988 by David Lodge. Used by permission of David Lodge.
SOURCE: English, UK; 28178 tokens, 26557 words, 5918 types. 1471 alignment units.

1996 Um almoço nunca é de graça translated by Maria Carlota Pracana. Lisboa: Gradiva, pp 13-67. Copyright © Gradiva Publicações, Lda. Used by permission of Gradiva Publicações, Lda.
TRANSLATION: Portuguese, Portugal; 28988 tokens, 27194 words, 6470 types.

EBDL3T1 Lodge, David

1975 Changing PlacesLondon: Secker & Warburg, pp 3-71. Copyright © David Lodge 1975. Used by permission of David Lodge.
SOURCE: English, UK; 30206 tokens, 27662 words, 6049 types. 1481 alignment units.

1995 A Troca translated by Helena Cardoso. Porto: Edições Asa, pp 9-67. Copyright © 1995 Edições Asa. Used by permission of Edições Asa.
TRANSLATION: Portuguese, Portugal; 29283 tokens, 26664 words, 6620 types.

EBDL3T2 Lodge, David

1975 Changing PlacesLondon: Secker & Warburg, pp 3-71. Copyright © David Lodge 1975. Used by permission of David Lodge.
SOURCE: English, UK; 30206 tokens, 27662 words, 6049 types. 1481 alignment units.

1998 Invertendo os Papéis translated by Lídia Luther-Cavalcante. São Paulo: Scipione, pp 15-93. Copyright © 1998 Scipione. Used by permission of Scipione.
TRANSLATION: Portuguese, Brazil; 30863 tokens, 29109 words, 6350 types.

EBJB1 Barnes, Julian

1984 Flaubert's parrot Corpus text based on London: Picador, 1985, pp 11-65. Copyright © 1984 by Julian Barnes. Used by permission of Julian Barnes.
SOURCE: English, UK; 21437 tokens, 19925 words, 4615 types. 1199 alignment units.

1988 O papagaio de Flaubert translated by Ana Maria Amador. Corpus text based on Lisboa: Queztal Editores, 1988, pp 11-74. Copyright © 1988 by Quetzal Editores. Used by permission of Quetzal Editores.
TRANSLATION: Portuguese, Portugal; 20961 tokens, 19033 words, 5049 types.

EBJT1 Trollope, Joanna

1996 Next of kin Corpus text based on London: Black Swan, 1997, pp 7-87. Copyright © 1996 Joanna Trollope. Used by permission of Joanna Trollope.
SOURCE: English, UK; 32483 tokens, 30363 words, 4586 types. 2078 alignment units.

1998 Parentes próximos translated by Ana Falcão Bastos. Corpus text based on Lisboa: Gradiva, 1998, pp 7-77. Copyright © 1998 Gradiva Publicações, Lda. Used by permission of Gradiva Publicações, Lda.
TRANSLATION: Portuguese, Portugal; 32552 tokens, 30300 words, 5716 types.

EBJT2 Trollope, Joanna

1993 A Spanish Lover London: Bloomsbury pp 223-313. Copyright © 1993 Joanna Trollope. Used by permission of Joanna Trollope.
SOURCE: English, UK; 37369 tokens, 35086 words, 4493 types. 2375 alignment units.

1999 Um Amante Espanhol translated by Ana Falcão Bastos. Lisboa: Gradiva, pp 223-310. Copyright © 1993 Joanna Trollope, Portuguese translation: Gradiva Publicações. Used by permission of Gradiva Publicações.
TRANSLATION: Portuguese, Portugal; 35824 tokens, 33206 words, 5706 types.

EBLC1 Carrol, Lewis

1871 Through the Looking Glass Corpus extract based on WWW Project Gutenberg 1994 lglass18.txt edition pp 7-18 & 24-33.
SOURCE: English, UK; 12972 tokens, 12100 words, 1630 types. 621 alignment units.

1977 Alice do Outro Lado do Espelho translated by Yolanda Artiaga, Nina Videira and Luís Lobo. Lisboa: Estampa, pp 25-47 & 63-79. Copyright © 1996 Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 12160 tokens, 10292 words, 2203 types.

EBOW1 Wilde, Oscar

1890 The Picture of Dorian Gray http://www.ibiblio.org/gutenberg/etext94/dgray10.txt. Project Gutenberg, 1994, chapters 8-20.
SOURCE: English, UK; 29961 tokens, 27492 words, 4193 types. 1736 alignment units.

1990 O Retrato de Dorian Gray translated by Januário Leite. Lisboa: Estampa, pp 115-194. Copyright © 1990 Editorial Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 27239 tokens, 25305 words, 5764 types.

ESNG1 Gordimer, Nadine

1990 My son's story Corpus text extracted from digital text prepared by the ENPC based on London: Penguin, 1991, pp 3-46. Copyright © 1990 Nadine Gordimer. Used by permission of Nadine Gordimer.
SOURCE: English, South Africa; 15932 tokens, 15058 words, 3048 types. 674 alignment units.

1992 A história do meu filho translated by Geraldo Galvão Ferraz. Corpus text extracted from digital text prepared by the ENPC based on São Paulo: Editora Siciliano, 1992, pp 11-49. Copyright © 1992 Siciliano. Used by permission of Editora Siciliano.
TRANSLATION: Portuguese, Brazil; 15035 tokens, 14134 words, 3595 types.

EUHJ1 James, Henry

1880 Washington Square, http://www.ibiblio.org/gutenberg/etext01/wassq10.txt, Project Gutenberg, 2001, pp 6-43.
SOURCE: English, USA; 23431 tokens, 20920 words, 3010 types. 1078 alignment units.

1990 A Herdeira translated by M. F. Gonçalves de Azevedo. Lisboa: Editorial Estampa, pp 7-77. Copyright © 1990 Editorial Estampa. Used by permission of Editorial Estampa.
TRANSLATION: Portuguese, Portugal; 21128 tokens, 19459 words, 3881 types.

EURZ1 Zimler, Richard

1998 The Last Kabbalist of Lisbon Corpus extract based on 1998 edition, London: Arcadia Books Ltd. pp 7 & 205-295 & 315-318. Copyright © Richard Zimler 1998. Used by permission of Richard Zimler.
SOURCE: English, USA; 44145 tokens, 38096 words, 5803 types. 3540 alignment units.

1996 O Último Cabalista de Lisboa translated by José Lima. Lisboa: Quetzal Editores, pp 17-20 & 205-294. Copyright © 1996 Queztal. Used by permission of Quetzal Editores.
TRANSLATION: Portuguese, Portugal; 44730 tokens, 40543 words, 7491 types.

PAJA1 Agualusa, José Eduardo

1992 A Feira dos Assombrados. Lisbon: Vega 1992, pp 11-17. Copyright Vega © 1992. Used by permission of José Eduardo Agualusa and Vega.
SOURCE: Portuguese, Angola; 2076 tokens, 1965 words, 926 types, 91 alignment units.

1994 Shadowtown translated by Richard Zenith. Prague: Trafika. Digital text by Richard Zenith. Used by permission of Trafika and Richard Zenith.
TRANSLATION: English, US; 2146 tokens, 2019 words, 846 types.

PBAA1 Azevedo, Aluísio

1890 O Cortiço, http://www.bibvirt.futuro.usp.br/textos/autores/aluisioazevedo/cortico/cortico.html [06/12/1999].
SOURCE: Portuguese, Brazil; 8324 tokens, 7900 words, 2376 types, 292 alignment units.

2000 The slum: a novel translated by David Rosenthal. New York: Oxford University Press, pp 1-17. Copyright © 2000 by Oxford University Press. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 7449 tokens, 6932 words, 2025 types.

PBAD1 Dourado, Autran

1973 O Risco do Bordado Rio de Janeiro: Editora Expressão e Cultura, pp 13-68. Copyright © Autran Dourado 1970. Used by permission of Autran Dourado.
SOURCE: Portuguese, Brazil; 20245 tokens, 18918 words, 3782 types, 1273 alignment units.

1984 Pattern for a Tapestry translated by John Parker. London: Peter Owen, pp 13-54. Copyright © John M. Parker 1984. Used by permission of John Parker.
TRANSLATION: English, UK; 22348 tokens, 21019 words, 3376 types.

PBCB1 Buarque, Chico

1995 Benjamim São Paulo: Companhia das Letras, pp 9-52. Copyright © Chico Buarque 1995. Used by permission of Chico Buarque.
SOURCE: Portuguese, Brazil; 12129 tokens, 11542 words, 3513 types, 443 alignment units.

1997 Benjamin translated by Cliff Landers. London: Bloomsbury, pp 3-45. Copyright © Cliff Landers 1997. Used by permission of Cliff Landers.
TRANSLATION: English, UK; 13282 tokens, 12752 words, 3083 types.

PBJA1T1 Alencar, José de

1865 Iracema http://www.bibvirt.futuro.usp.br/textos/autores/josedealencar/iracema/iracema.html , pp 1-12 [06/12/1999]. Corpus text extracted from digital text prepared by the Biblioteca Virtual do Estudante Brasileiro, based on 24th ed., São Paulo: Ática, 1991.
SOURCE: Portuguese, Brazil; 3741 tokens, 3470 words, 1249 types, 217 alignment units.

2000 Iracema translated by Clifford Landers. Corpus text based on New York: Oxford University Press, 2000, pp 1-17. Copyright © 2000 by Oxford University Press. Used by permission of Oxford University Press.
TRANSLATION: English, US; 4197 tokens, 3860 words, 1176 types.

PBJA1T2 Alencar, José de

1865 Iracema http://www.bibvirt.futuro.usp.br/textos/autores/josedealencar/iracema/iracema.html , pp 1-12 [06/12/1999]. Corpus text extracted from digital text prepared by the Biblioteca Virtual do Estudante Brasileiro, based on 24th ed., São Paulo: Ática, 1991.
SOURCE: Portuguese, Brazil; 3741 tokens, 3470 words, 1249 types, 217 alignment units.

1886 Iracema, the honey lips: a legend of Brazil translated by Lady Isabel Burton. Corpus text based on London: Bickers, 1886, pp 1-17.
TRANSLATION: English, UK; 4291 tokens, 3938 words, 1302 types.

PBMA1 Machado de Assis, J.

1886 Quincas Borba http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/quincasborba/quincasborba.html pp 1-21 [06/12/1999] Corpus text extracted from digital text prepared by Núcleo de Pesquisas em Informática, Literatura e Lingüística da Universidade Federal de Santa Catarina, based on Obra Completa, by Machado de Assis, vol. I, Nova Aguilar, Rio de Janeiro, 1994.
SOURCE: Portuguese, Brazil; 13733 tokens, 12621 words, 3183 types, 841 alignment units.

1998 Quincas Borbaedited by Celso Favaretto and David T. Haberly; translated by Gregory Rabassa. Corpus text based on New York: Oxford University Press 1998, pp 5-45. Copyright © 1998 by Oxford University Press Inc. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 15780 tokens, 14296 words, 2661 types.

PBMA2 Machado de Assis, Joaquim

1881 Memórias póstumas de Brás Cubas http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/brascubas/cubas.html pp 1-24 [06/12/1999]. Corpus text extracted from digital text prepared by Renato Lima for Virtual bookstore, livros e literatura.
SOURCE: Portuguese, Brazil; 12390 tokens, 11610 words, 3404 types, 508 alignment units.

1998 The posthumous Memoirs of Brás Cubas translated by Gregory Rabassa. Corpus text based on New York: Oxford University Press, 1998, pp 5-34. Copyright © 1998 by Oxford University Press Inc. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 13379 tokens, 12504 words, 2935 types.

PBMA3 Machado de Assis, Joaquim

1899 Dom Casmurro http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/dcasmurro/dcasmurro.html pp 1-18 [06/12/1999]. Corpus text extracted from digital text prepared by Núcleo de Pesquisas em Informática, Literatura e Lingüística da Universidade Federal de Santa Catarina.
SOURCE: Portuguese, Brazil; 12003 tokens, 11099 words, 2997 types, 680 alignment units.

1997 Dom Casmurro translated by John Gledson. Corpus text based on New York: Oxford University Press, 1997, pp 3-39. Copyright © 1997 by Oxford University Press Inc. Used by permission of Oxford University Press, Inc.
TRANSLATION: English, US; 14074 tokens, 12884 words, 2572 types.

PBMA4 Machado de Assis, Joaquim

1904 Esaú e Jacó http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/esauejaco/esauejaco.html [08/08/2003], pp 147-149. Electronic edition based on Obra Completa, de Machado de Assis, vol. I, Nova Aguilar, Rio de Janeiro, 1994, by the Núcleo de Pesquisas em Informática, Literatura e Lingüística da Universidade Federal de Santa Catarina.
SOURCE: Portuguese, Brazil; 15291 tokens, 14230 words, 3455 types, 902 alignment units.

2000 Esau and Jacob, translated by Elizabeth Lowe and edited by Dain Borges, New York: Oxford University Press, pp 200-245. Copyright © 2000 Oxford University Press Inc. Used by permission of Oxford University Press Inc.
TRANSLATION: English, US; 17405 tokens, 16349 words, 2796 types.

PBMA5 Machado de Assis

1876 Helena http://www.bibvirt.futuro.usp.br/textos/autores/machadodeassis/helena/helena.html [08/08/2003]. Electronic edition by Costa Flosi Ltda. for Biblioteca Virtual da Prefeitura de Ribeirao Preto, pp 118-174.
SOURCE: Portuguese, Brazil; 22460 tokens, 20548 words, 4554 types, 1322 alignment units.

1984 Helena: a novel translated by Helen Caldwell. Berkeley, los Angeles and London: University of California Press, pp 134-197. Copyright © 1984 by the Regents of the University of California. Used by permission of the Regents of the University of California.
TRANSLATION: English, USA; 25199 tokens, 23440 words, 3658 types.

PBMAA1 Almeida, Manuel Antônio de

1852 Memórias de um sargento de milícias http://www.bibvirt.futuro.usp.br/textos/autores/manuelantonio/sargento/sargento.html [08/08/2003], pp 76-101. Electronic version by Artur Araújo for the Biblioteca Virtual do Estudante Brasileiro, based on the 25th edition, São Paulo: Ática, 1996.
SOURCE: Portuguese, Brazil; 13153 tokens, 12335 words, 2923 types, 503 alignment units.

1999 Memoirs of a Militia Sergeant, edited by Thomas Holloway and translated by Ronald W. Sousa, pp 100-130. Copyright © 1999 Oxford University Press Inc. Used by permission of Oxford University Press.
TRANSLATION: English, USA; 15097 tokens, 13916 words, 2585 types.

PBMR1 Rey, Marcos

1986 Memórias de um Gigolô. São Paulo: Ática Editorial, pp 159-224. Copyright Ática Editorial. Used by permission of Ática Editorial. http://www.marcosrey.com.br.
SOURCE: Portuguese, Brazil; 22763 tokens, 20616 words, 5370 types, 1979 alignment units.

1987 Memoirs of a Gigolo translated by Clifford Landers. New York: Avon pp 155-217. Copyright Clifford Landers. Used by permission of Clifford Landers.
TRANSLATION: English, US; 26420 tokens, 23170 words, 4365 types.

PBPC1 Coelho, Paulo

1988 O alquimista Corpus text based on Rio de Janeiro: Rocco, 2000, pp 21-79. Copyright © 1988 by Paulo Coelho. Used by permission of Paulo Coelho.
SOURCE: Portuguese, Brazil; 11742 tokens, 10775 words, 2242 types, 772 alignment units.

1993 The alquemist translated by Alan Clarke. Corpus text based on London: Thorsons, 1997, pp 3-50. Copyright © by Paulo Coelho & Alan Clarke. Used by permission of Paulo Coelho and Alan Clarke.
TRANSLATION: English, US; 13094 tokens, 11887 words, 1782 types.

PBPM1 Melo, Patrícia

1998 O elogio da mentira Corpus text based on São Paulo: Companhia das Letras, 1998, pp 11-61. Copyright © 1998 by Patrícia Melo. Used by permission of Patrícia Melo.
SOURCE: Portuguese, Brazil; 15152 tokens, 14036 words, 3457 types, 1025 alignment units.

1999 In praise of lies translated by Clifford Landers. Corpus extract based on London: Bloomsbury, 1999, pp 1-53. Copyright © 1999 by Clifford Landers. Used by permission of Clifford Landers.
TRANSLATION: English, US; 16691 tokens, 15396 words, 2954 types.

PBRF1 Fonseca, Rubem

1988 Vastas emoções e pensamentos imperfeitos São Paulo: Companhia das Letras,pp 7-102.Copyright © Rubem Fonseca. Used by permission of Rubem Fonseca.
SOURCE: Portuguese, Brazil; 34657 tokens, 29643 words, 5893 types, 2806 alignment units.

1997 The lost manuscript translated by Clifford Landers. London: Bloomsbury, pp 3-107. Copyright © 1997 by Clifford Landers. Used by permission of Clifford Landers.
TRANSLATION: English, US; 36104 tokens, 32958 words, 4767 types.

PMMC1 Couto, Mia

1987 Vozes Anoitecidas. Lisboa: Editorial Caminho, pp 23-64. Copyright © Editorial Caminho 1987. Used by permission of Editorial Caminho
SOURCE: Portuguese, Mozambique; 7617 tokens, 6792 words, 2189 types, 758 alignment units.

1990 Voices Made Night translated by David Brookshaw. Oxford: Heinneman, pp 1-28. Copyright © 1990 by David Brookshaw. Used by permission of David Brookshaw .
TRANSLATION: English, UK; 9356 tokens, 8562 words, 1982 types.

PMMC2 Couto, Mia

1990 Cada Homem é uma Raça. Lisboa: Editorial Caminho, pp 11-68. Copyright © Editorial Caminho 1990. Used by permission of Editorial Caminho
SOURCE: Portuguese, Mozambique; 12495 tokens, 11148 words, 3358 types, 1229 alignment units.

1993 Every Man is a Race translated by David Brookshaw. Oxford: Heinneman, pp 1-37. Copyright © 1993 by David Brookshaw. Used by permission of David Brookshaw .
TRANSLATION: English, UK; 15044 tokens, 13706 words, 2784 types.

PPCC1 Castelo Branco, Camilo

1862 Amor de Perdição corpus extract taken from Obras integrais de Camilo Castelo Branco em CD-ROM. Projecto Vercial 2000, pp 60-92.
SOURCE: Portuguese, Portugal; 19042 tokens, 17503 words, 3959 types, 1209 alignment units.

2000 Doomed Love translated by Alice Clemente. Providence: Gávea Brown, pp 96-144. Copyright © 2000 Alice Clemente and Gávea-Brown Publications. Used by permission of Alice Clemente and Gávea-Brown Publications.
TRANSLATION: English, USA; 20417 tokens, 18095 words, 3005 types.

PPCP1 Cardoso Pires, José

1983 Balada da Praia dos Cães Lisboa: Edições "O Jornal", pp 13-60. First published by Publicações Dom Quixote, 1982. Copyright © 1982 by José Cardoso Pires. Used by permission of Maria Edite Cardoso Pires.
SOURCE: Portuguese, Portugal; 17550 tokens, 16227 words, 4435 types, 899 alignment units.

1986 Ballad of Dog's Beach translated by Mary Fitton. London: John M. Dent, pp 5-38. Copyright © 1986 by John M. Dent. Used by permission of The Orion Publishing Group Ltd.
TRANSLATION: English, UK; 15505 tokens, 14401 words, 3874 types.

PPEQ1 Eça de Queirós, J. M.

1880 O mandarim http://www.ipn.pt/opsis/litera/queiros.htmpp 2-10 [10/12/1999]. Corpus text extracted from digital text prepared by Deolinda Cabrera for Projecto Vercial.
SOURCE: Portuguese, Portugal; 7274 tokens, 6754 words, 2500 types, 300 alignment units.

1993 The mandarinin The mandarin and other storiestranslated by Margaret Jull Costa. Corpus text based on Sawtry: Dedalus Ltd, 1993, pp 5-25. Used by permission of Margaret Jull Costa and Dedalus Press.
TRANSLATION: English, UK; 8213 tokens, 7806 words, 2416 types.

PPEQ2 Eça de Queirós

1887 A Relíquia corpus extract taken from 1887 edition prepared by Deolinda Rodrigues Cabrera. for the Projecto Vercial. The orthography has been updated. Obras integrais de Eça de Queirós em CD-ROM. Projecto Vercial 1996 - 2003, pp 1-42.
SOURCE: Portuguese, Portugal; 30268 tokens, 28277 words, 6609 types, 1409 alignment units.

1994 The Relic translated by Margaret Jull Costa. Sawtry: Dedalus, pp 7-89. Copyright © 1994 Margaret Jull Costa. Used by permission of Margaret Jull Costa.
TRANSLATION: English, UK; 34057 tokens, 32438 words, 5495 types.

PPJS1 Sena, Jorge de

1978 Sinais de Fogo Lisboa: Edições 70, Lda., pp 25-136. Copyright © Mécia de Sena and Edições 70. Used by permission of Mécia de Sena and Edições 70.
SOURCE: Portuguese, Portugal; 52505 tokens, 48391 words, 7674 types, 3308 alignment units.

1999 Signs of Fire translated by John Byrne. Manchester: Carcanet Press, pp 1-115. Copyright © 1999 John Byrne (deceased). Used by permission of Carcanet Press.
TRANSLATION: English, UK; 59691 tokens, 55872 words, 5658 types.

PPMC1 Carvalho, Mário de

1994 Um deus passeando pela brisa da tarde Corpus text based on Lisboa: Caminho, 1995, pp 13-87. Copyright © 1994 Editorial Caminho. Used by permission of Editorial Caminho.
SOURCE: Portuguese, Portugal; 25012 tokens, 23499 words, 6463 types, 1383 alignment units.

1997 A god strolling in the cool of the evening translated by Gregory Rabassa. Corpus text based on London: Phoenix, 1999, pp 1-79. Copyright © 1997 by Lousiana StateUniversity Press. Used by permission of Lousiana StateUniversity Press.
TRANSLATION: English, US; 27107 tokens, 25292 words, 4892 types.

PPSC1 Sá-Carneiro, Mário de

1914 A confissão de Lúcio http://www.ipn.pt/opsis/litera/carneiro.htm pp 2-19 [10/12/1999]. Corpus text extracted from digital text prepared by Deolinda Cabrera for Projecto Vercial, based on orthographically updated 1st edition, Lisbon, 1914.
SOURCE: Portuguese, Portugal; 10252 tokens, 9440 words, 2796 types, 615 alignment units.

1993 Lucio's confession translated by Margaret Jull Costa. Corpus text based on Sawtry: Dedalus Ltd, 1993, pp 19-50. Copyright © 1993 by Dedalus Ltd. Used by permission of Dedalus Ltd.
TRANSLATION: English, UK; 11449 tokens, 10801 words, 2425 types.

PPSC2 Sá Carneiro, Mário de

1915 A Grande sombra electronic version obtained from Obras completas de Sá-Carneiro, vol. 3 "Céu em fogo", 2nd edition, 1956. Lisboa: Editora Ática, pp 39-123. Updated orthography.
SOURCE: Portuguese, Portugal; 17187 tokens, 15643 words, 4302 types, 1124 alignment units.

1996 The Great Shadow (and other stories) translated by Margaret Jull Costa. Sawtry: Dedalus, pp 15-74. Copyright © Margaret Jull Costa. Used by permission of Margaret Jull Costa and Dedalus Press.
TRANSLATION: English, UK; 18546 tokens, 17398 words, 3460 types.


Quantitative summary of present corpus

Aligned texts Source texts Translations
Portuguese 25 13
English 11 26
Total 36 39

Alignment units
Portuguese to English direction 26105
English to Portuguese direction 22032
Total 48137

Words Source texts Translations Source texts & translations
Portuguese 388452 360117 748573
English 361823 431691 793514
Portuguese & English 750275 791808 1542087

Words/alignment unit Source texts Translations Source texts & translations
Portuguese 14.88 16.35 15.55
English 16.42 16.54 16.48
Portuguese & English 15.59 16.45 32.04

Types Source texts Translations Source texts & translations
Portuguese 46973 33410 54747
English 22666 29157 34838

Tokens Source texts Translations Source texts & translations
Portuguese 422802 389121 811927
English 393828 466341 860169
Portuguese & English 816630 855462 1672096

Type/token ratio Source texts Translations Source texts & translations
Portuguese 0.1209 0.0928 0.0731
English 0.0626 0.0675 0.0439

Translation notes Foreign words or expressions Named entities Titles Emphasis
Portuguese 50 1081 200 458 448
English 91 699 69 455 618
Total 141 1780 269 913 1066

Language variety Words in Portuguese Words in English Alignment units Texts
United States 322135 297397 1383 17
Angola 2076 2146 91 1
Brazil 312621 334702 2806 18
United Kingdom 522101 499491 1124 21
Portugal 477114 498920 1124 18
Mozambique 20112 24400 1229 2
South Africa 15932 15035 674 1


Other text-pairs to be included in COMPARA


Author Year
Source Text Place Source Publisher

Agualusa, José Eduardo 1990
D. Nicolau Água Rosada e outras histórias verdadeiras Lisboa Vega
Levitin, Alexis 1995
The incredible but true story of Prince Nicolau Agua-Rosada Madison The Literary Review

Azevedo, Aluísio 1881
O mulato WWW Virtual Bookstore, Livros e Literatura
MacNicoll, Graeme 1990
Mulatto Cranbury, NJ Associated University Presses

Barnes, Julian 1989
The history of the world in 10 1/2 chapters London Picador


Uma história do mundo em 10 1/2 Capítulos Lisboa Quetzal Editores

Buarque, Chico 1991
Estorvo São Paulo Companhia das Letras
Bush, Peter 1992
Turbulence London Bloomsbury

Coelho, Paulo 1987
Diário de um mago Rio de Janeiro Rocco
Clarke, Alan 1995
The pilgrimage: a contemporary quest for ancient wisdom New York HarperCollins

Coelho, Paulo 1996
O monte cinco Rio de Janeiro Objetiva
Landers, Clifford 1998
The fifth mountain London Harper

Conrad, Joseph 1902
Heart of darkness WWW Project Gutenberg
Fernandes, Aníbal 1983
O coração das trevas Lisboa Editorial Estampa

Dourado, Autran 1975
Os sinos da agonia Rio de Janeiro Editora Expressão e Cultura
Parker, John 1988
The bells of agony London Peter Owen

Eça de Queirós, J. M.
Alves e Co. WWW Projecto Vercial
Vetch, John 1993
The yellow sofa Manchester Carcanet Press Limited

Fonseca, Rubem 1983
A grande arte São Paulo Companhia das Letras
Watson, Ellen 1987
High art London Collins

Gordimer, Nadine 1974
The conservationist London Picador

1992
O conservador Porto Edições Asa

Gordimer, Nadine 1981
Burger's daughter London Penguin
Teixeira de Aguilar, J. 1992
A filha de Burger Porto Edições Asa

Gordimer, Nadine 1992
July's people Bath Chivers Press

1986
A gente de July Lisboa Editorial Teorema

Lodge, David 1984
Small world London Secker & Warburg

1996
O mundo é pequeno Porto Edições Asa

Lodge, David 1992
Paradise news London Penguin
Babo, Carlos Grifo 2000
Notícias do paraíso Lisboa Gradiva Publicações, Lda

Lodge, David 1993
How far can you go? London Secker & Warburg
Cardoso, Helena 1997
Até onde se pode ir? Lisboa Gradiva Publicações, Lda

Machado de Assis, J. 1908
Memorial de Aires WWW Bibioteca Virtual, USP
Caldwell, Helen 1972
Counselor Ayres' Memorial Berkeley University of California Press

Melo, Patrícia 1995
O matador São Paulo Companhia das Letras
Landers, Clifford 1997
The killer London Bloomsbury

Saramago, José 1989
História do cerco de Lisboa Lisboa Editorial Caminho
Pontiero, Giovanni 1996
The History of the siege of Lisbon London Harvill Press

Saramago, José 1995
Ensaio sobre a cegueira Lisboa Editorial Caminho
Pontiero, Giovanni 1997
Blindness London Harvill Press

Shelley, Mary 1818
Frankenstein WWW Project Gutenberg


Frankenstein Lisboa Editorial Estampa

Soares, Jô 1995
O xangô de Baker Street São Paulo Companhia das Letras
Landers, Clifford
A samba for sherlock New York Pantheon books

Trollope, Joanna 1996
The best of friends London Bloomsbury

1996
Os melhores amigos Lisboa Gradiva Publicações, Lda

Wilde, Oscar 1897
De profundis WWW Project Gutenberg
1991
De profundis Lisboa Editorial Estampa

Zimler, Richard 2000
The angelic darkness London Arcadia Books, Ltd.
Lima, José 1983
Trevas de luz Lisboa Quetzal Editores

Previous versions


START USING COMPARA MORE INFORMATION ABOUT COMPARA ACKNOWLEDGEMENTS   HOW TO CONTRIBUTE  
Simple Search Project team
Complex Search Contents
Questions from users Publications
Search Help The DISPARA interface
Building the corpus

Comments and feedback to compara@informatics.sintef.no