GSCP 2014 abstract
Comparing oral (transcribed) and written corpora in Portuguese
After shortly presenting AC/DC , http://www.linguateca.pt/ACDC/, a large repository and
service for many and diverse corpora of Portuguese (Santos, 2011), I will use it to illustrate possible research on orality and possible orality markers.
After Biber's (1988) work on features of writing done for English but
then also expanded to other languages, the time is ripe to do similar
explorations for Portuguese, like those included e.g. in Biber et al.
(1999) or Biber & Gray (2010), but not necessarily with the same features or even methodology. After all, I am aware that written conventions of Portuguese are rather different from those of English as beautifully pointed out by Bennett(2010).
In the presentation, I will, using quantitative data, look at three
This will serve as an apetizer for discussing a corpus-based grammar of Portuguese that is in the making, Gramateca, and which has as special feature the use of some semantic annotation as well. See Santos (2014) for methodological issues, and Maia & Santos (2012) for a preliminary example for the fear domain.
- vocative and second person use (extending Freitas & Santos 2014),
- lexical bundles,
- passive, extending Santos (2014).
The corpora I will be making use can be described at a glance by the following maps:
- Bennett, Karen. "Academic Discourse in Portugal: A whole different ballgame?", Journal of English for Academic Purposes, 9 (1), 2010, pp. 21-32.
- Biber, Douglas. Variation across speech and writing, Cambridge
University Press, 1988.
- Biber, Douglas, Stig Johansson, Geoffrey Leech, Susan Conrad & E.
Finegan. The Longman grammar of spoken and written English. 1999,
London: Longman. 1999.
- Biber, Douglas & Bethany Gray. "Challenging stereotypes about academic writing: Complexity, elaboration, explicitness". Journal of English for Academic Purposes, Vol. 9, 1, 2010, pp. 1-82.
- Freitas, Cláudia & Diana Santos. "Blogs, Amazônia e a Floresta
Sintá(c)tica: um corpus de um novo gênero?". In Simone Sarmento, Tony
Berber Sardinha, Livia Pretto Mottin & Ana Maria T. Ibanos (eds.), Pesquisas e perspetivas em linguistica de corpus, Campinas, Sao Paulo: Mercado de Letras. 2014.
- Maia, Belinda & Diana Santos. ""Who's afraid of ... what?" -- in English
and Portuguese". Aspects of corpus linguistics: compilation, annotation,
analysis, ed. by Signe Oksefjell Ebeling, Jarle Ebeling & Hilde
Hasselgård. (Studies in variation, contact and change in English 12).
Helsinki: Research Unit for Variation, Contacts, and Change in English.
- Santos, Diana. "Linguateca's infrastructure for Portuguese and how it allows the detailed study of language varieties". In J.B. Johannessen (ed.), Language Variation Infrastructure, OSLa: Oslo Studies in Language 3.2, 2011, pp. 113-128.
- Santos, Diana. "Podemos contar com as contas?", in Sandra Aluísio & Stella Tagnin (eds.), New language technologies and linguistic research: a two-way road, Cambridge Scholars Publishing, 2014.
Last modified: 13 February 2014