Size matters when it comes to corpora. At 220 million words of text, the corpus used to create the second edition of the COBUILD dictionary in 1995 was over ten times the size of the one used for the first edition, and 220 times bigger than the first electronic corpora developed in the 1960s and early 1970s. Yet it was tiny compared to those we use today, some of which amount to billions, not millions of words.
In the 30 years since the publication of the first COBUILD dictionary, a whole flurry of new words has come into the language and as they’ve caught on and become part of everyday usage, they’ve been added to the dictionary.
By the time I arrived at COBUILD as part of the 1993 intake recruited to work on the second edition of the dictionary, the whole project had been fully computerised for several years. This meant working on screen at terminals linked to mainframe computers that hummed away in a separate room, still with the green text on a black background, as described by Andrew Delahunty in Part 1.
Where were you 30 years ago? I was in the middle of my university studies, still to embark on my ELT career, and as such, a smidgin too late to be part of the intrepid and free-spirited COBUILD dictionary team. Led by the late John Sinclair, this large young team was involved in bringing to life his vision: to create a dictionary for learners that was based on a large digital language database – or a corpus.
This article has been written by Penny Hands, who is one of the contributors to the Collins COBUILD English Grammar. If we’re going to talk about nonstandard English, it’s a good idea to start by asking what Standard English (SE) is.
This article has been written by Penny Hands, who is one of the contributors to the Collins COBUILD English Grammar. Most people who study and us...