Lancaster University Department of Linguistics and Modern English Language
Corpus Linguistics Home
Page index
WordSmith
Basic WordSmith
Using Concord
Frequency Lists and Keywords
Part-of-speech Tags
BNCweb
DIY Corpora
 
Page One
 
 
Page Two
 
 
Current page
 
 
Page Four
 
 

Meet the BNC

 
 
 

The BNC is the largest corpus we have at Lancaster. It is huge.

Look at the BNC folder and see just how huge the corpus is!

  1. Now go back to the root directory 'Bowland_back'.
  2. Go to the 'bnc_plain' directory.
  3. Browse a couple of directories and files.
  4. Check what each corpus file looks like.
    Can you distinguish the SGML header from the text?

We will look at the BNC in much great detail later in the course.