Computer Tools and Applications - Sommersemester 2000 - Anglistik

Step-by-step instructions

Using 'unmarked-up' corpora: or text archives: 2

 

Now take a look at the files that you downloaded in the previous step. Open them up in a normal text processing program, such as Word or Notepad.

Examine them to see what is meant by "unmarked up" text. Can you see any problems with the form of the texts as they have been downloaded?

Can we answer basic questions like:

  1. how many times is "red" used in the two texts?
  2. in what kind of contexts does "red" appear?
  3. what kinds of things are "red"?
  4. are these kinds of things other colours to?