Here is a collection of XML files that can be used as a data set for a corpus:

1 2 3 4 5 6 7 8 9

The complete set as a zip file.