Here is a collection of XML files that can be used as a data set for a corpus:
1 2 3 4 5 6 7 8 9
The complete set as a zip file.