USING AND KEEPING LINGUISTIC DATA
WinterSemester 2004/5
The folder for this course in the copy shop is number 87. We wind up the course with a few quick glances to the future of XML applications... For some of the presentations, topics and annotations developed, see here. |
21.10.2004 |
Corpora and annotation - IReading: Leech & Fligelstone (1992); Biber, Conrad & Reppen (1998, Chap. 1) |
28.10 |
Corpus and annotation – IIActivity:
Constructing an XML file and checking validity, using XML
editors, etc. |
4.11 |
Deciding on a topic for investigation |
11.11 |
Annotation schemes for particular purposesReading:
Selected online XML documentation
(DTD examples) |
18.11 |
Presentation of annotation schemes and data - IActivity: Designing an annotation
scheme |
25.11 |
Presentation of annotation schemes and data - IIActivity:
Design of your
own transcription system and specification as a DTD: first approximation.
|
2.12 |
Tools for getting information out again: Xpath
|
9.12 |
Advanced topics in annotation: XSLTActivity:Introduction to XSLT for presenting results |
16.12 |
Advanced topics in annotation: stand-off annotation and automatic annotationActivity: (i)
Illustration of stand-off annotation (powerpoint); (ii) online stemmers, taggers and parsers (exercises
and materials) |
Christmas break |
|
6.1.2005 |
Progress reports and trouble shooting.Activity: Working with multiple documents: turning a set of XML documents into a database of linguistic results; using XSLT transform programs for creating webpages, plain text and other documents |
13.1 |
Collecting information together: XML databasesActivity: XSLT tricks and methods: variables |
20.1 |
Using the technology for presenting results: using XSLT transformations and style sheets as access to databasesActivity: XSLT tricks and methods: variables for storing node sets; grouping information according to attributes rather than according to files. |
27.1 |
Review of and comparison with traditional tools for linguistic data storage: converting legacy dataPROGRESS REPORTS: presentations
|
3.2 |
Practical working session: finding patternsPROGRESS REPORTS Corpus
alignment: Example
of practice: Multext |
11.2 |
Presentation of resultsAnd review of methods and applications so far. Future developments. Reading: selected sections from the
DBXML documentation |