Preparing a multimodal corpus within the GeM model

27.01.02


Start slide show


Table of contents

  • Context of work: goals and orientation
  • Page-based multicodal documents
  • Method:corpus orientation
  • Corpus basics
  • Well-known problem of annotation: intersecting hierarchies
  • Well-known problem of annotation: intersecting hierarchies
  • TEI ‘Corpus Encoding Standard’ approach
  • Using XML: current technology and ‘standardization’ is making this easier
  • Using XML: current technology and ‘standardization’ provides tools for working with the data
  • Using XML: current technology and ‘standardization’ provides tools for working with the data
  • Critical features of current annotation technologies and ‘standards’
  • An XML-compatible annotation scheme consists ideally of...
  • Multilayered annotation
  • Waller’s (1988) model of document design
  • The GeM annotation layers
  • The GeM annotation layers
  • Initial corpus selection
  • The GeM annotation layers
  • The GeM annotation layers
  • Derivation of Layout Structure
  • The GeM annotation layers
  • RST analysis: Tiger text
  • State of work and early ‘results’
  • Stages in Analysis
  • Content Analysis
  • Stages in Analysis
  • RST analysis
  • Stages in Analysis
  • Layout Structure
  • Stages in Analysis
  • Layout Structure
  • The ‘illusion of knowing’
  • ‘Conclusions’
  • ‘Conclusions...’
Authors:Bateman, Delin, Henschel

E-Mail: bateman@uni-bremen.de

Homepage: http://purl.org/net/gem

Presented at:
Overheads from the Salzburg Symposium on Multimodality, January 2002

Materials distributed with talk:
preliminary GeM annotation manual (updated February 2002)
detailed example of GeM annotation (Dec 2001)
(both of these are preliminary - comments welcome)

Best viewed with...
Microsoft Internet Explorer
click to start

View or download the Powerpoint file directly