Digital Text Markup

Structured Text Transcription

  • Description: A literal transcription of the text, encoded in XML. Requires additional files and specialized server software to deliver, especially if searching is desired.
  • Format: XML.
  • Standard: TEI P4 (Text Encoding Initiative), with clocal modifications, following the UVA DTD.

Unstructured Text Transcription

  • Description: Plain text that may include minimal structural or formatting information.
  • Format: XHTML, ASCII text, e.g. OCR output.