travel!digital Corpus

Fiche du document

Discipline
Type de document
Langue
Identifiant
  • 0000-000C-29F7-0
Relations

Ce document est lié à :
travel!digital Collection

Ce document est lié à :
https://arche.acdh.oeaw.ac.at/api/19641

Licences

Namensnennung 4.0 International (CC BY 4.0) , Attribution 4.0 International (CC BY 4.0) , Austrian Centre for Digital Humanities and Cultural Heritage , https://arche.acdh.oeaw.ac.at/api/48505




Citer ce document

Ulrike Czeitschner et al., « travel!digital Corpus », ARCHE : Centre de ressources et de données pour les humanités numériques autrichien


Partage / Export

Résumé 0

A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights. Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en