FAQ: Useful Hints for Using the Lemmatizer Module Morphisto

Wrong format or encoding

In general, a pre-processing step is performed prior to morphological analysis. The TextGrid Tokenizer can be used to fulfil this task. If you like to use another tokenizer, please make sure that the characters are encoded in UFT-8 and the lines are separated in UNIX style, i.e. by means of the UNIX-specific line operator \n.

Running Morphisto on a local computer

For processing large files (>1 MB) the tool Morphisto can be downloaded from the URL www.ids-mannheim.de/ll/TextGrid/morphisto.html and installed on a local computer. In this case, the time for transferring the data can be saved.