GoTagger is a GUI-based Part-Of-Speech (POS) tagger that is freely availabe for research and education. This software is written in Delphi and thus runs on Windows wihout relying on any ActiveX or DLLs. GoTagger annotates a text with POS information utilizing the rule files contained in Eric Brill‘s POS tagger. If you don‘t have it, please download at Eric Brill‘s website.
(1) Directory explorer (2) File explorer
You can select one or more files using the directory explore (1) and the file explore (2).
Double-clicking a file in (2) will put it into the right frame (5) of the main window.
Add ... | The files highlighted in (2) will be added to (5). |
Add all ... | All of the files listed in (2) will be added to (5). |
Remove ... | The files highlighted in (5) will be removed. |
Remove all ... | All of the files listed in (5) will be removed. |
Lexicon | Choose one of the Lexicon files. |
Contextual Rule | Choose one of the Contextual Rule files. |
Separator | Choose your preferred separator. |
Destination of output files | If "..\(original file)\Tagged\" is selected, the "Tagged" folder will be automatically created under the same folder as the original files. In this option, the output files will be saved there. If you are inclined to "Specify" the save folder, press the "locate" button to select a directory. NOTICE -- Any of the old files having the same name of newly created files will be automaticaly overwritten. |
Tokenizer | Check the box written "On" if you need to tokenize sentences before tagging them. |
Lemmatizer | Check the box written "On" if you need to lemmatize words. To enable this function, you need to download "e_lemma.txt", complied by Prof. Yasumasa Someya, and put it into "G_data". |
联系客服