Indonesian NLP experiments
POS tagging and Named-entity recognizing
Binary distribution can be downloaded here (JRE 1.7 or later required, Unix or Windows only)
Please find usage guide in the README
Prerequisites:
Building program:
$ cd java/nlp
$ mvn clean package
POS tagging with predefined training and test data:
$ cd python
$ python tagger.py ../data/pos-tagging/Indonesian_Manually_Tagged_Corpus_ID.tsv ../data/pos-tagging/Wikipedia.txt
POS tagging by splitting training data to training and test data:
$ cd python
$ python tagger.py ../data/pos-tagging/Indonesian_Manually_Tagged_Corpus_ID.tsv 1000 sentences.tag