Subjects: Wikipedia; text corpora; monolingual corpus
Record details
Subjects: diacritical marks generation; natural language correction
Subjects: part of speech; tagging; semi-supervised
Subjects: test data; parallel corpus; Vietnamese
Subjects: corpus; test data; medical
Subjects: machine translation; neural machine translation; transformer
Subjects: multilingual corpora
Subjects: tokenizer; POS tagger; lemmatization