Trankit: A Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

https://github.com/nlp-uoregon/trankit

GitHub – nlp-uoregon/trankit: Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Trankit outperforms the current state-of-the-art multilingual toolkit Stanza (StanfordNLP) in many tasks over 90 Universal Dependencies v2.5 treebanks of 56 different languages while still being efficient in memory usage and speed, making it usable for general users. In particular, for English, Trankit is significantly better than Stanza on sentence segmentation (+9.36%) and dependency parsing …
github.com

Publié

dans

par

Étiquettes :