TenTrans (www.github.com/TenTrans) Open-Source Toolkit Sponsor.
-
Tencent
- Beijing, China
- www.bojiehu.com
-
joeynmt Public
Forked from joeynmt/joeynmtMinimalist NMT for educational purposes
Python Apache License 2.0 UpdatedDec 21, 2020 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedJul 16, 2020 -
pragmatic_segmenter Public
Forked from diasks2/pragmatic_segmenterPragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
Ruby MIT License UpdatedFeb 24, 2019 -
flores Public
Forked from facebookresearch/floresFacebook Low Resource (FLoRes) MT Benchmark
Shell Creative Commons Attribution Share Alike 4.0 International UpdatedFeb 12, 2019 -
indic_nlp_library Public
Forked from anoopkunchukuttan/indic_nlp_libraryResources and tools for Indian language Natural Language Processing
Python GNU General Public License v3.0 UpdatedJan 30, 2019 -
sentencepiece Public
Forked from google/sentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
C++ Apache License 2.0 UpdatedDec 25, 2018 -
pytext Public
Forked from facebookresearch/pytextA natural language modeling framework based on PyTorch
Python Other UpdatedDec 21, 2018 -
bpemb Public
Forked from bheinzerling/bpembPre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
Python MIT License UpdatedDec 5, 2018 -
langid.py Public
Forked from saffsd/langid.pyStand-alone language identification system
Python Other UpdatedNov 14, 2018
