BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages

BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages