mT5 Description
The multilingual T5 (mT5) is a highly versatile pretrained text-to-text transformer model, developed using a methodology akin to that of T5. This repository serves as a resource for replicating the findings outlined in the mT5 research paper.
mT5 has been trained on the extensive mC4 corpus, which encompasses 101 different languages, including but not limited to Afrikaans, Albanian, Amharic, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bulgarian, Burmese, Catalan, Cebuano, Chichewa, Chinese, Corsican, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hmong, Hungarian, Icelandic, Igbo, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Kurdish, Kyrgyz, Lao, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Mongolian, Nepali, Norwegian, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Samoan, Scottish Gaelic, Serbian, Shona, Sindhi, and many others. This impressive range of languages makes mT5 a valuable tool for multilingual applications across various fields.
Pricing
Integrations
Company Details
Product Details
mT5 Features and Options
mT5 User Reviews
Write a Review- Previous
- Next