SCB 10X introduces Typhoon-7b, a groundbreaking Thai Language Model (LLM) text generator that surpasses all existing models in the Thai language. It boasts the same performance as GPT-3.5 in Thai.
To assess the model’s efficacy in the Thai language, SCB 10X developed a testing suite called ThaiExam. This suite includes exams for high school students and investment data from experts in Thailand. The results showed that the Typhoon-7b model outperforms all freely available Thai models and achieves comparable scores to GPT-3.5.
Moreover, SCB 10X offers the model for free use under the Apache License 2.0. It is a basic model that has not undergone any fine-tuning. Users are advised to perform fine-tuning with the desired data before actual implementation.
Digging deeper into the development process, the Typhoon-7b model evolves from Mistral-7B by incorporating 5,000 Thai words. It is then further trained with LoRA. Experimental findings reveal that Typhoon-7b significantly reduces the number of Thai language tokens by up to 2.62 times compared to GPT-4.
Access the model on HF: https://huggingface.co/scb10x/typhoon-7b
Source: Typhoon: Thai Large Language Models
TLDR: SCB 10X unveils Typhoon-7b, an impressive Thai language model that surpasses all others in performance. It equals the capabilities of GPT-3.5 in Thai. SCB 10X offers the model for free use, but users are recommended to fine-tune it before implementation. The Typhoon-7b model achieves remarkable token reduction in Thai compared to GPT-4.
Leave a Comment