Trillion 7B Technical Report | Awesome LLM Papers Contribute to Awesome LLM Papers

Trillion 7B Technical Report

Sungjun Han, Juyoung Suk, Suyeong An, Hyungguk Kim, Kyuseok Kim, Wonsuk Yang, Seungtaek Choi, Jamin Shin . No Venue 2025

[Paper] [Paper]   Search on Google Scholar   Search on Semantic Scholar
Training Techniques

We introduce Trillion-7B, the most token-efficient Korean-centric multilingual LLM available. Our novel Cross-lingual Document Attention (XLDA) mechanism enables highly efficient and effective knowledge transfer from English to target languages like Korean and Japanese. Combined with optimized data mixtures, language-specific filtering, and tailored tokenizer construction, Trillion-7B achieves competitive performance while dedicating only 10% of its 2T training tokens to multilingual data and requiring just 59.4K H100 GPU hours ($148K) for full training. Comprehensive evaluations across 27 benchmarks in four languages demonstrate Trillion-7B’s robust multilingual performance and exceptional cross-lingual consistency.

https://huggingface.co/discussions/paper/680879ebd6dc8bf64565c9bb

Similar Work