Best Open Source LLM:Tencent Hunyuan-Large: The 389 Billion Parameter Model Outperforming Llama 3 and DeepSeek V2

Tiime@lemmy.ml · 7 months ago

Best Open Source LLM:Tencent Hunyuan-Large: The 389 Billion Parameter Model Outperforming Llama 3 and DeepSeek V2

Smorty [she/her]@lemmy.blahaj.zone · 7 months ago

So would the granite models count as “open source”? They do publish the training data they used.

hendrik@palaver.p3x.de · edit-2 7 months ago

Seems they’ve outlined the used datasets in Annex B of their paper. I haven’t checked if the list is exhaustive and if the training code and scripts to prepare the data are there… If they are, I’d say this is indeed a proper open-source model. And the weights are licensed under an Apache license.

Best Open Source LLM:Tencent Hunyuan-Large: The 389 Billion Parameter Model Outperforming Llama 3 and DeepSeek V2

Best Open Source LLM:Tencent Hunyuan-Large: The 389 Billion Parameter Model Outperforming Llama 3 and DeepSeek V2

Best Open Source LLM:Tencent Hunyuan-Large: The 389 Billion Parameter Model Outperforming Llama 3 and DeepSeek V2 -