• hendrik@palaver.p3x.de
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      2 months ago

      Seems they’ve outlined the used datasets in Annex B of their paper. I haven’t checked if the list is exhaustive and if the training code and scripts to prepare the data are there… If they are, I’d say this is indeed a proper open-source model. And the weights are licensed under an Apache license.