pull down to refresh

Swiss-made, robots.txt compliant open-source LLMs
We present Apertus, a fully open suite of large language models (LLMs) designed to address two systemic shortcomings in today's open model ecosystem: data compliance and multilingual representation. Unlike many prior models that release weights without reproducible data pipelines or regard for content-owner rights, Apertus models are pretrained exclusively on openly available data, retroactively respecting robots.txt exclusions and filtering for non-permissive, toxic, and personally identifiable content.

Here we go!
Note that the main models are KYC gated 1 but also licensed under Apache 2.0, so ungated models are distributed by the community, see for example 8B-ungated

Footnotes

  1. This shit is totally getting out of hand. If you're in the EU or CH, remember that between a Russian Invasion and a Totalitarian Compliance Regime, the latter is already there and fucking you up.
reply