Apr 9 (edited) • 💬 General
Cogito v1 Preview Introducing IDA as a path to general superintelligence
Iterated Distillation and Amplification (IDA) FTW.
"Takeaways:
  • We are releasing the strongest LLMs of sizes 3B, 8B, 14B, 32B and 70B under open license. Each model outperforms the best available open models of the same size, including counterparts from LLaMA, DeepSeek, and Qwen, across most standard benchmarks. In particular, the 70B model also outperforms the newly released Llama 4 109B MoE model.
  • The LLMs are trained using Iterated Distillation and Amplification (IDA) - a scalable and efficient alignment strategy for general superintelligence using iterative self-improvement.
  • Each model can answer directly (standard LLM), or self-reflect before answering (like reasoning models).1
  • We plan to release larger models, including 109B, 400B, 671B, in the coming weeks / months, as well as improved checkpoints for each of these model sizes.
You can download the models on Huggingface or Ollama, or use them directly through the API on Fireworks AI or Together AI."
1
0 comments
Marcio Pacheco
7
Cogito v1 Preview Introducing IDA as a path to general superintelligence
Data Alchemy
skool.com/data-alchemy
Your Community to Master the Fundamentals of Working with Data and AI — by Datalumina®
Leaderboard (30-day)
Powered by