Cogito v1 Preview Introducing IDA as a path to general superintelligence · Data Alchemy

Marcio Pacheco

Apr 9 (edited) • 💬 General

Cogito v1 Preview Introducing IDA as a path to general superintelligence

Iterated Distillation and Amplification (IDA) FTW.

"Takeaways:

We are releasing the strongest LLMs of sizes 3B, 8B, 14B, 32B and 70B under open license. Each model outperforms the best available open models of the same size, including counterparts from LLaMA, DeepSeek, and Qwen, across most standard benchmarks. In particular, the 70B model also outperforms the newly released Llama 4 109B MoE model.
The LLMs are trained using Iterated Distillation and Amplification (IDA) - a scalable and efficient alignment strategy for general superintelligence using iterative self-improvement.
Each model can answer directly (standard LLM), or self-reflect before answering (like reasoning models).1
We plan to release larger models, including 109B, 400B, 671B, in the coming weeks / months, as well as improved checkpoints for each of these model sizes.

You can download the models on Huggingface or Ollama, or use them directly through the API on Fireworks AI or Together AI."

https://www.deepcogito.com/research/cogito-v1-preview

1

0 comments

Data Alchemy

skool.com/data-alchemy

Your Community to Master the Fundamentals of Working with Data and AI — by Datalumina®

📚 Explore More Resources

🔗 Subscribe on YouTube

Leaderboard (30-day)

1

James Brown

+203

2

Pavan Sai

+74

3

Yves Joseph Sikati

Yves Joseph Sikati

+33

4

Surya Narayan

+32

5

Pierre-Henry Isidor

Pierre-Henry Isidor

+28