I just completed the 2nd largest statistical audit of scientific literature ever. Unlike academic papers that hide methods behind paywalls, here's EVERYTHING
THE ACHIEVEMENT:
✓ 4.27 million papers analyzed
✓ 210GB PostgreSQL database
✓ 155 minutes processing time
✓ Zero errors (validated on 60-paper sample)
THE DOCUMENTATION (all public):
→ Complete regex patterns (49 distinct regex patterns)
→ Database schema (every table, every field)
→ Processing code (500+ lines, commented)
→ Hardware specs (Ryzen 9 7900, 128GB RAM)
→ Rigor scoring algorithm (100-point system)
→ Validation methodology
→ Known limitations (CIs at 1.7% - likely in tables)
Why share everything? Because:
1. Real science is reproducible
2. Extraordinary claims need extraordinary evidence
3. Someone should verify this
Found: Only 1.7% of papers report confidence intervals. 0.6% do power analysis.
Verify it yourself. The code is there.
1
0 comments
Talon Neely
2
I just completed the 2nd largest statistical audit of scientific literature ever. Unlike academic papers that hide methods behind paywalls, here's EVERYTHING
powered by
Built Simple, We make it work
skool.com/built-simple-we-make-it-work-9812
We downloaded Stack Overflow and PubMed Central. Built a faster Windows Search. Automated entire workflows. Now we're sharing everything.
Build your own community
Bring people together around your passion and get paid.
Powered by