OurDojo K-12 GenAI Evaluation Report
ourdojo.org

The fastest, most affordable way to get an
independent AI content quality evaluation in K-12

1,000 samples · $2,500 (Pilot Pricing) · Ready in 2 weeks

Who This is For

Built for product teams selling into districts, schools, and administrators who ask hard questions about quality. If you’re making claims about learning outcomes, grade-level alignment, or AI accuracy — and you don’t yet have independent data to back them up — this is for you.

Scoping Call 30-min meeting to agree on evaluation dimensions and sampling method
1,000-Sample Evaluation AI content evaluated against Student Achievement Partners, CAST, and Achievement Network (ANet) established frameworksOurDojo designs the evaluation protocol, sampling approach, and business analysis, and uses open-source evaluators aligned to established instructional frameworks to assess outputs at scale.
Deliverables Report with diagnostic insights, raw data in CSV format, failure mode heatmap
Findings Walkthrough 1-hour meeting to review results and discuss how to leverage them for procurement, marketing, and product development

How We Compare

OurDojo ISTE / Common Sense LearnPlatform WestEd / SRI
Price $2.5K $5K–$10K $18K–$75K $150K–$500K+
Speed 2 weeks 3–6 months 3–6 months 12–36 months
Evidence Type Content quality Rubric-based review Usage data Efficacy / RCTs

Why Teams Use This

Your Team

Jay Syz — Founder & Lead Evaluator

3+ years implementing AI evaluations at venture-backed AI companies. Engineering foundations from Google. Built OurDojo because K-12 AI adoption is moving faster than the availability of independent evidence. Supported by an expert validation networkIncludes a staff engineer (14 YoE), a data scientist (5 YoE), a veteran language teacher (22 YoE), and a veteran math teacher (20 YoE). of four specialists across data science, engineering, and K–12 education.