their ability to concretelyevaluate student growth [12], [13], [33]. Direct assessments are complicated by three considerations: validity, reliability, andethical limitations on truly scientific study design. Validity asks: does the assessment measurewhat it is supposed to measure? Reliability asks: can writing be consistently and quantitativelyevaluated by different evaluators? Finally, ethics forbid writing centers from executing theclassic “treatment/no treatment” experimental design: true negative controls would requiredenial of writing center access to students who want it. Due to these three constraints, “thetypical evaluation of writing programs...usually fails to obtain statistically significant results” [34].For this reason