Figure 2From: Scaling up the evaluation of psychotherapy: evaluating motivational interviewing fidelity via statistical text classification Agreement of human raters and model raters. Session, reliability based on sums of codes across the entire session; Talk Turn, reliability based on unique codes in each talk turn.Back to article page