ML projects An extrinsic evaluation reveals what is truly important • Thinking back to our NER model: ◦ Gold annotation: "Harrisburg school district" ◦ Prediction: "Harrisburg" • In a typical "strict" evaluation setting, this would be a FP and a FN, resulting in lower precision and lower recall • In downstream processing however, the correct school could be identified, leading to no final error in the extrinsic evaluation 24