Breast imaging understanding how accuracy is measured when lesions are the unit of analysis review

Performance Metric

Author

Tea Tasting Lady

Published

2024-09-28

Info

Breast imaging: understanding how accuracy is measured when lesions are the unit of analysis. Breast J. Nov-Dec 2012;18(6):557-63. doi: 10.1111/tbj.12009.

Introduction

Breast Cancer Patients에서
- multiple lesions이 있거나 알려진 primary tumor 외의 다른 abnormal area를 찾는 경우가 있음
  - e.g, Cancer 관리를 위한 mammography 이후 추가 imaging 촬영, MRI로 수술전 계획(Preoperative planning)을 세우거나 staging of disease를 정하는 것 등
- 이 경우, Radiologists가
  - location of additional lesions을 정확히 찾고 이 lesions이 significant한 지 올바르게 판정하는 것이 중요함
  - Significant: lesion이 malignant하거나 treatment planning을 세워야 할 정도로 malignant로 발전할 가능성이 큼(high potential)을 의미
Lesion as the unit of analysis
- 위의 경우, lesion level analysis가 필요함
- 일반적인 SN, SP, ROC curve estimates의 사용은 한계가 있음
Purpose of the study
- 환자가 위치를 정확히 알아야 하는 multiple lesion이 있을 경우의 lesion level analysis에 대해 다룸
- lesion이 unit of analysis일 때는 일반적으로 사용하는(usual estimate) specificity나 ROC curve 사용이 어려움을 설명하고자 함

Materials and Methods

Illustrative Example

FDG PEM(F18-2-fluoro-2-deoxy-D-glucose positron emission) MMG study
- objective: FDG PEM의 accuracy 평가
- subjects: women newly diagnosed with breast cancer(invasive or in situ)
- MRI image와 FDG PEM image를 비교
  - 유방절제술(mastectomy)의 pathology information이 reference standard

Illustrative Figure

10 patients as an example
it is not meant to reflect the true accuracy of any imaging test
9 women with significant disease, one woman without breast cancer
- 17 significant lesions (10 true positives, 7 false negatives)
- 3 false-positive findings

Methods of Analysis

Sensitivity of a test
- the probability that the test indicates disease is present given that disease is truly present
- \(SN = P(Test + | Disease)\)
Specificity of a test
- the probability that the test indicates there is no disease given that disease is truly absent
- \(SP = P(Test - | No Disease)\)
Unit of Analysis가 무엇인지 중요함

Results

Patient-Level Analysis

Typical way of evaluating the accuracy of a medical test
Goal
- 질병이 있는 사람들과 없는 사람들을 test가 얼마나 잘 탐지(detect)하는 지를 평가함
2 by 2 table
\(SN = P(Test + | Disease) = \frac{P(Test + \& Disease)}{P(Disease)} = \frac{(8/10)}{(9/10)} = 8/9 = 89%\)
\(SP = P(Test -| No\ disease) = \frac{P(Test - \& No\ disease)}{P(No\ disease)} = \frac{(0/10)}{(1/10)} = 0%\)

Breast-Level Analysis

The breast as the unit of analysis
- 이 경우, patient level analysis에 비해 n수가 2배가 됨(each breast에 대해 평가)
2 by 2 Table
\(SN = P(Test + | Disease) = \frac{P(Test + \& Disease)}{P(Disease)} = \frac{(8/20)}{(10/20)} = 8/10 = 80%\)
\(SP = P(Test -| No\ disease) = \frac{P(Test - \& No\ disease)}{P(No\ disease)} = \frac{(9/20)}{(10/20)} = 90%\)
- Patient level analysis의 SP 0%가 Breast level에서는 90%가 됨

Lesion-Level Analysis

The lesion as the unit of analysis
- Figure 1에서 20 regions이 areas of interest였음
2 by 2 Table
Lesion level에서는 True Negative를 정의할 수 없음
- Radiologist가 식별하지 않고 또 Pathology에서도 암이 없다고 나온 lesion을 어떻게 ‘True Negative’하다고 lesion level에서 정의할 수 있을지?
  - “True Negative”라고 lesion level에서 정의할 수 있을 영역의 경우의 수가 너무 많기 때문에 위 table의 물음표(?)를 채울 명확한 방법이 없음
    - (There are infinitely many possibilities for defining areas that could be marked as true negatives. The end result is that we have no obvious way to fill in this cell)
- 따라서 Lesion level에서는 Specificity 계산을 할 수 없음 (민감도는 가능함)
\(SN = P(Test + | Disease) = \frac{P(Test + \& Disease)}{P(Disease)} = 10/17 = 59%\)

Receiver Operating Characteristic (ROC) Curve Analysis

Empirical ROC curve
- data are first ranked by the test result (값이 높을수록 질병이 있을 확률이 큼)
- the values above that threshold are considered positive
- For each threshold, SN and SP is calculated
  - ROC Curve is the plot of SN by 1 - SP
- Lesion level에서 SP 계산을 할 수 없으므로 ROC curve도 estimate 할 수 없음

Discussion

예시에서 Pathology information이 available 했음
- all women in our example had a bilateral mastectomy
- breast or lesion level 평가를 위해서는 confirmed case only 등으로 study design을 조정해야함
한 환자가 Multiple lesions이 있는 경우
- Patient level 또는 Breast level에서는 적절히 평가할 수 없음
- the location of the lesions의 평가 또한 patient/breast level에선 고려하지 않음
Correctly locate a lesion에 대한 test의 accuracy 평가
- LROC (Localization ROC)
  - only single lesion에 대해 평가
- FROC (Free Response ROC)
  - Observation은 mark-rating pairs의 형태
  - Mark: location identified by a radiologist as appearing suspicious of disease
  - Rating: the assessment of the probability disease is present at that location
  - the fraction of identified locations that are cancerous lesions by the fraction of the identified locations that are not의 plot임
- the average number of false-positives per patient(FPP)
  - Lesion level sensitivity와 함께 FPP를 보고
    - Proportion of patients who have any false-positive finding도 함께 보고
- the Poisson model for the number of false positives
  - the number of FPs 가 Poisson distribution을 따른다고 가정하고 Patient level에서의 Specificity를 추정함(the probability of no false-positive findings in a patient 추정에 Poisson 분포를 적용)
  - The lesion-level sensitivity와 patient level specificity에 대한 joint modeling을 random effects model로 설정할 수도 있음(by Zwinderman et al.)