A comparison of interobserver reproducibility of gleason grading of prostatic carcinoma in Japan and the United States

Tetsunari Oyama; William C. Allsbrook; Kohei Kurokawa; Hadzki Matsuda; Atsuki Segawa; Takaaki Sano; Keiji Suzuki; Jonathan I. Epstein

A comparison of interobserver reproducibility of gleason grading of prostatic carcinoma in Japan and the United States

Tetsunari Oyama, William C. Allsbrook, Kohei Kurokawa, Hadzki Matsuda, Atsuki Segawa, Takaaki Sano, Keiji Suzuki, Jonathan I. Epstein

School of Medicine

Research output: Contribution to journal › Review article › peer-review

49 Scopus citations

Abstract

Context. - Gleason grading is now the sole prostatic carcinoma grading system recommended by the World Health Organization. It is imperative that there be good interobserver reproducibility within this system worldwide. To our knowledge, there are no studies, using the same specimens, that compare the interobserver reproducibility of Gleason grading in Japan and the United States. Objective. - To compare the interobserver reproducibility of Gleason grading of prostatic carcinoma in Japan and the United States using, in Japan, images from the identical biopsy glass slides that were originally graded in the United States. Design. - Microsopic images from 37 needle biopsies of prostatic carcinoma were placed on CD-ROM and distributed to 14 Japanese pathologists for grading. These 14 physicians included 8 general pathologists and 6 pathologists with a special interest in urologic pathology. The needle biopsies had been previously reviewed so that a consensus diagnosis could be formed by a panel of urologic pathologists in the United States and Canada. Interobserver agreement with the consensus diagnoses was calculated by determining the overall κ coefficient for the Japanese pathologists and then compared to the interobserver agreement among American general pathologists who had previously graded identical needle biopsies from which the CD-ROM images had been taken. Results. - The interobserver agreement with the consensus diagnoses for the 4 Gleason grading groups (Gleason grades 2-4, 5-6, 7, and 8-10) among the Japanese urologic pathologists in this series of cases was substantial (overall κ = 0.68), and for the Japanese general pathologists, it was moderate (overall κ = 0.49), similar to that reported in the earlier study of American general pathologists (overall κ = 0.44). The major interobserver reproducibility problem for both Japanese and American general pathologists is undergrading. The major areas of undergrading are the underdiagnosis of Gleason scores 5-6 as Gleason scores 2-4, and the underdiagnosis of cribriform sheets and fragments of cribriform Gleason pattern 4 carcinoma as Gleason pattern 3. Conclusions. - The interobserver reproducibility of the Gleason grading for this collection of specimens was similar among Japanese and American general pathologists. The overall κ values for these generalists of 0.44 and 0.49 are only in the moderate (0.41-0.60) range of interobserver agreement when compared to 0.68, substantial (0.61-0.80) agreement, for Japanese urologic pathologists. Educational efforts to improve Gleason grading have been shown to be effective and are clearly warranted.

Original language	English (US)
Pages (from-to)	1004-1010
Number of pages	7
Journal	Archives of Pathology and Laboratory Medicine
Volume	129
Issue number	8
State	Published - Aug 2005

ASJC Scopus subject areas

Pathology and Forensic Medicine
Medical Laboratory Technology

Cite this

@article{ae24f3e0876d44efb8d7418f3113b60e,

title = "A comparison of interobserver reproducibility of gleason grading of prostatic carcinoma in Japan and the United States",

abstract = "Context. - Gleason grading is now the sole prostatic carcinoma grading system recommended by the World Health Organization. It is imperative that there be good interobserver reproducibility within this system worldwide. To our knowledge, there are no studies, using the same specimens, that compare the interobserver reproducibility of Gleason grading in Japan and the United States. Objective. - To compare the interobserver reproducibility of Gleason grading of prostatic carcinoma in Japan and the United States using, in Japan, images from the identical biopsy glass slides that were originally graded in the United States. Design. - Microsopic images from 37 needle biopsies of prostatic carcinoma were placed on CD-ROM and distributed to 14 Japanese pathologists for grading. These 14 physicians included 8 general pathologists and 6 pathologists with a special interest in urologic pathology. The needle biopsies had been previously reviewed so that a consensus diagnosis could be formed by a panel of urologic pathologists in the United States and Canada. Interobserver agreement with the consensus diagnoses was calculated by determining the overall κ coefficient for the Japanese pathologists and then compared to the interobserver agreement among American general pathologists who had previously graded identical needle biopsies from which the CD-ROM images had been taken. Results. - The interobserver agreement with the consensus diagnoses for the 4 Gleason grading groups (Gleason grades 2-4, 5-6, 7, and 8-10) among the Japanese urologic pathologists in this series of cases was substantial (overall κ = 0.68), and for the Japanese general pathologists, it was moderate (overall κ = 0.49), similar to that reported in the earlier study of American general pathologists (overall κ = 0.44). The major interobserver reproducibility problem for both Japanese and American general pathologists is undergrading. The major areas of undergrading are the underdiagnosis of Gleason scores 5-6 as Gleason scores 2-4, and the underdiagnosis of cribriform sheets and fragments of cribriform Gleason pattern 4 carcinoma as Gleason pattern 3. Conclusions. - The interobserver reproducibility of the Gleason grading for this collection of specimens was similar among Japanese and American general pathologists. The overall κ values for these generalists of 0.44 and 0.49 are only in the moderate (0.41-0.60) range of interobserver agreement when compared to 0.68, substantial (0.61-0.80) agreement, for Japanese urologic pathologists. Educational efforts to improve Gleason grading have been shown to be effective and are clearly warranted.",

author = "Tetsunari Oyama and Allsbrook, {William C.} and Kohei Kurokawa and Hadzki Matsuda and Atsuki Segawa and Takaaki Sano and Keiji Suzuki and Epstein, {Jonathan I.}",

year = "2005",

month = aug,

language = "English (US)",

volume = "129",

pages = "1004--1010",

journal = "Archives of Pathology and Laboratory Medicine",

issn = "0003-9985",

publisher = "College of American Pathologists",

number = "8",

}

TY - JOUR

T1 - A comparison of interobserver reproducibility of gleason grading of prostatic carcinoma in Japan and the United States

AU - Oyama, Tetsunari

AU - Allsbrook, William C.

AU - Kurokawa, Kohei

AU - Matsuda, Hadzki

AU - Segawa, Atsuki

AU - Sano, Takaaki

AU - Suzuki, Keiji

AU - Epstein, Jonathan I.

PY - 2005/8

Y1 - 2005/8

N2 - Context. - Gleason grading is now the sole prostatic carcinoma grading system recommended by the World Health Organization. It is imperative that there be good interobserver reproducibility within this system worldwide. To our knowledge, there are no studies, using the same specimens, that compare the interobserver reproducibility of Gleason grading in Japan and the United States. Objective. - To compare the interobserver reproducibility of Gleason grading of prostatic carcinoma in Japan and the United States using, in Japan, images from the identical biopsy glass slides that were originally graded in the United States. Design. - Microsopic images from 37 needle biopsies of prostatic carcinoma were placed on CD-ROM and distributed to 14 Japanese pathologists for grading. These 14 physicians included 8 general pathologists and 6 pathologists with a special interest in urologic pathology. The needle biopsies had been previously reviewed so that a consensus diagnosis could be formed by a panel of urologic pathologists in the United States and Canada. Interobserver agreement with the consensus diagnoses was calculated by determining the overall κ coefficient for the Japanese pathologists and then compared to the interobserver agreement among American general pathologists who had previously graded identical needle biopsies from which the CD-ROM images had been taken. Results. - The interobserver agreement with the consensus diagnoses for the 4 Gleason grading groups (Gleason grades 2-4, 5-6, 7, and 8-10) among the Japanese urologic pathologists in this series of cases was substantial (overall κ = 0.68), and for the Japanese general pathologists, it was moderate (overall κ = 0.49), similar to that reported in the earlier study of American general pathologists (overall κ = 0.44). The major interobserver reproducibility problem for both Japanese and American general pathologists is undergrading. The major areas of undergrading are the underdiagnosis of Gleason scores 5-6 as Gleason scores 2-4, and the underdiagnosis of cribriform sheets and fragments of cribriform Gleason pattern 4 carcinoma as Gleason pattern 3. Conclusions. - The interobserver reproducibility of the Gleason grading for this collection of specimens was similar among Japanese and American general pathologists. The overall κ values for these generalists of 0.44 and 0.49 are only in the moderate (0.41-0.60) range of interobserver agreement when compared to 0.68, substantial (0.61-0.80) agreement, for Japanese urologic pathologists. Educational efforts to improve Gleason grading have been shown to be effective and are clearly warranted.

AB - Context. - Gleason grading is now the sole prostatic carcinoma grading system recommended by the World Health Organization. It is imperative that there be good interobserver reproducibility within this system worldwide. To our knowledge, there are no studies, using the same specimens, that compare the interobserver reproducibility of Gleason grading in Japan and the United States. Objective. - To compare the interobserver reproducibility of Gleason grading of prostatic carcinoma in Japan and the United States using, in Japan, images from the identical biopsy glass slides that were originally graded in the United States. Design. - Microsopic images from 37 needle biopsies of prostatic carcinoma were placed on CD-ROM and distributed to 14 Japanese pathologists for grading. These 14 physicians included 8 general pathologists and 6 pathologists with a special interest in urologic pathology. The needle biopsies had been previously reviewed so that a consensus diagnosis could be formed by a panel of urologic pathologists in the United States and Canada. Interobserver agreement with the consensus diagnoses was calculated by determining the overall κ coefficient for the Japanese pathologists and then compared to the interobserver agreement among American general pathologists who had previously graded identical needle biopsies from which the CD-ROM images had been taken. Results. - The interobserver agreement with the consensus diagnoses for the 4 Gleason grading groups (Gleason grades 2-4, 5-6, 7, and 8-10) among the Japanese urologic pathologists in this series of cases was substantial (overall κ = 0.68), and for the Japanese general pathologists, it was moderate (overall κ = 0.49), similar to that reported in the earlier study of American general pathologists (overall κ = 0.44). The major interobserver reproducibility problem for both Japanese and American general pathologists is undergrading. The major areas of undergrading are the underdiagnosis of Gleason scores 5-6 as Gleason scores 2-4, and the underdiagnosis of cribriform sheets and fragments of cribriform Gleason pattern 4 carcinoma as Gleason pattern 3. Conclusions. - The interobserver reproducibility of the Gleason grading for this collection of specimens was similar among Japanese and American general pathologists. The overall κ values for these generalists of 0.44 and 0.49 are only in the moderate (0.41-0.60) range of interobserver agreement when compared to 0.68, substantial (0.61-0.80) agreement, for Japanese urologic pathologists. Educational efforts to improve Gleason grading have been shown to be effective and are clearly warranted.

UR - http://www.scopus.com/inward/record.url?scp=23044450720&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=23044450720&partnerID=8YFLogxK

M3 - Review article

C2 - 16048389

AN - SCOPUS:23044450720

SN - 0003-9985

VL - 129

SP - 1004

EP - 1010

JO - Archives of Pathology and Laboratory Medicine

JF - Archives of Pathology and Laboratory Medicine

IS - 8

ER -

A comparison of interobserver reproducibility of gleason grading of prostatic carcinoma in Japan and the United States

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this