Research - (2021) Volume 12, Issue 6
Received: 29-May-2021
Published:
18-Jun-2021
, DOI: 10.37421/2157-7099.2021.12.576
Citation: Shikha Chopra. "Application of Milan System for Reporting Salivary Gland Cytopathology: A 7 Year Study." J Cytol Histol 12 (2021): 576.
Copyright: © 2021 Chopra S. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Introduction: Salivary gland shows various pathological conditions ranging from cystic, inflammatory, tumor like and neoplastic lesions. Fine needle aspiration cytology (FNAC) plays an important role in evaluating salivary gland (SG) tumors. Salivary gland tumors are one of the most heterogeneous groups of neoplasms with cytopathological features overlapping among the entities and making it difficult to assign to specific category. Due to these facts, salivary gland cytopathology is one of the most challenging areas of cytology. The lack of a uniform reported guidelines in salivary gland cytopathology leads to inter-observer variability and disagreements. The present study was undertaken to assess the degree of inter-observer reproducibility for diagnostic categorization of salivary gland lesions utilizing MSRSGC among pathologists with varying experience along with its role in providing a framework for reporting salivary gland lesions.
Materials and Methods: In this cross sectional study, total of 44 cases of salivary gland lesions subjected to FNAC over a period of 7 year were studied. The cases were critically reviewed by 2 pathologists and a pathology resident with variable experience in cytopathology using MSRSGC in our institution. Inter-observer variability was assessed by comparing the agreement between two cytopathologists and pathology resident by using Cohen’s kappa statistics (κ score) and interpretation of the results was done using scale of Landis and Koch.
Results: All the salivary gland aspirates were categorized according to MSRSGC.Out of 44 cases, maximum cases 22 (50%) were classified under IVA (BN) followed by 27.27% to 29.5% cases classified under II (NN), 2.27-4.55% of cases under Category IVB (SUMP), 4.55% under category V( SM) and 6.82% cases under category VI ( M). Inter-observer variability (IOV) was calculated for individual category in Milan system using Cohens kappa test, which was found to be in the almost perfect agreement range as per Landis and Koch, for categories II, IVA, V, VI ( κ score 0.89- 1). Kappa score ranged from 0.645 - 1 for category I (ND), which showed substantial to an almost perfect agreement. Whereas, category IVB (SUMP) showed variable results, with substantial agreement (κ score 0.656) to no agreement (κ score 0) between different observers.The overall IOV showed an almost perfect agreement with a kappa score of 0.861(obs1 vs 2) ,0.896 (obs1 vs R), 0.965 (obs 2 vs R). The data was found to be statistically significant (p=< .0001).
Conclusion: MSRSGC is a very efficient system and has the potential to standardize salivary gland FNA diagnoses, providing clear prognostic and management information to clinicians and surgeons. This system can be used with good reproducibility between observers with variable cytopathology experience. Placing lesions in categories using MSRSGC can result in optimal management of discordant cases without using a specific diagnosis. Application of MSRSGC has immense value for standardization of reporting of salivary gland FNAC.Hence we recommend the use of Milan system for reporting salivary gland cytopathology.
MSRSGC: Milan system for reporting salivary gland cytopathology • IOV: Inter-observer variability • SG: Salivary Gland • FNAC: Fine needle aspiration cytology • κ score: kappa score
Primary objective: To study and categorize the fine needle aspiration cytology of salivary gland lesion using the Milan system (MSRSGC) in ourinstitution.
Secondary objective: To assess inter-observer variability to determine applicability of thesystem.
Salivary gland (SG) shows various pathological conditions ranging from cystic, inflammatory, tumor like and neoplastic lesions. Salivary gland neoplasm are relatively uncommon and constitute only about 0.5% of all cancers. The global annual incidence of overall salivary gland tumors (SGT) varies approximately 0.4 to 13.5 cases per 100,000 people and 0.4-2.6 per 100,000 for malignant tumor. [1] Benign tumor accounts for 54-79% of all SGT, among them the parotid gland is the most common site. Malignant tumor accounts for 21-46% and more frequently seen in minor salivary glands and sublingual glands [2].
Fine needle aspiration (FNA) is minimally invasive, cost effective test. It can differentiate not only between neoplastic and non-neoplastic lesions, but also between low grade and high grade salivary gland lesions. [3] The accuracy of FNA to differentiate benign lesions from malignant lesions has been observed to be as high as 81% to 100%. However, the accuracy of FNA to provide a specific diagnosis has a wider range of 48% to 94%. [3-7]
Salivary gland tumors are one of the most heterogeneous groups of neoplasms with cytopathological features overlapping among the entities and making it difficult to assign to specific category. Due to these facts, salivary gland cytopathology is one of the most challenging areas of cytology. [8] These challenges are further complicated by the lack of a standardized, tiered diagnostic framework and resulted in inconsistent terminology among institutions, individual cytopathologists and lead to confusion in communication among cytopathologists. To address this need, collaborative efforts made by The American Society of Cytopathology (ASC) and International Academy of Cytology (IAC) have resulted in a evidence based tiered classification system to report salivary gland FNA. They proposed the Milan System for reporting salivary gland cytopathology (MSRSGC) which includes the six major diagnostic categories as well as a brief description of cytomorphologic features that can be applied in daily practice to accurately diagnose salivary gland lesions. [9,10]
For a diagnostic categorization system to be clinically useful, it must show good inter-observer reproducibility. The present study was undertaken to assess the degree of inter-observer reproducibility for diagnostic categorization of salivary gland lesions utilizing MSRSGC among pathologists with varying experience in reporting salivary gland cytopathology.
In this cross sectional study, total of 44 cases of salivary gland lesions subjected to FNAC over a period of 7 year (Jan 2013 to Dec 2019) were studied at Department of Pathology, St. Stephen’s Hospital, Delhi.
Sample Size & Sample Technique:
The study of SA Thiryayi et al. [11] observed that according to Milan system,10.5% were categorized into Malignant. Taking this value as reference, the minimum required sample size with 10% margin of error and 5% level of significance is 37 patients. To reduce margin of error, total sample size taken is 44.
Formula used is: N ≥ (p(1 -p))/(ME/zα)2
Where Zα is value of Z at two sided alpha error of 5%, ME is margin of error and p is proportion of patients categorized as Malignant.
Calculations: n>= ((.105*(1-.105))/(.1/1.96)2=36.10=37(approx.)
The estimated sample size comes out to be 37 cases. However, to reduce margin of error, 44 cases were included in the study.
Methodology: Detailed clinical history regarding age, sex, site, increase or decrease in size of swelling, associated symptoms was taken from the patient. All the patients were clinically examined in detail as per the study preformed & a careful palpation of the salivary gland was done to locate the site of aspiration. After brief explanation about the procedure to the patient, consent was taken. The procedure was done with the patient in supine or sitting position. Using the first finger and thumb of the free hand, the swelling was held firmly in place while the other hand was used to smoothly and rapidly insert the needle into the swelling .Under aseptic conditions, aspiration was done using 23 G needle attached to a 10cc syringe, followed by multiple short, quick back and forth movements with the needle for approximately 5-10 seconds without significantly changing the direction of the needle. Material was collected in the needle, which was withdrawn and transferred to clean labelled glass slides and smears were prepared. Smears were air dried, fixed in methanol for Giemsa staining as per standard techniques.
For retrospective cases, the slides were retrieved from the archives and the cytological diagnoses were reviewed and retrospectively classified by two senior cytopathologists and pathology resident. For prospective cases, slides were studied under light microcopy. The slides were independently and blindly reviewed by two cytopathologist (observer 1 and observer 2) and a pathology resident. All the cases were categorized by using the MSRSGC.
The Milan System for Reporting Salivary Gland Cytopathology (MSRSGC) consists of 6 broad categories.
1. Non Diagnostic (ND)
2. Non Neoplastic (NN)
3. Atypia of Undetermined Significance (AUS)
4. Neoplasm: (a)Benign(BN) ; (b)Salivary Gland Neoplasm of Uncertain Malignant Potential(SUMP)
5. Suspicious For Malignancy (SM)
6. Malignant (M)
At the end of review diagnostic categories assigned by the observers were tallied. Inter-observer variability was assessed by comparing the agreement between two cytopathologists and pathology resident by using Cohen’s kappa statistics (κ score) and interpretation of the results was done using scale of Landis and Koch, it was interpreted as [12,13] (Table 1)
Statistical Analysis: Categorical variables were presented in number and percentage (%) and continuous variables were presented as mean ± SD and median. Normality of data was tested by Kolmogorov-Smirnov test. If the normality was rejected then non parametric test was used. Statistical tests were applied as follows:
1. Quantitative variables were compared using Kruskal Wallis Test(as the data sets were not normally distributed) between the Milan category.
2. Qualitative variables were associated using Fisher’s exact test.
3. Inter-rater kappa agreement was used to assess the strength of agreement between observers.
A p value of <0.05 was considered statistically significant.
The data was entered in MS Excel spreadsheet and analysis was done using Statistical Package for Social Sciences (SPSS) version 21.0.
This is a cross sectional study of application of Milan system for reporting salivary gland lesions, which was conducted at St. Stephen‘s hospital, Delhi. During January 2013 to Dec 2019, a total of 44 cases of salivary gland lesions were included in the study. Detailed cytological features were evaluated in each case. The cases were identified on the basis of clinical features i.e. complaints of swelling including size and site of swelling
1. The youngest patient in the study was 16 years old and the oldest was 75 years old.
2. Maximum number of patients were in age group 51-60 years (22.73%) followed by more than 60 years (20.45%).
3. Mean age of the cases at presentation was 45.93 years with standard deviation of ± 16. (Table 2 and Chart 1)
1. In our study, Males dominated the study group i.e. 29 cases (65.91%), when compared to females i.e. 15 cases (34.09%). (Table 3 and Chart 2)
2. The male to female ratio is 1.93:1. (Table 3 and Chart 2)
Out of the 44 patients, 22 patients presented with complaint of parotid swelling and 13 patients with complaint of submandibular swelling. So, parotid region (50 %) was the most common site of presentation followed by submandibular region (29.55%).(Table 4 and Chart 3)
In our study, the size of the lesion ranged from 1cm -10cm with a mean size of 3.24cm. (Table 5 and Chart 4)
Out of 44 lesions, 35 lesions (79.55%) were firm in consistency followed by 7 lesions (15.91%) with soft consistency and 2 (4.55%) with hard consistence. (Table 6 and Chart 5)
Out of 44 cases, most of the lesions (25/44) were mobile in nature with an incidence of 56.82% followed by immobile lesions (43.18%). (Table 7 and Chart 6)
Blood tinged aspirate was obtained in most of the cases (38/44) with an incidence of 86.36%. (Table 8 and Chart 7)
Out of 44 cases, maximum cases 22 (50%) were classified under IVA i.e. Benign Neoplasm followed by 27.27% to 29.5% cases classified under II i.e. Non-neoplastic. Category IVB i.e., SUMP seen in almost 2.27- 4.55% of the cases. (Table 9 and Chart 8)
1. A total of 44 cases were categorized according to MSRSGC by two cytopathologists i.e. observer 1 and observer 2.
2. At the end of the review diagnostic categories assigned by the observers were tallied and concordance level among them was calculated using Cohen’s Kappa statistics (κ).
3. The κ score was calculated as 0.861, that falls in the almost perfect agreement range. (Table 10)
4. The findings were statistically significant (p = < .0001). (Table 10)
1. The inter-observer variability was calculated between observer 1 and Pathology Resident
2. The κ score was calculated as 0.896, which represents an almost perfect agreement according to Cohen’s kappa. (Table11)
3. The findings were statistically significant (p = < .0001). (Table 11)
1. The inter-observer variability was calculated between observer 2 and Pathology Resident.
2. The κ score was calculated as 0.965, which represents an almost perfect agreement. 3. The findings were statistically significant (p = < .0001). (Table 12)
1. The table shows the chance correct agreement (Cohen’s kappa statistics) for each category.
2. Using the scale of Landis and Koch, an almost perfect agreement was seen between observers for categories II, IVA,V,VI.
3. Agreement ranged from substantial to almost perfect for category I.
4. Category IVB showed variable results, with substantial agreements between observer 1 vs. resident to no agreement between observer 1vs 2 and observer 2 vs. resident. (Table 13)
ND- Non diagnostic, NN- Non neoplastic, BN - Benign Neoplasm, SUMP - Salivary gland neoplasm of uncertain malignant potential, SM - Suspicious of malignancy, M – Malignant (Figures 1-8).
Salivary gland tumors represent about 6% of head and neck neoplasm and about 0.5% of all the malignancies in humans. [1] Recommendations and guidelines have been published for a number of body sites including the thyroid, [14] pancreaticobiliary system, [15] urinary tract [16] and respiratory system. [17] Recently a reporting system for the cytology specimens from the salivary glands has been published. [3] This system proposes categorical diagnoses analogous to those used in the Bethesda System for reporting thyroid gland cytopathology. This reporting system is intended to improve communication between reporting cytopathologists and clinicians both within an institution and to external institutions. In our study, 44 cases of salivary gland lesion FNAC reported during January 2013 to Dec 2019 at St. Stephen’s hospital, Delhi were reviewed and categorized according to Milan system for reporting salivary gland cytopathology. An attempt was made to assess the inter-observer variability to determine the applicability of this reporting system.
Demographic Data in our Study
The mean age of patients in our study was 45.9 years. This is in concurrence with the study of Rohilla et al [18], where the mean age was 43.7 years. Similarly, Amita K et al.[19] , Chen et al [20] also observed mean age of their study sample as 48.3 years and 53 years respectively, which is in close concordance with our results.
In our study, the incidence of salivary gland lesions in male was (65.91%) and female (34.09%). This is in concurrence with the study done by Karuna et al.[21] who observed the incidence of 69.52% and 30.48% in males and females respectively.
Clinical Manifestations:
Parotid was the most common site to be involved (N=22; 50 %) followed by Submandibular gland (N=13; 29.55%). Similarly, Kala C et al.[22] also observed an incidence of 57.7% and 27.3% in parotid and submandibular gland respectively, which is in close concordance with our study. Most of the studies like Karuna et al.[21] ,Katta R et al [23] , Amita K et al.[19] ,Vishwanathan et al [24] and Rohilla et al [18] also observed parotid gland as the most common site of involvement in their studies.
Categorization and Inter-observer Variability (IOV)
Out of 44 cases, maximum cases 22 (50%) were classified under IVA i.e. Benign Neoplasm followed by 27.27% to 29.5% cases classified under II i.e. Non neoplastic. Category IVB i.e. SUMP seen in almost 2.27-4.55% of the cases.(Refer to Table 9 and Chart 8)
Discordant Cases Observer 1 vs Observer 2
IOV was seen in 4 / 44 cases, in which two cases were reported as SUMP by observer 1 and observer 2 categorized them as NN and BN respectively. Similarly two cases reported as ND by observer 2 has been categorized under NN and BN by observer 1.
Observer 1 vs Pathology Resident
IOV was observed in 3 / 44 cases. One case reported as SUMP by observer 1, the same has been reported as BN by resident. Similarly another two cases were reported as BN and NN by observer 1,that has been categorized under ND by resident.
Observer 2 vs Pathology Resident
IOV was seen in 1 / 44 cases. One case was categorized under NN by observer 2 and the same has been reported as SUMP by resident.
In our study, ND category represents 4.55-9.09% of cases. This is somewhat similar to incidence reported in other studies by Wei et al.,[4] (2%), Karuna et al.[21] (2.08%), Rohilla et al.[18] (2.2%), Park et al. [25] (9.9%), Savant et al. [26] (9.2%), Viswanathan et al.[24] (12%) and Song et al.[27] (13.5%). We reported 29.55% cases under NN category, with most frequent diagnosis as sialadenosis followed by few cases of sialadenitis. This is in concurrence with study by Chen et al. [20] ,Song et al.[27], Wu et al.[28] and Karuna et al.[21] where they reported 23.6%, 21.5%,16.1% and 16.20% cases respectively. The majority of our cases were classified as BN (50%). This is consistent with the frequency reported in the literature by various studies like Karuna et al.[21] reported 51.3%, Chen et al [20] reported 45.6% , Wu et al [28] reported 37.2% ,Song et al.,[27] reported 34.9%. Pleomorphic adenoma (34%) and Warthin’s tumours (15%) were the most commonly diagnosed entities under BN category. SUMP represents almost 2.27- 4.55% of the cases in our study. This is in concurrence with the study of Karuna et al.[20], in which they reported 4.76% cases under SUMP category. Similar findings were recorded in study conducted by Chen et al [20] (4.6%), Wu et al [28] (5.9%) and Dubucs C et al. [29] (5%).We could not find any case to be categorized under category III i.e. Atypia of undetermined significance (AUS) in duration of our study period. But various studies like Karuna et al.[21], Chen et al [20] , Song et al.[27] and Wu et al [28] reported cases under AUS category with a variable incidence of 2.85%, 3.2 %, 10.8% and 37.2% respectively.In our study, SM was seldomly observed in 4.55% cases and Malignant (M) in 6.82% cases, which is consistent with the study done by Karuna et al.[21] where they reported 4.76% and 16.20% of cases under SM and M respectively. Since the release of MSRSGC in March 2018, multiple studies have been put forth in establishing the ROM (risk of malignancy) for each of the 6 categories, correlating classification with outcomes and indicating the utility of this system in day to day practice.[18,24,27] Only a few studies explored the inter-observer variability in MSRSGC. We calculated the inter-observer variability between all the three observers by using cohen’s kappa statistics (κ score) & interpretation of the results was done using scale of Landis and Koch [12,13] , which can range between -1 to +1. The overall kappa score between two cytopathologists (Observer 1 vs 2),Observer 1 vs resident and observer 2 vs resident was calculated as 0.861, 0.896 and 0.965 respectively , which showed an almost perfect agreement according to Landis and Koch scale. The data was statistically significant with a p value of < 0.0001. This is in concurrence with the study conducted by Garg et al.[32] where they noted an almost perfect agreement between consultant and JR, with a kappa score of 0.812 . In a study conducted by Vishwanathan et al [33] inter-observer variability of indeterminate category in MSRSGC was calculated between cytopathologists & cytotechnologists. They calculated inter-observer agreement using a slightly modified version of previously published agreement scoring system described by Griffith et al.[34] into uniform, majority, divided, minimal or no agreement with an overall kappa score of 0.314 for the entire group, and 0.091-0.254 for cytotechnologists & 0.403-0.539 in case of cytopathologists. Furthermore, in our study kappa statistics for individual categories under Milan system were also calculated. There was considerable variability in agreement between categories. Some individual categories within the Milan system had excellent kappa scores within categories V (SM) and VI (M) having perfect agreement (kappa score = 1). SUMP category showed highly variable results, we found substantial agreement between observer 1 vs resident with a kappa score of 0.656. (Table 14)
Above mentioned studies had been done in various parts of the world, where they concluded the results with overall kappa score between the observers and few studies calculated kappa score only for indeterminate categories like AUS and SUMP. Whereas in our study, we have reported inter-observer variability in all the categories of Milan system and calculated three different values of kappa score with each value specific for variability between two observers.
It is to be noted that, before conducting the current study, the MSRSGC was not implemented at our institution; therefore, awareness among observers of this system was limited, thus a brief explanation of each category under Milan system was given before categorization.
MSRSGC has the potential to standardize salivary gland FNA diagnoses. This system can be used with good reproducibility between observers with variable cytopathology experience. Placing lesions in categories using MSRSGC can result in optimal management of discordant cases without using a specific diagnosis. Hence we recommend the use of Milan system for reporting salivary gland cytopathology.
Strengths of the Study
1. This is a cross sectional study.
2. Three different inter-observer variability agreements among three observers were calculated with three different kappa score values.
Limitations of the Study
1. Small number of sample size, due to study limited to one institution
2. Retrospective nature of the study.
3. No Histological correlation was done, so risk of malignancy was not calculated.
Recommendations:
1. The rate of inadequate samples can be reduced by the use of Diff - Quik staining, where the adequacy of the sample can be checked at the time of procedure.
2. Defining and implementing adequacy criteria for salivary gland FNA cases may help in standardizing pathologist interpretations of samples.
3. The ongoing discovery of genetic alterations and judicious use of assays such as immunohistochemistry and molecular testing will continue to increase the specificity of salivary gland FNA.
4. Studies of larger numbers of salivary gland fine needle aspirates will be necessary to fully establish the level of inter-observer agreement that can be expected while using MSRSGC.
Journal of Cytology & Histology received 2476 citations as per Google Scholar report