Introduction

Around 35% of colorectal cancer patients who undergo curative surgical treatment will develop recurrent disease, the majority of which occurring during the first 3 years after surgery [1]. Relapse mainly occurs in the liver (20%) and lungs (5–10%), while approximately 5–15% of patients will develop a locoregional recurrence [2]. These local recurrences are associated with significant morbidity and impaired quality of life. Whereas locally recurrent rectal cancer has long been regarded as a rarely curable disease, patients are now increasingly being considered for curative salvage surgery, often combined with (neo)adjuvant chemo and/or radiation treatment. The chances of cure after resection are highest when all resection margins are microscopically free of tumour [3]. To increase the chance of cure it is therefore crucial to diagnose local recurrences when still in an early stage. At present, it is not clear whether imaging is beneficial during the surveillance of patients after rectal cancer surgery and trials to establish the role of imaging are ongoing [47]. If, however, during surveillance a patient is suspected of a local recurrence based on clinical symptoms and/or rising carcinoembryonic antigen (CEA) levels, CT is the first investigation of choice to confirm or rule out the presence of a local or distant relapse [4, 7, 8]. A pelvic mass that enlarges on consecutive postoperative CT studies is highly suspicious for a local recurrence, although the diagnosis is not always easy to make [9, 10]. Often the CT findings are equivocal and in those cases 18F-Fluorodeoxygenase Positron Emission Tomography (FDG-PET) or combined PET/CT have proven beneficial in identifying local tumour re-growth [10, 11]. False positive FDG-PET findings may, however, occur in areas of postoperative inflammatory scar tissue, in the case of displaced small bowel loops and because of the proximity of the urinary bladder. Moreover, FDG-PET lacks accuracy in the evaluation of a recurrence of the mucinous tumour type and is less useful to evaluate tumour resectability due to its insufficient spatial resolution [11, 12]. MRI on the other hand has proven valuable for assessing the resectability of a diagnosed local recurrence of rectal cancer [13], but its role in the detection of recurrent tumour is not well established. The main difficulty with MRI is the identification of a small growing tumour in an area of fibrotic scar tissue [1416]. Diffusion-weighted MR imaging (DWI) is a promising technique for the detection of small tumour volumes and its benefit has also been shown in pelvic tumours [1719]. The role of DWI in the detection of locally recurrent rectal cancer has, however, not been investigated yet. As an addition to standard anatomical MRI, DWI could be useful to increase the diagnostic performance for detection of locoregional tumour recurrences [20]. Furthermore, previous studies have suggested that image fusion of anatomical MRI and DWI can contribute to a better detection of both primary malignancies and recurrent disease, though again its role in rectal cancer is not clear [20, 21].

The aim of this study therefore is to evaluate the accuracy of standard pelvic MRI for the diagnosis of locoregional tumour recurrence in patients with a clinical suspicion of a recurrence after curative rectal cancer surgery and to evaluate whether the addition of DWI and/or image fusion of standard MRI and DWI improves its performance.

Materials and methods

Patients

This study retrospectively evaluated 42 consecutive patients who underwent MR imaging for the evaluation of the presence of a locally recurrent rectal cancer between June 2007 and April 2010. Because of the retrospective nature of the study, informed consent was waived. The baseline patient characteristics are given in Table 1. Twenty-six patients were male and 16 were female. Median age was 66 years (range 22–87). All patients had previously undergone curative-intent surgical treatment for primary rectal cancer. Patients were referred for MRI when during follow-up there was a clinical suspicion for a locoregional tumour recurrence based on symptoms (pain, faecal blood loss, changed bowel habits), clinical examination, increased serum CEA and/or the presence of metachronous distant tumour lesions. Eleven patients first underwent a CT examination, before being referred for MRI. Nineteen patients were diagnosed with a locally recurrent rectal tumour, while in 23 patients no evidence of recurrence was found. The locations of recurrence were as follows: anastomotic (n = 2), presacral (n = 3), lateral pelvic nodes (n = 3), pelvic floor (n = 3), local excision scar (n = 5), rectal lumen (n = 1), peri-prostatic (n = 1), and gluteus muscle (n = 1). The median time interval between surgical treatment of the primary rectal tumour and the follow-up MRI for local recurrence evaluation was 40 months (range 3–320). Median follow-up between the MRI and present was 20 months (range 4–42).

Table 1 Baseline characteristics of study patients

MR imaging

Patients did not receive bowel preparation or spasmolytics. MR imaging was performed at 1.5-T (Intera; Philips Medical Systems, Best, The Netherlands). For signal reception, a phased-array body coil was used. The standard rectal MRI protocol consisted of 2D T2-weighted (T2W) fast spin-echo (FSE) sequences in three planes; sagittal, axial and coronal (TR/TE 8456/130 msec, 90° flip angle, 25 echotrain length, 6 NSA, 0.78 × 1.14 × 3.00 mm acquisition voxel size, 30 slices, 6.03 min acquisition time). Since 2006, a DWI sequence had been implemented as part of the standard rectal MRI protocol. The DWI sequence was an axial diffusion-weighted sequence with background body signal suppression (DWIBS [22], b-values 0,500,1000 s/mm2, TR/TE 4829/70 msec, EPI factor 53, 4 NSA, 2.50 × 3.11 × 5.00 mm acquisition voxel size, 50 slices, 10.37 min acquisition time). The axial T2-weighted and DWI sequences were obtained in identical planes. These were routinely angled perpendicular to the rectal lumen. When no remaining rectal lumen was present (after abdominoperineal or extended resection) or in case of a pelvic mass located outside the rectal lumen, the axial sequences were angled perpendicular to the interface between the local area of suspicion and the closest structure or organ. ADC maps including all three b-values were automatically generated in greyscale at the operating system.

Image fusion of T2-weighted MRI and DWI

MR images were exported in DICOM format and transferred to a Macintosh computer (iMac, Apple, Cupertino, CA, USA). Fusion images of the axial T2-weighted and b1000 diffusion-weighted images were generated using OsiriX Medical Imaging Software, an open-source post-processing software tool [23]. The field of view and slice level of the diffusion images were automatically adjusted to exactly match the T2-weighted MRI. The DWI images were then converted to a red-to-yellow colour scale and overlaid on the T2-weighted images.

Image evaluation

The MR images were independently analysed by two readers from an expert referral center: a pelvic MR expert with 13 years expertise in reading MRI of rectal cancer (RGHBT) and a junior GI radiologist with 3 years specific expertise in pelvic MRI (VCC). The readers were aware of the treatment, histological stage and tumour type of the primary rectal cancer, but were blinded to each other’s scoring results and to whether or not previous CT imaging for the recurrence was present. They were also blinded to the results from endoscopy, other imaging, surgery and histopathology. This study design was chosen to reproduce the clinical standard of practice, where radiologists are exposed to a patient’s history of the primary tumour. Image evaluation consisted of three separate reading protocols conducted within the same reading session: (a) the readers first evaluated the T2-weighted FSE images only (further referred to as ‘standard MRI’), (b) immediately thereafter the scoring was repeated after the b1000 diffusion images were added to the standard MRI (further referred to as ‘standard MRI+DWI’). The low b-value diffusion images and the ADC map were also at the readers’ disposal. Finally (c) the fusion images of T2-weighted MRI+DWI were assessed. Patients were evaluated in random order. The readers scored the likelihood of a local tumour recurrence using a 5-point confidence level scale (0 = definitely absent, 1 = probably absent, 2 = possibly absent/present, 3 = probably present, 4 = definitely present). On T2-weighted MRI, focal areas showing a mass effect, a signal intensity isointense to muscle and/or a nodular or infiltrative border were considered highly suspicious for tumour recurrence. On the high b-value (b1000) DWI, focal areas of high signal intensity, corresponding with possible tumour locations on standard MRI were considered a criterion for tumour recurrence (Fig. 1). On the fusion images focal areas of hyperintensity (yellow), corresponding with possible tumour on T2-weighted MRI were considered suspicious for recurrent tumour. When a recurrent tumour was suspected, the largest tumour diameter was measured on T2-weighted MRI.

Fig. 1
figure 1

T2-weighted, b1000 diffusion-weighted and T2W+DWI fusion images of a male patient with a local tumour recurrence situated in the left gluteus muscle (arrows). On T2-weighted MRI (a), the signal of the tumour is isointense compared to the muscles. On the DWI (b) and fusion images (c) the tumour is visible as a lesion with high signal intensity, compared to the suppressed signal of the surrounding structures. Note the necrotic changes in the left internal obturator muscle (arrowheads), resulting in slightly increased signal intensity on the DWI and fusion images

Standard reference

The presence of a local recurrence (n = 19) was histologically confirmed in 16 patients (surgical resection in 11 and positive biopsy of the suspected tumour area in 5 patients). In 3 patients the diagnosis of a local recurrence was made based on concordant positive FDG-PET/CT findings (i.e. positive FDG uptake at the exact location suspected for tumour on MRI) as well as follow-up MRI and/or FDG-PET/CT examinations. Furthermore, an inguinal lymph node metastasis was confirmed by biopsy in 1 patient. These 3 patients did not undergo surgical treatment of their recurrence, because of unresectable disease. The absence of a local recurrence (n = 23) was confirmed by a combination of the following: no abnormal findings on corresponding FDG/PET-CT (n = 8), no abnormal findings on colonoscopy and/or negative biopsy results (n = 4) and/or no signs of tumour recurrence on consecutive imaging examinations (CT, MRI and/or FDG-PET/CT) during a follow-up period of at least 1 year (n = 22). Median follow-up time for the patients without recurrence was 23 months (range 14–41).

Statistical analysis

Statistical analyses were performed using the Statistical Package for the Social Sciences (SPSS, version 16.0, Inc., Chicago, IL). Receiver operator characteristic (ROC) curve analyses were performed to evaluate the diagnostic performance for (a) standard MRI only, (b) standard MRI+DWI and (c) fused T2-weighted+DWI images. Corresponding areas under the ROC-curve (AUC), sensitivities, specificities, positive predictive values (PPV) and negative predictive values (NPV) with 95% confidence intervals (CI) were calculated. For these analyses it had been decided at the start of the study to dichotomise the confidence level scores between 1 (probably absent) and 2 (possibly absent/present). Differences in diagnostic performance between the 3 reading strategies were analysed by comparing the ROC-curves according to the method described by DeLong et al [24]. Weighted kappa-values with quadratic kappa weighting (0.00–0.20 poor, 0.21–0.40 fair, 0.41–0.60 moderate, 0.61–0.80 good and 0.81–1.00 excellent interobserver agreement) were calculated to determine interobserver agreement between the two readers [25]. P-values <0.05 were considered statistically significant.

Results

Diagnostic predictive values including 95% confidence intervals are provided in Table 2 for the three reading methods. The ROC-curves are displayed in Fig. 2.

Table 2 Diagnostic performance for identification of a local tumour recurrence
Fig. 2
figure 2

Receiver operator characteristics curves and areas under the ROC-curve (AUC) for reader 1 (R1) and reader 2 (R2) for detection of locally recurrent rectal cancer by using (1) standard MRI, (2) standard MRI+DWI, and (3) T2-weighted MRI+DWI fusion images, respectively. There were no significant differences in AUC between the three reading designs for either of the two readers

Standard MRI

When using only standard MRI, reader 1 achieved an AUC of 0.99, sensitivity 100% and specificity 83%. For reader 2 these figures were 0.87, 84% and 74%.

Added value of DWI and fusion images

After addition of DWI, reader 1 achieved an AUC of 0.98, sensitivity 100% and specificity 91%. Compared to standard MRI (AUC 0.99), the difference in AUC was not significant (p = 0.78). For reader 2, AUC improved from 0.87 on standard MRI to 0.91 after addition of DWI (p = 0.09). He achieved a sensitivity of 89% and specificity of 83%. The added value of DWI is illustrated in Fig. 3, in which a false positive finding on T2-weighted MRI was corrected after addition of DWI. The addition of fusion images did not improve the diagnostic performance as compared to standard MRI+DWI (p = 0.41 for reader 1 and p = 1.00 for reader 2).

Fig. 3
figure 3

Axial T2-weighted, b1000 diffusion-weighted and T2W+DWI fusion images of a male patient who had previously undergone a low anterior resection. On T2-weighted MRI (a) there is an area of intermediate signal intensity (arrows) that was erroneously interpreted as a tumour recurrence by the first reader. On the DWI (b) and fusion images (c), there is no focal high signal intensity in this area (circle) and the reader adjusted his score. Follow-up imaging revealed no tumour recurrence

Interobserver agreement

Interobserver agreement for standard MRI was good (κ 0.69). After the addition of DWI, agreement was excellent (κ 0.82). Agreement for the T2W+DWI fusion images was excellent (κ 0.84).

Lesion size

In the 19 patients with recurrence, median lesion size (largest tumour diameter) was 37 mm (range 13–81 mm). Seven patients had a ‘small’ recurrence (<20 mm). Reader 1 detected all 7 small recurrences, both with standard MRI and after addition of DWI. Reader 2 missed one small recurrence on standard MRI, but detected all 7 after addition of DWI (Fig. 4).

Fig. 4
figure 4

Axial T2-weighted, b1000 diffusion-weighted and T2W+DWI fusion images of a male patient who had previously undergone transanal endoscopic microsurgery. On T2-weighted MRI (a) there is a small area of intermediate signal intensity (arrow), that was overlooked by reader 2. On the DWI (b) and fusion images (c), there is a clear focal area of high signal intensity and the reader now identified the recurrence, which was later surgically removed and histologically confirmed

Eleven patients had undergone CT imaging before MRI. In Table 3 and Fig. 5 the discrepancies between the CT findings and MRI scorings are illustrated.

Table 3 Discrepancies between CT findings and MRI
Fig. 5
figure 5

CT, T2-weighted and T2W+DWI fusion images of a male patient who had previously undergone an abdominoperineal resection for a distally located rectal tumour. On CT (a) there is a presacrally located soft tissue mass. On CT it is not possible to discriminate between postoperative scar tissue and recurrent tumour. On the corresponding MRI (b) there is an area of hypointense signal intensity indicating fibrosis (arrowheads). Located anterior to this fibrosis, there is a bowel loop (*) This bowel loop was not opacified by the oral contrast on CT, because it was located distally from the patient’s ileostomy. On the fusion images (c) there are no areas of high signal intensity, suggestive of recurrent tumour. Follow-up imaging revealed no tumour recurrence

Discussion

The aim of this study was to assess the performance of MRI for the diagnosis of locally recurrent rectal cancer in patients with a clinical suspicion of a recurrence after rectal cancer surgery and to evaluate whether the addition of diffusion-weighted imaging and/or image fusion of standard MRI and DWI improves its performance.

Our results indicate that standard MRI on its own has a high accuracy for the diagnosis of recurrent tumour in this specific group of patients. The addition of DWI does not significantly improve its performance. The high accuracy using only standard MRI could partly be due to the relatively high levels of expertise of the readers in this study. Furthermore, some patients had relatively large tumours, which were already correctly identified on morphological MR imaging. The benefit of DWI might be more pronounced for the detection of smaller tumours which are more difficult to detect within the fibrotic scar tissue. Results from a recent study by Syk et al. showed that mainly anastomotic recurrences—which tend to be smaller in size—are missed with imaging, again suggesting that the benefit from additional functional imaging may be higher in the detection of these small tumours [26]. In our study we indeed observed that DWI was helpful in such a case: the less experienced radiologist detected one of the small recurrences on MRI+DWI which he had overlooked before (Fig. 4). Larger patient studies are, however, required to further investigate and confirm the potential role of DWI in small tumours.

Of interest was the fact that adding DWI improved the interobserver agreement and reduced the number of false positives. DWI thus seems to increase the confidence of radiologists to rule out the presence of a recurrence (Fig. 3). This a relevant finding, as it is known from previous studies that standard MRI generally tends to overestimate the presence of tumour within areas of fibrotic (postoperative) scar tissue [1416, 27]. De Lange et al. analysed the signal intensities of areas suspected for local tumour recurrence on T2-weighted MRI and reported equally low signal intensities for areas of desmoplastic reaction containing tumour and areas of desmoplasia only [27]. It is especially in these cases that the combination of the morphological information from MRI and the functional information from DWI can be beneficial in the diagnosis of recurrent tumour growth. Fibrotic tissue typically has a low cellular density and a large interstitial space, which results in low signal intensity on high b-value (b1000) diffusion images. Conversely, tumoural areas have a relatively high cellular density, which will result in high signal on DWI [28]. Our findings are in concordance with a previous report from Nishie et al, who compared standard MRI to DWI and fusion images for the diagnosis of pelvic tumour recurrences in a group of 28 patients with a variety of primary pelvic malignancies [20]. They found the fusion images of T2-weighted MRI and DWI to perform the best with an AUC of 0.95. The main gain from the addition of DWI was the improved specificity from 47% on standard to MRI to 72% with DWI, indicating that false positivity could be reduced by more than 20%.

As compared to the separate evaluation of the standard MRI and diffusion images, the addition of image fusion did not improve the performance of the readers in our study. Both readers, however, experienced the fusion images to be helpful for a better understanding of the anatomical correlation between standard MRI and DWI. MRI is presently mainly used as a pre-operative roadmap for surgeons to determine the resectability of recurrence in patients who are potential candidates for curative salvage surgery [14]. In this regard, image fusion of standard MRI and DWI might be beneficial and should be addressed by further studies.

In our study, 11 patients had undergone CT imaging prior to MRI. In eight (73%) cases CT showed equivocal findings and patients had to be referred for further imaging. The readers were not aware of the fact that a CT had been performed previous to the MRI, nor were any of the CT findings revealed. In this study setting, all 8 patients were correctly diagnosed by both readers, both on standard MRI and on MRI plus DWI, suggesting that MRI has superior sensitivity and specificity compared to CT. The limitation of CT remains its inferior soft tissue contrast as compared to MRI. Previous studies that have compared CT and MRI for evaluation of local recurrences have reported similar superiority of MRI, both for the detection of recurrent tumour in this specific group of patients as well as for the assessment of tumour resectability [29, 30]. In the current study, MRI without contrast was used. Some authors have suggested the use of dynamic contrast enhanced (DCE) Gadolinium sequences for a better diagnosis of recurrent tumour [31, 32]. Results are, however, somewhat conflicting as others have shown little improvement for DCE compared to standard morphological MRI [33].

Our study is limited because of its retrospective nature. Furthermore, we only observed a trend towards an improved performance for MRI+DWI, which could partly be due to the small sample size and the relatively high level of experience of both readers in our study. The results of our study are, however, promising and therefore warrant further prospective studies to confirm the value of MRI+DWI in larger patient groups and in a general clinical setting.

In conclusion, this study shows that standard MRI on its own has a high accuracy for the diagnosis of locally recurrent rectal cancer. Addition of DWI does not significantly improve its performance. However, addition of DWI might help radiologists to more confidently rule out the presence of a recurrence. Moreover, the use of DWI can increase interobserver agreement. MRI is therefore promising as a tool to diagnose local recurrences after rectal cancer surgery in the group of patients with a high clinical suspicion for recurrence. Fusion images of MRI+DWI do not further improve the diagnostic accuracy.