Original research | Open | Published:
Impact of point spread function modelling and time of flight on FDG uptake measurements in lung lesions using alternative filtering strategies
EJNMMI Physicsvolume 1, Article number: 99 (2014)
The use of maximum standardised uptake value (SUVmax) is commonplace in oncology positron emission tomography (PET). Point spread function (PSF) modelling and time-of-flight (TOF) reconstructions have a significant impact on SUVmax, presenting a challenge for centres with defined protocols for lesion classification based on SUVmax thresholds. This has perhaps led to the slow adoption of these reconstructions. This work evaluated the impact of PSF and/or TOF reconstructions on SUVmax, SUVpeak and total lesion glycolysis (TLG) under two different schemes of post-filtering.
Post-filters to match voxel variance or SUVmax were determined using a NEMA NU-2 phantom. Images from 68 consecutive lung cancer patients were reconstructed with the standard iterative algorithm along with TOF; PSF modelling - Siemens HD·PET (HD); and combined PSF modelling and TOF - Siemens ultraHD·PET (UHD) with the two post-filter sets. SUVmax, SUVpeak, TLG and signal-to-noise ratio of tumour relative to liver (SNR(T-L)) were measured in 74 lesions for each reconstruction. Relative differences in uptake measures were calculated, and the clinical impact of any changes was assessed using published guidelines and local practice.
When matching voxel variance, SUVmax increased substantially (mean increase +32% and +49% for HD and UHD, respectively), potentially impacting outcome in the majority of patients. Increases in SUVpeak were less notable (mean increase +17% and +23% for HD and UHD, respectively). Increases with TOF alone were far less for both measures. Mean changes to TLG were <10% for all algorithms for either set of post-filters. SNR(T-L) were greater than ordered subset expectation maximisation (OSEM) in all reconstructions using both post-filtering sets.
Matching image voxel variance with PSF and/or TOF reconstructions, particularly with PSF modelling and in small lesions, resulted in considerable increases in SUVmax, inhibiting the use of defined protocols for lesion classification based on SUVmax. However, reduced partial volume effects may increase lesion detectability. Matching SUVmax in phantoms translated well to patient studies for PSF reconstruction but less well with TOF, where a small positive bias was observed in patient images. Matching SUVmax significantly reduced voxel variance and potential variability of uptake measures. Finally, TLG may be less sensitive to reconstruction methods compared with either SUVmax or SUVpeak.
[18F]2-Fluoro-2-deoxy-d-glucose (FDG) positron emission tomography (PET) has been shown to play a key role in the management of patients with non-small cell lung cancer in terms of staging and prognosis - and monitoring response to therapy . In these applications, the uptake of FDG expressed as standardised uptake value (SUV) is of key importance, with SUVmax being the most commonly reported measure . The use of SUVmax for discrimination between benign and malignancy for soft tissue masses and lymph nodes has been demonstrated for lung cancer patients , and changes in SUVmax used as an indicator of response to therapy .
While the use of SUVmax is commonplace, it is known to be sensitive to both reconstruction parameters  and the amount of statistical image noise, leading to poorer test-retest consistency relative to other SUV-based metrics ,. Consequently, alternative metrics such as SUVpeak  and total lesion glycolysis (TLG), the product of SUVmean and metabolic tumour volume derived from the PET images, have been suggested for use, particularly in monitoring response to therapy ,. Recently, TLG has also been shown to offer superior prognostic information than SUVmax -.
In recent years, there have been significant advances in iterative image reconstruction algorithms and scanner hardware. Consequently, reconstruction algorithms that include point spread function (PSF) modelling , and time of flight (TOF)  have become commercially available on PET/CT scanners, with TOF also available on PET/MR .
The use of PSF modelling, with and without TOF, has been shown to improve signal-to-noise ratio (SNR) - and lesion detectability - partly through decreasing voxel variance. However, the implementation of PSF modelling, both within projection space and image space, from different manufacturers and also academic institutions has been shown to produce Gibbs artefacts ,- (Nick Vennart, personal communication). In patient imaging, the Gibbs artefact, combined with reduced partial volume effects, has a significant impact on SUVmax -. This is particularly evident with minimal or no post-reconstruction filtering, which has been shown in phantom studies with numerical observers to provide greater lesion detectability -. Changes to SUVmax as a consequence of PSF modelling present a challenge as changes to defined local practice for reporting may be required such as changing the thresholds used for the discrimination of malignancy. The scanner used in this study has been part of a multi-site network of scanners for routine FDG oncology imaging since 2009. SUVmax is the reported uptake metric, and the consensus amongst local reporting clinicians within the network is that lesions with SUVmax > 5.0 are considered highly suspicious of malignant disease.
It is necessary, in practice, to smooth clinical images to provide image quality that is deemed acceptable for clinical reporting. This degrades the spatial resolution but increases signal to noise. The degree of smoothing applied at any given centre is heavily influenced by the experience and personal preferences of the reporting clinicians, informed by the advice of physicists providing scientific support. Where several PET scanners serve the same patient population, it is also advantageous to match imaging performance across the network in terms of visual image quality and quantitative characteristics.
A trade-off curve of signal enhancement versus noise reduction when using PSF and/or TOF algorithms can be established by applying a range of reconstruction post-filters. It has been demonstrated that it is possible to match SUVmax from PSF-based reconstruction with traditional non-PSF algorithms by applying a particular post-filter. Lasnon et al.  showed that a 7.0-mm full-width-half-maximum (FWHM) post-filter with PSF reconstruction gave comparable recovery coefficients in phantom data to non-PSF reconstructions and brought the recovery coefficients in line with European recommendations . Another study proposed the application of a post-filter for the purpose of quantification . This study also demonstrated that despite a spatially dependent PSF, this approach of using a single post-filter choice was adequate for all lesions irrespective of their location in the field of view. The application of a relatively broad post-filter to PSF modelling images may seem counterintuitive as it will undo the improvements in partial volume effect, but there are likely to be other benefits that have not been reported such as a reduction in voxel variance in the images.
Another potential solution may be to use alternative uptake metrics to SUVmax. One study  suggested that TLG may be more stable when comparing PSF to non-PSF reconstruction, but this study only assessed ten lung lesions. Another study  has suggested the move to SUVmean based upon a 50% isocontour of SUVmax. To our knowledge, there are currently no studies that investigate the impact of these reconstructions with PSF modelling and TOF on TLG and SUVpeak.
The primary aim of this study was to evaluate the impact of PSF modelling and TOF on SUVmax-based lesion classification as implemented at the local institution. This was performed using Siemens reconstruction software including implementations for TOF and PSF modelling (HD, UHD). Implementations of reconstruction algorithms can differ, and therefore, the results might be specific to HD and UHD; however, we feel it is likely that findings may be generalisable to other reconstruction implementations with similar philosophies. Any change in FDG uptake measurements across different reconstruction protocols can hopefully allow other centres to assess how such changes may impact their approaches to lesion classification. Two set criteria for post-filtering the images were assessed based upon characteristic locations on a signal enhancement versus noise reduction trade-off curve. These two points are 1) matching image noise (voxel variance) which was expected to enhance signal and 2) matching signal (SUVmax) which, based on previous studies ,, was anticipated to require greater levels of post-filtering and hence reduce image noise. This latter approach is aimed to be particularly relevant to centres that wish to maintain uptake quantification for practical purposes, which is particularly important in multi-site imaging networks. In addition, this work aimed to expand on the results of previous studies - with the addition of TOF, evaluation of other uptake metrics such as SUVpeak and TLG, and determining gains in SNR for the two strategies.
The PET scanner used in this study was a Siemens Biograph mCT with 64 slice CT (Siemens Medical Solutions, Erlangen, Germany). The scanner has a four-ring extended axial field of view of 21.6 cm (TrueV) and includes options for PSF modelling (Siemens HD·PET) and combined PSF modelling with TOF (Siemens ultraHD·PET) in the image reconstruction. Performance data for the scanner has been published previously .
A NEMA NU-2 image quality (IQ) phantom (PTW, Freiburg, Germany) was filled with [18F]FDG so that the background compartment and all six hot spheres had activity concentrations of 5.19 and 41.7 kBq/ml, respectively. This 8:1 contrast was chosen to mimic lung lesion contrast, which is generally high. In order to divide the data into ten replicate datasets, a gated 60-min list-mode acquisition was performed using an ECG simulator as the gating input. Each replicate image contained 30 million (±0.2%) net true coincidences as this was typical of the number of counts measured over the thorax in our standard patient acquisitions. Images were reconstructed using four methods: standard 3-D ordinary Poisson ordered subset expectation maximisation (OSEM) reconstruction; OSEM with TOF (TOF); OSEM with PSF modelling - Siemens HD·PET (HD); and OSEM with both PSF and TOF - Siemens ultraHD·PET (UHD). For non-TOF reconstructions, 3 iterations and 24 subsets (3i24s) were used, while for TOF reconstructions, 2 iterations and 21 subsets (2i21s) were used.
Two iterations were chosen for TOF reconstructions as TOF has been shown to provide faster convergence with comparable signal to noise achieved in fewer iterations than non-TOF ,, and it has been shown in published performance data for the scanner that one fewer iteration with TOF is optimal , providing similar background variability and marginally superior contrast recovery in smaller objects. However, it is not possible to exactly match the number of subsets for TOF and non-TOF reconstructions. All images were reconstructed into a 256 × 256 matrix with voxel sizes of 3.2 mm × 3.2 mm × 2.0 mm. As is routinely performed with patient data, a 5.0-mm FWHM Gaussian post-filter was applied to the OSEM images. The baseline parameters of 3 iterations and 24 subsets and 5.0-mm post-filter for OSEM reconstruction have been in routine use since the scanner was commissioned in 2009. These parameters were selected to align SUVmax quantification and voxel variance with other scanners in the local oncology imaging network.
A variety of post-filters with different kernel widths was applied to the TOF, HD and UHD images with kernel widths ranging from 0 to 10 mm FWHM in step sizes for 0.1 mm.
Twelve circular regions of interest (ROIs) of 37-mm diameter were placed in the phantom background over five separate slices (60 ROIs in total) of the IQ phantom image in accordance with the NEMA NU-2-2007 standard . For each image replicate, the average coefficient of variation (COV) over the 60 ROIs was calculated as
where σ k,R and μ k,R are the voxel standard deviation and mean, respectively, within ROI k and replicate R. The mean and standard deviation of COV R was determined across all ten replicate images. The OSEM 3i24s 5.0-mm post-filter image was used to compute the reference COV value. For the three other reconstruction methods, the post-filter that gave the smallest difference in COV, relative to the OSEM image, was determined.
SUVmax is the uptake measure used in our routine patient reports and so was the measure chosen to match across the reconstruction algorithms. To achieve this, SUVmax was measured in each hot sphere in the phantom for the OSEM images using a 3-D volume of interest, equal in diameter to each true sphere size and centred on the sphere. As with the COV matching, a post-filter was incremented in 0.1-mm steps on the other three reconstructions until the summed squared difference of SUVmax for the six hot spheres relative to those in the OSEM image was minimised.
FDG patient acquisitions
Retrospective data from 68 (33 males; mean [range] weight: 72.5 kg [40 to 136]; mean [range] body mass index: 26.3 kg/m2 [14.1 to 51.8]) consecutive routine oncology patients referred for assessment of single pulmonary nodule or staging of non-small cell lung cancer were included in this study. All data were fully anonymised before inclusion. Patients fasted for 6 h prior to the injection of FDG and were asked to drink at least 500 ml of water before the scan. Blood glucose was measured with permissible limits of 3.0 to 12.5 mmol/l. Patients with a body weight <100 kg were prescribed 350 MBq of [18F]FDG, while those with body weight >100 kg (two in this study) were prescribed 400 MBq. The mean [range] administered activity of [18F]FDG was 365.5 MBq [242.0 to 423.1]. It can be noted that the minimum dose administered is considerably below the prescribed activity - this was due to a patient arriving late and insufficient remaining activity in the stock vial. The mean [range] time was 64.3 [59 to 87] min from the time of injection to commencing the scan. Advice from the local ethics committee deemed that the use of retrospective anonymised patient data did not require formal ethical approval.
The PET acquisition was performed from eyes to mid-thigh for all patients, requiring six or seven bed positions. The acquisition time for each bed position was 2.5 min. Attenuation correction was performed using a non-contrast CT acquisition performed prior to the PET acquisition. Scatter and random corrections were applied to all images. All images were reconstructed with OSEM 3i24s and 5.0-mm post-filter as the reference, along with the phantom-determined TOF, HD and UHD protocols, which match either voxel COV or SUVmax.
All images were viewed and the uptake quantified using Siemens TrueD image display software (Siemens Medical Solutions, Erlangen, Germany). In each patient, a 3-cm-diameter spherical volume of interest (VOI) was placed within an area of uniform FDG distribution in the liver, and the COV of the voxels within the VOI was calculated. Three FDG uptake measurements were derived for each identified lesion within the lung: SUVmax, SUVpeak (as defined in the PET response criteria in solid tumours (PERCIST) protocol ) and TLG. SUV was normalised to patient body weight only. Volume delineation for TLG was performed using a 40% threshold of SUVmax (TLG-40). Recent meta-analyses , have highlighted several methods for volume delineation - either using percentage or absolute SUV thresholds. The choice of a percentage threshold in this study was based on a hypothesis that as the magnitude of the partial volume effect varied with different reconstructions, the impact on the tumour volume and SUVmean would be inversely related. This may result in a more stable value for the TLG. It should be noted that other methods of delineation are likely to produce alternative results. Lesion volume was measured on the OSEM image using a 40% threshold of SUVmax.
Signal to noise
It is difficult to estimate SNR directly in a lesion due to inhomogeneous uptake; therefore, we have adopted the use of the liver as a source for the background and noise measurement. This technique has been performed previously  and is considered a reasonable relative surrogate for SNR in the lesion. For lesions with SUVmax above the PERCIST threshold of 1.5 times the mean SUV in the liver VOI + 2 standard deviations of the voxels within the liver VOI , the signal-to-noise ratio of the tumour, relative to the liver, (SNR(T-L)) was calculated as
where the Tumour refers to SUVmax in the lung lesion, Liver is the mean SUV measured in the liver VOI and σ L is the standard deviation of voxel values measured in the liver VOI. This method allows comparison to other studies, which have used the same metric ,. SNR(T-L) of all qualifying lesions was determined for each reconstruction using the two filtering schemes of matched voxel COV and matched SUVmax. The gain in SNR(T-L) was expressed for the TOF, HD and UHD reconstructions as the ratio to the SNR(T-L) measurements from the standard OSEM images of the same patient.
Relative percentage differences of the uptake metrics relative to OSEM were expressed as mean with 95% confidence intervals. Bland-Altman analysis was also performed on the data. Relative changes of >25% for SUVmax and >30% for SUVpeak were considered clinically significant based upon EORTC  and PERCIST  guidelines respectively. In addition, hypothetical changes to patient management as a consequence of SUVmax based on local practice were recorded. Differences in voxel COV in the liver VOI and gains in SNR(T-L) were assessed using a paired t test with a p value <0.05 considered to be significant.
The FWHM of the post-filters obtained for matching voxel COV to OSEM 3i24s and a 5.0-mm post-filter were 4.4, 3.8 and 2.9 mm for TOF, HD and UHD, respectively. The FWHM of the post-filters obtained for matching SUVmax were 4.8, 6.6 and 6.5 mm for TOF, HD and UHD, respectively. To provide an illustration of the underlying impact of each algorithm, SUVmax, expressed as a percentage of the true activity concentration, and noise data are first shown with no post-filter in Table 1. Data are then presented with the two post-filter sets as described in Table 2. From the data, it is seen that there is considerable increase in SUVmax in the two smallest spheres with HD and UHD with matched voxel COV. The variability of SUVmax was greater in the two smallest spheres at matched voxel COV, particularly with HD and UHD; the positive bias in the larger spheres with OSEM and TOF at matched voxel COV is likely to be due to image voxel variance, while with HD and UHD at matched voxel COV, Gibbs artefacts are also expected to contribute. This can be seen in Figure 1, which shows profiles through the centre of the 37-, 22- and 13-mm spheres.
With post-filters to match SUVmax recovery, variability is comparable or less with HD and UHD compared with OSEM. To verify the cross-calibration between the dose calibrator and scanner, the activity concentration, averaged across the 60 background ROIs, was measured as 5.14 ± 0.1 kBq/ml.
Figure 2 shows images from a single representative female patient with a BMI of 37 kg/m2. The image has been cropped to show only the lung lesion and liver. Voxel COV within the liver VOI was 16.3%, 15.0%, 16.5% and 15.4% for OSEM, TOF, HD and UHD, respectively, with matched voxel COV post-filters and 13.5%, 10.8% and 7.95% for TOF, HD and UHD, respectively, with matched SUVmax post-filters. SUVmax for the lesion in the right lung was 5.4, 6.0, 8.2 and 10.1 for OSEM, TOF, HD and UHD, respectively, with matched noise post-filters and 5.2, 5.7 and 5.7 for TOF, HD and UHD, respectively, with matched SUVmax post-filters. The visual reduction in voxel variance within the liver is evident in the HD and UHD images with the matched SUVmax protocol.
Table 3 shows the voxel COV data measured in the VOI within the patient livers. There were no significant differences for the PSF and TOF-based reconstructions versus OSEM when using the matched voxel COV post-filters. As with the phantom data, significant reductions of voxel COV were measured for PSF and TOF-based reconstructions compared with OSEM using the post-filters to match SUVmax recovery. The mean measurements of voxel COV in the liver VOI for TOF, HD and UHD were 90%, 65% and 56%, respectively, of the value measured using OSEM.
FDG uptake measurements
Tables 4 and 5 summarise the changes of the three uptake measures observed using the PSF and TOF-based reconstructions relative to OSEM. The data in Table 5 for the number of lesions with a change in SUVmax greater than 25% occurred in lesions with very low grade uptake (SUVmax <2.5). Bland-Altman plots for the relative differences are shown in Figures 3, 4 and 5, which, in addition to data in Tables 4 and 5, show that the smaller values of SUVmax and SUVpeak experience the greatest increase with matched voxel COV (Figure 3a,b,c and Figure 4a,b,c). For matched SUVmax filters, this is still present with TOF algorithms (Figure 3d,f and Figure 4d,f) but not with HD reconstruction.
For matched voxel COV, the increase in both SUVmax and SUVpeak ratio for PSF and TOF-based reconstructions versus OSEM was inversely related to lesion volume as shown in Figure 6. This reflects what was seen in the image quality phantom measurements. The gains in SUVmax were most pronounced with UHD, which is likely to be a consequence of reduced post-filtering compared with HD when voxel COV was matched (2.9 mm for UHD and 3.8 mm for HD). Differences in TLG-40 were not dependent on lesion volume. No relationship between SUV difference and lesion volume was observed for matched SUVmax post-filters.
Out of the 74 lesions, 59 had a SUVmax of >5.0 using OSEM reconstruction. No change to patient management would occur in these instances as a result of an increase of SUVmax when using the PSF and TOF-based reconstructions. A key group of ten patients was identified with low or borderline SUVmax (<5.0) for suspicion of malignancy using this institute's practice. The SUVmax for these 15 lesions in each of the reconstruction algorithms are shown in Table 6. The table shows that, with matched voxel COV, several of these lesions would change classification with HD and UHD, as would be expected from data in previous tables and figures. With matched SUVmax filters, there is only one lesion that would have changed classification according to local practice and only with the TOF reconstruction.
Fifty-nine lesions were found to have SUVmax above the threshold based on the liver uptake as measured on the OSEM images. Significant SNR(T-L) gains were found for PSF and TOF-based reconstructions with both matched voxel COV and matched SUVmax. With the addition of PSF modelling, either to OSEM or OSEM + TOF images, there is a more marked gain in SNR(T-L). For matched voxel COV, SNR(T-L) ratios relative to OSEM were 1.10 ± 0.11, 1.43 ± 0.23 and 1.67 ± 0.41 for TOF, HD and UHD, respectively, and for matched SUVmax, they were 1.19 ± 0.12, 1.58 ± 0.16, and 1.94 ± 0.29, respectively. For each reconstruction algorithm, the improvement in SNR(T-L) with matched SUVmax versus matched noise was also significant.
The deployment of PSF and TOF-based reconstruction methods into routine clinical practice for FDG imaging presents a challenge, particularly in centres or collaborative imaging networks with a defined protocol for classification of malignancy based upon SUV data. To our knowledge, this is the first study that has evaluated the performance of PSF and TOF-based reconstruction algorithms with two post-filtering strategies based on the objective criteria of matched image noise (voxel COV) or matched SUVmax, quantifying the impact on SUVmax, SUVpeak, TLG and SNR(T-L). Specific findings are applicable to Siemens HD and ultraHD reconstruction algorithms using the parameters applied in the study.
It is clear from the data in Tables 1 and 2 and Figure 3 that quantification differences occur in the phantom data for all algorithms applied in this study. There are several factors that will contribute to the differences: the effect of statistical noise, partial volume effect, the size (and hence number of voxels) of the region of interest and, for the HD and UHD algorithms, Gibbs artefacts. The contributions from these factors to the measurements of SUVmax will differ as reconstruction parameters are varied. We believe that the interactions between the various factors are complex and not completely separable. As such, we do not feel that it is possible to identify one single phenomenon as the source of quantification differences for any of the algorithms used.
It can be seen that overestimation occurs for all four reconstruction algorithms (Table 1) and requires the application of a post-filter to reduce this (Table 2). The smaller filter kernel applied to HD and UHD to match noise combined with voxel correlation leads to a lesser reduction of this overestimation. It can be seen that there appears to be a particular size of object where an overestimation with HD and UHD is particularly prominent with no or minimal levels of post-filtering, which, in part, may be due to overlapping Gibbs edge artefacts. Despite this, it can be seen from HD recovery data in Table 2 that, with matched voxel variance, there is very little dependence of recovery on sphere size for the 13- to 37-mm spheres, which is a desirable property. This highlights the importance of establishing a full understanding of the impact of these algorithms, and it is the duty of medical physics experts to educate clinicians on changes expected to quantification.
Ideally, the implementation of PSF modelling would not lead to Gibbs artefacts, but given the necessary compromises for PET imaging with limited statistics, an improvement in one area such as in image resolution is almost certainly going to lead to a deterioration in other aspects. Overall, whether the changes are desirable is application dependent, with our data showing smaller absolute errors for smaller spheres (but not for large spheres) and reduced dependency on quantification with lesion size.
Matching image noise produces marked increases in SUVmax, particularly with PSF reconstructions, that are potentially clinically significant, depending on local practice. This highlights the pitfalls of using uptake metrics such as SUVmax, that are so sensitive to partial volume effects and reconstruction parameters, with fixed thresholds for malignancy. The largest increases in SUVmax occur for small lesions, which typically have low SUVmax (less than 5), which is consistent with other studies ,. One potential solution may be to modify thresholds based on estimated tumour volume. It would be useful to extend the matching of SUVmax to smaller objects, but this is not possible due to the limitation of the current NEMA phantom, with 10 mm being the diameter of the smallest sphere insert. It is these small lesions, with SUVmax close to the typical cut-offs for discrimination of benign and malignant disease, that are arguably the most critical lesions for lung cancer staging as they are likely to be possible additional pulmonary nodules or lymph nodes. Determining whether a lymph node is malignant, particularly those in the mediastinum, has a considerable influence on the overall staging and will play a major role in patient management. This change in SUVmax is expected to require an adaptation of locally used thresholds for discrimination of disease. It was also noted from the phantom studies that variability of SUVmax was worse for PSF-based algorithms in the small spheres, which suggests worse test-retest performance in clinical data. This is suspected to be due to increased inter-voxel correlation that is introduced when using PSF-based algorithms . This increased correlation results in a reduction of voxel variance (and hence the voxel COV as used in this study as a noise metric), but it has been shown to potentially result in larger variability of uptake metrics within small ROIs . We feel that the impact of PSF modelling on variability for clinical data has yet to be explored fully, and while this is beyond the scope of this study, it is recommended that caution is observed when applying PSF modelling for assessing response to treatment with follow-up scans. Despite this, the reduced levels of post-filtering required with PSF and PSF + TOF have been shown to improve lesion visualisation -.
With matched voxel COV, SUVpeak experiences similar differences to those seen for SUVmax, albeit to a lesser extent. Quantification of peak uptake implicitly includes an additional filtering operation with a spherical kernel. The small mean relative differences for TLG suggests that it is a relatively robust uptake metric when comparing against OSEM images for either filtering strategies. The large degree of variability seen in the relative changes, as highlighted by the confidence intervals in Tables 4 and 5, may be concerning. However, it should also be noted that the total range of TLG observed in this study is approximately a full order of magnitude greater than SUVmax and SUVpeak. The use of TLG has been reported in assessment of therapy response and, recently, for prognosis in a small number of studies. The increased stability of TLG with a volume delineation based on a percentage of SUVmax suggests the metric may be more appropriate than SUVmax for staging and prognosis as the evidence base for this metric is established. We believe that this is the first time that the dependence of TLG on reconstruction algorithm has been explored in the literature.
Alternatively, post-filters for PSF and TOF-based algorithms can be determined to give SUVmax that, according to this institute's practice, would not alter the outcome of the study. For all lesions with borderline SUVmax for suspicion of malignancy, relative changes with PSF and TOF-based reconstructions were less than 20%.
Matching SUVmax between PSF-based algorithms and OSEM has been demonstrated previously . However, our study has also shown that matching SUVmax will significantly reduce the voxel variance in the image compared with OSEM, which we believe has yet to be demonstrated quantitatively. Combined with increased voxel correlation, this reduction of voxel variance alters the image appearance quite considerably and may be perceived as over-smoothing of images. Findings from this study are based upon an image matrix of 256 × 256 voxels, whereas other centres may use different parameters such as 200 × 200 or 400 × 400 voxels, which are common choices on the mCT due to the system's intrinsic 400 × 400 matrix. We believe that, when Gaussian post-filtering is applied, the dependence of both image noise and SUVmax on matrix choice is diminished. It has also been shown that the thickness of the walls of the fillable spheres of the NEMA phantom has an impact on SUVmax quantification ,. This is only seen to cause appreciable error with low sphere-to-background contrast and small spheres, and hence, we expect that the impact on the test objects used in this study is likely to be minimal.
It is noted that the degree of post-filtering for the HD and UHD algorithms (6.6 and 6.5 mm, respectively) will reduce spatial resolution for these PSF-based algorithms that are intended to provide superior spatial resolution. However, we feel that this approach may be beneficial when deploying a new PET/CT scanner to an existing clinical setting, comparing patient scans for follow-up with other systems or supporting the transition to a ‘new’ imaging facility with a catalogue or library of images with higher resolution.
In this study, the addition of TOF increased the variation in ratio values of image voxel variance for both phantom and patient data with either matched noise or matched SUVmax. In the patient data only, TOF appeared to introduce a slight positive bias and greater distribution of differences in the SUVmax data. This was not seen in the phantom studies and the cause of this is unclear. It could be due to a dependence on patient size, as TOF is associated with SNR gains proportional to the diameter of object . However, in this study and others , this did not appear to apply in lung images where the majority of tissue in the image has low density with very low uptake of FDG.
We believe this is the first study to demonstrate SNR gains with PSF and/or TOF using lesion uptake as a measure of signal with two different criteria for choosing post-filtering. A recent study has shown reductions in voxel variance and gains in SNR but measured only in uniform areas of uptake with patient livers . One study has evaluated SNR gains using lesion uptake as the signal  but only comparing images reconstructed with PSF and PSF + TOF, with the intention to demonstrate the SNR gains brought on by TOF. It was expected that SNR gains would be seen for PSF and TOF-based algorithms compared with conventional OSEM. However, it was not anticipated that the gains in SNR would be greater when parameters are chosen to match SUVmax. This may be of particular relevance for low-contrast lesions elsewhere in the body, such as the abdomen, which do not have the inherent high lesion to background contrast of lung lesions. The notion that increased levels of post-filtering may be superior in terms of SNR gains seems slightly at odds with published work on lesion detection that suggest less post-filtering results in optimal lesion detection ,. This may be due to fact that the definition of SNR in this study is not a direct indicator of lesion detectability.
There are two limitations with this study where future work is planned. Firstly, no histological correlation with FDG uptake measured in the lesions was performed as in other studies . Therefore, it is not possible to determine cut-off values and diagnostic accuracy of the uptake metrics in the two strategies of implementation. This is arguably outside the scope of this study as the purpose was not to determine such data. Secondly, we have only assessed lung lesions, and from other studies , it is likely that reconstruction will perform differently in other areas of the body.
The effect of PSF and TOF-based reconstruction on quantification, particularly SUVmax, has limited their introduction into routine clinical use despite demonstrated improvements in lesion detectability. This study extends existing studies  which have shown that the impact on SUVmax can be addressed with appropriate post-filters, by demonstrating that the same approach can be used for reconstructions with TOF reconstructions and also with alternative uptake metrics such as SUVpeak or TLG. Furthermore, we have demonstrated that this additional filtering to match SUVmax actually provides added gains in SNR over parameters to match image voxel COV. However, if the additional smoothing is visually undesirable, an alternative methodology can be used which performs the additional filtering required to match SUVmax only for quantification and is not visualised .
This work evaluated the impact of reconstructions that include PSF modelling and/or TOF on lesion classification according to a local protocol by assessing changes in FDG uptake measurements. Two objective strategies for post-filtering were investigated: matching image voxel COV versus matching SUVmax. For matched voxel COV, considerable increases in SUVmax and SUVpeak were observed compared with OSEM. Using post-filters to match SUVmax reduced the discrepancies of either SUVmax or SUVpeak across reconstructions, particularly with PSF modelling. This also resulted in a considerable reduction in voxel variance. Some small discrepancies in patient data still remained when TOF was incorporated, which was not seen in phantom data, warranting further investigation. The TLG metric appears to be more robust in either scheme of post-filtering despite a slightly larger variation in the amount of change, which may be less of a problem considering the large range of TLG data observed. This suggests TLG may be a more suitable metric to adopt instead of SUVmax as the evidence base develops. Gains in SNR were seen in both implementations with the greatest gains seen for matched SUVmax post-filters.
coefficient of variation
European Organization for Research and Treatment of cancer
full width at half the maximum
Siemens HD·PET reconstruction
ordered subset expectation maximisation
PET response criteria in solid tumours
positron emission tomography
point spread function
region of interest
standardised uptake value
total lesion glycolysis
time of flight
Siemens ultraHD·PET reconstruction
volume of interest
Cerfolio RJ, Bryant AS, Ohja B, Bartolucci AA: The maximum standardized uptake values on positron emission tomography of a non-small cell lung cancer predict stage, recurrence, and survival. J Thorac Cardiovasc Surg 2005, 130: 151–159. 10.1016/j.jtcvs.2004.11.007
Cerfolio RJ, Bryant AS, Ojha B, Eloubeidi M: Improving the inaccuracies of clinical staging of patients with NSCLC: a prospective trial. Ann Thorac Surg 2005, 80: 1207–1214. 10.1016/j.athoracsur.2005.04.019
Subedi N, Scarsbrook A, Darby M, Korde K, Mc Shane P, Muers MF: The clinical impact of integrated FDG PET–CT on management decisions in patients with lung cancer. Lung Cancer 2009,64(3):301–307. 10.1016/j.lungcan.2008.09.006
Dijkman B, Schuurbiers O, Vriens D, Looijen-Salamon M, Bussink J, Timmer-Bonte J, Snoeren M, Oyen W, van der Heijden H, de Geus-Oei L-F: The role of 18F-FDG PET in the differentiation between lung metastases and synchronous second primary lung tumours. Eur J Nucl Med Mol Imaging 2010,37(11):2037–2047. 10.1007/s00259-010-1505-2
Gregory DL, Hicks RJ, Hogg A, Binns DS, Shum PL, Milner A, Link E, Ball DL, Mac Manus MP: Effect of PET/CT on management of patients with non-small cell lung cancer: results of a prospective study with 5-year survival data. J Nucl Med 2012,53(7):1007–1015. 10.2967/jnumed.111.099713
Erdi YE, Macapinlac H, Rosenzweig KE, Humm JL, Larson SM, Erdi AK, Yorke ED: Use of PET to monitor the response of lung cancer to radiation treatment. Eur J Nucl Med Mol Imaging 2000,27(7):861–866. 10.1007/s002590000258
Beyer T, Czernin J, Freudenberg LS: Variations in clinical PET/CT operations: results of an international survey of active PET/CT users. J Nucl Med 2011,52(2):303–310. 10.2967/jnumed.110.079624
Bryant AS, Cerfolio RJ: The maximum standardized uptake values on integrated FDG-PET/CT is useful in differentiating benign from malignant pulmonary nodules. Ann Thorac Surg 2006,82(3):1016–1020. 10.1016/j.athoracsur.2006.03.095
Nambu A, Kato S, Sato Y, Okuwaki H, Nishikawa K, Saito A, Matsumoto K, Ichikawa T, Araki T: Relationship between maximum standardized uptake value (SUVmax) of lung cancer and lymph node metastasis on FDG-PET. Ann Nucl Med 2009,23(3):269–275. 10.1007/s12149-009-0237-5
Young H, Baum R, Cremerius U, Herholz K, Hoekstra O, Lammertsma AA, Pruim J, Price P: Measurement of clinical and subclinical tumour response using [18F]-fluorodeoxyglucose and positron emission tomography: review and 1999 EORTC recommendations. European Organization for Research and Treatment of Cancer (EORTC) PET Study Group. Eur J Cancer 1999, 35: 1773–1782. 10.1016/S0959-8049(99)00229-4
Boellaard R, Krak NC, Hoekstra OS, Lammertsma AA: Effects of noise, image resolution, and ROI definition on the accuracy of standard uptake values: a simulation study. J Nucl Med 2004, 45: 1519–1527.
Nahmias C, Wahl LM: Reproducibility of standardized uptake value measurements determined by 18F-FDG PET in malignant tumors. J Nucl Med 2008, 49: 1804–1808. 10.2967/jnumed.108.054239
Lodge MA, Chaudhry MA, Wahl RL: Noise considerations for PET quantification using maximum and peak standardized uptake value. J Nucl Med 2012, 53: 1041–1047. 10.2967/jnumed.111.101733
Wahl RL, Jacene H, Kasamon Y, Lodge MA: From RECIST to PERCIST: evolving considerations for PET response criteria in solid tumors. J Nucl Med 2009, 50: 122S-150S. 10.2967/jnumed.108.057307
Larson SM, Erdi Y, Akhurst T, Mazumdar M, Macapinlac HA, Finn RD, Casilla C, Fazzari M, Srivastava N, Yeung HW, Humm JL, Guillem J, Downey R, Karpeh M, Cohen AE, Ginsberg R: Tumor treatment response based on visual and quantitative changes in global tumor glycolysis using PET-FDG imaging. The visual response score and the change in total lesion glycolysis. Clin Positron Imaging 1999, 2: 159–171. 10.1016/S1095-0397(99)00016-3
Wiele C, Kruse V, Smeets P, Sathekge M, Maes A: Predictive and prognostic value of metabolic tumour volume and total lesion glycolysis in solid tumours. Eur J Nucl Med Mol Imaging 2013, 40: 290–301. 10.1007/s00259-012-2280-z
Pak K, Cheon GI, Nam H-Y, Kim S-J, Kang KW, Chung J-K, Kim EE, Lee DS: Prognostic value of metabolic tumor volume and total lesion glycolysis in head and neck cancer: a systematic review and meta-analysis. J Nucl Med 2014, 55: 884–890. 10.2967/jnumed.113.133801
Chung MDHH, PD, Kwon MDHW, Kang MDKW, Park MDN-H, Song MDY-S, Chung MDJ-K, Kang MDS-B, Kim MDJW: Prognostic value of preoperative metabolic tumor volume and total lesion glycolysis in patients with epithelial ovarian cancer. Ann Surg Oncol 2012, 19: 1966–1972. 10.1245/s10434-011-2153-x
Hyun S, Ahn H, Kim H, Ahn M-J, Park K, Ahn Y, Kim J, Shim Y, Choi J: Volume-based assessment by 18F-FDG PET/CT predicts survival in patients with stage III non-small-cell lung cancer. Eur J Nucl Med Mol Imaging 2014, 41: 50–58. 10.1007/s00259-013-2530-8
Panin VY, Kehren F, Michel C, Casey M: Fully 3-D PET reconstruction with system matrix derived from point source measurements. Med Imaging, IEEE Trans 2006, 25: 907–921. 10.1109/TMI.2006.876171
Alessio AM, Stearns CW, Shan T, Ross SG, Kohlmyer S, Ganin A, Kinahan PE: Application and evaluation of a measured spatially variant system model for PET image reconstruction. Med Imaging IEEE Trans 2010, 29: 938–949. 10.1109/TMI.2010.2040188
Conti M, Bendriem B, Casey M, Chen M, Kehren F, Michel C, Panin V: First experimental results of time-of-flight reconstruction on an LSO PET scanner. Phys Med Biol 2005, 50: 4507. 10.1088/0031-9155/50/19/006
Kalemis A, Delattre BMA, Heinzer S: Sequential whole-body PET/MR scanner: concept, clinical use, and optimisation after two years in the clinic. The manufacturer's perspective. Magn Reson Mater Phy 2013, 26: 5–23. 10.1007/s10334-012-0330-y
Karp JS, Surti S, Daube-Witherspoon ME, Muehllehner G: Benefit of time-of-flight in PET: experimental and clinical results. J Nucl Med 2008, 49: 462–470. 10.2967/jnumed.107.044834
Lois C, Jakoby BW, Long MJ, Hubner KF, Barker DW, Casey ME, Conti M, Panin VY, Kadrmas DJ, Townsend DW: An assessment of the impact of incorporating time-of-flight information into clinical PET/CT imaging. J Nucl Med 2010, 51: 237–245. 10.2967/jnumed.109.068098
El Fakhri G, Surti S, Trott CM, Scheuermann J, Karp JS: Improvement in lesion detection with whole-body oncologic time-of-flight PET. J Nucl Med 2011, 52: 347–353. 10.2967/jnumed.110.080382
Akamatsu G, Ishikawa K, Mitsumoto K, Taniguchi T, Ohya N, Baba S, Abe K, Sasaki M: Improvement in PET/CT image quality with a combination of point-spread function and time-of-flight in relation to reconstruction parameters. J Nucl Med 2012, 53: 1716–1722. 10.2967/jnumed.112.103861
Kadrmas DJ, Casey ME, Black NF, Hamill JJ, Panin VY, Conti M: Experimental comparison of lesion detectability for four fully-3D PET reconstruction schemes. Med Imaging IEEE Trans 2009, 28: 523–534. 10.1109/TMI.2008.2006520
Schaefferkoetter J, Casey ME, Townsend DW, El Fakhri G: Clinical impact of time-of-flight and point response modeling in PET reconstructions: a lesion detection study. Phys Med Biol 2013, 58: 1465–1478. 10.1088/0031-9155/58/5/1465
Kadrmas DJ, Casey ME, Conti M, Jakoby BW, Lois C, Townsend DW: Impact of time-of-flight on PET tumor detection. J Nucl Med 2009, 50: 1315–1323. 10.2967/jnumed.109.063016
Tong S, Alessio AM, Thielemans K, Stearns C, Ross S, Kinahan PE: Properties and mitigation of edge artifacts in PSF-based PET reconstruction. Nucl Sci IEEE Trans 2011, 58: 2264–2275. 10.1109/TNS.2011.2164579
Rahmim A, Qi J, Sossi V: Resolution modeling in PET imaging: theory, practice, benefits, and pitfalls. Med Phys 2013, 40: 064301–064315. 10.1118/1.4800806
Rapisarda E, Bettinardi V, Thielemans K, Gilardi MC: Image-based point spread function implementation in a fully 3D OSEM reconstruction algorithm for PET. Phys Med Biol 2010, 55: 4131–4151. 10.1088/0031-9155/55/14/012
Watson CC: Estimating effective model kernel widths for PSF reconstruction in PET. Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), 2011 IEEE; 23–29 Oct. 2011 2011, 2368–2374.
Kotasidis FA, Matthews JC, Angelis GI, Noonan PJ, Jackson A, Price P, Lionheart WR, Reader AJ: Single scan parameterization of space-variant point spread functions in image space via a printed array: the impact for two PET/CT scanners. Phys Med Biol 2011, 56: 2917–2942. 10.1088/0031-9155/56/10/003
Lasnon C, Hicks RJ, Beauregard J-M, Milner A, Paciencia M, Guizard A-V, Bardet S, Gervais R, Lemoel G, Zalcman G, Aide N: Impact of point spread function reconstruction on thoracic lymph node staging with 18F-FDG PET/CT in non–small cell lung cancer. Clin Nucl Med 2012, 37: 971–976. 10.1097/RLU.0b013e318251e3d1
Andersen FL, Klausen TL, Loft A, Beyer T, Holm S: Clinical evaluation of PET image reconstruction using a spatial resolution model. Eur J Radiol 2013, 82: 862–869. 10.1016/j.ejrad.2012.11.015
Prieto E, Dominguez-Prado I, Garcia-Velloso MJ, Penuelas I, Richter JA, Marti-Climent JM: Impact of time-of-flight and point-spread-function in SUV quantification for oncological PET. Clin Nucl Med 2013, 38: 103–109. 10.1097/RLU.0b013e318279b9df
Lasnon C, Desmonts C, Quak E, Gervais R, Do P, Dubos-Arvis C, Aide N: Harmonizing SUVs in multicentre trials when using different generation PET systems: prospective validation in non-small cell lung cancer patients. Eur J Nucl Med Mol Imaging 2013, 40: 985–996. 10.1007/s00259-013-2391-1
Boellaard R, O'Doherty MJ, Weber WA, Mottaghy FM, Lonsdale MN, Stroobants SG, Oyen WJ, Kotzerke J, Hoekstra OS, Pruim J, Marsden PK, Tatsch K, Hoekstra CJ, Visser EP, Arends B, Verzijlbergen FJ, Zijlstra JM, Comans EF, Lammertsma AA, Paans AM, Willemsen AT, Beyer T, Bockisch A, Schaefer-Prokop C, Delbeke D, Baum RP, Chiti A, Krause BJ: FDG PET and PET/CT: EANM procedure guidelines for tumour PET imaging: version 1.0. Eur J Nucl Med Mol Imaging 2010, 37: 181–200. 10.1007/s00259-009-1297-4
Kelly MD, Declerck JM: SUVref: reducing reconstruction-dependent variation in PET SUV. EJNMMI Res 2011, 1: 16. 10.1186/2191-219X-1-16
Jakoby BW, Bercier Y, Conti M, Bendriem B, Townsend D: Physical and clinical performance of the mCT time-of-flight PET/CT scanner. Phys Med Biol 2011, 56: 2375–2389. 10.1088/0031-9155/56/8/004
Conti M, Bendriem B, Casey M, Mu C, Kehren F, Michel C, Panin V: Implementation of time-of-flight on CPS HiRez PET scanner. Nuclear Science Symposium Conference Record, 2004 IEEE; 16–22 Oct. 2004 2004, 2796–2800.
National Electrical Manufacturers Association: NEMA Standards Publication NU 2–2007: Performance Measurements of Positron Emission Tomographs. NEMA 2007.
Rahmim A, Tang J: Noise propagation in resolution modeled PET imaging and its impact on detectability. Phys Med Biol 2013, 58: 6945–6968. 10.1088/0031-9155/58/19/6945
Hofheinz F, Dittrich S, Potzsch C, Hoff J: Effects of cold sphere walls in PET phantom measurements on the volume reproducing threshold. Phys Med Biol 2010, 55: 1099–1113. 10.1088/0031-9155/55/4/013
Berthon B, Marshall C, Edwards A, Evans M, Spezi E: Influence of cold walls on PET image quantification and volume segmentation: a phantom study. Med Phys 2013, 40: 082505. 10.1118/1.4813302
Budinger TF: Time-of-flight positron emission tomography: status relative to conventional PET. J Nucl Med 1983, 24: 73–78.
This study was performed as part of the first author's (IA) PhD project, which receives financial support (course fees) from Siemens Healthcare that is paid to the nuclear medicine department and then directly to the University of Manchester.
IA managed and processed all image data and wrote the manuscript. MK assisted with data analysis (MATLAB code) and critically appraised and modified the draft manuscript. HW critically appraised and modified the draft manuscript. JM is a PhD supervisor and critically appraised and modified the draft manuscript. All authors read and approved the final manuscript.