Skip to main content

Concept development of an on-chip PET system



Organs-on-Chips (OOCs), microdevices mimicking in vivo organs, find growing applications in disease modeling and drug discovery. With the increasing number of uses comes a strong demand for imaging capabilities of OOCs as monitoring physiologic processes within OOCs is vital for the continuous improvement of this technology. Positron Emission Tomography (PET) would be ideal for OOC imaging, however, current PET systems are insufficient for this task due to their inadequate spatial resolution. In this work, we propose the concept of an On-Chip PET system capable of imaging OOCs and optimize its design using a Monte Carlo Simulation (MCS).

Material and methods

The proposed system consists of four detectors arranged around the OOC device. Each detector is made of two monolithic LYSO crystals and covered with Silicon photomultipliers (SiPMs) on multiple surfaces. We use a Convolutional Neural Network (CNN) trained with data from a MCS to predict the first gamma-ray interaction position inside the detector from the light patterns that are recorded by the SiPMs on the detector’s surfaces.


The CNN achieves a mean average prediction error of 0.80 mm in the best configuration. The proposed system achieves a sensitivity of 34.81% for 13 mm thick crystals and does not show a prediction degradation near the boundaries of the detector. We use the trained network to reconstruct an image of a grid of 21 point sources spread across the field-of-view and obtain a mean spatial resolution of 0.55 mm. We show that 25,000 Line of Responses (LORs) are needed to reconstruct a realistic OOC phantom with adequate image quality.


We demonstrate that it is possible to achieve a spatial resolution of almost 0.5 mm in a PET system made of multiple monolithic LYSO crystals by directly predicting the scintillation position from light patterns created with SiPMs. We observe that a thinner crystal performs better than a thicker one, that increasing the SiPM size from 3 mm to 6 mm only slightly decreases the prediction performance, and that certain surfaces encode significantly more information for the scintillation-point prediction than others.


Organs-on-Chips (OOCs) are microdevices that mimic in vivo organs. They contain 3D tissue cultures connected by microfluidic channels that add biomechanical forces to the system [1]. OOCs have sparked the interest of researchers in the past decade, especially in the drug discovery and development process, as they can enhance several steps in this process [2].

With the growing number of use cases for OOCs comes an increasing demand for novel measurement capabilities. Monitoring metabolism or other physiologic and pathophysiologic processes in OOCs is critical to refining the technology to closely resemble in vivo physiology and promote its application in new biological models.

Positron Emission Tomography (PET) would be the ideal candidate for OOC imaging due to its ability to retrieve in vivo information about metabolism and molecular pathways [3]. However, current imaging devices for measuring PET tracer uptake in either small animals or cell cultures are inadequate for the task of OOC imaging due to their limited spatial resolution [4]. Several degrading factors limit PET systems’ spatial resolution, such as the distance that a positron travels before annihilating with an electron, scattering of the emerging gamma-rays in the tissue, and the detector’s resolution.

In recent years, there has been a trend in pre-clinical PET research toward using monolithic instead of pixelated crystals as detectors to increase the spatial resolution. The resolution in monolithic crystals is not inherently limited by the pixel size in contrast to pixelated detectors but can be improved with more advanced readout schemes and data processing methods. The key to increasing the resolution is to predict the first gamma-ray interaction position in the detector as precisely as possible and thus improve the estimation of the Line of Responses (LORs) [5].

In literature, several works have tackled the problem of predicting the gamma-ray interaction position in monolithic crystals with either analytical or data-driven approaches.

Clement et al. [6] (not related to the first author of this work) implemented one of the earliest Neural Network (NN) based methods to predict the Depth of Interaction (DOI) in monolithic crystals. They used signals of solid-state photodetectors that fully cover the crystal as input for three multilayer NNs. Each NN predicts two of the three coordinates of the gamma-ray interaction position. The authors set up a Monte Carlo Simulation (MCS) to generate training data for the networks and demonstrated that the NN-based approach yields better results than a baseline method using Anger logic with a Full Width at Half Maximum (FWHM) of 2.0 mm compared to 3.5 mm.

The work by Wang et al. [7] introduced a monolithic PET detector system that can estimate gamma-ray interaction positions with NNs. They trained one network to estimate the plane position and another to predict the DOI. The input data were created with a simplified readout scheme with signals from a Photomultiplier Tube (PMT) on one side of the crystal. The system achieves spatial and DOI resolutions of 2.0 mm.

Marcinkowski et al. [8] investigated a high-resolution small-animal PET system based on a continuous crystal. They coupled a Lutetium-yttrium oxyorthosilicate (LYSO) crystal with a Digital Photon Counter and determined the gamma-ray interaction position using mean-nearest-neighbor positioning. The system reaches a spatial FWHM of 0.60 mm and a DOI FWHM of 1.66 mm.

The work by Tao et al. [9] compared four different NN architectures that estimate gamma-ray interaction positions in monolithic crystals. They trained fully connected and Convolutional Neural Networks (CNNs) with regression and classification heads with mean detector response functions as input. The different networks reached prediction errors between 2.0 mm and 2.6 mm. The authors found that deep learning methods reduce the memory cost by a factor of one to two orders of magnitude compared to searching-based methods.

Sanaat and Zaidi [10] presented another approach to estimate the DOI in a monolithic crystal using a NN. They trained a multilayer perceptron that outputs the 3D gamma-ray interaction position with data from an MCS. Their proposed approach reaches a spatial resolution of 1.54 mm in the x-y plane, which is better than an Anger logic-based method.

Decuyper et al. [11] simulated the interaction of a gamma-ray grid with a monolithic LYSO crystal. With the simulated data, they trained a NN that estimates the first gamma-ray interaction position. The optimal amount of training data and network design was determined to overcome the problem of overfitting. With the NN, they achieved a median positioning error of 0.77 mm and a 2D FWHM of 0.46 mm, which was an improvement of 17% compared to a nearest-neighbor algorithm. The prediction performance was even more improved when only using non-Compton scattered events.

Jaliparthi et al. [12] developed AnnPET, a monolithic annular PET system consisting of a single annulus-shaped LYSO crystal with Silicon photomultiplier (SiPM) arrays attached to its outer surfaces. They employed a ten-layer CNN to estimate the gamma-ray interaction position and reached single dimension Mean Absolute Error (MAE) values between 0.42 mm and 0.54 mm for the position prediction. When using the trained network for reconstruction, they achieved FWHM values between 0.71 and 0.80 mm, which are around 0.4 mm better than the results of a center-of-mass algorithm.

The work by Liu et al. [13] introduces a dedicated PET scanner for microfluidic radiobioassays that is made up of two detector panels placed above and below the microfluidic chip. The system achieves a spatial resolution of around 1.8 mm and the authors demonstrate its capability to image cellular pharmacokinetics.

In this work, we propose an On-Chip PET system to make functional imaging of OOCs possible. The novelties presented in this work are twofold. First, we design a scanner made up of four detectors that consist of two glued-together monolithic crystals each. Second, we train a CNN directly with the light pattern images that emerge on the surfaces of the detectors to predict the first scintillation positions inside the detectors. We optimize the design of the system with an MCS to create datasets of light pattern images emerging on the surfaces of the detectors through scintillation. With these datasets, we train and evaluate CNNs that predict the first interaction positions of the gamma rays inside the detector. With the predicted scintillation positions, we reconstruct the insides of the detector using Simultaneous Algebraic Reconstruction Technique (SART) [14].

Our proposed system would support two important applications of OOCs in preclinical use - disease modeling and precision medicine [15]. Measuring cellular pharmacokinetics helps to understand human diseases by modeling biochemical and genetic manipulations. The analysis of patient-derived organoids enables finding the most suitable drug on a per-patient basis.


Monte Carlo simulation

We model the interaction of the proposed system with a Fluorine-18 positron source in an MCS built with the Geant4 Application for Emission Tomography (GATE) tool [16,17,18]. GATE enables the creation of MCSs in the field of nuclear medicine through a macro language that controls the experimental settings. It is built as a wrapper around the Geant4 simulation toolkit that enables simulating ”the passage of particles through matter” for a wide range of physics processes, particles, and materials over a broad energy spectrum [19,20,21].

A GATE simulation consists of the following parts that are described in more detail in the next sections: a scanner geometry, a phantom, material properties, physics processes, boundary surfaces, and a primary particle source.


In GATE, the concept of a system plays a crucial role if one wants to store information about particles and physical processes in the simulation. A system is defined as a template for predefined scanner types with specific geometries. Different geometrical shapes are organized in a tree-level structure for these scanners to build up the final geometry.

In this work, we use the most generic system in GATE, the scanner. In our case, the scanner is defined as a box-shaped volume and placed in the world volume. The purpose of the scanner volume is to encapsulate our proposed PET system. The box-shaped detector volume is placed inside the scanner volume and repeated four times with a ring repeater around the z-axis. Inside the detector volume, the box-shaped crystal volume is placed and repeated two times with a linear repeater. Table 1 contains the lengths, translations, and materials of each volume. We evaluate two different crystal thicknesses, 13 mm and 26 mm. All dimensions of the crystals are chosen such that arrays of commercially available SiPMs fit on the surfaces. The properties of the respective materials of the different volumes are described in the next section.

Table 1 Geometry setup of the GATE simulation

In GATE, it is important to attach the created volumes to the system to be able to record hits in them. In our case, the detector volume is attached to the first level of the scanner system and the crystal volume to the second one. To both of these volumes, we attach a crystal sensitive detector that records the hits in them.

Figure 1 depicts the simulation setup viewed from the front and the side.

Fig. 1
figure 1

Simulation setup viewed from the front and the side. The yellow volumes represent the crystals, the green ones the epoxy layer around the crystal, and the blue one the volume from which the source position is sampled to create the training dataset


We use different types of phantoms depending on the purpose of the dataset that is created by the simulation.

For training, we use a simplified model of an OOC device consisting of a box-shaped water volume surrounded by a PMMA box that is placed in the middle of the detector. The PMMA box has side lengths of 10 mm \(\times\) 26 mmm \(\times\) 76 mm while those of the water box are 4 mm shorter each.

For evaluating the sensitivity of the system, we instead add a 1 mm thick and 104 mm long water cylinder to the simulation that is placed along the z-axis of the scanner.

To demonstrate that our system is able to capture the small volumes found in OOC devices, we create a more complex OOC phantom consisting of the PMMA box mentioned above that contains four water spheres with a radius of 2 mm each, which are distributed along the z-axis.

In “Sources” section, the sources that are combined with each of the phantoms are described.


The materials of the world and scanner volumes are set to vacuum, the detector’s to epoxy, and the crystal’s to LYSO. In this way, one detector consists of two LYSO crystals that are surrounded by a 0.1 mm thick layer of epoxy, which acts as the glue and optical medium between the crystals and SiPMs.

In Table 2, the material properties are described, and in Table 3, the scintillation properties of LYSO are shown.

Table 2 Properties of the materials used in the GATE simulation
Table 3 Scintillation properties of LYSO

Physics and cuts

As a physics list, the Electromagnetics (EM) constructor with option four is chosen, which uses the most accurate standard and low-energy models available in Geant4. Table 4 shows the enabled physics processes with their selected models. We set a cut of 0.1 mm for gammas, electrons, and positrons in the scanner volume.

Table 4 Added processes in the GATE simulation


We use Geant4’s unified model to define the surfaces in the simulation. The surfaces between the detector and crystal volumes are dielectric-dielectric ones with a ground finish and a sigmaalpha value of 0.01 corresponding to a typical polished crystal. Their specular lobe constant is set to 1.0.

To detect optical photons in GATE, it is necessary to use dielectric-metal boundaries. As we want to detect the optical photons that are leaving the Epoxy layer and thus entering the SiPMs, we add dielectric-metal surfaces to the boundaries between the scanner and detector volumes. These surfaces have specular lobe, specular spike, and backscatter constants of 0.0, a reflectivity of 0.0, and an efficiency of 1.0.


To create the training and testing datasets, we add a source that emits Fluorine-18 positrons with an activity of 1000 Bq to the simulation setup. The position of the source is sampled from the box-shaped water volume described in “Phantoms” section.

To evaluate the spatial resolution of the system, we create another dataset where 21 F18-point sources with a total activity of 1000 Bq are arranged in a 7 \(\times\) 3 grid with a distance of 10 mm in between each source, which are placed in the box-shaped water volume.

A 104 mm long line source is placed along the z-axis inside the water cylinder to determine the sensitivity of the system.

To create the more complex OOC phantom, we place four hot sphere-shaped sources with radii of 0.4 mm, 0.5 mm, 0.6 mm, and 0.7 mm inside the four water spheres described in “Phantoms” section. These hot sources each have an activity concentration of 1000 Bq/mm\(^3\), and are surrounded by cold sphere-shaped sources with a radius of 2 mm and an activity concentration of 100 Bq/mm\(^3\).

Dataset creation

In the first step, we run the GATE simulation described in the previous section in parallel. In the second step, we post-process the hits output files from the GATE simulation runs. For each primary event, in which one positron is emitted, zero to two samples of the training dataset are created. The amount of created samples depends on the number of detectors in which the gamma-rays interact. If there is a scintillation event in at least one detector, the position of the first interaction of the gamma-ray as well as the corresponding light patterns that emerge on the surfaces of the detectors, are saved. To create the light patterns, we simulate the SiPMs that are attached to the surfaces of the detector. This procedure is described in more detail in the next section. Table 5 gives an overview of the created datasets. Depending on how many different datasets are needed to evaluate a design choice, we use either the smaller (100k) or larger (1M) versions.

Table 5 Overview of created datasets

Light pattern creation

To create the light patterns that emerge on the surfaces of the detector, we simulate the SiPMs that cover the surfaces of the detector with SimSiPM, a C++ library for SiPM simulation [22]. We want to evaluate the performance of our proposed system with three different options for the size of the photosensitive area of the SiPMs. The SiPMs simulated in this work come from Hamamatsu’s S14161 series with photosensitive area sizes of either 3, 4, or 6 mm [23]. From now on, we refer to the SiPMs as small (3 mm), medium (4 mm), and large (6 mm). The properties of the SiPMs are described in Table 6.

Table 6 Properties of the simulated Hamamatsu SiPMs [23]

The light patterns are created with the following procedure:

  1. 1.

    Assign the corresponding SiPM to each photon leaving the epoxy layer depending on the photon’s position.

  2. 2.

    Add photon times for each SiPM on each surface.

  3. 3.

    Integrate over the SiPM signal to compute the output value of each SiPM on each surface.

  4. 4.

    Pad the resulting light patterns with zeros such that they all have the same shape. This step is necessary to be able to use the stacked light patterns as input to the CNN.

Six light patterns recorded with different SiPMs sizes of 13 mm and 26 mm thick crystals are depicted in Fig. 2.

Fig. 2
figure 2

Light patterns recorded with different SiPMs sizes of 13 mm thick crystals (left) and 26 mm thick crystals (right). The light patterns recorded with the SiPMs on all surfaces of the detector except the front one are shown. The top light patterns are created with a photosensitive area size of the SiPMs of 3 mm, the middle ones with 4 mm, and the bottom ones with 6 mm. The light patterns are padded such that they all have the same square shape. The red dot is the scintillation position of the gamma-ray inside the crystal projected onto each surface

Scintillation position prediction

With the created dataset described in the previous section, we train a CNN that predicts the gamma-ray interaction position inside the crystal. The input to the network is the stacked light patterns recorded with SiPMs on the detectors’ surfaces. With the light pattern images as input, the network should predict the gamma-ray interaction positions.

Baseline method

In addition to our deep learning-based approach, we implement a simple centroiding-based method to serve as a baseline.

For every sample in the dataset, the baseline method performs the following steps to determine the scintillation position:

  1. 1.

    Compute the centroids of every light pattern with image moments.

  2. 2.

    Take the mean of the centroids of corresponding light patterns (top-bottom, left-right) to compute x-z and x-y centroids. This step is not applicable for the back light pattern as there is no corresponding front light pattern.

  3. 3.

    Take the mean of the corresponding dimensions of the x-y, x-z, and y-z centroids to compute the x-, y-, and z-centroid.

  4. 4.

    Stack the x-, y-, and z-centroid to create the predicted x-y-z position.

The predicted position is then compared to the ground truth value for every sample in the dataset to compute the MAE value of the baseline method.

Training pipeline

We implement the CNN training pipeline with PyTorch Lightning [24]. The code is organized with subclasses of the LightningDataModule and the LightningModule. In the LightningDataModule subclass, the train and test datasets are loaded, the train dataset is split into a train and a validation dataset, and the PyTorch DataLoaders are set up. In the LightningModule subclass, the network architecture, loss function, and optimizer are initialized. Additionally, the training, validation, and test steps are defined, including logging the loss and metrics. Weights & Biases [25] is used to create experiment sweeps and log results. The training pipeline of the scintillation position prediction is depicted in Fig. 3.

Fig. 3
figure 3

Training pipeline of the scintillation position prediction. The light patterns recorded on each surface are stacked along the channels dimension and used as input to the CNN. The CNN should predict the gamma-ray interaction position from the stacked light patterns. The MAE loss is computed with the predicted and ground truth positions and backpropagated through the network to update its weights

Experiment setup

In this work, the design of our proposed On-Chip PET system is optimized. We investigate the influences of the network architecture, crystal thickness, SiPM size, and the number of surfaces covered with SiPMs on the scintillation-position prediction performance. For the network architecture, we selected variants from the ConvNeXt [26], ResNet [27], and EfficientNet [28] families with different depths, as these architectures have shown great performances on image data. The parameters given in Table 7 are used for every experiment run. The experiments are performed on a machine with four NVIDIA GeForce RTX 3090 GPUs where one training takes approximately one hour to run.

Table 7 Parameters used for every experiment run


We use the line source along the z-axis described in “Sources” section to determine the sensitivity of the system. The source has an activity of 1000 Bq and is simulated for 10 s. The sensitivity is computed by dividing the number of coincidence events where two interactions are recorded in two different detectors by the total number of events.


We evaluate the reconstruction performance of our proposed system with a grid of point sources and the more complex OOC phantom, as described in “Phantoms” and “Sources” sections.

For the reconstruction, it is necessary to be able to create LORs, which is only possible if the back-to-back gamma-rays interact in two detectors at the same time. For each of those sample pairs, the two corresponding scintillation positions are predicted with a trained network. All pairs of predicted scintillation positions are stored for further processing.

We perform the following steps for SART, an algebraic reconstruction method that shows good reconstruction performance in cases where there is limited data available [14]:

  1. 1.

    Load all predicted pairs of scintillation positions from disk.

  2. 2.

    Compute the distance and angle of the LOR to the origin for each LOR that is defined by the pair of predicted scintillation positions.

  3. 3.

    Create the sinogram from the LORs by computing the 2D histogram of the distances and angles with a bin size of 400 in both dimensions.

  4. 4.

    Generate the corresponding reconstructed image by running five iterations of SART implemented in scikit-image [30].


Scintillation position prediction

Baseline method

The baseline method using a centroiding-based approach as described in “Baseline method” section achieves MAE values on the test dataset ranging from 2.72 mm for large SiPMs to 3.18 mm for small ones for a crystal thickness of 13 mm. The results for a crystal thickness of 26 mm are around 2.3 mm higher. Table 8 contains the prediction errors for all six combinations of crystal thicknesses and SiPM sizes.

Table 8 Scintillation-position prediction results of different SiPM sizes and crystal thicknesses of the baseline method

Crystal thickness and SiPM size

With the first set of CNN training runs, we determined the optimal crystal thickness and SiPM size. For the crystal thickness, we evaluate two options, 13 and 26 mm, as those could easily be realized in reality with stock SiPM arrays. For this set of experiment runs, the light patterns from all five surfaces covered with SiPMs are used as input to the network and the training datasets with 100k samples are used. Table 7 depicts the other training parameters used for these runs. Table 9 shows the prediction performances on the validation dataset. The MAE achieved with 13 mm thick crystals is 0.98 mm for the small SiPM size, 0.99 mm for the medium, and 1.00 mm for the large ones. The results for 26 mm thick crystals are around 0.20 mm worse for each SiPM size.

Table 9 Scintillation-position prediction results of different SiPM sizes and crystal thicknesses of the first set of experiment runs

Network architecture

With the second set of CNN training runs, we determine the best working network architecture among variants from the ConvNeXt, ResNet, and EfficientNet families with different depths. For this set of experiment runs, the GATE simulation is run with a crystal thickness of 13 mm, the light patterns are created with the small SiPMs, the training dataset with one million samples is used, and the light patterns from all five surfaces covered with SiPMs are used as input to the network. Table 7 depicts the other training parameters used for these runs. Table 10 shows the prediction performances of the different network architectures on the validation dataset. ConvNeXt Tiny, ConvNeXt Large, and ConvNeXt Base are the architectures that perform best with an MAE value of 0.82 mm closely followed by ResNet50 with 0.83 mm and ResNet18 and ResNet101 with 0.84 mm. The three architectures from the EfficientNet family achieve results between 0.88 and 0.90 mm.

Table 10 Scintillation-position prediction results of different network architectures for the small SiPM size

Combination of light patterns as input

With the third set of CNN training runs, we determined the optimal combination of input light patterns. All possible combinations of the five surfaces, in total 31, are evaluated for a crystal thickness of 13 mm, the training dataset with one million samples, and the small SiPM size. The training parameters are those from Table 7. Table 11 contains the prediction results. The best ten combinations achieve results close together with MAE values between 0.8237 mm and 0.8370 mm. The worst performances are achieved with the top and bottom surfaces with an MAE of around 3.9 mm.

Table 11 Scintillation-position prediction results of different surfaces as input

Best result on test dataset

We trained a network with the best-performing design choices from the three experiment runs—a crystal thickness of 13 mm, a SiPM size of 3 mm, the ConvNeXt Tiny architecture, and the back-left-right surface combination—with the training dataset consisting of one million samples. This network achieves an MAE of 0.80 mm on the test dataset and is used for the reconstruction pipelines described in the following section. The training and validation loss curves are shown in Fig. 4.

Fig. 4
figure 4

Training curves of the network with the best performing design choices. The train MAE is depicted in blue and the validation MAE in orange

Anisotropy of positioning error

Figure 5 shows the anisotropy of the scintillation-position error at different crystal depths. The network taking all five surfaces covered with SiPMs from Table 11 is used to predict the scintillation positions. Figure 6 depicts the positioning errors for different input surface combinations at depth ranges. The mean positioning error is computed for depth intervals of 3.25 mm. The best performing combination for every number of surface light patterns from Table 11 is used to predict the first interaction positions.

Fig. 5
figure 5

Anisotropy of the predicted interaction position at different crystal depths. The positioning error between the predicted and ground truth positions is plotted across the x-z dimensions for crystal depths between 0 mm and 3.25 mm (top-left), 3.25 mm and 6.5 mm (top-right), 6.5 and 9.75 mm (bottom-left), and 9.75 mm and 13 mm (bottom-right)

Fig. 6
figure 6

Positioning errors for different input surface combinations at depth ranges. The mean positioning error is computed for depth intervals of 3.25 mm. The best performing combination for every number of surface light pattern from Table 11 is used to predict the first interaction positions


We recorded 3481 coincidence events from the line source simulation with 13 mm thick crystals where a total of 10,000 events were created and 4072 coincidence events for 26 mm thick crystals. For the 13 mm thick crystals, there we no random coincidence events, and for the 26 mm ones, there were four. This leaves us with sensitivities of 34.81% and 40.68% for the 13 mm and 26 mm thick crystals, respectively.

Spatial resolution

We use the best-performing model from “Best result on test dataset” section to predict the pairs of interaction positions of the point-sources grid dataset described in Table 5. The SART steps from “Reconstruction” section are performed to create the sinogram and reconstructed image shown in Fig. 7.

Fig. 7
figure 7

Sinogram and corresponding reconstructed image of point sources arranged in a 7 \(\times\) 3 grid with a distance of 10 mm between each source. The best performing model from “Best result on test dataset” section is used to predict the scintillation positions. The reconstructed image is created with SART [14]

Table 12 depicts the spatial resolution FWHM values for each of the 21 point sources. The mean FWHM value is 0.55 mm with a standard deviation of 0.19 mm. The FWHM values are computed by averaging the peak half widths of x-, y-, and z-line profiles drawn through the reconstructed image. The profiles are Gaussian filtered with a sigma of 0.5. Figure 8 depicts the z-line profiles drawn through the reconstructed image from Fig. 7.

Fig. 8
figure 8

Z-Line profiles drawn through the reconstructed image Fig. 7. The profiles are Gaussian filtered with a sigma of 0.5

Table 12 Spatial resolution FWHM values in mm of the reconstructed image of the grid of point sources shown in Fig. 7

OOC phantom reconstruction

We reconstruct images of the OOC phantom described section 2.1.2 in the same manner as the grid of point sources in the previous section. However, in this case, we perform multiple reconstructions with different numbers of LORs to determine the minimum number of LORs needed for an adequate image quality. Figure 9 shows the sinogram and corresponding reconstructed image where 100k LORs are used. In Fig. 10, the Signal-to-Noise Ratios (SNRs) for different numbers of LORs and source-volume sizes are shown. The SNRs values are computed by taking the mean from the x-, y-, and z-line profiles drawn through the center of each sphere source. The sphere-shaped source with a radius of 0.7 mm and a volume of 1.44 mm\(^3\) achieves the best SNR of 30.36 dB when using 100k LORs. The SNR drop-off for the smallest source with a radius of 0.4 mm and a volume of 0.27 mm\(^3\) when using 6250 and 3125 LORs comes from SNR values of 0 dB in one of the dimensions of the line profiles. The smallest SNR where there are three non-zero values from the line profiles is 10.49 dB achieved by the source with a radius of 0.5 mm and a volume of 0.52 mm\(^3\).

Fig. 9
figure 9

Sinogram and corresponding reconstructed image of the OOC phantom described in “Best result on test dataset” section. The phantom consists of four sphere-shaped hot sources with radii of 0.4 mm, 0.5 mm, 0.6 mm, and 0.7 mm. The hot sources are surrounded by sphere-shaped cold sources with a radius of 2 mm each. The activity concentration of the hot sources is 1000 Bq/mm\(^3\) and 100 Bq/mm\(^3\) for the cold sources

Fig. 10
figure 10

SNRs for different number of LORs and source-volume sizes. The OOC phantom described in “Phantoms” section is reconstructed with different numbers of LORs to determine the minimum number of LORs needed for an adequate image quality. The hot parts of the phantom are sphere-shaped volumes with radius ranging from 0.4 to 0.7 mm leading to the volume sizes used in the figure


Influence of the crystal thickness and SiPM size on the prediction performance

The results from the baseline method in “Baseline method” section show an inverse relationship between the prediction error and the SiPM size: the larger the SiPM size, the smaller the MAE value. This can be explained as follows: the larger the size of the sensitive area of the SiPM, the more optical photons are detected by the SiPM, and therefore, the higher the output value of the SiPM. Thus, the maximum values of the light patterns created with larger SiPMs are higher, and the positions of the centroids are closer to the positions of the maxima. This in turn enables a better prediction performance, as the scintillation position correlates with the position of the maxima of the light patterns.

We implemented the baseline method to serve as a comparison to the deep learning-based approaches, which give almost one order of magnitude better results. The deep learning-based method in “Crystal thickness and SiPM size” section shows the opposite relationship compared to the baseline method between the prediction errors and the SiPM sizes: the smaller the SiPM size, the smaller the MAE value. This can be explained by the observation that CNNs perform better with increasing image dimensions. In our case, the light patterns created with small SiPMs have dimensions of 32 \(\times\) 32, the medium ones 24 \(\times\) 24, and the large ones 16 \(\times\) 16.

When building an actual prototype of the system, we cannot simply choose the option with the best prediction performance but also have to take practical considerations into account. As Table 13 shows, the number of channels that need to be read out individually for one detector covered with small SiPMs is four times higher than when using large SiPMs, and 1.8 times higher compared when using medium SiPMs. At the same time, the MAE value for 13 mm thick crystals of the large SiPMs is only 2% higher than the one of the small SiPMs. Therefore, we need to find the right balance between system performance and technical feasibility as well as costs for the prototype.

Table 13 Number of channels for one detector for the different SiPM sizes

With regards to the crystal thickness, the design choice is easier to make. The results from the baseline method as well as the deep-learning based method show that the MAE values are around 20% higher for 26 mm thick crystals compared to 13 mm thick ones. Additionally, thinner crystals are also cheaper to manufacture. One downside of thinner crystals is their lower sensitivity compared to thicker ones.

Influence of the network architecture on the prediction performance

The results from the network architecture runs described in “Network architecture” section show that the family of ConvNeXt networks is better suited to the task of predicting the first scintillation position than the one of ResNet and EfficientNet. Determining the best overall architecture is a close call, as the MAE values within the ConvNeXt family are quite close. The results for the three ConvNeXt variants lie within 0.0025 mm and are closely followed by ResNet50 and ResNet18. As we are dealing with relatively small images (32 \(\times\) 32) compared and to increase the training and inference time of the network, it makes sense to choose the architecture with the smallest number of parameters, ConvNeXt Tiny.

Influence of the number of input light patterns on the prediction performance

The results from the experiments with varying numbers of input light patterns described in “Combination of light patterns as input” section indicate that light patterns from certain surfaces encode significantly more information about the scintillation position than others. We observe that the back surface achieves a prediction error that is only 0.0133 mm higher than the best overall performance that uses three surfaces as input. The result from the back surface is also better than many combinations of two, three, and four surfaces. Another indication of the high amount of information that is encoded in the back surface is that the back surface is part of every combination of surfaces in the ten best results. This observation should be expected as the back surface is the largest of all surfaces and therefore should encode the most information about the scintillation position for the network.

From these results, we observe a further practical implication: we do not need to cover all five surfaces with SiPMs to achieve good scintillation-position prediction performances.

Anisotropy of the predicted interaction positions

Figures 5 and 6 indicate that the positioning error decreases the deeper the scintillation takes place in the crystal. From Fig. 5 it is not clearly visible that the x- and z-positions of the scintillation influence the prediction performance. The heatmaps of the positioning errors do not show a degradation of the prediction performance near the boundaries of the detector as good (dark) and bad (light) spots are spread equally across the entire view. From Fig. 6 we observe that having SiPMs on additional surfaces instead of only on the back one increases the prediction performance more the closer the scintillation takes place to the back surface. From this figure, we also get an explanation of why the thicker crystals perform worse than the thinner ones. There is a clear relation visible between decreasing positioning error and increasing scintillation-position depth. The closer the scintillation takes place to the back surface the better the prediction performance. For the 26 mm thick crystals, the amount of scintillations that take place further away from the back surface is higher than for 13 mm thick crystals and therefore the prediction performance is worse for the thicker crystals.

Spatial resolution

The SART reconstruction of the grid of point sources described in “Reconstruction” section shows good reconstruction performance over the entire OOC device. We observe that the FWHM values from Table 12 slightly drop off when moving from the middle row (4), where the best FWHM values are reached, to lower and higher rows. The worst results are achieved close to the upper and lower boundaries of the field of view of the scanner.

OOC imaging capability

A qualitative evaluation of Fig. 9 lets us conclude that our proposed system is capable of imaging OOC devices containing volumes between 0.27 mm\(^3\) and 1.44 mm\(^3\). From Fig. 10, we observe a linear relationship between the SNRs and numbers of LORs for the four sphere sources with different volumes except for the drop-off described in section 3.4.

If we set the SNR threshold for an acceptable reconstructed image to above 16 dB, 25,000 LORs are needed to be able to reconstruct all four sphere sources with adequate quality. With an activity concentration of 1000 Bq/mm\(^3\) in the hot regions (four spheres with different radii) and 100 Bq/mm\(^3\) in the cold regions (four spheres with radii of 2 mm each), there is a total activity of 16,537 Bq in the system. With a sensitivity of 34.81%, around 73,500 events are needed for 25,000 LORs, which equals a recording time of fewer than 5 seconds. This means that it would be possible to perform pharmacokinetic analyses of OOCs with our proposed system, similar to the work of Liu et al. [13] with a higher resolution and smaller time interval. With the growing use of 3D models in radiopharmaceutical research, our system would enable deeper analyses of these models and provide a further tool for radiopharmacists to develop radiotheranostics [31].


In this work, we introduced the concept of an On-Chip PET system that makes OOC imaging possible. The main challenge to overcome for PET systems in this task is their limited spatial resolution, which lies in the range of slightly more than 1 mm. Previous works have shown that it is possible to achieve resolutions of less than 1 mm with setups consisting of a monolithic crystal combined with advanced data-analysis methods using deep learning-based approaches.

In this work, we designed a system consisting of four detectors each made up of two monolithic LYSO crystals with SiPMs attached to multiple surfaces. We generated training, testing, and reconstruction datasets with a MCS of the system and observed that the ConvNeXt Tiny architecture achieved the best scintillation-position prediction results with a MAE value of 0.80 mm on the test dataset. The proposed system achieves a sensitivity of 34.81% for 13 mm thick crystals and 40.68 % for 26 mm thick ones. It does not show a interaction-position prediction degradation near the boundaries of the detector. With the trained network, we reconstructed a grid of point sources using SART and reached a mean FWHM value of 0.55 mm, which is close to the lower bound of PET spatial resolution without positron range correction.

The results from the scintillation-position prediction and reconstruction demonstrate the capability of our system to achieve a resolution of almost 0.5 mm for a large field-of-view using out-of-the-box reconstruction methods. We showed that it is possible to reconstruct the small volumes found on OOC devices and that our proposed system would be able to perform pharmacokinetic analyses of OOCs. As next steps, we will shift our focus from interaction-position prediction to reconstruction to achieve an even better resolution. We will develop a list mode-based reconstruction method incorporating the geometrical priors that our system and OOC devices are constrained by.

One limitation of the proposed system in its current state is that it would not be able to differentiate signals coming from an individual organoids. However, it would still be possible to capture the spread of the fluid inside the vessel system and measure uptake differences between different compartments on the OOC device. This would enable pharmacokinetics analysis of OOCs for disease modeling and precision medicine applications.

Our results showed practical implications that play a crucial role in the next project steps, where we are going to build a prototype of the proposed system. We compared the performances of different SiPM sizes and observed that using larger SiPM sizes results in a slight decrease in prediction performance only while reducing the number of channels that need to be read out individually. We also saw that using thinner crystals is advantageous and that the back surface encodes significantly more information about the scintillation position compared to other surfaces. Therefore, we concluded that not all five surfaces of each detector need to be covered with SiPMs. This reduces the number of channels that need to be read out individually further.

Availability of data and materials

The code and datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.


  1. Low LA, Mummery C, Berridge BR, Austin CP, Tagle DA. Organs-on-chips: into the next decade. Nat Rev Drug Discov. 2021;20(5):345–61.

    Article  CAS  Google Scholar 

  2. Esch EW, Bahinski A, Huh D. Organs-on-chips at the frontiers of drug discovery. Nat Rev Drug Discov. 2015;14(4):248–60.

    Article  CAS  Google Scholar 

  3. Gambhir SS. Molecular imaging of cancer with positron emission tomography. Nat Rev Cancer. 2002;2(9):683–93.

    Article  CAS  Google Scholar 

  4. Pratx G, Chen K, Sun C, Martin L, Carpenter CM, Olcott PD, et al. Radioluminescence microscopy: measuring the heterogeneous uptake of radiotracers in single living cells. PLoS ONE. 2012;7(10):e46285.

    Article  CAS  Google Scholar 

  5. Jones T, Townsend D. History and future technical innovation in positron emission tomography. J Med Imaging (Bellingham). 2017;4(1):011013.

    Article  Google Scholar 

  6. Clement D, Frei R, Loude JF, Morel C. Development of a 3D position sensitive scintillation detector using neural networks. In: 1998 IEEE nuclear science symposium conference record. 1998 IEEE nuclear science symposium and medical imaging conference (Cat. No.98CH36255). vol. 3; 1998. p. 1448–52.

  7. Wang Y, Zhu W, Cheng X, Li D. 3D position estimation using an artificial neural network for a continuous scintillator PET detector. Phys Med Biol. 2013;58(5):1375–90.

    Article  CAS  Google Scholar 

  8. Marcinkowski R, Mollet P, Holen RV, Vandenberghe S. Sub-millimetre DOI detector based on monolithic LYSO and digital SiPM for a dedicated small-animal PET system. Phys Med Biol. 2016;61(5):2196–212.

    Article  CAS  Google Scholar 

  9. Tao L, Li X, Furenlid LR, Levin CS. Deep learning techniques for gamma ray interaction location estimation in monolithic scintillation crystal detectors. In: 2018 IEEE Nuclear science symposium and medical imaging conference proceedings (NSS/MIC); 2018. p. 1–3.

  10. Sanaat A, Zaidi H. Accurate estimation of depth of interaction in PET on monolithic crystal coupled to SiPMs using a deep neural network and Monte Carlo simulations. In: 2019 IEEE nuclear science symposium and medical imaging conference (NSS/MIC); 2019. p. 1–3.

  11. Decuyper M, Stockhoff M, Vandenberghe S, Holen RV. Artificial neural networks for positioning of gamma interactions in monolithic PET detectors. Phys Med Biol. 2021;66(7):075001.

    Article  CAS  Google Scholar 

  12. Jaliparthi G, Martone PF, Stolin AV, Raylman RR. Deep residual-convolutional neural networks for event positioning in a monolithic annular PET scanner. Phys Med Biol. 2021;66(14):145008.

    Article  Google Scholar 

  13. Liu Z, Zhang P, Ji H, Long Y, Jing B, Wan L, et al. A mini-panel PET scanner-based microfluidic radiobioassay system allowing high-throughput imaging of real-time cellular pharmacokinetics. Lab Chip. 2020;20:1110–23.

    Article  CAS  PubMed  Google Scholar 

  14. Andersen AH, Kak AC. Simultaneous algebraic reconstruction technique (SART): a superior implementation of the art algorithm. Ultrason Imaging. 1984;6(1):81–94.

    Article  CAS  Google Scholar 

  15. Kim J, Koo BK, Knoblich JA. Human organoids: model systems for human biology and medicine. Nat Rev Mol Cell Biol. 2020;21(10):571–84.

    Article  CAS  Google Scholar 

  16. Jan S, Santin G, Strul D, Staelens S, Assié K, Autret D, et al. GATE: a simulation toolkit for PET and SPECT. Phys Med Biol. 2004;49(19):4543–61.

    Article  CAS  Google Scholar 

  17. Jan S, Benoit D, Becheva E, Carlier T, Cassol F, Descourt P, et al. GATE V6: a major enhancement of the GATE simulation platform enabling modelling of CT and radiotherapy. Phys Med Biol. 2011;56(4):881–901.

    Article  CAS  Google Scholar 

  18. Sarrut D, Bardiès M, Boussion N, Freud N, Jan S, Létang JM, et al. A review of the use and potential of the GATE Monte Carlo simulation code for radiation therapy and dosimetry applications. Med Phys. 2014;41(6):064301.

    Article  Google Scholar 

  19. Agostinelli S, Allison J, Amako K, Apostolakis J, Araujo H, Arce P, et al. Geant4—a simulation toolkit. Nucl Instrum Methods Phys Res A Accel Spectrom Detect Assoc Equip. 2003;506(3):250–303.

    Article  CAS  Google Scholar 

  20. Allison J, Amako K, Apostolakis J, Araujo H, Arce Dubois P, Asai M, et al. Geant4 developments and applications. IEEE Trans Nucl Sci. 2006;53(1):270–8.

    Article  Google Scholar 

  21. Allison J, Amako K, Apostolakis J, Arce P, Asai M, Aso T, et al. Recent developments in Geant4. Nucl Instrum Methods Phys Res A Accel Spectrom Detect Assoc Equip. 2016;835:186–225.

    Article  CAS  Google Scholar 

  22. Edoardo P, Romualdo S. SimSiPM: a library for SiPM simulation; 2021.

  23. Photonics H. S14160/S14161 Series; 2020.

  24. Falcon W. PyTorch Lightning. GitHub Note: 2019;3.

  25. Biewald L. Experiment Tracking with Weights and Biases; 2020. Software available from

  26. Liu Z, Mao H, Wu C, Feichtenhofer C, Darrell T, Xie S. A ConvNet for the 2020s. CoRR. 2022; arXiv:2201.03545.

  27. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. arXiv:151203385 [cs]. 2015.

  28. Tan M, Le QV. EfficientNet: rethinking model scaling for convolutional neural networks. arXiv:190511946 [cs, stat]. 2020.

  29. Kingma DP, Ba J. Adam: a method for stochastic optimization. arXiv:14126980 [cs]. 2017.

  30. van der Walt S, Schönberger JL, Nunez-Iglesias J, Boulogne F, Warner JD, Yager N, et al. Scikit-Image: image processing in Python. PeerJ. 2014;2:e453.

    Article  Google Scholar 

  31. Doctor A, Seifert V, Ullrich M, Hauser S, Pietzsch J. Three-dimensional cell culture systems in radiopharmaceutical cancer research. Cancers. 2020;12(10):2765.

    Article  CAS  Google Scholar 

Download references


Calculations were performed on UBELIX (, the HPC cluster at the University of Bern. This project receives support by the CERN Budget for Knowledge Transfer to Medical Applications.


This work was supported by the Swiss National Science Foundation under Grant SNFN 200021_188914.

Author information

Authors and Affiliations



CC wrote the code to create and analyze the data and wrote the manuscript. GB created the first version of the Monte Carlo Simulation and supported its further development. MP, FP, and EA consulted the development of the simulation and provided material properties. MKJ, SZ, EA, AR, and KS developed the concept of the On-Chip PET system and provided guidance in the development process. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Christoph Clement.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Clement, C., Birindelli, G., Pizzichemi, M. et al. Concept development of an on-chip PET system. EJNMMI Phys 9, 38 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: