Joran Schoorlemmer

×

Dynamic modelling of PSA trajectories to personalise follow-up care for prostate cancer patients

Co-authors: Yvonne M. Geurts, Marcel Verheij, Henk van der Poel, Lonneke van de Poll-Franse, Iris Walraven

As a part of the SPACE project, we are trying to forecast the prostate specific antigen (PSA) trajectory of Prostate Cancer (PCa) patients after a prostatectomy. SPACE is a part of the PersOn consortium.

Abstract

Background:
Current follow-up (FU) guidelines for prostate cancer patients after prostatectomy include frequent prostate-specific antigen (PSA) checks, which pose a high burden on patients and healthcare systems. Existing tools to optimize FU frequency cannot dynamically integrate longitudinal PSA data, limiting clinical use. We created a methodology to map sparse longitudinal PSA data to functions for dynamically forecasting PSA trajectories during FU.
Methods:
We used data from the prospective Dutch Prostate Cancer Network, including 832 patients treated with prostatectomy from 2005 to 2021. We first performed a functional Principal Component Analysis (fPCA) to capture PSA data variance in multiple Eigenfunctions and fPCA scores. Next, an ensemble regression model was trained to predict individual fPCA scores using clinical variables (e.g., age and Gleason score). Last, PSA trajectories of patients were forecasted by combining the population Eigenfunctions and predicted fPCA scores at the first FU check.
Results:
Median age and PSA level at the time of prostatectomy were 64 years (IQR 59-68) and 8.0 ng/mL (IQR 5.6-11.7), respectively. Median follow-up was 3.2 years (IQR 1.7-5.0) with 6 PSA tests (IQR 4-10). Preliminary results showed that fPCA could describe the PSA trajectory of patients, capturing 97% of variance in the first Eigenfunction. However, ensemble model performance was low with the best regressor in the model explaining 10% of variance.
Conclusion:
Preliminary results showed that fPCA could capture data variance for dynamic forecasting of PSA trajectories. Next steps include improving model performance and assessing to what extent the trajectories can reduce PSA test frequency in the clinic.

×

Characterizing HEK293 metabolism for varying growth media using metabolic modelling

During my time at Boehringer Ingelheim, I worked in the Bioprocess Development Biologicals department. My responsibilities were designing and executing lab experiments, after which I wrote scripts to analyze and model the output data. The abstract of this project is shown below.

Abstract

Biopharmaceuticals like antibodies and viral vectors are extremely potent medicals due to their high specificity and efficacy. While antibodies are already produced on a large scale, the production of novel viral therapeutics is more complicated. These viral vectors and vaccines require multiple post translational modifications which are not feasible in the current internal production platforms in the human pharma business unit of Boehringer Ingelheim. For these purposes, HEK293 cells will build the basis for a suitable platform. This cell line from human origin allows for production of these medicines. To create this platform, a thorough understanding of the process and cell metabolism is required. In this project, a model is created to describe and predict the behavior of HEK293 cells using Metabolic Flux Analyses. This model can also be used for optimizing media formulations, by identifying bottlenecks in the metabolism of the cells, thereby majorly increasing development speed and success. The model is tested and verified in a Design of Experiment setup using the ambr®15 bioreactor. Investigated factors are the seeding cell density, glucose limit, glutamine source, glutamine source concentration and tyrosine concentration of the growth medium. The seeding cell density was found to be highly important for controlling and shifting the metabolism of the cells into a stationary phase. The tyrosine and glutamine source concentrations were also relevant for more specific reactions as alanine transaminase. This model provides a basis for more in depth research on the differences between growth phases and eventually implementing virus production.

×

Phages in different thermal stages. A machine learning study on thermal stability and diversity of bacteriophage protein structures using AlphaFold.

For my Master thesis, I worked on uncovering the underlaying characteristics of thermophilic bacteriophages. The focus for this project was on the surfaces of structural proteins, for which AlphaFFold structures were used. The abstrat of the project as well as the full thesis is shown below.

Abstract

Phages, viruses that infect bacteria, are found in a wide range of environments and have developed advanced strategies to survive in these surroundings. To get a better understanding of their stability under thermal stress, this project performed analyses of the strategies phages use to withstand heat. Structural phage protein surfaces were classified and compared with each other. This gave insight into the wide diversity of these proteins and was used to predict thermostability using machine learning models. Using two ways of assessing thermostability, random forest models were created for proteins separated by structural class. Proteins were characterized using structural features such as the compactness orsurface charge density of the protein and using sequential features retrieved using the deep learning embedding of UniRep. 75,567 structures of proteins were retrieved from the novel AlphaFold database and were checked for inaccuracies using a custom filtering pipeline, filtering out 22,843 low confidence entries and 23,454 loose structures. Model performance was found to inversely correlate with protein class diversity, indicating that within protein classes different strategies are used to withstand thermal stress. Combining different classes in one model led to lower predictive performance, confirming the high diversity between phage protein classes. The best performing model with an F1 score of 0.52 used structural features and 16S rRNA GC% estimated temperatures for the shaft class. This is far better than forced positive classification (F1=0.08) and showed the importance 3 of charged and turn surface residues in shaft proteins for thermostability. The use of phages in phage therapy to battle antibiotic-resistant bacteria and medicine delivery through phage design are very promising. However, problems regarding preparation and stabilization currently complicate the implementation of these phage applications. The novel characterizations of phage proteins in this project can be used to more accurately depict phages for phage therapy & design.

Click here for the full thesis.

×

Simulating ice nucleating proteins.

In this project, Molecular Dynamics simulations were used to find a stable ice nucleating protein. Pyrosetta software was used to iterate over different configurations for a backbone and specific motifs. The abstract and report can be found below.

Abstract

Ice binding proteins have many applications, ranging from snow cannons to microvalves in microtubing. However, these proteins are currently impossible to synthesize. Specific bacteria can transport these proteins on their membrane, but creating a pure ice binding protein is impossible. This is due to the proteins having high amounts of 𝛽-helices, which are very unstable in synthesis. In this project, ice binding proteins are synthesized on a much more stable back bone, 𝛼-helices. An ice binding motif (TxxxAxxxAxx)n is bound to the 𝛼 helix and is as such far easier to stabilize. Now, these proteins can be synthesized and can be used in this wide range of applications, without the hassle of dealing with living bacteria. A stable structure containing the ice binding motif was found during the design phase. Multiple configurations were found with low score values. The coil radius was found to be the most impactful parameter, while the twist and the phase of the protein seem to have less effect. Three local minima were found for the radius, these are 5.2-6 Å, 6.3-6.9 Å, 6.9-7.5 Å. The effect of the larger radii was a decrease in attraction between the three chains. Meanwhile, larger, more hydrophobic amino acids could be sampled in the core, resulting in a lower score. However, when looking at the fold and dock step, the likelihood of creating this structure from scratch is very low. Three different sequences were looked at, these all resulted in a slightly different structure with RMSD = 1.5 Å. For all 3, another completely different structure was present at RMSD = 8 or 10 Å. This indicates that the probability of the desired structure forming in the lab is very low. In the future, a broader range of input parameters must be taken in account. This could be a higher number of chains, fixing hydrogen bonds in the core or other parameters.

Click here for the full report.

Joran Schoorlemmer

Dynamic modelling of PSA trajectories to personalise follow-up care for prostate cancer patients

Abstract

Characterizing HEK293 metabolism for varying growth media using metabolic modelling

Abstract

Phages in different thermal stages. A machine learning study on thermal stability and diversity of bacteriophage protein structures using AlphaFold.

Abstract

The Effect of the Stoichiometry on the 3D-structure of C3Ms

Abstract

Academic Consultancy Training: Biomarker testing through lab-on-a-chip technology.

Effect of bacterial inoculation on gene expression in Arabidopsis Thaliana.

Abstract

Bonds and Breakups, a stressful model. Modeling Cell Adhesion to Extra-cellular Matrix with Focal Adhesion Complexes

Simulating ice nucleating proteins.

Abstract

The perfect fantasy cycling team.

Design and implementation of a SQL database for startup Urban Funghi