Blood plasma proteins maintain physiological and chemical homeostasis in the human body. Isolated from plasma donations, plasma proteins are necessary to treat conditions from immunodeficiencies to rare diseases.1,2 However, on scale, these proteins are isolated from laborious multi-step cold ethanol fractionation processes, resulting in low product recoveries and large volumes of waste.2
Alternative purification methods like affinity chromatography (AC), are being explored to improve purification efficiency, to meet the rising demand for plasma protein therapies.4 However, challenges in obtaining peptide-affinity ligands with high specificity and selectivity towards targeted plasma proteins must be addressed to make purification processes more cost-effective and streamlined.
The current strategy for compound discovery relies on resource-intensive ‘trial-and-error’ methods requiring extensive synthetic endeavour to identify small numbers of ‘hits’ within large arrays of low-quality and low-quantity ligands.6 Alternatively, machine learning (ML) techniques can be used to accurately predict the properties of ligands, expedite selection of ligands with requisite functionality, and design bespoke ligands.7,8
A ML model is being developed to predict binding affinities of peptide-affinity ligand candidates for their use in AC towards the isolation of targeted plasma proteins. A dataset of varying ligands and their binding affinities has been collated from literature, open-sourced data bases, and results from novel experimental data. This dataset has been used to train, test, and validate several ML models based on widely used fingerprint descriptors that represent the chemical structure and information on bond types and atom connectivity.
This presentation will highlight the key challenges faced in the development of a robust ML model, ensuring comparability across reported and currently collected datasets, and overcoming difficulties relating to the synthesis, purification, and analysis of peptide-ligand candidates. This industry-aligned project will facilitate maximal therapeutic yields from donations and increase treatment availability for patients who depend on them for survival.