Improving Viral Manufacture through Advanced PAT · 2019-11-28 · Improving Viral Manufacture...

Improving Viral Manufacture through Advanced PAT

Dr Michael DelahayeLead Scientist – Industrialisation

AMC October 2019

The Cell and Gene Therapy Catapult

£70m Development Facility

• 1,200m2 Custom designed cell and gene therapy development facility

• Prime location in the heart of the London clinical research cluster

• >200 permanent staff

£55m large scale manufacture center

• 7,200m2 manufacturing centre designed specifically for cell and gene therapies

• Located in the Stevenage biocatalyst

• 12 independent manufacturing modules

Artificial Intelligence

3

ARTIFICIALIntelligence

Predictive analytics

Deep learning

Digital twins

Machine Learning

Translation

Data extraction

Natural Language processing

Image recognition

Machine vision

Diagnostics

Autonomousmachines

Planning & optimisation

AI application – cell therapy

4

In 2016 Google Deepmind health began collaborating with Moorfields eye hospital

Develop an AI platform which can analyse OCT images and diagnose the early stages of 50 eye disease and recommend treatment

Has a 94% accuracy rate (comparable to clinician with 20 years experience)

Also explains how it arrives at it recommendations to clinicians

In 2018 the FDA approved the marketing of the first medical device that uses artificial intelligence to detect diabetic retinopathy (DR) in adults with diabetes

Uses a deep learning system to analyse images and identify DR with an 89.5% accuracy.

Microsoft in collaboration with MIT have developed 2 systems which use machine learning to design guide RNA sequences for CRISPR based gene editing.

Elevation – predicts off target effects and provides a score for how likely a guide is to disrupt the cell

Azimuth - predicts which part of a gene to target for optimal KO

AI and big data

5

• AI works best when large amounts of rich, big data are available. The more facets the data covers, the faster the algorithms can learn and fine-tune their predictive analyses.

• Big Data is the raw input – it needs to be cleaned, structured and integrated before it becomes useful,

• Artificial intelligence is the output, the intelligence that results from the processed data.

• To solve a specific set of problems, AI needs to be trained properly in order to build intelligence and allow systems to be validated

• IDx – 495,000 images to train and validate the AI system for identifying diabetic retinopathy

Data generation – biopharmaceutical production

6

Upstream FormulationDownstream

Process and material insightsActionable data analytics>500 QC entries

>2000 batch record entries

>500 million continuous data point

Upstream FormulationDownstream

Data driven processing

Data driven decision making

Bus

ines

s V

alue

Complexity

Hindsight

InsightForesight

Descriptive AnalyticsWhat happened?

Diagnostic AnalyticsWhy did it happen?

Predictive AnalyticsWhat will happen?

Prescriptive AnalyticsHow can we make it

happen?

Data driven gene therapy bioprocessing

9

• Temp• pH• DO• Raman spectroscopy• RI spectroscopy

Multivariateanalysis

100’s samples

• 2 production processes• 12 production runs

• ~10 million data point

• Metabolomics• Transcriptomics

Cell metabolism

10

Glycine

Cysteinyl disulfide

Cysteinyl glycine

Cysteinyl disulphide

Cysteinyl glycine disulfide

Cystathionine

Cysteine

Serine

Isoleucine

.γ -glutamyl serine

Cysteine sulfinic acid

Lipoic acidGlucose

Pyruvate

Acetyl CoA

Mevalonate N-butyryl isoleucine

Cystathionine

Citrate

.α -ketoglutarate

SuccinateFumarate

Malate

.α -lipoate

TCA CycleCholosterol

N-aspartyl glutamic acid

N-acetyl aspartate

N-acetyl glutamate

N-acetyl glutamine

Histidine N-acetyl histidine

4-imadazoleacetate

Imadizole lactate

.γ -glutamyl glutamine

.α -ketoglutaramate

3-hydroxyisobutyrate

N-acetyl valine

.γ -glutamyl methionine

Xanthine Purine Metabolism

Aspartate

Proline

Glutamate

Aconitate

Glutamine

Valine

Methionine

Betaine

Homocystine

.γ -glutamyl valine

Alanine

Lactate

Isoleucine

Trytophan

lysine

Leucine

Carboxyethyltyrosine

Tyrosine

Thymine

Thymidine

DNA Synthesis

Petrin

Phenyl-pyruvate

4-hydroxyphenylpyruvate

1-carboxyethylphenylalanineArginosuccinate

Phenylalanine

Aspartate

Asparagine

2-aminoadipate

Methyl-lysine

Aminoadipic acid

Pipecolic acid

N-butyryl-leucine

.γ -glutamyl Isoleucine

N-carbomylalanine

Cholesterol

Glycerophosphoserine

Spermidine

Membrane formation

Glycerophosphocholine

Phosphocholine

Choline

Phosphate

Phosphoethanolamine

N-acetylputrescine

Putrescine

Cadaverine

N-acetyl-cadaverine

Process 1

Process 2

Supervised learning: model performance and optimisation

11

All data (144 samples, 300 metabolites)

Training (70%) Test (30%)

Model performance

𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴 =𝑁𝑁𝑁𝑁. 𝑁𝑁𝑜𝑜 𝐴𝐴𝑁𝑁𝐴𝐴𝐴𝐴𝑐𝑐𝐴𝐴𝑐𝑐 𝑝𝑝𝐴𝐴𝑐𝑐𝑝𝑝𝑝𝑝𝐴𝐴𝑐𝑐𝑝𝑝𝑁𝑁𝑝𝑝𝑝𝑝𝑇𝑇𝑁𝑁𝑐𝑐𝐴𝐴𝑇𝑇 𝑝𝑝𝑁𝑁. 𝑁𝑁𝑜𝑜 𝑝𝑝𝐴𝐴𝑐𝑐𝑝𝑝𝑝𝑝𝐴𝐴𝑐𝑐𝑝𝑝𝑁𝑁𝑝𝑝𝑝𝑝

• Model performance measured on “test” data to assess ability to generalise (stop overfitting)

• Model parameters estimated using “training” data

x10

Model optimisation usingk-fold cross validation Parameter Optimisation

• Model parameter optimisation carried out using repeated k-fold cross validation

• Provides most mileage on training data• Accuracy averaged across 3 repeats and

10-folds

Cross Validation - Training Cv- Test

Machine learning algorithms

12

What to measure –• Application of machine learning to select markers for process monitoring• Combined with metabolomics to incorporate biological function

Marker Function

1 Proline Amino acid 812 Glucose Glycolosis 753 Hydroxyphenyl lactate lactic acid 714 Asparegine Nitrogen donor 675 Histidine Amino acid 666 Spermine Cholesterol metabolism 647 Malate TCA cycle by-product 568 Glutamine Amino acid 529 Glycine Formation of Serine 47

10 Lactate Glycolosis by-product 4611 Serine Amino acid 4412 Pyruvate Glycolosis 33

Variable importance (%)


Bus

ines

s V

alue

Complexity

Hindsight

InsightForesight





happen?

Process Analytical Technologies (PAT)

14

PAT is a framework for

“designing, analysing, and controlling manufacturing through timely measurements (i.e., during processing) of critical quality and performance attributes of raw and in-process materials and processes, with the goal of ensuring final product quality”

The aim of PAT is to obtain better process control by

• identifying and managing sources of variability,

• reducing cost by optimising the use of raw materials

• minimising product cycle times through the use of measurements that are • in-line (analysed in place),

• on-line (sample removed, analysed and returned to the process stream)

• at-line (sample removed and analysed close to the process stream)

FDA - PAT—a framework for innovative pharmaceutical development, manufacturing and quality assurance. 2004

Real time data acquisition

Raman SpectroscopyRaman Spectroscopy is a technique used to observe molecular vibrations that can identify and quantitate molecules

By measuring changes in the wavelength of laser light its possible to identify what molecules are present in the cell culture media

• Spectra collected at regular intervals.

• Matched to nearest offline / at-line sampling time points.

• Pre-processed using a number of algorithms.

• Modelling of at-line and offline data performed using multivariate chemometric methods such as projection to latent structures (PLS) and artificial neural networks.

-1

0

1

2

3

4

5

0 1 2 3 4 5 6 7 8 9 10 11 12

glucose

-1

0

1

2

3

4

0 1 2 3 4 5 6 7 8 9 10 11 12

lactate

00.20.40.60.8

11.21.4

0 1 2 3 4 5 6 7 8 9 10 11 12

glutamine

0

20

40

60

80

100

0 1 2 3 4 5 6 7 8 9 10 11 12

ammonia

days days

conc

entr

atio

n (g

/L)

conc

entr

atio

n (g

/L)

days days

conc

entr

atio

n (g

/L)

conc

entr

atio

n (m

g/L)

Raman spectroscopy

Process run 1Process run 2Process run 3Process run 4Raman Data

-1

0

1

2

3

4

5

0 1 2 3 4 5 6 7 8 9 10 11 12

glucose

Offline Data

Raman model development

17

Run 1Run 2Run 3Run 4Run 5Run 6Run 7Run 8Run 9Run 10Run 11Run 12

Metabolomics Data Raman spectroscopy

Random selection of ~20% of the data to allow Raman model development


Run 1Run 2Run 3Run 4Run 5Run 6Run 7Run 8Run 9Run 10Run 11Run 12

Training data Test data Training data Test data

x50PLS regression modelling for each metabolite

Model building was performed using 5-fold cross-validation.


19

Example of model performance for main metabolic markers:

𝑅𝑅2 = 1 −𝑅𝑅𝑐𝑐𝑝𝑝𝑝𝑝𝑝𝑝𝐴𝐴𝐴𝐴𝑇𝑇 𝑆𝑆𝐴𝐴𝑆𝑆 𝑁𝑁𝑜𝑜 𝑆𝑆𝑆𝑆𝐴𝐴𝐴𝐴𝐴𝐴𝑐𝑐𝑝𝑝𝑇𝑇𝑁𝑁𝑐𝑐𝐴𝐴𝑇𝑇 𝑆𝑆𝐴𝐴𝑆𝑆 𝑁𝑁𝑜𝑜 𝑆𝑆𝑆𝑆𝐴𝐴𝐴𝐴𝐴𝐴𝑐𝑐𝑝𝑝

• ~55 metabolites have R2 >0.85.

• ~88 metabolites have R2 >0.80.

Analyte Raman R2 Raman SD

Glucose 0.8354 0.0842

Lactate 0.9166 0.0440

Glutamine 0.8330 0.0846

Glutamate 0.8490 0.0602


20

Performance of Raman models for marker selected using machine learning algorithm

Marker Function

1 Proline Amino acid 81 0.942 Glucose Glycolosis 75 0.843 Hydroxyphenyl lactate lactic acid 71 0.894 Asparegine Nitrogen donor 67 0.905 Histidine Amino acid 66 0.826 Spermine Cholesterol metabolism 64 0.587 Malate TCA cycle by-product 56 0.928 Glutamine Amino acid 52 0.839 Glycine Formation of Serine 47 0.92

10 Lactate Glycolosis by-product 46 0.9211 Serine Amino acid 44 0.8812 Pyruvate Glycolosis 33 0.87

Variable importance (%)

Raman Mean R2

Real time monitoring of viral titre

21


Bus

ines

s V

alue

Complexity

Hindsight

InsightForesight





happen?

Going forward

Device manager

Data Parser

Digital twin

Data Processing

Manual upload

Data store

AnthaHubfile watcher

Data acquisition and processing using Antha

24

Real time monitoring of metabolites

25

Real-time online monitoring

26

Future looking - In-silico digital twin to support process improvement

27

Targeted and reduced experimentation

Robust, optimised processes

Improved process understanding

Soft sensors and augmented reality

In-silico modelling, process simulation, CFD

Digital Twin

Acknowledgements

John ChurchwellOliver DewhirstIsobelle Evie Marco BaradezAnna Williams

Peter JonesNicholas Clarkson Laura McCloskeyLee DaviesRui SanchesThomas WilliamsRobert LawAnn Justino

Markus GershaterJames ArpinoJames RutleyAllen Shaw

Improving Viral Manufacture through Advanced PATThe Cell and Gene Therapy CatapultArtificial IntelligenceAI application – cell therapyAI and big dataData generation – biopharmaceutical productionData driven processingData driven decision makingData driven gene therapy bioprocessingCell metabolism Supervised learning: model performance and optimisation Machine learning algorithmsData driven decision makingProcess Analytical Technologies (PAT)Real time data acquisitionRaman spectroscopyRaman model developmentRaman model developmentRaman model developmentRaman model developmentReal time monitoring of viral titreData driven decision makingGoing forwardData acquisition and processing using AnthaReal time monitoring of metabolitesReal-time online monitoring Future looking - In-silico digital twin to support process improvementAcknowledgements

Improving Viral Manufacture through Advanced PAT · 2019-11-28 · Improving Viral Manufacture...

Documents

Transcript of Improving Viral Manufacture through Advanced PAT · 2019-11-28 · Improving Viral Manufacture...