SlideShare una empresa de Scribd logo
1 de 35
Novozymes Enzyme Stability Prediction
Help identify the thermostable mutations in enzymes
組員:11莊O禾、18郭O智、19林O泰、20張O豪
About Presentation
Content
About This
Project !
Execute
How To Do
This Project ?
Motivation
Why Are We Doing
This Project ?
Content
About This Project !
Competition Enzyme
Novozymes
Enzyme Stability
Prediction
Kaggle is biggest data science
competition web
The goal of this competition is
to predict the thermostability
of enzyme variants (Tm)
Total 1331 teams
4
5
What is Protein
A A U P
P U C
I
Primary structure
Secondary structure
Tertiary structure
Quaternary structure
A large group of
proteins
Can accelerating
chemical reactions
6
What is Enzyme
Can be a Biologic
Therapy
pH and Tm is
crucial factors
Motivation
Why Are We Doing This Project ?
Market Demand
Market
The global biologics therapy market was estimated at US$ 366.43 billion in 2021 and it is expected to hit
over US$ 719.84 billion by 2030 with a noteworthy CAGR of 7.8% from 2022 to 2030.
資料來源:precedenceresearch/Biologics Market
Biologic Therapy Global Market 2021-2030
2030
2021
Market growth will
ACCELERATE
at a CAGR of
Growth Contributed by
NORTH AMERICA
Incremental growth ($B) Growth for 2021
7.8% 57%
719.84 196.2%
Regulatory Approval
Clinical Trials
Pre-clinical
Drug discovery
3-5 years
Demand
1-2 years 6-7 years 1-2 years
least over 10 years
Regulatory Approval
Clinical Trials
Pre-clinical
Drug discovery
1-2 years 2-3 years
Traditional development
AI
AI development
AI
5 years
Execute
How To Do This Project ?
Process Scoring
Process
Input Trained Output
Protein sequence
X
protein melting point
VPVNPEPDATSVENVALKTGS
GDSQSDPIKADLEVKGQSALPF
DVDCWAILCKGAPN...
Models ŷTm
Models
12
XG Boost
Baese on ML, we use
different feature which
form protein sequence
Py Rosetta
Compare 3D structural
between wild type and
mutant type
The protein sequence
sees as “Natural
Language”
Protein BERT
Analysis protein 3D
structural to extract
protein’s features
3D-CNN
XG Boost Process
Training data
Kaggle
Training
Data cleaning
Outcome
Protein sequence
X
y Tm
Testing data
1. Drop pH > 9 & PH
< 5.5
2. embedding:
length
entropy
aaindex1
atc
aac
Model
ŷTm
protein melting point
Data cleaning
Length
Sequence length in
amino acids
Entropy
Shannon entropy for
each sequence in the
dataset
atc
sum of atomic and bond
compositions for each
amino acid sequence
pH
Drop pH > 10 & PH < 6
aaindex1
is a set of 20
numerical values
representing 566
physicochemical and
biological properties
of amino acids
aac
the frequency of
amino acids for each
sequence in the
dataset
Data Cleaning
Py Rosetta Process
Training data Training Outcome
Protein sequence
X
Testing data
1. Input data
2. Import model
3. Predict scores
PyRosetta
Test data
Single-Point
Mutation pdb
Energy
Score
Function
scores
Wild type sequence:
VPVNPEPDATSVENVALKTGSGDSQSDPIKADL
EVKGQSALPFDVDCWAILCKGAPN...
Mutant sequence:
VPVNPEPDATSVENVALKTGSGDSASDPIKADL
EVKGQSALPFDVDCWAILCKGAPN...
Protein BERT Process
Training data Training Outcome
Protein sequence
X
1. Input data 2. Import model
ŷTm
protein melting point
4. Predict protein melting point
seq_id tm
0 75.7
1 50.5
2 40.5
3 47.2
4 49.5
5 48.4
6 45.7
7 55.9
8 48.1
◍ Protein Sequence vs NLP
○ Meaningful sentence
○ Bio-language
◍ BERT, stronger NLP model
pretrained on two tasks
○ language modeling
○ next sentence prediction
◍ Use Transfer Learning
○ Less training time
○ Significantly improve the
efficiency of reinforcement
learning
NLP Model
Transfer Learning
◍ BERT vs Protein
○ protein sequence not have
“chunk”
○ protein have 3D structural are
more complex than sentences
◍ Import Gene Ontology (GO)
○ Define protein’s space distance in
organism
○ Separate global and local, then
process each
Protein BERT
Input protein
BERT module
carry out
Finetune
ranking test
data
Input pretrained module
Build Finetune module
Setting Finetune Parameter
參考:https://github.com/nadavbra/protein_bert/blob/master/ProteinBERT demo.ipynb
Protein BERT
3D-CNN Process
Training data Training Outcome
Protein sequence
X
2. 3D structural 3D-CNN
1. Input data
3. Import model
4. PredictΔΔG
seq_id ΔΔG
31390 1.404995
31391 1.343793
31392 0.241666
31393 0.534203
31394 0.134588
31395 0.697623
31396 1.346896
31397 1.001297
31398 1.083319
資料來源: Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks Bian Li,Yucheng T. Yang,John A. Capra ,Mark B. Gerstein
3D Convolutional Neural Networks
資料來源: https://www.youtube.com/watch?v=f0t-OCG79-U
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)
資料來源: https://www.louisbouchard.ai/densenet-explained/
資料來源: https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53
3D Convolutional Neural Networks (CNN) Architecture
3D Convolutional Neural Networks (CNN) Architecture
ΔΔG
ΔG
資料來源:https://pubs.acs.org/doi/10.1021/ja100744h
0
0.1
0.2
0.3
0.4
0.5
0.6
XGBoost ProteinBERT Rosetta 3D-CNN
Score
0.494
0.471
0.168
Table of ranking Score
0.292
Process
Training Outcome
Ranking
table of ranking
Sorting
Protein BERT
3D-CNN
Training model
Ranking
Output
Data X
Data PR
Data PB
Data CN
Ensemble
𝛼1
𝛼2
𝛼3
𝛼4
Table of ranking Score
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
XGBoost ProteinBERT Rosetta 3D-CNN Ensemble
Score
0.592
0.494
0.471
0.292
0.168
Table of ranking Score
References
Kaggle:
 https://www.kaggle.com/code/cdeotte/protein-bert-finetune-lb-0-30
 https://www.kaggle.com/code/dschettler8845/novo-esp-residue-depth-and-more-w-biopython
 https://www.kaggle.com/code/vslaykovsky/nesp-thermonet
 https://www.kaggle.com/code/lucasmorin/nesp-changes-eda-and-baseline/notebook#Submission
ESM:
 https://www.pnas.org/doi/full/10.1073/pnas.2016239118
EVE:
 https://www.nature.com/articles/s41586-021-04043-8
Alphafold2:
 https://github.com/deepmind/alphafold
RoseTTAFold:
 https://github.com/RosettaCommons/RoseTTAFold
THE END
What is Enzyme
 酵素是一類大分子生物催化劑,大部分的酵
素是屬於蛋白質。能加快化學反應的速度,
在現代工業中,酵素可以取代化學品作為重
要的生產催化劑, 以少量的資源,創造更多
的產品,同時節省能源,並減少浪費、加速
生產過程。
 然而,酵素作用的溫度有著十分嚴苛的條件,
加熱時或與化學變性劑接觸時,酵素結構會
發生去摺疊(即變性),原有的結構被打亂,
活性也往往隨之喪失。這限制酵素場景使用
的廣泛程度。
 蛋白質的分子結構可劃分為四
級,以描述其不同的方面:
胺基酸 3-字母 1-字母
丙氨酸(Alanine) Ala A
精氨酸(Arginine) Arg R
天冬醯胺(Asparagine) Asn N
天冬氨酸(Aspartate) Asp D
半胱氨酸(Cysteine) Cys C
穀氨酸(Glutamic acid) Glu E
穀氨醯胺(Glutamine) Gln Q
甘氨酸(Glycine) Gly G
組氨酸(Histidine) His H
異亮氨酸(Isoleucine) Ile I
亮氨酸(Leucine) Leu L
賴氨酸(Lysine) Lys K
甲硫氨酸(Methionine) Met M
苯丙氨酸(Phenylalanine) Phe F
脯氨酸(Proline) Pro P
絲氨酸(Serine) Ser S
蘇氨酸(Threonine) Thr T
色氨酸(Tryptophan) Trp W
酪氨酸(Tyrosine) Tyr Y
纈氨酸(Valine) Val V
Spearman's rank
https://www.google.com/imgres?imgurl=https%3A%2F%2Fimg-
blog.csdnimg.cn%2F2019032717063990.png&imgrefurl=https%3A%2F%2Fblog.csdn.net%2Fgaifuxi9518%2Farticle%2Fdetails%2F88849283&tbnid=SH0Pg3IWoAOU
6M&vet=12ahUKEwj7t9Ker6_7AhWOAaYKHYVuCeAQMyhEegQIARBk..i&docid=JkOIIPPh0E6MzM&w=446&h=162&q=spearman%E7%9B%B8%E9%97%9C%E4%B
F%82%E6%95%B8&ved=2ahUKEwj7t9Ker6_7AhWOAaYKHYVuCeAQMyhEegQIARBk
https://www.tes.com/teaching-resource/spearman-s-rank-correlation-cie-a-level-biology-12411879
36

Más contenido relacionado

La actualidad más candente

VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)
VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)
VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)NTT DATA Technology & Innovation
 
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~SSII
 
transformer解説~Chat-GPTの源流~
transformer解説~Chat-GPTの源流~transformer解説~Chat-GPTの源流~
transformer解説~Chat-GPTの源流~MasayoshiTsutsui
 
2015年度GPGPU実践プログラミング 第7回 総和計算
2015年度GPGPU実践プログラミング 第7回 総和計算2015年度GPGPU実践プログラミング 第7回 総和計算
2015年度GPGPU実践プログラミング 第7回 総和計算智啓 出川
 
SMOTE resampling method slides 02-19-2018
SMOTE resampling method slides 02-19-2018SMOTE resampling method slides 02-19-2018
SMOTE resampling method slides 02-19-2018Shuma Ishigami
 
いまさら聞けない!CUDA高速化入門
いまさら聞けない!CUDA高速化入門いまさら聞けない!CUDA高速化入門
いまさら聞けない!CUDA高速化入門Fixstars Corporation
 
フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)NTT DATA Technology & Innovation
 
PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)
PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)
PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)NTT DATA OSS Professional Services
 
CUDAプログラミング入門
CUDAプログラミング入門CUDAプログラミング入門
CUDAプログラミング入門NVIDIA Japan
 
200611material ozaki
200611material ozaki200611material ozaki
200611material ozakiRCCSRENKEI
 
グラフデータ分析 入門編
グラフデータ分析 入門編グラフデータ分析 入門編
グラフデータ分析 入門編順也 山口
 
Alpha fold 2
Alpha fold 2Alpha fold 2
Alpha fold 2Vishwas N
 
NVIDIA Modulus: Physics ML 開発のためのフレームワーク
NVIDIA Modulus: Physics ML 開発のためのフレームワークNVIDIA Modulus: Physics ML 開発のためのフレームワーク
NVIDIA Modulus: Physics ML 開発のためのフレームワークNVIDIA Japan
 
2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調
2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調
2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調智啓 出川
 
PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...
PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...
PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...Preferred Networks
 
A3Cという強化学習アルゴリズムで遊んでみた話
A3Cという強化学習アルゴリズムで遊んでみた話A3Cという強化学習アルゴリズムで遊んでみた話
A3Cという強化学習アルゴリズムで遊んでみた話mooopan
 
第9回 配信講義 計算科学技術特論B(2022)
 第9回 配信講義 計算科学技術特論B(2022) 第9回 配信講義 計算科学技術特論B(2022)
第9回 配信講義 計算科学技術特論B(2022)RCCSRENKEI
 
大規模言語モデルとChatGPT
大規模言語モデルとChatGPT大規模言語モデルとChatGPT
大規模言語モデルとChatGPTnlab_utokyo
 
【DL輪読会】事前学習用データセットについて
【DL輪読会】事前学習用データセットについて【DL輪読会】事前学習用データセットについて
【DL輪読会】事前学習用データセットについてDeep Learning JP
 
PostgreSQL - C言語によるユーザ定義関数の作り方
PostgreSQL - C言語によるユーザ定義関数の作り方PostgreSQL - C言語によるユーザ定義関数の作り方
PostgreSQL - C言語によるユーザ定義関数の作り方Satoshi Nagayasu
 

La actualidad más candente (20)

VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)
VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)
VSCodeで作るPostgreSQL開発環境(第25回 PostgreSQLアンカンファレンス@オンライン 発表資料)
 
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
SSII2019OS: 深層学習にかかる時間を短くしてみませんか? ~分散学習の勧め~
 
transformer解説~Chat-GPTの源流~
transformer解説~Chat-GPTの源流~transformer解説~Chat-GPTの源流~
transformer解説~Chat-GPTの源流~
 
2015年度GPGPU実践プログラミング 第7回 総和計算
2015年度GPGPU実践プログラミング 第7回 総和計算2015年度GPGPU実践プログラミング 第7回 総和計算
2015年度GPGPU実践プログラミング 第7回 総和計算
 
SMOTE resampling method slides 02-19-2018
SMOTE resampling method slides 02-19-2018SMOTE resampling method slides 02-19-2018
SMOTE resampling method slides 02-19-2018
 
いまさら聞けない!CUDA高速化入門
いまさら聞けない!CUDA高速化入門いまさら聞けない!CUDA高速化入門
いまさら聞けない!CUDA高速化入門
 
フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
フックを使ったPostgreSQLの拡張機能を作ってみよう!(第33回PostgreSQLアンカンファレンス@オンライン 発表資料)
 
PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)
PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)
PostgreSQLでpg_bigmを使って日本語全文検索 (MySQLとPostgreSQLの日本語全文検索勉強会 発表資料)
 
CUDAプログラミング入門
CUDAプログラミング入門CUDAプログラミング入門
CUDAプログラミング入門
 
200611material ozaki
200611material ozaki200611material ozaki
200611material ozaki
 
グラフデータ分析 入門編
グラフデータ分析 入門編グラフデータ分析 入門編
グラフデータ分析 入門編
 
Alpha fold 2
Alpha fold 2Alpha fold 2
Alpha fold 2
 
NVIDIA Modulus: Physics ML 開発のためのフレームワーク
NVIDIA Modulus: Physics ML 開発のためのフレームワークNVIDIA Modulus: Physics ML 開発のためのフレームワーク
NVIDIA Modulus: Physics ML 開発のためのフレームワーク
 
2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調
2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調
2015年度先端GPGPUシミュレーション工学特論 第15回 CPUとGPUの協調
 
PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...
PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...
PFN Summer Internship 2021 / Kohei Shinohara: Charge Transfer Modeling in Neu...
 
A3Cという強化学習アルゴリズムで遊んでみた話
A3Cという強化学習アルゴリズムで遊んでみた話A3Cという強化学習アルゴリズムで遊んでみた話
A3Cという強化学習アルゴリズムで遊んでみた話
 
第9回 配信講義 計算科学技術特論B(2022)
 第9回 配信講義 計算科学技術特論B(2022) 第9回 配信講義 計算科学技術特論B(2022)
第9回 配信講義 計算科学技術特論B(2022)
 
大規模言語モデルとChatGPT
大規模言語モデルとChatGPT大規模言語モデルとChatGPT
大規模言語モデルとChatGPT
 
【DL輪読会】事前学習用データセットについて
【DL輪読会】事前学習用データセットについて【DL輪読会】事前学習用データセットについて
【DL輪読会】事前学習用データセットについて
 
PostgreSQL - C言語によるユーザ定義関数の作り方
PostgreSQL - C言語によるユーザ定義関数の作り方PostgreSQL - C言語によるユーザ定義関数の作り方
PostgreSQL - C言語によるユーザ定義関数の作り方
 

Similar a Novozymes Enzyme Stability Prediction

Auxin proteo controller
Auxin proteo controllerAuxin proteo controller
Auxin proteo controllerYumi Saiki
 
Protein Qualitative Analysis Services
Protein Qualitative Analysis ServicesProtein Qualitative Analysis Services
Protein Qualitative Analysis ServicesCreative Proteomics
 
Functional annotation
Functional annotationFunctional annotation
Functional annotationRavi Gandham
 
Eradicating diseases (genome)
Eradicating diseases (genome)Eradicating diseases (genome)
Eradicating diseases (genome)Utkarsh Verma
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Andrei KUCHARAVY
 
Multi Target Bioactivity Models in Pipeline Pilot
Multi Target Bioactivity Models in Pipeline PilotMulti Target Bioactivity Models in Pipeline Pilot
Multi Target Bioactivity Models in Pipeline PilotGerard van Westen
 
Recent trends in bioinformatics
Recent trends in bioinformaticsRecent trends in bioinformatics
Recent trends in bioinformaticsZeeshan Hanjra
 
Prediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeaturePrediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeatureKarnam Vasudeva Rao, PhD
 
A next generation sequencing based sample-to-result pharmacogenomics research...
A next generation sequencing based sample-to-result pharmacogenomics research...A next generation sequencing based sample-to-result pharmacogenomics research...
A next generation sequencing based sample-to-result pharmacogenomics research...Thermo Fisher Scientific
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeSumanthBT1
 
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...dkNET
 
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics WorkshopLopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics WorkshopNuria Lopez-Bigas
 
TheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptxTheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptxPRIYANKAZALA9
 
Joe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformaticsJoe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformaticsJoe Parker
 
Pcr plus and predictor 2013
Pcr plus and predictor 2013Pcr plus and predictor 2013
Pcr plus and predictor 2013Elsa von Licy
 

Similar a Novozymes Enzyme Stability Prediction (20)

Auxin proteo controller
Auxin proteo controllerAuxin proteo controller
Auxin proteo controller
 
Protein Qualitative Analysis Services
Protein Qualitative Analysis ServicesProtein Qualitative Analysis Services
Protein Qualitative Analysis Services
 
Functional annotation
Functional annotationFunctional annotation
Functional annotation
 
Biopharmaceutical
Biopharmaceutical Biopharmaceutical
Biopharmaceutical
 
Eradicating diseases (genome)
Eradicating diseases (genome)Eradicating diseases (genome)
Eradicating diseases (genome)
 
Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...Systems biology in polypharmacology: explaining and predicting drug secondary...
Systems biology in polypharmacology: explaining and predicting drug secondary...
 
Multi Target Bioactivity Models in Pipeline Pilot
Multi Target Bioactivity Models in Pipeline PilotMulti Target Bioactivity Models in Pipeline Pilot
Multi Target Bioactivity Models in Pipeline Pilot
 
Biopharma VS Small Molecules Therapeutic
Biopharma VS Small Molecules TherapeuticBiopharma VS Small Molecules Therapeutic
Biopharma VS Small Molecules Therapeutic
 
Recent trends in bioinformatics
Recent trends in bioinformaticsRecent trends in bioinformatics
Recent trends in bioinformatics
 
Prediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeaturePrediction of proteins for insecticidal activity using python toolkit iFeature
Prediction of proteins for insecticidal activity using python toolkit iFeature
 
Group b
Group bGroup b
Group b
 
A next generation sequencing based sample-to-result pharmacogenomics research...
A next generation sequencing based sample-to-result pharmacogenomics research...A next generation sequencing based sample-to-result pharmacogenomics research...
A next generation sequencing based sample-to-result pharmacogenomics research...
 
Multiplex Assays for Studying Gene Regulation and Cell Function
Multiplex Assays for Studying Gene Regulation and Cell FunctionMultiplex Assays for Studying Gene Regulation and Cell Function
Multiplex Assays for Studying Gene Regulation and Cell Function
 
2014 11 03_bioinformatics_case_studies
2014 11 03_bioinformatics_case_studies2014 11 03_bioinformatics_case_studies
2014 11 03_bioinformatics_case_studies
 
Proteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programmeProteomics in VSC for crop improvement programme
Proteomics in VSC for crop improvement programme
 
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...
dkNET Webinar: Multi-Omics Data Integration for Phenotype Prediction of Type-...
 
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics WorkshopLopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
Lopez-Bigas talk at the EBI/EMBL Cancer Genomics Workshop
 
TheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptxTheUniProtKBpptx__2022_03_30_13_07_41.pptx
TheUniProtKBpptx__2022_03_30_13_07_41.pptx
 
Joe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformaticsJoe parker-benchmarking-bioinformatics
Joe parker-benchmarking-bioinformatics
 
Pcr plus and predictor 2013
Pcr plus and predictor 2013Pcr plus and predictor 2013
Pcr plus and predictor 2013
 

Más de IttrainingIttraining

【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院IttrainingIttraining
 
商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班
商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班
商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班IttrainingIttraining
 
居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班
居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班
居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班IttrainingIttraining
 
AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班
AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班
AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班IttrainingIttraining
 
04_動物姿態識別Pet pose classification
04_動物姿態識別Pet pose classification04_動物姿態識別Pet pose classification
04_動物姿態識別Pet pose classificationIttrainingIttraining
 
01_智慧英文評分系統 Evaluate Student Summaries
01_智慧英文評分系統 Evaluate Student Summaries01_智慧英文評分系統 Evaluate Student Summaries
01_智慧英文評分系統 Evaluate Student SummariesIttrainingIttraining
 
03_AI旅宿訂房系統-AI智慧訂房方案
03_AI旅宿訂房系統-AI智慧訂房方案03_AI旅宿訂房系統-AI智慧訂房方案
03_AI旅宿訂房系統-AI智慧訂房方案IttrainingIttraining
 
02_永豐AI GO競賽攻房戰 Dr. House Team
02_永豐AI GO競賽攻房戰 Dr. House Team02_永豐AI GO競賽攻房戰 Dr. House Team
02_永豐AI GO競賽攻房戰 Dr. House TeamIttrainingIttraining
 
05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例
05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例
05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例IttrainingIttraining
 
旅遊推薦系統Tourist Line Spot
旅遊推薦系統Tourist Line Spot旅遊推薦系統Tourist Line Spot
旅遊推薦系統Tourist Line SpotIttrainingIttraining
 
APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子
APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子
APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子IttrainingIttraining
 
APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數
APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數
APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數IttrainingIttraining
 

Más de IttrainingIttraining (20)

【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
 
商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班
商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班
商品辨識定位系統_艾鍗學院-AIoT智能行動服務物聯網班
 
居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班
居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班
居家雲端照護系統_艾鍗學院-AIoT智能行動服務物聯網班
 
AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班
AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班
AIoT 智能商店_艾鍗學院-AIoT智能行動服務物聯網班
 
04_動物姿態識別Pet pose classification
04_動物姿態識別Pet pose classification04_動物姿態識別Pet pose classification
04_動物姿態識別Pet pose classification
 
01_智慧英文評分系統 Evaluate Student Summaries
01_智慧英文評分系統 Evaluate Student Summaries01_智慧英文評分系統 Evaluate Student Summaries
01_智慧英文評分系統 Evaluate Student Summaries
 
03_AI旅宿訂房系統-AI智慧訂房方案
03_AI旅宿訂房系統-AI智慧訂房方案03_AI旅宿訂房系統-AI智慧訂房方案
03_AI旅宿訂房系統-AI智慧訂房方案
 
02_永豐AI GO競賽攻房戰 Dr. House Team
02_永豐AI GO競賽攻房戰 Dr. House Team02_永豐AI GO競賽攻房戰 Dr. House Team
02_永豐AI GO競賽攻房戰 Dr. House Team
 
05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例
05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例
05_關鍵字提取與廣告文稿評分系統_以化妝品產業為例
 
旅遊推薦系統Tourist Line Spot
旅遊推薦系統Tourist Line Spot旅遊推薦系統Tourist Line Spot
旅遊推薦系統Tourist Line Spot
 
無人商店Unmanned Store
無人商店Unmanned Store無人商店Unmanned Store
無人商店Unmanned Store
 
物件辨識攝影機
物件辨識攝影機物件辨識攝影機
物件辨識攝影機
 
ChatBot問答機器人
ChatBot問答機器人ChatBot問答機器人
ChatBot問答機器人
 
街景門牌實價登錄
街景門牌實價登錄街景門牌實價登錄
街景門牌實價登錄
 
APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子
APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子
APCS程式設計實作題(Python解題)-物品堆疊/邏輯運算子
 
APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數
APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數
APCS程式設計實作題(Python解題)-解題技巧/特殊編碼/完全奇數
 
物聯智慧杯墊 Smart Cup Pad
物聯智慧杯墊 Smart Cup Pad物聯智慧杯墊 Smart Cup Pad
物聯智慧杯墊 Smart Cup Pad
 
口罩偵測機 Mask-detection
口罩偵測機 Mask-detection口罩偵測機 Mask-detection
口罩偵測機 Mask-detection
 
居家智能LED調光燈
居家智能LED調光燈居家智能LED調光燈
居家智能LED調光燈
 
智能居家監控Smart House
智能居家監控Smart House智能居家監控Smart House
智能居家監控Smart House
 

Último

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Último (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Novozymes Enzyme Stability Prediction

Notas del editor

  1. 先在大型設備進行pretraing建立模型,再由已建立的模型在自己的資料集執行finetue 早期NLP只做文字的的單向說明,會造成詞不達意。BERT會考量上下文,再決定持定文字所要表達的意義 將已經在一個特定資料集上訓練好的模型,在其他的資料集進行訓練
  2. 蛋白質的長度比句子的變化更大,會有超過數萬個氨基酸 由於它們的 3D 結構,所以字母前後結構的影響,比一般字句還長 定義全蛋白質在所有生物體的蛋白質空間上的內容 將全局(Global)和局部(Local)分開處理
  3. '疏水','芳香','h鍵受體','h鍵供體','正電離','負電離','佔用'
  4. 資料來源: https://www.louisbouchard.ai/densenet-explained/ 資料來源: https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53
  5. 資料來源: https://journals.plos.org/ploscompbiol/article/figure?id=10.1371/journal.pcbi.1006514.g001 https://www.youtube.com/watch?v=_v-7gZUWjmQ
  6. https://pubs.acs.org/doi/10.1021/ja100744h