Product Details
SoluProt: prediction of soluble protein expression in Escherichia coli
Created: 2021
Czech title
SoluProt: predikce rozpustné exprese proteinů v Escherichia coli
Type
software
License
optional - free
Authors
Hon Jiří, Ing., Ph.D. (DIFS FIT BUT)
Marušiak Martin, Ing. (FIT BUT)
Martínek Tomáš, doc. Ing., Ph.D. (DCSY FIT BUT)
Kunka Antonín, Mgr., Ph.D. (LL)
Zendulka Jaroslav, doc. Ing., CSc. (DIFS FIT BUT)
Bednář David, Mgr. (LL)
Damborský Jiří, prof. Mgr., Dr. (LL)
Marušiak Martin, Ing. (FIT BUT)
Martínek Tomáš, doc. Ing., Ph.D. (DCSY FIT BUT)
Kunka Antonín, Mgr., Ph.D. (LL)
Zendulka Jaroslav, doc. Ing., CSc. (DIFS FIT BUT)
Bednář David, Mgr. (LL)
Damborský Jiří, prof. Mgr., Dr. (LL)
Keywords
protein solubility, machine-learning
Description
A new tool for sequence-based prediction of soluble protein expression in Escherichia coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProts accuracy of 58.4% and AUC of 0.60 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies. SoluProt is freely available as a standalone program and a user-friendly webserver at https://loschmidt.chemi.muni.cz/soluprot/.
Location
Projects
Application of AI methods to cyber security and control systems (FIT-S-20-6293)