Publikationen

Hier finden Sie von Know-Center MitarbeiterInnen verfasste wissenschaftliche Publikationen

2019

Lovric Mario, Žuvela Petar, Kern Roman, Lucic, Bono, J. Jay Liu, Tomasz Bączek

Machine learning methods for cross-column prediction of retention time in reversed-phased liquid chromatography

8th World Conference on Physico Chemical Methods in Drug Discovery and Developmen, IAPC, Split, Croatia, 2019

Konferenz
Quantitative structure-retention relationships (QSRR) were employed to build global models for prediction of chromatographic retention time of synthetic peptides across six RP-LC-MS/MS columns and varied experimental conditions. The global QSRR models were based on only three a priori selected molecular descriptors: sum of gradient retention times of 20 natural amino acids (logSumAA), van der Waals volume (logvdWvol.), and hydrophobicity (clogP) related to the retention mechanism of RP-LC separation of peptides. Three machine learning regression methods were compared: random forests (RF), partial least squares (PLS), and adaptive boosting (ADA). All the models were comprehensively optimized through 3-fold cross-validation (CV) and validated through an external validation set. The chemical domain of applicability was also defined. Percentage root mean square error of prediction (%RMSEP) was used as an external validation metric. Results have shown that RF exhibited a %RMSEP of 14.99 %; PLS exhibited a %RMSEP of 40.561 %; whereas ADA exhibited a %RMSEP of 26.35 %. The ensemble models considerably outperform the conventional PLS-based QSRR model. Novel methods of tree-based model explainability were employed to reveal mechanisms behind black-box global ensemble QSRR models. The models revelead the highest feature importance for sum of gradient retention times (logSumAA), followed by van der Waals volume (logvdWvol.), and hydrophobicity (clogP). The promising results of this study show the potential of machine learning for improved peptide identification, retention time standardization and integration into state-of-the-art LC-MS/MS proteomics workflows.
2019

Lovric Mario, Petar Žuvela, Bono Lucic, J. Jay Liu, Kern Roman, Tomasz Bączek

Machine learning methods for cross-column prediction of retention time in reversed-phased liquid chromatography

8th World Conference on Physico Chemical Methods in Drug Discovery and Developmen, International Association of Physical Chemists (IAPC, Split, Croatia, 2019

Konferenz
Quantitative structure-retention relationships (QSRR) were employed to build global models for prediction of chromatographic retention time of synthetic peptides across six RP-LC-MS/MS columns and varied experimental conditions. The global QSRR models were based on only three a priori selected molecular descriptors: sum of gradient retention times of 20 natural amino acids (logSumAA), van der Waals volume (logvdWvol.), and hydrophobicity (clogP) related to the retention mechanism of RP-LC separation of peptides. Three machine learning regression methods were compared: random forests (RF), partial least squares (PLS), and adaptive boosting (ADA). All the models were comprehensively optimized through 3-fold cross-validation (CV) and validated through an external validation set. The chemical domain of applicability was also defined. Percentage root mean square error of prediction (%RMSEP) was used as an external validation metric. Results have shown that RF exhibited a %RMSEP of 14.99 %; PLS exhibited a %RMSEP of 40.561 %; whereas ADA exhibited a %RMSEP of 26.35 %. The ensemble models considerably outperform the conventional PLS-based QSRR model. Novel methods of tree-based model explainability were employed to reveal mechanisms behind black-box global ensemble QSRR models. The models revelead the highest feature importance for sum of gradient retention times (logSumAA), followed by van der Waals volume (logvdWvol.), and hydrophobicity (clogP). The promising results of this study show the potential of machine learning for improved peptide identification, retention time standardization and integration into state-of-the-art LC-MS/MS proteomics workflows.
Kontakt Karriere

Hiermit erkläre ich ausdrücklich meine Einwilligung zum Einsatz und zur Speicherung von Cookies. Weiter Informationen finden sich unter Datenschutzerklärung

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.

Close