Skip to main contentSkip to search and navigation

UEF eREPOSITORY

    • English
    • suomi
  • English 
    • English
    • suomi
  • Login
View Item 
  •   Home
  • Artikkelit
  • Terveystieteiden tiedekunta
  • View Item
  •   Home
  • Artikkelit
  • Terveystieteiden tiedekunta
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

MaNGA: a novel multi-objective multi-niche genetic algorithm for QSAR modelling

Thumbnail
Files
Article (713.4Kb)
Self archived version
final draft
Date
2019
Author(s)
Serra, Angela
Önlü, Serli
Festa, Paola
Fortino, Vittorio
Greco, Dario
Unique identifier
10.1093/bioinformatics/btz521
Metadata
Show full item record
More information
Research Database SoleCris

Self-archived article

Citation
Serra, Angela. Önlü, Serli. Festa, Paola. Fortino, Vittorio. Greco, Dario. (2019). MaNGA: a novel multi-objective multi-niche genetic algorithm for QSAR modelling.  Bioinformatics, 2020; 36 (1) , 145-153. 10.1093/bioinformatics/btz521.
Rights
© The Author(s) 2019
Licensed under
All rights reserved
Abstract

Quantitative structure–activity relationship (QSAR) modelling is currently used in multiple fields to relate structural properties of compounds to their biological activities. This technique is also used for drug design purposes with the aim of predicting parameters that determine drug behaviour. To this end, a sophisticated process, involving various analytical steps concatenated in series, is employed to identify and fine-tune the optimal set of predictors from a large dataset of molecular descriptors (MDs). The search of the optimal model requires to optimize multiple objectives at the same time, as the aim is to obtain the minimal set of features that maximizes the goodness of fit and the applicability domain (AD). Hence, a multi-objective optimization strategy, improving multiple parameters in parallel, can be applied. Here we propose a new multi-niche multi-objective genetic algorithm that simultaneously enables stable feature selection as well as obtaining robust and validated regression models with maximized AD. We benchmarked our method on two simulated datasets. Moreover, we analyzed an aquatic acute toxicity dataset and compared the performances of single- and multi-objective fitness functions on different regression models. Our results show that our multi-objective algorithm is a valid alternative to classical QSAR modelling strategy, for continuous response values, since it automatically finds the model with the best compromise between statistical robustness, predictive performance, widest AD, and the smallest number of MDs.

URI
https://erepo.uef.fi/handle/123456789/8183
Link to the original item
http://dx.doi.org/10.1093/bioinformatics/btz521
Publisher
Oxford University Press (OUP)
Collections
  • Terveystieteiden tiedekunta [1324]
University of Eastern Finland
OpenAccess
eRepo
erepo@uef.fi
OpenUEF
Service provided by
the University of Eastern Finland Library
Library web pages
Twitter
Facebook
Youtube
Library blog
 sitemap
Search

Browse

All of the ArchiveResource types & CollectionsBy Issue DateAuthorsTitlesSubjectsFacultyDepartmentFull organizationSeriesMain subjectThis CollectionBy Issue DateAuthorsTitlesSubjectsFacultyDepartmentFull organizationSeriesMain subject

My Account

Login
University of Eastern Finland
OpenAccess
eRepo
erepo@uef.fi
OpenUEF
Service provided by
the University of Eastern Finland Library
Library web pages
Twitter
Facebook
Youtube
Library blog
 sitemap