Normal view MARC view ISBD view

Performance analysis of the Survival-SVM classifier applied to gene-expression databases

By:

Camele, Genaro

Contributor(s):

Hasperué, Waldo

Material type: Article

ArticleDescription: 1 archivo (781,6 kB)Subject(s):

Online resources:

Click here to access online

Summary: The analysis of epigenetic information for the diagnosis and prognosis of patients has been gaining relevance in recent years due to the technological progress that entails a decrease in information extraction and processing costs. One of the tasks most commonly carried out in this area is obtaining models that allow using patient epigenetic information to make inferences about survival analysis. As a result, optimizing these models turns into a problem of great interest today. In this article, the evaluation of different metrics and execution times for the Survival Support Vector Machines model is carried out through survival analysis applied to gene expression databases. Different experiments were performed varying the number of genes used for training to measure the correlation between model performance and data growth. The results showed that linear and polynomial kernels offer a better balance between execution time and model predictive power when the number of genes to be evaluated is less than 2000, while the cosine and RBF kernels are better candidates otherwise.

Average rating: 0.0 (0 votes)

Holdings ( 1 )
Title notes ( 3 )

Holdings
Item type	Home library	Collection	Call number	URL	Status	Date due	Barcode
Capítulo de libro	Biblioteca de la Facultad de Informática	Biblioteca digital	A1330 (Browse shelf(Opens below))	Link to resource	No corresponde

Formato de archivo PDF. -- Este documento es producción intelectual de la Facultad de Informática - UNLP (Colección BIPA/Biblioteca)

The analysis of epigenetic information for the diagnosis and prognosis of patients has been gaining relevance in recent years due to the technological progress that entails a decrease in information extraction and processing costs. One of the tasks most commonly carried out in this area is obtaining models that allow using patient epigenetic information to make inferences about survival analysis. As a result, optimizing these models turns into a problem of great interest today. In this article, the evaluation of different metrics and execution times for the Survival Support Vector Machines model is carried out through survival analysis applied to gene expression databases. Different experiments were performed varying the number of genes used for training to measure the correlation between model performance and data growth. The results showed that linear and polynomial kernels offer a better balance between execution time and model predictive power when the number of genes to be evaluated is less than 2000, while the cosine and RBF kernels are better candidates otherwise.

Congreso Argentino de Ciencias de la Computación (29no : 2023 : Luján, Argentina)