loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Enhancing the Efficiency of the Grouping-Scoring-Modeling Framework with Statistical Pre-Scoring Component for Transcriptomic Data Analysis

Topics: Biostatistics and Stochastic Models; Genomics and Proteomics; Machine Learning Algorithms, Data Mining Techniques and Deep Learning Tools; Precision Medicine and Data-Driven Healthcare; Transcriptomics

Authors: Maham Khokhar 1 ; Burcu Bakir-Gungor 2 and Malik Yousef 3

Affiliations: 1 Department of Data Science, Social Sciences Institute, Abdullah Gul University, Kayseri, 38080, Turkey ; 2 Department of Computer Engineering, Faculty of Engineering, Abdullah Gul University, Kayseri, 38080, Turkey ; 3 Department of Information Systems, Galilee Digital Health Research Center, Zefat Academic College, 13206, Zefat, Israel

Keyword(s): Transcriptomics Data Analysis, Feature Selection, Machine Learning, Biomarker Discovery.

Abstract: The advent of high-throughput transcriptomic technologies has generated vast transcriptomic datasets, challenging current analytical methodologies with their sheer volume and complexity. The Grouping-Scoring-Modeling (G-S-M) approach is one of the recent approaches that treat groups of genes (or clusters of genes) by embedding prior biological knowledge with machine learning in order to detect the most significant groups for classification tasks. The G-S-M might need to treat thousand ten thousand of groups (scoring those groups) which might affect the speed and performance of the algorithm. In response, this study introduces the Pre-Scoring G-S-M model, an enhancement of the established Grouping-Scoring-Modeling (G-S-M) framework. This approach incorporates a Pre-Scoring component that leverages the Limma package for its empirical Bayes methods to optimize initial transcriptomic data evaluation through a percentage-based selection of statistically significant gene groups. Aimed at r educing computational demand and streamlining feature selection, the model also addresses data redundancy by eliminating duplicate gene-disease associations. Application to nine human gene expression datasets from the GEO database showed promising results. It demonstrated improvements in computational efficiency and analytical precision while reducing the number of features selected per dataset compared to the traditional G-S-M approach, without compromising accuracy. These initial findings highlight the Pre-Scoring G-S-M model's potential to enhance transcriptomic data analysis, indicating a promising direction for future bioinformatics research. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.243

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Khokhar, M., Bakir-Gungor, B. and Yousef, M. (2025). Enhancing the Efficiency of the Grouping-Scoring-Modeling Framework with Statistical Pre-Scoring Component for Transcriptomic Data Analysis. In Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOINFORMATICS; ISBN 978-989-758-731-3; ISSN 2184-4305, SciTePress, pages 479-488. DOI: 10.5220/0013192600003911

@conference{bioinformatics25,
author={Maham Khokhar and Burcu Bakir{-}Gungor and Malik Yousef},
title={Enhancing the Efficiency of the Grouping-Scoring-Modeling Framework with Statistical Pre-Scoring Component for Transcriptomic Data Analysis},
booktitle={Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOINFORMATICS},
year={2025},
pages={479-488},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013192600003911},
isbn={978-989-758-731-3},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOINFORMATICS
TI - Enhancing the Efficiency of the Grouping-Scoring-Modeling Framework with Statistical Pre-Scoring Component for Transcriptomic Data Analysis
SN - 978-989-758-731-3
IS - 2184-4305
AU - Khokhar, M.
AU - Bakir-Gungor, B.
AU - Yousef, M.
PY - 2025
SP - 479
EP - 488
DO - 10.5220/0013192600003911
PB - SciTePress