Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks

Authors: Mona Nourbakhsh (Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark), Kristine Degn (Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark), Astrid Saksager (Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark), Matteo Tiberti (Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark), Elena Papaleo (Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark)

arXiv: 2303.17402v1 - DOI (q-bio.GN)
35 pages, 7 figures
License: CC BY-NC-ND 4.0

Abstract: The vast amount of sequencing data presently available allow the scientific community to explore a range of genetic variables that may drive and progress cancer. A myriad of predictive tools has been proposed, allowing researchers and clinicians to compare and prioritize driver genes and mutations and their relative pathogenicity. However, there is little consensus on the computational approach or a golden standard for comparison. Hence, benchmarking the different tools depends highly on the input data, indicating that overfitting is still a massive problem. One of the solutions is to limit the scope and usage of specific tool. However, such limitations forces researchers to walk on a tightrope between creating and using high-quality tools for a specific purpose and describing the complex alterations driving cancer. While the knowledge of cancer development increases every day, many bioinformatic pipelines rely on single nucleotide variants or alterations in a vacuum without accounting for cellular compartment, mutational burden, or disease progression. Even within bioinformatics and computational cancer biology, the research fields work in silos, risking overlooking potential synergies or breakthroughs. Here, we provide an overview of databases and datasets for building or testing predictive tools for discovery of cancer drivers. We introduce predictive tools for driver genes, driver mutations, and the impact of these based on structural analysis. Additionally, we suggest and recommend directions in the field to avoid silo-research, moving in the direction of integrative frameworks.

Submitted to arXiv on 30 Mar. 2023

Explore the paper tree

Click on the tree nodes to be redirected to a given paper and access their summaries and virtual assistant

Also access our AI generated Summaries, or ask questions about this paper to our AI assistant.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.