Hope you can help me. For the analysis I am counting the number of mutated genes in all cell lines. For this I count the number of different gene names using the “Hugo_Symbol” column from the “CCLE_mutations.csv” file (Cellular Models Mutation Public 21Q1). As a result I got the 19540 genes the have description of mutations in file.
However, the description of the “CCLE_mutations.csv” file indicates the number 18788 of genes. Are 18788 are the number of mutated in cell lines genes? If yes, then, maybe you can help me to understand which parametr I should use to count number of mutated genes correctly.
Thank you in advance,