I am curious to know how TRUE/FALSE is determined in the isDeleterious column of the CCLE_mutations.csv file. Is it a combined score of multiple variant predictors? Apologies, if this is already written up in a publication.
The isDeleterious is an outdated annotation which we do not recommend using. We will drop this in a future release. Please use the column āVariant_annotationā instead.
Thanks for getting back to me @jnoorbak, are there plans to include a SIFT/PolyPhen scores to these mutations? Perhaps this was the original intent of isDeleterious?
The original goal has been somewhat to get the damaging effect of the mutation. This column in most cases corresponds to one of the following mutation types from āVariant_Classificationā column (Oncotator annotations), but there are exceptions to that:
āDe_novo_Start_OutOfFrameā, āNonsense_Mutationā, āNonstop_Mutationā, āSplice_Siteā, āFrame_Shift_Delā, āFrame_Shift_Insā, āStart_Codon_Delā, āStart_Codon_Insā, āStop_Codon_Delā, āStop_Codon_Insā
We do have a plan to update the mutation functional annotations in the future, but its timeline is unclear right now. We do generate SIFT/PolyPhen scores that are not reported right now. Iāll keep in mind including them once we get to that. -Thanks
Thank you for the clarification, I will mark your response here as the solution.
Cheers