Absolute CN = Line ploidy * relative CN = Line ploidy * ((2 ** relative CN from the dataset) - 1)
I then rounded the results and compared some of them to the data from Absolute Copy Number section in the data explorer. While most of my results (est. 90%) agreed with the data explorer, other results were off. For example:
Additionally all absolute calls were made on the CCLE2 data. It is likely that some lines got more up to date sequencing (going from WES->WGS or Affymetrix → WES/WGS) which would also create differences between ABSOLUTE values and the ones you are getting from the latest datasets.
Overall CN are not super precise measurements and finding this variance in CN measurements of Cell Line data is completely normal.
Would it be possible by any chance to publish a dataset with absolute copy numbers, considering DepMap already has the data? It would be very much appreciated. In the meantime, I’ll check the article and the algorithm.
The file you are using (CCLE_ABSOLUTE_combined_20181227) is the absolute dataset.
We do not create new absolute data because there is a manual step involved, to choose between possible profile, which is quite arbitrary. Also it is not really made for cell lines (where purity is expected to be always 100%).
We are working on a simple replacement to infer karyotype based on relative CN but there is no deadline set yet.
Hi. If I want to perform absolute analysis, how to modify the segment file downloaded from DepMap? I think the relative copy ratio in this segment file is different from that column in the standard segment file.
We are planning to release Absolute CN pretty soon. We also have some absolute calls already available from the CCLE2 paper on the CCLE2 cell lines.
One simple way to find absolute calls is to find, for each line, the scaling value that sets the relative CN calls to integer values. This is way oversimplified but might do the trick for most cell lines.