Classifying copy number alterations

vonnwalter · February 12, 2023, 2:01am

Hi Everyone,

The documentation states that the gene-level copy number values in the DepMap download are log2(copy_ratio + 1). Presumably copy_ratio = copy_number/2, in which case it is easy to compute quantitative gene-level copy number values. Can anyone provide guidance on thresholds for classifying gains and losses? For example, the GISTIC pipeline produced both quantitative and discrete copy number calls (-2 = homozygous deletion, -1 = heterozygous deletion, 0 = copy neutral, 1 = low gain, 2 = high gain) for the various TCGA cohorts. The general consensus is that -0.1 and 0.1 were thresholds for classifying gains and losses, respectively, based on the quantitative gene-level copy number values from GISTIC. However, these values went through an extensive normalization procedure, so I don’t think there’s no reason to expect the same thresholds to be applicable to the CCLE data. Any thoughts would be greatly appreciated.

simz · February 13, 2023, 4:27pm

Hello,

We currently don’t have a specific threshold recommendation, but perhaps the discussion in this thread could be helpful.

And in case you are interested in absolute copy number calls instead of relative to classify CNAs, we currently have data generated by the ABSOLUTE algorithm on the CCLE lines. You can find the file called CCLE_ABSOLUTE_combined_20181227 in the “CCLE 2019” data set on the download page. We are also working on running PureCN on all of our current DepMap data for absolute copy number calls, and we’re hoping to include it as part of the release in the future.

Hope this helps!
Simone

Angelina_Yershova · December 8, 2023, 11:31pm

Dear simz,

could you please explain what Modal_HSCN_1, Modal_HSCN_2 and LOH mean? I struggle to understand what “major” and “minor” means here. Is LOH a deletion of one copy of the segment?

Also, is ploidy in ABSOLUTE - absolute or is it a divisible of these numbers?
Thank you!

simz · January 2, 2024, 9:27pm

Hi Angelina,

HSCN = haplotype-specific copy number. HSCN1 and HSCN2 correspond to allelic absolute copy numbers and add their sum is Modal_Total_CN. In this context, the major allele is the more prevalent allele and the minor allele is the less prevalent.

Since ABSOLUTE is not developed by depmap, I’d recommend referring to the paper for details on the method: Absolute quantification of somatic DNA alterations in human cancer - PMC. According to the paper, “LOH (loss of heterozygosity) was defined as 0 allelic copies. Amplification was defined as > 1allelic copy for samples with 0 genome doublings, and as > 2 allelic copies for those with 1genome doubling. Calls were made based on the modal allelic copy numbers of each chromosome arm.”

Best,
Simone

Topic		Replies	Views
Defining "deep" deletions and amplifications Q&A	9	6718	March 22, 2024
Determining Copy Number Alterations for Genes/, as boolean, in DepMap Public 22Q2 Q&A	1	387	February 28, 2024
How to calculate absolute copy number from relative copy number? Q&A	6	2173	March 24, 2022
What is relative copy number/copy number ratio? Q&A omics , documentation	13	14727	February 9, 2022
Question about segtab annotations in ABSOLUTE CN file Q&A	3	211	May 9, 2024

Classifying copy number alterations

Related topics