You did already a great work in annotating the cell lines, nevertheless, I found some inconsistencies with other databases (for example for the NCI-H226 cell line). Do you plan to find a common disease annotation? For example that NCI-H226 is a mesothelioma cell line (as https://cellmodelpassports.sanger.ac.uk/ and ATCC say) and here in depmap it’s a NSCLC adenocarcinoma line, which should be wrong).
I am looking forward for your answer!
I did a quick check in the celligner tool and see that NCIH226 is closest to other cell lines that are annotated as mesothelioma. Might be interesting to. check if the TCGA samples it is closest to represent mesothelioma (is that in TCGA?) or something else.
(blue circle is NCIH226, gray circles are other cell lines, mostly mesothelioma, blue crosses are TCGA samples mostly lung)
FYI here’s a list of the closest TCGA samples: