I have downloaded Batch_corrected_Expression_Public_24Q2 data. I understood that they have integrated Project Achilis and Score’s dataset. But, how ? What is the total preprocessing and data cleaning pipeline ?? I tried to find out publications regarding this, only got 2021 paper integrated cross study dataset. Are they following the same methods ?
The effort to integrate the Achilles and “Project Score” datasets were focused on integrating the data from the CRISPR screens, not expression data.
I can’t find a file on the portal named “Batch_corrected_Expression_Public_24Q2” (maybe you generated it by downloading through the “custom downloads” UI?) but from the name, I suspect you’re looking at the data from OmicsExpressionProteinCodingGenesTPMLogp1BatchCorrected
Clicking the link OmicsExpressionProteinCodingGenesTPMLogp1BatchCorrected will take you to the file description which includes an explanation of how the file was generated.
Thanks,
Phil