Do others have problems opening RNAseq transcript .csv files? I have tried everything to open them, and my computer keeps “running out of memory” before the file opens.
Why does the CCLE have genes in columns and cell lines in rows? It seems like it would be easier to work with the genes if it were the other way around
Agreed. Excel only allows 16,384 columns of data, and most all of the download files exceed that in gene number. At least in the custom downloads page, would it be possible to default to the matrix where the genes are in rows and cell lines in columns? Or at least provide this as an option? Then there would be no issues for the Excel users.
FYI / be aware - if you use excel for this data in general - regardless of column/row orientation - excel will convert some of the gene symbols into dates/times, and then you won’t easily be able to match them up with other datasets and/or will have trouble interpreting the results.
It might make sense to check out R / Rstudio, the interface has gotten quite user friendly. It’s not excel and there will be a learning curve, but you’ll also gain a ton of functionality.
I took your advice to learn R. Working on downloading the CCLE_RNAseq_reads and getting this data into a R dataframe to work with.
Can you get all of the data from a .txt file to an R dataframe? If this is not good, how do you download the data from ccle in a format that can be imported into R?
Thanks