I noticed that there are two entries for ACH-003186 in the OmicsExpressionTPMLogp1HumanProteinCodingGenes.csv file, each with different SequencingID and IsDefaultEntryForModel values.
Could you clarify why there are duplicate entries for this model? Additionally, should we use the entry where IsDefaultEntryForModel is set to “yes”?
There can be multiple reasons why a model can have more than one Sequencing IDs (different growth conditions within one model, repeated sequencing, etc.). Each SequencingID represents a unique sequencing and those within the same model can be considered replicates to some extent. The expression processing pipeline is identical across all samples in each release. If your goal is to map which sequencing output represents each Model, setting IsDefaultEntryForModel to “Yes” would be the way to go.
I’d recommend checking out the more detailed walkthrough in the overview tab of our download page.