Skip to content

Data Schema

What is the standardized schema of OmixAtlas?

Data schema: the data available within OmixAtlas is curated within defined indexes on the basis of the information it contains. These indexes are:

  • Dataset-level metadata (index: files): Contains curated fields like drug, disease, tissue organism, etc., for each dataset.
  • Sample-level metadata (index: gct_metadata, h5ad_metadata, and biom_metadata): Contains curated fields like cell lines, experimental design, etc., for each sample.
  • Feature level metadata (gct_row_metadata, h5ad_data, and biom_data): Contains the gene/molecule symbol along with the feature intensity for each sample.
  • Variant-related data (index: variant_data): Contains the schema for variant-related information present in vcf files