Query Examples
Querying datasets in GEO OmixAtlas
-
To identify datasets belonging to the tissue Breast, disease Breast Neoplasms and organism Homo sapiens
-
Fetch datasets from Depmap which has gene dependency information according to CRISPR screening experiments
-
Identify all transcriptome datasets in Hepatocellular Carcinoma disease in Human and Mouse
Querying samples in GEO OmixAtlas
-
Get the name of samples, dataset ID and extract_protocol_ch1 where spectroscopy is mentioned in the extract_protocol_ch1
-
IGet the name of disease and number of samples where information about particular disease is curated
Querying data matrix in GEO OmixAtlas
Fetch data matrix for selected genes for a dataset ID of interest
```
gene = ('hbb-y', 'fth1', 'bbip1', 'actb')
query = f"SELECT * FROM data_matrices.geo__GSE4230_GPL1261 WHERE
LOWER(rid) IN {gene}"
```
Other Query Examples
Select a few feature level metadata for selected genes from Mutation datasets of TCGA where dataset_id contains BRCA
```
query = """SELECT src_dataset_id, disease, protein_position, amino_acids, sequencer, impact, variant_class, consequence, name
FROM tcga.features AS features
JOIN (
SELECT dataset_id AS dataset_id, curated_disease AS disease FROM tcga.datasets WHERE data_type LIKE 'Mutation') AS datasets
ON features.src_dataset_id = datasets.dataset_id
WHERE hugo_symbol IN ('TP53','PIK3CA','CDH1','GATA3') AND features.src_dataset_id LIKE '%BRCA%'
ORDER BY features.src_dataset_id"""
```
Tutorial Notebooks
Example queries have been given in various notebooks in this github folder.