BiomarkerBenchmark_GSE2109_Lung

Status: Complete

Testing Directory . . .

Results: PASS


Testing Configuration File . . .

✅ config.yaml contains all necessary configurations.

✅ Title is less than 100 characters

✅ description.md contains a description.

Results: PASS


Running Install . . .

Executing install.sh: Success

Results: PASS


Testing file paths:

✅ test_data.tsv exists.

✅ test_metadata.tsv exists.

✅ download.sh exists.

✅ install.sh exists.

✅ parse.sh exists.

✅ cleanup.sh exists.

✅ description.md exists.

✅ config.yaml exists.

Running user code . . .

Executing download.sh: Success

Executing parse.sh: Success

✅ data.tsv.gz was created and zipped correctly.

✅ metadata.tsv.gz was created and zipped correctly.

Results: PASS


Testing Key Files:

✅ test_data.tsv contains enough unique samples to test

✅ test_data.tsv contains enough test cases (8; min: 8)

✅ test_metadata.tsv contains enough unique samples to test

✅ test_metadata.tsv contains enough test cases (8; min: 8)

Results: PASS


First 5 columns and 5 rows of data.tsv.gz:

Sample ENSG00000000003 ENSG00000000005 ENSG00000000419 ENSG00000000457
GSM38100 1.40368261764706 -0.25811851375 2.23105485222222 0.4314917628125
GSM38103 1.92959003882353 0.02999554625 1.84192114111111 0.8087353546875
GSM38104 1.85159212117647 -0.2049791175 2.02616687 0.3431711746875
GSM46824 1.90478119705882 -0.2592340275 2.22703217444444 0.7706966209375

Columns: 20025 Rows: 104


“data.tsv.gz” Test Cases (from rows in test file). . .

✅ First column of file is titled “Sample”

✅ Row 1: Success

✅ Row 2: Success

✅ Row 3: Success

✅ Row 4: Success

✅ Row 5: Success

✅ Row 6: Success

✅ Row 7: Success

✅ Row 8: Success

Results: PASS


First 3 columns and 5 rows of metadata.tsv.gz:

Sample Variable Value
GSM38100 Alcohol_Consumption Yes
GSM38100 Days_from_Patient_Diagnosis_to_Excision 24
GSM38100 Ethnic_Background Caucasian
GSM38100 Family_History_of_Cancer Yes

Columns: 3 Rows: 1921


“metadata.tsv.gz” Test Cases (from rows in test file). . .

✅ First column of file is titled “Sample”

The value for variable "Ethnic_Background" for all samples is the same ("Caucasian"). This variable has been removed from metadata.tsv.gz

The value for variable "Pathological_Stage_During_or_Following_Multimodality_Therapy" for all samples is the same ("No"). This variable has been removed from metadata.tsv.gz

The value for variable "Primary_Site" for all samples is the same ("Lung"). This variable has been removed from metadata.tsv.gz

✅ Row 1: Success

✅ Row 2: Success

✅ Row 3: Success

✅ Row 4: Success

✅ Row 5: Success

✅ Row 6: Success

✅ Row 7: Success

✅ Row 8: Success

Results: PASS


Comparing samples in both files . . .

✅ Samples are the same in both “data.tsv.gz” & “metadata.tsv.gz”

Results: PASS


Testing Directory after cleanup . . .

Results: PASS