1 | | = Gene, exon and transcript counts = |
2 | | |
3 | | |
4 | | == Counts and Spearman correlations for run 1 == |
5 | | |
6 | | Date: 06-november-2013[[BR]] |
7 | | Analysis by: Peter-Bram 't Hoen[[BR]] |
8 | | |
9 | | The combined gene counts for the 2330 samples from run 1 are available on the VM: /virdir/Backup/run_1_gene_counts/combined_gene_count_run_1.txt and were generated using this script: [raw-attachment:merge_count_script.r R script for merging gene count tables][[BR]] |
10 | | Subsequently, pairwise Spearman correlations were calculated: /virdir/Backup/run_1_gene_counts/Spearman_correlations_complete_gene_data_run_1.txt[[BR]] |
11 | | From these the median Spearman correlation for each sample to each other sample was calculated. This is also called the D-statistic. The D-statistics (ranked from low to high) can be found in this file [raw-attachment:Median_pairwise_spearman_correlations_complete_gene_data_run_1.txt Median Spearman correlations][[BR]] |
12 | | |
13 | | [raw-attachment:Median_pairwise_spearman_correlations_by_flowcell_complete_gene_data_run_1.pdf Boxplot of median Spearman correlations grouped by flowcell] (Martijn Vermaat)[[BR]] |
14 | | |
15 | | [raw-attachment:Dstat_biobank_boxplot.pdf Boxplot of median Spearman correlations grouped by biobank] [[BR]] |
16 | | |
17 | | After removing the two samples with very low Spearman correlations to all other samples, the distance matrix was calculated (1 - correlation matrix), and a two-dimensional MDS plot was created using the R function cmdscale. [raw-attachment:mdsplot_filt_colored_biobank.pdf This is the resulting mdsplot]. The plot was colored according to the following color scheme: [[BR]] |
18 | | "LL" - gold[[BR]] |
19 | | "RS" - blue[[BR]] |
20 | | "CODAM" - orange[[BR]] |
21 | | "LLS" - pink[[BR]] |
22 | | "Amsterdam" - darkred[[BR]] |
23 | | |
24 | | Same mds plot but now colored according to mean GC percentage: [raw-attachment:mdsplot_filt_colored_gc.pdf mdsplot GC] |
25 | | |
26 | | |
| 1 | This page has been moved to [wiki:FgGeneExonTranscriptCounts Gene, exon and transcript counts]. |