42 | | The last step in preparing the immunochip data for comparison with the sequence data is to liftover the VCF file to the same Human Genome Reference as the Sequence data so that comparisons can be made. All is explained here: [[LiftOver_Genome_Assemblies]] |
| 42 | The last step in preparing the immunochip data for comparison with the sequence data is to liftover the VCF file to the same Human Genome Reference as the Sequence data so that comparisons can be made. Here is how: |
| 43 | # Get the appropriate chain files |
| 44 | #* From the [ftp://gsapubftp-anonymous@ftp.broadinstitute.org Broad GSA ftp] (password is blank) |
| 45 | #* From the [http://hgdownload.cse.ucsc.edu/downloads.html#human USCS Genome Browser] |
| 46 | # Get the appropriate fasta files (you'll need both from-and-to builds fasta files) |
| 47 | #* From [ftp://ftp.ncbi.nlm.nih.gov/genomes/H_sapiens/ NCBI] |
| 48 | #* From [http://hgdownload.cse.ucsc.edu/downloads.html#human USCS Genome Browser] |
| 49 | # Index the fasta files appropriately to get .fai (samtools) and .dict files (Picard) as described on the [http://www.broadinstitute.org/gsa/wiki/index.php/Preparing_the_essential_GATK_input_files:_the_reference_genome GATK wiki] |
| 50 | # Get and run the [http://www.broadinstitute.org/gsa/wiki/index.php/LiftOverVCF.pl GATK LiftOver tool] |