Difference between revisions of "Eland"

From Genome Technology Core (GTC) wiki - Sequencing and Microarray
Jump to: navigation, search
Line 1: Line 1:
 
ELAND - E fficient L arge-Scale A lignment of N ucleotide D atabases - is an alignment tool integrated in Illumina-Solexa data processing package, can do ungapped alignment for reads with size up to 32 bp (Cox, unpublished). Although the core program is a binary executable but we have built certain wrapper scripts to be able to run the ELAND program independently of the pipeline.
 
ELAND - E fficient L arge-Scale A lignment of N ucleotide D atabases - is an alignment tool integrated in Illumina-Solexa data processing package, can do ungapped alignment for reads with size up to 32 bp (Cox, unpublished). Although the core program is a binary executable but we have built certain wrapper scripts to be able to run the ELAND program independently of the pipeline.
 
== To run the program ==
 
 
Step 1: Please copy the script titled - "run_eland_v1.pl" - from "\\Gobo\wicmt_public\public_apps\ELAND" folder to your home directory.
 
 
Step 2: Copy the *_sequence.txt files from the "quality score" folders to your working directory where you have write permissions.
 
 
Step 3: Open the "run_eland_v1.pl" in any text editor and modify the parameter according to the instructions in the file. Please email Sumeet Gupta, sgupta@wi.mit.edu, if you have any questions.
 
 
Step 4: Logon to your tak account.
 
 
Step 5: Run the perl script "run_eland_v1.pl".
 
 
== Output Files ==
 
 
1. [Prefix]_[GenomeUsed]_[SeedUsed]_eland_extended.txt - Gives all the positions a read maps, if it matches to less than 10 positions.
 
 
2. [Prefix]_[GenomeUsed]_[SeedUsed]_export.txt - Contains all read, quality value and alignment information for a lane of data.
 
 
3. [Prefix]_[GenomeUsed]_[SeedUsed].ylf - (Y)oung (L)ab (F)ormat files - These are used as input for the young lab error model for ChIP-Seq analysis.
 
 
4. [Prefix]_[GenomeUsed]_[SeedUsed]_sorted.txt - Contains all the uniquely mapped reads, sorted based on the position on the genome.
 
 
5. [Prefix]_[GenomeUsed]_[SeedUsed]_genomesize.xml - Summary for the genome being used for the alignment
 
 
6. TechnicalAnalysis-[Prefix]_[GenomeUsed]_[SeedUsed].xls - QC Report for the ELAND alignment - Please refer here for details - http://jura.wi.mit.edu/genomecorewiki/index.php/QCOutputFormat
 
 
7. [Prefix]_[GenomeUsed]_[SeedUsed]_tagcount.txt - Gives a count for all the reads, their corresponding counts, position on the genome and the gene name if it is in the transcribed region.
 

Revision as of 11:06, 6 October 2009

ELAND - E fficient L arge-Scale A lignment of N ucleotide D atabases - is an alignment tool integrated in Illumina-Solexa data processing package, can do ungapped alignment for reads with size up to 32 bp (Cox, unpublished). Although the core program is a binary executable but we have built certain wrapper scripts to be able to run the ELAND program independently of the pipeline.