Eland

From Genome Technology Core (GTC) wiki - Sequencing and Microarray
Revision as of 18:15, 1 September 2009 by Sgupta (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

ELAND - E fficient L arge-Scale A lignment of N ucleotide D atabases - is an alignment tool integrated in Illumina-Solexa data processing package, can do ungapped alignment for reads with size up to 32 bp (Cox, unpublished). Although the core program is a binary executable but we have built certain wrapper scripts to be able to run the ELAND program independently of the pipeline.

To run the program

Step 1: Please copy the script titled - "run_eland.pl" - from "\\Gobo\wicmt_public\public_apps\ELAND" folder to your home directory.

Step 2: Copy the *_sequence.txt files from the "quality score" folders to your working directory where you have write permissions.

Step 3: Open the "run_eland.pl" in any text editor and modify the parameter according to the instructions in the file. Please email Sumeet Gupta, sgupta@wi.mit.edu, if you have any questions.

Step 4: Logon to your tak account.

Step 5: Run the perl script "run_eland.pl".

Output Files

1. *_eland_extended.txt - Gives all the positions a read maps, if it matches to less than 10 positions.

2. *_export.txt - Contains all read, quality value and alignment information for a lane of data.

3. *.ylf - (Y)oung (L)ab (F)ormat files - These are used as input for the young lab error model for ChIP-Seq analysis.

4. *_sorted.txt - Contains all the uniquely mapped reads, sorted based on the position on the genome.

5. *_genomesize.xml - Summary for the genome being used for the alignment

6. TechnicalAnalysis-(*).xls - QC Report for the ELAND alignment - Please refer here for details - http://jura.wi.mit.edu/genomecorewiki/index.php/QCOutputFormat