Ψ-RA: a parallel sparse index for genomic read alignment

Kulekci, M. Oguzhan; Hon, Wing-Kai; Shah, Rahul; Vitter, Jeffrey Scott; Xu, Bojian

dc.contributor.author	Kulekci, M. Oguzhan
dc.contributor.author	Hon, Wing-Kai
dc.contributor.author	Shah, Rahul
dc.contributor.author	Vitter, Jeffrey Scott
dc.contributor.author	Xu, Bojian
dc.date.accessioned	2015-11-11T15:15:01Z
dc.date.available	2015-11-11T15:15:01Z
dc.date.issued	2011-07-27
dc.identifier.citation	Külekci, M. Oğuzhan, Wing-Kai Hon, Rahul Shah, Jeffrey Scott Vitter, and Bojian Xu. "Ψ-RA: A Parallel Sparse Index for Genomic Read Alignment." BMC Genomics 12.Suppl 2 (2011). http://dx.doi.org/10.1186/1471-2164-12-S2-S7.	en_US
dc.identifier.uri	http://hdl.handle.net/1808/18877
dc.description	This is the published version. Copyright © 2011 Oğuzhan Külekci et al; licensee BioMed Central Ltd.	en_US
dc.description.abstract	Background Genomic read alignment involves mapping (exactly or approximately) short reads from a particular individual onto a pre-sequenced reference genome of the same species. Because all individuals of the same species share the majority of their genomes, short reads alignment provides an alternative and much more efficient way to sequence the genome of a particular individual than does direct sequencing. Among many strategies proposed for this alignment process, indexing the reference genome and short read searching over the index is a dominant technique. Our goal is to design a space-efficient indexing structure with fast searching capability to catch the massive short reads produced by the next generation high-throughput DNA sequencing technology. Results We concentrate on indexing DNA sequences via sparse suffix arrays (SSAs) and propose a new short read aligner named Ψ-RA (PSI-RA: parallel sparse index read aligner). The motivation in using SSAs is the ability to trade memory against time. It is possible to fine tune the space consumption of the index based on the available memory of the machine and the minimum length of the arriving pattern queries. Although SSAs have been studied before for exact matching of short reads, an elegant way of approximate matching capability was missing. We provide this by defining the rightmost mismatch criteria that prioritize the errors towards the end of the reads, where errors are more probable. Ψ-RA supports any number of mismatches in aligning reads. We give comparisons with some of the well-known short read aligners, and show that indexing a genome with SSA is a good alternative to the Burrows-Wheeler transform or seed-based solutions. Conclusions Ψ-RA is expected to serve as a valuable tool in the alignment of short reads generated by the next generation high-throughput sequencing technology. Ψ-RA is very fast in exact matching and also supports rightmost approximate matching. The SSA structure that Ψ-RA is built on naturally incorporates the modern multicore architecture and thus further speed-up can be gained. All the information, including the source code of Ψ-RA, can be downloaded at: http://www.busillis.com/o_kulekci/PSIRA.zip webcite.	en_US
dc.publisher	BioMed Central	en_US
dc.title	Ψ-RA: a parallel sparse index for genomic read alignment	en_US
dc.type	Article
kusw.kuauthor	Vitter, Jeffrey
kusw.kudepartment	Electrical Engr & Comp Science	en_US
dc.identifier.doi	10.1186/1471-2164-12-S2-S7
kusw.oaversion	Scholarly/refereed, publisher version
kusw.oapolicy	This item meets KU Open Access policy criteria.
dc.rights.accessrights	openAccess

Files in this item

Name:: Kulekci_genomic_read_alignment ...
Size:: 285.1Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

The University of Kansas prohibits discrimination on the basis of race, color, ethnicity, religion, sex, national origin, age, ancestry, disability, status as a veteran, sexual orientation, marital status, parental status, gender identity, gender expression and genetic information in the University’s programs and activities. The following person has been designated to handle inquiries regarding the non-discrimination policies: Director of the Office of Institutional Opportunity and Access, IOA@ku.edu, 1246 W. Campus Road, Room 153A, Lawrence, KS, 66045, (785)864-6414, 711 TTY.