Comparative Studies on Genomic Sequences Compressibility of Different Organisms
Loading...
Files
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The eukaryotic DNA is highly complex depending upon the organisms. The genome complexity can be analysed by reassociation kinetics which in turn is related to the genomic contents such as coding sequences and repeat sequences. The coding sequences are usually unique i.e., they do not contain repetitive sequences whereas non coding sequences usually consist of repetitive DNA sequences. In this study genome complexity has been studied by DNA sequence compression. Lossless sequence compressibility depends upon the repetition of sequences. It has been found that in comparison with RLE method, LZ algorithm is more efficient in sequence compression. DNA sequence compression by either of the two methods could not show much difference among various genomes with varying evolutionary lineages. However, Percentage Compression Ratio (PCR) exhibited significant correlations with G+C content and sequence heterogeneity of different genomes studied.
Description
Master of Science -Biotechnology
