Effect of Base Composition on DNA Sequence Traits
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The eukaryotic DNA is highly complex depending upon the organisms. The genome complexity
can be analyzed by re-association kinetics which in turn is related to the genomic contents such
as coding sequences and repeat sequences. The coding sequences are usually unique i.e., they do
not contain repetitive sequences whereas non coding sequences usually consist of repetitive
DNA sequences. In this study, genome complexity has been studied by DNA sequence
compression. Lossless sequence compressibility depends upon the repetition of sequences. It has
been found that in comparison with Run Length Encoding (RLE) method DNA sequence
compression by either of the two methods could not show much difference among various
genomes with varying evolutionary lineages.
However, Percentage Compression Ratio (PCR) exhibited significant correlations with G+C
content and sequence heterogeneity of different genomes studied.
Description
Master of Science- Biotechnology
