Comparative Studies on Genomic Sequences Compressibility of Different Organisms

dc.contributor.authorKaur, Japleen
dc.contributor.supervisorHanda, Vikas
dc.date.accessioned2017-08-16T11:55:01Z
dc.date.available2017-08-16T11:55:01Z
dc.date.issued2017-08-16
dc.descriptionMaster of Science -Biotechnologyen_US
dc.description.abstractThe eukaryotic DNA is highly complex depending upon the organisms. The genome complexity can be analysed by reassociation kinetics which in turn is related to the genomic contents such as coding sequences and repeat sequences. The coding sequences are usually unique i.e., they do not contain repetitive sequences whereas non coding sequences usually consist of repetitive DNA sequences. In this study genome complexity has been studied by DNA sequence compression. Lossless sequence compressibility depends upon the repetition of sequences. It has been found that in comparison with RLE method, LZ algorithm is more efficient in sequence compression. DNA sequence compression by either of the two methods could not show much difference among various genomes with varying evolutionary lineages. However, Percentage Compression Ratio (PCR) exhibited significant correlations with G+C content and sequence heterogeneity of different genomes studied.en_US
dc.identifier.urihttp://hdl.handle.net/10266/4670
dc.language.isoenen_US
dc.subjectDNA sequenceen_US
dc.subjectRLEen_US
dc.subjectLZ algorithmen_US
dc.subjectsequence compressionen_US
dc.subjectrepetitive sequencesen_US
dc.titleComparative Studies on Genomic Sequences Compressibility of Different Organismsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
4670.pdf
Size:
3.83 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.03 KB
Format:
Item-specific license agreed upon to submission
Description: