Comparative Studies on Genomic Sequences Compressibility of Different Organisms
| dc.contributor.author | Kaur, Japleen | |
| dc.contributor.supervisor | Handa, Vikas | |
| dc.date.accessioned | 2017-08-16T11:55:01Z | |
| dc.date.available | 2017-08-16T11:55:01Z | |
| dc.date.issued | 2017-08-16 | |
| dc.description | Master of Science -Biotechnology | en_US |
| dc.description.abstract | The eukaryotic DNA is highly complex depending upon the organisms. The genome complexity can be analysed by reassociation kinetics which in turn is related to the genomic contents such as coding sequences and repeat sequences. The coding sequences are usually unique i.e., they do not contain repetitive sequences whereas non coding sequences usually consist of repetitive DNA sequences. In this study genome complexity has been studied by DNA sequence compression. Lossless sequence compressibility depends upon the repetition of sequences. It has been found that in comparison with RLE method, LZ algorithm is more efficient in sequence compression. DNA sequence compression by either of the two methods could not show much difference among various genomes with varying evolutionary lineages. However, Percentage Compression Ratio (PCR) exhibited significant correlations with G+C content and sequence heterogeneity of different genomes studied. | en_US |
| dc.identifier.uri | http://hdl.handle.net/10266/4670 | |
| dc.language.iso | en | en_US |
| dc.subject | DNA sequence | en_US |
| dc.subject | RLE | en_US |
| dc.subject | LZ algorithm | en_US |
| dc.subject | sequence compression | en_US |
| dc.subject | repetitive sequences | en_US |
| dc.title | Comparative Studies on Genomic Sequences Compressibility of Different Organisms | en_US |
| dc.type | Thesis | en_US |
