000 | 02482nam a22003017a 4500 | ||
---|---|---|---|
003 | KOHA | ||
005 | 20231103122052.0 | ||
008 | 231103d2023 cy ||||| m||| 00| 0 eng d | ||
040 |
_aCY-NiCIU _beng _cCY-NiCIU _erda |
||
041 | _aeng | ||
090 |
_aYL 3117 _bA46 2023 |
||
100 | 1 | _aAfolabi, Gbolahan Salam | |
245 | 1 | 0 |
_aDNA SEQUENCE ANALYSIS BASED ON DNABIT ALGORITHM _cGBOLAHAN SALAM AFOLABI; SUPERVISOR: PROF. DR. AHMET ADALIER |
264 | _c2023 | ||
300 |
_aviii, 71 sheets; _c31 cm. _e1 CD-ROM |
||
336 |
_2rdacontent _atext _btxt |
||
337 |
_2rdamedia _aunmediated _bn |
||
338 |
_2rdacarrier _avolume _bnc |
||
502 | _aThesis (MSc) - Cyprus International University. Institute of Graduate Studies and Research Information Technologies Department | ||
504 | _aIncludes bibliography (sheets 46-50) | ||
520 | _aABSTRACT The amount of DNA data changes globally in different data banks. DNA datasets get bigger because more of them are being collected. Two significant problems result from the difficulty of keeping huge DNA data sets: the amount of space needed to store these data sets and the amount of time needed to encode and decode them. Algorithms for encryption, decryption and compression have been developed in order to store and process DNA data effectively. DNA compression entails utilizing fewer bits than the original representation to encode information. This thesis introduces a DNA sequence compression approach based on the DNABIT algorithm. By evaluating the pre-defined protein sequences and replacing them with a Binary entity, the compression method decreases the characters in a DNA sequence. This study was carried out utilizing DNA information from the National Center for Biotechnology Information (NCBI). We were able to achieve an average compression ratio of 0.797, average space saving percentage of 80% within a minimal compression time. The DNA sequence of any living organism, virus or bacteria can therefore be easily determined with the use of this type of compression of DNA sequences. Keywords: Algorithm, Binary, Bit, Byte, Conversion, Compression, Decryption, DNA, Encryption, Patterns, Repetitive, Sequence. | ||
650 | 0 |
_aAlgorithms _vDissertations, Academic |
|
650 | 0 |
_aConversion _vDissertations, Academic |
|
650 | 0 |
_aData compression (Computer science) _vDissertations, Academic |
|
650 | 0 |
_aDNA _vDissertations, Academic |
|
700 | 1 | _aAdalıer, Ahmet | |
942 |
_2ddc _cTS |
||
999 |
_c291646 _d291646 |