000 02404nam a22002777a 4500
003 KOHA
005 20230712090642.0
008 230712d2023 cy ||||| m||| 00| 0 eng d
040 _aCY-NiCIU
_beng
_cCY-NiCIU
_erda
041 _aeng
090 _aYL 2949
_bA48 2023
100 1 _aAfuohsaa, Boris Neba
245 1 0 _aDNA SEQUENCE ANALYSIS BASED ON REPETITIVE PATTERNS /
_cBORIS NEBA AFUOHSAA; SUPERVISOR: PROF. DR. AHMET ADALIER
264 _c2023
300 _aviii, 65 sheets;
_c31 cm.
_eIncludes CD
336 _2rdacontent
_atext
_btxt
337 _2rdamedia
_aunmediated
_bn
338 _2rdacarrier
_avolume
_bnc
502 _aThesis (MSc) - Cyprus International University. Institute of Graduate Studies and Research Information Technologies Department
504 _aIncludes bibliography (sheets 51-55)
520 _aABSTRACT The volume of DNA data evolves worldwide in various Databanks. DNA data sets increase in size because they are increasingly being gathered. The difficulty of storing large DNA data brings about two major drawbacks which are the space needed to store these data sets and the time required to encode and decode them. In order to store and process DNA data in an effective way, encryption and compression algorithms are developed. The aim of this thesis is to implement a DNA sequence compression algorithm based on repetitive patterns. This compression technique not only reduces the number of characters of the DNA sequence but equally converts the DNA sequence into binary and thus reducing the storage space consumption of one character from 1-byte to a 2-bits binary. The experiments are conducted using DNA data from National Centre for Biotechnology Information (NCBI). The minimization of transmission time is attained in sending the DNA. The end results achieved an average compression ratio of 0.77, a compression of 1.8 bpb and with an average compression time of 3 seconds. The space saving average was obtained at a value of 77%. The disarrangement of DNA sequence of any living organism could therefore be easily determined with the use of this type of compression of DNA sequences. Keywords: Algorithm, Compression, DNA, Repetitive Patterns, Sequence.
650 0 _aAlgorithms
_vDissertations, Academic
650 0 _aDNA
_vDissertations, Academic
700 1 _aAdalıer, Ahmet
_esupervisor
942 _2ddc
_cTS
999 _c290552
_d290552