University of Bahrain
Scientific Journals

Deduplication using Modified Dynamic File Chunking for Big Data Mining

Show simple item record

dc.contributor.author Taha Ahmed, Saja
dc.date.accessioned 2023-07-16T10:38:47Z
dc.date.available 2023-07-16T10:38:47Z
dc.date.issued 2023-07-16
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/5001
dc.description.abstract The unpredictability of data growth necessitates data management to make optimum use of storage capacity. An innovative strategy for data deduplication is proposed in this research study. The data is split into blocks of a predefined size by the fixed-size DeDuplication algorithm. The main drawback of this approach is that the preceding sections will be relocated from their original placements if additional sections are inserted into the forefront or center of a file. As a result, the generated chunks will have a new hash value, resulting in less DeDuplication ratio. To overcome this drawback, this study suggests multiple characters as content-defined chunking breakpoints, which mostly depend on file internal representation and have variable chunk sizes. The experimental result shows significant improvement in the redundancy removal ratio of the Linux dataset. So that a comparison is made between the proposed fixed and dynamic deduplication stating that double character chunking has less average chunk size and can gain a much higher deduplication ratio. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.subject big data en_US
dc.subject data mining en_US
dc.subject deduplication en_US
dc.subject dynamic chucking en_US
dc.subject fixed chunking en_US
dc.title Deduplication using Modified Dynamic File Chunking for Big Data Mining en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/160105
dc.volume 16 en_US
dc.issue 1 en_US
dc.pagestart 57 en_US
dc.pageend 66 en_US
dc.contributor.authorcountry Iraq en_US
dc.contributor.authoraffiliation Ministry of Education, Vocational Education Department en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

This item appears in the following Issue(s)

Show simple item record

All Journals


Advanced Search

Browse

Administrator Account