Manuscript Number : IJSRSET229425
Big Data Backup Deduplication : A Survey
Authors(2) :-Hashem Bedr Jehlol, Loay E. George
The massive explosion in the field of data such as images, video, audio, and text has caused significant problems in data storage and retrieval. Companies and organizations spend a lot of money to store and manage data. Therefore, there is an urgent need for efficient technologies to deal with this massive amount of data. One of the essential techniques to eliminate redundant data is data deduplication and data reduction. The best technique used for this purpose is data deduplication. Data deduplication decreases bandwidth, hard disc drive utilization, and backup costs by removing redundant data. This paper focuses on studying the literature of several research papers related to data deduplication for various techniques that several researchers have proposed. It summarized multiple concepts and techniques related to deduplication and methods used to improve storage. The data deduplication processes were examined in detail, including data chunking, hashing, indexing, and writing. Also, this study discussed the most critical problems faced by the data deduplication algorithm.
Hashem Bedr Jehlol
Big Data, Data Deduplication, Data Reduction, Redundant Data, Data Chunking, Hashing.
Publication Details
Published in :
Volume 9 | Issue 4 | July-August 2022 Article Preview
Iraqi Commission for Computers and Informatics, Informatics Institute of Postgraduate Studies, Baghdad-Iraq
Loay E. George
University of Information Technology and Communication (UoITC), Baghdad-Iraq
Date of Publication :
2022-08-30
License: This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) :
174-191
Manuscript Number :
IJSRSET229425
Publisher : Technoscience Academy
Journal URL :
https://res.ijsrset.com/IJSRSET229425