In a typical enterprise backup situation, compression may reduce backup size by a ratio of 2:1 or 3:1, while deduplication can reduce backup size by up to 25:1, depending on how much duplicate data is in the systems. Also, compression doesn’t get rid of duplicated data - the storage system could still contain multiple copies of compressed files.ĭeduplication often has a larger impact on backup file size than compression. By contrast, deduplication only eliminates extra copies of data none of the original data is lost. Some compression is lossless, meaning that no data is lost in the process, but “lossy” compression, which is frequently used with audio and video files, actually deletes some of the less-important data included in a file in order to save space. While deduplication eliminates redundant data, compression uses algorithms to save data more concisely. Compressionĭeduplication is sometimes confused with compression, another technique for reducing storage requirements. Eliminating extra copies of data saves money not only on direct disk hardware costs, but also on related costs, like electricity, cooling, maintenance, floor space, etc.ĭeduplication can also reduce the amount of network bandwidth required for backup processes, and in some cases, it can speed up the backup and recovery process. While deduplication can be applied to data stored on tape, the relatively high costs of disk storage make deduplication a very popular option for disk-based systems. NetApp reports that in some cases, deduplication can reduce storage requirements up to 95 percent, but the type of data you’re trying to deduplicate and the amount of file sharing your organization does will influence your own deduplication ratio. The primary benefit of data deduplication is that it reduces the amount of disk or tape that organizations need to buy, which in turn reduces costs. Often the phrase data deduplication is used as a synonym for block-level or variable length deduplication. Deduplication can also take place on the block level, eliminating duplicated blocks of data that occur in non-identical files.īlock-level deduplication frees up more space than SIS, and a particular type known as variable block or variable length deduplication has become very popular. This kind of deduplication is sometimes called file-level deduplication or single instance storage (SIS). In its simplest form, deduplication takes place on the file level that is, it eliminates duplicate copies of the same file. Companies frequently use deduplication in backup and disaster recovery applications, but it can be used to free up space in primary storage as well. For example, the same file may be saved in several different places by different users, or two or more files that aren’t identical may still include much of the same data.ĭeduplication eliminates these extra copies by saving just one copy of the data and replacing the other copies with pointers that lead back to the original copy. In most organizations, the storage systems contain duplicate copies of many pieces of data. Data deduplication is a technique used to reduce the amount of storage space an organization needs to save its data.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |