DistCp2
In this post, we are going to cover the remaining part of Hadoop Distcp command. Distcp has one disadvantage of not having the option to merge the data. The three ways come with the option of either copying the part that is missing or to overwrite the whole data. Updated version of Distcp command with - append option which can be used by the update, but even it is working pursuing the update data operation. To skip the file size check skip check operation can be used with Hadoop Distcp. There are a few limitations with Hadoop distcp command, these are as below. When copying the data from multiple sources, the Distcp command with fail with an error in case of two sources collides, but we can avoid this scenario at destination level by using certain options. By default, the files at destination level are skipped to copy. There are a few limitations with Hadoop Distcp command, these are as below. ...