Merged pdf file shown as duplicate

I noticed if I merge two big (around 80 Mb each) PDF documents, the resulting one and the first of the two original one are marked as duplicates, and this should not be, since only the initial part of the merged document matches the first source pdf. Which is the way two files are considered as duplicates? They should be identical otherwise this could lead to erase of files that are unique in the DB.