The Impact of Near-Duplicate Documents on Information Retrieval Evaluation