Remove duplicates in pairwise comparison
dup_rm_pairwise.Rd
This function ensures that the most recent record be kept. When a peer-reviewed publication co-exists with a preprint or a conference proceeding, the peer-reviewed version will be kept.
Arguments
- ls_df
A list of data frames containing the partitioned dataset (i.e., output #1 of
simi_ptn_pair()
).- id_dup_pair
A data frame listing record id and partition id of duplicate pairs after resolving checked duplicates (i.e., output of
dup_resolve_pairwise()
).- to_dataframe
Logical: Should we merge the list of data frames into a single data frame? Defaults to
TRUE
.