Skip to contents

Output potential duplicates determined as requiring manual check by the decision tree

Usage

dup_screen_pairwise(ls_df, id_dup_pair)

Arguments

ls_df

A list of data frames containing the partitioned dataset (i.e., output #1 of simi_ptn_pair()).

id_dup_pair

A data frame listing record id and partition id of duplicate pairs after getting automatic decisions (i.e., output of decision_tree_pairwise()).

Value

A data frame of duplicate pairs for manual review. Pairing is indicated in "match" column.

Examples

if (FALSE) {
df_check <- dup_screen_pairwise(ls_df, id_dup_pair)
}