Skip to contents

Make decisions for potential duplicates

Usage

decision_tree_pairwise(ls_df, id_dup_pair)

Arguments

ls_df

A list of data frames containing the partitioned dataset (i.e., output #1 of simi_ptn_pair()).

id_dup_pair

A data frame listing record id and partition id of duplicate pairs (i.e., output of dup_find_fuzzy_pairwise()).

Value

The input id_dup_pair with "decision" column added.

Examples

if (FALSE) {
id_dup_pair <- decision_tree_pairwise(ls_df, id_dup_pair)
}