Skip to contents

Change decision to "duplicate" or "not duplicate" according to manual review results.

Usage

dup_resolve_pairwise(id_dup_pair, df_check, match_index, result)

Arguments

id_dup_pair

A data frame listing record id and partition id of duplicate pairs after getting automatic decisions (i.e., output of decision_tree_pairwise()).

df_check

A data frame of duplicate pairs for manual review (i.e., output of dup_screen_pairwise()).

match_index

Numeric: a vector of "match" numbers to change the decision for.

result

Character: "duplicate" or "not duplicate".

Value

A data frame: the input id_dup_pair with "decision" column modified accordingly.

Examples

if (FALSE) {
id_dup_pair <- dup_resolve_pairwise(
id_dup_pair,
df_check,
match_index = c(1, 2, 3, 4),
result = "not duplicate")
}