Skip to contents

All functions

bib_example_complete bib_example_small
Sample data frames containing bibliographic information
decision_tree_adj()
Make decisions for potential duplicates
decision_tree_pairwise()
Make decisions for potential duplicates
dedu_exact()
Find duplicates by exact match and remove them
dup_find_exact()
Find duplicates by exact match
dup_find_fuzzy_adj()
Find duplicates by fuzzy match of string similarity between adjacent rows
dup_find_fuzzy_pairwise()
Find duplicates by fuzzy match of string similarity between pairwise records
dup_resolve_pairwise()
Manually resolve potential duplicate pairs requiring "check"
dup_rm_adj()
Remove duplicates between adjacent rows
dup_rm_pairwise()
Remove duplicates in pairwise comparison
dup_screen_pairwise()
Output potential duplicates determined as requiring manual check by the decision tree
extract_initialism()
Extract initialism Extract 1st letter of each word and delete all the other letters
norm_abstract()
Clean and normalize abstract in bibliography
norm_author()
Clean and normalize author in bibliography
norm_df()
Clean and normalize the entire data frame with bibliographic information
norm_journal()
Clean and normalize journal in bibliography
norm_title()
Clean and normalize title in bibliography
norm_transliteration()
Transliterate a text file
plot_simi_dist()
Generate similarity distribution plot
simi_order_adj()
Calculate string similarity between adjacent rows
simi_ptn_pair()
Calculate pairwise string similarity