Function reference
-
bib_example_complete
bib_example_small
- Sample data frames containing bibliographic information
-
decision_tree_adj()
- Make decisions for potential duplicates
-
decision_tree_pairwise()
- Make decisions for potential duplicates
-
dedu_exact()
- Find duplicates by exact match and remove them
-
dup_find_exact()
- Find duplicates by exact match
-
dup_find_fuzzy_adj()
- Find duplicates by fuzzy match of string similarity between adjacent rows
-
dup_find_fuzzy_pairwise()
- Find duplicates by fuzzy match of string similarity between pairwise records
-
dup_resolve_pairwise()
- Manually resolve potential duplicate pairs requiring "check"
-
dup_rm_adj()
- Remove duplicates between adjacent rows
-
dup_rm_pairwise()
- Remove duplicates in pairwise comparison
-
dup_screen_pairwise()
- Output potential duplicates determined as requiring manual check by the decision tree
-
extract_initialism()
- Extract initialism Extract 1st letter of each word and delete all the other letters
-
norm_abstract()
- Clean and normalize abstract in bibliography
-
norm_author()
- Clean and normalize author in bibliography
-
norm_df()
- Clean and normalize the entire data frame with bibliographic information
-
norm_journal()
- Clean and normalize journal in bibliography
-
norm_title()
- Clean and normalize title in bibliography
-
norm_transliteration()
- Transliterate a text file
-
plot_simi_dist()
- Generate similarity distribution plot
-
simi_order_adj()
- Calculate string similarity between adjacent rows
-
simi_ptn_pair()
- Calculate pairwise string similarity