Skip to contents

For journal, we do the following string normalization.

  • convert letters to lowercase

  • remove punctuation

  • remove English stop words

  • remove whitespace from start and end of string; also reduce repeated whitespace inside the string.

Usage

norm_journal(journal)

Arguments

journal

A character vector (e.g., a column in a data frame)

Value

Normalized character vector

Examples

journal <- c("Proteins: Structure, Function and Bioinformatics",
"Zoonoses Public Health",
"Zoonoses and Public Health")

norm_journal(journal)
#> [1] "proteins structure function bioinformatics"
#> [2] "zoonoses public health"                    
#> [3] "zoonoses public health"