Shiny app for locating and excluding duplicated entries in a dataset
This is a simple app for calculating, displaying and screening potential duplicates in bibliographic data
screen_duplicates(x, max_file_size)
x |
An (optional) object of class |
max_file_size |
Optional argument to set the maximum file size (in MB) that the app will accept. |
This app is effectively a wrapper for find_duplicates
, with the added option to manually screen pairs of duplicates to check the results. Consequently, this is a more reliable method than extract_unique_references
of dealing with the duplicates identified by find_duplicates
, and for testing whether that function has returned sensible results for a given dataset.
This function launches a Shiny app in the users' default browser, allowing the user to customize their parameters for duplicate detection, and visualise the results.
screen_titles
or screen_abstracts
for manual screening of individual articles.
# to run the app and upload data interactively ## Not run: screen_duplicates() # or to specify data from the workspace file_location <- system.file( "extdata", "avian_ecology_bibliography.ris", package = "revtools") x <- read_bibliography(file_location) # to run the app using these data: ## Not run: screen_duplicates(x) # or to run the app & save results to the workspace: ## Not run: result <- screen_duplicates(x)
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.