Package: textreuse Type: Package Title: Detect Text Reuse and Document Similarity Version: 1.0.1 Date: 2026-05-06 Authors@R: c( person("Lincoln", "Mullen", role = "aut", comment = c(ORCID = "0000-0001-5103-6917") ), person("Yaoxiang", "Li", role = c("aut", "cre"), email = "liyaoxiang@outlook.com", comment = c(ORCID = "0000-0001-9200-1016"))) Description: Tools for measuring similarity among documents and detecting passages which have been reused. Implements shingled n-gram, skip n-gram, and other tokenizers; similarity/dissimilarity functions; pairwise comparisons; minhash and locality sensitive hashing algorithms; and a version of the Smith-Waterman local alignment algorithm suitable for natural language. License: MIT + file LICENSE URL: https://docs.ropensci.org/textreuse/, https://github.com/ropensci/textreuse BugReports: https://github.com/ropensci/textreuse/issues VignetteBuilder: knitr Depends: R (>= 3.1.1) Imports: assertthat (>= 0.1), digest (>= 0.6.8), dplyr (>= 0.8.0), NLP (>= 0.1.8), Matrix, Rcpp (>= 0.12.0), RcppProgress (>= 0.1), stringr (>= 1.0.0), tibble (>= 3.0.1), tidyr (>= 1.0.0) Suggests: testthat (>= 0.11.0), knitr (>= 1.11), rmarkdown (>= 0.8), covr LinkingTo: BH, Rcpp, RcppProgress RoxygenNote: 7.3.2 Encoding: UTF-8 Config/pak/sysreqs: libicu-dev Repository: https://ropensci.r-universe.dev Date/Publication: 2026-05-06 16:00:52 UTC RemoteUrl: https://github.com/ropensci/textreuse RemoteRef: master RemoteSha: 6f8cbe38029502e7cc6baeb169f587a6da0da655 NeedsCompilation: yes Packaged: 2026-07-01 08:15:44 UTC; root Author: Lincoln Mullen [aut] (ORCID: ), Yaoxiang Li [aut, cre] (ORCID: ) Maintainer: Yaoxiang Li