Skip to contents

A deterministic, LLM-free screening method that uses regular expression matching on work titles and abstracts. Works matching any include_terms (case-insensitive) and none of the exclude_terms are classified as "include"; works matching an exclude term are "exclude"; works matching no include term are "exclude".

Usage

sm_screen_regex(
  corpus,
  include_terms,
  exclude_terms = NULL,
  call = rlang::caller_env()
)

Arguments

corpus

An sm_corpus object.

include_terms

Character vector of regex patterns. A work must match at least one to be included.

exclude_terms

Character vector of regex patterns. A work matching any is excluded. Default NULL (no exclusions).

call

Caller environment for error reporting.

Value

A modified sm_corpus with updated screening table.