Skip to contents

Parse Scopus CSV export files into an sm_corpus object. Handles the standard Scopus CSV format with columns for Authors, Title, Year, Source title, DOI, Abstract, and more.

Usage

sm_read_scopus(
  path,
  encoding = "UTF-8",
  engine = c("native", "bibliometrix", "auto"),
  verbose = TRUE,
  call = rlang::caller_env()
)

Arguments

path

Character scalar. Path to a Scopus CSV file.

encoding

Character scalar. File encoding (default "UTF-8").

engine

Character scalar. One of "native" (built-in parser), "bibliometrix" (delegate to bibliometrix::convert2df()), or "auto" (try bibliometrix first, fall back to native).

verbose

Logical. Print progress messages?

call

Caller environment for error reporting.

Value

An sm_corpus object.

Implementation

The native parser reads the Scopus CSV export using readr::read_csv(). Scopus CSV column names may vary between export versions; the parser matches on known column name patterns. Key columns: Authors, Title, Year, Source title, DOI, Abstract, Document Type, Cited by, Author Keywords, Index Keywords, ISSN, Language of Original Document, EID.

References

Aria, M. & Cuccurullo, C. (2017). bibliometrix: An R-tool for comprehensive science mapping analysis. Journal of Informetrics, 11(4), 959–975. doi:10.1016/j.joi.2017.08.007

Examples

if (FALSE) { # \dontrun{
corpus <- sm_read_scopus("scopus.csv")
corpus$works
} # }