Skip to contents

Parse EndNote XML export files into an sm_corpus object. Uses xml2 to extract record metadata from the EndNote XML format.

Usage

sm_read_endnote(
  path,
  encoding = "UTF-8",
  verbose = TRUE,
  call = rlang::caller_env()
)

Arguments

path

Character scalar. Path to an EndNote XML file.

encoding

Character scalar. File encoding (default "UTF-8").

verbose

Logical. Print progress messages?

call

Caller environment for error reporting.

Value

An sm_corpus object.

Implementation

The parser uses xml2::read_xml() to read the EndNote XML format. Each xml/records/record element (or records/record) is processed. The following fields are extracted:

  • titles/title/style for the title

  • contributors/authors/author/style for author names

  • dates/year/style for year

  • periodical/full-title/style or secondary-title/style for journal

  • electronic-resource-num/style for DOI

  • abstract/style for abstract

  • ref-type attribute for document type

  • keywords/keyword/style for keywords

  • accession-num/style for record identifier

No bibliometrix engine is available since EndNote XML is not directly supported as a format by bibliometrix::convert2df().

References

Aria, M. & Cuccurullo, C. (2017). bibliometrix: An R-tool for comprehensive science mapping analysis. Journal of Informetrics, 11(4), 959–975. doi:10.1016/j.joi.2017.08.007

Examples

if (FALSE) { # \dontrun{
corpus <- sm_read_endnote("library.xml")
corpus$works
} # }