Skip to contents

Query the arXiv API (Atom feed) for preprints and return the results as an sm_corpus.

Uses start/max_results pagination to retrieve up to n_max results. The arXiv API is free and requires no authentication.

Usage

sm_fetch_arxiv(query, n_max = 200L, verbose = TRUE, call = rlang::caller_env())

Arguments

query

arXiv search query string using the arXiv query syntax. Supports all:, ti:, au:, abs:, cat:, etc.

n_max

Maximum number of results to return (default 200).

verbose

Print progress messages?

call

Caller environment for error reporting.

Value

An sm_corpus object.

Examples

if (FALSE) { # \dontrun{
corpus <- sm_fetch_arxiv(query = "all:bibliometrics", n_max = 10)
print(corpus)
} # }