Skip to contents

Computes several collaboration indicators for the corpus, including the Collaboration Index (CI), mean authors per paper, proportion of multi-authored papers, and international collaboration rate.

Usage

sm_metric_collab_index(corpus, call = rlang::caller_env())

Arguments

corpus

An sm_corpus object with a populated authorships table.

call

Caller environment for error reporting.

Value

A tibble with one row per work and columns:

work_id

The work identifier.

n_authors

Number of authors on the work.

n_countries

Number of distinct countries among co-authors.

n_institutions

Number of distinct institutions among co-authors.

is_international

Logical; TRUE if authors from more than one country contributed.

is_multi_authored

Logical; TRUE if more than one author.

Details

The Collaboration Index (CI) for a set of works is typically defined as the mean number of authors per paper. This function returns per-work data, from which aggregate CI can be computed by the user.

Examples

corpus <- sm_example_corpus()
collab <- sm_metric_collab_index(corpus)
head(collab)
#> # A tibble: 6 × 6
#>   work_id    n_authors n_countries n_institutions is_international
#>   <chr>          <int>       <int>          <int> <lgl>           
#> 1 W000000001         1           1              0 FALSE           
#> 2 W000000002         6           4              0 TRUE            
#> 3 W000000003         4           4              0 TRUE            
#> 4 W000000004         5           5              0 TRUE            
#> 5 W000000005         4           3              0 TRUE            
#> 6 W000000006         1           1              0 FALSE           
#> # ℹ 1 more variable: is_multi_authored <lgl>
# Aggregate collaboration index:
mean(collab$n_authors)
#> [1] 3.775