The `doc_terms` dataset contains and edge list of rulings and lemmas (or processed words).
Format
A data frame with three variables:
id
Ruling ID
lemma
A standardized word
n
Number of times used in the document
Examples
docterms
#> # A tibble: 24,831,504 × 3
#> id lemma n
#> <fct> <fct> <int>
#> 1 C-001-18 abierto 2
#> 2 C-001-18 abordar 1
#> 3 C-001-18 absoluto 1
#> 4 C-001-18 abstracto 12
#> 5 C-001-18 academia 3
#> 6 C-001-18 academico 1
#> 7 C-001-18 acapite 2
#> 8 C-001-18 acceso 1
#> 9 C-001-18 accion 7
#> 10 C-001-18 accionante 3
#> # ℹ 24,831,494 more rows