Get links from a specific section of a Wikipedia page
Source:R/tw_get_wikipedia_page_section_links.R
tw_get_wikipedia_page_section_links.Rd
Get links from a specific section of a Wikipedia page
Usage
tw_get_wikipedia_page_section_links(
url = NULL,
title = NULL,
section_title = NULL,
section_index = NULL,
language = tidywikidatar::tw_get_language(),
cache = NULL,
overwrite_cache = FALSE,
cache_connection = NULL,
disconnect_db = TRUE,
wait = 1,
attempts = 10,
wikipedia_page_qid_df = NULL
)
Arguments
- url
Full URL to a Wikipedia page. If given, title and language can be left empty.
- title
Title of a Wikipedia page or final parts of its url. If given, url can be left empty, but language must be provided.
- section_title
Defaults to NULL. If given, it should correspond to the human-readable title of a section of the relevant Wikipedia page. See also
tw_get_wikipedia_page_sections()
- section_index
Defaults to NULL. If given, it should correspond to the ordinal of a section of the relevant Wikipedia page. See also
tw_get_wikipedia_page_sections()
- language
Two-letter language code used to define the Wikipedia version to use. Defaults to language set with
tw_set_language()
; if not set, "en". If url given, this can be left empty.- cache
Defaults to NULL. If given, it should be given either TRUE or FALSE. Typically set with
tw_enable_cache()
ortw_disable_cache()
.- overwrite_cache
Logical, defaults to FALSE. If TRUE, it overwrites the table in the local sqlite database. Useful if the original Wikidata object has been updated.
- cache_connection
Defaults to NULL. If NULL, and caching is enabled,
tidywikidatar
will use a local sqlite database. A custom connection to other databases can be given (see vignettecaching
for details).- disconnect_db
Defaults to TRUE. If FALSE, leaves the connection to cache open.
- wait
In seconds, defaults to 1 due to time-outs with frequent queries. Time to wait between queries to the APIs. If data are cached locally, wait time is not applied. If you are running many queries systematically you may want to add some waiting time between queries.
- attempts
Defaults to 10. Number of times it re-attempts to reach the API before failing.
- wikipedia_page_qid_df
Defaults to NULL. If given, used to reduce calls to cache. A data frame
Examples
if (interactive()) {
tw_get_wikipedia_page_section_links(title = "Margaret Mead", language = "en", section_index = 1)
}