--- title: "Caching requests from the UN Comtrade API" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{caching} %\VignetteEncoding{UTF-8} %\VignetteEngine{knitr::rmarkdown} editor_options: markdown: wrap: 72 --- ## Caching requests from the UN Comtrade API There is four reasons why we might want to use caching for our requests to the UN Comtrade API: - The UN Comtrade API at times can be a bit slow đ - Free users only have a limited amount of requests đ¸ - We might want to insulate our analysis against updates to the official data đĻ - It saves energy and is climate friendly to not repeat requests that return the same data every time. đ The package offers a caching option, that is fairly easy to use. ### The basics of caching đŊ When you set the respective argument to `TRUE`, a directory named `comtradr` will be set up in your cache directory. This is determined by the `tools::R_user_dir(which = 'cache')` function. It has the default values for a cache as used by `cachem::cache_disk`. See here: <https://cachem.r-lib.org/reference/cache_disk.html> and here:<https://stat.ethz.ch/R-manual/R-devel/library/tools/html/userdir.html> ```{r, eval = FALSE} library(comtradr) #### Now q <- ct_get_data(reporter = "USA", partner = c("DEU", "FRA", "JPN", "MEX"), flow_direction = "import", start_date = 2020, end_date = 2023, cache = TRUE) # <----- set this argument to TRUE ``` If you want to modify these parameters, you need to set environment variables. You can use the `usethis::edit_r_environ(scope = 'project)` function to set them only for your current project. âšī¸ Restart your R Session after setting the environment variables! đĄ The parameters are: - `COMTRADR_CACHE_MAX_SIZE` = Maximum size of the cache, in bytes. If the cache exceeds this size, cached objects will be removed according to the value of the `evict`. Use `Inf` for no size limit. The default is 1 gigabyte. - `COMTRADR_CACHE_MAX_AGE` = Maximum age of files in cache before they are evicted, in seconds. Use `Inf` for no age limit. - `COMTRADR_CACHE_MAX_N` = Maximum number of objects in the cache. If the number of objects exceeds this value, then cached objects will be removed according to the value of `evict`. Use `Inf` for no limit of number of items. - `R_USER_CACHE_DIR` = Directory where the cached files will be saved. See here for the details on pruning and other functions of `cachem`: <https://cachem.r-lib.org/reference/cache_disk.html> ### Careful with outdated data! â ī¸ The current default for the cache age is set to `Inf`, hence when you have requested data once, it will never downloaded again. For recent data, this can be a very bad idea, as the United Nations might update their data and you will only work with outdated data. The recommended way is to set the `COMTRADR_CACHE_MAX_AGE` environment variable as described above. E.g.: to `60*60*24*365` for one year. To find your cache and empty it by yourself, you can do: ``` {r, eval = FALSE} ## to delete all files in your cache tools::R_user_dir('comtradr', which = 'cache') |> fs::dir_delete() ``` and ``` {r, eval = FALSE} ## to delete all files in your cache tools::R_user_dir('comtradr_bulk', which = 'cache') |> fs::dir_delete() ``` You will have to restart `comtradr` to re-create the cache and be able to use the package.