Skip to content

dubicube: Calculation and Interpretation of Data Cube Indicator Uncertainty

dubicube website

repo
status Release dubicube status
badge CRAN
status R-CMD-check codecov DOI name status
badge funder

The dubicube package aims to deliver measures for assessing the applicability of biodiversity data cubes, whether for general use or specific biodiversity indicators. These measures facilitate data exploration by providing insights into data quality and reliability. Additionally, the package includes functions for calculating indicator uncertainty using bootstrapping, as well as tools for interpreting and visualising uncertainty in biodiversity indicators derived from occurrence cubes.

Install dubicube in R:

install.packages("dubicube", repos = c("https://b-cubed-eu.r-universe.dev", "https://cloud.r-project.org"))

You can install the development version from GitHub with:

# install.packages("remotes")
remotes::install_github("b-cubed-eu/dubicube")

The role of dubicube in the indicator calculation workflow

Section titled “The role of dubicube in the indicator calculation workflow”

The functionality of the dubicube package is useful throughout the occurrence cube indicator calculation workflow. Occurrence cubes can be created from GBIF data using the rgbif package. They are processed using the process_cube() function from the b3gbi package. This ensures data standardisation and verifies that the cube’s format is correct. dubicube facilitates data exploration and filtering (1) which is an iterative process with cube generation and processing. After a number of iterations, data evaluation is successful and the final data cube can be used for indicator calculation. Indicator calculation packages can use dubicube as a dependency for uncertainty interval calculation via bootstrapping (2) but the package can also be used on its own. Finally, the package provides tools and tutorials to help with indicator visualisation and interpretation (3).

dubicube indicator calculation workflow.

The dubicube package offers:

🔍 1. Data Exploration & Variability Assessment

Section titled “🔍 1. Data Exploration & Variability Assessment”

Gain insights into the structure and sensitivity of biodiversity data cubes.

  • diagnose_cube()
    Evaluate the quality of a processed data cube by applying a set of diagnostic rules.

  • filter_cube()
    Remove observations from a processed cube that do not meet defined quality criteria.
    📘 Read the tutorial →

  • cross_validate_cube()
    Perform cross-validation (leave-one-out or k-fold) to assess group-level sensitivity of indicators and evaluate how individual categories influence results.
    📘 Read the tutorial →

Use bootstrap methods to understand variability, bias, and confidence in your indicators.

  • bootstrap_cube()
    Create bootstrap replicates to estimate indicator variability, bias, and standard error.
    📘 Read the tutorial →

  • calculate_bootstrap_ci()
    Compute confidence intervals (percentile, BCa, normal, basic), with optional transformations and bias correction.
    📘 Read the tutorial →

Put your results in context with reference values and uncertainty thresholds.


🔗 Learn more at our website or explore the documentation.