dubicube: Calculation and Interpretation of Data Cube Indicator Uncertainty

Last updated: Apr 29, 2026

The dubicube package aims to deliver measures for assessing the applicability of biodiversity data cubes, whether for general use or specific biodiversity indicators. These measures facilitate data exploration by providing insights into data quality and reliability. Additionally, the package includes functions for calculating indicator uncertainty using bootstrapping, as well as tools for interpreting and visualising uncertainty in biodiversity indicators derived from occurrence cubes.

Installation

Install dubicube in R:

install.packages("dubicube", repos = c("https://b-cubed-eu.r-universe.dev", "https://cloud.r-project.org"))

You can install the development version from GitHub with:

# install.packages("remotes")
remotes::install_github("b-cubed-eu/dubicube")

The role of dubicube in the indicator calculation workflow

The functionality of the dubicube package is useful throughout the occurrence cube indicator calculation workflow. Occurrence cubes can be created from GBIF data using the rgbif package. They are processed using the process_cube() function from the b3gbi package. This ensures data standardisation and verifies that the cube’s format is correct. dubicube facilitates data exploration and filtering (1) which is an iterative process with cube generation and processing. After a number of iterations, data evaluation is successful and the final data cube can be used for indicator calculation. Indicator calculation packages can use dubicube as a dependency for uncertainty interval calculation via bootstrapping (2) but the package can also be used on its own. Finally, the package provides tools and tutorials to help with indicator visualisation and interpretation (3).

dubicube indicator calculation workflow.

Key Features

The dubicube package offers:

🔍 1. Data Exploration & Variability Assessment

Gain insights into the structure and sensitivity of biodiversity data cubes.

diagnose_cube()
Evaluate the quality of a processed data cube by applying a set of diagnostic rules.
filter_cube()
Remove observations from a processed cube that do not meet defined quality criteria.
📘 Read the tutorial →
cross_validate_cube()
Perform cross-validation (leave-one-out or k-fold) to assess group-level sensitivity of indicators and evaluate how individual categories influence results.
📘 Read the tutorial →

📈 2. Estimating Indicator Uncertainty

Use bootstrap methods to understand variability, bias, and confidence in your indicators.

bootstrap_cube()
Create bootstrap replicates to estimate indicator variability, bias, and standard error.
📘 Read the tutorial →
calculate_bootstrap_ci()
Compute confidence intervals (percentile, BCa, normal, basic), with optional transformations and bias correction.
📘 Read the tutorial →

🧠 3. Interpretation & Visualisation

Put your results in context with reference values and uncertainty thresholds.

add_effect_classification()
Classify indicator trends (e.g. increase, stable, decrease) by comparing confidence intervals to thresholds.
📘 Read the tutorial →
Spatial and temporal interpretation
Learn how to visualise and assess patterns across space and time using indicator uncertainty.
📘 Best practices for temporal trends →
📘 Best practices for spatial trends →

🔗 Learn more at our website or explore the documentation.