Cloud-native access and downloading of NOAA's GEFS data. Caution package is highly experimental and not yet suitable for general use. Direct access to EFI-processed GEFS forecasts for NEON sites may be available through the neon4cast
package.
Example scripts can be seen in the inst/examples
directory.
For variable definitions, see the NOAA tables (If selecting band numbers from the grib files, note band numbers differ between the 000 hour and > 003 forecast)
Use arrow
for quick remote access to the database. Specifying the reference_datetime={date}
is optional but will give the best performance.
date <- "2020-09-24"
bucket <- "bio230014-bucket01"
path <- "neon4cast-drivers/noaa/gefs-v12/stage1"
endpoint <- "https://sdsc.osn.xsede.org"
stage1 <-
glue::glue("{bucket}/{path}/reference_datetime={date}") |>
arrow::s3_bucket(endpoint_override = endpoint, anonymous = TRUE) |>
arrow::open_dataset()
Now we can use dplyr
commands to subset the desired data without downloading the entire data product.
library(dplyr)
q <- stage1 |>
filter(variable == "TMP",
site_id == "BART") |>
group_by(datetime) |>
summarise(temp = mean(prediction)) |>
arrange(datetime)
df <- q |> collect()