The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.

dataset: Create Data Frames for Exchange and Reuse

The 'dataset' package extends tidy data frames with machine-readable metadata, semantic definitions, and provenance information. It supports incremental semantic stabilization, interoperable dataset exchange, and FAIR-oriented publication workflows by preserving contextual metadata directly within R objects. The package facilitates the creation, exchange, reuse, and RDF serialization of datasets in line with ISO and W3C standards.

Version: 0.4.5
Depends: R (≥ 3.5)
Imports: assertthat, haven, ISOcodes, labelled, pillar, stats, tibble, utils, vctrs
Suggests: dplyr, jsonld, knitr, rdflib, rmarkdown, spelling, tidyr, testthat (≥ 3.0.0)
Published: 2026-06-03
DOI: 10.32614/CRAN.package.dataset
Author: Daniel Antal ORCID iD [aut, cre], Marcelo Perlin ORCID iD [rev], Anna Márta Mester ORCID iD [rev], Mauro Lepore ORCID iD [rev]
Maintainer: Daniel Antal <daniel.antal at dataobservatory.eu>
BugReports: https://github.com/ropensci/dataset/issues
License: GPL (≥ 3)
URL: https://docs.ropensci.org/dataset/, https://github.com/ropensci/dataset, https://dataset.dataobservatory.eu
NeedsCompilation: no
Language: en-GB
Citation: dataset citation info
Materials: README, NEWS
CRAN checks: dataset results

Documentation:

Reference manual: dataset.html , dataset.pdf
Vignettes: Modernising Citation Metadata in R: Introducing 'bibrecord' (source, R code)
dataset_df: Create Datasets that are Easy to Share Exchange and Extend (source, R code)
defined: Semantically Enriched Vectors (source, R code)
Design Principles & Future Work Semantically Enriched, Standards-Aligned Datasets in R (source, R code)
Example Dataset Definitions (source)
An Introduction to the dataset Package (source, R code)
Handling Semantic Ambiguity with prelabelled Vectors (source, R code)
From R to RDF (source, R code)

Downloads:

Package source: dataset_0.4.5.tar.gz
Windows binaries: r-devel: dataset_0.4.5.zip, r-release: dataset_0.4.5.zip, r-oldrel: dataset_0.4.5.zip
macOS binaries: r-release (arm64): dataset_0.4.5.tgz, r-oldrel (arm64): dataset_0.4.5.tgz, r-release (x86_64): dataset_0.4.5.tgz, r-oldrel (x86_64): dataset_0.4.5.tgz
Old sources: dataset archive

Reverse dependencies:

Reverse imports: retroharmonize

Linking:

Please use the canonical form https://CRAN.R-project.org/package=dataset to link to this page.

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.