The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.
get_eurostat()
use.data.table
argument. This is especially useful with big
datasets that would otherwise take a long time to go through the
different data cleaning functions or crash R with their large memory
footprint. (issue #277, PR #278)httr
package to httr2
(issue
#273, PR #276)list_eurostat_cache_items()
. (Affects issues mentioned in
#144, #257, #258, fixed in PR #267).eurostatTOC
object (returned by
get_eurostat_toc()
) now use dots instead of spaces in the
style of base::make.names()
, e.g. turning
last update of data
to last.update.of.data
(PR
#271).eurostatTOC
object includes a new hierarchy column
that represents the position of each folder, dataset and table in the
folder structure.search_eurostat()
includes the option to search Table
of Content items by dataset codes in addition to titles. This makes it
possible to make further queries from similar datasets
(e.g. “nama_10_gdp”, “nama_10r_2gdp”, “nama_10r_3popgdp”) that might
have different titles.label_eurostat_tables()
has been rewritten to use the
new SDMX API instead of table_dic.dic
file in Eurostat Bulk
Download Listing (PR #271)get_eurostat_geospatial()
now leverages on
giscoR::gisco_get_nuts()
for downloading geospatial data
(PR #264, thanks to @dieghernan):
"spdf"
output class soft-deprecated, it would return a
sf
object with a message.make_valid
parameter soft-deprecated....
to the function so additional parametes can
be passed to giscoR::gisco_get_nuts()
.eurostat_geodata_60_2016
updated.get_eurostat_geospatial()
now requires sf package to
work at all (PR #280, thanks to @dieghernan)get_eurostat()
and get_eurostat_json()
functions. The documentation now warns users about potential problems
caused by time
/ TIME_PERIOD
parameters when
used to query datasets that contain quarterly data (issue #260)get_eurostat_dic()
and
label_eurostat()
functions.get_bibentry()
now outputs “Accessed YYYY-MM-DD” and
“dataset last updated YYYY-MM-DD” in note field as otherwise it would be
sporadically printed or not at all printed from urldate
field.sp
, methods
and broom
packages from dependencies.giscoR
to Suggests. (PR #264)get_eurostat_interactive()
for
interactively searching and downloading data from Eurostat SDMX API. The
function aims to make good data citation practices more prominently
visible and also make it easier to explore what different arguments in
get_eurostat()
function do.eurostat:::fixity_checksum()
to easily calculate a fixity
checksum for datasets downloaded from Eurostat. The fixity checksum can,
for example, be saved in research notes and reported in as part of data
appendices. Printing the fixity checksum is encouraged by including an
option to print it in every get_eurostat_interactive()
query.clean_eurostat_toc()
for
easy removal of TOC objects from .EurostatEnv environment. (PR
#278)check_lang()
(PR #270)get_eurostat()
function now explicity accepts a ‘lang’
argument, for passing onwards to get_eurostat_json()
and
label_eurostat()
(PR #270)get_eurostat_folder()
for
downloading all datasets in a folder. The function is limited to
downloading folders that contain at maximum 20 datasets. This function
relies on new internal helper functions:
toc_count_whitespace()
,
toc_determine_hierarchy()
,
toc_count_children()
and toc_list_children()
.
(PR #270)get_eurostat_toc()
and
set_eurostat_toc()
now have experimental features that
support downloading TOCs in French and German as well. This support, in
turn, is leveraged in get_bibentry()
which now has a
language parameter: lang
(PR #270)get_eurostat_toc()
,
search_eurostat()
now supports searching from French and
German TOC-files as well (PR #270)grepEurostatTOC()
is completely marked as defunct and
is enroute to being removed from the package as
search_eurostat()
is now the only way to fetch Eurostat TOC
items and search (grep) them (PR #270)label_eurostat_vars2
that has been removed
in the final version, as promised earlier: “The old function will be
completely removed after October 2023 when Eurostat Bulk Download
Listing website is retired and label_eurostat_vars2
will be
renamed to label_eurostat_vars()
”. The new
label_eurostat_vars()
function uses the new SDMX API to
retrieve names for dataset columns. Function evolution is subject to
ongoing Eurostat API developments. (PR #270)get_bibentry()
returns correct codes for titles and
warns the user if some / all of the requested codes were not found in
the TOC (PR #270)get_bibentry()
uses the date field with the internal
BibEntry format that can be easily translated to other formats: bibtex,
bibentry (PR #270)get_bibentry()
now outputs dataset codes in titles
correctly so that bibtex
and biblatex
entries
can be copypasted into bibliographies without adding escape characters
manually (PR #270)eurotime2date()
when handling big
datasets containing weekly data and tens of millions of rows (dataset
used for testing mentioned in issue #200).get_eurostat_json()
and
eurotime2date()
functions (issue #251, reported by @lz1nwm). The
get_eurostat_json()
function uses the temporary
eurotime2date()
function for date handling until the old
bulk download API is deprecated.curl::curl_download
on Windows platforms instead of
utils::download.file
as the latter causes the following
error: “error reading from the connection […] invalid or incomplete
compressed data”. This affects only files downloaded from the new
API.get_eurostat()
and its assorted functions to
download data from the new dissemination API (related to issues #251,
#243). See Eurostat web page Transition - from Eurostat Bulk Download to
API for a list of differences between old and new data sources:
https://wikis.ec.europa.eu/display/EUROSTATHELP/Transition+-+from+Eurostat+Bulk+Download+to+APIget_eurostat_raw2
,
tidy_eurostat2
, convert_time_col2
,
eurotime2date2
, eurotime2num2
and
label_eurostat2
. When the old bulk download facilities are
decommissioned, these functions will replace the old functions with old
naming schemes (without the 2s at the end).tidy_eurostat2
function is now able to handle multiple
time frequencies in one call: For example, you can download annual,
quarterly, and monthly data simply by using a vector c(“A”, “Q”, “M”) in
select_time instead of using these singular frequencies in separate
calls. The function will also return multiple time series in one dataset
if select_time is NULL (as it is by default). If the dataset contains
multiple time series and these are explicitly downloaded / no
select_time parameter is given, a message will be printed.eurotime2num
can now handle monthly and weekly data as
well.get_eurostat()
function:
legacy_bulk_download (default = TRUE). By setting this parameter to
FALSE the user can download data from the new dissemination API. If you
want to test the new API before it becomes the only way to download the
data (and we very much encourage you to do so), set this parameter to
FALSE.get_eurostat_json()
to migrate from JSON web
service to API Statistics (addressed in issues #243, #251). Please note
that the output from JSON API is now slightly different than before: the
datasets now contain a freq column to indicate the frequency with which
data has been collected, for example annually “A”, monthly “M” or
quarterly “Q”. See Eurostat - Data browser online help website for more
information:
https://wikis.ec.europa.eu/display/EUROSTATHELP/API+Statistics+-+migrating+from+JSON+web+service+to+API+Statisticsget_bibentry()
and
get_eurostat_geospatial()
eurostat_geodata_60_2016
to
fix the issue of old-style crs object (#237)eurostat_geodata_60_2016
so that the dataset is more
understandable and usable for testing purposes. Added the same
information to get_eurostat_geospatial()
documentation as
well.eurostat_geodata_60_2016
and
get_eurostat_geospatial()
documentation.get_eurostat_geospatial()
by setting content
encoding to UTF-8 when httr::content()
function is
called\code{}
,
\link{}
syntax to markdown (issue #230, PR #231 by @dieghernan)options()
command is
no longer needed and the cache dir can be modified persistently with a
custom function (issue #223, PR #228 by @dieghernan)add_nuts_level()
,
harmonize_geo_code()
, recode_to_nuts_2016()
and recode_to_nuts_2013()
; these functions were moved to
the new package regions. The problem of sub-national geo codes is
explained in the new vignette “Mapping Regional Data, Mapping Metadata
Problems”, which replaces the “Regional data examples for the eurostat R
package” vignette. This is a shared vignette, but the new regions
package has more articles on how to work with sub-national data. (issues
#218 and #219, PR #220 by @antaldaniel)get_eurostat_geospatial()
return a message if sf is not
installed. This is to increase compatibility of eurostat-package on
systems that have trouble installing sf (issue #213)\dontrun{}
for
a quick CRAN releaselabel_eurostat()
has new countrycode and
countrycode_nomatch arguments to label with countrycode package and
custom_dic argument to add custom dictionary.label_eurostat()
gives always an error by default,
if labelling introduces duplicated labels. A new
fix_duplicated
argument is add to fix duplicated labels
automatically. (#79, #90)search_eurostat()
accepts new argument
fixed
: if TRUE
(default), pattern
provided will used as is; if FALSE
, pattern
will be interpreted as a true regex pattern.get_eurostat()
returns tibbles (#52)get_eurostat_dic()
and
get_eurostat_toc()
return tibblesread_tsv()
is used instead of
read.csv()
(#29)harmonize_country_code()
which
converts the raw data values from EL to GR and from UK to GB.get_eurostat()
can now get data also from the
Eurostat JSON API via get_eurostat_json()
. It also have a
new argument type
to select labels for variable values
instead of codes.tidyr 0.4.0
(#47).select_time
argument for
get_eurostat()
to select a time frequency in case of
multi-frequency datasets. Now the get_eurostat()
also gives
an error if you try to get multi-frequency with other time formats than
time_format = "raw"
. (#30) time
column is also
now in ascending order.get_eurostat()
gets a new argument
compress_file
to control compression of the cache file.
Also cache filenames includes now all relevant arguments. (#28)search_eurostat()
a new type option
type = "all"
to search all types.label_eurostat()
new arguments. A code
to retain also codes for specified columns. A eu_order
to
order factor levels in Eurostat order, which uses the new function
dic_order()
.label_eurostat_vars(x)
gives labels for names, if x
is other than a character or a factor and
label_eurostat_tables(x)
does not accept other than a
character or a factor.get_eurostat()
a new argument
stringsAsFactors
to control the factor conversion of
variables.eurotime2date
(and get_eurostat
) convers
now also daily data.These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.