The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.

rcausim: An R package to generate causally-simulated data

rcausim is an R package designed to generate causally-simulated data to serve as ground truth for evaluating methods in causal discovery and effect estimation. This is particularly useful for researchers in fields such as artificial intelligence, statistics, biology, medicine, epidemiology, economics, and social sciences, who are developing a general or a domain-specific methods to discover causal structures and estimate causal effects.

Features

Installation

You can install the development version of rcausim from GitHub with:

# install.packages("devtools")
devtools::install_github("herdiantrisufriyana/rcausim")

Quick Start

Define Functions and Edges

Defining Causal Structure

Start by defining the causal structure as a data frame of edges:

library(rcausim)
# Load predefined edge data
data(edges)
print(edges)

Assist in setting up functions based on these edges:

# Generate function setups from edge definitions
functions <- function_from_edge(edges)
print(functions)

Define specific functions:

# Define a function for vertex B
function_B <- function(n){ rnorm(n, mean = 90, sd = 5) }
functions <- define(functions, 'B', function_B)
print(functions)

Alternatively, Start by Defining Functions

You can also start by defining functions directly:

# Define a function for vertex B
function_B <- function(n){ rnorm(n, mean = 90, sd = 5) }

# Define a function for vertex A
function_A <- function(B){ ifelse(B>=95, 1, 0) }

# Combine functions in a list
functions <- list(A = function_A, B = function_B)
functions <- function_from_user(functions)

Ensure the causal structure is a directed acyclic graph (DAG):

library(igraph)
# Set up edges based on functions
edges <- edge_from_function(functions)

# Check if the resulting edges form a DAG
g <- graph_from_data_frame(edges, directed = TRUE)
is_dag(g)

Data Simulation

Generate simulated data based on the predefined functions:

# Assume completed functions setup
data(functions)

# Generate simulated data
set.seed(1)
simulated_data <- data_from_function(functions, n = 100)
print(simulated_data)

Vignettes

Explore detailed examples and methodologies in the following vignettes:

License

rcausim is licensed under the GNU General Public License v3.0 (GPL-3), which ensures that all derivatives of the software are free to use under the same terms. See the LICENSE file for more details.

Citation

If you use rcausim in your research, please consider citing it:

@misc{rcausim2024,
  author = {Herdiantri Sufriyana and Emily Chia-Yu Su},
  title = {rcausim: An R package to generate causally-simulated data},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\\url{https://github.com/herdiantrisufriyana/rcausim}}
}

Contact

For questions or support, please contact herdi[at]tmu.edu.tw.

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.