The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.
The goal of this document is to show the launch of a small example of data preprocessing. To achieve this from version 1.0.1, the package brings by default a folder with an example of files that can be used to test the tool. Specifically, they are 20 ham-type SMS files and another 20 spam-type files, which can be viewed by loading the data, called bdparData, associated with the package.
For this preprocessing we will use both the default preprocessing flow (DefaultPipeline) and the default way to create the Instances (ExtractorFactory).
Taking into account all the elements to be used to configure the preprocessing, the start of the pipe flow is launched as follows.
runPipeline(path = system.file(package = "bdpar", "example"),
cache = FALSE,
verbose = FALSE,
summary = FALSE)
This function will generate a csv file in the user’s working directory, where you can see the characteristics of the text obtained, such as cleaning the text to be preprocessed.
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.