Synopsis
Generates huge amounts of data for testing purposes.
Description
Generates huge amounts of data in either sparse or dense format. This operator can be used to check if huge amounts of data can be handled by RapidMiner for a given process setup without creating the correct format / writing special purpose input operators.
Input
Output
- output:
Parameters
- number examples: The number of generated examples.
- number attributes: The number of attributes.
- sparse fraction: The fraction of default attributes.
- sparse representation: Indicates if the example should be internally represented in a sparse format.
- use local random seed: Indicates if a local random seed should be used.
- local random seed: Specifies the local random seed