Read AML


Synopsis

This operator reads an example set from file. The operator can be configured to read almost all line based file formats.


Description

This operator reads an example set from (a) file(s). Probably you can use the default parameter values for the most file formats (including the format produced by the ExampleSetWriter, CSV, ...). Please refer to section First steps/File formats for details on the attribute description file set by the parameter attributes used to specify attribute types. You can use the wizard of this operator or the tool Attribute Editor in order to create those meta data .aml files for your datasets.

This operator supports the reading of data from multiple source files. Each attribute (including special attributes like labels, weights, ...) might be read from another file. Please note that only the minimum number of lines of all files will be read, i.e. if one of the data source files has less lines than the others, only this number of examples will be read.

The split points can be defined with regular expressions (please refer to one of the plenty tutorials available on the web for an introduction). The default split parameter ",\s*|;\s*|\s+" should work for most file formats. This regular expression describes the following column separators

A logical XOR is defined by "|". Other useful separators might be "\t" for tabulars, " " for a single whitespace, and "\s" for any whitespace. </p>

<p> Quoting is also possible with ". You can escape quotes with a backslash, i.e. \". Please note that you can change these characters by adjusting the corresponding settings.

Additionally you can specify comment characters which can be used at arbitrary locations of the data lines. Any content after the comment character will be ignored. Unknown attribute values can be marked with empty strings (if this is possible for your column separators) or by a question mark (recommended).


Input


Output


Parameters


ExampleProcess