Data Parser
Classes
DataParser(schema: IdSchemaObjectT, mapping: Mapping[SchemaField, str] | None = None)
: A DataParser describes the interface to get a source data to the format of a defined schema with mapping support.
Attributes:
mapping (Mapping[SchemaField, str], optional): Source to SchemaField mapping rules
as `SchemaField`-`str` pairs such as `{movie_schema.title: "movie_title"}`.
Initialize DataParser
Get the desired output schema and initialize a default mapping
that can be extended by DataParser realizations.
Args:
schema (IdSchemaObjectT): SchemaObject describing the desired output.
mapping (Mapping[SchemaField, str], optional): Realizations can use the `SchemaField` to `str` mapping
to define their custom mapping logic.
Raises:
InitializationException: Parameter `schema` is of invalid type.
### Ancestors (in MRO)
* abc.ABC
* typing.Generic
### Descendants
* superlinked.framework.common.parser.dataframe_parser.DataFrameParser
* superlinked.framework.common.parser.json_parser.JsonParser
### Instance variables
`allow_bytes_input: bool`
:
`blob_loader: BlobLoader`
:
### Methods
`marshal(self, parsed_schemas: ParsedSchema | list[ParsedSchema]) ‑> list[~SourceTypeT]`
: Get a previously parsed data and return it to it's input format.
Args:
parsed_schemas: Previously parsed data that follows the schema of the `DataParser`.
Returns:
list[SourceTypeT]: A list of the original source data format after marshalling the parsed data.
`set_allow_bytes_input(self, value: bool) ‑> None`
:
`unmarshal(self, data: SourceTypeT) ‑> list[superlinked.framework.common.parser.parsed_schema.ParsedSchema]`
: Get the source data and parse it to the desired Schema with the defined mapping.
Args:
data (TSourceType): Source data that corresponds to the DataParser's type.
Returns:
list[ParsedSchema]: A list of ParsedSchema objects.
Last updated