Expand description
Module containing helper methods/traits related to orchestrating file serialization, streaming to object store, parallelization, and abort handling
Enumsยง
- Serialized
Record ๐Batch Result - Result of calling
serialize_rb_stream_to_object_store
Functionsยง
- serialize_
rb_ ๐stream_ to_ object_ store - Serializes a single data stream in parallel and writes to an ObjectStore concurrently. Data order is preserved.
- spawn_
writer_ tasks_ and_ join - Orchestrates multipart put of a dynamic number of output files from a single input stream for any statelessly serialized file type. That is, any file type for which each [RecordBatch] can be serialized independently of all other [RecordBatch]s.
- stateless_
serialize_ ๐and_ write_ files - Contains the common logic for serializing RecordBatches and writing the resulting bytes to an ObjectStore. Serialization is assumed to be stateless, i.e. each RecordBatch can be serialized without any dependency on the RecordBatches before or after.
Type Aliasesยง
- File
Write ๐Bundle - Serializer
Type ๐ - Writer
Type ๐