Module orchestration

Module orchestration 

Source
Expand description

Module containing helper methods/traits related to orchestrating file serialization, streaming to object store, parallelization, and abort handling

Enumsยง

SerializedRecordBatchResult ๐Ÿ”’
Result of calling serialize_rb_stream_to_object_store

Functionsยง

serialize_rb_stream_to_object_store ๐Ÿ”’
Serializes a single data stream in parallel and writes to an ObjectStore concurrently. Data order is preserved.
spawn_writer_tasks_and_join
Orchestrates multipart put of a dynamic number of output files from a single input stream for any statelessly serialized file type. That is, any file type for which each [RecordBatch] can be serialized independently of all other [RecordBatch]s.
stateless_serialize_and_write_files ๐Ÿ”’
Contains the common logic for serializing RecordBatches and writing the resulting bytes to an ObjectStore. Serialization is assumed to be stateless, i.e. each RecordBatch can be serialized without any dependency on the RecordBatches before or after.

Type Aliasesยง

FileWriteBundle ๐Ÿ”’
SerializerType ๐Ÿ”’
WriterType ๐Ÿ”’