Module demux

Module demux 

Source
Expand description

Module containing helper methods/traits related to enabling dividing input stream into multiple output files at execution time

Functionsยง

compute_hive_style_file_path ๐Ÿ”’
compute_partition_keys_by_row ๐Ÿ”’
compute_take_arrays ๐Ÿ”’
create_new_file_stream ๐Ÿ”’
Helper for row count demuxer
generate_file_path ๐Ÿ”’
Helper for row count demuxer
hive_style_partitions_demuxer ๐Ÿ”’
Splits an input stream based on the distinct values of a set of columns Assumes standard hive style partition paths such as /col1=val1/col2=val2/outputfile.parquet
remove_partition_by_columns ๐Ÿ”’
row_count_demuxer ๐Ÿ”’
Dynamically partitions input stream to achieve desired maximum rows per file
start_demuxer_task ๐Ÿ”’
Splits a single [SendableRecordBatchStream] into a dynamically determined number of partitions at execution time.

Type Aliasesยง

DemuxedStreamReceiver
RecordBatchReceiver ๐Ÿ”’