Module statistics

Module statistics 

Source
Expand description

Use statistics to optimize physical planning.

Currently, this module houses code to sort file groups if they are non-overlapping with respect to the required sort order. See MinMaxStatistics

Structs§

MinMaxStatistics 🔒
A normalized representation of file min/max statistics that allows for efficient sorting & comparison. The min/max values are ordered by Self::sort_order. Furthermore, any columns that are reversed in the sort order have their min/max values swapped.

Functions§

add_row_statsDeprecated
compute_all_files_statistics
Computes statistics for all files across multiple file groups.
compute_file_group_statistics
Computes the summary statistics for a group of files(FileGroup level’s statistics).
get_statistics_with_limitDeprecated
Get all files as well as the file level summary statistics (no statistic for partition columns). If the optional limit is provided, includes only sufficient files. Needed to read up to limit number of rows. collect_stats is passed down from the configuration parameter on ListingTable. If it is false we only construct bare statistics and skip a potentially expensive call to multiunzip for constructing file level summary statistics.
sort_columns_from_physical_sort_exprs 🔒