get_statistics_with_limit

Function get_statistics_with_limit 

Source
pub async fn get_statistics_with_limit(
    all_files: impl Stream<Item = Result<(PartitionedFile, Arc<Statistics>)>>,
    file_schema: SchemaRef,
    limit: Option<usize>,
    collect_stats: bool,
) -> Result<(FileGroup, Statistics)>
👎Deprecated since 47.0.0: Please use get_files_with_limit and compute_all_files_statistics instead
Expand description

Get all files as well as the file level summary statistics (no statistic for partition columns). If the optional limit is provided, includes only sufficient files. Needed to read up to limit number of rows. collect_stats is passed down from the configuration parameter on ListingTable. If it is false we only construct bare statistics and skip a potentially expensive call to multiunzip for constructing file level summary statistics.