Expand description
Reexports the datafusion_datasource_parquet crate, containing Parquet based FileSource.
Modules§
- access_
plan - file_
format ParquetFormat: ParquetFileFormatabstractions- metadata
DFParquetMetadatafor fetching Parquet file metadata, statistics and schema information.- source
- ParquetSource implementation for reading parquet files
Structs§
- Cached
Parquet File Reader - Implements [
AsyncFileReader] for a Parquet file in object storage. Reads the file metadata from theFileMetadataCache, if available, otherwise reads it directly from the file and then updates the cache. - Cached
Parquet File Reader Factory - Implementation of
ParquetFileReaderFactorysupporting the caching of footer and page metadata. Reads and updates theFileMetadataCachewith the [ParquetMetaData] data. This reader always loads the entire metadata (including page index, unless the file is encrypted), even if not required by the current query, to ensure it is always available for those that need it. - Cached
Parquet Meta Data - Wrapper to implement
FileMetadatafor [ParquetMetaData]. - Default
Parquet File Reader Factory - Default implementation of
ParquetFileReaderFactory - Object
Store Fetch - [
MetadataFetch] adapter for reading bytes from an [ObjectStore] - Page
Pruning Access Plan Filter - Filters a
ParquetAccessPlanbased on the Parquet PageIndex, if present - Parquet
Access Plan - A selection of rows and row groups within a ParquetFile to decode.
- Parquet
File Metrics - Stores metrics about the parquet execution for a particular parquet file.
- Parquet
File Reader - Implements [
AsyncFileReader] for a parquet file in object storage. - Parquet
Format - The Apache Parquet
FileFormatimplementation - Parquet
Format Factory - Factory struct used to create ParquetFormat
- Parquet
Sink - Implements
DataSinkfor writing to a parquet file. - RowGroup
Access Plan Filter - Reduces the
ParquetAccessPlanbased on row group level metadata.
Enums§
- RowGroup
Access - Describes how the parquet reader will access a row group
Traits§
- Parquet
File Reader Factory - Interface for reading parquet files.
Functions§
- apply_
file_ schema_ type_ coercions - Apply necessary schema type coercions to make file schema match table schema.
- build_
row_ filter - Build a [
RowFilter] from the given predicateExprif possible - can_
expr_ be_ pushed_ down_ with_ schemas - Recurses through expr as a tree, finds all
columns, and checks if any of them would prevent this expression from being predicate pushed down. If any of them would, this returns false. Otherwise, true. Note that the schema passed in here is not the physical file schema (as it is not available at that point in time); it is the schema of the table that this expression is being evaluated against minus any projected columns and partition columns. - coerce_
file_ schema_ to_ string_ type Deprecated - If the table schema uses a string type, coerce the file schema to use a string type.
- coerce_
file_ schema_ to_ view_ type Deprecated - Coerces the file schema if the table schema uses a view type.
- coerce_
int96_ to_ resolution - Coerces the file schema’s Timestamps to the provided TimeUnit if Parquet schema contains INT96.
- fetch_
parquet_ metadata Deprecated - Fetches parquet metadata from ObjectStore for given object
- fetch_
statistics Deprecated - Read and parse the statistics of the Parquet file at location
path - plan_
to_ parquet - Executes a query and writes the results to a partitioned Parquet file.
- statistics_
from_ parquet_ meta_ calc Deprecated - transform_
binary_ to_ string - Transform a schema so that any binary types are strings
- transform_
schema_ to_ view - Transform a schema to use view types for Utf8 and Binary