ParquetReadOptions

Struct ParquetReadOptions 

Source
pub struct ParquetReadOptions<'a> {
    pub file_extension: &'a str,
    pub table_partition_cols: Vec<(String, DataType)>,
    pub parquet_pruning: Option<bool>,
    pub skip_metadata: Option<bool>,
    pub schema: Option<&'a Schema>,
    pub file_sort_order: Vec<Vec<SortExpr>>,
    pub file_decryption_properties: Option<ConfigFileDecryptionProperties>,
    pub metadata_size_hint: Option<usize>,
}
Expand description

Options that control the reading of Parquet files.

Note this structure is supplied when a datasource is created and can not not vary from statement to statement. For settings that can vary statement to statement see ConfigOptions.

Fields§

§file_extension: &'a str

File extension; only files with this extension are selected for data input. Defaults to “.parquet”.

§table_partition_cols: Vec<(String, DataType)>

Partition Columns

§parquet_pruning: Option<bool>

Should the parquet reader use the predicate to prune row groups? If None, uses value in SessionConfig

§skip_metadata: Option<bool>

Should the parquet reader to skip any metadata that may be in the file Schema? This can help avoid schema conflicts due to metadata.

If None specified, uses value in SessionConfig

§schema: Option<&'a Schema>

An optional schema representing the parquet files. If None, parquet reader will try to infer it based on data in file.

§file_sort_order: Vec<Vec<SortExpr>>

Indicates how the file is sorted

§file_decryption_properties: Option<ConfigFileDecryptionProperties>

Properties for decryption of Parquet files that use modular encryption

§metadata_size_hint: Option<usize>

Metadata size hint for Parquet files reading (in bytes)

Implementations§

Source§

impl<'a> ParquetReadOptions<'a>

Source

pub fn new() -> Self

Create a new ParquetReadOptions with default values

Source

pub fn file_extension(self, file_extension: &'a str) -> Self

Specify file_extension

Source

pub fn parquet_pruning(self, parquet_pruning: bool) -> Self

Specify parquet_pruning

Source

pub fn skip_metadata(self, skip_metadata: bool) -> Self

Tell the parquet reader to skip any metadata that may be in the file Schema. This can help avoid schema conflicts due to metadata. Defaults to true.

Source

pub fn schema(self, schema: &'a Schema) -> Self

Specify schema to use for parquet read

Source

pub fn table_partition_cols( self, table_partition_cols: Vec<(String, DataType)>, ) -> Self

Specify table_partition_cols for partition pruning

Source

pub fn file_sort_order(self, file_sort_order: Vec<Vec<SortExpr>>) -> Self

Configure if file has known sort order

Source

pub fn file_decryption_properties( self, file_decryption_properties: ConfigFileDecryptionProperties, ) -> Self

Configure file decryption properties for reading encrypted Parquet files

Source

pub fn metadata_size_hint(self, size_hint: Option<usize>) -> Self

Configure metadata size hint for Parquet files reading (in bytes)

Trait Implementations§

Source§

impl<'a> Clone for ParquetReadOptions<'a>

Source§

fn clone(&self) -> ParquetReadOptions<'a>

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Default for ParquetReadOptions<'_>

Source§

fn default() -> Self

Returns the “default value” for a type. Read more
Source§

impl ReadOptions<'_> for ParquetReadOptions<'_>

Source§

fn to_listing_options( &self, config: &SessionConfig, table_options: TableOptions, ) -> ListingOptions

Helper to convert these user facing options to ListingTable options
Source§

fn get_resolved_schema<'life0, 'life1, 'async_trait>( &'life0 self, config: &'life1 SessionConfig, state: SessionState, table_path: ListingTableUrl, ) -> Pin<Box<dyn Future<Output = Result<SchemaRef>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait, 'life1: 'async_trait,

Infer and resolve the schema from the files/sources provided.
Source§

fn _get_resolved_schema<'life0, 'async_trait>( &'a self, config: &'life0 SessionConfig, state: SessionState, table_path: ListingTableUrl, schema: Option<&'a Schema>, ) -> Pin<Box<dyn Future<Output = Result<SchemaRef>> + Send + 'async_trait>>
where Self: Sync + 'async_trait, 'a: 'async_trait, 'life0: 'async_trait,

helper function to reduce repetitive code. Infers the schema from sources if not provided. Infinite data sources not supported through this function.

Auto Trait Implementations§

§

impl<'a> Freeze for ParquetReadOptions<'a>

§

impl<'a> !RefUnwindSafe for ParquetReadOptions<'a>

§

impl<'a> Send for ParquetReadOptions<'a>

§

impl<'a> Sync for ParquetReadOptions<'a>

§

impl<'a> Unpin for ParquetReadOptions<'a>

§

impl<'a> !UnwindSafe for ParquetReadOptions<'a>

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more
§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
§

impl<T> PolicyExt for T
where T: ?Sized,

§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] only if self and other return Action::Follow. Read more
§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns [Action::Follow] if either self or other returns Action::Follow. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

§

fn vzip(self) -> V

§

impl<T> WithSubscriber for T

§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

impl<T> ErasedDestructor for T
where T: 'static,