Kartothek

Latest version: v5.3.0

Safety actively analyzes 627484 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 7 of 7

3.3.0

==========================
- Fix rejection of bool predicates in :func:`~kartothek.serialization.filter_array_like` when bool columns contains
``None``
- Streamline behavior of `store_dataset_from_ddf` when passing empty ddf.
- Fix an issue where a segmentation fault may be raised when comparing MetaPartition instances
- Expose a ``date_as_object`` flag in ``kartothek.core.index.as_flat_series``

3.2.0

==========================
- Fix gh:66 where predicate pushdown may evalute false results if evaluated
using improper types. The behavior now is to raise in these situations.
- Predicate pushdown and :func:`~kartothek.serialization.filter_array_like` will now properly handle pandas Categoricals.
- Add :meth:`~kartothek.io.dask.bag.read_dataset_as_dataframe_bag`
- Add `kartothek.io.dask.bag.read_dataset_as_metapartitions_bag`

3.1.1

==========================

- make :func:`~kartothek.io.dask.bag.build_dataset_indices__bag` more efficient
- make :func:`~kartothek.io.eager.build_dataset_indices` more efficient
- fix pseudo-private :meth:`~kartothek.io_components.read.dispatch_metapartitions` handling of
``concat_partitions_on_primary_index``
- fix internal errors if querying (e.g. via :meth:`~kartothek.io.eager.read_dataset_as_dataframes`) with
``datetime.date`` predicates that use the dataset index; this affects all code paths using
:meth:`~kartothek.io_components.metapartition.MetaPartition.load_dataframes`

3.1.0

==========================

- fix ``getargspec`` ``DeprecationWarning``
- fix ``FutureWarning`` in ``filter_array_like``
- remove ``funcsigs`` requirement
- Implement reference ``io.eager`` implementation, adding the functions:

- :meth:`~kartothek.io.eager.garbage_collect_dataset`
- :meth:`~kartothek.io.eager.build_dataset_indices`
- :meth:`~kartothek.io.eager.update_dataset_from_dataframes`

- fix ``_apply_partition_key_predicates`` ``FutureWarning``
- serialize :class:`~kartothek.core.index.ExplicitSecondaryIndex` to parquet
- improve messages for schema violation errors
- Ensure binary column names are read as type ``str``:

- Ensure dataframe columns are of type ``str`` in :func:`~kartothek.core.common_metadata.empty_dataframe_from_schema`
- Testing: create :func:`~kartothek.io.testing.read.test_binary_column_metadata` which checks column names stored as
``bytes`` objects are read as type ``str``

- fix issue where it was possible to add an index to an existing dataset by using update functions and partition indices
(https://github.com/JDASoftwareGroup/kartothek/issues/16).

- fix issue where unreferenced files were not being removed when deleting an entire dataset

- support nested :class:`~kartothek.io_components.metapartition.MetaPartition`
in :meth:`~kartothek.io_components.metapartition.MetaPartition.add_metapartition`.
This fixes issue https://github.com/JDASoftwareGroup/kartothek/issues/40 .

- Add :meth:`~kartothek.io.dask.bag.build_dataset_indices__bag`

- Return `dask.bag.Item` object from :meth:`~kartothek.io.dask.bag.store_bag_as_dataset` to avoid misoptimization

**Breaking:**

- categorical normalization was moved from :meth:`~kartothek.core.common_metadata.make_meta` to
:meth:`~kartothek.core.common_metadata.normalize_type`.
- :meth:`kartothek.core.common_metadata.SchemaWrapper.origin` is now a set of of strings instead of a single string
- ``Partition.from_v2_dict`` was removed, use :meth:`kartothek.core.partition.Partition.from_dict` instead

3.0.0

==========================

- Initial public release

Page 7 of 7

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.