Sdv

Latest version: v1.13.1

Safety actively analyzes 630566 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 10

0.18.0

This release adds suppport for Python 3.10 and drops support for 3.6.

Maintenance

* Drop support for python 3.6 - Issue [1177](https://github.com/sdv-dev/SDV/issues/1177) by amontanez24
* Support for python 3.10 - Issue [939](https://github.com/sdv-dev/SDV/issues/939) by amontanez24
* Support Python >=3.10,<4 - Issue [1000](https://github.com/sdv-dev/SDV/issues/1000) by amontanez24

0.17.2

This release fixes a bug in the demo module related to loading the demo data with constraints. It also adds a name to the demo datasets. Finally, it bumps the version of `SDMetrics` used.

Maintenance

* Upgrade SDMetrics requirement to 0.8.0 - Issue [1125](https://github.com/sdv-dev/SDV/issues/1125) by katxiao

New Features

* Provide a name for the default demo datasets - Issue [1124](https://github.com/sdv-dev/SDV/issues/1124) by amontanez24

Bugs Fixed

* Cannot load_tabular_demo with metadata - Issue [1123](https://github.com/sdv-dev/SDV/issues/1123) by amontanez24

0.17.1

This release bumps the dependency requirements to use the latest version of `SDMetrics`.

Maintenance

* Patch release: Bump required version for SDMetrics - Issue [1010](https://github.com/sdv-dev/SDV/issues/1010) by katxiao

0.17.0

This release updates the code to use RDT version 1.2.0 and greater, so that those new features are now available in SDV. This changes the transformers that are available in SDV models to be those that are in RDT version 1.2.0. As a result, some arguments for initializing models have changed.

Additionally, this release fixes bugs related to loading models with custom constraints. It also fixes a bug that added `NaNs` to the index of sampled data when using `sample_remaining_columns`.

Bugs Fixed

* Incorrect rounding in Custom Constraint example - Issue [941](https://github.com/sdv-dev/SDV/issues/941) by amontanez24
* Can't save the model if use the custom constraint - Issue [928](https://github.com/sdv-dev/SDV/issues/928) by pvk-developer
* User Guide code fixes - Issue [983](https://github.com/sdv-dev/SDV/issues/983) by amontanez24
* Index contains NaNs when using sample_remaining_columns - Issue [985](https://github.com/sdv-dev/SDV/issues/985) by amontanez24
* Cannot sample after loading a model with custom constraints: TypeError - Issue [984](https://github.com/sdv-dev/SDV/issues/984) by pvk-developer
* Set HyperTransformer config manually, based on Metadata if given - Issue [982](https://github.com/sdv-dev/SDV/issues/982) by pvk-developer

New Features

* Change default metrics for evaluate - Issue [949](https://github.com/sdv-dev/SDV/issues/949) by fealho

Maintenance

* Update the RDT version to 1.0 - Issue [897](https://github.com/sdv-dev/SDV/issues/897) by pvk-developer

0.16.0

This release brings user friendly improvements and bug fixes on the `SDV` constraints, to help
users generate their synthetic data easily.

Some predefined constraints have been renamed and redefined to be more user friendly & consistent.
The custom constraint API has also been updated for usability. The SDV now automatically determines
the best `handling_strategy` to use for each constraint, attempting `transform` by default and
falling back to `reject_sampling` otherwise. The `handling_strategy` parameters are no longer
included in the API.

Finally, this version of `SDV` also unifies the parameters for all sampling related methods for
all models (including TabularPreset).

Changes to Constraints

* `GreatherThan` constraint is now separated in two new constraints: `Inequality`, which is
intended to be used between two columns, and `ScalarInequality`, which is intended to be used
between a column and a scalar.

* `Between` constraint is now separated in two new constraints: `Range`, which is intended to
be used between three columns, and `ScalarRange`, which is intended to be used between a column
and low and high scalar values.

* `FixedIncrements` a new constraint that makes the data increment by a certain value.
* New `create_custom_constraint` function available to create custom constraints.

Removed Constraints
* `Rounding` Rounding is automatically being handled by the ``rdt.HyperTransformer``.
* `ColumnFormula` the `create_custom_constraint` takes place over this one and allows more
advanced usage for the end users.

New Features

* Improve error message for invalid constraints - Issue [801](https://github.com/sdv-dev/SDV/issues/801) by fealho
* Numerical Instability in Constrained GaussianCopula - Issue [806](https://github.com/sdv-dev/SDV/issues/806) by fealho
* Unify sampling params for reject sampling - Issue [809](https://github.com/sdv-dev/SDV/issues/809) by amontanez24
* Split `GreaterThan` constraint into `Inequality` and `ScalarInequality` - Issue [814](https://github.com/sdv-dev/SDV/issues/814) by fealho
* Split `Between` constraint into `Range` and `ScalarRange` - Issue [815](https://github.com/sdv-dev/SDV/issues/815) pvk-developer
* Change `columns` to `column_names` in `OneHotEncoding` and `Unique` constraints - Issue [816](https://github.com/sdv-dev/SDV/issues/816) by amontanez24
* Update columns parameter in `Positive` and `Negative` constraint - Issue [817](https://github.com/sdv-dev/SDV/issues/817) by fealho
* Create `FixedIncrements` constraint - Issue [818](https://github.com/sdv-dev/SDV/issues/818) by amontanez24
* Improve datetime handling in `ScalarInequality` and `ScalarRange` constraints - Issue [819](https://github.com/sdv-dev/SDV/issues/819) by pvk-developer
* Support strict boundaries even when transform strategy is used - Issue [820](https://github.com/sdv-dev/SDV/issues/820) by fealho
* Add `create_custom_constraint` factory method - Issue [836](https://github.com/sdv-dev/SDV/issues/836) by fealho

Internal Improvements
* Remove `handling_strategy` parameter - Issue [833](https://github.com/sdv-dev/SDV/issues/833) by amontanez24
* Remove `fit_columns_model` parameter - Issue [834](https://github.com/sdv-dev/SDV/issues/834) by pvk-developer
* Remove the `ColumnFormula` constraint - Issue [837](https://github.com/sdv-dev/SDV/issues/837) by amontanez24
* Move `table_data.copy` to base class of constraints - Issue [845](https://github.com/sdv-dev/SDV/issues/845) by fealho

Bugs Fixed
* Numerical Instability in Constrained GaussianCopula - Issue [801](https://github.com/sdv-dev/SDV/issues/801) by tlranda and fealho
* Fix error message for `FixedIncrements` - Issue [865](https://github.com/sdv-dev/SDV/issues/865) by pvk-developer
* Fix constraints with conditional sampling - Issue [866](https://github.com/sdv-dev/SDV/issues/866) by amontanez24
* Fix error message in `ScalarInequality` - Issue [868](https://github.com/sdv-dev/SDV/issues/868) by pvk-developer
* Cannot use `max_tries_per_batch` on sample: `TypeError: sample() got an unexpected keyword argument 'max_tries_per_batch'` - Issue [885](https://github.com/sdv-dev/SDV/issues/885) by amontanez24
* Conditional sampling + batch size: `ValueError: Length of values (1) does not match length of index (5)` - Issue [886](https://github.com/sdv-dev/SDV/issues/886) by amontanez24
* `TabularPreset` doesn't support new sampling parameters - Issue [887](https://github.com/sdv-dev/SDV/issues/887) by fealho
* Conditional Sampling: `batch_size` is being set to `None` by default? - Issue [889](https://github.com/sdv-dev/SDV/issues/889) by amontanez24
* Conditional sampling using GaussianCopula inefficient when categories are noised - Issue [910](https://github.com/sdv-dev/SDV/issues/910) by amontanez24

Documentation Changes
* Show the `API` for `TabularPreset` models - Issue [854](https://github.com/sdv-dev/SDV/issues/854) by katxiao
* Update handling constraints doc - Pull Request [856](https://github.com/sdv-dev/SDV/issues/856) by amontanez24
* Update custom costraints documentation - Pull Request [857](https://github.com/sdv-dev/SDV/issues/857) by pvk-developer

0.15.0

This release improves the speed of the `GaussianCopula` model by removing logic that previously searched for the appropriate distribution to
use. It also fixes a bug that was happening when conditional sampling was used with the `TabularPreset`.

The rest of the release focuses on making changes to improve constraints including changing the `UniqueCombinations` constraint to `FixedCombinations`,
making the `Unique` constraint work with missing values and erroring when null values are seen in the `OneHotEncoding` constraint.

New Features
* Silence warnings coming from univariate fit in copulas - Issue [769](https://github.com/sdv-dev/SDV/issues/769) by pvk-developer
* Remove parameters related to distribution search and change default - Issue [767](https://github.com/sdv-dev/SDV/issues/767) by fealho
* Update the UniqueCombinations constraint - Issue [793](https://github.com/sdv-dev/SDV/issues/793) by fealho
* Make Unique constraint works with nans - Issue [797](https://github.com/sdv-dev/SDV/issues/797) by fealho
* Error out if nans in OneHotEncoding - Issue [800](https://github.com/sdv-dev/SDV/issues/800) by amontanez24

Bugs Fixed
* Unable to sample conditionally in Tabular_Preset model - Issue [796](https://github.com/sdv-dev/SDV/issues/796) by katxiao

Documentation Changes
* Support GPU computing and progress track? - Issue [478](https://github.com/sdv-dev/SDV/issues/478) by fealho

Page 4 of 10

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.