Sdv

Latest version: v1.13.1

Safety actively analyzes 630656 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 5 of 10

0.14.1

This release adds a `TabularPreset`, available in the `sdv.lite` module, which allows users to easily optimize a tabular model for speed.
In this release, we also include bug fixes for sampling with conditions, an unresolved warning, and setting field distributions. Finally,
we include documentation updates for sampling and the new `TabularPreset`.

Bugs Fixed
* Sampling with conditions={column: 0.0} for float columns doesn't work - Issue [525](https://github.com/sdv-dev/SDV/issues/525) by shlomihod and tssbas
* resolved FutureWarning with Pandas replaced append by concat - Issue [759](https://github.com/sdv-dev/SDV/issues/759) by Deathn0t
* Field distributions bug in CopulaGAN - Issue [747](https://github.com/sdv-dev/SDV/issues/747) by katxiao
* Field distributions bug in GaussianCopula - Issue [746](https://github.com/sdv-dev/SDV/issues/746) by katxiao

New Features
* Set default transformer to categorical_fuzzy - Issue [768](https://github.com/sdv-dev/SDV/issues/768) by amontanez24
* Model nulls normally when tabular preset has constraints - Issue [764](https://github.com/sdv-dev/SDV/issues/764) by katxiao
* Don't modify my metadata object - Issue [754](https://github.com/sdv-dev/SDV/issues/754) by amontanez24
* Presets should be able to handle constraints - Issue [753](https://github.com/sdv-dev/SDV/issues/753) by katxiao
* Change preset optimize_for --> name - Issue [749](https://github.com/sdv-dev/SDV/issues/749) by katxiao
* Create a speed optimized Preset - Issue [716](https://github.com/sdv-dev/SDV/issues/716) by katxiao

Documentation Changes
* Add tabular preset docs - Issue [777](https://github.com/sdv-dev/SDV/issues/777) by katxiao
* sdv.sampling module is missing from the API - Issue [740](https://github.com/sdv-dev/SDV/issues/740) by katxiao

0.14.0

This release updates the sampling API and splits the existing functionality into three methods - `sample`, `sample_conditions`,
and `sample_remaining_columns`. We also add support for sampling in batches, displaying a progress bar when sampling with more than one batch,
sampling deterministically, and writing the sampled results to an output file. Finally, we include fixes for sampling with conditions
and updates to the documentation.

Bugs Fixed
* Fix write to file in sampling - Issue [732](https://github.com/sdv-dev/SDV/issues/732) by katxiao
* Conditional sampling doesn't work if the model has a CustomConstraint - Issue [696](https://github.com/sdv-dev/SDV/issues/696) by katxiao

New Features
* Updates to GaussianCopula conditional sampling methods - Issue [729](https://github.com/sdv-dev/SDV/issues/729) by katxiao
* Update conditional sampling errors - Issue [730](https://github.com/sdv-dev/SDV/issues/730) by katxiao
* Enable Batch Sampling + Progress Bar - Issue [693](https://github.com/sdv-dev/SDV/issues/693) by katxiao
* Create sample_remaining_columns() method - Issue [692](https://github.com/sdv-dev/SDV/issues/692) by katxiao
* Create sample_conditions() method - Issue [691](https://github.com/sdv-dev/SDV/issues/691) by katxiao
* Improve sample() method - Issue [690](https://github.com/sdv-dev/SDV/issues/690) by katxiao
* Create Condition object - Issue [689](https://github.com/sdv-dev/SDV/issues/689) by katxiao
* Is it possible to generate data with new set of primary keys? - Issue [686](https://github.com/sdv-dev/SDV/issues/686) by katxiao
* No way to fix the random seed? - Issue [157](https://github.com/sdv-dev/SDV/issues/157) by katxiao
* Can you set a random state for the sdv.tabular.ctgan.CTGAN.sample method? - Issue [515](https://github.com/sdv-dev/SDV/issues/515) by katxiao
* generating different synthetic data while training the model multiple times. - Issue [299](https://github.com/sdv-dev/SDV/issues/299) by katxiao

Documentation Changes
* Typo in the document documentation - Issue [680](https://github.com/sdv-dev/SDV/issues/680) by katxiao

0.13.1

This release adds support for passing tabular constraints to the HMA1 model, and adds more explicit error handling for
metric evaluation. It also includes a fix for using categorical columns in the PAR model and documentation updates
for metadata and HMA1.

Bugs Fixed

* Categorical column after sequence_index column - Issue [314](https://github.com/sdv-dev/SDV/issues/314) by fealho

New Features

* Support passing tabular constraints to the HMA1 model - Issue [296](https://github.com/sdv-dev/SDV/issues/296) by katxiao
* Metric evaluation error handling metrics - Issue [638](https://github.com/sdv-dev/SDV/issues/638) by katxiao

Documentation Changes

* Make true/false values lowercase in Metadata Schema specification - Issue [664](https://github.com/sdv-dev/SDV/issues/664) by katxiao
* Update docstrings for hma1 methods - Issue [642](https://github.com/sdv-dev/SDV/issues/642) by katxiao

0.13.0

This release makes multiple improvements to different `Constraint` classes. The `Unique` constraint can now
handle columns with the name `index` and no longer crashes on subsets of the original data. The `Between`
constraint can now handle columns with nulls properly. The memory of all constraints was also improved.

Various other features and fixes were added. Conditional sampling no longer crashes when the `num_rows` argument
is not provided. Multiple localizations can now be used for PII fields. Scaffolding for integration tests was added
and the workflows now run `pip check`.

Additionally, this release adds support for Python 3.9!

Bugs Fixed

* Gaussian Copula – Memory Issue in Release 0.10.0 - Issue [459](https://github.com/sdv-dev/SDV/issues/459) by xamm
* Applying Unique Constraint errors when calling model.fit() on a subset of data - Issue [610](https://github.com/sdv-dev/SDV/issues/610) by xamm
* Calling sampling with conditions and without num_rows crashes - Issue [614](https://github.com/sdv-dev/SDV/issues/614) by xamm
* Metadata.visualize with path parameter throws AttributeError - Issue [634](https://github.com/sdv-dev/SDV/issues/634) by xamm
* The Unique constraint crashes when the data contains a column called index - Issue [616](https://github.com/sdv-dev/SDV/issues/616) by xamm
* The Unique constraint cannot handle non-default index - Issue [617](https://github.com/sdv-dev/SDV/issues/617) by xamm
* ConstraintsNotMetError when applying Between constraint on datetime columns containing null values - Issue [632](https://github.com/sdv-dev/SDV/issues/632) by katxiao

New Features

* Adds Multi localisations feature for PII fields defined in 308 - PR [609](https://github.com/sdv-dev/SDV/pull/609) by xamm

Housekeeping Tasks

* Support latest version of Faker - Issue [621](https://github.com/sdv-dev/SDV/issues/621) by katxiao
* Add scaffolding for Metadata integration tests - Issue [624](https://github.com/sdv-dev/SDV/issues/624) by katxiao
* Add support for Python 3.9 - Issue [631](https://github.com/sdv-dev/SDV/issues/631) by amontanez24

Internal Improvements

* Add pip check to CI workflows - Issue [626](https://github.com/sdv-dev/SDV/issues/626) by pvk-developer

Documentation Changes

* Anonymizing PII in single table tutorials states address field as e-mail type - Issue [604](https://github.com/sdv-dev/SDV/issues/604) by xamm

Special thanks to xamm, katxiao, pvk-developer and amontanez24 for all the work that made this release possible!

0.12.1

This release fixes bugs in constraints, metadata behavior, and SDV documentation. Specifically, we added
proper handling of data containing null values for constraints and timeseries data, and updated the
default metadata detection behavior.

Bugs Fixed

* ValueError: The parameter loc has invalid values - Issue [353](https://github.com/sdv-dev/SDV/issues/353) by fealho
* Gaussian Copula is generating different data with metadata and without metadata - Issue [576](https://github.com/sdv-dev/SDV/issues/576) by katxiao
* Make pomegranate an optional dependency - Issue [567](https://github.com/sdv-dev/SDV/issues/567) by katxiao
* Small wording change for Question Issue Template - Issue [571](https://github.com/sdv-dev/SDV/issues/571) by katxiao
* ConstraintsNotMetError when using GreaterThan constraint with datetime - Issue [590](https://github.com/sdv-dev/SDV/issues/590) by katxiao
* GreaterThan constraint crashing with NaN values - Issue [592](https://github.com/sdv-dev/SDV/issues/592) by katxiao
* Null values in GreaterThan constraint raises error - Issue [589](https://github.com/sdv-dev/SDV/issues/589) by katxiao
* ColumnFormula raises ConstraintsNotMetError when checking NaN values - Issue [593](https://github.com/sdv-dev/SDV/issues/593) by katxiao
* GreaterThan constraint raises TypeError when using datetime - Issue [596](https://github.com/sdv-dev/SDV/issues/596) by katxiao
* Fix repository language - Issue [464](https://github.com/sdv-dev/SDV/issues/464) by fealho
* Update __init__.py - Issue [578](https://github.com/sdv-dev/SDV/issues/578) by dyuliu
* IndexingError: Unalignable boolean - Issue [446](https://github.com/sdv-dev/SDV/issues/446) by fealho

0.12.0

This release focuses on improving and expanding upon the existing constraints. More specifically, the users can now
(1) specify multiple columns in `Positive` and `Negative` constraints, (2) use the new `Unique`constraint and
(3) use datetime data with the `Between` constraint. Additionaly, error messages have been added and updated
to provide more useful feedback to the user.

Besides the added features, several bugs regarding the `UniqueCombinations` and `ColumnFormula` constraints have been fixed,
and an error in the metadata.json for the `student_placements` dataset was corrected. The release also added documentation
for the `fit_columns_model` which affects the majority of the available constraints.

New Features

* Change default fit_columns_model to False - Issue [550](https://github.com/sdv-dev/SDV/issues/550) by katxiao
* Support multi-column specification for positive and negative constraint - Issue [545](https://github.com/sdv-dev/SDV/issues/545) by sarahmish
* Raise error when multiple constraints can't be enforced - Issue [541](https://github.com/sdv-dev/SDV/issues/541) by amontanez24
* Create Unique Constraint - Issue [532](https://github.com/sdv-dev/SDV/issues/532) by amontanez24
* Passing invalid conditions when using constraints produces unreadable errors - Issue [511](https://github.com/sdv-dev/SDV/issues/511) by katxiao
* Improve error message for ColumnFormula constraint when constraint column used in formula - Issue [508](https://github.com/sdv-dev/SDV/issues/508) by katxiao
* Add datetime functionality to Between constraint - Issue [504](https://github.com/sdv-dev/SDV/issues/504) by katxiao

Bugs Fixed

* UniqueCombinations constraint with handling_strategy = 'transform' yields synthetic data with nan values - Issue [521](https://github.com/sdv-dev/SDV/issues/521) by katxiao and csala
* UniqueCombinations constraint outputting wrong data type - Issue [510](https://github.com/sdv-dev/SDV/issues/510) by katxiao and csala
* UniqueCombinations constraint on only one column gets stuck in an infinite loop - Issue [509](https://github.com/sdv-dev/SDV/issues/509) by katxiao
* Conditioning on a non-constraint column using the ColumnFormula constraint - Issue [507](https://github.com/sdv-dev/SDV/issues/507) by katxiao
* Conditioning on the constraint column of the ColumnFormula constraint - Issue [506](https://github.com/sdv-dev/SDV/issues/506) by katxiao
* Update metadata.json for duration of student_placements dataset - Issue [503](https://github.com/sdv-dev/SDV/issues/503) by amontanez24
* Unit test for HMA1 when working with a single child row per parent row - Issue [497](https://github.com/sdv-dev/SDV/issues/497) by pvk-developer
* UniqueCombinations constraint for more than 2 columns - Issue [494](https://github.com/sdv-dev/SDV/issues/494) by katxiao and csala

Documentation Changes

* Add explanation of fit_columns_model to API docs - Issue [517](https://github.com/sdv-dev/SDV/issues/517) by katxiao

Page 5 of 10

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.