Recordlinkage

Latest version: v0.16

Safety actively analyzes 630052 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 4 of 4

0.3.1

0.3

This version contains a lot of changes to the API. Hopefully, there are no large API changes needed for now.
- Total restructure of compare functions (The end of changing the API is close to now.)
- Compare method `numerical` is now named `numeric` and `fuzzy` is now named `string`.
- Add haversine formula to compare geographical records.
- Use numexpr for computing numeric comparisons.
- Add step, linear and squared comparing.
- Add eye index method.
- Improve, update and add new tests.
- Remove iterative indexing functions.
- New add chunks for indexing functions. These chunks are defined in the class Pairs. If chunks are defined, then the indexing functions returns a generator with an Index for each element.
- Update documentation.
- Various bug fixes.

0.2

- Full Python3 support
- Update the parameters of the Logistic Regression Classifier manually. In literature, this is often denoted as the _deterministic record linkage_.
- Expectation/Conditional Maxisation algorithm completely rewritten. The performance of the algorithm is much better now. The algorithm is still experimental.
- New string comparison metrics: Q-gram string comparing and Cosine string comparing.
- New indexing algorithm: Q-gram indexing.
- Several internal tests.
- Updated documenation.
- BernoulliNBClassifier is now named NaiveBayesClassifier. No changes to the algorithm.
- Arguments order in compare functions corrected.
- Function to clean phone numbers
- Return the result of the classifier as index, numpy array or pandas series.
- Many bug fixes

0.1.2

In the version are the following things added or changed:
- Arguments in compare functions renamed.
- Remove exact comparing of dataframes and add efficiency tricks for exact comparing.
- Update documentation about comparing, classifying and evaluation.

0.1.1

This update includes:
- Updated documentation about indexing, comparing and classification
- Improved performance for some indexing methods
- Random indexing returns now exact number of record pairs
- Argumens renamed in comparing functions

0.1.0

The is the first big release of the record linkage package. See the [documentation](http://recordlinkage.readthedocs.org/en/latest/) for information about the available functions. The framework needs to be extended with more functions, but there is a stable, easily extendable, framework to do that. More information how to do that is coming.

Page 4 of 4

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.