Dataprep

Latest version: v0.4.5

Safety actively analyzes 613750 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 4

0.4.4

Bugfixes 🐛
* eda: type error for npartitions (57db1ede)
* eda.create-db-report: remove pystache dependency and replace it with jinja2 (676fff1a)
* eda.create-db-report: add missing style files from previously ignored by gitignore (75361915)
* eda: jinja2.markup import broken with 3.1 (b9b60a0a)
* eda: fixed create_report browser sort rendering issue, returned context values directly instead of selecting by css class (331a9644)
* eda: report for empty df (485e58d3)
* eda: plot_diff when columns are not aligned (7e53dbf6)
* eda: scipy version issue (8798a146)
* eda: na column name when upgrade dask (43fdd1a6)
* eda: pd grouper issue when upgrade dask (761c4455)
* clean: delete abundant print (0e072a80)
* eda.plot: fix display issue in notebook (6ed13b09)
* eda.plot: fix pagination styling issues (8396f2d9)
* eda: restyled plots into same row, set height + width of plots to be same (c6ffcd4d)
* eda: interaction error in report for cat-only df (e60239a0)
* eda: fix cat-cat error (94f70ef6)
* eda: fix stat layout issue (5bb535d7)
* eda.create_report: fix display issue in notebook (487659fd)
* clean: remove usaddress library (c192ab43)
* clean: fix the bug of am, pm (4c3b2312)
* clean: fix the bug of am, pm (caf2b372)
* eda: fixed issue where plots weren't rendering twice (fd3fd573)
* eda: wordcloud setting in terminal (00901699)

Features ✨
* clean: add updated version of rapidfuzz and python-crfsuite (59f35066)
* eda.create-db-report: add save report functionality (2fb16ad6)
* eda: add get_db_names (a7bf8206)
* eda: added sorting feature for create_diff_report (8b187a6c)
* eda: add running total for time series test (d0940726)
* eda: add create_db_report submodule (9784cceb)
* eda.plot: add pagination threshold and add auto jump in pagination navigation (cfdd0dec)
* eda.create_report: add sort by approximate unique (5738db2a)
* eda: add sort variables by alphabetical and missing (fb93493a)
* clean: New version of GUI (6828807b)
* eda: enriched show details tab by adding plots and overview statistics (eeb210db)

Code Quality + Testing 💯
* eda: add test for npartition type error (5affd75a)
* eda: add tests for intermediate compute functions (700add77)

Documentation 📃
* eda: add the use-case of dataprep.eda for spark dataframe with ray (4bf14e7c)
* clean: revise __init_.py (02ede811)
* clean: add doc of clean GUI (5e2f38ac)
* eda.plot: add pagination for plot (c4cd4b97)
* eda.create_report: remove old doc file (e1153cb1)
* eda.create_report: convert rst docs file to ipynb and add additional docs for variables sort (bf39a568)
* eda: add doc for getting imdt result (6fbcfe4c)
* eda: add the doc of run dataprep.eda on Hadoop yarn (628686d5)

Contributors this release 🏆

The following users contributed code to DataPrep since the last release.

* Andrey Pham \<andrey.phammove.com\> (First time contributor) ⭐️
* astellarius \<zak.lake0gmail.com\> (First time contributor) ⭐️
* Bowen0729 \<bowen0729qq.com\> (First time contributor) ⭐️
* Danrui Qi \<qidanruigmail.com\> (First time contributor) ⭐️
* Danrui QI \<danruiqiDanruis-MBP.hitronhub.home\> (First time contributor) ⭐️
* dependabot[bot] \<49699333+dependabot[bot]users.noreply.github.com\> (First time contributor) ⭐️
* Devin \<devinlluhotmail.com\> (First time contributor) ⭐️
* Devin Lu \<ludevinlsfu.ca\>
* Grey Murav \<65895033+gremurusers.noreply.github.com\> (First time contributor) ⭐️
* henryye \<yixuysfu.ca\> (First time contributor) ⭐️
* Jinglin Peng \<jlpengcsgmail.com\>
* jwa345 \<jwa345sfu.ca\> (First time contributor) ⭐️
* qidanrui \<qidanruigmail.com\>
* Sultan Orazbayev \<contacteconpoint.com\> (First time contributor) ⭐️
* Weiyuan Wu \<youngwsfu.ca\>

🎉🎉 Thank you! 🎉🎉

0.4.4alpha.1

Bugfixes 🐛
* eda.create-db-report: add missing style files from previously ignored by gitignore (75361915)
* eda: jinja2.markup import broken with 3.1 (b9b60a0a)
* eda: fixed create_report browser sort rendering issue, returned context values directly instead of selecting by css class (331a9644)
* eda: report for empty df (485e58d3)
* eda: plot_diff when columns are not aligned (7e53dbf6)
* eda: scipy version issue (8798a146)
* eda: na column name when upgrade dask (43fdd1a6)
* eda: pd grouper issue when upgrade dask (761c4455)
* clean: delete abundant print (0e072a80)
* eda.plot: fix display issue in notebook (6ed13b09)
* eda.plot: fix pagination styling issues (8396f2d9)
* eda: restyled plots into same row, set height + width of plots to be same (c6ffcd4d)
* eda: interaction error in report for cat-only df (e60239a0)
* eda: fix cat-cat error (94f70ef6)
* eda: fix stat layout issue (5bb535d7)
* eda.create_report: fix display issue in notebook (487659fd)
* clean: remove usaddress library (c192ab43)
* clean: fix the bug of am, pm (4c3b2312)
* clean: fix the bug of am, pm (caf2b372)
* eda: fixed issue where plots weren't rendering twice (fd3fd573)
* eda: wordcloud setting in terminal (00901699)

Features ✨
* eda: added sorting feature for create_diff_report (8b187a6c)
* eda: add running total for time series test (d0940726)
* eda: add create_db_report submodule (9784cceb)
* eda.plot: add pagination threshold and add auto jump in pagination navigation (cfdd0dec)
* eda.create_report: add sort by approximate unique (5738db2a)
* eda: add sort variables by alphabetical and missing (fb93493a)
* clean: New version of GUI (6828807b)
* eda: enriched show details tab by adding plots and overview statistics (eeb210db)

Code Quality + Testing 💯
* eda: add tests for intermediate compute functions (700add77)

Documentation 📃
* clean: revise __init_.py (02ede811)
* clean: add doc of clean GUI (5e2f38ac)
* eda.plot: add pagination for plot (c4cd4b97)
* eda.create_report: remove old doc file (e1153cb1)
* eda.create_report: convert rst docs file to ipynb and add additional docs for variables sort (bf39a568)
* eda: add doc for getting imdt result (6fbcfe4c)
* eda: add the doc of run dataprep.eda on Hadoop yarn (628686d5)

Contributors this release 🏆

The following users contributed code to DataPrep since the last release.

* Andrey Pham \<andrey.phammove.com\> (First time contributor) ⭐️
* Bowen0729 \<bowen0729qq.com\> (First time contributor) ⭐️
* Danrui Qi \<qidanruigmail.com\> (First time contributor) ⭐️
* Danrui QI \<danruiqiDanruis-MBP.hitronhub.home\> (First time contributor) ⭐️
* dependabot[bot] \<49699333+dependabot[bot]users.noreply.github.com\> (First time contributor) ⭐️
* Devin \<devinlluhotmail.com\> (First time contributor) ⭐️
* Devin Lu \<ludevinlsfu.ca\>
* Grey Murav \<65895033+gremurusers.noreply.github.com\> (First time contributor) ⭐️
* henryye \<yixuysfu.ca\> (First time contributor) ⭐️
* Jinglin Peng \<jlpengcsgmail.com\>
* jwa345 \<jwa345sfu.ca\> (First time contributor) ⭐️
* qidanrui \<qidanruigmail.com\>
* Weiyuan Wu \<youngwsfu.ca\>

🎉🎉 Thank you! 🎉🎉

0.4.3

Bugfixes 🐛
* eda: fixed create_report browser sort rendering issue, returned context values directly instead of selecting by css class (331a9644)
* eda: report for empty df (485e58d3)
* eda: plot_diff when columns are not aligned (7e53dbf6)
* eda: scipy version issue (8798a146)
* eda: na column name when upgrade dask (43fdd1a6)
* eda: pd grouper issue when upgrade dask (761c4455)
* clean: delete abundant print (0e072a80)
* eda.plot: fix display issue in notebook (6ed13b09)
* eda.plot: fix pagination styling issues (8396f2d9)
* eda: restyled plots into same row, set height + width of plots to be same (c6ffcd4d)
* eda: interaction error in report for cat-only df (e60239a0)
* eda: fix cat-cat error (94f70ef6)
* eda: fix stat layout issue (5bb535d7)
* eda.create_report: fix display issue in notebook (487659fd)
* clean: remove usaddress library (c192ab43)
* clean: fix the bug of am, pm (4c3b2312)
* clean: fix the bug of am, pm (caf2b372)
* eda: fixed issue where plots weren't rendering twice (fd3fd573)
* eda: wordcloud setting in terminal (00901699)

Features ✨
* eda: added sorting feature for create_diff_report (8b187a6c)
* eda: add running total for time series test (d0940726)
* eda: add create_db_report submodule (9784cceb)
* eda.plot: add pagination threshold and add auto jump in pagination navigation (cfdd0dec)
* eda.create_report: add sort by approximate unique (5738db2a)
* eda: add sort variables by alphabetical and missing (fb93493a)
* clean: New version of GUI (6828807b)
* eda: enriched show details tab by adding plots and overview statistics (eeb210db)

Code Quality + Testing 💯
* eda: add tests for intermediate compute functions (700add77)

Documentation 📃
* clean: revise __init_.py (02ede811)
* clean: add doc of clean GUI (5e2f38ac)
* eda.plot: add pagination for plot (c4cd4b97)
* eda.create_report: remove old doc file (e1153cb1)
* eda.create_report: convert rst docs file to ipynb and add additional docs for variables sort (bf39a568)
* eda: add doc for getting imdt result (6fbcfe4c)
* eda: add the doc of run dataprep.eda on Hadoop yarn (628686d5)

Contributors this release 🏆

The following users contributed code to DataPrep since the last release.

* Andrey Pham \<andrey.phammove.com\> (First time contributor) ⭐️
* Bowen0729 \<bowen0729qq.com\> (First time contributor) ⭐️
* Danrui Qi \<qidanruigmail.com\> (First time contributor) ⭐️
* Danrui QI \<danruiqiDanruis-MBP.hitronhub.home\> (First time contributor) ⭐️
* dependabot[bot] \<49699333+dependabot[bot]users.noreply.github.com\> (First time contributor) ⭐️
* Devin \<devinlluhotmail.com\> (First time contributor) ⭐️
* Devin Lu \<ludevinlsfu.ca\>
* Grey Murav \<65895033+gremurusers.noreply.github.com\> (First time contributor) ⭐️
* henryye \<yixuysfu.ca\> (First time contributor) ⭐️
* Jinglin Peng \<jlpengcsgmail.com\>
* jwa345 \<jwa345sfu.ca\> (First time contributor) ⭐️
* qidanrui \<qidanruigmail.com\>
* Weiyuan Wu \<youngwsfu.ca\>

🎉🎉 Thank you! 🎉🎉

0.4.2

Bugfixes 🐛
* eda: na column name when upgrade dask (43fdd1a6)
* eda: pd grouper issue when upgrade dask (761c4455)
* clean: delete abundant print (0e072a80)
* eda.plot: fix display issue in notebook (6ed13b09)
* eda.plot: fix pagination styling issues (8396f2d9)
* eda: restyled plots into same row, set height + width of plots to be same (c6ffcd4d)
* eda: interaction error in report for cat-only df (e60239a0)
* eda: fix cat-cat error (94f70ef6)
* eda: fix stat layout issue (5bb535d7)
* eda.create_report: fix display issue in notebook (487659fd)
* clean: remove usaddress library (c192ab43)
* clean: fix the bug of am, pm (4c3b2312)
* clean: fix the bug of am, pm (caf2b372)
* eda: fixed issue where plots weren't rendering twice (fd3fd573)
* eda: wordcloud setting in terminal (00901699)

Features ✨
* eda.plot: add pagination threshold and add auto jump in pagination navigation (cfdd0dec)
* eda.create_report: add sort by approximate unique (5738db2a)
* eda: add sort variables by alphabetical and missing (fb93493a)
* clean: New version of GUI (6828807b)
* eda: enriched show details tab by adding plots and overview statistics (eeb210db)

Code Quality + Testing 💯
* eda: add tests for intermediate compute functions (700add77)

Documentation 📃
* clean: add doc of clean GUI (5e2f38ac)
* eda.plot: add pagination for plot (c4cd4b97)
* eda.create_report: remove old doc file (e1153cb1)
* eda.create_report: convert rst docs file to ipynb and add additional docs for variables sort (bf39a568)
* eda: add doc for getting imdt result (6fbcfe4c)
* eda: add the doc of run dataprep.eda on Hadoop yarn (628686d5)

Contributors this release 🏆

The following users contributed code to DataPrep since the last release.

* Andrey Pham \<andrey.phammove.com\> (First time contributor) ⭐️
* Bowen0729 \<bowen0729qq.com\> (First time contributor) ⭐️
* dependabot[bot] \<49699333+dependabot[bot]users.noreply.github.com\> (First time contributor) ⭐️
* Devin \<devinlluhotmail.com\> (First time contributor) ⭐️
* Devin Lu \<ludevinlsfu.ca\>
* Grey Murav \<65895033+gremurusers.noreply.github.com\> (First time contributor) ⭐️
* henryye \<yixuysfu.ca\> (First time contributor) ⭐️
* Jinglin Peng \<jlpengcsgmail.com\>
* jwa345 \<jwa345sfu.ca\> (First time contributor) ⭐️
* qidanrui \<qidanruigmail.com\>
* Weiyuan Wu \<youngwsfu.ca\>

🎉🎉 Thank you! 🎉🎉

0.4.1

Bugfixes 🐛
* eda: stat layout in plot (946319f7)
* eda: fix display in plot(df) (c11bb94c)
* eda: report for pandas extension type (2cbb3873)
* eda: fix saving imdt as json file (5ee6529f)

Features ✨
* clean: Add wiki and simple GUI(7f4ab12a)
* eda: added overview and variables section for create_diff_report (dc4cf7da)
* eda: add categorical interaction in create_report (7f13cd57)

Code Quality + Testing 💯
* eda: added basic automated tests (3a0653e0)

Documentation 📃
* eda: link creete_diff_report to intro (05d9850b)
* eda: added docs for create_diff_report (d8fc9d4b)
* eda: enrich parameters in report (3d0a148a)

Contributors this release 🏆

The following users contributed code to DataPrep since the last release.

* Devin Lu \<ludevinlsfu.ca\>
* Jinglin Peng \<jlpengcsgmail.com\>
* qidanrui \<qidanruigmail.com\>
* waterpine \<songbianzju.edu.cn\>
* Weiyuan Wu \<youngwsfu.ca\>
* Xiaoying Wang \<xiaoying_wangsfu.ca\> (First time contributor) ⭐️
* Xiaoying Wang \<wangxiaoying0369gmail.com\>

🎉🎉 Thank you! 🎉🎉

0.4.0

Bugfixes 🐛
* eda: fix string type (b7e3321f)
* eda: fix value table display (57281bc2)
* eda: remove imdt output from plot (5c227e15)
* eda: adjusted save report method to accept one parameter (4ceefcc1)
* eda: clean config code and fix scatter sample param (8ab27f92)
* plot_diff: fix ci issue (44ce81cf)
* clean: clean_duplication issue 646 (ca9f7085)
* eda: fix category type error (9750694a)

Features ✨
* eda: refactored code and added density parameter to plot_diff(df) (323ae6b0)
* eda: save imdt as json file (78673867)
* connector: integrate connectorx into connector (106457e3, a64e3563, 9f89d3bf)
* clean: add clean_ml function (909cd196)
* clean: add multiple clean functions for number types (3c05be58)
* eda.diff: add plot_diff([df1..dfn], continuous) (3bfb4f57)
* clean: support conversion into packed binary format in clean_ip (7e30f93f, 37a83b03)

Code Quality + Testing 💯
* eda: add densify test and doc for diff (f8d2054d)
* eda: add test for config (ab3172f5)

Performance 🚀
* clean: update documentation of clean_duplication (50f90fa9)

Documentation 📃
* clean: change the introduction (862b4478)
* eda: change eda colab position (ce25b17d, d00b0bd5)
* clean: add documentation for multiple clean functions for number types (732480f1)
* clean: add documentation for clean_ml function (0c139db6)
* eda: scattter.sample_rate added to documentation (549b3193)
* eda: fix plot show (0b40a40f)
* readme: add benchmark link (e807f798)
* readme: small text change on clean and connector (e193a6a7)
* readme: fix titanc link (29cc06cc)

Contributors this release 🏆

The following users contributed code to DataPrep since the last release.

* Devin Lu \<devinlluhotmail.com\> (First time contributor) ⭐️
* dylanzxc \<zca92sfu.ca\>
* Jinglin Peng \<jlpengcsgmail.com\>
* Noir Tree \<2515744793qq.com\> (First time contributor) ⭐️
* pwwang \<1188067+pwwangusers.noreply.github.com\> (First time contributor) ⭐️
* qidanrui \<qidanruigmail.com\>
* sahmad11 \<53022377+sahmad11users.noreply.github.com\> (First time contributor) ⭐️
* waterpine \<songbianzju.edu.cn\>
* Weiyuan Wu \<youngwsfu.ca\>
* Xiaoying Wang \<wangxiaoying0369gmail.com\> (First time contributor) ⭐️

🎉🎉 Thank you! 🎉🎉

Page 1 of 4

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.