Great-expectations

Latest version: v0.18.12

Safety actively analyzes 621931 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 17 of 40

0.15.7

* [FEATURE] Add Rule for TEXT semantic domains within the Onboarding Assistant (5144)
* [FEATURE] Helper method to determine whether Expectation is self-initializing (5159)
* [FEATURE] OnboardingDataAssistantResult plotting feature parity with VolumeDataAssistantResult (5145)
* [FEATURE] Example Notebook for self-initializing `Expectations` (5169)
* [FEATURE] DataAssistant: Enable passing directives to run() method using runtime_environment argument (5187)
* [FEATURE] Adding DataAssistantResult.get_expectation_suite(expectation_suite_name) method (5191)
* [FEATURE] Cronjob to automatically create release PR (5181)
* [BUGFIX] Insure TABLE Domain Metrics Do Not Get Column Key From Column Type Rule Domain Builder (5166)
* [BUGFIX] Update name for stdev expectation in `OnboardingDataAssistant` backend (5193)
* [BUGFIX] OnboardingDataAssistant and Underlying Metrics: Add Defensive Programming Into Metric Implementations So As To Avoid Warnings About Incompatible Data (5195)
* [BUGFIX] Insure that Histogram Metric in Pandas operates on numerical columns that do not have NULL values (5199)
* [BUGFIX] RuleBasedProfiler: Ensure that run() method runtime environment directives are handled correctly when existing setting is None (by default) (5202)
* [BUGFIX] In aggregate metrics, Spark Implementation already gets Column type as argument -- no need for F.col() as the operand is not a string. (5207)
* [DOCS] Update ToC with category links (5155)
* [DOCS] update on availability and parameters of conditional expectations (5150)
* [MAINTENANCE] Helper method for RBP Notebook tests that does clean-up (5171)
* [MAINTENANCE] Increase timeout for longer stages in Azure pipelines (5175)
* [MAINTENANCE] Rule-Based Profiler -- In ParameterBuilder insure that metrics are validated for conversion to numpy array (to avoid deprecation warnings) (5173)
* [MAINTENANCE] Increase timeout in packaging & installation pipeline (5178)
* [MAINTENANCE] OnboardingDataAssistant handle multiple expectations per domain (5170)
* [MAINTENANCE] Update timeout in pipelines to fit Azure syntax (5180)
* [MAINTENANCE] Error message when `Validator` is instantiated with Incorrect `BatchRequest` (5172)
* [MAINTENANCE] Don't include infinity in rendered string for diagnostics (5190)
* [MAINTENANCE] Mark Great Expectations Cloud tests and add stage to CI/CD (5186)
* [MAINTENANCE] Trigger expectation gallery build with scheduled CI/CD runs (5192)
* [MAINTENANCE] `expectation_gallery` Azure pipeline (5194)
* [MAINTENANCE] General cleanup/refactor of `DataAssistantResult` (5198)

0.15.6

* [FEATURE] `NumericMetricRangeMultiBatchParameterBuilder` kernel density estimation (5084)
* [FEATURE] Splitters and limit sample work on AWS Athena (5024)
* [FEATURE] `ColumnValuesLengthMin` and `ColumnValuesLengthMax` metrics (5107)
* [FEATURE] Use `batch_identifiers` in plot tooltips (5091)
* [FEATURE] Updated `DataAssistantResult` plotting API (5117)
* [FEATURE] Onboarding DataAssistant: Numeric Rules and Relevant Metrics (5120)
* [FEATURE] DateTime Rule for OnboardingDataAssistant (5121)
* [FEATURE] Categorical Rule is added to OnboardingDataAssistant (5134)
* [FEATURE] OnboardingDataAssistant: Introduce MeanTableColumnsSetMatchMultiBatchParameterBuilder (to enable expect_table_columns_to_match_set) (5135)
* [FEATURE] Giving the "expect_table_columns_to_match_set" Expectation Self-Initializing Capabilities. (5136)
* [FEATURE] For OnboardingDataAssistant: Implement a TABLE Domain level rule to output "expect_table_columns_to_match_set" (5137)
* [FEATURE] Enable self-initializing `ExpectColumnValueLengthsToBeBetween` (4985)
* [FEATURE] `DataAssistant` plotting for non-sequential batches (5126)
* [BUGFIX] Insure that Batch IDs are accessible in the order in which they were loaded in Validator (5112)
* [BUGFIX] Update `DataAssistant` notebook for new plotting API (5118)
* [BUGFIX] For DataAssistants, added try-except for Notebook tests (5124)
* [BUGFIX] CategoricalColumnDomainBuilder needs to accept limit_mode with dictionary type (5127)
* [BUGFIX] Use `external_sqldialect` mark to skip during lightweight runs (5139)
* [BUGFIX] Use RANDOM_STATE in fixture to make tests deterministic (5142)
* [BUGFIX] Read deployment_version instead of using versioneer in deprecation tests (5147)
* [MAINTENANCE] DataAssistant: Refactoring Access to common ParameterBuilder instances (5108)
* [MAINTENANCE] Refactor of`MetricTypes` and `AttributedResolvedMetrics` (5100)
* [MAINTENANCE] Remove references to show_cta_footer except in schemas.py (5111)
* [MAINTENANCE] Adding unit tests for sqlalchemy limit sampler part 1 (5109)
* [MAINTENANCE] Don't re-raise connection errors in CI (5115)
* [MAINTENANCE] Sqlite specific tests for splitting and sampling (5119)
* [MAINTENANCE] Add Trino dialect in SqlAlchemyDataset (5085) (thanks ms32035)
* [MAINTENANCE] Move upper bound on sqlalchemy to <2.0.0. (5140)
* [MAINTENANCE] Update primary pipeline to cut releases with tags (5128)
* [MAINTENANCE] Improve handling of "expect_column_unique_values_count_to_be_between" in VolumeDataAssistant (5146)
* [MAINTENANCE] Simplify DataAssistant Operation to not Depend on Self-Initializing Expectations (5148)
* [MAINTENANCE] Improvements to Trino support (5152)
* [MAINTENANCE] Update how_to_configure_a_new_checkpoint_using_test_yaml_config.md (5157)
* [MAINTENANCE] Speed up the site builder (5125) (thanks tanelk)
* [MAINTENANCE] remove account id deprecation notice (5158)

0.15.5

* [FEATURE] Add subset operation to Domain class (5049)
* [FEATURE] In DataAssistant: Use Domain instead of domain_type as key for Metrics Parameter Builders (5057)
* [FEATURE] Self-initializing `ExpectColumnStddevToBeBetween` (5065)
* [FEATURE] Enum used by DateSplitter able to be represented as YAML (5073)
* [FEATURE] Implementation of auto-complete for DataAssistant class names in Jupyter notebooks (5077)
* [FEATURE] Provide display ("friendly") names for batch identifiers (5086)
* [FEATURE] Onboarding DataAssistant -- Initial Rule Implementations (Data Aspects) (5101)
* [FEATURE] OnboardingDataAssistant: Implement Nullity/Non-nullity Rules and Associated Metrics (5104)
* [BUGFIX] `self_check()` now also checks for `aws_config_file` (5040)
* [BUGFIX] `multi_batch_rule_based_profiler` test up to date with RBP changes (5066)
* [BUGFIX] Splitting Support at Asset level (5026)
* [BUGFIX] Make self-initialization in expect_column_values_to_be_between truly multi batch (5068)
* [BUGFIX] databricks engine create temporary view (4994) (thanks gvillafanetapia)
* [BUGFIX] Patch broken Expectation gallery script (5090)
* [BUGFIX] Sampling support at asset level (5092)
* [DOCS] Update process and configurations in OpenLineage Action guide. (5039)
* [DOCS] Update process and config examples in Opsgenie guide (5037)
* [DOCS] Correct name of `openlineage-integration-common` package (5041) (thanks mobuchowski)
* [DOCS] Remove reference to validation operator process from how to trigger slack notifications guide (5034)
* [DOCS] Update process and configuration examples in email Action guide. (5036)
* [DOCS] Update Docusaurus version (5063)
* [MAINTENANCE] Saved output of usage stats schema script in repo (5053)
* [MAINTENANCE] Apply Altair custom themes to return objects (5044)
* [MAINTENANCE] Introducing RuleBasedProfilerResult -- neither expectation suite name nor expectation suite must be passed to RuleBasedProfiler.run() (5061)
* [MAINTENANCE] Refactor `DataAssistant` plotting to leverage utility dataclasses (5022)
* [MAINTENANCE] Check that a passed string is parseable as an integer (mssql limit param) (5071)
* [MAINTENANCE] Clean up mssql limit sampling code path and comments (5074)
* [MAINTENANCE] Make saving bootstraps histogram for NumericMetricRangeMultiBatchParameterBuilder optional (absent by default) (5075)
* [MAINTENANCE] Make self-initializing expectations return estimated kwargs with auto-generation timestamp and Great Expectation version (5076)
* [MAINTENANCE] Adding a unit test for batch_id mapping to batch display names (5087)
* [MAINTENANCE] `pypandoc` version constraint added (`< 1.8`) (5093)
* [MAINTENANCE] Utilize Rule objects in Profiler construction in DataAssistant (5089)
* [MAINTENANCE] Turn off metric calculation progress bars in `RuleBasedProfiler` and `DataAssistant` workflows (5080)
* [MAINTENANCE] A small refactor of ParamerBuilder management used in DataAssistant classes (5102)
* [MAINTENANCE] Convenience method refactor for Onboarding DataAssistant (5103)

0.15.4

* [FEATURE] Enable self-initializing `ExpectColumnMeanToBeBetween` (4986)
* [FEATURE] Enable self-initializing `ExpectColumnMedianToBeBetween` (4987)
* [FEATURE] Enable self-initializing `ExpectColumnSumToBeBetween` (4988)
* [FEATURE] New MetricSingleBatchParameterBuilder for specifically single-Batch Rule-Based Profiler scenarios (5003)
* [FEATURE] Enable Pandas DataFrame and Series as MetricValues Output of Metric ParameterBuilder Classes (5008)
* [FEATURE] Notebook for `VolumeDataAssistant` Example (5010)
* [FEATURE] Histogram/Partition Single-Batch ParameterBuilder (5011)
* [FEATURE] Update `DataAssistantResult.plot()` return value to emit `PlotResult` wrapper dataclass (4962)
* [FEATURE] Limit samplers work with supported sqlalchemy backends (5014)
* [FEATURE] trino support (5021)
* [BUGFIX] RBP Profiling Dataset ProgressBar Fix (4999)
* [BUGFIX] Fix DataAssistantResult serialization issue (5020)
* [DOCS] Update slack notification guide to not use validation operators. (4978)
* [MAINTENANCE] Update `autoupdate` GitHub action (5001)
* [MAINTENANCE] Move `DataAssistant` registry capabilities into `DataAssistantRegistry` to enable user aliasing (4991)
* [MAINTENANCE] Fix continuous partition example (4939) (thanks andyjessen)
* [MAINTENANCE] Preliminary refactors for data samplers. (4996)
* [MAINTENANCE] Clean up unused imports and enforce through `flake8` in CI/CD (5005)
* [MAINTENANCE] ParameterBuilder tests should maximally utilize polymorphism (5007)
* [MAINTENANCE] Clean up type hints in CLI (5006)
* [MAINTENANCE] Making ParameterBuilder metric computations robust to failures through logging and exception handling (5009)
* [MAINTENANCE] Condense column-level `vconcat` plots into one interactive plot (5002)
* [MAINTENANCE] Update version of `black` in pre-commit config (5019)
* [MAINTENANCE] Improve tooltips and formatting for distinct column values chart in VolumeDataAssistantResult (5017)
* [MAINTENANCE] Enhance configuring serialization for DotDict type classes (5023)
* [MAINTENANCE] Pyarrow upper bound (5028)

0.15.3

* [FEATURE] Enable self-initializing capabilities for `ExpectColumnProportionOfUniqueValuesToBeBetween` (4929)
* [FEATURE] Enable support for plotting both Table and Column charts in `VolumeDataAssistant` (4930)
* [FEATURE] BigQuery Temp Table Support (4925)
* [FEATURE] Registry for DataAssistant classes with ability to execute from DataContext by registered name (4966)
* [FEATURE] Enable self-intializing capabilities for `ExpectColumnValuesToMatchRegex`/`ExpectColumnValuesToNotMatchRegex` (4958)
* [FEATURE] Provide "estimation histogram" ParameterBuilder output details . (4975)
* [FEATURE] Enable self-initializing `ExpectColumnValuesToMatchStrftimeFormat` (4977)
* [BUGFIX] check contrib requirements (4922)
* [BUGFIX] Use `monkeypatch` to set a consistent bootstrap seed in tests (4960)
* [BUGFIX] Make all Builder Configuration classes of Rule-Based Profiler Configuration Serializable (4972)
* [BUGFIX] extras_require (4968)
* [BUGFIX] Fix broken packaging test and update `dgtest-overrides` (4976)
* [MAINTENANCE] Add timeout to `great_expectations` pipeline stages to prevent false positive build failures (4957)
* [MAINTENANCE] Defining Common Test Fixtures for DataAssistant Testing (4959)
* [MAINTENANCE] Temporarily pin `cryptography` package (4963)
* [MAINTENANCE] Type annotate relevant functions with `-> None` (per PEP 484) (4969)
* [MAINTENANCE] Handle edge cases where `false_positive_rate` is not in range [0, 1] or very close to bounds (4946)
* [MAINTENANCE] fix a typo (4974)

0.15.2

* [FEATURE] Split data assets using sql datetime columns (4871)
* [FEATURE] Plot metrics with `DataAssistantResult.plot()` (4873)
* [FEATURE] RuleBasedProfiler/DataAssistant/MetricMultiBatchParameterBuilder: Enable Returning Metric Computation Results with batch_id Attribution (4862)
* [FEATURE] Enable variables to be specified at both Profiler and its constituent individual Rule levels (4912)
* [FEATURE] Enable self-initializing `ExpectColumnUniqueValueCountToBeBetween` (4902)
* [FEATURE] Improve diagnostic testing process (4816)
* [FEATURE] Add Azure CI/CD action to aid with style guide enforcement (type hints) (4878)
* [FEATURE] Add Azure CI/CD action to aid with style guide enforcement (docstrings) (4617)
* [FEATURE] Use formal interfaces to clean up DataAssistant and DataAssistantResult modules/classes (4901)
* [BUGFIX] fix validation issue for column domain type and implement expect_column_unique_value_count_to_be_between for VolumeDataAssistant (4914)
* [BUGFIX] Fix issue with not using the generated table name on read (4905)
* [BUGFIX] Add deprecation comment to RuntimeDataConnector
* [BUGFIX] Ensure proper class_name within all RuleBasedProfilerConfig instantiations
* [BUGFIX] fix rounding directive handling (4887)
* [BUGFIX] `great_expectations` import fails when SQL Alchemy is not installed (4880)
* [MAINTENANCE] Altair types cleanup (4916)
* [MAINTENANCE] test: update test time (4911)
* [MAINTENANCE] Add module docstring and simplify access to DatePart (4910)
* [MAINTENANCE] Chip away at type hint violations around data context (4897)
* [MAINTENANCE] Improve error message outputted to user in DocstringChecker action (4895)
* [MAINTENANCE] Re-enable bigquery tests (4903)
* [MAINTENANCE] Unit tests for sqlalchemy splitter methods, docs and other improvements (4900)
* [MAINTENANCE] Move plot logic from `DataAssistant` into `DataAssistantResult` (4896)
* [MAINTENANCE] Add condition to primary pipeline to ensure `import_ge` stage doesn't cause misleading Slack notifications (4898)
* [MAINTENANCE] Refactor `RuleBasedProfilerConfig` (4882)
* [MAINTENANCE] Refactor DataAssistant Access to Parameter Computation Results and Plotting Utilities (4893)
* [MAINTENANCE] Update `dgtest-overrides` list to include all test files not captured by primary strategy (4891)
* [MAINTENANCE] Add dgtest-overrides section to dependency_graph Azure pipeline
* [MAINTENANCE] Datasource and DataContext-level tests for RuntimeDataConnector changes (4866)
* [MAINTENANCE] Temporarily disable bigquery tests. (4888)
* [MAINTENANCE] Import GE after running `ge init` in packaging CI pipeline (4885)
* [MAINTENANCE] Add CI stage importing GE with only required dependencies installed (4884)
* [MAINTENANCE] `DataAssistantResult.plot()` conditional formatting and tooltips (4881)
* [MAINTENANCE] split data context files (4879)
* [MAINTENANCE] Add Tanner to CODEOWNERS for schemas.py (4875)
* [MAINTENANCE] Use defined constants for ParameterNode accessor keys (4872)

Page 17 of 40

Links

Releases

Has known vulnerabilities

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.