Sheetwork

Latest version: v1.0.7

Safety actively analyzes 629811 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.0.1

=============================================================================

Bug Fixes
---------

- `252 <https://github.com/bastienboutonnet/sheetwork/issues/252>`_: The logger file handler now always prints ``DEBUG``-and-up level messages



Features
--------

- `231 <https://github.com/bastienboutonnet/sheetwork/issues/231>`_: * Users can now use their **end user/personal** account. Previously, it was only possible to authenticate using a **service_account**. Thanks for recent changes in ``gspread`` the auth flow allows for differenciation between end user and service account that is much simpler so we ported this to here https://github.com/burnash/gspread/pull/762

* **Internal Note/Curiosity**: To make ``oauth`` work we had to patch ``gspread``'s default credential paths (see https://github.com/burnash/gspread/issues/826). Hopefully, this is temporary.


- `253 <https://github.com/bastienboutonnet/sheetwork/issues/253>`_: Success of failure message for database table creation now fully qualifies the table (``<database>.<schema>.<table>``). This makes the messages a lot more usable to a user who might want to copy paste and check that the table has been correctly created. (Also some ugly hardcoding in the catalog queries have been squashed).


- `257 <https://github.com/bastienboutonnet/sheetwork/issues/257>`_: Allows more granular control over table (and schema) creation via the ``sheetwork_project.yml`` file:
- ``always_create_table`` is now the new way to ensure that tables always get created. Whether the table is going to be replaced or trucated is governed by the ``desctructive_create_table`` flag. Closes 251

**IMPORTANT NOTE**: ``always_create`` is now internally remapped to ``always_create_table`` and will be deprecated in a future major release.


- `262 <https://github.com/bastienboutonnet/sheetwork/issues/262>`_: The target schema now gets created if ``always_create_schema`` is ``True``. Under the hood: SnowflakeAdaptor checks if the schema already exists on the database before creating it.


- `265 <https://github.com/bastienboutonnet/sheetwork/issues/265>`_: In order to override (or set on demand) object creation (tables and schemas) we now provide the following CLI arguments:
- ``create-table``
- ``create-schema``
- ``destructive-create-table``

These arguments are "companions" which can override the following project configuration arguments:
- ``always_create_table``
- ``always_create_schema``
- ``destructive_create_table``

**IMPORTANT**:CLI flags will override whatever is already present in the project config


- `282 <https://github.com/bastienboutonnet/sheetwork/issues/282>`_: Sheetwork now retries obtaining google sheets up to 3 times (max duration 10s) if it hits an ``APIError`` because the end-user or service account was rate limited or other common service availability errors encountered by end users.

Check the PR to see the exact set of ``APIError`` that sheetwork will attempt retrying for.

Under The Hood/Misc
-------------------

- `229 <https://github.com/bastienboutonnet/sheetwork/issues/229>`_: Uses the new ``service_account()`` wrapper from ``gspread`` to authenticate to a service account. This brings some stability with regards to the deprecation of ``oauth2`` by google as the converstion is now handled by ``gspread``.

1.0.0

=========================================

Bug Fixes
---------

- `150 <https://github.com/bastienboutonnet/sheetwork/issues/150>`_: Columns that are now mentioned in the sheet.yml are first checked for presence in the sheet and ignored or skipped if not present with warning.


- `155 <https://github.com/bastienboutonnet/sheetwork/issues/155>`_: Schema specification hierarchy is fixed: Flags > Config > Project.


- `206 <https://github.com/bastienboutonnet/sheetwork/issues/206>`_: Pandas dataframe casting is disabled due to issues with mixed ints and strings (see #205, 204)


- `221 <https://github.com/bastienboutonnet/sheetwork/issues/221>`_: Attempts to reintroduce datatype casting to solve issue with dates converion (see #216 for issue). Since the mixed str and ints issue is not solved on pandas side, int conversion doesn't actually happen (for now Snowflake deals with it ok and converts to the reguested int format).



Features
--------

- `151 <https://github.com/bastienboutonnet/sheetwork/issues/151>`_: Raises errors when a sheet contains duplicate columns


- `156 <https://github.com/bastienboutonnet/sheetwork/issues/156>`_: Interactive cleanup is a bit more intereactive


- `169 <https://github.com/bastienboutonnet/sheetwork/issues/169>`_: Adds ``InitTask`` to ``sheetwork`` to ease users set their projects up.


- `195 <https://github.com/bastienboutonnet/sheetwork/issues/195>`_: Sheetwork now checks for available updates on start (provided you have an internet connection)



Under The Hood/Misc
-------------------

- `154 <https://github.com/bastienboutonnet/sheetwork/issues/154>`_: Logging to file always debug, logging messages in CLI look more like pretty prints.


- `161 <https://github.com/bastienboutonnet/sheetwork/issues/161>`_: Simplify ``SheetBag`` internals: ``check_table`` is moved to the db adapter


- `163 <https://github.com/bastienboutonnet/sheetwork/issues/163>`_: Fixes broken interactive flow of asking whether to push to db.


- `171 <https://github.com/bastienboutonnet/sheetwork/issues/171>`_: CLI logging/progress messages are now timed


- `173 <https://github.com/bastienboutonnet/sheetwork/issues/173>`_: Sheetwork now uses an adaptor/plugin design to allow and facilitate extensions of the tool to other databases.


- `193 <https://github.com/bastienboutonnet/sheetwork/issues/193>`_: CLI arguments are now POSIX


- `207 <https://github.com/bastienboutonnet/sheetwork/issues/207>`_: An proper sheetwork error is thrown when you do not provide a command to ``sheetwork`` in CLI


- `208 <https://github.com/bastienboutonnet/sheetwork/issues/208>`_: Profile error messages are now a bit more helpful and more nicely formatted


- `210 <https://github.com/bastienboutonnet/sheetwork/issues/210>`_: Use and try to fix most warnings from Pylance in an attempt to have more strict typing


- `215 <https://github.com/bastienboutonnet/sheetwork/issues/215>`_: Poetry is now used a the package and dependencies manager


- `218 <https://github.com/bastienboutonnet/sheetwork/issues/218>`_: When passing ``--log-level debug`` in CLI the format of the console output looks more like proper logs instead of the pretty prints to make following logs more easy


Previous releases
=================

There have been releases before. But at the time we were managing things differently. The old changelog can be consulted in `_old_changelog.md <_old_changelog.md>`_

0.2.1

Documentation Missed and was not usable.

0.1.0

Overview

This release focuses on making `sheetwork` a lot more flexible and robust. The main addition lies with the ability to read from a config file columns which data types need to be changed in table creation when pushed to database.

Features

- Is able to read configuratio info from a configuration file (`sheets.yml`) which contains (amongst other things) column typing often required after pulling a sheet from google as the data loaded by pandas is often interpreted as strings. [20](https://github.com/bastienboutonnet/sheetwork/pull/20)
- Allows for interactive CLI interface on whether to run default cleanups when passing `--i`. This cleanup was previously applied to all sheets by default. [17](https://github.com/bastienboutonnet/sheetwork/pull/17)
- Under the hood: validates the config file in a pretty strict way for missing tags which are required when reading from config or for unalowable tags which could potentially break things down the line. [24](https://github.com/bastienboutonnet/sheetwork/pull/24)

0.0.0

Overview

This is the very first version of sheetwork. It is named after the German electronic pioneers **"Kraftwerk"** as it is the pioneer (first version) of the package that will set and pave the way for future generations to come and we will all live happy forever ever after...

It loads Google Sheets into Snowflake, from the comand line and avoids the fast multiplication of `<insert_non_creative_name>_sheet_importer.py` type scripts.

Features

- Loads google sheet into a pandas DataFrame. [9](https://github.com/bastienboutonnet/sheetwork/pull/9)
- Performs basic cleaning.[10](https://github.com/bastienboutonnet/sheetwork/pull/10)
- Pushes data to Snowflake using [data_tools.push_pandas_to_snowflake()](https://github.com/tripactions/data_tooling/blob/master/data_tools/db/pandas.py#L230). [9](https://github.com/bastienboutonnet/sheetwork/pull/9)
- `--dry_run` functionality skips pushes to database and offers preview of datatypes, and head of dataframe that would be uploaded to database. [12](https://github.com/bastienboutonnet/sheetwork/pull/12)
- `--mode dev` overrides target schema to "sand" to avoid pushing the wrong data to a production table or to allow for full flow testing while behind user permissions.[11](https://github.com/bastienboutonnet/sheetwork/pull/12)
- `--force` can be added when running in `dev` mode to force a push to otherwise overidden target schema (see bullet above). [12](https://github.com/bastienboutonnet/sheetwork/pull/12)

Page 2 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.