Textdirectory

Latest version: v0.3.3

Safety actively analyzes 621562 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.3.3

==================
* added filter_by_filenames
* added filter_by_filename_not_contains
* added transform_to_files
* added transformation_eebop4_to_plaintext
* added transformation_replace_digits
* added transformation_ftfy
* added fast and skip_checkpoint options to load_files
* added __repr__ method to TextDirectory
* added examples
* upgraded to spaCy 3
* improved the test suite
* fixed some minor bugs

0.3.2

==================
* added transformation_expand_english_contractions
* fixed some minor bugs
* added __str__ method to TextDirectory
* added filename to __str__ output
* added `get_text` method

0.3.1

==================
* added long_description_content_type to setup.py

0.3.0

==================
* added transformation_remove_weird_tokens
* added transformation_lemmatizer
* fixed some minor bugs
* added a function to revert applied filters
* added a function that prints the current pipeline
* added a function that clears all transformations
* added helper functions to list available filters and transformations
* fixed a bug in which ``tabulate_flat_list_of_dicts`` would fail if the dictionary was empty
* ``self.aggregation`` does not hold a copy of the files anymore but references to ``self.files``
* transformations relying on spaCy are now estimating a max_length based on available memory
* TextDirectory objects are now iterable

0.2.2

==================
* changed the data packaging

0.2.1

==================
* added transformation_remove_stopwords
* added transformation_remove_htmltags
* fixed some minor bugs

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.