Trafilatura

Latest version: v1.9.0

Safety actively analyzes 629359 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 7 of 8

0.2.0

- better handling of nested elements, quotes and tables
- validation of XML TEI documents
- bulk download and processing

0.1.1

- handling of line breaks
- element trimming simplified

0.1.0

- first release used in production and meant to be archived for reproducibility and citability
- better extraction precision


0.0.5: last version compatible with Python 3.4
- optional dependencies
- bugs in parsing removed

0.0.4

- code profiling and speed-up

0.0.3

- tables included in extraction
- bypass justext in arguments
- better handling of non-p elements

0.0.2

- better handling of text nodes
- improvements in extraction recall

Page 7 of 8

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.