Goose3

Latest version: v3.1.19

Safety actively analyzes 629532 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 3 of 5

3.1.7

* Fixed author configuration [see PR 96](https://github.com/goose3/goose3/pull/96)
* Improve parent node scoring to get more of the correct data [see PR 102](https://github.com/goose3/goose3/pull/102) Thanks [skruse](https://github.com/skruse)
* Use `__slots__` for configuration helper classes

3.1.6

* Improved handling of page encoding [see PR 92](https://github.com/goose3/goose3/pull/92)
* Improved author and published date extraction [see PR 93](https://github.com/goose3/goose3/pull/93) Thanks [timoilya](https://github.com/timoilya)!
* Added additional schema extractors for schema.org parser [see PR 89](https://github.com/goose3/goose3/pull/89)
* Allow for pulling more then the first og:type data for Opengraph [see PR 90](https://github.com/goose3/goose3/pull/90)

3.1.5

* Added additional date parsing [see PR 71](https://github.com/goose3/goose3/pull/71) Thanks [dlrobertson](https://github.com/dlrobertson)!
* Added datetime representation of the publish date `publish_datetime_utc` [see issue 72](https://github.com/goose3/goose3/issues/72)
* Fixed mismatch encoding error [see issue 74](https://github.com/goose3/goose3/issues/74)
* Fixed og_type with NoneType error [see issue 81](https://github.com/goose3/goose3/issues/81) Thanks [dust0x](https://github.com/dust0x)!

3.1.4

* Fix IndexError when title has only an title splitter or is the site name [see issue 59](https://github.com/goose3/goose3/issues/48) Thanks [dlrobertson](https://github.com/dlrobertson)!
* Retry the calculate_top_node function with the root node if the first pass failed to find an article which may occur if one or more known article patterns are found, but none contain content [see PR 66](https://github.com/goose3/goose3/issues/48) Thanks [dlrobertson](https://github.com/dlrobertson)!
* Add parsing of schema.org's ReportageNewsArticle tags [see PR 67](https://github.com/goose3/goose3/issues/48) Thanks [dlrobertson](https://github.com/dlrobertson)!
* Add additional parsing of opengraph tags [see PR 64](https://github.com/goose3/goose3/issues/48) Thanks [dlrobertson](https://github.com/dlrobertson)!

3.1.3

* Catch mismatch encoding meta tag and document encoding [see pull request 53](https://github.com/goose3/goose3/pull/53) Thanks [jeffquach](https://github.com/jeffquach)!
* Parse headers and include in `cleaned_text`
* Additional Configuration options:
* Parse Headers: `parse_headers`
* Parse Lists: `parse_lists`
* Pretty Lists: `pretty_lists`

3.1.2

* Parse lists out if present in the main article
* Added configuration option `pretty_lists` to specify if a list should be represented as text or made to read like a list; default is `True`

Page 3 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.