- fix regression for fast extraction introduced in e8b3538 (96) - fix setup by making backports-datetime-fromisoformat optional (95)
1.5.0
- slightly higher accuracy with revised heuristics - simplified code structure for better performance - setup: support for 3.12, fromisoformat backport if applicable - HTML parsing fixes: more lenient parsing, pinned LXML version for MacOS
- support min_date/max_date as datetimes or datetime strings with kernc (73) - add date attributes to HTML extraction with kernc (74) - fix for extraction of updated and original dates in time elements - code refactoring and maintenance
1.4.1
- better coverage of relevant HTML attributes - automatically define upper time bound at each function call (70) - reviewed and simplified extraction code - cache validation for format diverging from `%Y-%m-%d` - updated dependencies and removed real-world tests from package
1.4.0
- additional search of free text in whole document (67) - optional parameter for subdaily precision with getorca (66) - fix for HTML doctype parsing (44) - cleaner code for multilingual month expressions - extended expressions for extraction in HTML meta fields - update of dependencies and evaluation