- XML sentence splitting: Added hr tag to default sentence breaks - Recognize Reddit links in shorthand notation - Improved robustness of XML processing
1.10.7
- Make recognition of gender star case insensitive - Fix problem with “nasty” character as last character of text unit
1.10.6
- Recognize gender star. - Improve recognition of lists of numbers, section numbers and IPv4 addresses
1.10.5
- Correctly tokenize flags followed by a variation selector. - Delete variation selector that occurs on its own.
1.10.4
- Bugfix related to the --version option.
1.10.3
- New option -v/--version to output version information. - Explicitly specify input encoding as UTF-8.