Subword-nmt

Latest version: v0.3.8

Safety actively analyzes 621854 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 2

0.3.8

- multiprocessing support (get_vocab and apply_bpe)
- progress bar for learn_bpe
- seed parameter for deterministic BPE dropout
- ignore some unicode line separators which would crash subword-nmt

0.3.7

- BPE dropout (Provilkov et al., 2019)
- more efficient glossaries (https://github.com/rsennrich/subword-nmt/pull/69)

0.3.6

- fix to subword-bpe command encoding

0.3.5

- fix to subword-bpe command under Python 2
- wider support of --total-symbols argument

0.3.4

- segment_tokens method to improve library usability (https://github.com/rsennrich/subword-nmt/pull/52)
- support regex glossaries (https://github.com/rsennrich/subword-nmt/pull/56)
- allow unicode separators (https://github.com/rsennrich/subword-nmt/pull/57)
- new option --total-symbols in learn-bpe (commit 61ad8)
- fix documentation (best practices) (https://github.com/rsennrich/subword-nmt/pull/60)

0.3

- library is now installable via pip
- fix occasional problems with UTF-8 whitespace and new lines in learn_bpe and apply_bpe.
- do not silently convert UTF-8 newline characters into "\n"
- do not silently convert UTF-8 whitespace characters into " "
- UTF-8 whitespace and newline characters are now considered part of a word, and segmented by BPE

Page 1 of 2

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.