Pythainlp

Latest version: v5.0.3

Safety actively analyzes 630052 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 10 of 21

2.2.0

English

Hello World. Today, we're happy to announce the availability of PyThaiNLP. It has been four years since PyThaiNLP's the first release. Thank you very much for supporting PyThaiNLP.

Summary – Release Highlights

New Features

Tokenizer

- Fix **longest** engine, last character is now consumed
- Add **CRFCut** sentence segmentation

Transliteration

- Add Thai Grapheme-to-Phoneme (Thai G2P) deep learning sequence-to-sequence model

Normalization

- Add more normalize functions, like remove zero-width characters, remove duplicate spaces, etc.

Utilities

- Add thaiword_to_date() and thaiword_to_time()
- Fix countthai() to handle a case where the text has only numbers and symbols

Command line

- Update command and sub-command syntax - see[ command line docs](https://github.com/PyThaiNLP/pythainlp/blob/dev/docs/notes/command_line.rst)

**Others**

- Code improvement: Move non-init code out of __init__.py files, etc.
- Remove dependency: Unigram POS tagger no longer need NLTK module



Installation

You can install or upgrade using *pip install -U pythainlp*





Change log: https://github.com/PyThaiNLP/pythainlp/issues/330

2.2.0beta1

This the first beta version of PyThaiNLP 2.2.

**Installation**

pip install --pre pythainlp

2.2.0dev1

**Dev version For developer only**

2.2.0dev0

**Dev version For developer only**

2.1.4

This release is a bug fix release.

- Remove NumPy and pandas requirements from base install (353)
- Fix longest matching bug (fail when the entire input text is a full word) (357)

2.1.3

This release is a bug fix release.

- **`numtoword`** number to thai word (350)


Installation

You can install or upgarde from *pip install -U pythainlp*


Change log: https://github.com/PyThaiNLP/pythainlp/issues/181

Page 10 of 21

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.