Cltk

Latest version: v1.3.0

Safety actively analyzes 628969 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 8 of 12

0.1.43

• 386 Port the Collatinus Latin decliner by PonteIneptique (Thibaut Clérice). (See more at https://github.com/cltk/cltk/issues/385.)
• 390 Small addition to Latin word tokenizer by diyclassics
• 383 Docs added for Greek Accentuation package
• 380 Docs for Grk alphabet by michaalbert
• 377 Added Sefaria corpus for dl
• 376 add docstrings by souravsingh
• 372 English exception words to Latin tokenizer by diyclassics
• 371 Fix to path for cltk_data by diyclassics
• 370 Speed improvement to Latin macronizer by TylerKirby
• 364 Update punjabi.rst by RatulGhosh
• 363 Punjabi docs by nimitbhardwaj
• 362 New Bengali corpus from RatulGhosh
• 356 remove most corpus downloads from test by kylepjohnson

0.1.42

Fixes from ryanfb 351 and TylerKirby 350

0.1.41

Behind-the-scenes changes for easy-to-use corpus reader by diyclassics (Patrick Burns), from PR 347

0.1.40

The release is for Tyler Kirby's ( TylerKirby ) Latin macronizer. Use example:

python

In [1]: from cltk.prosody.latin.macronizer import Macronizer

In [2]: macronizer = Macronizer('tag_ngram_123_backoff')

In [3]: text = 'Quo usque tandem, O Catilina, abutere nostra patientia?'

In [4]: macronizer.macronize_text(text)
Out[4]: 'quō usque tandem , ō catilīnā , abūtēre nostrā patientia ?

In [5]: macronizer.macronize_tags(text)
Out[5]: [('quo', 'd--------', 'quō'), ('usque', 'd--------', 'usque'), ('tandem', 'd--------', 'tandem'), (',', 'u--------', ','), ('o', 'e--------', 'ō'), ('catilina', 'n-s---mb-', 'catilīnā'), (',', 'u--------', ','), ('abutere', 'v2sfip---', 'abūtēre'), ('nostra', 'a-s---fb-', 'nostrā'), ('patientia', 'n-s---fn-', 'patientia'), ('?', None, '?')]

0.1.39

This PR allows users to define repositories/corpora that are not hosted at https://github.com/cltk. They can do so by putting a file at `~/cltk_data/distributed_corpora.yaml` with markup like so for each repo:

python
example_distributed_latin_corpus:
git_remote: gitgithub.com:kylepjohnson/latin_corpus_newton_example.git
language: latin
type: text

0.1.38

Add syllabifier and tokenizer for Indian languages

By soumyag213 who ported some code of anoopkunchukuttan's [indic_nlp_library](https://github.com/anoopkunchukuttan/indic_nlp_library).

https://github.com/cltk/cltk/issues/245

Thank you to both!!!

Page 8 of 12

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.