Nltk

Latest version: v3.8.1

Safety actively analyzes 629765 Python packages for vulnerabilities to keep your Python projects secure.

Page 8 of 13

2.0b8

Not secure

NLTK:
* fixed copyright and license statements
* removed PyYAML, and added dependency to installers and download instructions
* updated to LogicParser, DRT (Dan Garrette)
* WordNet similarity metrics return None instead of -1 when
they fail to find a path (Steve Bethard)
* shortest_path_distance uses instance hypernyms (Jordan Boyd-Graber)
* clean_html improved (Bjorn Maeland)
* batch_parse, batch_interpret and batch_evaluate functions allow
grammar or grammar filename as argument
* more Portuguese examples (portuguese_en.doctest, examples/pt.py)

NLTK-Contrib:
* Aligner implementations (Christopher Crowner, Torsten Marek)
* ScriptTranscriber package (Richard Sproat and Kristy Hollingshead)

Book:
* updates for second printing, correcting errata
https://nltk.googlecode.com/svn/trunk/nltk/doc/book/errata.txt

Data:
* added Europarl sample, with 10 docs for each of 11 langs (Nitin Madnani)
* added SMULTRON sample corpus (Torsten Marek, Martin Volk)

2.0b7

Not secure

NLTK:
* minor bugfixes and enhancements: data loader, inference package, FreqDist, Punkt
* added Portuguese example module, similar to nltk.book for English (examples/pt.py)
* added all_lemma_names() method to WordNet corpus reader
* added update() and __add__() extensions to FreqDist (enhances alignment with Python 3.0 counters)
* reimplemented clean_html
* added test-suite runner for automatic/manual regression testing

NLTK-Data:
* updated Punkt models for sentence segmentation
* added corpus of the works of Machado de Assis (Brazilian Portuguese)

Book:
* Added translation of preface into Portuguese, contributed by Tiago Tresoldi.

2.0b6

Not secure

NLTK:
* minor fixes for Python 2.4 compatibility
* added words() method to XML corpus reader
* minor bugfixes and code clean-ups
* fixed downloader to put data in %APPDATA% on Windows

Data:
* Updated Punkt models
* Fixed utf8 encoding issues with UDHR and Stopwords Corpora
* Renamed CoNLL "cat" files to "esp" (different language)
* Added Alvey NLT feature-based grammar
* Added Polish PL196x corpus

2.0b5

Not secure

NLTK:
* minor bugfixes (incl FreqDist, Python eggs)
* added reader for Europarl Corpora (contributed by Nitin Madnani)
* added reader for IPI PAN Polish Corpus (contributed by Konrad Goluchowski)
* fixed data.py so that it doesn't generate a warning for Windows Python 2.6

NLTK-Contrib:
* updated Praat reader (contributed by Margaret Mitchell)

2.0b4

Not secure

NLTK:
* switched to Apache License, Version 2.0
* minor bugfixes in semantics and inference packages
* support for Python eggs
* fixed stale regression tests

Data:
* added NomBank 1.0
* uppercased feature names in some grammars

2.0b3

NLTK:
* several bugfixes
* added nombank corpus reader (Paul Bedaride)

Page 8 of 13

Releases

Has known vulnerabilities

Previous Next

Nltk

Page 8 of 13

2.0b8

2.0b7

2.0b6

2.0b5

2.0b4

2.0b3

Page 8 of 13

Links

Releases