Gensim

Latest version: v4.3.2

Safety actively analyzes 629723 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 9 of 15

0.11.0

* added "topic ranking" to sort topics by coherence in LdaModel (jtmcmc, 311)
* new fast ShardedCorpus out-of-core corpus (Jan Hajic jr., 284)
* utils.smart_open now uses the smart_open package (316)
* new wrapper for LDA in Vowpal Wabbit (Dave Challis, 304)
* improvements to the DtmModel wrapper (Yang Han, 272, 277)
* move wrappers for external modeling programs into a submodule (Christopher Corley, 295)
* allow transparent compression of NumPy files in save/load (Christopher Corley, 248)
* save/load methods now accept file handles, in addition to file names (macks22, 292)
* fixes to LdaMulticore on Windows (Feng Mai, 305)
* lots of small fixes & py3k compatibility improvements (Chyi-Kwei Yau, Daniel Nouri, Timothy Emerick, Juarez Bochi, Christopher Corley, Chirag Nagpal, Jan Hajic jr., Flávio Codeço Coelho)
* re-released as 0.11.1 and 0.11.1-1 because of a packaging bug

0.10.3

* added streamed phrases = collocation detection (Miguel Cabrera, 258)
* added param for multiple word2vec epochs (sebastienj, 243)
* added doc2vec (=paragraph2vec = extension of word2vec) model (Timothy Emerick, 231)
* initialize word2vec deterministically, for increased experiment reproducibility (KCzar, 240)
* all indexed corpora now allow full Python slicing syntax (Christopher Corley, 246)
* update distributed code for new Pyro4 API and py3k (Michael Brooks, Marco Bonzanini, 255, 249)
* fixes to six module version (Lars Buitinck, 259)
* fixes to setup.py (Maxim Avanov and Christopher Corley, 260, 251)
* ...and lots of minor fixes & updates all around

0.10.2

* new parallelized, LdaMulticore implementation (Jan Zikes, 232)
* Dynamic Topic Models (DTM) wrapper (Arttii, 205)
* word2vec compiled from bundled C file at install time: no more pyximport (233)
* standardize show_/print_topics in LdaMallet (Benjamin Bray, 223)
* add new word2vec multiplicative objective (3CosMul) of Levy & Goldberg (Gordon Mohr, 224)
* preserve case in MALLET wrapper (mcburton, 222)
* support for matrix-valued topic/word prior eta in LdaModel (mjwillson, 208)
* py3k fix to SparseCorpus (Andreas Madsen, 234)
* fix to LowCorpus when switching dictionaries (Christopher Corley, 237)

0.10.1

* word2vec: new n_similarity method for comparing two sets of words (François Scharffe, 219)
* make LDA print/show topics parameters consistent with LSI (Bram Vandekerckhove, 201)
* add option for efficient word2vec subsampling (Gordon Mohr, 206)
* fix length calculation for corpora on empty files (Christopher Corley, 209)
* improve file cleanup of unit tests (Christopher Corley)
* more unit tests
* unicode now stored everywhere in gensim internally; accepted input stays either utf8 or unicode
* various fixes to the py3k ported code
* allow any dict-like input in Dictionary.from_corpus (Andreas Madsen)
* error checking improvements to the MALLET wrapper
* ignore non-articles during wiki parsig
* utils.lemmatize now (optionally) ignores stopwords

0.10.0

* full Python 3 support (targeting 3.3+, 196)
* all internal methods now expect & store unicode, instead of utf8
* new optimized word2vec functionality: negative sampling, cbow (sebastien-j, 162)
* allow by-frequency sort in Dictionary.save_as_text (Renaud Richardet, 192)
* add topic printing to HDP model (Tiepes, 190)
* new gensim_addons package = optional install-time Cython compilations (Björn Esser, 197)
* added py3.3 and 3.4 to Travis CI tests
* fix a cbow word2vec bug (Liang-Chi Hsieh)

0.9.1

* MmCorpus fix for Windows
* LdaMallet support for printing/showing topics
* fix LdaMallet bug when user specified a file prefix (Victor, 184)
* fix LdaMallet output when input is single vector (Suvir)
* added LdaMallet unit tests
* more py3k fixes (Lars Buitinck)
* change order of LDA topic printing (Fayimora Femi-Balogun, 188)

Page 9 of 15

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.