Jieba

Latest version: v0.39

Safety actively analyzes 621803 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 1 of 5

0.39

1. del_word支持强行拆开词语; by gumblex,fxsjy
2. 修复百分数的切词; by fxsjy
3. 修复HMM=False在多进程模式下的bug; by huntzhan

0.38

1. 通过pkg_resources载入默认词典,支持在Spark等平台上运行, by gumblex;
2. 扩充识别的汉字unicode范围:[\u4E00-\u9FD5], by gumblex;
3. 关键词提取支持返回词性,修复posseg分词得到的pair做dict关键字的问题,by jerryday;
4. 修复load_userdict加载用户词典不能识别含有空格等特殊字符的问题, by gumblex;
5. 命令行分词支持返回词性, by gumblex;

0.37

1. 代码重构,分词器封装为Class,支持实例化,by gumblex (https://github.com/fxsjy/jieba/commit/94840a734c32cfece05c0c3ec236ffc3d36b4ae6)
2. 修复cut_for_search的bug,完善posseg; by gumblex
3. 修复posseg在0.36中引入的一处bug; by wangbin
4. 修复load_userdict异常处理的bug; by gip0
5. 修复生成词典二进制cache文件时跨文件系统的bug, 支持自定义; by gumblex

0.36

1. 代码同时兼容python2与python3, 若干性能优化; by gumblex
2. 解决用户添加词的概率自动计算问题,分词更加准确;by gumblex
3. 可自定义cache_file的文件系统路径; by changyy
4. TextRank算法实现完善; by sing1ee,walkskyer

0.35.1

1. 修复 Python 3.2 的兼容性问题

0.35

1. 改进词典cache的dump和加载机制;by gumblex
2. 提升关键词提取的性能; by gumblex
3. 关键词提取新增基于textrank算法的子模块; by singlee
4. 修复自定义stopwords功能的bug; by walkskyer

Page 1 of 5

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.