Trl

Latest version: v0.8.6

Safety actively analyzes 629599 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 6

0.8.0

New Contributors
* nautsimon made their first contribution in https://github.com/huggingface/trl/pull/1333
* fancyerii made their first contribution in https://github.com/huggingface/trl/pull/1353
* samuki made their first contribution in https://github.com/huggingface/trl/pull/1363
* yuanwu2017 made their first contribution in https://github.com/huggingface/trl/pull/1366
* kawine made their first contribution in https://github.com/huggingface/trl/pull/1380
* skavulya made their first contribution in https://github.com/huggingface/trl/pull/1391
* pengwei715 made their first contribution in https://github.com/huggingface/trl/pull/1439

**Full Changelog**: https://github.com/huggingface/trl/compare/v0.7.11...v0.8.0

0.7.11

New Contributors

* otlaitil made their first contribution in https://github.com/huggingface/trl/pull/1269
* JohnGiorgi made their first contribution in https://github.com/huggingface/trl/pull/1255
* ouhenio made their first contribution in https://github.com/huggingface/trl/pull/1280
* imraviagrawal made their first contribution in https://github.com/huggingface/trl/pull/1286
* akx made their first contribution in https://github.com/huggingface/trl/pull/1297
* esceptico made their first contribution in https://github.com/huggingface/trl/pull/1307
* johnowhitaker made their first contribution in https://github.com/huggingface/trl/pull/1308
* elhusseiniali made their first contribution in https://github.com/huggingface/trl/pull/1312
* maliozer made their first contribution in https://github.com/huggingface/trl/pull/1313
* j-cb made their first contribution in https://github.com/huggingface/trl/pull/1321
* R-seny made their first contribution in https://github.com/huggingface/trl/pull/1325
* markstur made their first contribution in https://github.com/huggingface/trl/pull/1315

**Full Changelog**: https://github.com/huggingface/trl/compare/v0.7.10...v0.7.11

0.7.10

New Contributors
* yuta0x89 made their first contribution in https://github.com/huggingface/trl/pull/1204
* danielhanchen made their first contribution in https://github.com/huggingface/trl/pull/1213
* zspo made their first contribution in https://github.com/huggingface/trl/pull/1214
* philschmid made their first contribution in https://github.com/huggingface/trl/pull/1208
* kykim0 made their first contribution in https://github.com/huggingface/trl/pull/1226
* AjayP13 made their first contribution in https://github.com/huggingface/trl/pull/1160
* yumemio made their first contribution in https://github.com/huggingface/trl/pull/1152

**Full Changelog**: https://github.com/huggingface/trl/compare/v0.7.9...v0.7.10

0.7.9

**Full Changelog**: https://github.com/huggingface/trl/compare/v0.7.8...v0.7.9

0.7.8

Unsloth tag for `xxxTrainer`

If users use Unsloth library, the `unsloth` tag gets automatically pushed on the Hub.

* [`xxxTrainer`] Add unsloth tag by younesbelkada in https://github.com/huggingface/trl/pull/1130

DPO fixes

Some important fixes for DPO has been introduced to address: https://twitter.com/jon_durbin/status/1743575483365699809 and to make DPO faster

* Allow separate devices for target/ref models. by jondurbin in https://github.com/huggingface/trl/pull/1190
* Allow swapping PEFT adapters for target/ref model. by jondurbin in https://github.com/huggingface/trl/pull/1193
* Change device access order for speedup of calculating metrics in DPOTrainer by brcps12 in https://github.com/huggingface/trl/pull/1154

DDPO + PEFT

Now DDPO supports PEFT

* add: support for `peft` in ddpo. by sayakpaul in https://github.com/huggingface/trl/pull/1165

Other fixes

* add peft_module_casting_to_bf16 in DPOTrainer by sywangyi in https://github.com/huggingface/trl/pull/1143
* SFT Tokenizer Fix by ChrisCates in https://github.com/huggingface/trl/pull/1142
* Minor fixes to some comments in some examples. by mattholl in https://github.com/huggingface/trl/pull/1156
* Correct shapes in docstring of PPOTrainer's train_minibatch method by nikihowe in https://github.com/huggingface/trl/pull/1170
* Update sft_trainer.py by Hemanthkumar2112 in https://github.com/huggingface/trl/pull/1162
* Fix batch all gather by vwxyzjn in https://github.com/huggingface/trl/pull/1177
* Address issue 1122 by maneandrea in https://github.com/huggingface/trl/pull/1174
* Fix misleading variable "epoch" from the training loop from PPOTrainer Doc. by Jfhseh in https://github.com/huggingface/trl/pull/1171
* SFTTrainer: follow args.remove_unused_columns by mgerstgrasser in https://github.com/huggingface/trl/pull/1188
* Handle last token from generation prompt by pablovicente in https://github.com/huggingface/trl/pull/1153

New Contributors
* ChrisCates made their first contribution in https://github.com/huggingface/trl/pull/1142
* brcps12 made their first contribution in https://github.com/huggingface/trl/pull/1154
* mattholl made their first contribution in https://github.com/huggingface/trl/pull/1156
* sayakpaul made their first contribution in https://github.com/huggingface/trl/pull/1165
* nikihowe made their first contribution in https://github.com/huggingface/trl/pull/1170
* Hemanthkumar2112 made their first contribution in https://github.com/huggingface/trl/pull/1162
* maneandrea made their first contribution in https://github.com/huggingface/trl/pull/1174
* Jfhseh made their first contribution in https://github.com/huggingface/trl/pull/1171
* mgerstgrasser made their first contribution in https://github.com/huggingface/trl/pull/1188
* pablovicente made their first contribution in https://github.com/huggingface/trl/pull/1153
* jondurbin made their first contribution in https://github.com/huggingface/trl/pull/1190

**Full Changelog**: https://github.com/huggingface/trl/compare/v0.7.7...v0.7.8

0.7.7

A fix has been introduce to fix a breaking change with `PPOTrainer.push_to_hub()` and `DDPOTrainer.push_to_hub()`

* [`PPOTrainer` / `DDPOTrainer`] Fix ppo & ddpo push to Hub by younesbelkada in https://github.com/huggingface/trl/pull/1141

What's Changed

Page 2 of 6

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.