Sagemaker-training

Latest version: v4.7.4

Safety actively analyzes 630566 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 8 of 22

3.9.3

Breaking Changes

* Added `py38`, Removed `py36` and `py27` support

Bug Fixes and Other Changes

* Use asyncio to read stdout and stderr streams in realtime
* Fix delayed logging issues
* Convey user informative message if process gets OOM Killed
* Filter out stderr to look for error messages and report
* Report Exit code on training job failures
* Prepend tags to MPI logs to enable easy filtering in CloudWatch
* All the changes are from PR 108

Documentation Changes

* Update SM doc urls
* Update Amazon Licensing
Testing and Release Infrastructure

* Install libssl1.1 and openssl packages in Dockerfiles
* Added `asyncio` package
* Updated tests to use `asyncio` package

3.9.2

Bug Fixes and Other Changes

* Reverted -x FI_EFA_USE_DEVICE_RDMA=1 to fix a crash on PyTorch Dataloaders for Distributed training

3.9.1

Bug Fixes and Other Changes

* [smdataparallel] better messages to establish the SSH connection between workers

3.9.0

Features

* smdataparallel enable EFA RDMA flag

3.8.0

Features

* smdataparallel custom mpi options support

3.7.5

Page 8 of 22

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.