Cupy

Latest version: v13.1.0

Safety actively analyzes 631215 Python packages for vulnerabilities to keep your Python projects secure.

Page 6 of 25

11.0.0a2

Not secure

This is the release note of v11.0.0a2 See [here](https://github.com/cupy/cupy/pulls?q=is%3Apr+is%3Aclosed+milestone%3Av11.0.0a2) for the complete list of solved issues and merged PRs.

We are running a [Gitter chat](https://gitter.im/cupy/community) for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

Highlights

Improved NumPy functions coverage (6078)

As series of NumPy routines have been proposed as a good-first-issue and as a result, an increasing number of contributors have sent pull requests to help increase the number of available APIs. An issue tracker with the currently implemented issues is available at 6078.

Initial support for `cupy.typing` (6251)

An API equivalent to [`numpy.typing`](https://numpy.org/devdocs/reference/typing.html) to allow the introduction of data types in CuPy and user codes has been added.

Support for CUDA 11.6 (6349)

Initial support for CUDA 11.6 has been added as of this release. However, binary wheels are not yet distributed and users are expected to build CuPy from source meanwhile.

Support for ROCm 5.0 (6466)

Initial support for ROCm 5.0 has been added as of this release. However, binary wheels are not yet distributed and users are expected to build CuPy from source meanwhile.

Changes without compatibility

Drop support for ROCm 4.0 (6420)

CuPy v11 will drop support for ROCm 4.0. We recommend users to use ROCm 4.2/4.3 instead.

Changes

New Features

- Add `cupy.isneginf` and `cupy.isposinf` (6089)
- Add `cupy.typing` (6251)
- Add `asarray_chkfinite` API. (6275)
- Add Box-Cox transformations to `cupyx.scipy.special` (6302)
- Use CUDA's `log1p` for `cupyx.scipy.special.log1p` (6315)
- Add special functions from the CUDA Math API (6317)
- Add `beta` functions to `cupyx.scipy.special` (6318)
- Add `cupy.union1d` API. (6357)
- Add `cupy.float_power` (6371)
- Add `cupy.intersect1d` API. (6402)
- Add `cupy.setdiff1d` api. (6433)
- Add `cupy.format_float_scientific` API (6474)

Enhancements

- First step of `mypy` introduction (4955)
- Fix CI failure to support SciPy 1.8.0 (6249)
- implement overwrite_input in cupy.{percentile,quantile} (6298)
- avoid DeprecationWarning from SciPy 1.8 (`cupyx.scipy.sparse`) (6321)
- Support NumPy 1.22 (6323)
- Remove batched QR solver's experimental mark (6327)
- Make scipy.special ufuncs work with CuPy inputs (6341)
- Fix thrust related build issue with CUDA 11.6 (6346)
- Support CUDA 11.6 (6349)
- Fix CI failure to support SciPy 1.8.0 (6362)
- Fix type annotations in installer (6382)
- Add `__cupy_get_ndarray__` dunder method to transform objects to arrays' (6414)
- Bump Jitify version to fix memory leak (6430)
- Support cuSPARSELt 0.2.0 (repost) (6436)
- Support ROCm 5.0 (6466)
- Warn if unexpectedlly failed to detect device count in `cupy.show_config()` (6472)
- Fix verbose LOBPCG for SciPy 1.8 (6388)

Performance Improvements

- Reduce memory usage in `cupy.sort` (6392)

Bug Fixes

- Fix JIT to support notebook environment (6329)
- Fix `cupyx.ndimage.spline_filter1d` for HIP (6406)
- Fix `cupy.nan_to_num` (6408)
- Fix `cupyx.special.gammainc`, `lpmv` and `sph_harm` for hip (6409)
- Fix boolean views for HIP (6412)
- Fix reduction contiguous size calculation (6457)

Code Fixes

- Remove global `use_hip` flag in setup (6391)
- Hide private names in `cupyx.scipy.linalg` (6449)
- Hide private names in `cupyx.scipy.ndimage` (6450)
- Hide private names in `cupyx.scipy.signal` (6451)
- Hide private names in `cupyx.scipy.sparse` (6454)
- Hide private names in `cupyx.scipy.stats` (6456)

Documentation

- Use `cupy.__version__` instead of `pkg_resources` (6332)
- Tentatively pin intersphinx to SciPy 1.7.1 docs (6440)
- Revert "Tentatively pin intersphinx to SciPy 1.7.1 docs" (6479)

Installation

- Avoid monkeypatching distutils (6273)
- Eliminate unnecessary configuration pass in setup (6389)
- Remove `CUPY_SETUP_ENABLE_THRUST=0` environment variable (6390)
- Drop support for ROCm 4.0 (6420)
- Bump version to v11.0.0a2 (6501)

Tests

- CI: allow discarding docker image cache manually (6269)
- Add slow tests for stable branch (6340)
- Parameterize library installer tests (6343)
- Fix tests for `eigh()` for CUDA 11.6 (6347)
- Avoid empty notification message for scheduled tests (6363)
- Support SciPy 1.8 (6365)
- Add `cupy.testing.installed` (6381)
- Mark XFAIL for SciPy 1.8 release candidate (6385)
- CI: Bump ROCm version from 4.3 to 4.3.1 (6415)
- CI: build docs in parallel (6416)
- CI: Add HEAD tests for stable branch (6423)
- CI: Use default schema/matrix path in `generate.py` (6424)
- Skip hfft related tests in HIP (6427)
- CI: Manage test tags in yaml (6429)
- CI: coverage in reST (6445)
- CI: fix NCCL 2.10 unit test not covered (6448)
- CI: Fix CUDA 11.6 driver update steps (6467)
- Ignore warnings from Optuna 3.0 pre-releases (6470)
- Fix failing tests in ROCm (6482)

Others

- CI: allow specifying special `skip` tag (6468)

Contributors

The CuPy Team would like to thank all those who contributed to this release!

amanchhaparia anaruse asi1024 emcastillo grlee77 IvanYashchuk khushi-411 kmaehashi pri1311 saswatpp takagi

11.0.0a1

Not secure

This is the release note of v11.0.0a1. See [here](https://github.com/cupy/cupy/pulls?q=is%3Apr+is%3Aclosed+milestone%3Av11.0.0a1) for the complete list of solved issues and merged PRs.

We are running a [Gitter chat](https://gitter.im/cupy/community) for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

Highlights

Improved NumPy functions coverage (6078)

As series of NumPy routines have been proposed as a good-first-issue and as a result, an increasing number of contributors have sent pull requests to help increase the number of available APIs. An issue tracker with the currently implemented issues is available at 6078.

Add `cupyx.scipy.special` functions (5687)

Spherical harmonics, Legendre and Gamma functions are implemented using highly performant specific CUDA kernels. Thanks to grlee77!

Initial support for CUDA Graph API by means of stream capture API (4567)

This PR adds the ability of using the CUDA Graph API to greatly reduce the overhead of kernel launching. This is done by using the stream capture API, and example follows.
Thanks to leofang!

py
import cupy as cp

a = cp.random.randint(0, 10, 100, dtype=np.int32)
s = cp.cuda.Stream(non_blocking=True)

with s:
s.begin_capture()
a += 3
a = cp.abs(a)
g = s.end_capture() work is queued, but not yet launched
g.launch()
s.synchronize()

Support `__device__` function in CuPy JIT (6265)

The new interface `cupyx.jit.rawkernel(device=True)` is supported to define a CUDA device function.

py
from cupyx import jit

jit.rawkernel(device=True)
def getitem(x, tid):
return x[tid]

jit.rawkernel()
def elementwise_copy(x, y):
tid = jit.threadIdx.x + jit.blockDim.x * jit.blockIdx.x
y[tid] = getitem(x, tid)

The following CUDA code is generated from the above python code.

cpp
__device__ int getitem_1(CArray<int, 1, true, true> x, unsigned int tid) {
return x[tid];
}
extern "C" __global__ void elementwise_copy(CArray<int, 1, true, true> x, CArray<int, 1, true, true> y) {
unsigned int tid;
tid = (threadIdx.x + (blockDim.x * blockIdx.x));
y[tid] = getitem_1(x, tid);
}

Changes

New Features
- Support stream capture (4567)
- Add additional special functions (spherical harmonics, Legendre, Gamma functions) (5687)
- Add `cupy.asfarray` (6085)
- Add `cupy.trapz` (6107)
- Add `cupy.array_api.linalg` (6131)
- Add `cupy.mask_indices` (6156)
- Add `cupy.array_equiv` API. (6254)
- Add `cupy.cublas.syrk` and `cupy.cublas.sbmv` (6278)
- Add `cupy.vander` API. (6279)
- Add `cupy.ediff1d` API. (6280)
- Add `cupy.fabs` API. (6282)
- Add discrete cosine and sine transforms to `cupyx.scipy.fft` (6288)
- Add `logit`, `expit` and `log_expit` to `cupyx.scipy.special` (6300)
- Add `xlogy` and `xlog1py` to `cupyx.scipy.special`(6301)
- Add `tril_indices` and `tril_indices_from` API. (6305)
- Add `cupy.format_float_positional` (6308)
- Add `cupy.row_stack` API. (6312)
- Add `triu_indices` and `triu_indices_from` API. (6316)

Enhancements
- Raise better message when importing CPU array via DLPack (6051)
- Borrow more non-GPU APIs from NumPy (6074)
- Add more aliases for compatibility with NumPy (6075)
- Import more dtype aliases from NumPy (6076)
- Borrow indexing APIs from NumPy (6077)
- Apply upstream patch to `cupy.array_api` (6086)
- Compile cub/thrust with no unique symbol (6106)
- Support cuDNN 8.3.0 (6108)
- Support all advanced indexing (6127)
- Support CUDA 11.5.1 (6166)
- Support lambda function in `cupy.vectorize` (6170)
- Support eigenvalue solver 64bit API (6178)
- Support cuTENSOR 1.4.0 (6187)
- Make `matmul` support ufunc kwargs (6195)
- Alias NumPy error classes (6212)
- Support comparison to `None` and `Ellipsis` (6222)
- JIT: Fix if expr typing rule (6234)
- Support comparison with more objects (6250)
- JIT: Support `__device__` function (6265)
- More clear warning message (6283)
- Make streams hashable (6285)
- Check isinstance before comparison in `__eq__` (6287)
- Support cuDNN 8.3.2 (6314)
- Deprecate MachAr (support NumPy 1.22) (6188)
- Fix `cupy.linalg.qr` to align with NumPy 1.22 (6225)
- Change a parameter name in `percentile` and `quantile` to support NumPy 1.22 (6228)

Performance Improvements
- Avoid 64bit division for reduce register consumption (6019)
- Remove memory copy in matmul (6179)

Bug Fixes
- Detect repeated axis in reduction (5964)
- Fix `__all__` in `cupyx.scipy.fft` (6071)
- Fix `__getitem__` on Ellipsis and advanced indexing dimension (6081)
- Allow leading unit dimensions in copy source (6118)
- Always test broadcast in `copyto` (6121)
- Fix overloading ambiguity in ndimage filters (6162)
- Fix empty Cholesky (6164)
- Fix empty `solve` (6167)
- Allow `flip` ()-shaped array (6169)
- Handles infinities of the same sign in `logaddexp` and `logaddexp2` (6172)
- Fix 4675 on resolving TODO in 4198 (6197)
- Eigenvalue solver 64bit API on CUDA 11.1 (6201)
- Fix edge case compatibility in `cupy.eye()` (6208)
- Fix `linalg.eigh` and `linalg.eigvalsh` on empty inputs (6210)
- Fix overlapping `out` in `matmul` and `(tensor)dot` (6216)
- Fix `compile_with_cache` returning None (6232)
- Fixing index calculation for random constructor (6257)
- BUG: Fix the .T attribute in the `array_api` namespace (6289)
- Fix stream capture in ROCm (6296)
- Fix cuDNN installer not working (6337)

Code Fixes
- Remove `__all__` from `cupyx/scipy/*` (6149)
- Delete `from os import path` (6152)
- Remove legacy `cp.linalg.solve()` implementation (6161)

Documentation
- Add link to compatibility matrix (6055)
- Update upgrade guide (6058)
- Add v11 to compatibility matrix (6067)
- Exclude `kernel_version` from comparison table (6072)
- Doc: Add more footnotes to comparison table (6073)
- Add polynomial modules to comparison table (6082)
- Add CITATION.bib and update README (6091)
- Remove LLVM_PATH note on document (6093)
- Docs: Update linkcode implementation (6126)
- Update footnotes in comparison table (6142)
- Update conda-forge installation guide (6186)
- Revise Overview for CuPy v10 (6209)
- Docs: CentOS installation from source (6218)
- Fix `cupy.trapz` docstring (6239)
- Fix `eigsh` doc (6266)
- Add `cupy.positive` in API Reference (6274)

Installation
- Replace `distutils` with `setuptools` in Windows `cl.exe` detection (6025)
- Fix for cuDNN directory structure in Windows (6342)

Tests
- Fix `testing.multi_gpu` to add pytest marker (6015)
- CI: add link to ROCm projects in CI coverage matrix (6037)
- CI: use separate project for multi-GPU tests (6050)
- Fix CI result notification message format (6066)
- Fix CI cannot override cuSPARSELt/cuTENSOR version preinstalled (6084)
- Workaround DeprecationWarning raised from pkg_resources (6094)
- Fix missing `multi_gpu` annotation in tests (6098)
- Fix exception handling in cupyx.distributed (6114)
- Improve FlexCI test scripts (6117)
- CI: Add timeout to show_config (6120)
- Trigger FlexCI from GitHub Actions (6130)
- CI: Fix package override sometimes fails in CentOS (6141)
- CI: Need to update CUDA driver in cuda115.multi (6144)
- Add tests for `convolve2d` (6171)
- CI: Update limits to reduce cache size (6174)
- CI: Fix unquoted specifiers (6175)
- Support pre-release NumPy version in tests (6190)
- Remove XFAIL for XPASS tests on ROCm (6259)
- Tentatively pin to `setuptools<60` in Windows CI (6260)
- Fix cache key for github actions (6281)
- Use NVIDIA docker images for CUDA 11.5 (6303)
- Tentatively pin to CUDA Driver 495 (6310)
- Remove unused dtype parameterizing in `tril_indices` test (6322)
- Use `get_include` instead of `array_equiv` for fallback test (6333)
- CI: Add `cuda-slow` test in FlexCI (6335)
- CI: use CUDA docker images for CUDA Python CI (6336)

Others
- Add doc issue template (6294)
- Bump version to v11.0.0a1 (6344)

Contributors

The CuPy Team would like to thank all those who contributed to this release!

akochepasov amanchhaparia asi1024 ColmTalbot emcastillo eternalphane grlee77 haesleinhuepf khushi-411 kmaehashi leofang okuta ptim0626 SauravMaheshkar shwina takagi thomasjpfan tom24d toslunar twmht WiseroOrb Yutaro-Sanada

10.6.0

Not secure

This is the release note of v10.6.0. See [here](https://github.com/cupy/cupy/pulls?q=is%3Apr+is%3Aclosed+milestone%3Av10.6.0) for the complete list of solved issues and merged PRs.

**This is the last planned release for CuPy v10 series. We are going to release v11.0.0 on July 28th. Please start testing your workload with the v11 release candidate (`pip install --pre cupy-cuda11x -f https://pip.cupy.dev/pre`). See the [Upgrade Guide](https://docs.cupy.dev/en/latest/upgrade.html#cupy-v11) for the list of possible breaking changes in v11.**

We are running a [Gitter chat](https://gitter.im/cupy/community) for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

Highlights

Support CUDA 11.7 (6767)

Full support for CUDA 11.7 has been added as of this release. Binary packages can be installed with the following command: `pip install cupy-cuda117`

Changes without compatibility

Changes

Enhancements

- Improve warning message in sparse (6675)
- Support CUDA 11.7 (6794)
- Make the warning for `cupy.array_api` say "cupy" instead of "numpy" (6795)
- cupy-wheel: Use NVRTC to infer the toolkit version (6831)

Bug Fixes

- Fix `cupy.median` for NaN inputs (6760)
- Fix batched matmul for integral numbers (6777)

Documentation

- Add CUDA 11.7 on documents (6801)

Tests

- Fix Dockerfile broken for array-api tests (6518)
- Skip `ndimage.filter` tests for ROCm 4.0 (6676)
- Xfail a test of LOBPCG on ROCm 5.0+ (6733)
- CI: Fix prep script to show build failure details (6784)
- Fix a potential variable misuse bug (6788)
- Fix CI Docker image build failing in head test (6808)
- Skip `ndimage.filter` tests for ROCm 4.0 (6676)

Contributors

The CuPy Team would like to thank all those who contributed to this release!

asi1024 asmeurer emcastillo kmaehashi LostBenjamin takagi

10.5.0

Not secure

This is the release note of v10.5.0. See [here](https://github.com/cupy/cupy/pulls?q=is%3Apr+is%3Aclosed+milestone%3Av10.5.0) for the complete list of solved issues and merged PRs.

We are running a [Gitter chat](https://gitter.im/cupy/community) for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

**Update (2022-06-17)**: Wheels for CUDA 11.5 Arm SBSA are now available in the Assets section below. (6705)

Changes

Enhancements
- Fix compilation warning caused by `ifdef` (6740)
- Support cuDNN 8.4 (6741)

Bug Fixes
- Fix memory leak in the FFT plan cache during multi-threading (6732)
- Fix `ifdef` for ROCm >= 4.2 (6751)

Documentation
- Minor improvement on the array API docs (6714)
- Document the returned benchmark object (6742)

Tests
- CI: Update repo for libcudnn7 in cuda10.2 (6709)
- Pin mypy version in setup.py (6711)
- Follow `scipy==1.8.1` sparse dot bugfix (6728)
- Support testing CUDA 11.6+ in FlexCI (6737)
- Fix GPG key issue in FlexCI base image (6743)

Contributors

The CuPy Team would like to thank all those who contributed to this release!

asi1024 emcastillo kmaehashi leofang takagi

10.4.0

Not secure

This is the release note of v10.4.0. See [here](https://github.com/cupy/cupy/pulls?q=is%3Apr+is%3Aclosed+milestone%3Av10.4.0) for the complete list of solved issues and merged PRs.

We are running a [Gitter chat](https://gitter.im/cupy/community) for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

Announcements

Introduction of generic `cupy-wheel` (EXPERIMENTAL) (6012)

We have added a new package in the PyPI called `cupy-wheel`. This meta package allows other libraries to add a dependency to CuPy with the ability to transparently install the exact CuPy binary wheel matching the user environment. Users can also install CuPy using this package instead of manually specifying a CUDA/ROCm version.

pip install cupy-wheel

This package is only available for the stable release as the current pre-release wheels are not hosted in PyPI.

This feature is currently experimental and subject to change so we recommend users not to distribute packages relying on it for now. Your suggestions or comments are highly welcomed (please visit 6688.)

Changes

Enhancements
- Add missing `cudaDevAttrMemoryPoolsSupported` to hip (6626)
- Add CC 3.2 to Tegra arch list (6647)
- Add a few driver/runtime/nvrtc API wrappers (6651)

Bug Fixes
- Define `float16::operator-()` only for ROCm 5.0+ (6629)
- JIT: fix access to cached codes (6642)
- [v10] Fix Mempool attr for Cuda Python (6654)
- Fix int64 overflow in `cupy.polyval` (6666)

Documentation
- Documentation update for ROCm 5.0 (6607)
- Add `--pre` option to instructions installing pre-releases (6614)
- Fix typo in performance guide (6659)
- JIT: fix function signatures in the docs (6660)

Installation
- Add universal CuPy package (6683)

Tests
- Remove `jenkins` requirements (6634)
- CI: Trigger FlexCI for hotfix branches (6636)
- Fix `TestIncludesCompileCUDA` for HEAD tests (6650)
- Trigger CUDA Python tests with `/test mini` (6655)
- Fix missing f prefix on f-strings fix (6679)

Contributors

The CuPy Team would like to thank all those who contributed to this release!

asi1024 code-review-doctor danielg1111 emcastillo kmaehashi leofang takagi

10.3.1

Not secure

This is the release note of v10.3.1. See [here](https://github.com/cupy/cupy/pulls?q=is%3Apr+is%3Aclosed+milestone%3Av10.3.1) for the complete list of solved issues and merged PRs.

This is a hot-fix release for v10.3.0 which contained a regression that prevents CuPy from working on older CUDA GPUs (Maxwell or earlier).

Changes

Bug Fixes

- Define float16::operator-() only for ROCm 5.0+ (6630)

Installation

- Bump version to v10.3.1 (6633)

Contributors

The CuPy Team would like to thank all those who contributed to this release!

kmaehashi takagi

Page 6 of 25

Releases

Has known vulnerabilities

Previous Next

Cupy

Page 6 of 25

11.0.0a2

11.0.0a1

10.6.0

10.5.0

10.4.0

10.3.1

Page 6 of 25

Links

Releases