###############################################################
cuTENSOR: A High-Performance CUDA Library For Tensor Primitives
###############################################################
`cuTENSOR <https://developer.nvidia.com/cutensor>`_ is a high-performance CUDA library for tensor primitives.
Key Features
============
* Extensive mixed-precision support:
* FP64 inputs with FP32 compute.
* FP32 inputs with FP16, BF16, or TF32 compute.
* Complex-times-real operations.
* Conjugate (without transpose) support.
* Support for up to 64-dimensional tensors.
* Arbitrary data layouts.
* Trivially serializable data structures.
* Main computational routines:
* Direct (i.e., transpose-free) tensor contractions.
* Support just-in-time compilation of dedicated kernels.
* Tensor reductions (including partial reductions).
* Element-wise tensor operations:
* Support for various activation functions.
* Support for padding of the output tensor
* Arbitrary tensor permutations.
* Conversion between different data types.
Documentation
=============
Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.
Installation
============
The cuTENSOR wheel can be installed as follows:
.. code-block:: bash
pip install cutensor-cuXX
where XX is the CUDA major version (currently CUDA 11 & 12 are supported).
The package ``cutensor`` (without the ``-cuXX`` suffix) is deprecated. If you have
``cutensor`` installed, please remove it prior to installing ``cutensor-cuXX``.
Raw data
{
"_id": null,
"home_page": "https://developer.nvidia.com/cutensor",
"name": "cutensor-cu12",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "cuda,nvidia,machine learning,tensor network,high-performance computing",
"author": "NVIDIA Corporation",
"author_email": "cuda_installer@nvidia.com",
"download_url": "",
"platform": null,
"description": "###############################################################\ncuTENSOR: A High-Performance CUDA Library For Tensor Primitives\n###############################################################\n\n`cuTENSOR <https://developer.nvidia.com/cutensor>`_ is a high-performance CUDA library for tensor primitives.\n\nKey Features\n============\n\n* Extensive mixed-precision support:\n\n * FP64 inputs with FP32 compute.\n * FP32 inputs with FP16, BF16, or TF32 compute.\n * Complex-times-real operations.\n * Conjugate (without transpose) support.\n\n* Support for up to 64-dimensional tensors.\n* Arbitrary data layouts.\n* Trivially serializable data structures.\n* Main computational routines:\n\n * Direct (i.e., transpose-free) tensor contractions.\n\n * Support just-in-time compilation of dedicated kernels.\n\n * Tensor reductions (including partial reductions).\n * Element-wise tensor operations:\n\n * Support for various activation functions.\n * Support for padding of the output tensor\n * Arbitrary tensor permutations.\n * Conversion between different data types.\n\nDocumentation\n=============\n\nPlease refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.\n\nInstallation\n============\n\nThe cuTENSOR wheel can be installed as follows:\n\n.. code-block:: bash\n\n pip install cutensor-cuXX\n\nwhere XX is the CUDA major version (currently CUDA 11 & 12 are supported).\nThe package ``cutensor`` (without the ``-cuXX`` suffix) is deprecated. If you have\n``cutensor`` installed, please remove it prior to installing ``cutensor-cuXX``.\n",
"bugtrack_url": null,
"license": "NVIDIA Proprietary Software",
"summary": "NVIDIA cuTENSOR",
"version": "2.0.1",
"project_urls": {
"Homepage": "https://developer.nvidia.com/cutensor"
},
"split_keywords": [
"cuda",
"nvidia",
"machine learning",
"tensor network",
"high-performance computing"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "447333bf1dfddf31ae8419d2af48537fcca8ac172e355f6a0e9be9b282c39e42",
"md5": "2f632b11b0f1f71d7a86b200fdcce9d9",
"sha256": "eceef4c91e4fd9d95bd4859de300074534cc1997c21b132446fc4be771f3e4fd"
},
"downloads": -1,
"filename": "cutensor_cu12-2.0.1-py3-none-manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "2f632b11b0f1f71d7a86b200fdcce9d9",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 145387295,
"upload_time": "2024-02-08T18:53:56",
"upload_time_iso_8601": "2024-02-08T18:53:56.038953Z",
"url": "https://files.pythonhosted.org/packages/44/73/33bf1dfddf31ae8419d2af48537fcca8ac172e355f6a0e9be9b282c39e42/cutensor_cu12-2.0.1-py3-none-manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "d1c5e5a0616154e03f72ed2e641cdea479bf246fb2e5fc66967c70d1c1493dd2",
"md5": "b1160e3261ff3117de7ee785eeb2705e",
"sha256": "c4345d99b3dba3cef0b44199b094e40c24f4cc14ae8f259addb9288d1cba5023"
},
"downloads": -1,
"filename": "cutensor_cu12-2.0.1-py3-none-manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "b1160e3261ff3117de7ee785eeb2705e",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 152020905,
"upload_time": "2024-02-08T18:54:53",
"upload_time_iso_8601": "2024-02-08T18:54:53.218077Z",
"url": "https://files.pythonhosted.org/packages/d1/c5/e5a0616154e03f72ed2e641cdea479bf246fb2e5fc66967c70d1c1493dd2/cutensor_cu12-2.0.1-py3-none-manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "66691351b43555c26c42d799228ffe5a2f2350b098e724ee9a6f8cce7d78aff7",
"md5": "93ae5963383bcabd5bc4d6d153ce7d49",
"sha256": "b5ae232d4c4a422a6c17864a13ef5e99a1134b056dcaf9ad9e38afead8791e0b"
},
"downloads": -1,
"filename": "cutensor_cu12-2.0.1-py3-none-win_amd64.whl",
"has_sig": false,
"md5_digest": "93ae5963383bcabd5bc4d6d153ce7d49",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 139401271,
"upload_time": "2024-02-08T18:52:30",
"upload_time_iso_8601": "2024-02-08T18:52:30.683121Z",
"url": "https://files.pythonhosted.org/packages/66/69/1351b43555c26c42d799228ffe5a2f2350b098e724ee9a6f8cce7d78aff7/cutensor_cu12-2.0.1-py3-none-win_amd64.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-02-08 18:53:56",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "cutensor-cu12"
}