###############################################################
cuTENSOR: A High-Performance CUDA Library For Tensor Primitives
###############################################################
`cuTENSOR <https://developer.nvidia.com/cutensor>`_ is a high-performance CUDA library for tensor primitives.
Key Features
============
* Extensive mixed-precision support:
* FP64 inputs with FP32 compute.
* FP32 inputs with FP16, BF16, or TF32 compute.
* Complex-times-real operations.
* Conjugate (without transpose) support.
* Support for up to 64-dimensional tensors.
* Arbitrary data layouts.
* Trivially serializable data structures.
* Main computational routines:
* Direct (i.e., transpose-free) tensor contractions.
* Support just-in-time compilation of dedicated kernels.
* Tensor reductions (including partial reductions).
* Element-wise tensor operations:
* Support for various activation functions.
* Support for padding of the output tensor
* Arbitrary tensor permutations.
* Conversion between different data types.
Documentation
=============
Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.
Installation
============
The cuTENSOR wheel can be installed as follows:
.. code-block:: bash
pip install cutensor-cuXX
where XX is the CUDA major version (currently CUDA 11 & 12 are supported).
The package ``cutensor`` (without the ``-cuXX`` suffix) is deprecated. If you have
``cutensor`` installed, please remove it prior to installing ``cutensor-cuXX``.
Raw data
{
"_id": null,
"home_page": "https://developer.nvidia.com/cutensor",
"name": "cutensor-cu12",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "cuda, nvidia, machine learning, tensor network, high-performance computing",
"author": "NVIDIA Corporation",
"author_email": "cuda_installer@nvidia.com",
"download_url": null,
"platform": null,
"description": "###############################################################\ncuTENSOR: A High-Performance CUDA Library For Tensor Primitives\n###############################################################\n\n`cuTENSOR <https://developer.nvidia.com/cutensor>`_ is a high-performance CUDA library for tensor primitives.\n\nKey Features\n============\n\n* Extensive mixed-precision support:\n\n * FP64 inputs with FP32 compute.\n * FP32 inputs with FP16, BF16, or TF32 compute.\n * Complex-times-real operations.\n * Conjugate (without transpose) support.\n\n* Support for up to 64-dimensional tensors.\n* Arbitrary data layouts.\n* Trivially serializable data structures.\n* Main computational routines:\n\n * Direct (i.e., transpose-free) tensor contractions.\n\n * Support just-in-time compilation of dedicated kernels.\n\n * Tensor reductions (including partial reductions).\n * Element-wise tensor operations:\n\n * Support for various activation functions.\n * Support for padding of the output tensor\n * Arbitrary tensor permutations.\n * Conversion between different data types.\n\nDocumentation\n=============\n\nPlease refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.\n\nInstallation\n============\n\nThe cuTENSOR wheel can be installed as follows:\n\n.. code-block:: bash\n\n pip install cutensor-cuXX\n\nwhere XX is the CUDA major version (currently CUDA 11 & 12 are supported).\nThe package ``cutensor`` (without the ``-cuXX`` suffix) is deprecated. If you have\n``cutensor`` installed, please remove it prior to installing ``cutensor-cuXX``.\n",
"bugtrack_url": null,
"license": "NVIDIA Proprietary Software",
"summary": "NVIDIA cuTENSOR",
"version": "2.0.2",
"project_urls": {
"Homepage": "https://developer.nvidia.com/cutensor"
},
"split_keywords": [
"cuda",
" nvidia",
" machine learning",
" tensor network",
" high-performance computing"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "f751786c275bc675e3f5d8d207c378652bfbd4c4103174ce857f1a04ff194211",
"md5": "6fb2971ae31c6dbb75a284618de6355f",
"sha256": "1db559bdfe4345ac19ee66ab7ee49a54e98b1529fc96de812ade3dbc0a90ef47"
},
"downloads": -1,
"filename": "cutensor_cu12-2.0.2-py3-none-manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "6fb2971ae31c6dbb75a284618de6355f",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 149947131,
"upload_time": "2024-07-09T23:42:57",
"upload_time_iso_8601": "2024-07-09T23:42:57.124092Z",
"url": "https://files.pythonhosted.org/packages/f7/51/786c275bc675e3f5d8d207c378652bfbd4c4103174ce857f1a04ff194211/cutensor_cu12-2.0.2-py3-none-manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "edd661fc3511bc9e4cdb423b69964e3d344090b4093cbf9d3c8cc469ef4642d0",
"md5": "1cc1e67fe05b55aae6f604f5518efc44",
"sha256": "18c96a4f1e8a559eec626527f5928d5f5b575f6c2b9c45e87309a025ae682334"
},
"downloads": -1,
"filename": "cutensor_cu12-2.0.2-py3-none-manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "1cc1e67fe05b55aae6f604f5518efc44",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 156915608,
"upload_time": "2024-07-09T23:43:06",
"upload_time_iso_8601": "2024-07-09T23:43:06.782819Z",
"url": "https://files.pythonhosted.org/packages/ed/d6/61fc3511bc9e4cdb423b69964e3d344090b4093cbf9d3c8cc469ef4642d0/cutensor_cu12-2.0.2-py3-none-manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "08a13fb72bd0593dc4e451d5e6f81c43562b38622a24d68642ff9bda8df35ac0",
"md5": "f413c9a16db6dc129c90c44beeb47ee4",
"sha256": "e2ae37dc9e4a1643dee9318ffdbd212097660e69826328953830cead567fd543"
},
"downloads": -1,
"filename": "cutensor_cu12-2.0.2-py3-none-win_amd64.whl",
"has_sig": false,
"md5_digest": "f413c9a16db6dc129c90c44beeb47ee4",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 144971092,
"upload_time": "2024-07-09T23:43:16",
"upload_time_iso_8601": "2024-07-09T23:43:16.514270Z",
"url": "https://files.pythonhosted.org/packages/08/a1/3fb72bd0593dc4e451d5e6f81c43562b38622a24d68642ff9bda8df35ac0/cutensor_cu12-2.0.2-py3-none-win_amd64.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-07-09 23:42:57",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "cutensor-cu12"
}