###############################################################
cuTENSOR: A High-Performance CUDA Library For Tensor Primitives
###############################################################
`cuTENSOR <https://developer.nvidia.com/cutensor>`_ is a high-performance CUDA library for tensor primitives.
Key Features
============
* Extensive mixed-precision support:
* FP64 inputs with FP32 compute.
* FP32 inputs with FP16, BF16, or TF32 compute.
* Complex-times-real operations.
* Conjugate (without transpose) support.
* Support for up to 64-dimensional tensors.
* Arbitrary data layouts.
* Trivially serializable data structures.
* Main computational routines:
* Direct (i.e., transpose-free) tensor contractions.
* Support just-in-time compilation of dedicated kernels.
* Tensor reductions (including partial reductions).
* Element-wise tensor operations:
* Support for various activation functions.
* Support for padding of the output tensor
* Arbitrary tensor permutations.
* Conversion between different data types.
Documentation
=============
Please refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.
Installation
============
The cuTENSOR wheel can be installed as follows:
.. code-block:: bash
pip install cutensor-cuXX
where XX is the CUDA major version (currently CUDA 11 & 12 are supported).
The package ``cutensor`` (without the ``-cuXX`` suffix) is deprecated. If you have
``cutensor`` installed, please remove it prior to installing ``cutensor-cuXX``.
Raw data
{
"_id": null,
"home_page": "https://developer.nvidia.com/cutensor",
"name": "cutensor-cu11",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "cuda, nvidia, machine learning, tensor network, high-performance computing",
"author": "NVIDIA Corporation",
"author_email": "cuda_installer@nvidia.com",
"download_url": null,
"platform": null,
"description": "###############################################################\ncuTENSOR: A High-Performance CUDA Library For Tensor Primitives\n###############################################################\n\n`cuTENSOR <https://developer.nvidia.com/cutensor>`_ is a high-performance CUDA library for tensor primitives.\n\nKey Features\n============\n\n* Extensive mixed-precision support:\n\n * FP64 inputs with FP32 compute.\n * FP32 inputs with FP16, BF16, or TF32 compute.\n * Complex-times-real operations.\n * Conjugate (without transpose) support.\n\n* Support for up to 64-dimensional tensors.\n* Arbitrary data layouts.\n* Trivially serializable data structures.\n* Main computational routines:\n\n * Direct (i.e., transpose-free) tensor contractions.\n\n * Support just-in-time compilation of dedicated kernels.\n\n * Tensor reductions (including partial reductions).\n * Element-wise tensor operations:\n\n * Support for various activation functions.\n * Support for padding of the output tensor\n * Arbitrary tensor permutations.\n * Conversion between different data types.\n\nDocumentation\n=============\n\nPlease refer to https://docs.nvidia.com/cuda/cutensor/index.html for the cuTENSOR documentation.\n\nInstallation\n============\n\nThe cuTENSOR wheel can be installed as follows:\n\n.. code-block:: bash\n\n pip install cutensor-cuXX\n\nwhere XX is the CUDA major version (currently CUDA 11 & 12 are supported).\nThe package ``cutensor`` (without the ``-cuXX`` suffix) is deprecated. If you have\n``cutensor`` installed, please remove it prior to installing ``cutensor-cuXX``.\n",
"bugtrack_url": null,
"license": "NVIDIA Proprietary Software",
"summary": "NVIDIA cuTENSOR",
"version": "2.0.2",
"project_urls": {
"Homepage": "https://developer.nvidia.com/cutensor"
},
"split_keywords": [
"cuda",
" nvidia",
" machine learning",
" tensor network",
" high-performance computing"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "3193d8ee8ac22b83e004c6d1f8e16a6f50834ffb300f4db032dac257e77e8ba8",
"md5": "f063f5299c4da6ccba3fa60aa8f4a2fa",
"sha256": "e17003e5f5cf0e83292e9e7e380b64c87a311f8096b3a287a630cbab743ef52f"
},
"downloads": -1,
"filename": "cutensor_cu11-2.0.2-py3-none-manylinux2014_aarch64.whl",
"has_sig": false,
"md5_digest": "f063f5299c4da6ccba3fa60aa8f4a2fa",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 148066837,
"upload_time": "2024-07-09T23:42:30",
"upload_time_iso_8601": "2024-07-09T23:42:30.568008Z",
"url": "https://files.pythonhosted.org/packages/31/93/d8ee8ac22b83e004c6d1f8e16a6f50834ffb300f4db032dac257e77e8ba8/cutensor_cu11-2.0.2-py3-none-manylinux2014_aarch64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "3d005eb39fbd12ecfe727f15749337ecda5585977ae9d969c2f7c69a12f55649",
"md5": "594dd2e6bb48303b91df94281603a172",
"sha256": "6d37a1164cb02d74322b35b09f018ce51aff078dedee10823820b9d878ebb8c3"
},
"downloads": -1,
"filename": "cutensor_cu11-2.0.2-py3-none-manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "594dd2e6bb48303b91df94281603a172",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 148843815,
"upload_time": "2024-07-09T23:42:39",
"upload_time_iso_8601": "2024-07-09T23:42:39.836064Z",
"url": "https://files.pythonhosted.org/packages/3d/00/5eb39fbd12ecfe727f15749337ecda5585977ae9d969c2f7c69a12f55649/cutensor_cu11-2.0.2-py3-none-manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "f0b928977c56495b4847b98fb0348c70fff0fd74c3e7535407c6eb0cbae28a26",
"md5": "97fdf0934f6eb00e93bfd34937040a15",
"sha256": "4576723d94b81bdc733e1cdb30808551ed1ddeb7d0440df58f56b2555d639f02"
},
"downloads": -1,
"filename": "cutensor_cu11-2.0.2-py3-none-win_amd64.whl",
"has_sig": false,
"md5_digest": "97fdf0934f6eb00e93bfd34937040a15",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 143398925,
"upload_time": "2024-07-09T23:42:47",
"upload_time_iso_8601": "2024-07-09T23:42:47.873779Z",
"url": "https://files.pythonhosted.org/packages/f0/b9/28977c56495b4847b98fb0348c70fff0fd74c3e7535407c6eb0cbae28a26/cutensor_cu11-2.0.2-py3-none-win_amd64.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-07-09 23:42:30",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "cutensor-cu11"
}