aiacc-nccl


Nameaiacc-nccl JSON
Version 2.0.0 PyPI version JSON
download
home_pagehttps://www.aliyun.com
SummaryAIACC-NCCL is an AI-Accelerator communication framework for NVIDIA-NCCL. It implements optimized all-reduce, all-gather, reduce, broadcast, reduce-scatter, all-to-all,as well as any send/receive based communication pattern.It has been optimized to achieve high bandwidth on aliyun machines using PCIe, NVLink, NVswitch,as well as networking using InfiniBand Verbs, eRDMA or TCP/IP sockets.
upload_time2023-09-08 02:20:32
maintainer
docs_urlNone
authorAlibaba Cloud
requires_python
licenseCopyright (C) Alibaba Group Holding Limited
keywords distributed deep learning communication nccl aiacc
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # AIACC-NCCL

Optimized primitives for inter-GPU communication on aliyun machines.

## Introduction

AIACC-NCCL is an AI-Accelerator communication framework for NVIDIA-NCCL.
It implements optimized all-reduce, all-gather, reduce, broadcast, reduce-scatter, all-to-all, as well as any send/receive based communication pattern.
It has been optimized to achieve high bandwidth on aliyun machines using PCIe, NVLink, NVswitch, as well as networking using InfiniBand Verbs, eRDMA or TCP/IP sockets.

## Install

To install AIACC NCCL on the system, create a package then install it as root as follow two methods:

- method1: rpm/deb (Recommended)
```sh
# Centos:
wget http://mirrors.aliyun.com/aiacc/aiacc-nccl/aiacc_nccl-1.0.rpm
rpm -i aiacc-nccl-1.0.rpm
# Ubuntu:
wget http://mirrors.aliyun.com/aiacc/aiacc-nccl/aiacc_nccl-1.0.deb
dpkg -i aiacc-nccl-1.0.deb
```
- method2: python-offline
```sh
wget http://mirrors.aliyun.com/aiacc/aiacc-nccl/aiacc_nccl-2.0.0.tar.gz
pip install aiacc_nccl-2.0.0.tar.gz
# notes: must download and then pip install, cannot merge in oneline `pip install aiacc_xxx_url` 
# Both method1 and method2 can run concurrently.
```

- method3: python-pypi
```sh
pip install aiacc_nccl==2.0
```

After install aiacc-nccl package, you need do nothing to change code!


## Environment

* ***AIACC_FASTTUNING***: Enable Fasttuning for LLMs, default=1 is to enable.
* ***NCCL_AIACC_ALLREDUCE_DISABLE***: Disable allreduce algo, default=0 is to enable.
* ***NCCL_AIACC_ALLGATHER_DISABLE***: Disable allgather algo, default=0 is to enable.
* ***NCCL_AIACC_REDUCE_SCATTER_DISABLE***: Disable reduce_scatter algo, default=0 is to enable.
* ***AIACC_UPDATE_ALGO_DISABLE***: Disable update aiacc nccl algo from aiacc-sql-server, default=0 is to enable.

## Copyright

All source code and accompanying documentation is copyright (c) 2015-2020, NVIDIA CORPORATION. All rights reserved.
All modifications are copyright (c) 2020-2024, ALIYUN CORPORATION. All rights reserved.



            

Raw data

            {
    "_id": null,
    "home_page": "https://www.aliyun.com",
    "name": "aiacc-nccl",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "Distributed,Deep Learning,Communication,NCCL,AIACC",
    "author": "Alibaba Cloud",
    "author_email": "ziqi.yzq@alibaba-inc.com",
    "download_url": "https://files.pythonhosted.org/packages/c1/16/abb9e29ed3fb06b872994753ca8eb7bf2aedcfa077a7bafd99ec143148bf/aiacc_nccl-2.0.0.tar.gz",
    "platform": null,
    "description": "# AIACC-NCCL\n\nOptimized primitives for inter-GPU communication on aliyun machines.\n\n## Introduction\n\nAIACC-NCCL is an AI-Accelerator communication framework for NVIDIA-NCCL.\nIt implements optimized all-reduce, all-gather, reduce, broadcast, reduce-scatter, all-to-all, as well as any send/receive based communication pattern.\nIt has been optimized to achieve high bandwidth on aliyun machines using PCIe, NVLink, NVswitch, as well as networking using InfiniBand Verbs, eRDMA or TCP/IP sockets.\n\n## Install\n\nTo install AIACC NCCL on the system, create a package then install it as root as follow two methods:\n\n- method1: rpm/deb (Recommended)\n```sh\n# Centos:\nwget http://mirrors.aliyun.com/aiacc/aiacc-nccl/aiacc_nccl-1.0.rpm\nrpm -i aiacc-nccl-1.0.rpm\n# Ubuntu:\nwget http://mirrors.aliyun.com/aiacc/aiacc-nccl/aiacc_nccl-1.0.deb\ndpkg -i aiacc-nccl-1.0.deb\n```\n- method2: python-offline\n```sh\nwget http://mirrors.aliyun.com/aiacc/aiacc-nccl/aiacc_nccl-2.0.0.tar.gz\npip install aiacc_nccl-2.0.0.tar.gz\n# notes: must download and then pip install, cannot merge in oneline `pip install aiacc_xxx_url` \n# Both method1 and method2 can run concurrently.\n```\n\n- method3: python-pypi\n```sh\npip install aiacc_nccl==2.0\n```\n\nAfter install aiacc-nccl package, you need do nothing to change code!\n\n\n## Environment\n\n* ***AIACC_FASTTUNING***: Enable Fasttuning for LLMs, default=1 is to enable.\n* ***NCCL_AIACC_ALLREDUCE_DISABLE***: Disable allreduce algo, default=0 is to enable.\n* ***NCCL_AIACC_ALLGATHER_DISABLE***: Disable allgather algo, default=0 is to enable.\n* ***NCCL_AIACC_REDUCE_SCATTER_DISABLE***: Disable reduce_scatter algo, default=0 is to enable.\n* ***AIACC_UPDATE_ALGO_DISABLE***: Disable update aiacc nccl algo from aiacc-sql-server, default=0 is to enable.\n\n## Copyright\n\nAll source code and accompanying documentation is copyright (c) 2015-2020, NVIDIA CORPORATION. All rights reserved.\nAll modifications are copyright (c) 2020-2024, ALIYUN CORPORATION. All rights reserved.\n\n\n",
    "bugtrack_url": null,
    "license": "Copyright (C) Alibaba Group Holding Limited",
    "summary": "AIACC-NCCL is an AI-Accelerator communication framework for NVIDIA-NCCL. It implements optimized all-reduce, all-gather, reduce, broadcast, reduce-scatter, all-to-all,as well as any send/receive based communication pattern.It has been optimized to achieve high bandwidth on aliyun machines using PCIe, NVLink, NVswitch,as well as networking using InfiniBand Verbs, eRDMA or TCP/IP sockets.",
    "version": "2.0.0",
    "project_urls": {
        "Homepage": "https://www.aliyun.com"
    },
    "split_keywords": [
        "distributed",
        "deep learning",
        "communication",
        "nccl",
        "aiacc"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "c116abb9e29ed3fb06b872994753ca8eb7bf2aedcfa077a7bafd99ec143148bf",
                "md5": "7b128aee689b05a3d84db9137427234d",
                "sha256": "ad5ab83252ccb2d2694fdc914f76735f654be1fe62b249c945e49296ec3e0ded"
            },
            "downloads": -1,
            "filename": "aiacc_nccl-2.0.0.tar.gz",
            "has_sig": false,
            "md5_digest": "7b128aee689b05a3d84db9137427234d",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 88364750,
            "upload_time": "2023-09-08T02:20:32",
            "upload_time_iso_8601": "2023-09-08T02:20:32.150141Z",
            "url": "https://files.pythonhosted.org/packages/c1/16/abb9e29ed3fb06b872994753ca8eb7bf2aedcfa077a7bafd99ec143148bf/aiacc_nccl-2.0.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-09-08 02:20:32",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "aiacc-nccl"
}
        
Elapsed time: 0.13331s