oracle-ads


Nameoracle-ads JSON
Version 2.12.7 PyPI version JSON
download
home_pageNone
SummaryOracle Accelerated Data Science SDK
upload_time2024-11-20 19:19:02
maintainerNone
docs_urlNone
authorOracle Data Science
requires_python>=3.8
licenseNone
keywords oracle cloud infrastructure oci machine learning ml artificial intelligence ai data science cloud oracle
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage
            # Oracle Accelerated Data Science (ADS)

[![PyPI](https://img.shields.io/pypi/v/oracle-ads.svg?style=for-the-badge&logo=pypi&logoColor=white)](https://pypi.org/project/oracle-ads/) [![Python](https://img.shields.io/pypi/pyversions/oracle-ads.svg?style=for-the-badge&logo=pypi&logoColor=white)](https://pypi.org/project/oracle-ads/) [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg?style=for-the-badge&logo=pypi&logoColor=white)](https://github.com/ambv/black)


The [Oracle Accelerated Data Science (ADS) SDK](https://accelerated-data-science.readthedocs.io/en/latest/index.html) is maintained by the Oracle Cloud Infrastructure (OCI) [Data Science service](https://docs.oracle.com/en-us/iaas/data-science/using/data-science.htm) team. It speeds up common data science activities by providing tools that automate and simplify common data science tasks. Additionally, provides data scientists a friendly pythonic interface to OCI services. Some of the more notable services are OCI Data Science, Model Catalog, Model Deployment, Jobs, ML Pipelines, Data Flow, Object Storage, Vault, Big Data Service, Data Catalog, and the Autonomous Database. ADS gives you an interface to manage the life cycle of machine learning models, from data acquisition to model evaluation, interpretation, and model deployment.

With ADS you can:

- Read datasets from Oracle Object Storage, Oracle RDBMS (ATP/ADW/On-prem), AWS S3 and other sources into `Pandas dataframes`.
- Tune models using hyperparameter optimization with the `ADSTuner` tool.
- Generate detailed evaluation reports of your model candidates with the `ADSEvaluator` module.
- Save machine learning models to the [OCI Data Science Model Catalog](https://docs.oracle.com/en-us/iaas/data-science/using/models-about.htm).
- Deploy models as HTTP endpoints with [Model Deployment](https://docs.oracle.com/en-us/iaas/data-science/using/model-dep-about.htm).
- Launch distributed ETL, data processing, and model training jobs in Spark with [OCI Data Flow](https://docs.oracle.com/en-us/iaas/data-flow/using/home.htm).
- Train machine learning models in OCI Data Science [Jobs](https://docs.oracle.com/en-us/iaas/data-science/using/jobs-about.htm).
- Define and run an end-to-end machine learning orchestration covering all the steps of machine learning lifecycle in a repeatable, continuous [ML Pipelines](https://accelerated-data-science.readthedocs.io/en/latest/user_guide/pipeline/overview.html#).
- Manage the life cycle of conda environments through the `ads conda` command line interface (CLI).

## Installation

You have various options when installing ADS.

### Installing the oracle-ads base package

```bash
  python3 -m pip install oracle-ads
```

### Installing OCI AI Operators

To use the AI Forecast Operator, install the "forecast" dependencies using the following command:

```bash
  python3 -m pip install 'oracle_ads[forecast]>=2.9.0'
```

### Installing extras libraries

To work with gradient boosting models, install the `boosted` module. This module includes XGBoost and LightGBM model classes.

```bash
  python3 -m pip install 'oracle-ads[boosted]'
```

For big data use cases using Oracle Big Data Service (BDS), install the `bds` module. It includes the following libraries, `ibis-framework[impala]`, `hdfs[kerberos]` and `sqlalchemy`.

```bash
  python3 -m pip install 'oracle-ads[bds]'
```

To work with a broad set of data formats (for example, Excel, Avro, etc.) install the `data` module. It includes the `fastavro`, `openpyxl`, `pandavro`, `asteval`, `datefinder`, `htmllistparse`, and `sqlalchemy` libraries.

```bash
  python3 -m pip install 'oracle-ads[data]'
```

To work with geospatial data install the `geo` module. It includes the `geopandas` and libraries from the `viz` module.

```bash
  python3 -m pip install 'oracle-ads[geo]'
```

Install the `notebook` module to use ADS within a OCI Data Science service [notebook session](https://docs.oracle.com/en-us/iaas/data-science/using/manage-notebook-sessions.htm). This module installs `ipywidgets` and `ipython` libraries.

```bash
  python3 -m pip install 'oracle-ads[notebook]'
```

To work with ONNX-compatible run times and libraries designed to maximize performance and model portability, install the `onnx` module. It includes the following libraries, `onnx`, `onnxruntime`, `onnxmltools`, `skl2onnx`, `xgboost`, `lightgbm` and libraries from the `viz` module.

```bash
  python3 -m pip install 'oracle-ads[onnx]'
```

For infrastructure tasks, install the `opctl` module. It includes the following libraries, `oci-cli`, `docker`, `conda-pack`, `nbconvert`, `nbformat`, and `inflection`.

```bash
  python3 -m pip install 'oracle-ads[opctl]'
```

For hyperparameter optimization tasks install the `optuna` module. It includes the `optuna` and libraries from the `viz` module.

```bash
  python3 -m pip install 'oracle-ads[optuna]'
```

Install the `tensorflow` module to include `tensorflow` and libraries from the `viz` module.

```bash
  python3 -m pip install 'oracle-ads[tensorflow]'
```

For text related tasks, install the `text` module. This will include the `wordcloud`, `spacy` libraries.

```bash
  python3 -m pip install 'oracle-ads[text]'
```

Install the `torch` module to include `pytorch` and libraries from the `viz` module.

```bash
  python3 -m pip install 'oracle-ads[torch]'
```

Install the `viz` module to include libraries for visualization tasks. Some of the key packages are `bokeh`, `folium`, `seaborn` and related packages.

```bash
  python3 -m pip install 'oracle-ads[viz]'
```

See `pyproject.toml` file `[project.optional-dependencies]` section for full list of modules and its list of extra libraries.

**Note**

Multiple extra dependencies can be installed together. For example:

```bash
  python3 -m pip install  'oracle-ads[notebook,viz,text]'
```

## Documentation

  - [Oracle Accelerated Data Science SDK (ADS) Documentation](https://accelerated-data-science.readthedocs.io/en/latest/index.html)
  - [OCI Data Science and AI services Examples](https://github.com/oracle/oci-data-science-ai-samples)
  - [Oracle AI & Data Science Blog](https://blogs.oracle.com/ai-and-datascience/)
  - [OCI Documentation](https://docs.oracle.com/en-us/iaas/data-science/using/data-science.htm)

## Examples

### Load data from Object Storage

```python
  import ads
  from ads.common.auth import default_signer
  import oci
  import pandas as pd

  ads.set_auth(auth="api_key", oci_config_location=oci.config.DEFAULT_LOCATION, profile="DEFAULT")
  bucket_name = <bucket_name>
  key = <key>
  namespace = <namespace>
  df = pd.read_csv(f"oci://{bucket_name}@{namespace}/{key}", storage_options=default_signer())
```

### Load data from ADB

This example uses SQL injection safe binding variables.

```python
  import ads
  import pandas as pd

  connection_parameters = {
      "user_name": "<user_name>",
      "password": "<password>",
      "service_name": "<tns_name>",
      "wallet_location": "<file_path>",
  }

  df = pd.DataFrame.ads.read_sql(
      """
      SELECT *
      FROM SH.SALES
      WHERE ROWNUM <= :max_rows
      """,
      bind_variables={ max_rows : 100 },
      connection_parameters=connection_parameters,
  )
```

## Contributing

This project welcomes contributions from the community. Before submitting a pull request, please [review our contribution guide](./CONTRIBUTING.md)

Find Getting Started instructions for developers in [README-development.md](https://github.com/oracle/accelerated-data-science/blob/main/README-development.md)

## Security

Consult the security guide [SECURITY.md](https://github.com/oracle/accelerated-data-science/blob/main/SECURITY.md) for our responsible security vulnerability disclosure process.

## License

Copyright (c) 2020, 2024 Oracle and/or its affiliates. Licensed under the [Universal Permissive License v1.0](https://oss.oracle.com/licenses/upl/)


            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "oracle-ads",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "Oracle Cloud Infrastructure, OCI, Machine Learning, ML, Artificial Intelligence, AI, Data Science, Cloud, Oracle",
    "author": "Oracle Data Science",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/03/62/83700a7bf7fcd2a0b365beb135ac4bdbfdd7ec97999eded0bfbf402b5b13/oracle_ads-2.12.7.tar.gz",
    "platform": null,
    "description": "# Oracle Accelerated Data Science (ADS)\n\n[![PyPI](https://img.shields.io/pypi/v/oracle-ads.svg?style=for-the-badge&logo=pypi&logoColor=white)](https://pypi.org/project/oracle-ads/) [![Python](https://img.shields.io/pypi/pyversions/oracle-ads.svg?style=for-the-badge&logo=pypi&logoColor=white)](https://pypi.org/project/oracle-ads/) [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg?style=for-the-badge&logo=pypi&logoColor=white)](https://github.com/ambv/black)\n\n\nThe [Oracle Accelerated Data Science (ADS) SDK](https://accelerated-data-science.readthedocs.io/en/latest/index.html) is maintained by the Oracle Cloud Infrastructure (OCI) [Data Science service](https://docs.oracle.com/en-us/iaas/data-science/using/data-science.htm) team. It speeds up common data science activities by providing tools that automate and simplify common data science tasks. Additionally, provides data scientists a friendly pythonic interface to OCI services. Some of the more notable services are OCI Data Science, Model Catalog, Model Deployment, Jobs, ML Pipelines, Data Flow, Object Storage, Vault, Big Data Service, Data Catalog, and the Autonomous Database. ADS gives you an interface to manage the life cycle of machine learning models, from data acquisition to model evaluation, interpretation, and model deployment.\n\nWith ADS you can:\n\n- Read datasets from Oracle Object Storage, Oracle RDBMS (ATP/ADW/On-prem), AWS S3 and other sources into `Pandas dataframes`.\n- Tune models using hyperparameter optimization with the `ADSTuner` tool.\n- Generate detailed evaluation reports of your model candidates with the `ADSEvaluator` module.\n- Save machine learning models to the [OCI Data Science Model Catalog](https://docs.oracle.com/en-us/iaas/data-science/using/models-about.htm).\n- Deploy models as HTTP endpoints with [Model Deployment](https://docs.oracle.com/en-us/iaas/data-science/using/model-dep-about.htm).\n- Launch distributed ETL, data processing, and model training jobs in Spark with [OCI Data Flow](https://docs.oracle.com/en-us/iaas/data-flow/using/home.htm).\n- Train machine learning models in OCI Data Science [Jobs](https://docs.oracle.com/en-us/iaas/data-science/using/jobs-about.htm).\n- Define and run an end-to-end machine learning orchestration covering all the steps of machine learning lifecycle in a repeatable, continuous [ML Pipelines](https://accelerated-data-science.readthedocs.io/en/latest/user_guide/pipeline/overview.html#).\n- Manage the life cycle of conda environments through the `ads conda` command line interface (CLI).\n\n## Installation\n\nYou have various options when installing ADS.\n\n### Installing the oracle-ads base package\n\n```bash\n  python3 -m pip install oracle-ads\n```\n\n### Installing OCI AI Operators\n\nTo use the AI Forecast Operator, install the \"forecast\" dependencies using the following command:\n\n```bash\n  python3 -m pip install 'oracle_ads[forecast]>=2.9.0'\n```\n\n### Installing extras libraries\n\nTo work with gradient boosting models, install the `boosted` module. This module includes XGBoost and LightGBM model classes.\n\n```bash\n  python3 -m pip install 'oracle-ads[boosted]'\n```\n\nFor big data use cases using Oracle Big Data Service (BDS), install the `bds` module. It includes the following libraries, `ibis-framework[impala]`, `hdfs[kerberos]` and `sqlalchemy`.\n\n```bash\n  python3 -m pip install 'oracle-ads[bds]'\n```\n\nTo work with a broad set of data formats (for example, Excel, Avro, etc.) install the `data` module. It includes the `fastavro`, `openpyxl`, `pandavro`, `asteval`, `datefinder`, `htmllistparse`, and `sqlalchemy` libraries.\n\n```bash\n  python3 -m pip install 'oracle-ads[data]'\n```\n\nTo work with geospatial data install the `geo` module. It includes the `geopandas` and libraries from the `viz` module.\n\n```bash\n  python3 -m pip install 'oracle-ads[geo]'\n```\n\nInstall the `notebook` module to use ADS within a OCI Data Science service [notebook session](https://docs.oracle.com/en-us/iaas/data-science/using/manage-notebook-sessions.htm). This module installs `ipywidgets` and `ipython` libraries.\n\n```bash\n  python3 -m pip install 'oracle-ads[notebook]'\n```\n\nTo work with ONNX-compatible run times and libraries designed to maximize performance and model portability, install the `onnx` module. It includes the following libraries, `onnx`, `onnxruntime`, `onnxmltools`, `skl2onnx`, `xgboost`, `lightgbm` and libraries from the `viz` module.\n\n```bash\n  python3 -m pip install 'oracle-ads[onnx]'\n```\n\nFor infrastructure tasks, install the `opctl` module. It includes the following libraries, `oci-cli`, `docker`, `conda-pack`, `nbconvert`, `nbformat`, and `inflection`.\n\n```bash\n  python3 -m pip install 'oracle-ads[opctl]'\n```\n\nFor hyperparameter optimization tasks install the `optuna` module. It includes the `optuna` and libraries from the `viz` module.\n\n```bash\n  python3 -m pip install 'oracle-ads[optuna]'\n```\n\nInstall the `tensorflow` module to include `tensorflow` and libraries from the `viz` module.\n\n```bash\n  python3 -m pip install 'oracle-ads[tensorflow]'\n```\n\nFor text related tasks, install the `text` module. This will include the `wordcloud`, `spacy` libraries.\n\n```bash\n  python3 -m pip install 'oracle-ads[text]'\n```\n\nInstall the `torch` module to include `pytorch` and libraries from the `viz` module.\n\n```bash\n  python3 -m pip install 'oracle-ads[torch]'\n```\n\nInstall the `viz` module to include libraries for visualization tasks. Some of the key packages are `bokeh`, `folium`, `seaborn` and related packages.\n\n```bash\n  python3 -m pip install 'oracle-ads[viz]'\n```\n\nSee `pyproject.toml` file `[project.optional-dependencies]` section for full list of modules and its list of extra libraries.\n\n**Note**\n\nMultiple extra dependencies can be installed together. For example:\n\n```bash\n  python3 -m pip install  'oracle-ads[notebook,viz,text]'\n```\n\n## Documentation\n\n  - [Oracle Accelerated Data Science SDK (ADS) Documentation](https://accelerated-data-science.readthedocs.io/en/latest/index.html)\n  - [OCI Data Science and AI services Examples](https://github.com/oracle/oci-data-science-ai-samples)\n  - [Oracle AI & Data Science Blog](https://blogs.oracle.com/ai-and-datascience/)\n  - [OCI Documentation](https://docs.oracle.com/en-us/iaas/data-science/using/data-science.htm)\n\n## Examples\n\n### Load data from Object Storage\n\n```python\n  import ads\n  from ads.common.auth import default_signer\n  import oci\n  import pandas as pd\n\n  ads.set_auth(auth=\"api_key\", oci_config_location=oci.config.DEFAULT_LOCATION, profile=\"DEFAULT\")\n  bucket_name = <bucket_name>\n  key = <key>\n  namespace = <namespace>\n  df = pd.read_csv(f\"oci://{bucket_name}@{namespace}/{key}\", storage_options=default_signer())\n```\n\n### Load data from ADB\n\nThis example uses SQL injection safe binding variables.\n\n```python\n  import ads\n  import pandas as pd\n\n  connection_parameters = {\n      \"user_name\": \"<user_name>\",\n      \"password\": \"<password>\",\n      \"service_name\": \"<tns_name>\",\n      \"wallet_location\": \"<file_path>\",\n  }\n\n  df = pd.DataFrame.ads.read_sql(\n      \"\"\"\n      SELECT *\n      FROM SH.SALES\n      WHERE ROWNUM <= :max_rows\n      \"\"\",\n      bind_variables={ max_rows : 100 },\n      connection_parameters=connection_parameters,\n  )\n```\n\n## Contributing\n\nThis project welcomes contributions from the community. Before submitting a pull request, please [review our contribution guide](./CONTRIBUTING.md)\n\nFind Getting Started instructions for developers in [README-development.md](https://github.com/oracle/accelerated-data-science/blob/main/README-development.md)\n\n## Security\n\nConsult the security guide [SECURITY.md](https://github.com/oracle/accelerated-data-science/blob/main/SECURITY.md) for our responsible security vulnerability disclosure process.\n\n## License\n\nCopyright (c) 2020, 2024 Oracle and/or its affiliates. Licensed under the [Universal Permissive License v1.0](https://oss.oracle.com/licenses/upl/)\n\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Oracle Accelerated Data Science SDK",
    "version": "2.12.7",
    "project_urls": {
        "Documentation": "https://accelerated-data-science.readthedocs.io/en/latest/index.html",
        "Github": "https://github.com/oracle/accelerated-data-science"
    },
    "split_keywords": [
        "oracle cloud infrastructure",
        " oci",
        " machine learning",
        " ml",
        " artificial intelligence",
        " ai",
        " data science",
        " cloud",
        " oracle"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "5357038223af5e03e03759041fb432c14e756427f2638db17edf1405296962a6",
                "md5": "a0e7e048c1fe4e0475d012648b15b644",
                "sha256": "6670e49ec9f6b80e117517492dc2208ff647b0307a4d5c7211f95a0b83af3d8f"
            },
            "downloads": -1,
            "filename": "oracle_ads-2.12.7-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "a0e7e048c1fe4e0475d012648b15b644",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 23015414,
            "upload_time": "2024-11-20T19:18:58",
            "upload_time_iso_8601": "2024-11-20T19:18:58.766808Z",
            "url": "https://files.pythonhosted.org/packages/53/57/038223af5e03e03759041fb432c14e756427f2638db17edf1405296962a6/oracle_ads-2.12.7-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "036283700a7bf7fcd2a0b365beb135ac4bdbfdd7ec97999eded0bfbf402b5b13",
                "md5": "d7addd0cb1cf7294cd4551145199ad6b",
                "sha256": "eaf89116fd6b8ef7e09769f4aac3429339e13c3a5553169a604556146b4da76c"
            },
            "downloads": -1,
            "filename": "oracle_ads-2.12.7.tar.gz",
            "has_sig": false,
            "md5_digest": "d7addd0cb1cf7294cd4551145199ad6b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 22567743,
            "upload_time": "2024-11-20T19:19:02",
            "upload_time_iso_8601": "2024-11-20T19:19:02.528251Z",
            "url": "https://files.pythonhosted.org/packages/03/62/83700a7bf7fcd2a0b365beb135ac4bdbfdd7ec97999eded0bfbf402b5b13/oracle_ads-2.12.7.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-11-20 19:19:02",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "oracle",
    "github_project": "accelerated-data-science",
    "travis_ci": false,
    "coveralls": true,
    "github_actions": true,
    "lcname": "oracle-ads"
}
        
Elapsed time: 1.03372s