gptchem


Namegptchem JSON
Version 0.0.5 PyPI version JSON
download
home_pagehttps://github.com/kjappelbaum/gptchem
SummaryUse GPT-3 to solve chemistry problems
upload_time2023-10-04 11:28:07
maintainerKevin M. Jablonka
docs_urlNone
authorKevin M. Jablonka
requires_python>=3.8
licenseMIT
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            
<p align="center">
  <img src="https://github.com/kjappelbaum/gptchem/raw/main/docs/source/static/grid_0.png" height="150">
</p>


<h1 align="center">
  gptchem
</h1>

<p align="center">
    <a href="https://github.com/kjappelbaum/gptchem/actions?query=workflow%3ATests">
        <img alt="Tests" src="https://github.com/kjappelbaum/gptchem/workflows/Tests/badge.svg" />
    </a>
    <!-- <a href="https://pypi.org/project/gptchem">
        <img alt="PyPI" src="https://img.shields.io/pypi/v/gptchem" />
    </a>
    <a href="https://pypi.org/project/gptchem">
        <img alt="PyPI - Python Version" src="https://img.shields.io/pypi/pyversions/gptchem" />
    </a> -->
    <a href="https://github.com/kjappelbaum/gptchem/blob/main/LICENSE">
        <img alt="PyPI - License" src="https://img.shields.io/pypi/l/gptchem" />
    </a>
    <a href='https://gptchem.readthedocs.io/en/latest/?badge=latest'>
        <img src='https://readthedocs.org/projects/gptchem/badge/?version=latest' alt='Documentation Status' />
    </a>
    <a href="https://codecov.io/gh/kjappelbaum/gptchem/branch/main">
        <img src="https://codecov.io/gh/kjappelbaum/gptchem/branch/main/graph/badge.svg" alt="Codecov status" />
    </a>  
    <a href="https://github.com/cthoyt/cookiecutter-python-package">
        <img alt="Cookiecutter template from @cthoyt" src="https://img.shields.io/badge/Cookiecutter-snekpack-blue" /> 
    </a>
    <a href='https://github.com/psf/black'>
        <img src='https://img.shields.io/badge/code%20style-black-000000.svg' alt='Code style: black' />
    </a>
    <a href="https://github.com/kjappelbaum/gptchem/blob/main/.github/CODE_OF_CONDUCT.md">
        <img src="https://img.shields.io/badge/Contributor%20Covenant-2.1-4baaaa.svg" alt="Contributor Covenant"/>
    </a>
</p>

Use GPT-3 to solve chemistry problems.
Most of the repo is currently not intended for use as library but as documentation of our experiments. 
We'll factor out the experiments (that come with tricky dependencies) into its own repository over time.

## 💪 Getting Started

```python 
from gptchem.gpt_classifier import GPTClassifier 
from gptchem.tuner import Tuner 

classifier = GPTClassifier(
    property_name="transition wavelength", # this is the property name we will use in the prompt template
    tuner=Tuner(n_epochs=8, learning_rate_multiplier=0.02, wandb_sync=False),
)

classifier.fit(["CC", "CDDFSS"], [0, 1])
predictions = classifier.predict(['CCCC', 'CCCCCCCC'])
```

The time these call take can vary as the methods call the OpenAI API under the hood. Therefore, in situation of high load, we also experienced hours of waiting time in the queue. 

## 🚀 Installation

<!-- Uncomment this section after your first ``tox -e finish``
The most recent release can be installed from
[PyPI](https://pypi.org/project/gptchem/) with:

```bash
$ pip install gptchem
```
-->

The most recent code and data can be installed directly from GitHub with:

```bash
$ pip install git+https://github.com/kjappelbaum/gptchem.git
```

The installation should only take a few seconds to minutes. You can install additional depenencies using the extras `experiments` and `eval`.

## 👐 Contributing

Contributions, whether filing an issue, making a pull request, or forking, are appreciated. See
[CONTRIBUTING.md](https://github.com/kjappelbaum/gptchem/blob/master/.github/CONTRIBUTING.md) for more information on getting involved.

## 👋 Attribution

### ⚖️ License

The code in this package is licensed under the MIT License.


### 📖 Citation

If you found this package useful, please cite our preprint 

```
@inproceedings{Jablonka_2023,
	doi = {10.26434/chemrxiv-2023-fw8n4},
	url = {https://doi.org/10.26434%2Fchemrxiv-2023-fw8n4},
	year = 2023,
	month = {feb},
	booktitle = {ChemRxiv},
	author = {Kevin Maik Jablonka and Philippe Schwaller and Andres Ortega-Guerrero and Berend Smit},
	title = {Is {GPT} all you need for low-data discovery in chemistry?}
}
```


<!--
### 🎁 Support

This project has been supported by the following organizations (in alphabetical order):

- [Harvard Program in Therapeutic Science - Laboratory of Systems Pharmacology](https://hits.harvard.edu/the-program/laboratory-of-systems-pharmacology/)

-->

<!--
### 💰 Funding

This project has been supported by the following grants:

| Funding Body | Program                                                                                                                       | Grant         |
| ------------ | ----------------------------------------------------------------------------------------------------------------------------- | ------------- |
| DARPA        | [Automating Scientific Knowledge Extraction (ASKE)](https://www.darpa.mil/program/automating-scientific-knowledge-extraction) | HR00111990009 |
-->

## 🛠️ For Developers

<details>
  <summary>See developer instructions</summary>


The final section of the README is for if you want to get involved by making a code contribution.

### Development Installation

To install in development mode, use the following:

```bash
$ git clone git+https://github.com/kjappelbaum/gptchem.git
$ cd gptchem
$ pip install -e .
```

### 🥼 Testing

After cloning the repository and installing `tox` with `pip install tox`, the unit tests in the `tests/` folder can be
run reproducibly with:

```shell
$ tox
```

Additionally, these tests are automatically re-run with each commit in a [GitHub Action](https://github.com/kjappelbaum/gptchem/actions?query=workflow%3ATests).

### 📖 Building the Documentation

The documentation can be built locally using the following:

```shell
$ git clone git+https://github.com/kjappelbaum/gptchem.git
$ cd gptchem
$ tox -e docs
$ open docs/build/html/index.html
``` 

The documentation automatically installs the package as well as the `docs`
extra specified in the [`setup.cfg`](setup.cfg). `sphinx` plugins
like `texext` can be added there. Additionally, they need to be added to the
`extensions` list in [`docs/source/conf.py`](docs/source/conf.py).

### 📦 Making a Release

After installing the package in development mode and installing
`tox` with `pip install tox`, the commands for making a new release are contained within the `finish` environment
in `tox.ini`. Run the following from the shell:

```shell
$ tox -e finish
```

This script does the following:

1. Uses [Bump2Version](https://github.com/c4urself/bump2version) to switch the version number in the `setup.cfg`,
   `src/gptchem/version.py`, and [`docs/source/conf.py`](docs/source/conf.py) to not have the `-dev` suffix
2. Packages the code in both a tar archive and a wheel using [`build`](https://github.com/pypa/build)
3. Uploads to PyPI using [`twine`](https://github.com/pypa/twine). Be sure to have a `.pypirc` file configured to avoid the need for manual input at this
   step
4. Push to GitHub. You'll need to make a release going with the commit where the version was bumped.
5. Bump the version to the next patch. If you made big changes and want to bump the version by minor, you can
   use `tox -e bumpversion minor` after.
</details>

### 🍪 Cookiecutter

This package was created with [@audreyfeldroy](https://github.com/audreyfeldroy)'s
[cookiecutter](https://github.com/cookiecutter/cookiecutter) package using [@cthoyt](https://github.com/cthoyt)'s
[cookiecutter-snekpack](https://github.com/cthoyt/cookiecutter-snekpack) template.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/kjappelbaum/gptchem",
    "name": "gptchem",
    "maintainer": "Kevin M. Jablonka",
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": "mail@kjablonka.com",
    "keywords": "",
    "author": "Kevin M. Jablonka",
    "author_email": "mail@kjablonka.com",
    "download_url": "https://github.com/kjappelbaum/gptchem/releases",
    "platform": null,
    "description": "\n<p align=\"center\">\n  <img src=\"https://github.com/kjappelbaum/gptchem/raw/main/docs/source/static/grid_0.png\" height=\"150\">\n</p>\n\n\n<h1 align=\"center\">\n  gptchem\n</h1>\n\n<p align=\"center\">\n    <a href=\"https://github.com/kjappelbaum/gptchem/actions?query=workflow%3ATests\">\n        <img alt=\"Tests\" src=\"https://github.com/kjappelbaum/gptchem/workflows/Tests/badge.svg\" />\n    </a>\n    <!-- <a href=\"https://pypi.org/project/gptchem\">\n        <img alt=\"PyPI\" src=\"https://img.shields.io/pypi/v/gptchem\" />\n    </a>\n    <a href=\"https://pypi.org/project/gptchem\">\n        <img alt=\"PyPI - Python Version\" src=\"https://img.shields.io/pypi/pyversions/gptchem\" />\n    </a> -->\n    <a href=\"https://github.com/kjappelbaum/gptchem/blob/main/LICENSE\">\n        <img alt=\"PyPI - License\" src=\"https://img.shields.io/pypi/l/gptchem\" />\n    </a>\n    <a href='https://gptchem.readthedocs.io/en/latest/?badge=latest'>\n        <img src='https://readthedocs.org/projects/gptchem/badge/?version=latest' alt='Documentation Status' />\n    </a>\n    <a href=\"https://codecov.io/gh/kjappelbaum/gptchem/branch/main\">\n        <img src=\"https://codecov.io/gh/kjappelbaum/gptchem/branch/main/graph/badge.svg\" alt=\"Codecov status\" />\n    </a>  \n    <a href=\"https://github.com/cthoyt/cookiecutter-python-package\">\n        <img alt=\"Cookiecutter template from @cthoyt\" src=\"https://img.shields.io/badge/Cookiecutter-snekpack-blue\" /> \n    </a>\n    <a href='https://github.com/psf/black'>\n        <img src='https://img.shields.io/badge/code%20style-black-000000.svg' alt='Code style: black' />\n    </a>\n    <a href=\"https://github.com/kjappelbaum/gptchem/blob/main/.github/CODE_OF_CONDUCT.md\">\n        <img src=\"https://img.shields.io/badge/Contributor%20Covenant-2.1-4baaaa.svg\" alt=\"Contributor Covenant\"/>\n    </a>\n</p>\n\nUse GPT-3 to solve chemistry problems.\nMost of the repo is currently not intended for use as library but as documentation of our experiments. \nWe'll factor out the experiments (that come with tricky dependencies) into its own repository over time.\n\n## \ud83d\udcaa Getting Started\n\n```python \nfrom gptchem.gpt_classifier import GPTClassifier \nfrom gptchem.tuner import Tuner \n\nclassifier = GPTClassifier(\n    property_name=\"transition wavelength\", # this is the property name we will use in the prompt template\n    tuner=Tuner(n_epochs=8, learning_rate_multiplier=0.02, wandb_sync=False),\n)\n\nclassifier.fit([\"CC\", \"CDDFSS\"], [0, 1])\npredictions = classifier.predict(['CCCC', 'CCCCCCCC'])\n```\n\nThe time these call take can vary as the methods call the OpenAI API under the hood. Therefore, in situation of high load, we also experienced hours of waiting time in the queue. \n\n## \ud83d\ude80 Installation\n\n<!-- Uncomment this section after your first ``tox -e finish``\nThe most recent release can be installed from\n[PyPI](https://pypi.org/project/gptchem/) with:\n\n```bash\n$ pip install gptchem\n```\n-->\n\nThe most recent code and data can be installed directly from GitHub with:\n\n```bash\n$ pip install git+https://github.com/kjappelbaum/gptchem.git\n```\n\nThe installation should only take a few seconds to minutes. You can install additional depenencies using the extras `experiments` and `eval`.\n\n## \ud83d\udc50 Contributing\n\nContributions, whether filing an issue, making a pull request, or forking, are appreciated. See\n[CONTRIBUTING.md](https://github.com/kjappelbaum/gptchem/blob/master/.github/CONTRIBUTING.md) for more information on getting involved.\n\n## \ud83d\udc4b Attribution\n\n### \u2696\ufe0f License\n\nThe code in this package is licensed under the MIT License.\n\n\n### \ud83d\udcd6 Citation\n\nIf you found this package useful, please cite our preprint \n\n```\n@inproceedings{Jablonka_2023,\n\tdoi = {10.26434/chemrxiv-2023-fw8n4},\n\turl = {https://doi.org/10.26434%2Fchemrxiv-2023-fw8n4},\n\tyear = 2023,\n\tmonth = {feb},\n\tbooktitle = {ChemRxiv},\n\tauthor = {Kevin Maik Jablonka and Philippe Schwaller and Andres Ortega-Guerrero and Berend Smit},\n\ttitle = {Is {GPT} all you need for low-data discovery in chemistry?}\n}\n```\n\n\n<!--\n### \ud83c\udf81 Support\n\nThis project has been supported by the following organizations (in alphabetical order):\n\n- [Harvard Program in Therapeutic Science - Laboratory of Systems Pharmacology](https://hits.harvard.edu/the-program/laboratory-of-systems-pharmacology/)\n\n-->\n\n<!--\n### \ud83d\udcb0 Funding\n\nThis project has been supported by the following grants:\n\n| Funding Body | Program                                                                                                                       | Grant         |\n| ------------ | ----------------------------------------------------------------------------------------------------------------------------- | ------------- |\n| DARPA        | [Automating Scientific Knowledge Extraction (ASKE)](https://www.darpa.mil/program/automating-scientific-knowledge-extraction) | HR00111990009 |\n-->\n\n## \ud83d\udee0\ufe0f For Developers\n\n<details>\n  <summary>See developer instructions</summary>\n\n\nThe final section of the README is for if you want to get involved by making a code contribution.\n\n### Development Installation\n\nTo install in development mode, use the following:\n\n```bash\n$ git clone git+https://github.com/kjappelbaum/gptchem.git\n$ cd gptchem\n$ pip install -e .\n```\n\n### \ud83e\udd7c Testing\n\nAfter cloning the repository and installing `tox` with `pip install tox`, the unit tests in the `tests/` folder can be\nrun reproducibly with:\n\n```shell\n$ tox\n```\n\nAdditionally, these tests are automatically re-run with each commit in a [GitHub Action](https://github.com/kjappelbaum/gptchem/actions?query=workflow%3ATests).\n\n### \ud83d\udcd6 Building the Documentation\n\nThe documentation can be built locally using the following:\n\n```shell\n$ git clone git+https://github.com/kjappelbaum/gptchem.git\n$ cd gptchem\n$ tox -e docs\n$ open docs/build/html/index.html\n``` \n\nThe documentation automatically installs the package as well as the `docs`\nextra specified in the [`setup.cfg`](setup.cfg). `sphinx` plugins\nlike `texext` can be added there. Additionally, they need to be added to the\n`extensions` list in [`docs/source/conf.py`](docs/source/conf.py).\n\n### \ud83d\udce6 Making a Release\n\nAfter installing the package in development mode and installing\n`tox` with `pip install tox`, the commands for making a new release are contained within the `finish` environment\nin `tox.ini`. Run the following from the shell:\n\n```shell\n$ tox -e finish\n```\n\nThis script does the following:\n\n1. Uses [Bump2Version](https://github.com/c4urself/bump2version) to switch the version number in the `setup.cfg`,\n   `src/gptchem/version.py`, and [`docs/source/conf.py`](docs/source/conf.py) to not have the `-dev` suffix\n2. Packages the code in both a tar archive and a wheel using [`build`](https://github.com/pypa/build)\n3. Uploads to PyPI using [`twine`](https://github.com/pypa/twine). Be sure to have a `.pypirc` file configured to avoid the need for manual input at this\n   step\n4. Push to GitHub. You'll need to make a release going with the commit where the version was bumped.\n5. Bump the version to the next patch. If you made big changes and want to bump the version by minor, you can\n   use `tox -e bumpversion minor` after.\n</details>\n\n### \ud83c\udf6a Cookiecutter\n\nThis package was created with [@audreyfeldroy](https://github.com/audreyfeldroy)'s\n[cookiecutter](https://github.com/cookiecutter/cookiecutter) package using [@cthoyt](https://github.com/cthoyt)'s\n[cookiecutter-snekpack](https://github.com/cthoyt/cookiecutter-snekpack) template.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Use GPT-3 to solve chemistry problems",
    "version": "0.0.5",
    "project_urls": {
        "Bug Tracker": "https://github.com/kjappelbaum/gptchem/issues",
        "Download": "https://github.com/kjappelbaum/gptchem/releases",
        "Homepage": "https://github.com/kjappelbaum/gptchem",
        "Source Code": "https://github.com/kjappelbaum/gptchem"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "3bb552b79a84e70d39481b7f0fa4fb528b364b38aff2a7ebbd9ee760528fda1a",
                "md5": "1070b7ba75a646c4f730721f61eb4245",
                "sha256": "c57aa1f11c32fd56e6d3c205ebf8c995f66699b8cb1e305242f5dabe1884a110"
            },
            "downloads": -1,
            "filename": "gptchem-0.0.5-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "1070b7ba75a646c4f730721f61eb4245",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 64946,
            "upload_time": "2023-10-04T11:28:07",
            "upload_time_iso_8601": "2023-10-04T11:28:07.055890Z",
            "url": "https://files.pythonhosted.org/packages/3b/b5/52b79a84e70d39481b7f0fa4fb528b364b38aff2a7ebbd9ee760528fda1a/gptchem-0.0.5-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-10-04 11:28:07",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "kjappelbaum",
    "github_project": "gptchem",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "tox": true,
    "lcname": "gptchem"
}
        
Elapsed time: 0.16299s