cereslib


Namecereslib JSON
Version 0.5.6 PyPI version JSON
download
home_pagehttps://chaoss.github.io/grimoirelab/
SummaryGrimoireLab: Unify, eventize and enrich information from Perceval
upload_time2024-03-27 12:11:17
maintainerNone
docs_urlNone
authorGrimoireLab Developers
requires_python<4.0,>=3.8
licenseGPL-3.0+
keywords development grimoirelab
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Ceres [![Build Status](https://github.com/chaoss/grimoirelab-cereslib/workflows/tests/badge.svg)](https://github.com/chaoss/grimoirelab-cereslib/actions?query=workflow:tests+branch:master+event:push) [![Coverage Status](https://coveralls.io/repos/github/chaoss/grimoirelab-cereslib/badge.svg?branch=master)](https://coveralls.io/github/chaoss/grimoirelab-cereslib?branch=master) [![PyPI version](https://badge.fury.io/py/cereslib.svg)](https://badge.fury.io/py/cereslib)

Ceres is a library that aims at dealing with data in general,
and software development data in particular.

The initial goal of Ceres is to parse information in several ways
from the [Perceval](https://github.com/grimoirelab/perceval) tool
in the [GrimoireLab project](https://github.com/grimoirelab).

However, the more code is added to this project, the more generic
methods are found to be useful in other areas of analysis.

The following are the areas of analysis that Ceres can help at:

## Eventize

The 'eventizer' helps to split information coming from Perceval.
In short, Perceval produces JSON documents and those can be consumed
by Ceres and by the 'eventizing' side of the library.

By 'eventizing', this means the process to parse a full Perceval JSON
document and produce a Pandas DataFrame with certain amount of information.

As an example, a commit contains information about the commit itself, and
the files that were 'touched' at some point. Depending on the granularity
of the analysis Ceres will work in the following way:

* Granularity = 1: This is the first level and produces 1 to 1 relationship
  with the main items in the original data source. For example 1 commit would 
  be just 1 row in the resultant dataframe. This would be a similar case for
  a code review process in Gerrit or in Bugzilla for tickets.
* Granularity = 2: This is the second level and depends on the data source
  how in depth this goes. In the specific case of commits, this would return
  n rows in the dataframe. And there will be as many rows as files where 
  'touched' in the original data source.


## Format

The format part of the library contains some utils that are useful for
some basic formatting actions such as having a whole column in the Pandas
dataframe with the same string format.

Another example would be the use of the format utils to cast from string
to date using datetuils and applying the method to a whole column of a 
given dataframe.

## Filter

The filter utility basically removes rows based on certain values in
certain cells of a dataframe.

## Data Enrich

This is the utility most context-related together with the eventizing
actions. This will add or modify one or more columns in several ways.

There are several examples such as taking care of the surrogates enabling
UTF8, adding new columns based on some actions on others, adding the gender
of the name provided in another column, and others.


# How can you help here?

This project is still quite new, and the development is really slow, so
any extra hand would be really awesome, even giving directions, pieces
of advice or feature requests :).

And of course, using the software would be great!

# Where to start?

The examples folder contains some of the clients I've used for some
analysis such as the gender analysis or to produce dataframes that help
to understand the areas of the code where developers are working.

Those are probably a good place to have a look at.

## Requirements

 * Python >= 3.8

You will also need some other libraries for running the tool, you can find the
whole list of dependencies in [pyproject.toml](pyproject.toml) file.

## Installation

There are several ways to install Cereslib on your system: packages or source 
code using Poetry or pip.

### PyPI

Cereslib can be installed using pip, a tool for installing Python packages. 
To do it, run the next command:
```
$ pip install cereslib
```

### Source code

To install from the source code you will need to clone the repository first:
```
$ git clone https://github.com/chaoss/grimoirelab-cereslib
$ cd grimoirelab-cereslib
```

Then use pip or Poetry to install the package along with its dependencies.

#### Pip
To install the package from local directory run the following command:
```
$ pip install .
```
In case you are a developer, you should install cereslib in editable mode:
```
$ pip install -e .
```

#### Poetry
We use [poetry](https://python-poetry.org/) for dependency management and 
packaging. You can install it following its [documentation](https://python-poetry.org/docs/#installation).
Once you have installed it, you can install cereslib and the dependencies in 
a project isolated environment using:
```
$ poetry install
```
To spaw a new shell within the virtual environment use:
```
$ poetry shell
```

## License

Licensed under GNU General Public License (GPL), version 3 or later.

            

Raw data

            {
    "_id": null,
    "home_page": "https://chaoss.github.io/grimoirelab/",
    "name": "cereslib",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4.0,>=3.8",
    "maintainer_email": null,
    "keywords": "development, grimoirelab",
    "author": "GrimoireLab Developers",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/8e/17/a0d43f74d7b57cf572f968d8dff9367f28b64466b5630aaa7bf3334fcd0e/cereslib-0.5.6.tar.gz",
    "platform": null,
    "description": "# Ceres [![Build Status](https://github.com/chaoss/grimoirelab-cereslib/workflows/tests/badge.svg)](https://github.com/chaoss/grimoirelab-cereslib/actions?query=workflow:tests+branch:master+event:push) [![Coverage Status](https://coveralls.io/repos/github/chaoss/grimoirelab-cereslib/badge.svg?branch=master)](https://coveralls.io/github/chaoss/grimoirelab-cereslib?branch=master) [![PyPI version](https://badge.fury.io/py/cereslib.svg)](https://badge.fury.io/py/cereslib)\n\nCeres is a library that aims at dealing with data in general,\nand software development data in particular.\n\nThe initial goal of Ceres is to parse information in several ways\nfrom the [Perceval](https://github.com/grimoirelab/perceval) tool\nin the [GrimoireLab project](https://github.com/grimoirelab).\n\nHowever, the more code is added to this project, the more generic\nmethods are found to be useful in other areas of analysis.\n\nThe following are the areas of analysis that Ceres can help at:\n\n## Eventize\n\nThe 'eventizer' helps to split information coming from Perceval.\nIn short, Perceval produces JSON documents and those can be consumed\nby Ceres and by the 'eventizing' side of the library.\n\nBy 'eventizing', this means the process to parse a full Perceval JSON\ndocument and produce a Pandas DataFrame with certain amount of information.\n\nAs an example, a commit contains information about the commit itself, and\nthe files that were 'touched' at some point. Depending on the granularity\nof the analysis Ceres will work in the following way:\n\n* Granularity = 1: This is the first level and produces 1 to 1 relationship\n  with the main items in the original data source. For example 1 commit would \n  be just 1 row in the resultant dataframe. This would be a similar case for\n  a code review process in Gerrit or in Bugzilla for tickets.\n* Granularity = 2: This is the second level and depends on the data source\n  how in depth this goes. In the specific case of commits, this would return\n  n rows in the dataframe. And there will be as many rows as files where \n  'touched' in the original data source.\n\n\n## Format\n\nThe format part of the library contains some utils that are useful for\nsome basic formatting actions such as having a whole column in the Pandas\ndataframe with the same string format.\n\nAnother example would be the use of the format utils to cast from string\nto date using datetuils and applying the method to a whole column of a \ngiven dataframe.\n\n## Filter\n\nThe filter utility basically removes rows based on certain values in\ncertain cells of a dataframe.\n\n## Data Enrich\n\nThis is the utility most context-related together with the eventizing\nactions. This will add or modify one or more columns in several ways.\n\nThere are several examples such as taking care of the surrogates enabling\nUTF8, adding new columns based on some actions on others, adding the gender\nof the name provided in another column, and others.\n\n\n# How can you help here?\n\nThis project is still quite new, and the development is really slow, so\nany extra hand would be really awesome, even giving directions, pieces\nof advice or feature requests :).\n\nAnd of course, using the software would be great!\n\n# Where to start?\n\nThe examples folder contains some of the clients I've used for some\nanalysis such as the gender analysis or to produce dataframes that help\nto understand the areas of the code where developers are working.\n\nThose are probably a good place to have a look at.\n\n## Requirements\n\n * Python >= 3.8\n\nYou will also need some other libraries for running the tool, you can find the\nwhole list of dependencies in [pyproject.toml](pyproject.toml) file.\n\n## Installation\n\nThere are several ways to install Cereslib on your system: packages or source \ncode using Poetry or pip.\n\n### PyPI\n\nCereslib can be installed using pip, a tool for installing Python packages. \nTo do it, run the next command:\n```\n$ pip install cereslib\n```\n\n### Source code\n\nTo install from the source code you will need to clone the repository first:\n```\n$ git clone https://github.com/chaoss/grimoirelab-cereslib\n$ cd grimoirelab-cereslib\n```\n\nThen use pip or Poetry to install the package along with its dependencies.\n\n#### Pip\nTo install the package from local directory run the following command:\n```\n$ pip install .\n```\nIn case you are a developer, you should install cereslib in editable mode:\n```\n$ pip install -e .\n```\n\n#### Poetry\nWe use [poetry](https://python-poetry.org/) for dependency management and \npackaging. You can install it following its [documentation](https://python-poetry.org/docs/#installation).\nOnce you have installed it, you can install cereslib and the dependencies in \na project isolated environment using:\n```\n$ poetry install\n```\nTo spaw a new shell within the virtual environment use:\n```\n$ poetry shell\n```\n\n## License\n\nLicensed under GNU General Public License (GPL), version 3 or later.\n",
    "bugtrack_url": null,
    "license": "GPL-3.0+",
    "summary": "GrimoireLab: Unify, eventize and enrich information from Perceval",
    "version": "0.5.6",
    "project_urls": {
        "Bug Tracker": "https://github.com/chaoss/grimoirelab-cereslib/issues",
        "Homepage": "https://chaoss.github.io/grimoirelab/",
        "Repository": "https://github.com/chaoss/grimoirelab-cereslib"
    },
    "split_keywords": [
        "development",
        " grimoirelab"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "66dd55a3ee2ade6573e1aae34c5eef01406be5a48b39a12b075a51cfb485056b",
                "md5": "eccf322aca380d8df7307d43ceee256e",
                "sha256": "0545e6453b9f413d888e5eb59d256ac7a59245a4c234c2abfb860c780946194f"
            },
            "downloads": -1,
            "filename": "cereslib-0.5.6-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "eccf322aca380d8df7307d43ceee256e",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.8",
            "size": 23491,
            "upload_time": "2024-03-27T12:11:15",
            "upload_time_iso_8601": "2024-03-27T12:11:15.525423Z",
            "url": "https://files.pythonhosted.org/packages/66/dd/55a3ee2ade6573e1aae34c5eef01406be5a48b39a12b075a51cfb485056b/cereslib-0.5.6-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "8e17a0d43f74d7b57cf572f968d8dff9367f28b64466b5630aaa7bf3334fcd0e",
                "md5": "69ac291c08ffd0bacaea1a5462413e46",
                "sha256": "7aea49599f18597f4fd124887ebed0a2b59bca567380d7512a228a9649011212"
            },
            "downloads": -1,
            "filename": "cereslib-0.5.6.tar.gz",
            "has_sig": false,
            "md5_digest": "69ac291c08ffd0bacaea1a5462413e46",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.8",
            "size": 29422,
            "upload_time": "2024-03-27T12:11:17",
            "upload_time_iso_8601": "2024-03-27T12:11:17.305315Z",
            "url": "https://files.pythonhosted.org/packages/8e/17/a0d43f74d7b57cf572f968d8dff9367f28b64466b5630aaa7bf3334fcd0e/cereslib-0.5.6.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-03-27 12:11:17",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "chaoss",
    "github_project": "grimoirelab-cereslib",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "cereslib"
}
        
Elapsed time: 0.38740s