git2doc


Namegit2doc JSON
Version 0.2.3 PyPI version JSON
download
home_pagehttps://github.com/voynow/git2doc
SummaryA tool for converting git repositories into documents
upload_time2023-06-25 20:19:32
maintainer
docs_urlNone
authorJamie Voynow
requires_python
license
keywords
VCS
bugtrack_url
requirements gitpython python-dotenv
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # git2doc 📚

A powerful Python library for converting git repositories into documents. git2doc allows you to extract and analyze code from GitHub repositories, making it easier to understand and work with large codebases.

## Why git2doc? 🚀

Working with large repositories can be overwhelming, especially when trying to understand the structure and content of the code. git2doc simplifies this process by converting repositories into documents, allowing you to easily search, analyze, and understand the codebase.

## Table of Contents 📖

- [Installation](#installation)
- [Usage](#usage)
  - [Fetching Repositories](#fetching-repositories)
  - [Loading Repository Data](#loading-repository-data)
  - [Writing Data to Parquet Files](#writing-data-to-parquet-files)
- [Badges](#badges)
- [Contributing](#contributing)
- [License](#license)

## Installation 💻

```bash
pip install git2doc
```

## Usage 🛠️

### Fetching Repositories

```python
from git2doc import get_repos_orchestrator

repos = get_repos_orchestrator(
    n_repos=10,
    last_n_days=30,
    language="Python"
)
```

### Loading Repository Data

```python
from git2doc import pull_code_from_repo

repo_data = pull_code_from_repo(
    repo="https://github.com/voynow/git2doc",
    branch="main"
)
```

### Writing Data to Parquet Files

```python
from git2doc import pipeline_fetch_and_load

pipeline_fetch_and_load(
    n_repos=1000,
    last_n_days=365,
    language="Python",
    write_batch_size=100,
    delete=True,
)
```

## Badges 🏅

[![PyPI version](https://badge.fury.io/py/git2doc.svg)](https://badge.fury.io/py/git2doc)
[![GitHub stars](https://img.shields.io/github/stars/voynow/git2doc)](https://github.com/voynow/git2doc/stargazers)
[![GitHub forks](https://img.shields.io/github/forks/voynow/git2doc)](https://github.com/voynow/git2doc/network)
[![GitHub issues](https://img.shields.io/github/issues/voynow/git2doc)](https://github.com/voynow/git2doc/issues)

## Contributing 🤝

Contributions are welcome! Please feel free to submit a pull request or open an issue on GitHub.

## License 📄

This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for more details.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/voynow/git2doc",
    "name": "git2doc",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "",
    "author": "Jamie Voynow",
    "author_email": "voynow99@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/be/d9/815d50cc4f5e4518332438e926373529a45cbba01f8b1eef9a2a80719bcb/git2doc-0.2.3.tar.gz",
    "platform": null,
    "description": "# git2doc \ud83d\udcda\r\n\r\nA powerful Python library for converting git repositories into documents. git2doc allows you to extract and analyze code from GitHub repositories, making it easier to understand and work with large codebases.\r\n\r\n## Why git2doc? \ud83d\ude80\r\n\r\nWorking with large repositories can be overwhelming, especially when trying to understand the structure and content of the code. git2doc simplifies this process by converting repositories into documents, allowing you to easily search, analyze, and understand the codebase.\r\n\r\n## Table of Contents \ud83d\udcd6\r\n\r\n- [Installation](#installation)\r\n- [Usage](#usage)\r\n  - [Fetching Repositories](#fetching-repositories)\r\n  - [Loading Repository Data](#loading-repository-data)\r\n  - [Writing Data to Parquet Files](#writing-data-to-parquet-files)\r\n- [Badges](#badges)\r\n- [Contributing](#contributing)\r\n- [License](#license)\r\n\r\n## Installation \ud83d\udcbb\r\n\r\n```bash\r\npip install git2doc\r\n```\r\n\r\n## Usage \ud83d\udee0\ufe0f\r\n\r\n### Fetching Repositories\r\n\r\n```python\r\nfrom git2doc import get_repos_orchestrator\r\n\r\nrepos = get_repos_orchestrator(\r\n    n_repos=10,\r\n    last_n_days=30,\r\n    language=\"Python\"\r\n)\r\n```\r\n\r\n### Loading Repository Data\r\n\r\n```python\r\nfrom git2doc import pull_code_from_repo\r\n\r\nrepo_data = pull_code_from_repo(\r\n    repo=\"https://github.com/voynow/git2doc\",\r\n    branch=\"main\"\r\n)\r\n```\r\n\r\n### Writing Data to Parquet Files\r\n\r\n```python\r\nfrom git2doc import pipeline_fetch_and_load\r\n\r\npipeline_fetch_and_load(\r\n    n_repos=1000,\r\n    last_n_days=365,\r\n    language=\"Python\",\r\n    write_batch_size=100,\r\n    delete=True,\r\n)\r\n```\r\n\r\n## Badges \ud83c\udfc5\r\n\r\n[![PyPI version](https://badge.fury.io/py/git2doc.svg)](https://badge.fury.io/py/git2doc)\r\n[![GitHub stars](https://img.shields.io/github/stars/voynow/git2doc)](https://github.com/voynow/git2doc/stargazers)\r\n[![GitHub forks](https://img.shields.io/github/forks/voynow/git2doc)](https://github.com/voynow/git2doc/network)\r\n[![GitHub issues](https://img.shields.io/github/issues/voynow/git2doc)](https://github.com/voynow/git2doc/issues)\r\n\r\n## Contributing \ud83e\udd1d\r\n\r\nContributions are welcome! Please feel free to submit a pull request or open an issue on GitHub.\r\n\r\n## License \ud83d\udcc4\r\n\r\nThis project is licensed under the MIT License. See the [LICENSE](LICENSE) file for more details.\r\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "A tool for converting git repositories into documents",
    "version": "0.2.3",
    "project_urls": {
        "Homepage": "https://github.com/voynow/git2doc"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "a5dc4571065865d7db022c9e22fca92c6c9ca77d0b5d0a6cf0605b56754c08b6",
                "md5": "066b774df4d9a05d49c0856927256182",
                "sha256": "46cb2813bad39c33f7cbed7394bb16816207e4070eb4e208e5c7e72949ca1661"
            },
            "downloads": -1,
            "filename": "git2doc-0.2.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "066b774df4d9a05d49c0856927256182",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 10846,
            "upload_time": "2023-06-25T20:19:29",
            "upload_time_iso_8601": "2023-06-25T20:19:29.858898Z",
            "url": "https://files.pythonhosted.org/packages/a5/dc/4571065865d7db022c9e22fca92c6c9ca77d0b5d0a6cf0605b56754c08b6/git2doc-0.2.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "bed9815d50cc4f5e4518332438e926373529a45cbba01f8b1eef9a2a80719bcb",
                "md5": "189f976ca4e0ae7d0089b7977c60aa61",
                "sha256": "e6a9d25da0a1b6ae940dc45183333489347cf33c9d99df39bdb25d42c2ccfc21"
            },
            "downloads": -1,
            "filename": "git2doc-0.2.3.tar.gz",
            "has_sig": false,
            "md5_digest": "189f976ca4e0ae7d0089b7977c60aa61",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 6122,
            "upload_time": "2023-06-25T20:19:32",
            "upload_time_iso_8601": "2023-06-25T20:19:32.370948Z",
            "url": "https://files.pythonhosted.org/packages/be/d9/815d50cc4f5e4518332438e926373529a45cbba01f8b1eef9a2a80719bcb/git2doc-0.2.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-06-25 20:19:32",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "voynow",
    "github_project": "git2doc",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "gitpython",
            "specs": []
        },
        {
            "name": "python-dotenv",
            "specs": []
        }
    ],
    "lcname": "git2doc"
}
        
Elapsed time: 1.11126s