kgcreator


Namekgcreator JSON
Version 0.21 PyPI version JSON
download
home_pagehttps://github.com/mark-watson/kgcreator
SummaryKnowledge Graph Creator: converts text to RDF triples
upload_time2022-12-09 22:02:55
maintainer
docs_urlNone
authorMark Watson
requires_python>=3.7
licenseApache License, Version 2.0
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Knowledge Graph Creator command line tool: kgcreator

[![PyPI](https://img.shields.io/pypi/v/kgcreator.svg)](https://pypi.org/project/kgcreator/)
[![Changelog](https://img.shields.io/github/v/release/mark-watson/kgcreator?include_prereleases&label=changelog)](https://github.com/mark-watson/kgcreator/releases)
[![Tests](https://github.com/mark-watson/kgcreator/workflows/Test/badge.svg)](https://github.com/mark-watson/kgcreator/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/mark-watson/kgcreator/blob/master/LICENSE)

### Knowledge Graph Creator: converts text to RDF triples and Neo4J graph and Memgraph database data

The Knowledge Graph Creator (kgcreator) is a tool for automating the generation of RDF data for Knowledge Graphs from raw text data read from an input directory path.

The Knowledge Graph Creator creates an output file containing RDF triples suitable for loading into any linked data/semantic web data store.

This Python command line utility is one of the example programs in my book
[Practical Python Artificial Intelligence Programming](https://leanpub.com/pythonai) that is available on [Leanpub.com](https://leanpub.com/pythonai) or available (in the future, this book is under development) to read for free on my web site [https://markwatson.com](https://markwatson.com).


## Installation

Install this tool using `pip`:

    pip install kgcreator
    pip install spacy
    python -m spacy download en_core_web_sm

## Usage

For help, run:

    kgcreator --help
    kgcreator --inputdir=test_data --outputfile=out.rdf  --outputfileneo4j=out.cypher

You can also use:

    python -m kgcreator --help

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd kgcreator
    python -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest

## Notes on generating Neo4J graph data

Start by identifying:

- Names of entities (node labels).
- Names of relationships.
- Names of properties for nodes and relationships.
- Constraints to be defined.
- Indexes required.
- The most important queries?

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/mark-watson/kgcreator",
    "name": "kgcreator",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.7",
    "maintainer_email": "",
    "keywords": "",
    "author": "Mark Watson",
    "author_email": "",
    "download_url": "https://files.pythonhosted.org/packages/8c/e2/70d885131bf36f74d667b3d6332e69b5fbf892a9d342e69ce7b918c650d3/kgcreator-0.21.tar.gz",
    "platform": null,
    "description": "# Knowledge Graph Creator command line tool: kgcreator\n\n[![PyPI](https://img.shields.io/pypi/v/kgcreator.svg)](https://pypi.org/project/kgcreator/)\n[![Changelog](https://img.shields.io/github/v/release/mark-watson/kgcreator?include_prereleases&label=changelog)](https://github.com/mark-watson/kgcreator/releases)\n[![Tests](https://github.com/mark-watson/kgcreator/workflows/Test/badge.svg)](https://github.com/mark-watson/kgcreator/actions?query=workflow%3ATest)\n[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/mark-watson/kgcreator/blob/master/LICENSE)\n\n### Knowledge Graph Creator: converts text to RDF triples and Neo4J graph and Memgraph database data\n\nThe Knowledge Graph Creator (kgcreator) is a tool for automating the generation of RDF data for Knowledge Graphs from raw text data read from an input directory path.\n\nThe Knowledge Graph Creator creates an output file containing RDF triples suitable for loading into any linked data/semantic web data store.\n\nThis Python command line utility is one of the example programs in my book\n[Practical Python Artificial Intelligence Programming](https://leanpub.com/pythonai) that is available on [Leanpub.com](https://leanpub.com/pythonai) or available (in the future, this book is under development) to read for free on my web site [https://markwatson.com](https://markwatson.com).\n\n\n## Installation\n\nInstall this tool using `pip`:\n\n    pip install kgcreator\n    pip install spacy\n    python -m spacy download en_core_web_sm\n\n## Usage\n\nFor help, run:\n\n    kgcreator --help\n    kgcreator --inputdir=test_data --outputfile=out.rdf  --outputfileneo4j=out.cypher\n\nYou can also use:\n\n    python -m kgcreator --help\n\n## Development\n\nTo contribute to this tool, first checkout the code. Then create a new virtual environment:\n\n    cd kgcreator\n    python -m venv venv\n    source venv/bin/activate\n\nNow install the dependencies and test dependencies:\n\n    pip install -e '.[test]'\n\nTo run the tests:\n\n    pytest\n\n## Notes on generating Neo4J graph data\n\nStart by identifying:\n\n- Names of entities (node labels).\n- Names of relationships.\n- Names of properties for nodes and relationships.\n- Constraints to be defined.\n- Indexes required.\n- The most important queries?\n",
    "bugtrack_url": null,
    "license": "Apache License, Version 2.0",
    "summary": "Knowledge Graph Creator: converts text to RDF triples",
    "version": "0.21",
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "md5": "b6bbc0f5cf8a1504631fcc2d28033b45",
                "sha256": "c12fd90e77a978b20d910557407043ca8443794d15d18297d1bcc463ed52a3f6"
            },
            "downloads": -1,
            "filename": "kgcreator-0.21.tar.gz",
            "has_sig": false,
            "md5_digest": "b6bbc0f5cf8a1504631fcc2d28033b45",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.7",
            "size": 8070,
            "upload_time": "2022-12-09T22:02:55",
            "upload_time_iso_8601": "2022-12-09T22:02:55.726709Z",
            "url": "https://files.pythonhosted.org/packages/8c/e2/70d885131bf36f74d667b3d6332e69b5fbf892a9d342e69ce7b918c650d3/kgcreator-0.21.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2022-12-09 22:02:55",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "github_user": "mark-watson",
    "github_project": "kgcreator",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "kgcreator"
}
        
Elapsed time: 0.02937s