general-conference-extractor


Namegeneral-conference-extractor JSON
Version 0.0.1 PyPI version JSON
download
home_pagehttps://github.com/c-a-s-t-l-e/general-conference-extractor
SummaryThis library allows you to extract all the speeches given for the general conferences of the Church of Jesus Christ of Latter-Day Saints from April 1971 to the most recent month and year.
upload_time2024-04-19 01:58:25
maintainerNone
docs_urlNone
authorc-a-s-t-l-e
requires_python>=3.7
licenseApache Software License 2.0
keywords nbdev jupyter notebook python
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # general-conference-extractor


<!-- WARNING: THIS FILE WAS AUTOGENERATED! DO NOT EDIT! -->

## Install

``` sh
pip install general_conference_extractor
```

## How to Use

#### Example 1 - Just One Talk URL

Here’s what you could do with just one talk URL:

``` python
from general_conference_extractor.GeneralConferenceTalk import GeneralConferenceTalk

url = "https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng"

talk = GeneralConferenceTalk(url, title=True, author=True, calling=True)

# Print the extracted text
print("**** Metadata ****  \n")
print(talk.metadata)
print("\n")

print("**** Extracted Text **** \n")
print(talk.text[0:300])
```

    **** Metadata ****  

    {'title': 'Pillars and Rays', 'author': 'Alexander Dushku', 'calling': 'Of the Seventy', 'year': 2024, 'month': 4, 'url': 'https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng'}


    **** Extracted Text **** 

    Pillars and Rays

    By Elder Alexander Dushku

    Of the Seventy

    My message is for those who worry about their testimony because they haven’t had overwhelming spiritual experiences. I pray that I can provide some peace and assurance.

    The Restoration of the gospel of Jesus Christ began with an explosion

#### Example 2 - Get All the Talks for One General Conference

Or, here’s an example of extracting every talk from a specific General
Conference (i.e. April 2017 in this instance):

``` python
from general_conference_extractor.extract_URLs import generate_conference_url, extract_talk_urls
from general_conference_extractor.data_output import extract_conference_talks

# Step 1 - Get the URLs for the talks

# get the page URL that shows all the talks for that specific General Conference
gen_conf_page_url = generate_conference_url(2017, '04')

# get all the URLs for the talks that were given for that conference
talk_urls = extract_talk_urls(gen_conf_page_url)

# Step 2 - Save the talks as txt docs in folders and then their respective metadata in a seperate csv file
output_folder = './conference_talks'
metadata_csv_path = './metadata.csv'

# to produce the respective folders and documents
# extract_conference_talks(talk_urls, output_folder, metadata_csv_path)
```

#### Example 3 - Get All the Talks for a Specific Year

``` python
from general_conference_extractor.extract_URLs import extract_multiconference_talk_urls
from general_conference_extractor.data_output import extract_conference_talks

# As an example
multiconference_talk_urls = extract_multiconference_talk_urls(2017,2017)

# Step 2 - Save the talks as txt docs and their metadata in a csv file
output_folder = './conference_talks'
metadata_csv_path = './metadata.csv'

# to produce the respective folders and documents
# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)
```

#### Example 4 - Get All the Talks for a Specific Decade

``` python
from general_conference_extractor.extract_URLs import extract_multiconference_talk_urls
from general_conference_extractor.data_output import extract_conference_talks

# As an example
multiconference_talk_urls = extract_multiconference_talk_urls(2010,2019)

# Step 2 - Save the talks as txt docs and their metadata in a csv file
output_folder = './conference_talks'
metadata_csv_path = './metadata.csv'

# to produce the respective folders and documents
# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)
```



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/c-a-s-t-l-e/general-conference-extractor",
    "name": "general-conference-extractor",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.7",
    "maintainer_email": null,
    "keywords": "nbdev jupyter notebook python",
    "author": "c-a-s-t-l-e",
    "author_email": "castle676767@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/2c/ac/977b46c714547e28ca4c1b5bab65358b4eae71a4d80e195b92c2498e8165/general-conference-extractor-0.0.1.tar.gz",
    "platform": null,
    "description": "# general-conference-extractor\n\n\n<!-- WARNING: THIS FILE WAS AUTOGENERATED! DO NOT EDIT! -->\n\n## Install\n\n``` sh\npip install general_conference_extractor\n```\n\n## How to Use\n\n#### Example 1 - Just One Talk URL\n\nHere\u2019s what you could do with just one talk URL:\n\n``` python\nfrom general_conference_extractor.GeneralConferenceTalk import GeneralConferenceTalk\n\nurl = \"https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng\"\n\ntalk = GeneralConferenceTalk(url, title=True, author=True, calling=True)\n\n# Print the extracted text\nprint(\"**** Metadata ****  \\n\")\nprint(talk.metadata)\nprint(\"\\n\")\n\nprint(\"**** Extracted Text **** \\n\")\nprint(talk.text[0:300])\n```\n\n    **** Metadata ****  \n\n    {'title': 'Pillars and Rays', 'author': 'Alexander Dushku', 'calling': 'Of the Seventy', 'year': 2024, 'month': 4, 'url': 'https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng'}\n\n\n    **** Extracted Text **** \n\n    Pillars and Rays\n\n    By Elder Alexander Dushku\n\n    Of the Seventy\n\n    My message is for those who worry about their testimony because they haven\u2019t had overwhelming spiritual experiences. I pray that I can provide some peace and assurance.\n\n    The Restoration of the gospel of Jesus Christ began with an explosion\n\n#### Example 2 - Get All the Talks for One General Conference\n\nOr, here\u2019s an example of extracting every talk from a specific General\nConference (i.e.\u00a0April 2017 in this instance):\n\n``` python\nfrom general_conference_extractor.extract_URLs import generate_conference_url, extract_talk_urls\nfrom general_conference_extractor.data_output import extract_conference_talks\n\n# Step 1 - Get the URLs for the talks\n\n# get the page URL that shows all the talks for that specific General Conference\ngen_conf_page_url = generate_conference_url(2017, '04')\n\n# get all the URLs for the talks that were given for that conference\ntalk_urls = extract_talk_urls(gen_conf_page_url)\n\n# Step 2 - Save the talks as txt docs in folders and then their respective metadata in a seperate csv file\noutput_folder = './conference_talks'\nmetadata_csv_path = './metadata.csv'\n\n# to produce the respective folders and documents\n# extract_conference_talks(talk_urls, output_folder, metadata_csv_path)\n```\n\n#### Example 3 - Get All the Talks for a Specific Year\n\n``` python\nfrom general_conference_extractor.extract_URLs import extract_multiconference_talk_urls\nfrom general_conference_extractor.data_output import extract_conference_talks\n\n# As an example\nmulticonference_talk_urls = extract_multiconference_talk_urls(2017,2017)\n\n# Step 2 - Save the talks as txt docs and their metadata in a csv file\noutput_folder = './conference_talks'\nmetadata_csv_path = './metadata.csv'\n\n# to produce the respective folders and documents\n# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)\n```\n\n#### Example 4 - Get All the Talks for a Specific Decade\n\n``` python\nfrom general_conference_extractor.extract_URLs import extract_multiconference_talk_urls\nfrom general_conference_extractor.data_output import extract_conference_talks\n\n# As an example\nmulticonference_talk_urls = extract_multiconference_talk_urls(2010,2019)\n\n# Step 2 - Save the talks as txt docs and their metadata in a csv file\noutput_folder = './conference_talks'\nmetadata_csv_path = './metadata.csv'\n\n# to produce the respective folders and documents\n# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)\n```\n\n\n",
    "bugtrack_url": null,
    "license": "Apache Software License 2.0",
    "summary": "This library allows you to extract all the speeches given for the general conferences of the Church of Jesus Christ of Latter-Day Saints from April 1971 to the most recent month and year.",
    "version": "0.0.1",
    "project_urls": {
        "Homepage": "https://github.com/c-a-s-t-l-e/general-conference-extractor"
    },
    "split_keywords": [
        "nbdev",
        "jupyter",
        "notebook",
        "python"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "59c97fd4d0d7030e39c6512ba005b14bbd7e401fd8aa9523cac974cc77ea9363",
                "md5": "5dc3dec2f49ad3d48072708a352372d5",
                "sha256": "68330dcc243657e270035ce6311ee18a5779f3e1e08d31883ad60de32bf83891"
            },
            "downloads": -1,
            "filename": "general_conference_extractor-0.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "5dc3dec2f49ad3d48072708a352372d5",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.7",
            "size": 14151,
            "upload_time": "2024-04-19T01:58:23",
            "upload_time_iso_8601": "2024-04-19T01:58:23.492112Z",
            "url": "https://files.pythonhosted.org/packages/59/c9/7fd4d0d7030e39c6512ba005b14bbd7e401fd8aa9523cac974cc77ea9363/general_conference_extractor-0.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "2cac977b46c714547e28ca4c1b5bab65358b4eae71a4d80e195b92c2498e8165",
                "md5": "369d5f2c908166dbf5b7daba7fd13d8a",
                "sha256": "51f59dd04b44e321357d6b580904f25c14656ea52cedd33e162d92dfb3187456"
            },
            "downloads": -1,
            "filename": "general-conference-extractor-0.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "369d5f2c908166dbf5b7daba7fd13d8a",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.7",
            "size": 13275,
            "upload_time": "2024-04-19T01:58:25",
            "upload_time_iso_8601": "2024-04-19T01:58:25.795481Z",
            "url": "https://files.pythonhosted.org/packages/2c/ac/977b46c714547e28ca4c1b5bab65358b4eae71a4d80e195b92c2498e8165/general-conference-extractor-0.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-04-19 01:58:25",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "c-a-s-t-l-e",
    "github_project": "general-conference-extractor",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "general-conference-extractor"
}
        
Elapsed time: 0.23345s