# general-conference-extractor
<!-- WARNING: THIS FILE WAS AUTOGENERATED! DO NOT EDIT! -->
## Install
``` sh
pip install general_conference_extractor
```
## How to Use
#### Example 1 - Just One Talk URL
Here’s what you could do with just one talk URL:
``` python
from general_conference_extractor.GeneralConferenceTalk import GeneralConferenceTalk
url = "https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng"
talk = GeneralConferenceTalk(url, title=True, author=True, calling=True)
# Print the extracted text
print("**** Metadata **** \n")
print(talk.metadata)
print("\n")
print("**** Extracted Text **** \n")
print(talk.text[0:300])
```
**** Metadata ****
{'title': 'Pillars and Rays', 'author': 'Alexander Dushku', 'calling': 'Of the Seventy', 'year': 2024, 'month': 4, 'url': 'https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng'}
**** Extracted Text ****
Pillars and Rays
By Elder Alexander Dushku
Of the Seventy
My message is for those who worry about their testimony because they haven’t had overwhelming spiritual experiences. I pray that I can provide some peace and assurance.
The Restoration of the gospel of Jesus Christ began with an explosion
#### Example 2 - Get All the Talks for One General Conference
Or, here’s an example of extracting every talk from a specific General
Conference (i.e. April 2017 in this instance):
``` python
from general_conference_extractor.extract_URLs import generate_conference_url, extract_talk_urls
from general_conference_extractor.data_output import extract_conference_talks
# Step 1 - Get the URLs for the talks
# get the page URL that shows all the talks for that specific General Conference
gen_conf_page_url = generate_conference_url(2017, '04')
# get all the URLs for the talks that were given for that conference
talk_urls = extract_talk_urls(gen_conf_page_url)
# Step 2 - Save the talks as txt docs in folders and then their respective metadata in a seperate csv file
output_folder = './conference_talks'
metadata_csv_path = './metadata.csv'
# to produce the respective folders and documents
# extract_conference_talks(talk_urls, output_folder, metadata_csv_path)
```
#### Example 3 - Get All the Talks for a Specific Year
``` python
from general_conference_extractor.extract_URLs import extract_multiconference_talk_urls
from general_conference_extractor.data_output import extract_conference_talks
# As an example
multiconference_talk_urls = extract_multiconference_talk_urls(2017,2017)
# Step 2 - Save the talks as txt docs and their metadata in a csv file
output_folder = './conference_talks'
metadata_csv_path = './metadata.csv'
# to produce the respective folders and documents
# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)
```
#### Example 4 - Get All the Talks for a Specific Decade
``` python
from general_conference_extractor.extract_URLs import extract_multiconference_talk_urls
from general_conference_extractor.data_output import extract_conference_talks
# As an example
multiconference_talk_urls = extract_multiconference_talk_urls(2010,2019)
# Step 2 - Save the talks as txt docs and their metadata in a csv file
output_folder = './conference_talks'
metadata_csv_path = './metadata.csv'
# to produce the respective folders and documents
# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)
```
Raw data
{
"_id": null,
"home_page": "https://github.com/c-a-s-t-l-e/general-conference-extractor",
"name": "general-conference-extractor",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.7",
"maintainer_email": null,
"keywords": "nbdev jupyter notebook python",
"author": "c-a-s-t-l-e",
"author_email": "castle676767@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/2c/ac/977b46c714547e28ca4c1b5bab65358b4eae71a4d80e195b92c2498e8165/general-conference-extractor-0.0.1.tar.gz",
"platform": null,
"description": "# general-conference-extractor\n\n\n<!-- WARNING: THIS FILE WAS AUTOGENERATED! DO NOT EDIT! -->\n\n## Install\n\n``` sh\npip install general_conference_extractor\n```\n\n## How to Use\n\n#### Example 1 - Just One Talk URL\n\nHere\u2019s what you could do with just one talk URL:\n\n``` python\nfrom general_conference_extractor.GeneralConferenceTalk import GeneralConferenceTalk\n\nurl = \"https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng\"\n\ntalk = GeneralConferenceTalk(url, title=True, author=True, calling=True)\n\n# Print the extracted text\nprint(\"**** Metadata **** \\n\")\nprint(talk.metadata)\nprint(\"\\n\")\n\nprint(\"**** Extracted Text **** \\n\")\nprint(talk.text[0:300])\n```\n\n **** Metadata **** \n\n {'title': 'Pillars and Rays', 'author': 'Alexander Dushku', 'calling': 'Of the Seventy', 'year': 2024, 'month': 4, 'url': 'https://www.churchofjesuschrist.org/study/general-conference/2024/04/15dushku?lang=eng'}\n\n\n **** Extracted Text **** \n\n Pillars and Rays\n\n By Elder Alexander Dushku\n\n Of the Seventy\n\n My message is for those who worry about their testimony because they haven\u2019t had overwhelming spiritual experiences. I pray that I can provide some peace and assurance.\n\n The Restoration of the gospel of Jesus Christ began with an explosion\n\n#### Example 2 - Get All the Talks for One General Conference\n\nOr, here\u2019s an example of extracting every talk from a specific General\nConference (i.e.\u00a0April 2017 in this instance):\n\n``` python\nfrom general_conference_extractor.extract_URLs import generate_conference_url, extract_talk_urls\nfrom general_conference_extractor.data_output import extract_conference_talks\n\n# Step 1 - Get the URLs for the talks\n\n# get the page URL that shows all the talks for that specific General Conference\ngen_conf_page_url = generate_conference_url(2017, '04')\n\n# get all the URLs for the talks that were given for that conference\ntalk_urls = extract_talk_urls(gen_conf_page_url)\n\n# Step 2 - Save the talks as txt docs in folders and then their respective metadata in a seperate csv file\noutput_folder = './conference_talks'\nmetadata_csv_path = './metadata.csv'\n\n# to produce the respective folders and documents\n# extract_conference_talks(talk_urls, output_folder, metadata_csv_path)\n```\n\n#### Example 3 - Get All the Talks for a Specific Year\n\n``` python\nfrom general_conference_extractor.extract_URLs import extract_multiconference_talk_urls\nfrom general_conference_extractor.data_output import extract_conference_talks\n\n# As an example\nmulticonference_talk_urls = extract_multiconference_talk_urls(2017,2017)\n\n# Step 2 - Save the talks as txt docs and their metadata in a csv file\noutput_folder = './conference_talks'\nmetadata_csv_path = './metadata.csv'\n\n# to produce the respective folders and documents\n# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)\n```\n\n#### Example 4 - Get All the Talks for a Specific Decade\n\n``` python\nfrom general_conference_extractor.extract_URLs import extract_multiconference_talk_urls\nfrom general_conference_extractor.data_output import extract_conference_talks\n\n# As an example\nmulticonference_talk_urls = extract_multiconference_talk_urls(2010,2019)\n\n# Step 2 - Save the talks as txt docs and their metadata in a csv file\noutput_folder = './conference_talks'\nmetadata_csv_path = './metadata.csv'\n\n# to produce the respective folders and documents\n# extract_conference_talks(multiconference_talk_urls, output_folder, metadata_csv_path)\n```\n\n\n",
"bugtrack_url": null,
"license": "Apache Software License 2.0",
"summary": "This library allows you to extract all the speeches given for the general conferences of the Church of Jesus Christ of Latter-Day Saints from April 1971 to the most recent month and year.",
"version": "0.0.1",
"project_urls": {
"Homepage": "https://github.com/c-a-s-t-l-e/general-conference-extractor"
},
"split_keywords": [
"nbdev",
"jupyter",
"notebook",
"python"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "59c97fd4d0d7030e39c6512ba005b14bbd7e401fd8aa9523cac974cc77ea9363",
"md5": "5dc3dec2f49ad3d48072708a352372d5",
"sha256": "68330dcc243657e270035ce6311ee18a5779f3e1e08d31883ad60de32bf83891"
},
"downloads": -1,
"filename": "general_conference_extractor-0.0.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "5dc3dec2f49ad3d48072708a352372d5",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.7",
"size": 14151,
"upload_time": "2024-04-19T01:58:23",
"upload_time_iso_8601": "2024-04-19T01:58:23.492112Z",
"url": "https://files.pythonhosted.org/packages/59/c9/7fd4d0d7030e39c6512ba005b14bbd7e401fd8aa9523cac974cc77ea9363/general_conference_extractor-0.0.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "2cac977b46c714547e28ca4c1b5bab65358b4eae71a4d80e195b92c2498e8165",
"md5": "369d5f2c908166dbf5b7daba7fd13d8a",
"sha256": "51f59dd04b44e321357d6b580904f25c14656ea52cedd33e162d92dfb3187456"
},
"downloads": -1,
"filename": "general-conference-extractor-0.0.1.tar.gz",
"has_sig": false,
"md5_digest": "369d5f2c908166dbf5b7daba7fd13d8a",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.7",
"size": 13275,
"upload_time": "2024-04-19T01:58:25",
"upload_time_iso_8601": "2024-04-19T01:58:25.795481Z",
"url": "https://files.pythonhosted.org/packages/2c/ac/977b46c714547e28ca4c1b5bab65358b4eae71a4d80e195b92c2498e8165/general-conference-extractor-0.0.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-04-19 01:58:25",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "c-a-s-t-l-e",
"github_project": "general-conference-extractor",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "general-conference-extractor"
}