[![Documentation Status](https://readthedocs.org/projects/ao3-api/badge/?version=latest)](https://ao3-api.readthedocs.io/en/latest/?badge=latest)
# AO3 API
This is an unofficial API that lets you access some of AO3's (archiveofourown.org) data through Python.
## Installation
Use the package manager [pip](https://pip.pypa.io/en/stable/) to install AO3 API.
```bash
pip install ao3_api
```
# Github
https://github.com/wendytg/ao3_api
# Usage
This package is divided in 9 core modules: works, chapters, users, series, search, session, comments, extra, and utils.
## Works
One of the most basic things you might want to do with this package is loading a work and checking its statistics and information. To do that, you'll need the `AO3.Work` class.
We start by finding the _workid_ of the work we want to load. We do that either by using `AO3.utils.workid_from_url(url)` or by just looking at the url ourselves. Let's take a look:
```py3
import AO3
url = "https://archiveofourown.org/works/14392692/chapters/33236241"
workid = AO3.utils.workid_from_url(url)
print(f"Work ID: {workid}")
work = AO3.Work(workid)
print(f"Chapters: {work.nchapters}")
```
After running this snippet, we get the output:
```
Work ID: 14392692
Chapters: 46
```
It's important to note that some works may not be accessible to guest users, and in this case you will get 0 chapters as an output, and the error `AO3.utils.AuthError: This work is only available to registered users of the Archive` if you try to load it. Nontheless, we can still do a lot more with this Work object: Lets try to get the first 20 words of the second chapter.
```py3
import AO3
work = AO3.Work(14392692)
print(work.chapters[1].title) # Second chapter name
text = work.chapters[1].text # Second chapter text
print(' '.join(text.split(" ")[:20]))
```
```
What Branches Grow Meaning
December 27, 2018
Christmas sucked this year, and Shouto’s got the black eye to prove it.
Things had started out well enough,
```
The objects in work.chapters are of type `AO3.Chapter`. They have a lot of the same properties as a `Work` object would.
Another thing you can do with the work object is download the entire work as a pdf or e-book. At the moment you can download works as AZW3, EPUB, HTML, MOBI, and PDF files.
```py3
import AO3
work = AO3.Work(14392692)
with open(f"{work.title}.pdf", "wb") as file:
file.write(work.download("PDF"))
```
__Advanced functionality__
Usually, when you call the constructor for the `Work` class, all info about it is loaded in the `__init__()` function. However, this process takes quite some time (~1-1.5 seconds) and if you want to load a list of works from a series, for example, you might be waiting for upwards of 30 seconds. To avoid this problem, the `Work.reload()` function, called on initialization, is a "threadable" function, which means that if you call it with the argument `threaded=True`, it will return a `Thread` object and work in parallel, meaning you can load multiple works at the same time. Let's take a look at an implementation:
```py3
import AO3
import time
series = AO3.Series(1295090)
works = []
threads = []
start = time.time()
for work in series.work_list:
works.append(work)
threads.append(work.reload(threaded=True))
for thread in threads:
thread.join()
print(f"Loaded {len(works)} works in {round(time.time()-start, 1)} seconds.")
```
`Loaded 29 works in 2.2 seconds.`
The `load=False` inside the `Work` constructor makes sure we don't load the work as soon as we create an instance of the class. In the end, we iterate over every thread and wait for the last one to finish using `.join()`. Let's compare this method with the standard way of loading AO3 works:
```py3
import AO3
import time
series = AO3.Series(1295090)
works = []
start = time.time()
for work in series.work_list:
work.reload()
works.append(work)
print(f"Loaded {len(works)} works in {round(time.time()-start, 1)} seconds.")
```
`Loaded 29 works in 21.6 seconds.`
As we can see, there is a significant performance increase. There are other functions in this package which have this functionality. To see if a function is "threadable", either use `hasattr(function, "_threadable")` or check its `__doc__` string.
To save even more time, if you're only interested in metadata, you can load a work with the `load_chapters` option set to False. Also, be aware that some functions (like `Series.work_list` or `Search.results`) might return semi-loaded `Work` objects. This means that no requests have been made to load this work (so you don't have access to chapter text, notes, etc...) but almost all of its metadata will already have been cached, and you might not need to call `Work.reload()` at all.
The last important information about the `Work` class is that most of its properties (like the number of bookmarks, kudos, the authors' names, etc...) are cached properties. That means that once you check them once, the value is stored and it won't ever change, even if those values change. To update these values, you will need to call `Work.reload()`. See the example below:
```py3
import AO3
sess = AO3.GuestSession()
work = AO3.Work(16721367, sess)
print(work.kudos)
work.leave_kudos()
work.reload()
print(work.kudos)
```
```
392
393
```
## Users
Another useful thing you might want to do is get information on who wrote which works / comments. For that, we use the `AO3.User` class.
```py3
import AO3
user = AO3.User("bothersomepotato")
print(user.url)
print(user.bio)
print(user.works) # Number of works published
```
```
https://archiveofourown.org/users/bothersomepotato
University student, opening documents to write essays but writing this stuff instead. No regrets though. My Tumblr, come chat with -or yell at- me if you feel like it! :)
2
```
## Search
To search for works, you can either use the `AO3.search()` function and parse the BeautifulSoup object returned yourself, or use the `AO3.Search` class to automatically do that for you.
```py3
import AO3
search = AO3.Search(any_field="Clarke Lexa", word_count=AO3.utils.Constraint(5000, 15000))
search.update()
print(search.total_results)
for result in search.results:
print(result)
```
```
3074
<Work [five times lexa falls for clarke]>
<Work [an incomplete list of reasons (why Clarke loves Lexa)]>
<Work [five times clarke and lexa aren’t sure if they're a couple or not]>
<Work [Chemistry]>
<Work [The New Commander (Lexa Joining Camp Jaha)]>
<Work [Ode to Clarke]>
<Work [it's always been (right in front of me)]>
<Work [The Girlfriend Tag]>
<Work [The After-Heda Chronicles]>
<Work [The Counter]>
<Work [May We Meet Again]>
<Work [No Filter]>
<Work [The Games We Play]>
<Work [A l'épreuve des balles]>
<Work [Celebration]>
<Work [Another level of fucked up]>
<Work [(Don't Ever Want to Tame) This Wild Heart]>
<Work [Self Control]>
<Work [Winter]>
<Work [My only wish]>
```
You can then use the workid to load one of the works you searched for. To get more then the first 20 works, change the page number using
```py3
search.page = 2
```
## Session
A lot of actions you might want to take might require an AO3 account. If you already have one, you can access those actions using an AO3.Session object. You start by logging in using your username and password, and then you can use that object to access restricted content.
```py3
import AO3
session = AO3.Session("username", "password")
print(f"Bookmarks: {session.bookmarks}")
session.refresh_auth_token()
print(session.kudos(AO3.Work(18001499, load=False))
```
```
Bookmarks: 67
True
```
We successfully left kudos in a work and checked our bookmarks. The `session.refresh_auth_token()` is needed for some activities such as leaving kudos and comments. If it is expired or you forget to call this function, the error `AO3.utils.AuthError: Invalid authentication token. Try calling session.refresh_auth_token()` will be raised.
You can also comment / leave kudos in a work by calling `Work.leave_kudos()`/`Work.comment()` and provided you have instantiated that object with a session already (`AO3.Work(xxxxxx, session=sess)` or using `Work.set_session()`). This is probably the best way to do so because you will run into less authentication issues (as the work's authenticity token will be used instead).
If you would prefer to leave a comment or kudos anonymously, you can use an `AO3.GuestSession` in the same way you'd use a normal session, except you won't be able to check your bookmarks, subscriptions, etc. because you're not actually logged in.
## Comments
To retrieve and process comment threads, you might want to look at the `Work.get_comments()` method. It returns all the comments in a specific chapter and their respective threads. You can then process them however you want. Let's take a look:
```py3
from time import time
import AO3
work = AO3.Work(24560008)
work.load_chapters()
start = time()
comments = work.get_comments(5)
print(f"Loaded {len(comments)} comment threads in {round(time()-start, 1)} seconds\n")
for comment in comments:
print(f"Comment ID: {comment.id}\nReplies: {len(comment.get_thread())}")
```
```
Loaded 5 comment threads in 1.8 seconds
Comment ID: 312237184
Replies: 1
Comment ID: 312245032
Replies: 1
Comment ID: 312257098
Replies: 1
Comment ID: 312257860
Replies: 1
Comment ID: 312285673
Replies: 2
```
Loading comments takes a very long time so you should try and use it as little as possible. It also causes lots of requests to be sent to the AO3 servers, which might result in getting the error `utils.HTTPError: We are being rate-limited. Try again in a while or reduce the number of requests`. If that happens, you should try to space out your requests or reduce their number. There is also the option to enable request limiting using `AO3.utils.limit_requests()`, which make it so you can't make more than x requests in a certain time window.
You can also reply to comments using the `Comment.reply()` function, or delete one (if it's yours) using `Comment.delete()`.
## Extra
AO3.extra contains the the code to download some extra resources that are not core to the functionality of this package and don't change very often. One example would be the list of fandoms recognized by AO3.
To download a resource, simply use `AO3.extra.download(resource_name)`. To download every resource, you can use `AO3.extra.download_all()`. To see the list of available resources, use `AO3.extra.get_resources()`.
# Contact info
For information or bug reports, please create an issue or start a discussion.
# License
[MIT](https://choosealicense.com/licenses/mit/)
Raw data
{
"_id": null,
"home_page": null,
"name": "ao3-api",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": null,
"keywords": "ao3, fanfiction, Archive of Our Own",
"author": "Wendy",
"author_email": null,
"download_url": "https://files.pythonhosted.org/packages/ea/ee/fac4294a45e59adeaa5a10713e9aeaca8462ac1489354b0539f8dcd69966/ao3_api-2.3.1.tar.gz",
"platform": null,
"description": "[![Documentation Status](https://readthedocs.org/projects/ao3-api/badge/?version=latest)](https://ao3-api.readthedocs.io/en/latest/?badge=latest)\n\n# AO3 API\n\nThis is an unofficial API that lets you access some of AO3's (archiveofourown.org) data through Python.\n\n## Installation\n\nUse the package manager [pip](https://pip.pypa.io/en/stable/) to install AO3 API.\n\n```bash\npip install ao3_api\n```\n\n# Github\n\nhttps://github.com/wendytg/ao3_api\n\n\n# Usage\n\nThis package is divided in 9 core modules: works, chapters, users, series, search, session, comments, extra, and utils.\n\n## Works\n\nOne of the most basic things you might want to do with this package is loading a work and checking its statistics and information. To do that, you'll need the `AO3.Work` class.\n\nWe start by finding the _workid_ of the work we want to load. We do that either by using `AO3.utils.workid_from_url(url)` or by just looking at the url ourselves. Let's take a look:\n\n```py3\nimport AO3\n\nurl = \"https://archiveofourown.org/works/14392692/chapters/33236241\"\nworkid = AO3.utils.workid_from_url(url)\nprint(f\"Work ID: {workid}\")\nwork = AO3.Work(workid)\nprint(f\"Chapters: {work.nchapters}\")\n```\n\nAfter running this snippet, we get the output:\n\n```\nWork ID: 14392692\nChapters: 46\n```\n\nIt's important to note that some works may not be accessible to guest users, and in this case you will get 0 chapters as an output, and the error `AO3.utils.AuthError: This work is only available to registered users of the Archive` if you try to load it. Nontheless, we can still do a lot more with this Work object: Lets try to get the first 20 words of the second chapter.\n\n```py3\nimport AO3\n\nwork = AO3.Work(14392692)\n\nprint(work.chapters[1].title) # Second chapter name\ntext = work.chapters[1].text # Second chapter text\nprint(' '.join(text.split(\" \")[:20]))\n```\n\n```\nWhat Branches Grow Meaning\nDecember 27, 2018\n \nChristmas sucked this year, and Shouto\u2019s got the black eye to prove it.\nThings had started out well enough,\n```\n\nThe objects in work.chapters are of type `AO3.Chapter`. They have a lot of the same properties as a `Work` object would.\n\n\nAnother thing you can do with the work object is download the entire work as a pdf or e-book. At the moment you can download works as AZW3, EPUB, HTML, MOBI, and PDF files.\n\n```py3\nimport AO3\n\nwork = AO3.Work(14392692)\n\nwith open(f\"{work.title}.pdf\", \"wb\") as file:\n file.write(work.download(\"PDF\"))\n```\n\n\n__Advanced functionality__\n\nUsually, when you call the constructor for the `Work` class, all info about it is loaded in the `__init__()` function. However, this process takes quite some time (~1-1.5 seconds) and if you want to load a list of works from a series, for example, you might be waiting for upwards of 30 seconds. To avoid this problem, the `Work.reload()` function, called on initialization, is a \"threadable\" function, which means that if you call it with the argument `threaded=True`, it will return a `Thread` object and work in parallel, meaning you can load multiple works at the same time. Let's take a look at an implementation:\n\n```py3\nimport AO3\nimport time\n\nseries = AO3.Series(1295090)\n\nworks = []\nthreads = []\nstart = time.time()\nfor work in series.work_list:\n works.append(work)\n threads.append(work.reload(threaded=True))\nfor thread in threads:\n thread.join()\nprint(f\"Loaded {len(works)} works in {round(time.time()-start, 1)} seconds.\")\n```\n\n`Loaded 29 works in 2.2 seconds.`\n\nThe `load=False` inside the `Work` constructor makes sure we don't load the work as soon as we create an instance of the class. In the end, we iterate over every thread and wait for the last one to finish using `.join()`. Let's compare this method with the standard way of loading AO3 works:\n\n```py3\nimport AO3\nimport time\n\nseries = AO3.Series(1295090)\n\nworks = []\nstart = time.time()\nfor work in series.work_list:\n work.reload()\n works.append(work)\n\nprint(f\"Loaded {len(works)} works in {round(time.time()-start, 1)} seconds.\")\n```\n\n`Loaded 29 works in 21.6 seconds.`\n\nAs we can see, there is a significant performance increase. There are other functions in this package which have this functionality. To see if a function is \"threadable\", either use `hasattr(function, \"_threadable\")` or check its `__doc__` string.\n\nTo save even more time, if you're only interested in metadata, you can load a work with the `load_chapters` option set to False. Also, be aware that some functions (like `Series.work_list` or `Search.results`) might return semi-loaded `Work` objects. This means that no requests have been made to load this work (so you don't have access to chapter text, notes, etc...) but almost all of its metadata will already have been cached, and you might not need to call `Work.reload()` at all. \n\nThe last important information about the `Work` class is that most of its properties (like the number of bookmarks, kudos, the authors' names, etc...) are cached properties. That means that once you check them once, the value is stored and it won't ever change, even if those values change. To update these values, you will need to call `Work.reload()`. See the example below:\n\n```py3\nimport AO3\n\nsess = AO3.GuestSession()\nwork = AO3.Work(16721367, sess)\nprint(work.kudos)\nwork.leave_kudos()\nwork.reload()\nprint(work.kudos)\n```\n\n```\n392\n393\n```\n\n\n\n## Users\n\nAnother useful thing you might want to do is get information on who wrote which works / comments. For that, we use the `AO3.User` class.\n\n```py3\nimport AO3\n\nuser = AO3.User(\"bothersomepotato\")\nprint(user.url)\nprint(user.bio)\nprint(user.works) # Number of works published\n```\n\n```\nhttps://archiveofourown.org/users/bothersomepotato\nUniversity student, opening documents to write essays but writing this stuff instead. No regrets though. My Tumblr, come chat with -or yell at- me if you feel like it! :)\n2\n```\n\n\n## Search\n\nTo search for works, you can either use the `AO3.search()` function and parse the BeautifulSoup object returned yourself, or use the `AO3.Search` class to automatically do that for you.\n\n```py3\nimport AO3\nsearch = AO3.Search(any_field=\"Clarke Lexa\", word_count=AO3.utils.Constraint(5000, 15000))\nsearch.update()\nprint(search.total_results)\nfor result in search.results:\n print(result)\n```\n\n```\n3074\n<Work [five times lexa falls for clarke]>\n<Work [an incomplete list of reasons (why Clarke loves Lexa)]>\n<Work [five times clarke and lexa aren\u2019t sure if they're a couple or not]>\n<Work [Chemistry]>\n<Work [The New Commander (Lexa Joining Camp Jaha)]>\n<Work [Ode to Clarke]>\n<Work [it's always been (right in front of me)]>\n<Work [The Girlfriend Tag]>\n<Work [The After-Heda Chronicles]>\n<Work [The Counter]>\n<Work [May We Meet Again]>\n<Work [No Filter]>\n<Work [The Games We Play]>\n<Work [A l'\u00e9preuve des balles]>\n<Work [Celebration]>\n<Work [Another level of fucked up]>\n<Work [(Don't Ever Want to Tame) This Wild Heart]>\n<Work [Self Control]>\n<Work [Winter]>\n<Work [My only wish]>\n```\n\nYou can then use the workid to load one of the works you searched for. To get more then the first 20 works, change the page number using \n```py3\nsearch.page = 2\n```\n\n## Session\n\nA lot of actions you might want to take might require an AO3 account. If you already have one, you can access those actions using an AO3.Session object. You start by logging in using your username and password, and then you can use that object to access restricted content.\n\n```py3\nimport AO3\n\nsession = AO3.Session(\"username\", \"password\")\nprint(f\"Bookmarks: {session.bookmarks}\")\nsession.refresh_auth_token()\nprint(session.kudos(AO3.Work(18001499, load=False))\n```\n\n```\nBookmarks: 67\nTrue\n```\n\nWe successfully left kudos in a work and checked our bookmarks. The `session.refresh_auth_token()` is needed for some activities such as leaving kudos and comments. If it is expired or you forget to call this function, the error `AO3.utils.AuthError: Invalid authentication token. Try calling session.refresh_auth_token()` will be raised.\n\nYou can also comment / leave kudos in a work by calling `Work.leave_kudos()`/`Work.comment()` and provided you have instantiated that object with a session already (`AO3.Work(xxxxxx, session=sess)` or using `Work.set_session()`). This is probably the best way to do so because you will run into less authentication issues (as the work's authenticity token will be used instead).\n\nIf you would prefer to leave a comment or kudos anonymously, you can use an `AO3.GuestSession` in the same way you'd use a normal session, except you won't be able to check your bookmarks, subscriptions, etc. because you're not actually logged in.\n\n\n## Comments\n\nTo retrieve and process comment threads, you might want to look at the `Work.get_comments()` method. It returns all the comments in a specific chapter and their respective threads. You can then process them however you want. Let's take a look:\n\n```py3\nfrom time import time\n\nimport AO3\n\n\nwork = AO3.Work(24560008)\nwork.load_chapters()\nstart = time()\ncomments = work.get_comments(5)\nprint(f\"Loaded {len(comments)} comment threads in {round(time()-start, 1)} seconds\\n\")\nfor comment in comments:\n print(f\"Comment ID: {comment.id}\\nReplies: {len(comment.get_thread())}\")\n```\n\n```\nLoaded 5 comment threads in 1.8 seconds\n\nComment ID: 312237184\nReplies: 1\nComment ID: 312245032\nReplies: 1\nComment ID: 312257098\nReplies: 1\nComment ID: 312257860\nReplies: 1\nComment ID: 312285673\nReplies: 2\n```\n\nLoading comments takes a very long time so you should try and use it as little as possible. It also causes lots of requests to be sent to the AO3 servers, which might result in getting the error `utils.HTTPError: We are being rate-limited. Try again in a while or reduce the number of requests`. If that happens, you should try to space out your requests or reduce their number. There is also the option to enable request limiting using `AO3.utils.limit_requests()`, which make it so you can't make more than x requests in a certain time window.\nYou can also reply to comments using the `Comment.reply()` function, or delete one (if it's yours) using `Comment.delete()`.\n\n\n## Extra\n\nAO3.extra contains the the code to download some extra resources that are not core to the functionality of this package and don't change very often. One example would be the list of fandoms recognized by AO3.\nTo download a resource, simply use `AO3.extra.download(resource_name)`. To download every resource, you can use `AO3.extra.download_all()`. To see the list of available resources, use `AO3.extra.get_resources()`.\n\n\n# Contact info\n\nFor information or bug reports, please create an issue or start a discussion.\n\n\n# License\n[MIT](https://choosealicense.com/licenses/mit/)\n",
"bugtrack_url": null,
"license": null,
"summary": "An unofficial AO3 (archiveofourown.org) API",
"version": "2.3.1",
"project_urls": {
"Documentation": "https://ao3-api.readthedocs.io/",
"Homepage": "https://github.com/wendytg/ao3_api",
"Issues": "https://github.com/wendytg/ao3_api/issues"
},
"split_keywords": [
"ao3",
" fanfiction",
" archive of our own"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "a605ca929be16698e355f6fb237fcc1dbc560f0b0dc447e13760c1bc4466f368",
"md5": "aaf6e94e38b62662cc9fc820c88bec70",
"sha256": "bab5a621cdeee387bf7a245fdc263f700d382eb1db2a3ec6c290ec1e61225089"
},
"downloads": -1,
"filename": "ao3_api-2.3.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "aaf6e94e38b62662cc9fc820c88bec70",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.8",
"size": 41621,
"upload_time": "2025-01-20T23:57:48",
"upload_time_iso_8601": "2025-01-20T23:57:48.474548Z",
"url": "https://files.pythonhosted.org/packages/a6/05/ca929be16698e355f6fb237fcc1dbc560f0b0dc447e13760c1bc4466f368/ao3_api-2.3.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "eaeefac4294a45e59adeaa5a10713e9aeaca8462ac1489354b0539f8dcd69966",
"md5": "11c1ab9a388304aca1836d6ba294f50b",
"sha256": "0fa80a905fdd698202369daf09b43c50459c1f473d46bdaf71da5ae46e5dcd76"
},
"downloads": -1,
"filename": "ao3_api-2.3.1.tar.gz",
"has_sig": false,
"md5_digest": "11c1ab9a388304aca1836d6ba294f50b",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8",
"size": 35009,
"upload_time": "2025-01-20T23:57:50",
"upload_time_iso_8601": "2025-01-20T23:57:50.155728Z",
"url": "https://files.pythonhosted.org/packages/ea/ee/fac4294a45e59adeaa5a10713e9aeaca8462ac1489354b0539f8dcd69966/ao3_api-2.3.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-01-20 23:57:50",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "wendytg",
"github_project": "ao3_api",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"requirements": [
{
"name": "BeautifulSoup4",
"specs": []
},
{
"name": "lxml",
"specs": []
},
{
"name": "requests",
"specs": []
}
],
"lcname": "ao3-api"
}