malexport


Namemalexport JSON
Version 0.1.5 PyPI version JSON
download
home_pagehttps://github.com/seanbreckenridge/malexport
Summarybacks up info from your MAL account
upload_time2024-07-03 05:48:15
maintainerNone
docs_urlNone
authorSean Breckenridge
requires_python>=3.8
licenseMIT
keywords anime data webscraping
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # malexport

This uses multiple methods to save your personal data from a MAL (MyAnimeList) account, focused on [anime|manga]lists/episode history and forum posts your account has made.

I wanted to use the API whenever possible here, but the information returned by the API is so scarce, or endpoints don't really exist at all, so you can't really get a lot of info out of it. As far as I could figure out, it doesn't have a history endpoint, or any way to retrieve how many times you've rewatched a show, so this uses:

- `malexport update lists` - The `load.json` endpoint (unauthenticated) to backup my `anime`/`manga` list (by most recently updated, as that's useful in many contexts)
- `malexport update api-lists` - The MAL API endpoint (authenticated) to backup my `anime`/`manga` list. This includes a lot of metadata for each entry, and also works for private lists (assuming you go through the OAuth flow with the private account logged in your browser)
- Selenium (so requires your MAL Username/Password; stored locally) to:
  - `malexport update history` - Individually grab episode/chapter history data (i.e., [this](https://i.imgur.com/2h5ZFng.png)). Note: the datetimes on these depend on what timezone you have set in your MAL settings
  - `malexport update export` - Download the MAL export (the giant XML files), since those have rewatch information, and better dates
  - `malexport update messages` - Downloads/Updates your received and sent messages (DMs)
- `malexport update forum` - Uses the MAL API ([docs](https://myanimelist.net/apiconfig/references/api/v2)) to grab forum posts
- `malexport update friends` - Uses [Jikan](https://jikan.moe/) to update your friends

The defaults here are far more on the safe side when scraping. If data fails to download you may have been flagged as a bot and may need to open MAL in your browser to solve a captcha.

For most people, this'll take a few hours to populate the initial cache, and then, and then a few minutes every few days (would recommend doing it at least once every 3 weeks, since it uses recent history to update history entries) to update it.

## Installation

Requires `python3.8+`

To install with pip, run:

```
pip install malexport
```

For your MyAnimeList [API Info](https://myanimelist.net/apiconfig), you can use 'other' as the 'App Type', 'hobbyist' as 'Purpose of Use', and `http://localhost` as the redirect URI. This only requires a Client ID, not both a Client ID and a Secret

Since this uses selenium, that requires a `chromedriver` binary somewhere on your system. That's typically available in package repositories, else see [here](https://gist.github.com/seanbreckenridge/709a824b8c56ea22dbf4e86a7804287d). If this isn't able to find the file, set the `MALEXPORT_CHROMEDRIVER_LOCATION` environment variable, like: `MALEXPORT_CHROMEDRIVER_LOCATION=C:\\Downloads\\chromedriver.exe malexport ...`

I left some shell functions I commonly use to query my data in `malexport.sh`, to use that set the `MAL_USERNAME` variable to your account name, and then `source malexport.sh` in your shell startup. Should work on both `bash`/`zsh`

## Usage

### update

`malexport update all` can be run to run all the updaters or `malexport update [forum|history|lists|export|friends|messages]` can be run to update one of them. Each of those require you to pass a `-u malUsername`. This stores everything (except for the MAL API Client ID) on an account-by-account basis, so its possible to backup multiple accounts

If you want to hide the chromedriver, you can run this like `MALEXPORT_CHROMEDRIVER_HIDDEN=1 malexport update ...`

For the `update lists` command, this uses the unauthenticated `load.json` endpoint, which is what is used on modern lists as MAL. Therefore, its contents might be slightly different depending on your settings. To get the most info out of it, I'd recommend going to your [list preferences](https://myanimelist.net/editprofile.php?go=listpreferences) and enabling all of the columns so that metadata is returned. Also, this assumes the [European date format](https://myanimelist.net/editprofile.php?go=listpreferences) for lists.

Credentials are asked for the first time they're needed, and then stored in `~/.config/malexport` (overwrite with `MALEXPORT_CFG`). Data by default is stored in `~/.local/share/malexport` (overwrite with `MALEXPORT_DIR`). Lots of other things here are configurable with environment variables:

```
malexport/common.py:18:REQUEST_WAIT_TIME: int = int(os.environ.get("MALEXPORT_REQUEST_WAIT_TIME", 10))
malexport/exporter/messages.py:27:TILL_SAME_LIMIT = int(os.environ.get("MALEXPORT_THREAD_LIMIT", 10))
malexport/exporter/driver.py:26:HIDDEN_CHROMEDRIVER = bool(int(os.environ.get("MALEXPORT_CHROMEDRIVER_HIDDEN", 0)))
malexport/exporter/driver.py:27:CHROME_LOCATION: Optional[str] = os.environ.get("MALEXPORT_CHROMEDRIVER_LOCATION")
malexport/exporter/driver.py:30:TEMP_DOWNLOAD_BASE = os.environ.get("MALEXPORT_TEMPDIR", tempfile.gettempdir())
malexport/exporter/history.py:47:TILL_SAME_LIMIT = int(os.environ.get("MALEXPORT_EPISODE_LIMIT", 5))
malexport/exporter/export_downloader.py:21:TRY_EXPORT_TIMES = int(os.environ.get("MALEXPORT_EXPORT_TRIES", 3))
malexport/exporter/mal_session.py:31:MALEXPORT_REDIRECT_URI = os.environ.get("MALEXPORT_REDIRECT_URI", "http://localhost")
malexport/paths.py:24:mal_id_cache_dir = os.environ.get("MAL_ID_CACHE_DIR", os.path.join(cache_dir, "mal-id-cache"))
malexport/paths.py:29:    os.environ.get("MALEXPORT_ZIP_BACKUPS", os.path.join(local_directory, "malexport_zips"))
malexport/parse/common.py:30:CUTOFF_DATE = int(os.environ.get("MALEXPORT_CUTOFF_DATE", date.today().year + 5))
```

To show debug logs set `export MALEXPORT_LOGS=10` (uses [logging levels](https://docs.python.org/3/library/logging.html#logging-levels)).

If you use 2FA you can set the `MALEXPORT_2FA` variable, like `MALEXPORT_2FA=1 malexport update ...` when running this, that adds a prompt to wait for you to login before continuing

### parse

I generally don't interface with the CLI interface here and instead use the `my.mal.export` in [HPI](https://github.com/seanbreckenridge/HPI). That handles configuring accounts/locating my data on disk

The `parse` subcommand includes corresponding commands which take the saved data, clean it up a bit into easier to manipulate representations. Those each have python functions which can be imported from `malexport.parse`, or called from the CLI to produce JSON.

The most useful is probably `combine`, which combines the `xml`, `api-lists`, `history` and `lists` data.

Otherwise, this acts on the data files (Reminder that data by default is stored in `~/.local/share/malexport`):

`$ malexport parse xml ./animelist.xml | jq '.entries[106]'`

```json
{
  "anime_id": 31646,
  "title": "3-gatsu no Lion",
  "media_type": "TV",
  "episodes": 22,
  "my_id": 0,
  "watched_episodes": 22,
  "start_date": "2020-07-01",
  "finish_date": "2020-08-09",
  "rated": null,
  "score": 9,
  "storage": null,
  "storage_value": 0,
  "status": "Completed",
  "comments": "",
  "times_watched": 0,
  "rewatch_value": null,
  "priority": "LOW",
  "tags": "",
  "rewatching": false,
  "rewatching_ep": 0,
  "discuss": true,
  "sns": "default",
  "update_on_import": false
}
```

`parse list` converts some of the status int enumerations (status/airing status) into the corresponding string values, and parses date strings like '04-09-20' to '09-04-2020':

`malexport parse list ./animelist.json | jq '.[0]'`:

```json
{
  "status": "On Hold",
  "score": 6,
  "tags": "Slice of Life",
  "rewatching": false,
  "watched_episodes": 8,
  "title": "Shiroi Suna no Aquatope",
  "episodes": 24,
  "airing_status": "Currently Airing",
  "id": 46093,
  "studios": [
    {
      "id": 132,
      "name": "P.A. Works"
    }
  ],
  "licensors": [],
  "season": {
    "year": 2021,
    "season": "Summer"
  },
  "has_episode_video": true,
  "has_promotion_video": true,
  "has_video": true,
  "video_url": "/anime/46093/Shiroi_Suna_no_Aquatope/video",
  "url": "/anime/46093/Shiroi_Suna_no_Aquatope",
  "image_path": "https://cdn.myanimelist.net/r/96x136/images/anime/1932/114952.jpg?s=12d30d08dd16eb006e02f73d9dc14a8f",
  "is_added_to_list": false,
  "media_type": "TV",
  "rating": "PG-13",
  "start_date": "2021-07-10",
  "finish_date": null,
  "air_start_date": "2021-07-09",
  "air_end_date": null,
  "days": 53,
  "storage": "",
  "priority": "Low"
}
```

If you want exact dates, I'd recommend using the `xml` export, as there's some estimation that has to done for the `list` export since the dates aren't absolute (e.g. `04-09-20` could be `2020` or `1920`)

`malexport parse forum -u malUsername` extracts posts made by your user to JSON

`malexport parse history -u malUsername | jq '.[] | select(.title == "Akira")'`

```json
{
  "mal_id": 47,
  "list_type": "anime",
  "title": "Akira",
  "entries": [
    {
      "at": "2016-02-02 21:47:00",
      "number": 1
    }
  ]
}
```

'number' in this case refers to the chapter or episode number

---

As some random examples, using this from the python, or the CLI:

_Which season do I have the most completed from?_

```python
>>> Counter([a.season for a in malexport.parse.parse_list("animelist.json", malexport.parse.ListType.ANIME) if a.score is not None and a.status == "Completed" if a.season is not None]).most_common(1)
[(Season(year=2016, season='Spring'), 73)]
```

Or, you can use [`jq`](https://github.com/stedolan/jq) to mangle it into whatever you want. Heres a mess of pipes to create a graph of your `Completed` ratings, using [`termgraph`](https://github.com/mkaz/termgraph):

```
$ malexport parse list ./animelist.json | jq '.[] | select(.status == "Completed") | .score' | grep -vx 0 | sort | uniq -c | awk '{ print $2 " " $1}' | termgraph | sort -n
1 : ▇▇▇▇▇▇▇▇▇ 158.00
2 : ▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇ 652.00
3 : ▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇ 847.00
4 : ▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇ 791.00
5 : ▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇ 562.00
6 : ▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇ 384.00
7 : ▇▇▇▇▇▇▇▇▇▇▇▇▇▇▇ 263.00
8 : ▇▇▇▇▇▇ 103.00
9 : ▇▇ 47.00
10: ▏ 5.00
```

### recover_deleted

This includes a command to recover deleted MAL entries (in other words, a MAL moderator completely deleted the entry from the site, which automatically removes it from your list) which you previously had on your list, by recovering deleted items from zipfiles.

This requires [`hpi`](https://pypi.org/project/HPI/) to automatically unzip files, install with `python3 -m pip install malexport[recover]` or directly with `pip install hpi`

`malexport recover-deleted backup`, saves to `~/.local/share/malexport_zips` (can overwrite default location with `MALEXPORT_ZIP_BACKUPS`)

To figure out which entries are deleted, this uses [mal-id-cache](https://github.com/seanbreckenridge/mal-id-cache). To update the local cache of IDs, run:

```bash
$ malexport recover-deleted approved-update
Updated mal-id-cache to commit 9c0cbdeac567671c0970c79ee99531edc2d89b0b
$ malexport recover-deleted approved-ids-stats
Approved Anime: 23930
Approved Manga: 62212
```

Then, you can run `malexport recover-deleted recover` to find deleted entries:

```
malexport recover-deleted recover
```

For example, mine look like:

```bash
python3 -m malexport recover-deleted recover -F \
    | jq 'values[].anime | .[].XMLData | "\(.anime_id) \(.title) \(.score)/10 \(.start_date)"' -r
```

```
6852 Ahiru no Otegara 3/10 2016-09-08
42142 X-Men Openings 4/10 2020-06-04
37879 Benghuai 3: Reburn 4/10 2018-09-14
38411 Eiga Daisuki Pompo-san 2/10 2018-09-27
29293 Isu 6/10 2016-09-10
38426 Koutetsujou no Kabaneri: Ran - Hajimaru Michiato 4/10 2019-02-13
10584 Mononoke Dance 4/10 2017-01-15
29949 Nami 1/10 2016-08-12
13675 Taisei Kensetsu: Bosporus Kaikyou Tunnel 4/10 2016-09-02
38065 Taisei Kensetsu: Singapore 4/10 2018-07-31
21441 Taisei Kensetsu: Sri Lanka Kousokudouro 3/10 2016-10-24
25883 Taisei Kensetsu: Vietnam Noi Bai Kuukou 4/10 2016-09-02
33234 Kaibutsu-kun: Suna Majin wo Yattsukero no Maki / Kaibutsu-kun to Haniwa Kaishin no Maki 4/10 2018-09-27
30245 Kamaishi no "Kiseki": Inochi wo Mamoru Tokubetsu Jugyou 2/10 2016-12-21
23399 Minami no Shima no Dera-chan 3/10 2016-04-26
```

None of those IDs exist anymore on MAL, so these backups are the only way to get metadata or my history/data for them

I backup my list once every 3 days, and have a corresponding [`bleanser`](https://github.com/seanbreckenridge/bleanser/blob/master/src/bleanser_sean/modules/mal_zips.py) (backup cleanser) file to remove redundant backups (ones that don't introduce 'new deleted' entries).

### manual-history

The `manual-history` command lets you locally save episode/chapter (history) data for anime or manga.

Since its not trivial to mark a single episode as watched on MAL; you have to:

- go to the anime page
- update the entry to the episode you watched
- then, reset the episode count to completed
- then, go to advanced edit, click the 'History' button and remove the latest entry created by you resetting the episode count

Sometimes I just rewatch particular episodes and not an entire show, this lets me do that and save that to a file.

That requires `autotui` and `pyfzf-iter`, which can be installed with `pip install malexport[manual]`

There is a `parse` command to parse the manual history file, and its combined into the `history` data when using `combine`

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/seanbreckenridge/malexport",
    "name": "malexport",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "anime, data, webscraping",
    "author": "Sean Breckenridge",
    "author_email": "\"seanbrecke@gmail.com\"",
    "download_url": "https://files.pythonhosted.org/packages/db/62/be5c99806fe37594d2483832dc278088db17dafaf99d48e44e1911c4593a/malexport-0.1.5.tar.gz",
    "platform": null,
    "description": "# malexport\n\nThis uses multiple methods to save your personal data from a MAL (MyAnimeList) account, focused on [anime|manga]lists/episode history and forum posts your account has made.\n\nI wanted to use the API whenever possible here, but the information returned by the API is so scarce, or endpoints don't really exist at all, so you can't really get a lot of info out of it. As far as I could figure out, it doesn't have a history endpoint, or any way to retrieve how many times you've rewatched a show, so this uses:\n\n- `malexport update lists` - The `load.json` endpoint (unauthenticated) to backup my `anime`/`manga` list (by most recently updated, as that's useful in many contexts)\n- `malexport update api-lists` - The MAL API endpoint (authenticated) to backup my `anime`/`manga` list. This includes a lot of metadata for each entry, and also works for private lists (assuming you go through the OAuth flow with the private account logged in your browser)\n- Selenium (so requires your MAL Username/Password; stored locally) to:\n  - `malexport update history` - Individually grab episode/chapter history data (i.e., [this](https://i.imgur.com/2h5ZFng.png)). Note: the datetimes on these depend on what timezone you have set in your MAL settings\n  - `malexport update export` - Download the MAL export (the giant XML files), since those have rewatch information, and better dates\n  - `malexport update messages` - Downloads/Updates your received and sent messages (DMs)\n- `malexport update forum` - Uses the MAL API ([docs](https://myanimelist.net/apiconfig/references/api/v2)) to grab forum posts\n- `malexport update friends` - Uses [Jikan](https://jikan.moe/) to update your friends\n\nThe defaults here are far more on the safe side when scraping. If data fails to download you may have been flagged as a bot and may need to open MAL in your browser to solve a captcha.\n\nFor most people, this'll take a few hours to populate the initial cache, and then, and then a few minutes every few days (would recommend doing it at least once every 3 weeks, since it uses recent history to update history entries) to update it.\n\n## Installation\n\nRequires `python3.8+`\n\nTo install with pip, run:\n\n```\npip install malexport\n```\n\nFor your MyAnimeList [API Info](https://myanimelist.net/apiconfig), you can use 'other' as the 'App Type', 'hobbyist' as 'Purpose of Use', and `http://localhost` as the redirect URI. This only requires a Client ID, not both a Client ID and a Secret\n\nSince this uses selenium, that requires a `chromedriver` binary somewhere on your system. That's typically available in package repositories, else see [here](https://gist.github.com/seanbreckenridge/709a824b8c56ea22dbf4e86a7804287d). If this isn't able to find the file, set the `MALEXPORT_CHROMEDRIVER_LOCATION` environment variable, like: `MALEXPORT_CHROMEDRIVER_LOCATION=C:\\\\Downloads\\\\chromedriver.exe malexport ...`\n\nI left some shell functions I commonly use to query my data in `malexport.sh`, to use that set the `MAL_USERNAME` variable to your account name, and then `source malexport.sh` in your shell startup. Should work on both `bash`/`zsh`\n\n## Usage\n\n### update\n\n`malexport update all` can be run to run all the updaters or `malexport update [forum|history|lists|export|friends|messages]` can be run to update one of them. Each of those require you to pass a `-u malUsername`. This stores everything (except for the MAL API Client ID) on an account-by-account basis, so its possible to backup multiple accounts\n\nIf you want to hide the chromedriver, you can run this like `MALEXPORT_CHROMEDRIVER_HIDDEN=1 malexport update ...`\n\nFor the `update lists` command, this uses the unauthenticated `load.json` endpoint, which is what is used on modern lists as MAL. Therefore, its contents might be slightly different depending on your settings. To get the most info out of it, I'd recommend going to your [list preferences](https://myanimelist.net/editprofile.php?go=listpreferences) and enabling all of the columns so that metadata is returned. Also, this assumes the [European date format](https://myanimelist.net/editprofile.php?go=listpreferences) for lists.\n\nCredentials are asked for the first time they're needed, and then stored in `~/.config/malexport` (overwrite with `MALEXPORT_CFG`). Data by default is stored in `~/.local/share/malexport` (overwrite with `MALEXPORT_DIR`). Lots of other things here are configurable with environment variables:\n\n```\nmalexport/common.py:18:REQUEST_WAIT_TIME: int = int(os.environ.get(\"MALEXPORT_REQUEST_WAIT_TIME\", 10))\nmalexport/exporter/messages.py:27:TILL_SAME_LIMIT = int(os.environ.get(\"MALEXPORT_THREAD_LIMIT\", 10))\nmalexport/exporter/driver.py:26:HIDDEN_CHROMEDRIVER = bool(int(os.environ.get(\"MALEXPORT_CHROMEDRIVER_HIDDEN\", 0)))\nmalexport/exporter/driver.py:27:CHROME_LOCATION: Optional[str] = os.environ.get(\"MALEXPORT_CHROMEDRIVER_LOCATION\")\nmalexport/exporter/driver.py:30:TEMP_DOWNLOAD_BASE = os.environ.get(\"MALEXPORT_TEMPDIR\", tempfile.gettempdir())\nmalexport/exporter/history.py:47:TILL_SAME_LIMIT = int(os.environ.get(\"MALEXPORT_EPISODE_LIMIT\", 5))\nmalexport/exporter/export_downloader.py:21:TRY_EXPORT_TIMES = int(os.environ.get(\"MALEXPORT_EXPORT_TRIES\", 3))\nmalexport/exporter/mal_session.py:31:MALEXPORT_REDIRECT_URI = os.environ.get(\"MALEXPORT_REDIRECT_URI\", \"http://localhost\")\nmalexport/paths.py:24:mal_id_cache_dir = os.environ.get(\"MAL_ID_CACHE_DIR\", os.path.join(cache_dir, \"mal-id-cache\"))\nmalexport/paths.py:29:    os.environ.get(\"MALEXPORT_ZIP_BACKUPS\", os.path.join(local_directory, \"malexport_zips\"))\nmalexport/parse/common.py:30:CUTOFF_DATE = int(os.environ.get(\"MALEXPORT_CUTOFF_DATE\", date.today().year + 5))\n```\n\nTo show debug logs set `export MALEXPORT_LOGS=10` (uses [logging levels](https://docs.python.org/3/library/logging.html#logging-levels)).\n\nIf you use 2FA you can set the `MALEXPORT_2FA` variable, like `MALEXPORT_2FA=1 malexport update ...` when running this, that adds a prompt to wait for you to login before continuing\n\n### parse\n\nI generally don't interface with the CLI interface here and instead use the `my.mal.export` in [HPI](https://github.com/seanbreckenridge/HPI). That handles configuring accounts/locating my data on disk\n\nThe `parse` subcommand includes corresponding commands which take the saved data, clean it up a bit into easier to manipulate representations. Those each have python functions which can be imported from `malexport.parse`, or called from the CLI to produce JSON.\n\nThe most useful is probably `combine`, which combines the `xml`, `api-lists`, `history` and `lists` data.\n\nOtherwise, this acts on the data files (Reminder that data by default is stored in `~/.local/share/malexport`):\n\n`$ malexport parse xml ./animelist.xml | jq '.entries[106]'`\n\n```json\n{\n  \"anime_id\": 31646,\n  \"title\": \"3-gatsu no Lion\",\n  \"media_type\": \"TV\",\n  \"episodes\": 22,\n  \"my_id\": 0,\n  \"watched_episodes\": 22,\n  \"start_date\": \"2020-07-01\",\n  \"finish_date\": \"2020-08-09\",\n  \"rated\": null,\n  \"score\": 9,\n  \"storage\": null,\n  \"storage_value\": 0,\n  \"status\": \"Completed\",\n  \"comments\": \"\",\n  \"times_watched\": 0,\n  \"rewatch_value\": null,\n  \"priority\": \"LOW\",\n  \"tags\": \"\",\n  \"rewatching\": false,\n  \"rewatching_ep\": 0,\n  \"discuss\": true,\n  \"sns\": \"default\",\n  \"update_on_import\": false\n}\n```\n\n`parse list` converts some of the status int enumerations (status/airing status) into the corresponding string values, and parses date strings like '04-09-20' to '09-04-2020':\n\n`malexport parse list ./animelist.json | jq '.[0]'`:\n\n```json\n{\n  \"status\": \"On Hold\",\n  \"score\": 6,\n  \"tags\": \"Slice of Life\",\n  \"rewatching\": false,\n  \"watched_episodes\": 8,\n  \"title\": \"Shiroi Suna no Aquatope\",\n  \"episodes\": 24,\n  \"airing_status\": \"Currently Airing\",\n  \"id\": 46093,\n  \"studios\": [\n    {\n      \"id\": 132,\n      \"name\": \"P.A. Works\"\n    }\n  ],\n  \"licensors\": [],\n  \"season\": {\n    \"year\": 2021,\n    \"season\": \"Summer\"\n  },\n  \"has_episode_video\": true,\n  \"has_promotion_video\": true,\n  \"has_video\": true,\n  \"video_url\": \"/anime/46093/Shiroi_Suna_no_Aquatope/video\",\n  \"url\": \"/anime/46093/Shiroi_Suna_no_Aquatope\",\n  \"image_path\": \"https://cdn.myanimelist.net/r/96x136/images/anime/1932/114952.jpg?s=12d30d08dd16eb006e02f73d9dc14a8f\",\n  \"is_added_to_list\": false,\n  \"media_type\": \"TV\",\n  \"rating\": \"PG-13\",\n  \"start_date\": \"2021-07-10\",\n  \"finish_date\": null,\n  \"air_start_date\": \"2021-07-09\",\n  \"air_end_date\": null,\n  \"days\": 53,\n  \"storage\": \"\",\n  \"priority\": \"Low\"\n}\n```\n\nIf you want exact dates, I'd recommend using the `xml` export, as there's some estimation that has to done for the `list` export since the dates aren't absolute (e.g. `04-09-20` could be `2020` or `1920`)\n\n`malexport parse forum -u malUsername` extracts posts made by your user to JSON\n\n`malexport parse history -u malUsername | jq '.[] | select(.title == \"Akira\")'`\n\n```json\n{\n  \"mal_id\": 47,\n  \"list_type\": \"anime\",\n  \"title\": \"Akira\",\n  \"entries\": [\n    {\n      \"at\": \"2016-02-02 21:47:00\",\n      \"number\": 1\n    }\n  ]\n}\n```\n\n'number' in this case refers to the chapter or episode number\n\n---\n\nAs some random examples, using this from the python, or the CLI:\n\n_Which season do I have the most completed from?_\n\n```python\n>>> Counter([a.season for a in malexport.parse.parse_list(\"animelist.json\", malexport.parse.ListType.ANIME) if a.score is not None and a.status == \"Completed\" if a.season is not None]).most_common(1)\n[(Season(year=2016, season='Spring'), 73)]\n```\n\nOr, you can use [`jq`](https://github.com/stedolan/jq) to mangle it into whatever you want. Heres a mess of pipes to create a graph of your `Completed` ratings, using [`termgraph`](https://github.com/mkaz/termgraph):\n\n```\n$ malexport parse list ./animelist.json | jq '.[] | select(.status == \"Completed\") | .score' | grep -vx 0 | sort | uniq -c | awk '{ print $2 \" \" $1}' | termgraph | sort -n\n1 : \u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587 158.00\n2 : \u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587 652.00\n3 : \u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587 847.00\n4 : \u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587 791.00\n5 : \u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587 562.00\n6 : \u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587 384.00\n7 : \u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587\u2587 263.00\n8 : \u2587\u2587\u2587\u2587\u2587\u2587 103.00\n9 : \u2587\u2587 47.00\n10: \u258f 5.00\n```\n\n### recover_deleted\n\nThis includes a command to recover deleted MAL entries (in other words, a MAL moderator completely deleted the entry from the site, which automatically removes it from your list) which you previously had on your list, by recovering deleted items from zipfiles.\n\nThis requires [`hpi`](https://pypi.org/project/HPI/) to automatically unzip files, install with `python3 -m pip install malexport[recover]` or directly with `pip install hpi`\n\n`malexport recover-deleted backup`, saves to `~/.local/share/malexport_zips` (can overwrite default location with `MALEXPORT_ZIP_BACKUPS`)\n\nTo figure out which entries are deleted, this uses [mal-id-cache](https://github.com/seanbreckenridge/mal-id-cache). To update the local cache of IDs, run:\n\n```bash\n$ malexport recover-deleted approved-update\nUpdated mal-id-cache to commit 9c0cbdeac567671c0970c79ee99531edc2d89b0b\n$ malexport recover-deleted approved-ids-stats\nApproved Anime: 23930\nApproved Manga: 62212\n```\n\nThen, you can run `malexport recover-deleted recover` to find deleted entries:\n\n```\nmalexport recover-deleted recover\n```\n\nFor example, mine look like:\n\n```bash\npython3 -m malexport recover-deleted recover -F \\\n    | jq 'values[].anime | .[].XMLData | \"\\(.anime_id) \\(.title) \\(.score)/10 \\(.start_date)\"' -r\n```\n\n```\n6852 Ahiru no Otegara 3/10 2016-09-08\n42142 X-Men Openings 4/10 2020-06-04\n37879 Benghuai 3: Reburn 4/10 2018-09-14\n38411 Eiga Daisuki Pompo-san 2/10 2018-09-27\n29293 Isu 6/10 2016-09-10\n38426 Koutetsujou no Kabaneri: Ran - Hajimaru Michiato 4/10 2019-02-13\n10584 Mononoke Dance 4/10 2017-01-15\n29949 Nami 1/10 2016-08-12\n13675 Taisei Kensetsu: Bosporus Kaikyou Tunnel 4/10 2016-09-02\n38065 Taisei Kensetsu: Singapore 4/10 2018-07-31\n21441 Taisei Kensetsu: Sri Lanka Kousokudouro 3/10 2016-10-24\n25883 Taisei Kensetsu: Vietnam Noi Bai Kuukou 4/10 2016-09-02\n33234 Kaibutsu-kun: Suna Majin wo Yattsukero no Maki / Kaibutsu-kun to Haniwa Kaishin no Maki 4/10 2018-09-27\n30245 Kamaishi no \"Kiseki\": Inochi wo Mamoru Tokubetsu Jugyou 2/10 2016-12-21\n23399 Minami no Shima no Dera-chan 3/10 2016-04-26\n```\n\nNone of those IDs exist anymore on MAL, so these backups are the only way to get metadata or my history/data for them\n\nI backup my list once every 3 days, and have a corresponding [`bleanser`](https://github.com/seanbreckenridge/bleanser/blob/master/src/bleanser_sean/modules/mal_zips.py) (backup cleanser) file to remove redundant backups (ones that don't introduce 'new deleted' entries).\n\n### manual-history\n\nThe `manual-history` command lets you locally save episode/chapter (history) data for anime or manga.\n\nSince its not trivial to mark a single episode as watched on MAL; you have to:\n\n- go to the anime page\n- update the entry to the episode you watched\n- then, reset the episode count to completed\n- then, go to advanced edit, click the 'History' button and remove the latest entry created by you resetting the episode count\n\nSometimes I just rewatch particular episodes and not an entire show, this lets me do that and save that to a file.\n\nThat requires `autotui` and `pyfzf-iter`, which can be installed with `pip install malexport[manual]`\n\nThere is a `parse` command to parse the manual history file, and its combined into the `history` data when using `combine`\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "backs up info from your MAL account",
    "version": "0.1.5",
    "project_urls": {
        "Homepage": "https://github.com/seanbreckenridge/malexport"
    },
    "split_keywords": [
        "anime",
        " data",
        " webscraping"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "37f2761773a5dc079436fe91518d70da35ae61fcb57c6d3d6d1f0eb395179cad",
                "md5": "8c3639b854327e6c3802fcba9d72e28f",
                "sha256": "286502608a96ad5b1a56d4a83d91f752c15cca4febde25bcc9f866f6839fe6ad"
            },
            "downloads": -1,
            "filename": "malexport-0.1.5-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "8c3639b854327e6c3802fcba9d72e28f",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 55385,
            "upload_time": "2024-07-03T05:48:13",
            "upload_time_iso_8601": "2024-07-03T05:48:13.763440Z",
            "url": "https://files.pythonhosted.org/packages/37/f2/761773a5dc079436fe91518d70da35ae61fcb57c6d3d6d1f0eb395179cad/malexport-0.1.5-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "db62be5c99806fe37594d2483832dc278088db17dafaf99d48e44e1911c4593a",
                "md5": "b7d095100feb9df697227c2e5b7a3d2e",
                "sha256": "69269336bcc4709b64e758bb0d98632b15372633b49b8819a40b5cdf5f68cbe7"
            },
            "downloads": -1,
            "filename": "malexport-0.1.5.tar.gz",
            "has_sig": false,
            "md5_digest": "b7d095100feb9df697227c2e5b7a3d2e",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 49463,
            "upload_time": "2024-07-03T05:48:15",
            "upload_time_iso_8601": "2024-07-03T05:48:15.625863Z",
            "url": "https://files.pythonhosted.org/packages/db/62/be5c99806fe37594d2483832dc278088db17dafaf99d48e44e1911c4593a/malexport-0.1.5.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-07-03 05:48:15",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "seanbreckenridge",
    "github_project": "malexport",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "malexport"
}
        
Elapsed time: 0.26950s