echr-extractor

Name	echr-extractor JSON
Version	1.0.43 JSON
	download
home_page
Summary	Library for extracting ECHR data
upload_time	2023-10-04 15:31:25
maintainer
docs_url	None
author	LawTech Lab
requires_python
license	MIT
keywords	echr extractor european convention human rights european convention human rights european convention on human rights
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            ## echr extractor

This library contains functions to get ECHR data.

## Version

Python 3.9

## Contributors

<!-- readme: contributors,gijsvd -start -->
<table>
<tr>
    <td align="center">
        <a href="https://github.com/brodriguesdemiranda">
            <img src="https://avatars.githubusercontent.com/u/35369949?v=4" width="100;" alt="brodriguesdemiranda"/>
            <br />
            <sub><b>Benjamin Rodrigues de Miranda</b></sub>
        </a>
    </td>
    <td align="center">
        <a href="https://github.com/ChloeCro">
            <img src="https://avatars.githubusercontent.com/u/99276050?v=4" width="100;" alt="ChloeCro"/>
            <br />
            <sub><b>Chloe Crombach</b></sub>
        </a>
    </td>
    <td align="center">
        <a href="https://github.com/Cloud956">
            <img src="https://avatars.githubusercontent.com/u/24865274?v=4" width="100;" alt="Cloud956"/>
            <br />
            <sub><b>Piotr Lewandowski</b></sub>
        </a>
    </td>
    <td align="center">
        <a href="https://github.com/pranavnbapat">
            <img src="https://avatars.githubusercontent.com/u/7271334?v=4" width="100;" alt="pranavnbapat"/>
            <br />
            <sub><b>Pranav Bapat</b></sub>
        </a>
    </td>
    <td align="center">
        <a href="https://github.com/running-machin">
            <img src="https://avatars.githubusercontent.com/u/60750154?v=4" width="100;" alt="running-machin"/>
            <br />
            <sub><b>running-machin</b></sub>
        </a>
    </td>
    <td align="center">
        <a href="https://github.com/shashankmc">
            <img src="https://avatars.githubusercontent.com/u/3445114?v=4" width="100;" alt="shashankmc"/>
            <br />
            <sub><b>shashankmc</b></sub>
        </a>
    </td>
    <td align="center">
        <a href="https://github.com/gijsvd">
            <img src="https://avatars.githubusercontent.com/u/31765316?v=4" width="100;" alt="gijsvd"/>
            <br />
            <sub><b>gijsvd</b></sub>
        </a>
    </td>
</tr>
</table>
<!-- readme: contributors,gijsvd -end -->

## How to install?

<code>pip install echr-extractor</code>

## What are the functions?

<ol>
    <li><code>get_echr</code></li>
     Gets all of the available metadata for echr cases from the HUDOC database.
    Can be saved in a file or returned in-memory.
<br>
    <li><code>get_echr_extra</code></li>
    Gets all of the available metadata for echr cases from the HUDOC database. 
On top of that downloads the full text for each case downloaded. Can be saved in a file or returned in-memory.
<br>
    <li><code>get_nodes_edges</code></li>
    Gets all of the available nodes and edges for echr cases for given metadata from the HUDOC database.
</ol>

## What are the parameters?

<ol>
    <li><code>get_echr</code></li> 
    <ul>
        <li><strong>start_id: int, optional, default: 0</strong></li>
        The id of the first case to be downloaded.
         <li><strong>end_id: int, optional, default: The maximum number of cases available</strong></li>
        The id of the last case to be downloaded.
        <li><strong>count: int, optional, default: None </strong></li>
        The number of cases per language to be downloaded, starting from the start_id. 
        <br><strong>!NOTICE!</strong><br>
        If count is provided, the end_id will be set to start_id+count, overwriting any given end_id value.
        <li><strong>start_date: date, optional, default None</strong></li>
        The start publication date (yyyy-mm-dd)
        <li><strong>end_date: date, optional, default current date</strong></li>
        The end publication date (yyyy-mm-dd)
        <li><strong>verbose: boolean, optional, default False</strong></li>
        This option allows for additional printing, showing live progress of the extraction process.
        <li><strong>fields: list of strings, optional, default all available fields</strong></li>
        This argument can be provided, to limit the metadata to be downloaded. These fields will appear as 
        different columns in the csv file / Dataframe object. The full list of fields is attached in the appendix.
        <li><strong>save_file: ['y', 'n'],optional, default 'y'</strong></li>
        Save metadata as a csv file in the data folder, or return as a Pandas DataFrame object in-memory.
        <li><strong>link: string ,optional, default None </strong></li>
        Allows the user to download results of a search from the HUDOC website. Since the HUDOC does not provide any proper API documentation,
        this method attempts to recreate an API call based on observer relation between the browser link and API call. 
        This method might encounter errors, as there are possible behaviors that were not tested.
        If this argument is provided, all the other arguments are ignored, except for 'fields'. Further information on proper usage is in the Appendix.
        <li><strong>query_payload: string ,optional, default None </strong></li>
        Allows the user to download results of a search from the HUDOC website. If this argument is provided, it takes priority over the 'link' parameter.
        This method is much more robust than using the 'link' parameter. It requires the user to access the Network tab on his browser - 
        full information on proper usage is in the Appendix.
        <li><strong>language: list of strings, optional, default ['ENG']</strong></li>
        The language of the metadata to be downloaded from the available languages.
        <br><strong>!NOTICE!</strong><br>
        If link or query payload are provided, the language argument will not be used, as the language also appears in the link and query.
</ul>
    <li><code>get_echr_extra</code></li>
    <ul> 
        <li><strong>start_id: int, optional, default: 0</strong></li>
        The id of the first case to be downloaded.
        <li><strong>end_id: int, optional, default: The maximum number of cases available</strong></li>
        The id of the last case to be downloaded.
        <li><strong>count: int, optional, default: None </strong></li>
        The number of cases per language given as input to be downloaded, starting from the start_id. 
        <br><strong>!NOTICE!</strong><br>
        If count is provided, the end_id will be set to start_id+count, overwriting any given end_id value.
        <li><strong>start_date: date, optional, default None</strong></li>
        The start publication date (yyyy-mm-dd)
        <li><strong>end_date: date, optional, default current date</strong></li>
        The end publication date (yyyy-mm-dd)
        <li><strong>verbose: boolean, optional, default False</strong></li>
        This option allows for additional printing, showing live progress of the extraction process.
        <li><strong>skip_missing_dates: boolean, optional, default False</strong></li>
        This option makes the extraction not collect data for cases where there is no judgement date provided.
        <li><strong>fields: list of strings, optional, default all available fields</strong></li>
        This argument can be provided, to limit the metadata to be downloaded. These fields will appear as 
        different columns in the csv file / Dataframe object. The full list of fields is attached in the appendix.
        <li><strong>save_file: ['y', 'n'],optional, default 'y'</strong></li>
        Save metadata as a csv file in the data folder and the full_text as a json file, 
        or return a Pandas DataFrame object and a list of dictionaries in-memory.
        <li><strong>language: list of strings, optional, default ['ENG']</strong></li>
        The language of the metadata to be downloaded from the available languages.
        <br><strong>!NOTICE!</strong><br>
        If link or query payload are provided, the language argument will not be used, as the language also appears in the link and query.
        <li><strong>link: string ,optional, default None </strong></li>
        Allows the user to download results of a search from the HUDOC website. Since the HUDOC does not provide any proper API documentation,
        this method attempts to recreate an API call based on observer relation between the browser link and API call. 
        This method might encounter errors, as there are possible behaviors that were not tested.
        If this argument is provided, all the other arguments are ignored, except for 'fields'. Further information on proper usage is in the Appendix.
        <li><strong>query_payload: string ,optional, default None </strong></li>
        Allows the user to download results of a search from the HUDOC website. If this argument is provided, it takes priority over the 'link' parameter.
        This method is much more robust than using the 'link' parameter. It requires the user to access the Network tab on his browser - 
        full information on proper usage is in the Appendix.
        <li><strong>threads: int, optional, default: 10</strong></li>
        The full text download is a parallelizable process.
        This parameter determines the number of threads to be used in the download.
    </ul>
    <li><code>get_nodes_edges</code></li>
    <ul>
        <li><strong>metadata_path</strong></li>
        The path to the metadata file to read.
        <li><strong>df</strong></li>
        Alternative to metadata_path, user can provide a Pandas Dataframe object. In case both are given, df is ignored.
        <li><strong>save_file: ['y', 'n'],optional, default 'y'</strong></li>
        Save the nodes and edges of cases in metadata as csv files in the data folder, or return them as Pandas Dataframe objects in-memory.
    </ul>
</ol>

## Examples

```
import echr_extractor as echr

Below are examples for in-file saving:

df, json = echr.get_echr_extra(count=100,save_file='y',threads=10)
df = echr.get_echr(start_id=1,save_file='y',skip_missing_dates=True)

Below are examples for in-memory saving:

df, json = echr.get_echr_extra(start_id=20,end_id=3000,save_file='n')
    
df = echr.get_echr(start_id=1000,count=2000,save_file='n',verbose=True)

nodes, edges = echr.get_nodes_edges(metadata_path='data/echr_metadata.csv',save_file='n')
```

```

## License
[![License: Apache 2.0](https://img.shields.io/github/license/maastrichtlawtech/extraction_libraries)](https://opensource.org/licenses/Apache-2.0)

Previously under the [MIT License](https://opensource.org/licenses/MIT), as of 28/10/2022 this work is licensed under a [Apache License, Version 2.0](https://opensource.org/licenses/Apache-2.0).

Apache License, Version 2.0

Copyright (c) 2022 Maastricht Law & Tech Lab

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
    
    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
```

## Appendix

```
To properly use the 'link' parameter of the extraction methods, the user should head to 

https://hudoc.echr.coe.int/eng#%20

There, the user can use the tools of Advanced Search of HUDOC to search for specific cases.
Afterwards*, the user can copy the link of the current website, and pass it on to the extraction methods. 

Known issues with the 'link' method:

- Using the " character in your searches will cause the extraction to fail. It will only work if that character is in the
Text section, where it is essential for proper use of the search. In all the other search fields, please do not use the " character.
If it is essential for you work, please raise an issue on Github, and we can try to manually fix another field.


* It should be noted that the link only updates after the 'search' button  of the Advanced Search is clicked.



The full list of fields is as follows:

fields = ['itemid','applicability','application','appno','article','conclusion','decisiondate','docname',
'documentcollectionid','documentcollectionid2','doctype','doctypebranch','ecli','externalsources','extractedappno',
'importance','introductiondate','isplaceholder','issue','judgementdate','kpdate','kpdateAsText','kpthesaurus',
'languageisocode','meetingnumber','originatingbody','publishedby','Rank','referencedate','reportdate','representedby',
'resolutiondate',resolutionnumber','respondent','respondentOrderEng','rulesofcourt','separateopinion','scl',
'sharepointid','typedescription','nonviolation','violation']

These fields can take different values, for more information head to https://hudoc.echr.coe.int.
```

### Query_payload Parameter
This section will define in a step-by-step fashion the proper usage of the 'query_payload' parameter.

1. Go to the [HUDOC website](https://hudoc.echr.coe.int/eng#%20). 
2. Input your search parameters.
3. Right-click on the website and inspect the website elements. <br>
![guide](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/inspect.png)
4. After the tab on the right side has opened, enter the network section, which records requests made by the website. <br>
![guide2](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/network.png)
5. Here you might see some requests already present. If that is the case, press the record button twice to clear the history.
Otherwise, continue to step 6.<br>
![guide3](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/record.png)
6. Once you are recording new requests and the history has been cleared, click the search button on the website to execute your search.
![guide4](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/search.png)
7. Now in the network tab, you should see new request records appear. Click the one on the top to inspect it.
![guide5](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/grab_api.png)
8. A tab with request information should appear. Enter the Payload section.
![guide6](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/enter_payload.png)
9. Here the query payload should be present. Copy its value, and use it as the 'query_payload' parameter in code. It should be noted that this parameter should be used as a string surrounded by the single quotation mark ( ' ) , as the query payload might include the double quotation mark ( " ) characters.
![guide7](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/success.png)
10. Now you know how to use the query_payload parameter!

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "echr-extractor",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "echr,extractor,european,convention,human,rights,european convention,human rights,european convention on human rights",
    "author": "LawTech Lab",
    "author_email": "a.gade@student.maastrichtuniversity.nl",
    "download_url": "https://files.pythonhosted.org/packages/07/2c/4bd625f53260e606be93f20d5e9123c9ff3ca8ec12a839687a1386998d9c/echr_extractor-1.0.43.tar.gz",
    "platform": null,
    "description": "## echr extractor\r\n\r\nThis library contains functions to get ECHR data.\r\n\r\n## Version\r\n\r\nPython 3.9\r\n\r\n## Contributors\r\n\r\n<!-- readme: contributors,gijsvd -start -->\r\n<table>\r\n<tr>\r\n    <td align=\"center\">\r\n        <a href=\"https://github.com/brodriguesdemiranda\">\r\n            <img src=\"https://avatars.githubusercontent.com/u/35369949?v=4\" width=\"100;\" alt=\"brodriguesdemiranda\"/>\r\n            <br />\r\n            <sub><b>Benjamin Rodrigues de Miranda</b></sub>\r\n        </a>\r\n    </td>\r\n    <td align=\"center\">\r\n        <a href=\"https://github.com/ChloeCro\">\r\n            <img src=\"https://avatars.githubusercontent.com/u/99276050?v=4\" width=\"100;\" alt=\"ChloeCro\"/>\r\n            <br />\r\n            <sub><b>Chloe Crombach</b></sub>\r\n        </a>\r\n    </td>\r\n    <td align=\"center\">\r\n        <a href=\"https://github.com/Cloud956\">\r\n            <img src=\"https://avatars.githubusercontent.com/u/24865274?v=4\" width=\"100;\" alt=\"Cloud956\"/>\r\n            <br />\r\n            <sub><b>Piotr Lewandowski</b></sub>\r\n        </a>\r\n    </td>\r\n    <td align=\"center\">\r\n        <a href=\"https://github.com/pranavnbapat\">\r\n            <img src=\"https://avatars.githubusercontent.com/u/7271334?v=4\" width=\"100;\" alt=\"pranavnbapat\"/>\r\n            <br />\r\n            <sub><b>Pranav Bapat</b></sub>\r\n        </a>\r\n    </td>\r\n    <td align=\"center\">\r\n        <a href=\"https://github.com/running-machin\">\r\n            <img src=\"https://avatars.githubusercontent.com/u/60750154?v=4\" width=\"100;\" alt=\"running-machin\"/>\r\n            <br />\r\n            <sub><b>running-machin</b></sub>\r\n        </a>\r\n    </td>\r\n    <td align=\"center\">\r\n        <a href=\"https://github.com/shashankmc\">\r\n            <img src=\"https://avatars.githubusercontent.com/u/3445114?v=4\" width=\"100;\" alt=\"shashankmc\"/>\r\n            <br />\r\n            <sub><b>shashankmc</b></sub>\r\n        </a>\r\n    </td>\r\n    <td align=\"center\">\r\n        <a href=\"https://github.com/gijsvd\">\r\n            <img src=\"https://avatars.githubusercontent.com/u/31765316?v=4\" width=\"100;\" alt=\"gijsvd\"/>\r\n            <br />\r\n            <sub><b>gijsvd</b></sub>\r\n        </a>\r\n    </td>\r\n</tr>\r\n</table>\r\n<!-- readme: contributors,gijsvd -end -->\r\n\r\n## How to install?\r\n\r\n<code>pip install echr-extractor</code>\r\n\r\n## What are the functions?\r\n\r\n<ol>\r\n    <li><code>get_echr</code></li>\r\n     Gets all of the available metadata for echr cases from the HUDOC database.\r\n    Can be saved in a file or returned in-memory.\r\n<br>\r\n    <li><code>get_echr_extra</code></li>\r\n    Gets all of the available metadata for echr cases from the HUDOC database. \r\nOn top of that downloads the full text for each case downloaded. Can be saved in a file or returned in-memory.\r\n<br>\r\n    <li><code>get_nodes_edges</code></li>\r\n    Gets all of the available nodes and edges for echr cases for given metadata from the HUDOC database.\r\n</ol>\r\n\r\n## What are the parameters?\r\n\r\n<ol>\r\n    <li><code>get_echr</code></li> \r\n    <ul>\r\n        <li><strong>start_id: int, optional, default: 0</strong></li>\r\n        The id of the first case to be downloaded.\r\n         <li><strong>end_id: int, optional, default: The maximum number of cases available</strong></li>\r\n        The id of the last case to be downloaded.\r\n        <li><strong>count: int, optional, default: None </strong></li>\r\n        The number of cases per language to be downloaded, starting from the start_id. \r\n        <br><strong>!NOTICE!</strong><br>\r\n        If count is provided, the end_id will be set to start_id+count, overwriting any given end_id value.\r\n        <li><strong>start_date: date, optional, default None</strong></li>\r\n        The start publication date (yyyy-mm-dd)\r\n        <li><strong>end_date: date, optional, default current date</strong></li>\r\n        The end publication date (yyyy-mm-dd)\r\n        <li><strong>verbose: boolean, optional, default False</strong></li>\r\n        This option allows for additional printing, showing live progress of the extraction process.\r\n        <li><strong>fields: list of strings, optional, default all available fields</strong></li>\r\n        This argument can be provided, to limit the metadata to be downloaded. These fields will appear as \r\n        different columns in the csv file / Dataframe object. The full list of fields is attached in the appendix.\r\n        <li><strong>save_file: ['y', 'n'],optional, default 'y'</strong></li>\r\n        Save metadata as a csv file in the data folder, or return as a Pandas DataFrame object in-memory.\r\n        <li><strong>link: string ,optional, default None </strong></li>\r\n        Allows the user to download results of a search from the HUDOC website. Since the HUDOC does not provide any proper API documentation,\r\n        this method attempts to recreate an API call based on observer relation between the browser link and API call. \r\n        This method might encounter errors, as there are possible behaviors that were not tested.\r\n        If this argument is provided, all the other arguments are ignored, except for 'fields'. Further information on proper usage is in the Appendix.\r\n        <li><strong>query_payload: string ,optional, default None </strong></li>\r\n        Allows the user to download results of a search from the HUDOC website. If this argument is provided, it takes priority over the 'link' parameter.\r\n        This method is much more robust than using the 'link' parameter. It requires the user to access the Network tab on his browser - \r\n        full information on proper usage is in the Appendix.\r\n        <li><strong>language: list of strings, optional, default ['ENG']</strong></li>\r\n        The language of the metadata to be downloaded from the available languages.\r\n        <br><strong>!NOTICE!</strong><br>\r\n        If link or query payload are provided, the language argument will not be used, as the language also appears in the link and query.\r\n</ul>\r\n    <li><code>get_echr_extra</code></li>\r\n    <ul> \r\n        <li><strong>start_id: int, optional, default: 0</strong></li>\r\n        The id of the first case to be downloaded.\r\n        <li><strong>end_id: int, optional, default: The maximum number of cases available</strong></li>\r\n        The id of the last case to be downloaded.\r\n        <li><strong>count: int, optional, default: None </strong></li>\r\n        The number of cases per language given as input to be downloaded, starting from the start_id. \r\n        <br><strong>!NOTICE!</strong><br>\r\n        If count is provided, the end_id will be set to start_id+count, overwriting any given end_id value.\r\n        <li><strong>start_date: date, optional, default None</strong></li>\r\n        The start publication date (yyyy-mm-dd)\r\n        <li><strong>end_date: date, optional, default current date</strong></li>\r\n        The end publication date (yyyy-mm-dd)\r\n        <li><strong>verbose: boolean, optional, default False</strong></li>\r\n        This option allows for additional printing, showing live progress of the extraction process.\r\n        <li><strong>skip_missing_dates: boolean, optional, default False</strong></li>\r\n        This option makes the extraction not collect data for cases where there is no judgement date provided.\r\n        <li><strong>fields: list of strings, optional, default all available fields</strong></li>\r\n        This argument can be provided, to limit the metadata to be downloaded. These fields will appear as \r\n        different columns in the csv file / Dataframe object. The full list of fields is attached in the appendix.\r\n        <li><strong>save_file: ['y', 'n'],optional, default 'y'</strong></li>\r\n        Save metadata as a csv file in the data folder and the full_text as a json file, \r\n        or return a Pandas DataFrame object and a list of dictionaries in-memory.\r\n        <li><strong>language: list of strings, optional, default ['ENG']</strong></li>\r\n        The language of the metadata to be downloaded from the available languages.\r\n        <br><strong>!NOTICE!</strong><br>\r\n        If link or query payload are provided, the language argument will not be used, as the language also appears in the link and query.\r\n        <li><strong>link: string ,optional, default None </strong></li>\r\n        Allows the user to download results of a search from the HUDOC website. Since the HUDOC does not provide any proper API documentation,\r\n        this method attempts to recreate an API call based on observer relation between the browser link and API call. \r\n        This method might encounter errors, as there are possible behaviors that were not tested.\r\n        If this argument is provided, all the other arguments are ignored, except for 'fields'. Further information on proper usage is in the Appendix.\r\n        <li><strong>query_payload: string ,optional, default None </strong></li>\r\n        Allows the user to download results of a search from the HUDOC website. If this argument is provided, it takes priority over the 'link' parameter.\r\n        This method is much more robust than using the 'link' parameter. It requires the user to access the Network tab on his browser - \r\n        full information on proper usage is in the Appendix.\r\n        <li><strong>threads: int, optional, default: 10</strong></li>\r\n        The full text download is a parallelizable process.\r\n        This parameter determines the number of threads to be used in the download.\r\n    </ul>\r\n    <li><code>get_nodes_edges</code></li>\r\n    <ul>\r\n        <li><strong>metadata_path</strong></li>\r\n        The path to the metadata file to read.\r\n        <li><strong>df</strong></li>\r\n        Alternative to metadata_path, user can provide a Pandas Dataframe object. In case both are given, df is ignored.\r\n        <li><strong>save_file: ['y', 'n'],optional, default 'y'</strong></li>\r\n        Save the nodes and edges of cases in metadata as csv files in the data folder, or return them as Pandas Dataframe objects in-memory.\r\n    </ul>\r\n</ol>\r\n\r\n## Examples\r\n\r\n```\r\nimport echr_extractor as echr\r\n\r\nBelow are examples for in-file saving:\r\n\r\ndf, json = echr.get_echr_extra(count=100,save_file='y',threads=10)\r\ndf = echr.get_echr(start_id=1,save_file='y',skip_missing_dates=True)\r\n\r\nBelow are examples for in-memory saving:\r\n\r\ndf, json = echr.get_echr_extra(start_id=20,end_id=3000,save_file='n')\r\n    \r\ndf = echr.get_echr(start_id=1000,count=2000,save_file='n',verbose=True)\r\n\r\nnodes, edges = echr.get_nodes_edges(metadata_path='data/echr_metadata.csv',save_file='n')\r\n```\r\n\r\n```\r\n\r\n## License\r\n[![License: Apache 2.0](https://img.shields.io/github/license/maastrichtlawtech/extraction_libraries)](https://opensource.org/licenses/Apache-2.0)\r\n\r\nPreviously under the [MIT License](https://opensource.org/licenses/MIT), as of 28/10/2022 this work is licensed under a [Apache License, Version 2.0](https://opensource.org/licenses/Apache-2.0).\r\n\r\nApache License, Version 2.0\r\n\r\nCopyright (c) 2022 Maastricht Law & Tech Lab\r\n\r\nLicensed under the Apache License, Version 2.0 (the \"License\");\r\nyou may not use this file except in compliance with the License.\r\nYou may obtain a copy of the License at\r\n    \r\n    http://www.apache.org/licenses/LICENSE-2.0\r\n\r\nUnless required by applicable law or agreed to in writing, software\r\ndistributed under the License is distributed on an \"AS IS\" BASIS,\r\nWITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\r\nSee the License for the specific language governing permissions and\r\nlimitations under the License.\r\n```\r\n\r\n## Appendix\r\n\r\n```\r\nTo properly use the 'link' parameter of the extraction methods, the user should head to \r\n\r\nhttps://hudoc.echr.coe.int/eng#%20\r\n\r\nThere, the user can use the tools of Advanced Search of HUDOC to search for specific cases.\r\nAfterwards*, the user can copy the link of the current website, and pass it on to the extraction methods. \r\n\r\nKnown issues with the 'link' method:\r\n\r\n- Using the \" character in your searches will cause the extraction to fail. It will only work if that character is in the\r\nText section, where it is essential for proper use of the search. In all the other search fields, please do not use the \" character.\r\nIf it is essential for you work, please raise an issue on Github, and we can try to manually fix another field.\r\n\r\n\r\n* It should be noted that the link only updates after the 'search' button  of the Advanced Search is clicked.\r\n\r\n\r\n\r\nThe full list of fields is as follows:\r\n\r\nfields = ['itemid','applicability','application','appno','article','conclusion','decisiondate','docname',\r\n'documentcollectionid','documentcollectionid2','doctype','doctypebranch','ecli','externalsources','extractedappno',\r\n'importance','introductiondate','isplaceholder','issue','judgementdate','kpdate','kpdateAsText','kpthesaurus',\r\n'languageisocode','meetingnumber','originatingbody','publishedby','Rank','referencedate','reportdate','representedby',\r\n'resolutiondate',resolutionnumber','respondent','respondentOrderEng','rulesofcourt','separateopinion','scl',\r\n'sharepointid','typedescription','nonviolation','violation']\r\n\r\nThese fields can take different values, for more information head to https://hudoc.echr.coe.int.\r\n```\r\n\r\n### Query_payload Parameter\r\nThis section will define in a step-by-step fashion the proper usage of the 'query_payload' parameter.\r\n\r\n1. Go to the [HUDOC website](https://hudoc.echr.coe.int/eng#%20). \r\n2. Input your search parameters.\r\n3. Right-click on the website and inspect the website elements. <br>\r\n![guide](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/inspect.png)\r\n4. After the tab on the right side has opened, enter the network section, which records requests made by the website. <br>\r\n![guide2](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/network.png)\r\n5. Here you might see some requests already present. If that is the case, press the record button twice to clear the history.\r\nOtherwise, continue to step 6.<br>\r\n![guide3](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/record.png)\r\n6. Once you are recording new requests and the history has been cleared, click the search button on the website to execute your search.\r\n![guide4](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/search.png)\r\n7. Now in the network tab, you should see new request records appear. Click the one on the top to inspect it.\r\n![guide5](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/grab_api.png)\r\n8. A tab with request information should appear. Enter the Payload section.\r\n![guide6](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/enter_payload.png)\r\n9. Here the query payload should be present. Copy its value, and use it as the 'query_payload' parameter in code. It should be noted that this parameter should be used as a string surrounded by the single quotation mark ( ' ) , as the query payload might include the double quotation mark ( \" ) characters.\r\n![guide7](https://raw.githubusercontent.com/maastrichtlawtech/extraction_libraries/echr/echr/success.png)\r\n10. Now you know how to use the query_payload parameter!\r\n\r\n\r\n\r\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Library for extracting ECHR data",
    "version": "1.0.43",
    "project_urls": {
        "Bug Tracker": "https://github.com/maastrichtlawtech/extraction_libraries",
        "Build Source": "https://github.com/maastrichtlawtech/extraction_libraries"
    },
    "split_keywords": [
        "echr",
        "extractor",
        "european",
        "convention",
        "human",
        "rights",
        "european convention",
        "human rights",
        "european convention on human rights"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "072c4bd625f53260e606be93f20d5e9123c9ff3ca8ec12a839687a1386998d9c",
                "md5": "fdee4eff1ad2165b2098046035882ffb",
                "sha256": "d07fb02f8f588bc9f8510bb5e7e7ff4b5c0b70540403045fdb86eee03239cc90"
            },
            "downloads": -1,
            "filename": "echr_extractor-1.0.43.tar.gz",
            "has_sig": false,
            "md5_digest": "fdee4eff1ad2165b2098046035882ffb",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 18235,
            "upload_time": "2023-10-04T15:31:25",
            "upload_time_iso_8601": "2023-10-04T15:31:25.955008Z",
            "url": "https://files.pythonhosted.org/packages/07/2c/4bd625f53260e606be93f20d5e9123c9ff3ca8ec12a839687a1386998d9c/echr_extractor-1.0.43.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-10-04 15:31:25",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "maastrichtlawtech",
    "github_project": "extraction_libraries",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "echr-extractor"
}

LawTech Lab