parser-cian


Nameparser-cian JSON
Version 0.1.6 PyPI version JSON
download
home_pagehttps://github.com/EZsmail/cian_parser
SummaryParser for easy data retrieval from cian.ru
upload_time2023-11-11 22:01:43
maintainer
docs_urlNone
authorAnatoly Gvozdev
requires_python
licenseMIT
keywords python parser requests cian realestate beautifulsoup dataset
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Cian Parser #

## What is this? ##
This is a parser with which you can easily get data from a website cian.ru.

## Quick Guide ##
This module is based on Selenium-Stealth, using BeautifulSoup as well as Asyncio

Data you can get:
1) Name of apartment
2) The city district in which the housing estate is located
3) Price of the apartment             
4) Time to the subway
5) How to get to the subway          
6) Nearest subway             
7) Price per square meter             
8) Total square footage             
9) Living Space             
10) Floor             
11) Number of stories in the house             
12) Year of delivery of the house                       
13) Surrendered or not             
14) Finishing             
15) Parking             
16) Ceiling Heights            
17) Builder Rating 

----------


### Using ###


Using the library is as simple and convenient as possible:

Let's import it first:
First, import everything from the library (use the `from `...` import *` construct).

Examples of all operations:

Сreate an instance of a class `Cian_Parser` (PATH - file save path, URL - site url, BOOST (True or False) - you can also receive a seller rating, but the speed is reduced several times, COUNT_PAGE - how many pages of apartments do you want to receive):

    parser = Cian_Parser(PATH, URL, BOOST, COUNT_PAGE)

Receive all data of all apartments in CSV format using the `start_parsing()` function:

    parser.start_parsing()

If you want to create your own parser logic, then use the description of the other modules:

Сreate an instance of a class `Pagination` (parser - parser instance from the Flats_Url class, next_button_selector - XPATH pagination buttons)

    pagination = Pagination(parser, next_button_selector)

Checking for next page using the `HasNextPage()` function:

    await pagination.HasNextPage()

Go to next page using the `GoToTheNextPage()` function:

    await pagination.GoToTheNextPage()



## Developer ##
My site: [link](https://ezsmail.github.io/Profile/)

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/EZsmail/cian_parser",
    "name": "parser-cian",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "python parser requests cian realestate beautifulsoup dataset",
    "author": "Anatoly Gvozdev",
    "author_email": "tolagvgvgvgv@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/e3/2e/d52a0e4b5283b60a982def88cc0dc472bda7a1a08531b04529d49a6a1951/parser_cian-0.1.6.tar.gz",
    "platform": null,
    "description": "# Cian Parser #\n\n## What is this? ##\nThis is a parser with which you can easily get data from a website cian.ru.\n\n## Quick Guide ##\nThis module is based on Selenium-Stealth, using BeautifulSoup as well as Asyncio\n\nData you can get:\n1) Name of apartment\n2) The city district in which the housing estate is located\n3) Price of the apartment             \n4) Time to the subway\n5) How to get to the subway          \n6) Nearest subway             \n7) Price per square meter             \n8) Total square footage             \n9) Living Space             \n10) Floor             \n11) Number of stories in the house             \n12) Year of delivery of the house                       \n13) Surrendered or not             \n14) Finishing             \n15) Parking             \n16) Ceiling Heights            \n17) Builder Rating \n\n----------\n\n\n### Using ###\n\n\nUsing the library is as simple and convenient as possible:\n\nLet's import it first:\nFirst, import everything from the library (use the `from `...` import *` construct).\n\nExamples of all operations:\n\n\u0421reate an instance of a class `Cian_Parser` (PATH - file save path, URL - site url, BOOST (True or False) - you can also receive a seller rating, but the speed is reduced several times, COUNT_PAGE - how many pages of apartments do you want to receive):\n\n    parser = Cian_Parser(PATH, URL, BOOST, COUNT_PAGE)\n\nReceive all data of all apartments in CSV format using the `start_parsing()` function:\n\n    parser.start_parsing()\n\nIf you want to create your own parser logic, then use the description of the other modules:\n\n\u0421reate an instance of a class `Pagination` (parser - parser instance from the Flats_Url class, next_button_selector - XPATH pagination buttons)\n\n    pagination = Pagination(parser, next_button_selector)\n\nChecking for next page using the `HasNextPage()` function:\n\n    await pagination.HasNextPage()\n\nGo to next page using the `GoToTheNextPage()` function:\n\n    await pagination.GoToTheNextPage()\n\n\n\n## Developer ##\nMy site: [link](https://ezsmail.github.io/Profile/)\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Parser for easy data retrieval from cian.ru",
    "version": "0.1.6",
    "project_urls": {
        "Homepage": "https://github.com/EZsmail/cian_parser"
    },
    "split_keywords": [
        "python",
        "parser",
        "requests",
        "cian",
        "realestate",
        "beautifulsoup",
        "dataset"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e32ed52a0e4b5283b60a982def88cc0dc472bda7a1a08531b04529d49a6a1951",
                "md5": "ba20e17058ef47339002e18e9ac63e08",
                "sha256": "ef8ef6d2f4eb8e413fe5e5dfc475d370fd5870782e86f4a14fa22a07e6148884"
            },
            "downloads": -1,
            "filename": "parser_cian-0.1.6.tar.gz",
            "has_sig": false,
            "md5_digest": "ba20e17058ef47339002e18e9ac63e08",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 7071,
            "upload_time": "2023-11-11T22:01:43",
            "upload_time_iso_8601": "2023-11-11T22:01:43.236677Z",
            "url": "https://files.pythonhosted.org/packages/e3/2e/d52a0e4b5283b60a982def88cc0dc472bda7a1a08531b04529d49a6a1951/parser_cian-0.1.6.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-11-11 22:01:43",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "EZsmail",
    "github_project": "cian_parser",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "parser-cian"
}
        
Elapsed time: 0.15492s