# Cian Parser #
## What is this? ##
This is a parser with which you can easily get data from a website cian.ru.
## Quick Guide ##
This module is based on Selenium-Stealth, using BeautifulSoup as well as Asyncio
Data you can get:
1) Name of apartment
2) The city district in which the housing estate is located
3) Price of the apartment
4) Time to the subway
5) How to get to the subway
6) Nearest subway
7) Price per square meter
8) Total square footage
9) Living Space
10) Floor
11) Number of stories in the house
12) Year of delivery of the house
13) Surrendered or not
14) Finishing
15) Parking
16) Ceiling Heights
17) Builder Rating
----------
### Using ###
Using the library is as simple and convenient as possible:
Let's import it first:
First, import everything from the library (use the `from `...` import *` construct).
Examples of all operations:
Сreate an instance of a class `Cian_Parser` (PATH - file save path, URL - site url, BOOST (True or False) - you can also receive a seller rating, but the speed is reduced several times, COUNT_PAGE - how many pages of apartments do you want to receive):
parser = Cian_Parser(PATH, URL, BOOST, COUNT_PAGE)
Receive all data of all apartments in CSV format using the `start_parsing()` function:
parser.start_parsing()
If you want to create your own parser logic, then use the description of the other modules:
Сreate an instance of a class `Pagination` (parser - parser instance from the Flats_Url class, next_button_selector - XPATH pagination buttons)
pagination = Pagination(parser, next_button_selector)
Checking for next page using the `HasNextPage()` function:
await pagination.HasNextPage()
Go to next page using the `GoToTheNextPage()` function:
await pagination.GoToTheNextPage()
## Developer ##
My site: [link](https://ezsmail.github.io/Profile/)
Raw data
{
"_id": null,
"home_page": "https://github.com/EZsmail/cian_parser",
"name": "parser-cian",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "python parser requests cian realestate beautifulsoup dataset",
"author": "Anatoly Gvozdev",
"author_email": "tolagvgvgvgv@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/e3/2e/d52a0e4b5283b60a982def88cc0dc472bda7a1a08531b04529d49a6a1951/parser_cian-0.1.6.tar.gz",
"platform": null,
"description": "# Cian Parser #\n\n## What is this? ##\nThis is a parser with which you can easily get data from a website cian.ru.\n\n## Quick Guide ##\nThis module is based on Selenium-Stealth, using BeautifulSoup as well as Asyncio\n\nData you can get:\n1) Name of apartment\n2) The city district in which the housing estate is located\n3) Price of the apartment \n4) Time to the subway\n5) How to get to the subway \n6) Nearest subway \n7) Price per square meter \n8) Total square footage \n9) Living Space \n10) Floor \n11) Number of stories in the house \n12) Year of delivery of the house \n13) Surrendered or not \n14) Finishing \n15) Parking \n16) Ceiling Heights \n17) Builder Rating \n\n----------\n\n\n### Using ###\n\n\nUsing the library is as simple and convenient as possible:\n\nLet's import it first:\nFirst, import everything from the library (use the `from `...` import *` construct).\n\nExamples of all operations:\n\n\u0421reate an instance of a class `Cian_Parser` (PATH - file save path, URL - site url, BOOST (True or False) - you can also receive a seller rating, but the speed is reduced several times, COUNT_PAGE - how many pages of apartments do you want to receive):\n\n parser = Cian_Parser(PATH, URL, BOOST, COUNT_PAGE)\n\nReceive all data of all apartments in CSV format using the `start_parsing()` function:\n\n parser.start_parsing()\n\nIf you want to create your own parser logic, then use the description of the other modules:\n\n\u0421reate an instance of a class `Pagination` (parser - parser instance from the Flats_Url class, next_button_selector - XPATH pagination buttons)\n\n pagination = Pagination(parser, next_button_selector)\n\nChecking for next page using the `HasNextPage()` function:\n\n await pagination.HasNextPage()\n\nGo to next page using the `GoToTheNextPage()` function:\n\n await pagination.GoToTheNextPage()\n\n\n\n## Developer ##\nMy site: [link](https://ezsmail.github.io/Profile/)\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Parser for easy data retrieval from cian.ru",
"version": "0.1.6",
"project_urls": {
"Homepage": "https://github.com/EZsmail/cian_parser"
},
"split_keywords": [
"python",
"parser",
"requests",
"cian",
"realestate",
"beautifulsoup",
"dataset"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "e32ed52a0e4b5283b60a982def88cc0dc472bda7a1a08531b04529d49a6a1951",
"md5": "ba20e17058ef47339002e18e9ac63e08",
"sha256": "ef8ef6d2f4eb8e413fe5e5dfc475d370fd5870782e86f4a14fa22a07e6148884"
},
"downloads": -1,
"filename": "parser_cian-0.1.6.tar.gz",
"has_sig": false,
"md5_digest": "ba20e17058ef47339002e18e9ac63e08",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 7071,
"upload_time": "2023-11-11T22:01:43",
"upload_time_iso_8601": "2023-11-11T22:01:43.236677Z",
"url": "https://files.pythonhosted.org/packages/e3/2e/d52a0e4b5283b60a982def88cc0dc472bda7a1a08531b04529d49a6a1951/parser_cian-0.1.6.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-11-11 22:01:43",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "EZsmail",
"github_project": "cian_parser",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "parser-cian"
}