# Process & Manipulate HTML via Python API
<!--- banner links to [Temporary License](https://purchase.aspose.com/temporary-license) -->
[![banner](https://products.aspose.com/html/images/aspose_html-for-python-banner-TL.png)](https://purchase.aspose.com/temporary-license)
[Product Page](https://products.aspose.com/html/python-net/) | [Docs](https://docs.aspose.com/html/python-net/) | [Demos](https://products.aspose.app/html/applications) | [API Reference](https://reference.aspose.com/html/python-net/) | [Examples](https://github.com/aspose-html/) | [Blog](https://blog.aspose.com/category/html/) | [Search](https://search.aspose.com/) | [Free Support](https://forum.aspose.com/c/html/29)
[Aspose.HTML for Python via .NET](https://products.aspose.com/html/python-net/) is a powerful API for Python that provides headless browser functionality, allowing you to work with HTML documents. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes, rendering, and converting HTML to other popular formats, etc.
## HTML API Features
The following are some popular features of Aspose.HTML for Python via .NET:
- Convert HTML documents to a wide range of formats, including PDF, XPS, DOCX, and different image formats (PNG, JPEG, BMP, TIFF, and GIF).
- Load and convert SVG and EPUB to XPS, PDF, and images.
- Navigate through documents using either NodeIterator or TreeWalker.
- Set and control the timeout for the rendering process.
- Use MutationObserver to monitor DOM modifications.
- Populate HTML documents with external data sources such as XML and JSON.
- Support for both single (PDF, XPS) and multiple (image formats) output file streams.
- Extract CSS styling information from HTML documents.
- Configure a Sandbox environment that is independent of the execution machine, ensuring a secure and isolated environment for running and testing.
## Read & Write Web Formats
- Web: HTML, XHTML, MHTML
- Other: SVG, MD (Markdown)
## Save HTML As
- Fixed Layout: PDF, XPS
- Images: TIFF, JPEG, PNG, BMP, GIF
## Read Formats
- EPUB
## Platform Independence
Aspose.HTML for Python via .NET can be used to develop applications for a vast range of operating systems, such as Windows, where Python 3.5 or later is installed. You can build both 32-bit and 64-bit Python applications.
## Get Started
Are you ready to give Aspose.HTML for Python via .NET a try?
Simply run ```pip install aspose-html-net``` from the Console to fetch the package.
If you already have Aspose.HTML for Python via .NET and want to upgrade the version, please run ```pip install --upgrade aspose-html-net``` to get the latest version.
You can run the following snippets in your environment to see how Aspose.HTML works, or check out the [GitHub Repository](https://github.com/aspose-html/) or [Aspose.HTML for Python via .NET Documentation](https://docs.aspose.com/html/python-net/) for other common use cases.
## Using Python to Convert HTML to Markdown (MD)
Aspose.HTML for Python via .NET allows you to convert HTML to PDF, XPS, Markdown, MHTML, PNG, JPEG, and other file formats. The following snippet demonstrates the conversion from HTML to GIT-based Markdown (MD) Format:
```python
import os
from aspose.html import *
from aspose.html.converters import *
from aspose.html.saving import *
# Setup directories
output_folder = "output/"
input_folder = "data/"
if not os.path.exists(output_folder):
os.makedirs(output_folder)
# Prepare an HTML code and save it to the file
code = "<h1>Header 1</h1>" \
"<h2>Header 2</h2>" \
"<p>Hello World!!</p>"
document_path = os.path.join(input_folder, 'document.html')
output_path = os.path.join(output_folder, 'output.md')
with open(document_path, 'w', encoding="utf-8") as f:
f.write(code)
f.close()
# Call convert_html method to convert HTML to Markdown.
Converter.convert_html(document_path, MarkdownSaveOptions.git, output_path)
```
[Product Page](https://products.aspose.com/html/python-net/) | [Docs](https://docs.aspose.com/html/python-net/) | [Demos](https://products.aspose.app/html/applications) | [API Reference](https://reference.aspose.com/html/python-net/) | [Examples](https://github.com/aspose-html/) | [Blog](https://blog.aspose.com/category/html/) | [Search](https://search.aspose.com/) | [Free Support](https://forum.aspose.com/c/html/29) | [Temporary License](https://purchase.aspose.com/temporary-license)
Raw data
{
"_id": null,
"home_page": "https://products.aspose.com/html/python-net/",
"name": "aspose-html-net",
"maintainer": null,
"docs_url": null,
"requires_python": "<3.12,>=3.9",
"maintainer_email": null,
"keywords": "Aspose, Aspose.HTML, Aspose.Total, Python, Component, Conholdate, Conholdate.Total, convert, converter, vectorization, vectorizer, vectorize, encoding, base64, css, DOM, dom-manipulation, image, JPEG, Library, Microsoft, mvc, net-standard, netcore, netstandard, to PDF, PNG, rotate, security, Standard, XPath, XPS, SVG, Windows, Vector Graphics, line, Bezier Curves, Cubic, Glyphs, Paths, Text, Scaling, Rotation, Skewing, Fill, Dots, Bitmap, HTML, HTML-to-Image, HTML-to-PDF, HTML-to-XPS, HTML-to-DOCX, HTML-to-MHTML, HTML-to-Markdown, HTML-to-MD, EPUB-to-Image, EPUB-to-PDF, EPUB-to-XPS, Markdown-to-HTML, MD-to-HTML, HTML-Template, HTML-Converter, HTML-Renderer, SVG-Renderer, MHTML-Renderer, EPUB-Renderer, HTML5, HTML5-Canvas, HTML-DOM, HTML-Form-Editor, HTML, XHTML, MHTML, EPUB, SVG, MD, Markdown, DOCX, PDF, XPS, TIFF, JPEG, JPG, PNG, BMP, GIF, WEBP, eBook-API, Headless-Browser, Document-Manipulation, Web-Scraping, Conversion, Crawler, Hypertext, Markup, XPath, Spider, CSS, CSS3, Converter, DOM, Convert, Azure, parse, render, image, graphics, to HTML-code, XHTML-to-PDF, MHTML-to-PDF, HTML-to-GIF, MHTML-to-JPG",
"author": "Aspose",
"author_email": null,
"download_url": null,
"platform": "linux_x86_64",
"description": "# Process & Manipulate HTML via Python API\n\n<!--- banner links to [Temporary License](https://purchase.aspose.com/temporary-license) -->\n[![banner](https://products.aspose.com/html/images/aspose_html-for-python-banner-TL.png)](https://purchase.aspose.com/temporary-license)\n\n[Product Page](https://products.aspose.com/html/python-net/) | [Docs](https://docs.aspose.com/html/python-net/) | [Demos](https://products.aspose.app/html/applications) | [API Reference](https://reference.aspose.com/html/python-net/) | [Examples](https://github.com/aspose-html/) | [Blog](https://blog.aspose.com/category/html/) | [Search](https://search.aspose.com/) | [Free Support](https://forum.aspose.com/c/html/29) \n\n[Aspose.HTML for Python via .NET](https://products.aspose.com/html/python-net/) is a powerful API for Python that provides headless browser functionality, allowing you to work with HTML documents. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes, rendering, and converting HTML to other popular formats, etc.\n\n## HTML API Features\n\nThe following are some popular features of Aspose.HTML for Python via .NET:\n\n- Convert HTML documents to a wide range of formats, including PDF, XPS, DOCX, and different image formats (PNG, JPEG, BMP, TIFF, and GIF).\n- Load and convert SVG and EPUB to XPS, PDF, and images.\n- Navigate through documents using either NodeIterator or TreeWalker.\n- Set and control the timeout for the rendering process.\n- Use MutationObserver to monitor DOM modifications.\n- Populate HTML documents with external data sources such as XML and JSON.\n- Support for both single (PDF, XPS) and multiple (image formats) output file streams.\n- Extract CSS styling information from HTML documents.\n- Configure a Sandbox environment that is independent of the execution machine, ensuring a secure and isolated environment for running and testing.\n\n## Read & Write Web Formats\n\n- Web: HTML, XHTML, MHTML\n- Other: SVG, MD (Markdown)\n\n## Save HTML As\n\n- Fixed Layout: PDF, XPS\n- Images: TIFF, JPEG, PNG, BMP, GIF\n\n## Read Formats\n\n- EPUB\n\n## Platform Independence\n\nAspose.HTML for Python via .NET can be used to develop applications for a vast range of operating systems, such as Windows, where Python 3.5 or later is installed. You can build both 32-bit and 64-bit Python applications.\n\n## Get Started\n\nAre you ready to give Aspose.HTML for Python via .NET a try? \n\nSimply run ```pip install aspose-html-net``` from the Console to fetch the package.\nIf you already have Aspose.HTML for Python via .NET and want to upgrade the version, please run ```pip install --upgrade aspose-html-net``` to get the latest version.\n\nYou can run the following snippets in your environment to see how Aspose.HTML works, or check out the [GitHub Repository](https://github.com/aspose-html/) or [Aspose.HTML for Python via .NET Documentation](https://docs.aspose.com/html/python-net/) for other common use cases.\n\n## Using Python to Convert HTML to Markdown (MD)\n\nAspose.HTML for Python via .NET allows you to convert HTML to PDF, XPS, Markdown, MHTML, PNG, JPEG, and other file formats. The following snippet demonstrates the conversion from HTML to GIT-based Markdown (MD) Format:\n\n```python\nimport os\nfrom aspose.html import *\nfrom aspose.html.converters import *\nfrom aspose.html.saving import *\n \n# Setup directories\noutput_folder = \"output/\"\ninput_folder = \"data/\"\nif not os.path.exists(output_folder):\n os.makedirs(output_folder)\n \n# Prepare an HTML code and save it to the file\ncode = \"<h1>Header 1</h1>\" \\\n \"<h2>Header 2</h2>\" \\\n \"<p>Hello World!!</p>\"\ndocument_path = os.path.join(input_folder, 'document.html')\noutput_path = os.path.join(output_folder, 'output.md')\nwith open(document_path, 'w', encoding=\"utf-8\") as f:\n f.write(code)\n f.close()\n # Call convert_html method to convert HTML to Markdown.\n Converter.convert_html(document_path, MarkdownSaveOptions.git, output_path)\n```\n\n[Product Page](https://products.aspose.com/html/python-net/) | [Docs](https://docs.aspose.com/html/python-net/) | [Demos](https://products.aspose.app/html/applications) | [API Reference](https://reference.aspose.com/html/python-net/) | [Examples](https://github.com/aspose-html/) | [Blog](https://blog.aspose.com/category/html/) | [Search](https://search.aspose.com/) | [Free Support](https://forum.aspose.com/c/html/29) | [Temporary License](https://purchase.aspose.com/temporary-license)\n\n",
"bugtrack_url": null,
"license": null,
"summary": "Aspose.HTML for Python via .NET is a powerful API for Python that provides a headless browser functionality, allowing you to work with HTML documents in a variety of ways. With this API, you can easily create new HTML documents or open existing ones from different sources. Once you have the document, you can perform various manipulation operations, such as removing and replacing HTML nodes.",
"version": "24.10.0",
"project_urls": {
"API Reference": "https://reference.aspose.com/html/python-net/",
"Blog": "https://blog.aspose.com/category/html/",
"Demos": "https://products.aspose.app/html/applications",
"Docs": "https://docs.aspose.com/html/python-net/",
"Examples": "https://github.com/aspose-html/",
"Free Support": "https://forum.aspose.com/c/html/29",
"Homepage": "https://products.aspose.com/html/python-net/",
"Release Notes": "https://releases.aspose.com/html/python-net/release-notes/2024/aspose-html-for-python-via-dotnet-24-10-release-notes/",
"Temporary License": "https://purchase.aspose.com/temporary-license"
},
"split_keywords": [
"aspose",
" aspose.html",
" aspose.total",
" python",
" component",
" conholdate",
" conholdate.total",
" convert",
" converter",
" vectorization",
" vectorizer",
" vectorize",
" encoding",
" base64",
" css",
" dom",
" dom-manipulation",
" image",
" jpeg",
" library",
" microsoft",
" mvc",
" net-standard",
" netcore",
" netstandard",
" to pdf",
" png",
" rotate",
" security",
" standard",
" xpath",
" xps",
" svg",
" windows",
" vector graphics",
" line",
" bezier curves",
" cubic",
" glyphs",
" paths",
" text",
" scaling",
" rotation",
" skewing",
" fill",
" dots",
" bitmap",
" html",
" html-to-image",
" html-to-pdf",
" html-to-xps",
" html-to-docx",
" html-to-mhtml",
" html-to-markdown",
" html-to-md",
" epub-to-image",
" epub-to-pdf",
" epub-to-xps",
" markdown-to-html",
" md-to-html",
" html-template",
" html-converter",
" html-renderer",
" svg-renderer",
" mhtml-renderer",
" epub-renderer",
" html5",
" html5-canvas",
" html-dom",
" html-form-editor",
" html",
" xhtml",
" mhtml",
" epub",
" svg",
" md",
" markdown",
" docx",
" pdf",
" xps",
" tiff",
" jpeg",
" jpg",
" png",
" bmp",
" gif",
" webp",
" ebook-api",
" headless-browser",
" document-manipulation",
" web-scraping",
" conversion",
" crawler",
" hypertext",
" markup",
" xpath",
" spider",
" css",
" css3",
" converter",
" dom",
" convert",
" azure",
" parse",
" render",
" image",
" graphics",
" to html-code",
" xhtml-to-pdf",
" mhtml-to-pdf",
" html-to-gif",
" mhtml-to-jpg"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "d99728f71ba8a345f64cddd11af77973b72ccb6fd9492cef453fc6e06157ba48",
"md5": "bfa5274227e02e620e27a94de414c570",
"sha256": "b51461409cc6891ee2a2aa26d3d66ad2bd933c2faa99aa2d561f0d043643bed8"
},
"downloads": -1,
"filename": "aspose_html_net-24.10.0-py3-none-manylinux1_x86_64.whl",
"has_sig": false,
"md5_digest": "bfa5274227e02e620e27a94de414c570",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<3.12,>=3.9",
"size": 45987703,
"upload_time": "2024-10-31T12:09:37",
"upload_time_iso_8601": "2024-10-31T12:09:37.345876Z",
"url": "https://files.pythonhosted.org/packages/d9/97/28f71ba8a345f64cddd11af77973b72ccb6fd9492cef453fc6e06157ba48/aspose_html_net-24.10.0-py3-none-manylinux1_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "c41b69eda39cefb6951edda1456d991f055fa5877eb5e195f34d05349f17eef1",
"md5": "8e4d72f9c3059cc19a9cf30b48d648fc",
"sha256": "25d8429ab9f1f9ff373d33db0f3862e1e5d3468a870137374f88d7cabb0c6a73"
},
"downloads": -1,
"filename": "aspose_html_net-24.10.0-py3-none-win32.whl",
"has_sig": false,
"md5_digest": "8e4d72f9c3059cc19a9cf30b48d648fc",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<3.12,>=3.9",
"size": 36295381,
"upload_time": "2024-10-31T12:09:50",
"upload_time_iso_8601": "2024-10-31T12:09:50.886857Z",
"url": "https://files.pythonhosted.org/packages/c4/1b/69eda39cefb6951edda1456d991f055fa5877eb5e195f34d05349f17eef1/aspose_html_net-24.10.0-py3-none-win32.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "4a6a72aabcfec3f1143db708c1d5699b2a7645129f648eab8fff9a16fa17b9ca",
"md5": "2de962bd63c80162a6a043eb95cf7b8e",
"sha256": "5525030cbe4459a3d01ee44f361cb5e92435e6a767b80b610f0dfe06780c931c"
},
"downloads": -1,
"filename": "aspose_html_net-24.10.0-py3-none-win_amd64.whl",
"has_sig": false,
"md5_digest": "2de962bd63c80162a6a043eb95cf7b8e",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<3.12,>=3.9",
"size": 40419385,
"upload_time": "2024-10-31T12:10:01",
"upload_time_iso_8601": "2024-10-31T12:10:01.116750Z",
"url": "https://files.pythonhosted.org/packages/4a/6a/72aabcfec3f1143db708c1d5699b2a7645129f648eab8fff9a16fa17b9ca/aspose_html_net-24.10.0-py3-none-win_amd64.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-10-31 12:09:37",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "aspose-html-net"
}