article-scraper-mcp


Namearticle-scraper-mcp JSON
Version 1.0.0 PyPI version JSON
download
home_pageNone
SummaryMCP server that fetches article data (title, text, author, date) from a URL using newspaper3k
upload_time2025-08-20 21:21:53
maintainerNone
docs_urlNone
authorNone
requires_python>=3.11
licenseMIT
keywords article-extraction mcp model-context-protocol news newspaper3k scraper
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Article Scraper MCP

A Model Context Protocol (MCP) server that fetches article data from URLs using newspaper3k.

## Features

- Extract article title, text, author, and publication date
- Robust error handling and URL validation
- Structured data output
- Built with FastMCP for easy integration

## Installation

Install directly from PyPI:

```bash
uvx article-scraper-mcp
```

## Usage

Add to your MCP client configuration:

```json
{
  "mcpServers": {
    "article-scraper": {
      "command": "uvx",
      "args": ["article-scraper-mcp"]
    }
  }
}
```

## API

### `fetch_article(url: str) -> dict[str, Any]`

Fetches and parses a news article from the given URL.

**Parameters:**
- `url`: The URL of the news article to fetch

**Returns:**
A dictionary containing:
- `title`: Article title
- `text`: Article content text
- `author`: Author name(s) (may be None)
- `date`: Publication date in ISO format (may be None)

**Raises:**
- `ValueError`: If URL is invalid or article cannot be parsed
- `requests.RequestException`: If HTTP request fails

## Requirements

- Python 3.11+
- newspaper3k
- requests
- loguru
- mcp[cli]

## License

MIT

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "article-scraper-mcp",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.11",
    "maintainer_email": "Dmitrii K <dmitriik@protonmail.com>",
    "keywords": "article-extraction, mcp, model-context-protocol, news, newspaper3k, scraper",
    "author": null,
    "author_email": "Dmitrii K <dmitriik@protonmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/30/f1/d87ac63cc6e0b433a5d5c062cef230cf07f55b637d1ad6eaa1b9d0c384aa/article_scraper_mcp-1.0.0.tar.gz",
    "platform": null,
    "description": "# Article Scraper MCP\n\nA Model Context Protocol (MCP) server that fetches article data from URLs using newspaper3k.\n\n## Features\n\n- Extract article title, text, author, and publication date\n- Robust error handling and URL validation\n- Structured data output\n- Built with FastMCP for easy integration\n\n## Installation\n\nInstall directly from PyPI:\n\n```bash\nuvx article-scraper-mcp\n```\n\n## Usage\n\nAdd to your MCP client configuration:\n\n```json\n{\n  \"mcpServers\": {\n    \"article-scraper\": {\n      \"command\": \"uvx\",\n      \"args\": [\"article-scraper-mcp\"]\n    }\n  }\n}\n```\n\n## API\n\n### `fetch_article(url: str) -> dict[str, Any]`\n\nFetches and parses a news article from the given URL.\n\n**Parameters:**\n- `url`: The URL of the news article to fetch\n\n**Returns:**\nA dictionary containing:\n- `title`: Article title\n- `text`: Article content text\n- `author`: Author name(s) (may be None)\n- `date`: Publication date in ISO format (may be None)\n\n**Raises:**\n- `ValueError`: If URL is invalid or article cannot be parsed\n- `requests.RequestException`: If HTTP request fails\n\n## Requirements\n\n- Python 3.11+\n- newspaper3k\n- requests\n- loguru\n- mcp[cli]\n\n## License\n\nMIT\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "MCP server that fetches article data (title, text, author, date) from a URL using newspaper3k",
    "version": "1.0.0",
    "project_urls": {
        "Changelog": "https://github.com/dmitriiweb/article-scraper-mcp/blob/main/CHANGELOG.md",
        "Documentation": "https://github.com/dmitriiweb/article-scraper-mcp#readme",
        "Homepage": "https://github.com/dmitriiweb/article-scraper-mcp",
        "Issues": "https://github.com/dmitriiweb/article-scraper-mcp/issues",
        "Repository": "https://github.com/dmitriiweb/article-scraper-mcp"
    },
    "split_keywords": [
        "article-extraction",
        " mcp",
        " model-context-protocol",
        " news",
        " newspaper3k",
        " scraper"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "4c327b43817cf157701438f0815339c755182690a3f05a9be1ccb9ae98b6ac64",
                "md5": "d7ba52be1cf3632eadbdb9bf2be07015",
                "sha256": "8a5eb3fb01b6a8eaf8aa24604ed34de973768012a1aab9227f20542b73e5d87b"
            },
            "downloads": -1,
            "filename": "article_scraper_mcp-1.0.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "d7ba52be1cf3632eadbdb9bf2be07015",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.11",
            "size": 5046,
            "upload_time": "2025-08-20T21:21:52",
            "upload_time_iso_8601": "2025-08-20T21:21:52.422276Z",
            "url": "https://files.pythonhosted.org/packages/4c/32/7b43817cf157701438f0815339c755182690a3f05a9be1ccb9ae98b6ac64/article_scraper_mcp-1.0.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "30f1d87ac63cc6e0b433a5d5c062cef230cf07f55b637d1ad6eaa1b9d0c384aa",
                "md5": "2da083cc93e0beb282dd5bfed9260d7e",
                "sha256": "8472faf8b417bb922458b1fb0b68f5c488b476ab932092227656c38a744d37ef"
            },
            "downloads": -1,
            "filename": "article_scraper_mcp-1.0.0.tar.gz",
            "has_sig": false,
            "md5_digest": "2da083cc93e0beb282dd5bfed9260d7e",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.11",
            "size": 5723,
            "upload_time": "2025-08-20T21:21:53",
            "upload_time_iso_8601": "2025-08-20T21:21:53.877634Z",
            "url": "https://files.pythonhosted.org/packages/30/f1/d87ac63cc6e0b433a5d5c062cef230cf07f55b637d1ad6eaa1b9d0c384aa/article_scraper_mcp-1.0.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-20 21:21:53",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "dmitriiweb",
    "github_project": "article-scraper-mcp",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "article-scraper-mcp"
}
        
Elapsed time: 1.23725s