Name | mcp-server-multi-fetch JSON |
Version |
0.8.1
JSON |
| download |
home_page | None |
Summary | A Model Context Protocol server providing tools to fetch and convert web content for usage by LLMs |
upload_time | 2025-08-30 02:10:38 |
maintainer | None |
docs_url | None |
author | Alex Yang |
requires_python | >=3.10 |
license | MIT |
keywords |
automation
http
llm
mcp
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# Multi Fetch MCP Server
This project is based on the [Fetch MCP Server](https://github.com/modelcontextprotocol/servers/tree/main/src/fetch) by Anthropic.
This fork replaces direct HTTP fetching with the Firecrawl Python SDK. Set up your Firecrawl API key to enable web scraping via Firecrawl.
A Model Context Protocol server that provides web content fetching capabilities. This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
The fetch tool will truncate the response, but by using the `start_index` argument, you can specify where to start the content extraction. This lets models read a webpage in chunks, until they find the information they need.
### Available Tools
- `fetch` - Fetches a URL from the internet and extracts its contents as markdown.
- `url` (string, required): URL to fetch
- `max_length` (integer, optional): Maximum number of characters to return (default: 50000)
- `start_index` (integer, optional): Start content from this character index (default: 0)
- `raw` (boolean, optional): Get raw content without markdown conversion (default: false)
- `fetch_multi` - Fetches multiple URLs concurrently and returns an array of results. Input is an array of objects, each with:
- `url` (string, required): URL to fetch
- `max_length` (integer, optional): Maximum number of characters to return (default: 50000)
- `start_index` (integer, optional): Start content from this character index (default: 0)
- `raw` (boolean, optional): Get raw content without markdown conversion (default: false)
- `search` - Searches the web using the Firecrawl search API and scrapes results in markdown and link formats by default.
- `query` (string, required): Search query string
- `limit` (integer, optional): Maximum number of results to return (default: 10)
### Prompts
- **fetch**
- Fetch a URL and extract its contents as markdown
- Arguments:
- `url` (string, required): URL to fetch
- **search**
- Search the web using the Firecrawl search API
- Arguments:
- `query` (string, required): Search query string
- `limit` (integer, optional): Maximum number of results to return (default: 10)
## Installation
Install the Firecrawl SDK and configure your API key before running the server:
```bash
# Install the MCP server and Firecrawl SDK
pip install mcp-server-multi-fetch firecrawl-py
# Set your Firecrawl API key (required)
export FIRECRAWL_API_KEY="fc-YOUR_API_KEY"
# Optionally, override the Firecrawl API endpoint via env or CLI
export FIRECRAWL_API_URL="https://api.firecrawl.dev"
# or
mcp-server-multi-fetch --api-url https://api.firecrawl.dev
```
Optionally: Install node.js, this will cause the fetch server to use a different HTML simplifier that is more robust.
### Using uv (recommended)
When using [`uv`](https://docs.astral.sh/uv/) no specific installation is needed. We will
use [`uvx`](https://docs.astral.sh/uv/guides/tools/) to directly run *mcp-server-multi-fetch*.
## Configuration
### Configure for Claude.app
Add to your Claude settings:
<details>
<summary>Using uvx</summary>
```json
"mcpServers": {
"fetch": {
"command": "uvx",
"args": ["mcp-server-multi-fetch"]
}
}
```
</details>
### Customization - robots.txt
By default, the server will obey a websites robots.txt file if the request came from the model (via a tool), but not if
the request was user initiated (via a prompt). This can be disabled by adding the argument `--ignore-robots-txt` to the
`args` list in the configuration.
### Customization - User-agent
By default, depending on if the request came from the model (via a tool), or was user initiated (via a prompt), the
server will use either the user-agent
```
ModelContextProtocol/1.0 (Autonomous; +https://github.com/modelcontextprotocol/servers)
```
or
```
ModelContextProtocol/1.0 (User-Specified; +https://github.com/modelcontextprotocol/servers)
```
This can be customized by adding the argument `--user-agent=YourUserAgent` to the `args` list in the configuration.
### Customization - Proxy
The server supports HTTP(S) and SOCKS5 proxies via the `--proxy-url` argument. For example:
```bash
# HTTP proxy
mcp-server-multi-fetch --proxy-url http://192.168.1.1:8080
# SOCKS5 proxy
mcp-server-multi-fetch --proxy-url socks5://192.168.1.1:8080
```
Proxy handling is provided by the Firecrawl Python SDK, which supports HTTP(S) and SOCKS5 proxies configured via the `--proxy-url` flag.
### Customization - Firecrawl API URL
The SDK endpoint can be overridden without environment variables using `--api-url`:
```bash
mcp-server-multi-fetch --api-url https://api.firecrawl.dev
```
## Debugging
You can use the MCP inspector to debug the server. For uvx installations:
```
npx @modelcontextprotocol/inspector uvx mcp-server-multi-fetch
```
Or if you've installed the package in a specific directory or are developing on it:
```
cd path/to/servers/src/fetch
npx @modelcontextprotocol/inspector uv run mcp-server-multi-fetch
```
## License
mcp-server-multi-fetch is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Raw data
{
"_id": null,
"home_page": null,
"name": "mcp-server-multi-fetch",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.10",
"maintainer_email": "Alex Yang <i@alexyang.me>",
"keywords": "automation, http, llm, mcp",
"author": "Alex Yang",
"author_email": null,
"download_url": "https://files.pythonhosted.org/packages/ce/bc/a35468734a5c94c1b8b1a19cfaecc433ecbfc4cbca9a0638bf0877e56bc6/mcp_server_multi_fetch-0.8.1.tar.gz",
"platform": null,
"description": "# Multi Fetch MCP Server\n\nThis project is based on the [Fetch MCP Server](https://github.com/modelcontextprotocol/servers/tree/main/src/fetch) by Anthropic.\nThis fork replaces direct HTTP fetching with the Firecrawl Python SDK. Set up your Firecrawl API key to enable web scraping via Firecrawl.\n\nA Model Context Protocol server that provides web content fetching capabilities. This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.\n\nThe fetch tool will truncate the response, but by using the `start_index` argument, you can specify where to start the content extraction. This lets models read a webpage in chunks, until they find the information they need.\n\n### Available Tools\n\n - `fetch` - Fetches a URL from the internet and extracts its contents as markdown.\n - `url` (string, required): URL to fetch\n - `max_length` (integer, optional): Maximum number of characters to return (default: 50000)\n - `start_index` (integer, optional): Start content from this character index (default: 0)\n - `raw` (boolean, optional): Get raw content without markdown conversion (default: false)\n - `fetch_multi` - Fetches multiple URLs concurrently and returns an array of results. Input is an array of objects, each with:\n - `url` (string, required): URL to fetch\n - `max_length` (integer, optional): Maximum number of characters to return (default: 50000)\n - `start_index` (integer, optional): Start content from this character index (default: 0)\n - `raw` (boolean, optional): Get raw content without markdown conversion (default: false)\n\n- `search` - Searches the web using the Firecrawl search API and scrapes results in markdown and link formats by default.\n - `query` (string, required): Search query string\n - `limit` (integer, optional): Maximum number of results to return (default: 10)\n### Prompts\n\n- **fetch**\n - Fetch a URL and extract its contents as markdown\n - Arguments:\n - `url` (string, required): URL to fetch\n\n- **search**\n - Search the web using the Firecrawl search API\n - Arguments:\n - `query` (string, required): Search query string\n - `limit` (integer, optional): Maximum number of results to return (default: 10)\n## Installation\n\nInstall the Firecrawl SDK and configure your API key before running the server:\n\n```bash\n# Install the MCP server and Firecrawl SDK\npip install mcp-server-multi-fetch firecrawl-py\n\n# Set your Firecrawl API key (required)\nexport FIRECRAWL_API_KEY=\"fc-YOUR_API_KEY\"\n\n# Optionally, override the Firecrawl API endpoint via env or CLI\nexport FIRECRAWL_API_URL=\"https://api.firecrawl.dev\"\n# or\nmcp-server-multi-fetch --api-url https://api.firecrawl.dev\n```\n\nOptionally: Install node.js, this will cause the fetch server to use a different HTML simplifier that is more robust.\n\n### Using uv (recommended)\n\nWhen using [`uv`](https://docs.astral.sh/uv/) no specific installation is needed. We will\nuse [`uvx`](https://docs.astral.sh/uv/guides/tools/) to directly run *mcp-server-multi-fetch*.\n\n\n## Configuration\n\n### Configure for Claude.app\n\nAdd to your Claude settings:\n\n<details>\n<summary>Using uvx</summary>\n\n```json\n\"mcpServers\": {\n \"fetch\": {\n \"command\": \"uvx\",\n \"args\": [\"mcp-server-multi-fetch\"]\n }\n}\n```\n</details>\n\n### Customization - robots.txt\n\nBy default, the server will obey a websites robots.txt file if the request came from the model (via a tool), but not if\nthe request was user initiated (via a prompt). This can be disabled by adding the argument `--ignore-robots-txt` to the\n`args` list in the configuration.\n\n### Customization - User-agent\n\nBy default, depending on if the request came from the model (via a tool), or was user initiated (via a prompt), the\nserver will use either the user-agent\n```\nModelContextProtocol/1.0 (Autonomous; +https://github.com/modelcontextprotocol/servers)\n```\nor\n```\nModelContextProtocol/1.0 (User-Specified; +https://github.com/modelcontextprotocol/servers)\n```\n\nThis can be customized by adding the argument `--user-agent=YourUserAgent` to the `args` list in the configuration.\n\n### Customization - Proxy\n\nThe server supports HTTP(S) and SOCKS5 proxies via the `--proxy-url` argument. For example:\n\n```bash\n# HTTP proxy\nmcp-server-multi-fetch --proxy-url http://192.168.1.1:8080\n\n# SOCKS5 proxy\nmcp-server-multi-fetch --proxy-url socks5://192.168.1.1:8080\n```\n\nProxy handling is provided by the Firecrawl Python SDK, which supports HTTP(S) and SOCKS5 proxies configured via the `--proxy-url` flag.\n\n### Customization - Firecrawl API URL\n\nThe SDK endpoint can be overridden without environment variables using `--api-url`:\n\n```bash\nmcp-server-multi-fetch --api-url https://api.firecrawl.dev\n```\n\n## Debugging\n\nYou can use the MCP inspector to debug the server. For uvx installations:\n\n```\nnpx @modelcontextprotocol/inspector uvx mcp-server-multi-fetch\n```\n\nOr if you've installed the package in a specific directory or are developing on it:\n\n```\ncd path/to/servers/src/fetch\nnpx @modelcontextprotocol/inspector uv run mcp-server-multi-fetch\n```\n\n## License\n\nmcp-server-multi-fetch is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "A Model Context Protocol server providing tools to fetch and convert web content for usage by LLMs",
"version": "0.8.1",
"project_urls": null,
"split_keywords": [
"automation",
" http",
" llm",
" mcp"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "e188d0e783209764ed75ca0aede7609b47598a930ae779319ac54e9df052ef9c",
"md5": "18729f8354ae7065086634659e0cc5ad",
"sha256": "a8f1c4672d2be7b94368fdcf8abc1227cc2d88611dc7d9ac5272abeeda37d64d"
},
"downloads": -1,
"filename": "mcp_server_multi_fetch-0.8.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "18729f8354ae7065086634659e0cc5ad",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.10",
"size": 9968,
"upload_time": "2025-08-30T02:10:37",
"upload_time_iso_8601": "2025-08-30T02:10:37.248303Z",
"url": "https://files.pythonhosted.org/packages/e1/88/d0e783209764ed75ca0aede7609b47598a930ae779319ac54e9df052ef9c/mcp_server_multi_fetch-0.8.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "cebca35468734a5c94c1b8b1a19cfaecc433ecbfc4cbca9a0638bf0877e56bc6",
"md5": "834315a6ff4077f1d459a2e07eef61b1",
"sha256": "9ae5f89883c897a062b289631897fbd450e18063f67ea4920aaf4eed44e8f376"
},
"downloads": -1,
"filename": "mcp_server_multi_fetch-0.8.1.tar.gz",
"has_sig": false,
"md5_digest": "834315a6ff4077f1d459a2e07eef61b1",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.10",
"size": 95357,
"upload_time": "2025-08-30T02:10:38",
"upload_time_iso_8601": "2025-08-30T02:10:38.815711Z",
"url": "https://files.pythonhosted.org/packages/ce/bc/a35468734a5c94c1b8b1a19cfaecc433ecbfc4cbca9a0638bf0877e56bc6/mcp_server_multi_fetch-0.8.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-08-30 02:10:38",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "mcp-server-multi-fetch"
}