mcp-web-fetch-langgpt

Name	mcp-web-fetch-langgpt JSON
Version	0.6.3 JSON
	download
home_page	None
Summary	A Model Context Protocol server providing tools to fetch and convert web content for usage by LLMs (robots.txt checking removed)
upload_time	2025-07-13 01:16:30
maintainer	langgpt
docs_url	None
author	langgpt
requires_python	>=3.10
license	MIT
keywords	automation http llm mcp
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # MCP Web Fetch

A Model Context Protocol server that provides web content fetching capabilities **with robots.txt checking removed**. This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.

This is a modified version of the original mcp-server-fetch that removes all robots.txt checking, allowing unrestricted access to web content.

> [!CAUTION]
> This server can access local/internal IP addresses and may represent a security risk. Exercise caution when using this MCP server to ensure this does not expose any sensitive data. Additionally, this version ignores robots.txt restrictions which may violate some websites' access policies.

The fetch tool will truncate the response, but by using the `start_index` argument, you can specify where to start the content extraction. This lets models read a webpage in chunks, until they find the information they need.

## Available Tools

- `fetch` - Fetches a URL from the internet and extracts its contents as markdown.
    - `url` (string, required): URL to fetch
    - `max_length` (integer, optional): Maximum number of characters to return (default: 5000)
    - `start_index` (integer, optional): Start content from this character index (default: 0)
    - `raw` (boolean, optional): Get raw content without markdown conversion (default: false)

## Available Prompts

- **fetch**
  - Fetch a URL and extract its contents as markdown
  - Arguments:
    - `url` (string, required): URL to fetch

## Installation and Usage

### Local Development Setup

1. **Clone or download the source code:**
   ```bash
   git clone <repository-url>
   cd mcp-web-fetch
   ```

2. **Install dependencies using uv:**
   ```bash
   uv sync
   ```

3. **Test the server:**
   ```bash
   uv run python -m mcp_server_fetch --help
   ```

### Using with Claude Desktop (Local Source)

1. **Create Claude Desktop configuration:**
   ```json
   {
     "mcpServers": {
       "mcp-web-fetch": {
         "command": "uv",
         "args": [
           "run",
           "--directory",
           "/path/to/your/mcp-web-fetch",
           "python",
           "-m",
           "mcp_server_fetch"
         ]
       }
     }
   }
   ```

2. **Add configuration to Claude Desktop:**
   - macOS: `~/Library/Application Support/Claude/claude_desktop_config.json`
   - Windows: `%APPDATA%/Claude/claude_desktop_config.json`
   - Linux: `~/.config/Claude/claude_desktop_config.json`

3. **Restart Claude Desktop** to load the new server.

### Using with VS Code (Local Source)

Add to your VS Code settings or `.vscode/mcp.json`:

```json
{
  "mcp": {
    "servers": {
      "mcp-web-fetch": {
        "command": "uv",
        "args": [
          "run",
          "--directory",
          "/path/to/your/mcp-web-fetch",
          "python",
          "-m",
          "mcp_server_fetch"
        ]
      }
    }
  }
}
```

### Installation via Package Manager

#### Using uv (recommended)

When using [`uv`](https://docs.astral.sh/uv/) no specific installation is needed. We will
use [`uvx`](https://docs.astral.sh/uv/guides/tools/) to directly run *mcp-web-fetch*:

```bash
uvx mcp-web-fetch
```

#### Using pip

```bash
pip install mcp-web-fetch
```

After installation, run it as:
```bash
python -m mcp_server_fetch
```

### Package Manager Configuration

#### Claude Desktop with uvx

```json
{
  "mcpServers": {
    "mcp-web-fetch": {
      "command": "uvx",
      "args": ["mcp-web-fetch"]
    }
  }
}
```

#### VS Code with uvx

```json
{
  "mcp": {
    "servers": {
      "mcp-web-fetch": {
        "command": "uvx",
        "args": ["mcp-web-fetch"]
      }
    }
  }
}
```

## Development

### Setting up Development Environment

1. **Install development dependencies:**
   ```bash
   uv sync --dev
   ```

2. **Run linting and type checking:**
   ```bash
   uv run ruff check
   uv run pyright
   ```

3. **Build the package:**
   ```bash
   uv build
   ```

### Testing

Test the server locally:
```bash
uv run python -m mcp_server_fetch
```

Use the MCP inspector for debugging:
```bash
npx @modelcontextprotocol/inspector uv run python -m mcp_server_fetch
```

### Making Changes

1. Edit the source code in `src/mcp_server_fetch/`
2. Test your changes with `uv run python -m mcp_server_fetch`
3. Update version in `pyproject.toml` if needed
4. Run tests and linting

## Publishing

### Publishing to PyPI

1. **Build the package:**
   ```bash
   uv build
   ```

2. **Publish to PyPI:**
   ```bash
   uv publish
   ```
   
   Or using twine:
   ```bash
   pip install twine
   twine upload dist/*
   ```

### Publishing to GitHub

1. **Initialize git repository (if not already done):**
   ```bash
   git init
   git branch -m main
   ```

2. **Add and commit files:**
   ```bash
   git add .
   git commit -m "Initial commit: MCP Web Fetch server without robots.txt checking"
   ```

3. **Create GitHub repository and push:**
   ```bash
   # Create repository on GitHub first, then:
   git remote add origin https://github.com/langgpt/mcp-web-fetch.git
   git push -u origin main
   ```

4. **Create a release on GitHub:**
   - Go to your repository on GitHub
   - Click "Releases" → "Create a new release"
   - Tag version: `v0.6.3`
   - Release title: `v0.6.3 - MCP Web Fetch`
   - Describe your changes
   - Publish release

### Building Docker Image

```bash
docker build -t mcp-web-fetch .
docker tag mcp-web-fetch langgpt/mcp-web-fetch:latest
docker push langgpt/mcp-web-fetch:latest
```

## Customization

### robots.txt

**This version has robots.txt checking completely removed.** All web requests will proceed regardless of robots.txt restrictions.

### User-agent

By default, depending on if the request came from the model (via a tool), or was user initiated (via a prompt), the
server will use either the user-agent:
```
ModelContextProtocol/1.0 (Autonomous; +https://github.com/modelcontextprotocol/servers)
```
or:
```
ModelContextProtocol/1.0 (User-Specified; +https://github.com/modelcontextprotocol/servers)
```

This can be customized by adding the argument `--user-agent=YourUserAgent` to the `args` list in the configuration.

### Proxy

The server can be configured to use a proxy by using the `--proxy-url` argument.

## Debugging

You can use the MCP inspector to debug the server:

For local development:
```bash
npx @modelcontextprotocol/inspector uv run python -m mcp_server_fetch
```

For uvx installations:
```bash
npx @modelcontextprotocol/inspector uvx mcp-web-fetch
```

## Contributing

This is a modified version of the original mcp-server-fetch. For contributing to the original project, see:
https://github.com/modelcontextprotocol/servers

For this modified version, please submit issues and pull requests to this repository.

## License

mcp-web-fetch is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "mcp-web-fetch-langgpt",
    "maintainer": "langgpt",
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": "automation, http, llm, mcp",
    "author": "langgpt",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/b4/ba/d9e00421874af1e5fd0d487ddf8f7069e17ea070bbc680754e179a03612c/mcp_web_fetch_langgpt-0.6.3.tar.gz",
    "platform": null,
    "description": "# MCP Web Fetch\n\nA Model Context Protocol server that provides web content fetching capabilities **with robots.txt checking removed**. This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.\n\nThis is a modified version of the original mcp-server-fetch that removes all robots.txt checking, allowing unrestricted access to web content.\n\n> [!CAUTION]\n> This server can access local/internal IP addresses and may represent a security risk. Exercise caution when using this MCP server to ensure this does not expose any sensitive data. Additionally, this version ignores robots.txt restrictions which may violate some websites' access policies.\n\nThe fetch tool will truncate the response, but by using the `start_index` argument, you can specify where to start the content extraction. This lets models read a webpage in chunks, until they find the information they need.\n\n## Available Tools\n\n- `fetch` - Fetches a URL from the internet and extracts its contents as markdown.\n    - `url` (string, required): URL to fetch\n    - `max_length` (integer, optional): Maximum number of characters to return (default: 5000)\n    - `start_index` (integer, optional): Start content from this character index (default: 0)\n    - `raw` (boolean, optional): Get raw content without markdown conversion (default: false)\n\n## Available Prompts\n\n- **fetch**\n  - Fetch a URL and extract its contents as markdown\n  - Arguments:\n    - `url` (string, required): URL to fetch\n\n## Installation and Usage\n\n### Local Development Setup\n\n1. **Clone or download the source code:**\n   ```bash\n   git clone <repository-url>\n   cd mcp-web-fetch\n   ```\n\n2. **Install dependencies using uv:**\n   ```bash\n   uv sync\n   ```\n\n3. **Test the server:**\n   ```bash\n   uv run python -m mcp_server_fetch --help\n   ```\n\n### Using with Claude Desktop (Local Source)\n\n1. **Create Claude Desktop configuration:**\n   ```json\n   {\n     \"mcpServers\": {\n       \"mcp-web-fetch\": {\n         \"command\": \"uv\",\n         \"args\": [\n           \"run\",\n           \"--directory\",\n           \"/path/to/your/mcp-web-fetch\",\n           \"python\",\n           \"-m\",\n           \"mcp_server_fetch\"\n         ]\n       }\n     }\n   }\n   ```\n\n2. **Add configuration to Claude Desktop:**\n   - macOS: `~/Library/Application Support/Claude/claude_desktop_config.json`\n   - Windows: `%APPDATA%/Claude/claude_desktop_config.json`\n   - Linux: `~/.config/Claude/claude_desktop_config.json`\n\n3. **Restart Claude Desktop** to load the new server.\n\n### Using with VS Code (Local Source)\n\nAdd to your VS Code settings or `.vscode/mcp.json`:\n\n```json\n{\n  \"mcp\": {\n    \"servers\": {\n      \"mcp-web-fetch\": {\n        \"command\": \"uv\",\n        \"args\": [\n          \"run\",\n          \"--directory\",\n          \"/path/to/your/mcp-web-fetch\",\n          \"python\",\n          \"-m\",\n          \"mcp_server_fetch\"\n        ]\n      }\n    }\n  }\n}\n```\n\n### Installation via Package Manager\n\n#### Using uv (recommended)\n\nWhen using [`uv`](https://docs.astral.sh/uv/) no specific installation is needed. We will\nuse [`uvx`](https://docs.astral.sh/uv/guides/tools/) to directly run *mcp-web-fetch*:\n\n```bash\nuvx mcp-web-fetch\n```\n\n#### Using pip\n\n```bash\npip install mcp-web-fetch\n```\n\nAfter installation, run it as:\n```bash\npython -m mcp_server_fetch\n```\n\n### Package Manager Configuration\n\n#### Claude Desktop with uvx\n\n```json\n{\n  \"mcpServers\": {\n    \"mcp-web-fetch\": {\n      \"command\": \"uvx\",\n      \"args\": [\"mcp-web-fetch\"]\n    }\n  }\n}\n```\n\n#### VS Code with uvx\n\n```json\n{\n  \"mcp\": {\n    \"servers\": {\n      \"mcp-web-fetch\": {\n        \"command\": \"uvx\",\n        \"args\": [\"mcp-web-fetch\"]\n      }\n    }\n  }\n}\n```\n\n## Development\n\n### Setting up Development Environment\n\n1. **Install development dependencies:**\n   ```bash\n   uv sync --dev\n   ```\n\n2. **Run linting and type checking:**\n   ```bash\n   uv run ruff check\n   uv run pyright\n   ```\n\n3. **Build the package:**\n   ```bash\n   uv build\n   ```\n\n### Testing\n\nTest the server locally:\n```bash\nuv run python -m mcp_server_fetch\n```\n\nUse the MCP inspector for debugging:\n```bash\nnpx @modelcontextprotocol/inspector uv run python -m mcp_server_fetch\n```\n\n### Making Changes\n\n1. Edit the source code in `src/mcp_server_fetch/`\n2. Test your changes with `uv run python -m mcp_server_fetch`\n3. Update version in `pyproject.toml` if needed\n4. Run tests and linting\n\n## Publishing\n\n### Publishing to PyPI\n\n1. **Build the package:**\n   ```bash\n   uv build\n   ```\n\n2. **Publish to PyPI:**\n   ```bash\n   uv publish\n   ```\n   \n   Or using twine:\n   ```bash\n   pip install twine\n   twine upload dist/*\n   ```\n\n### Publishing to GitHub\n\n1. **Initialize git repository (if not already done):**\n   ```bash\n   git init\n   git branch -m main\n   ```\n\n2. **Add and commit files:**\n   ```bash\n   git add .\n   git commit -m \"Initial commit: MCP Web Fetch server without robots.txt checking\"\n   ```\n\n3. **Create GitHub repository and push:**\n   ```bash\n   # Create repository on GitHub first, then:\n   git remote add origin https://github.com/langgpt/mcp-web-fetch.git\n   git push -u origin main\n   ```\n\n4. **Create a release on GitHub:**\n   - Go to your repository on GitHub\n   - Click \"Releases\" \u2192 \"Create a new release\"\n   - Tag version: `v0.6.3`\n   - Release title: `v0.6.3 - MCP Web Fetch`\n   - Describe your changes\n   - Publish release\n\n### Building Docker Image\n\n```bash\ndocker build -t mcp-web-fetch .\ndocker tag mcp-web-fetch langgpt/mcp-web-fetch:latest\ndocker push langgpt/mcp-web-fetch:latest\n```\n\n## Customization\n\n### robots.txt\n\n**This version has robots.txt checking completely removed.** All web requests will proceed regardless of robots.txt restrictions.\n\n### User-agent\n\nBy default, depending on if the request came from the model (via a tool), or was user initiated (via a prompt), the\nserver will use either the user-agent:\n```\nModelContextProtocol/1.0 (Autonomous; +https://github.com/modelcontextprotocol/servers)\n```\nor:\n```\nModelContextProtocol/1.0 (User-Specified; +https://github.com/modelcontextprotocol/servers)\n```\n\nThis can be customized by adding the argument `--user-agent=YourUserAgent` to the `args` list in the configuration.\n\n### Proxy\n\nThe server can be configured to use a proxy by using the `--proxy-url` argument.\n\n## Debugging\n\nYou can use the MCP inspector to debug the server:\n\nFor local development:\n```bash\nnpx @modelcontextprotocol/inspector uv run python -m mcp_server_fetch\n```\n\nFor uvx installations:\n```bash\nnpx @modelcontextprotocol/inspector uvx mcp-web-fetch\n```\n\n## Contributing\n\nThis is a modified version of the original mcp-server-fetch. For contributing to the original project, see:\nhttps://github.com/modelcontextprotocol/servers\n\nFor this modified version, please submit issues and pull requests to this repository.\n\n## License\n\nmcp-web-fetch is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "A Model Context Protocol server providing tools to fetch and convert web content for usage by LLMs (robots.txt checking removed)",
    "version": "0.6.3",
    "project_urls": null,
    "split_keywords": [
        "automation",
        " http",
        " llm",
        " mcp"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "420d6bdbd6047d81d1f8d4791f52392605cc5678df9db9bff8b75dd99e9ce325",
                "md5": "688841e102b486c2bf240c3f77c59842",
                "sha256": "ef2cf3aa3ae8db2d4d8715c78c717bcfbeffbfc870caddfb2900affe98c2457c"
            },
            "downloads": -1,
            "filename": "mcp_web_fetch_langgpt-0.6.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "688841e102b486c2bf240c3f77c59842",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 8208,
            "upload_time": "2025-07-13T01:16:28",
            "upload_time_iso_8601": "2025-07-13T01:16:28.995593Z",
            "url": "https://files.pythonhosted.org/packages/42/0d/6bdbd6047d81d1f8d4791f52392605cc5678df9db9bff8b75dd99e9ce325/mcp_web_fetch_langgpt-0.6.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "b4bad9e00421874af1e5fd0d487ddf8f7069e17ea070bbc680754e179a03612c",
                "md5": "91b16ac632d049ea550c5ad061aaa730",
                "sha256": "45064b31b7bd8a888a9ea9441ee6e8dbec838a7f8c84a09fd5dff2672cdaf80a"
            },
            "downloads": -1,
            "filename": "mcp_web_fetch_langgpt-0.6.3.tar.gz",
            "has_sig": false,
            "md5_digest": "91b16ac632d049ea550c5ad061aaa730",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 41765,
            "upload_time": "2025-07-13T01:16:30",
            "upload_time_iso_8601": "2025-07-13T01:16:30.709692Z",
            "url": "https://files.pythonhosted.org/packages/b4/ba/d9e00421874af1e5fd0d487ddf8f7069e17ea070bbc680754e179a03612c/mcp_web_fetch_langgpt-0.6.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-07-13 01:16:30",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "mcp-web-fetch-langgpt"
}

langgpt