xinfer

Name	xinfer JSON
Version	0.3.2 JSON
	download
home_page	None
Summary	Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm, ultralytics, vllm, ollama and your custom model.
upload_time	2024-11-20 15:48:48
maintainer	None
docs_url	None
author	None
requires_python	>=3.10
license	Apache Software License 2.0
keywords	xinfer
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            [python_badge]: https://img.shields.io/badge/Python-3.10+-brightgreen?style=for-the-badge&logo=python&logoColor=white
[pypi_badge]: https://img.shields.io/pypi/v/xinfer.svg?style=for-the-badge&logo=pypi&logoColor=white&label=PyPI&color=blue
[downloads_badge]: https://img.shields.io/pepy/dt/xinfer.svg?style=for-the-badge&logo=pypi&logoColor=white&label=Downloads&color=purple
[license_badge]: https://img.shields.io/badge/License-Apache%202.0-green.svg?style=for-the-badge&logo=apache&logoColor=white
[transformers_badge]: https://img.shields.io/github/stars/huggingface/transformers?style=for-the-badge&logo=huggingface&label=Transformers%20⭐&color=yellow
[timm_badge]: https://img.shields.io/github/stars/huggingface/pytorch-image-models?style=for-the-badge&logo=pytorch&label=TIMM%20⭐&color=limegreen
[ultralytics_badge]: https://img.shields.io/github/stars/ultralytics/ultralytics?style=for-the-badge&logo=udacity&label=Ultralytics%20⭐&color=red
[vllm_badge]: https://img.shields.io/github/stars/vllm-project/vllm?style=for-the-badge&logo=v&label=vLLM%20⭐&color=purple
[ollama_badge]: https://img.shields.io/github/stars/ollama/ollama?style=for-the-badge&logo=ollama&label=Ollama%20⭐&color=darkgreen
[colab_badge]: https://img.shields.io/badge/Open%20In-Colab-blue?style=for-the-badge&logo=google-colab
[kaggle_badge]: https://img.shields.io/badge/Open%20In-Kaggle-blue?style=for-the-badge&logo=kaggle
[back_to_top_badge]: https://img.shields.io/badge/Back_to_Top-↑-blue?style=for-the-badge
[image_classification_badge]: https://img.shields.io/badge/Image%20Classification-6366f1?style=for-the-badge
[object_detection_badge]: https://img.shields.io/badge/Object%20Detection-8b5cf6?style=for-the-badge
[image_captioning_badge]: https://img.shields.io/badge/Image%20Captioning-a855f7?style=for-the-badge
[vqa_badge]: https://img.shields.io/badge/Visual%20QA-d946ef?style=for-the-badge
[os_badge]: https://img.shields.io/badge/Tested%20on-Linux%20%7C%20macOS%20%7C%20Windows-indigo?style=for-the-badge&logo=iterm2&logoColor=white&color=indigo
[pose_estimation_badge]: https://img.shields.io/badge/Pose%20Estimation-ec4899?style=for-the-badge
[instance_segmentation_badge]: https://img.shields.io/badge/Instance%20Segmentation-f43f5e?style=for-the-badge


![Python][python_badge]
[![PyPI version][pypi_badge]](https://pypi.org/project/xinfer/)
[![Downloads][downloads_badge]](https://pypi.org/project/xinfer/)
![License][license_badge]
![OS Support][os_badge]


<div align="center">
    <img src="https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/xinfer.jpg" alt="x.infer" width="500"/>
    <img src="https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/code_typing.gif" alt="x.infer" width="500"/>
    <br />
    <br />
    <a href="https://dnth.github.io/x.infer" target="_blank" rel="noopener noreferrer"><strong>Explore the docs »</strong></a>
    <br />
    <a href="#-quickstart" target="_blank" rel="noopener noreferrer">Quickstart</a>
    ·
    <a href="https://github.com/dnth/x.infer/issues/new?assignees=&labels=Feature+Request&projects=&template=feature_request.md" target="_blank" rel="noopener noreferrer">Feature Request</a>
    ·
    <a href="https://github.com/dnth/x.infer/issues/new?assignees=&labels=bug&projects=&template=bug_report.md" target="_blank" rel="noopener noreferrer">Report Bug</a>
    ·
    <a href="https://github.com/dnth/x.infer/discussions" target="_blank" rel="noopener noreferrer">Discussions</a>
    ·
    <a href="https://dicksonneoh.com/" target="_blank" rel="noopener noreferrer">About</a>
</div>

<div align="center">
    <br />
    <table>
        <tr>
            <td align="center">
                <a href="#-key-features">🌟 Features</a>
            </td>
            <td align="center">
                <a href="#-why-xinfer">🤔 Why x.infer?</a>
            </td>
            <td align="center">
                <a href="#-quickstart">🚀 Quickstart</a>
            </td>
            <td align="center">
                <a href="#-installation">📦 Installation</a>
            </td>
        </tr>
        <tr>
            <td align="center">
                <a href="#%EF%B8%8F-usage">🛠️ Usage</a>
            </td>
            <td align="center">
                <a href="#-supported-models">🤖 Models</a>
            </td>
            <td align="center">
                <a href="#-contributing">🤝 Contributing</a>
            </td>
            <td align="center">
                <a href="#%EF%B8%8F-disclaimer">⚠️ Disclaimer</a>
            </td>
        </tr>
    </table>
</div>

## 🌟 Key Features
<div align="center">
  <img src="https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/flowchart.gif" alt="x.infer" width="850"/>
</div>


✅ Run inference with >1000+ models in 3 lines of code. \
✅ List and search models interactively. \
✅ Launch a Gradio interface to interact with a model. \
✅ Serve model as a REST API endpoint with Ray Serve and FastAPI. \
✅ OpenAI chat completions API compatible. \
✅ Customize and add your own models with minimal code changes.

Tasks supported:

![Image Classification][image_classification_badge]
![Object Detection][object_detection_badge]
![Image Captioning][image_captioning_badge]
![Visual QA][vqa_badge]
![Pose Estimation][pose_estimation_badge]
![Instance Segmentation][instance_segmentation_badge]

## 🤔 Why x.infer?
So, a new computer vision model just dropped last night. It's called `GPT-54o-mini-vision-pro-max-xxxl`. It's a super cool model, open-source, open-weights, open-data, all the good stuff.

You're excited. You want to try it out. 

But it's written in a new framework, `TyPorch` that you know nothing about.
You don't want to spend a weekend learning `TyPorch` just to find out the model is not what you expected.

This is where x.infer comes in. 

x.infer is a simple wrapper that allows you to run inference with any computer vision model in just a few lines of code. All in Python.

Out of the box, x.infer supports the following frameworks:

[![Transformers][transformers_badge]](https://github.com/huggingface/transformers)
[![TIMM][timm_badge]](https://github.com/huggingface/pytorch-image-models)
[![Ultralytics][ultralytics_badge]](https://github.com/ultralytics/ultralytics)
[![vLLM][vllm_badge]](https://github.com/vllm-project/vllm)
[![Ollama][ollama_badge]](https://github.com/ollama/ollama)

Combined, x.infer supports over 1000+ models from all the above frameworks.



Run any supported model using the following 4 lines of code:

```python
import xinfer

model = xinfer.create_model("vikhyatk/moondream2")
model.infer(image, prompt)         # Run single inference
model.infer_batch(images, prompts) # Run batch inference
model.launch_gradio()              # Launch Gradio interface
```

Have a custom model? Create a class that implements the `BaseXInferModel` interface and register it with x.infer. See [Add Your Own Model](#add-your-own-model) for more details.

## 🚀 Quickstart

Here's a quick example demonstrating how to use x.infer with a Transformers model:

[![Open In Colab][colab_badge]](https://colab.research.google.com/github/dnth/x.infer/blob/main/nbs/quickstart.ipynb)
[![Open In Kaggle][kaggle_badge]](https://kaggle.com/kernels/welcome?src=https://github.com/dnth/x.infer/blob/main/nbs/quickstart.ipynb)

```python
import xinfer

model = xinfer.create_model("vikhyatk/moondream2")

image = "https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg"
prompt = "Describe this image. "

model.infer(image, prompt)

>>> 'A parade with a marching band and a flag-bearing figure passes through a town, with spectators lining the street and a church steeple visible in the background.'
```

## 📦 Installation
> [!IMPORTANT]
> You must have [PyTorch](https://pytorch.org/get-started/locally/) installed to use x.infer.

To install the barebones x.infer (without any optional dependencies), run:
```bash
pip install xinfer
```
x.infer can be used with multiple optional dependencies. You'll just need to install one or more of the following:

```bash
pip install "xinfer[transformers]"
pip install "xinfer[ultralytics]"
pip install "xinfer[timm]"
pip install "xinfer[vllm]"
pip install "xinfer[ollama]"
```

To install all optional dependencies, run:
```bash
pip install "xinfer[all]"
```

To install from a local directory, run:
```bash
git clone https://github.com/dnth/x.infer.git
cd x.infer
pip install -e .
```

## 🛠️ Usage

### List Models

```python
xinfer.list_models()
```

```
                                    Available Models                                      
┏━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━┓
┃ Implementation ┃ Model ID                                              ┃ Input --> Output     ┃
┡━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━┩
│ timm           │ timm/eva02_large_patch14_448.mim_m38m_ft_in22k_in1k   │ image --> categories │
│ timm           │ timm/eva02_large_patch14_448.mim_m38m_ft_in1k         │ image --> categories │
│ timm           │ timm/eva02_large_patch14_448.mim_in22k_ft_in22k_in1k  │ image --> categories │
│ timm           │ timm/eva02_large_patch14_448.mim_in22k_ft_in1k        │ image --> categories │
│ timm           │ timm/eva02_base_patch14_448.mim_in22k_ft_in22k_in1k   │ image --> categories │
│ timm           │ timm/eva02_base_patch14_448.mim_in22k_ft_in1k         │ image --> categories │
│ timm           │ timm/eva02_small_patch14_336.mim_in22k_ft_in1k        │ image --> categories │
│ timm           │ timm/eva02_tiny_patch14_336.mim_in22k_ft_in1k         │ image --> categories │
│ transformers   │ Salesforce/blip2-opt-6.7b-coco                        │ image-text --> text  │
│ transformers   │ Salesforce/blip2-flan-t5-xxl                          │ image-text --> text  │
│ transformers   │ Salesforce/blip2-opt-6.7b                             │ image-text --> text  │
│ transformers   │ Salesforce/blip2-opt-2.7b                             │ image-text --> text  │
│ transformers   │ fancyfeast/llama-joycaption-alpha-two-hf-llava        │ image-text --> text  │
│ transformers   │ vikhyatk/moondream2                                   │ image-text --> text  │
│ transformers   │ sashakunitsyn/vlrm-blip2-opt-2.7b                     │ image-text --> text  │
│ ultralytics    │ ultralytics/yolov8x                                   │ image --> boxes      │
│ ultralytics    │ ultralytics/yolov8m                                   │ image --> boxes      │
│ ultralytics    │ ultralytics/yolov8l                                   │ image --> boxes      │
│ ultralytics    │ ultralytics/yolov8s                                   │ image --> boxes      │
│ ultralytics    │ ultralytics/yolov8n                                   │ image --> boxes      │
│ ultralytics    │ ultralytics/yolov8n-seg                               │ image --> masks      │
│ ultralytics    │ ultralytics/yolov8n-pose                              │ image --> poses      │
│ ...            │ ...                                                   │ ...                  │
│ ...            │ ...                                                   │ ...                  │
└────────────────┴───────────────────────────────────────────────────────┴──────────────────────┘
```

If you're running in a Juypter Notebook environment, you can specify `interactive=True` to list and search supported models interactively.



https://github.com/user-attachments/assets/d51cf707-2001-478c-881a-ae27f690d1bc



### Gradio Interface

For all supported models, you can launch a Gradio interface to interact with the model. This is useful for quickly testing the model and visualizing the results.

Once the model is created, you can launch the Gradio interface with the following line of code:

```python
model.launch_gradio()
```


https://github.com/user-attachments/assets/25ce31f3-c9e2-4934-b341-000a6d1b7dc4


If you'd like to launch a Gradio interface with all models available in a dropdown, you can use the following line of code:

```python
xinfer.launch_gradio_demo()
```


https://github.com/user-attachments/assets/bd46f55a-573f-45b9-910f-e22bee27fd3d



See [Gradio Demo](./nbs/gradio_demo.ipynb) for more details.

### Serve Model
If you're happy with your model, you can serve it with x.infer. 

```python
xinfer.serve_model("vikhyatk/moondream2")
```

This will start a FastAPI server at `http://localhost:8000` powered by [Ray Serve](https://docs.ray.io/en/latest/serve/index.html), allowing you to interact with your model through a REST API.



https://github.com/user-attachments/assets/cd3925f8-ffcb-4890-8a34-13ee5f6884f1




You can also specify deployment options such as the number of replicas and GPU requirements and host/port.

```python
xinfer.serve_model(
    "vikhyatk/moondream2",
    device="cuda",
    dtype="float16",
    host="0.0.0.0",
    port=8000,
    deployment_kwargs={
        "num_replicas": 1, 
        "ray_actor_options": {"num_gpus": 1}
    }
)
```
### FastAPI Endpoint
You can now query the endpoint with an image and prompt.

```bash
curl -X 'POST' \
  'http://127.0.0.1:8000/infer' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "image": "https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg",
  "infer_kwargs": {"text": "Caption this image"}
}'
```

Or in Python:

```python
import requests

url = "http://127.0.0.1:8000/infer"
headers = {
    "accept": "application/json",
    "Content-Type": "application/json"
}
payload = {
    "image": "https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg",
    "infer_kwargs": {
        "text": "Caption this image"
    }
}

response = requests.post(url, headers=headers, json=payload)
print(response.json())
```

### OpenAI chat completions API
x.infer endpoint is also compatible with the OpenAI chat completions API format.

You'll have to install the `openai` package to use this feature.

```bash
pip install openai
```

```python
from openai import OpenAI

client = OpenAI(
    api_key="dummy",
    base_url="http://127.0.0.1:8000/v1"
)

response = client.chat.completions.create(
    model="vikhyatk/moondream2",
    messages=[
        {
            "role": "user",
            "content": [
                {
                    "type": "image_url",
                    "image_url": "https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg"
                },
                {
                    "type": "text",
                    "text": "Caption this image"
                }
            ]
        }
    ]
)

print(response.choices[0].message.content)
```


### Add Your Own Model

+ **Step 1:** Create a new model class that implements the `BaseXInferModel` interface.

+ **Step 2:** Implement the required abstract methods `load_model`, `infer`, and `infer_batch`.

+ **Step 3:** Decorate your class with the `register_model` decorator, specifying the model ID, implementation, and input/output.

For example:
```python
@register_model("my-model", "custom", ModelInputOutput.IMAGE_TEXT_TO_TEXT)
class MyModel(BaseXInferModel):
    def load_model(self):
        # Load your model here
        pass

    def infer(self, image, prompt):
        # Run single inference 
        pass

    def infer_batch(self, images, prompts):
        # Run batch inference here
        pass
```

See an example implementation of the Molmo model [here](https://github.com/dnth/x.infer/blob/main/xinfer/vllm/molmo.py).







## 🤖 Supported Models


<details>
<summary>Transformers</summary>

<!DOCTYPE html>
<html lang="en">
<body>
    <table>
        <thead>
            <tr>
                <th>Model</th>
                <th>Usage</th>
            </tr>
        </thead>
        <tbody>
            <tr>
                <td><a href="https://huggingface.co/collections/Salesforce/blip2-models-65242f91b4c4b4a32e5cb652">BLIP2 Series</a></td>
                <td><pre lang="python"><code>xinfer.create_model("Salesforce/blip2-opt-2.7b")</code></pre></td>
            </tr>
            <tr>
                <td><a href="https://github.com/vikhyat/moondream">Moondream2</a></td>
                <td><pre lang="python"><code>xinfer.create_model("vikhyatk/moondream2")</code></pre></td>
            </tr>
            <tr>
                <td><a href="https://huggingface.co/sashakunitsyn/vlrm-blip2-opt-2.7b">VLRM-BLIP2</a></td>
                <td><pre lang="python"><code>xinfer.create_model("sashakunitsyn/vlrm-blip2-opt-2.7b")</code></pre></td>
            </tr>
            <tr>
                <td><a href="https://github.com/fpgaminer/joycaption">JoyCaption</a></td>
                <td><pre lang="python"><code>xinfer.create_model("fancyfeast/llama-joycaption-alpha-two-hf-llava")</code></pre></td>
            </tr>
            <tr>
                <td><a href="https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct">Llama-3.2 Vision Series</a></td>
                <td><pre lang="python"><code>xinfer.create_model("meta-llama/Llama-3.2-11B-Vision-Instruct")</code></pre></td>
            </tr>
            <tr>
                <td><a href="https://huggingface.co/microsoft/Florence-2-base-ft">Florence-2 Series</a></td>
                <td><pre lang="python"><code>xinfer.create_model("microsoft/Florence-2-base-ft")</code></pre></td>
            </tr>
            <tr>
                <td><a href="https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct">Qwen2-VL Series</a></td>
                <td><pre lang="python"><code>xinfer.create_model("Qwen/Qwen2-VL-2B-Instruct")</code></pre></td>
            </tr>
        </tbody>
    </table>
</body>
</html>



You can also load any [AutoModelForVision2Seq model](https://huggingface.co/docs/transformers/main/en/model_doc/auto#transformers.AutoModelForVision2Seq) 
from Transformers by using the `Vision2SeqModel` class.

```python
from xinfer.transformers import Vision2SeqModel

model = Vision2SeqModel("facebook/chameleon-7b")
model = xinfer.create_model(model)
```

</details>

<details>
<summary>TIMM</summary>

All models from [TIMM](https://github.com/huggingface/pytorch-image-models) fine-tuned for ImageNet 1k are supported.

For example load a `resnet18.a1_in1k` model:
```python
xinfer.create_model("timm/resnet18.a1_in1k")
```

You can also load any model (or a custom timm model) by using the `TIMMModel` class.

```python
from xinfer.timm import TimmModel

model = TimmModel("resnet18")
model = xinfer.create_model(model)
```

</details>

<details>
<summary>Ultralytics</summary>

<table>
    <thead>
        <tr>
            <th>Model</th>
            <th>Usage</th>
        </tr>
    </thead>
    <tbody>
        <tr>
            <td><a href="https://github.com/ultralytics/ultralytics">YOLOv8 Detection Series</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ultralytics/yolov8n")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://github.com/ultralytics/ultralytics">YOLOv10 Detection Series</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ultralytics/yolov10x")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://github.com/ultralytics/ultralytics">YOLOv11 Detection Series</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ultralytics/yolov11s")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://github.com/ultralytics/ultralytics">YOLOv8 Classification Series</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ultralytics/yolov8n-cls")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://github.com/ultralytics/ultralytics">YOLOv11 Classification Series</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ultralytics/yolov11s-cls")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://github.com/ultralytics/ultralytics">YOLOv8 Segmentation Series</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ultralytics/yolov8n-seg")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://github.com/ultralytics/ultralytics">YOLOv8 Pose Series</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ultralytics/yolov8n-pose")</code></pre></td>
        </tr>
    </tbody>
</table>


You can also load any model from Ultralytics by using the `UltralyticsModel` class.

```python
from xinfer.ultralytics import UltralyticsModel

model = UltralyticsModel("yolov5n6u")
model = xinfer.create_model(model)
```

</details>

<details>
<summary>vLLM</summary>

<table>
    <thead>
        <tr>
            <th>Model</th>
            <th>Usage</th>
        </tr>
    </thead>
    <tbody>
        <tr>
            <td><a href="https://huggingface.co/allenai/Molmo-72B-0924">Molmo-72B</a></td>
            <td><pre lang="python"><code>xinfer.create_model("vllm/allenai/Molmo-72B-0924")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://huggingface.co/allenai/Molmo-7B-D-0924">Molmo-7B-D</a></td>
            <td><pre lang="python"><code>xinfer.create_model("vllm/allenai/Molmo-7B-D-0924")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://huggingface.co/allenai/Molmo-7B-O-0924">Molmo-7B-O</a></td>
            <td><pre lang="python"><code>xinfer.create_model("vllm/allenai/Molmo-7B-O-0924")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://huggingface.co/microsoft/Phi-3.5-vision-instruct">Phi-3.5-vision-instruct</a></td>
            <td><pre lang="python"><code>xinfer.create_model("vllm/microsoft/Phi-3.5-vision-instruct")</code></pre></td>
        </tr>
        <tr>
            <td><a href="https://huggingface.co/microsoft/Phi-3-vision-128k-instruct">Phi-3-vision-128k-instruct</a></td>
            <td><pre lang="python"><code>xinfer.create_model("vllm/microsoft/Phi-3-vision-128k-instruct")</code></pre></td>
        </tr>
    </tbody>
</table>

</details>

<details>
<summary>Ollama</summary>

To use Ollama models, you'll need to install the Ollama on your machine. See [Ollama Installation Guide](https://ollama.com/download) for more details.

<table>
    <thead>
        <tr>
            <th>Model</th>
            <th>Usage</th>
        </tr>
    </thead>
    <tbody>
        <tr>
            <td><a href="https://github.com/ollama/ollama">LLaVA Phi3</a></td>
            <td><pre lang="python"><code>xinfer.create_model("ollama/llava-phi3")</code></pre></td>
        </tr>
    </tbody>
</table>

</details>



## 🤝 Contributing

If you'd like to contribute, here are some ways you can help:

1. **Add new models:** Implement new model classes following the steps in the [Adding New Models](#-adding-new-models) section.

2. **Improve documentation:** Help us enhance our documentation, including this README, inline code comments, and the [official docs](https://dnth.github.io/x.infer).

3. **Report bugs:** If you find a bug, please [open an issue](https://github.com/dnth/x.infer/issues/new?assignees=&labels=bug&projects=&template=bug_report.md) with a clear description and steps to reproduce.

4. **Suggest enhancements:** Have ideas for new features? [Open a feature request](https://github.com/dnth/x.infer/issues/new?assignees=&labels=Feature+Request&projects=&template=feature_request.md).

5. **Financial support:** Please consider sponsoring the project to support continued development.

Please also see the code of conduct [here](./CODE_OF_CONDUCT.md).
Thank you for helping make x.infer better!

## ⚠️ Disclaimer

x.infer is not affiliated with any of the libraries it supports. It is a simple wrapper that allows you to run inference with any of the supported models.

Although x.infer is Apache 2.0 licensed, the models it supports may have their own licenses. Please check the individual model repositories for more details. 

<div align="center">
    <img src="https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/github_banner.png" alt="x.infer" width="600"/>
    <br />
    <br />
    <a href="https://dnth.github.io/x.infer" target="_blank" rel="noopener noreferrer"><strong>Explore the docs »</strong></a>
    <br />
    <a href="#quickstart" target="_blank" rel="noopener noreferrer">Quickstart</a>
    ·
    <a href="https://github.com/dnth/x.infer/issues/new?assignees=&labels=Feature+Request&projects=&template=feature_request.md" target="_blank" rel="noopener noreferrer">Feature Request</a>
    ·
    <a href="https://github.com/dnth/x.infer/issues/new?assignees=&labels=bug&projects=&template=bug_report.md" target="_blank" rel="noopener noreferrer">Report Bug</a>
    ·
    <a href="https://github.com/dnth/x.infer/discussions" target="_blank" rel="noopener noreferrer">Discussions</a>
    ·
    <a href="https://dicksonneoh.com/" target="_blank" rel="noopener noreferrer">About</a>
</div>



<div align="right">
    <br />
    <a href="#top"><img src="https://img.shields.io/badge/Back_to_Top-↑-blue?style=for-the-badge" alt="Back to Top" /></a>
</div>

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "xinfer",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": "xinfer",
    "author": null,
    "author_email": "Dickson Neoh <dickson.neoh@gmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/a3/95/7888a9bd4b03da2264d7fa4035db0426f9b942e15d120468ea003bc2acc7/xinfer-0.3.2.tar.gz",
    "platform": null,
    "description": "[python_badge]: https://img.shields.io/badge/Python-3.10+-brightgreen?style=for-the-badge&logo=python&logoColor=white\n[pypi_badge]: https://img.shields.io/pypi/v/xinfer.svg?style=for-the-badge&logo=pypi&logoColor=white&label=PyPI&color=blue\n[downloads_badge]: https://img.shields.io/pepy/dt/xinfer.svg?style=for-the-badge&logo=pypi&logoColor=white&label=Downloads&color=purple\n[license_badge]: https://img.shields.io/badge/License-Apache%202.0-green.svg?style=for-the-badge&logo=apache&logoColor=white\n[transformers_badge]: https://img.shields.io/github/stars/huggingface/transformers?style=for-the-badge&logo=huggingface&label=Transformers%20\u2b50&color=yellow\n[timm_badge]: https://img.shields.io/github/stars/huggingface/pytorch-image-models?style=for-the-badge&logo=pytorch&label=TIMM%20\u2b50&color=limegreen\n[ultralytics_badge]: https://img.shields.io/github/stars/ultralytics/ultralytics?style=for-the-badge&logo=udacity&label=Ultralytics%20\u2b50&color=red\n[vllm_badge]: https://img.shields.io/github/stars/vllm-project/vllm?style=for-the-badge&logo=v&label=vLLM%20\u2b50&color=purple\n[ollama_badge]: https://img.shields.io/github/stars/ollama/ollama?style=for-the-badge&logo=ollama&label=Ollama%20\u2b50&color=darkgreen\n[colab_badge]: https://img.shields.io/badge/Open%20In-Colab-blue?style=for-the-badge&logo=google-colab\n[kaggle_badge]: https://img.shields.io/badge/Open%20In-Kaggle-blue?style=for-the-badge&logo=kaggle\n[back_to_top_badge]: https://img.shields.io/badge/Back_to_Top-\u2191-blue?style=for-the-badge\n[image_classification_badge]: https://img.shields.io/badge/Image%20Classification-6366f1?style=for-the-badge\n[object_detection_badge]: https://img.shields.io/badge/Object%20Detection-8b5cf6?style=for-the-badge\n[image_captioning_badge]: https://img.shields.io/badge/Image%20Captioning-a855f7?style=for-the-badge\n[vqa_badge]: https://img.shields.io/badge/Visual%20QA-d946ef?style=for-the-badge\n[os_badge]: https://img.shields.io/badge/Tested%20on-Linux%20%7C%20macOS%20%7C%20Windows-indigo?style=for-the-badge&logo=iterm2&logoColor=white&color=indigo\n[pose_estimation_badge]: https://img.shields.io/badge/Pose%20Estimation-ec4899?style=for-the-badge\n[instance_segmentation_badge]: https://img.shields.io/badge/Instance%20Segmentation-f43f5e?style=for-the-badge\n\n\n![Python][python_badge]\n[![PyPI version][pypi_badge]](https://pypi.org/project/xinfer/)\n[![Downloads][downloads_badge]](https://pypi.org/project/xinfer/)\n![License][license_badge]\n![OS Support][os_badge]\n\n\n<div align=\"center\">\n    <img src=\"https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/xinfer.jpg\" alt=\"x.infer\" width=\"500\"/>\n    <img src=\"https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/code_typing.gif\" alt=\"x.infer\" width=\"500\"/>\n    <br />\n    <br />\n    <a href=\"https://dnth.github.io/x.infer\" target=\"_blank\" rel=\"noopener noreferrer\"><strong>Explore the docs \u00bb</strong></a>\n    <br />\n    <a href=\"#-quickstart\" target=\"_blank\" rel=\"noopener noreferrer\">Quickstart</a>\n    \u00b7\n    <a href=\"https://github.com/dnth/x.infer/issues/new?assignees=&labels=Feature+Request&projects=&template=feature_request.md\" target=\"_blank\" rel=\"noopener noreferrer\">Feature Request</a>\n    \u00b7\n    <a href=\"https://github.com/dnth/x.infer/issues/new?assignees=&labels=bug&projects=&template=bug_report.md\" target=\"_blank\" rel=\"noopener noreferrer\">Report Bug</a>\n    \u00b7\n    <a href=\"https://github.com/dnth/x.infer/discussions\" target=\"_blank\" rel=\"noopener noreferrer\">Discussions</a>\n    \u00b7\n    <a href=\"https://dicksonneoh.com/\" target=\"_blank\" rel=\"noopener noreferrer\">About</a>\n</div>\n\n<div align=\"center\">\n    <br />\n    <table>\n        <tr>\n            <td align=\"center\">\n                <a href=\"#-key-features\">\ud83c\udf1f Features</a>\n            </td>\n            <td align=\"center\">\n                <a href=\"#-why-xinfer\">\ud83e\udd14 Why x.infer?</a>\n            </td>\n            <td align=\"center\">\n                <a href=\"#-quickstart\">\ud83d\ude80 Quickstart</a>\n            </td>\n            <td align=\"center\">\n                <a href=\"#-installation\">\ud83d\udce6 Installation</a>\n            </td>\n        </tr>\n        <tr>\n            <td align=\"center\">\n                <a href=\"#%EF%B8%8F-usage\">\ud83d\udee0\ufe0f Usage</a>\n            </td>\n            <td align=\"center\">\n                <a href=\"#-supported-models\">\ud83e\udd16 Models</a>\n            </td>\n            <td align=\"center\">\n                <a href=\"#-contributing\">\ud83e\udd1d Contributing</a>\n            </td>\n            <td align=\"center\">\n                <a href=\"#%EF%B8%8F-disclaimer\">\u26a0\ufe0f Disclaimer</a>\n            </td>\n        </tr>\n    </table>\n</div>\n\n## \ud83c\udf1f Key Features\n<div align=\"center\">\n  <img src=\"https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/flowchart.gif\" alt=\"x.infer\" width=\"850\"/>\n</div>\n\n\n\u2705 Run inference with >1000+ models in 3 lines of code. \\\n\u2705 List and search models interactively. \\\n\u2705 Launch a Gradio interface to interact with a model. \\\n\u2705 Serve model as a REST API endpoint with Ray Serve and FastAPI. \\\n\u2705 OpenAI chat completions API compatible. \\\n\u2705 Customize and add your own models with minimal code changes.\n\nTasks supported:\n\n![Image Classification][image_classification_badge]\n![Object Detection][object_detection_badge]\n![Image Captioning][image_captioning_badge]\n![Visual QA][vqa_badge]\n![Pose Estimation][pose_estimation_badge]\n![Instance Segmentation][instance_segmentation_badge]\n\n## \ud83e\udd14 Why x.infer?\nSo, a new computer vision model just dropped last night. It's called `GPT-54o-mini-vision-pro-max-xxxl`. It's a super cool model, open-source, open-weights, open-data, all the good stuff.\n\nYou're excited. You want to try it out. \n\nBut it's written in a new framework, `TyPorch` that you know nothing about.\nYou don't want to spend a weekend learning `TyPorch` just to find out the model is not what you expected.\n\nThis is where x.infer comes in. \n\nx.infer is a simple wrapper that allows you to run inference with any computer vision model in just a few lines of code. All in Python.\n\nOut of the box, x.infer supports the following frameworks:\n\n[![Transformers][transformers_badge]](https://github.com/huggingface/transformers)\n[![TIMM][timm_badge]](https://github.com/huggingface/pytorch-image-models)\n[![Ultralytics][ultralytics_badge]](https://github.com/ultralytics/ultralytics)\n[![vLLM][vllm_badge]](https://github.com/vllm-project/vllm)\n[![Ollama][ollama_badge]](https://github.com/ollama/ollama)\n\nCombined, x.infer supports over 1000+ models from all the above frameworks.\n\n\n\nRun any supported model using the following 4 lines of code:\n\n```python\nimport xinfer\n\nmodel = xinfer.create_model(\"vikhyatk/moondream2\")\nmodel.infer(image, prompt)         # Run single inference\nmodel.infer_batch(images, prompts) # Run batch inference\nmodel.launch_gradio()              # Launch Gradio interface\n```\n\nHave a custom model? Create a class that implements the `BaseXInferModel` interface and register it with x.infer. See [Add Your Own Model](#add-your-own-model) for more details.\n\n## \ud83d\ude80 Quickstart\n\nHere's a quick example demonstrating how to use x.infer with a Transformers model:\n\n[![Open In Colab][colab_badge]](https://colab.research.google.com/github/dnth/x.infer/blob/main/nbs/quickstart.ipynb)\n[![Open In Kaggle][kaggle_badge]](https://kaggle.com/kernels/welcome?src=https://github.com/dnth/x.infer/blob/main/nbs/quickstart.ipynb)\n\n```python\nimport xinfer\n\nmodel = xinfer.create_model(\"vikhyatk/moondream2\")\n\nimage = \"https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg\"\nprompt = \"Describe this image. \"\n\nmodel.infer(image, prompt)\n\n>>> 'A parade with a marching band and a flag-bearing figure passes through a town, with spectators lining the street and a church steeple visible in the background.'\n```\n\n## \ud83d\udce6 Installation\n> [!IMPORTANT]\n> You must have [PyTorch](https://pytorch.org/get-started/locally/) installed to use x.infer.\n\nTo install the barebones x.infer (without any optional dependencies), run:\n```bash\npip install xinfer\n```\nx.infer can be used with multiple optional dependencies. You'll just need to install one or more of the following:\n\n```bash\npip install \"xinfer[transformers]\"\npip install \"xinfer[ultralytics]\"\npip install \"xinfer[timm]\"\npip install \"xinfer[vllm]\"\npip install \"xinfer[ollama]\"\n```\n\nTo install all optional dependencies, run:\n```bash\npip install \"xinfer[all]\"\n```\n\nTo install from a local directory, run:\n```bash\ngit clone https://github.com/dnth/x.infer.git\ncd x.infer\npip install -e .\n```\n\n## \ud83d\udee0\ufe0f Usage\n\n### List Models\n\n```python\nxinfer.list_models()\n```\n\n```\n                                    Available Models                                      \n\u250f\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2533\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2533\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2513\n\u2503 Implementation \u2503 Model ID                                              \u2503 Input --> Output     \u2503\n\u2521\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2547\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2547\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2529\n\u2502 timm           \u2502 timm/eva02_large_patch14_448.mim_m38m_ft_in22k_in1k   \u2502 image --> categories \u2502\n\u2502 timm           \u2502 timm/eva02_large_patch14_448.mim_m38m_ft_in1k         \u2502 image --> categories \u2502\n\u2502 timm           \u2502 timm/eva02_large_patch14_448.mim_in22k_ft_in22k_in1k  \u2502 image --> categories \u2502\n\u2502 timm           \u2502 timm/eva02_large_patch14_448.mim_in22k_ft_in1k        \u2502 image --> categories \u2502\n\u2502 timm           \u2502 timm/eva02_base_patch14_448.mim_in22k_ft_in22k_in1k   \u2502 image --> categories \u2502\n\u2502 timm           \u2502 timm/eva02_base_patch14_448.mim_in22k_ft_in1k         \u2502 image --> categories \u2502\n\u2502 timm           \u2502 timm/eva02_small_patch14_336.mim_in22k_ft_in1k        \u2502 image --> categories \u2502\n\u2502 timm           \u2502 timm/eva02_tiny_patch14_336.mim_in22k_ft_in1k         \u2502 image --> categories \u2502\n\u2502 transformers   \u2502 Salesforce/blip2-opt-6.7b-coco                        \u2502 image-text --> text  \u2502\n\u2502 transformers   \u2502 Salesforce/blip2-flan-t5-xxl                          \u2502 image-text --> text  \u2502\n\u2502 transformers   \u2502 Salesforce/blip2-opt-6.7b                             \u2502 image-text --> text  \u2502\n\u2502 transformers   \u2502 Salesforce/blip2-opt-2.7b                             \u2502 image-text --> text  \u2502\n\u2502 transformers   \u2502 fancyfeast/llama-joycaption-alpha-two-hf-llava        \u2502 image-text --> text  \u2502\n\u2502 transformers   \u2502 vikhyatk/moondream2                                   \u2502 image-text --> text  \u2502\n\u2502 transformers   \u2502 sashakunitsyn/vlrm-blip2-opt-2.7b                     \u2502 image-text --> text  \u2502\n\u2502 ultralytics    \u2502 ultralytics/yolov8x                                   \u2502 image --> boxes      \u2502\n\u2502 ultralytics    \u2502 ultralytics/yolov8m                                   \u2502 image --> boxes      \u2502\n\u2502 ultralytics    \u2502 ultralytics/yolov8l                                   \u2502 image --> boxes      \u2502\n\u2502 ultralytics    \u2502 ultralytics/yolov8s                                   \u2502 image --> boxes      \u2502\n\u2502 ultralytics    \u2502 ultralytics/yolov8n                                   \u2502 image --> boxes      \u2502\n\u2502 ultralytics    \u2502 ultralytics/yolov8n-seg                               \u2502 image --> masks      \u2502\n\u2502 ultralytics    \u2502 ultralytics/yolov8n-pose                              \u2502 image --> poses      \u2502\n\u2502 ...            \u2502 ...                                                   \u2502 ...                  \u2502\n\u2502 ...            \u2502 ...                                                   \u2502 ...                  \u2502\n\u2514\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2534\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2534\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2518\n```\n\nIf you're running in a Juypter Notebook environment, you can specify `interactive=True` to list and search supported models interactively.\n\n\n\nhttps://github.com/user-attachments/assets/d51cf707-2001-478c-881a-ae27f690d1bc\n\n\n\n### Gradio Interface\n\nFor all supported models, you can launch a Gradio interface to interact with the model. This is useful for quickly testing the model and visualizing the results.\n\nOnce the model is created, you can launch the Gradio interface with the following line of code:\n\n```python\nmodel.launch_gradio()\n```\n\n\nhttps://github.com/user-attachments/assets/25ce31f3-c9e2-4934-b341-000a6d1b7dc4\n\n\nIf you'd like to launch a Gradio interface with all models available in a dropdown, you can use the following line of code:\n\n```python\nxinfer.launch_gradio_demo()\n```\n\n\nhttps://github.com/user-attachments/assets/bd46f55a-573f-45b9-910f-e22bee27fd3d\n\n\n\nSee [Gradio Demo](./nbs/gradio_demo.ipynb) for more details.\n\n### Serve Model\nIf you're happy with your model, you can serve it with x.infer. \n\n```python\nxinfer.serve_model(\"vikhyatk/moondream2\")\n```\n\nThis will start a FastAPI server at `http://localhost:8000` powered by [Ray Serve](https://docs.ray.io/en/latest/serve/index.html), allowing you to interact with your model through a REST API.\n\n\n\nhttps://github.com/user-attachments/assets/cd3925f8-ffcb-4890-8a34-13ee5f6884f1\n\n\n\n\nYou can also specify deployment options such as the number of replicas and GPU requirements and host/port.\n\n```python\nxinfer.serve_model(\n    \"vikhyatk/moondream2\",\n    device=\"cuda\",\n    dtype=\"float16\",\n    host=\"0.0.0.0\",\n    port=8000,\n    deployment_kwargs={\n        \"num_replicas\": 1, \n        \"ray_actor_options\": {\"num_gpus\": 1}\n    }\n)\n```\n### FastAPI Endpoint\nYou can now query the endpoint with an image and prompt.\n\n```bash\ncurl -X 'POST' \\\n  'http://127.0.0.1:8000/infer' \\\n  -H 'accept: application/json' \\\n  -H 'Content-Type: application/json' \\\n  -d '{\n  \"image\": \"https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg\",\n  \"infer_kwargs\": {\"text\": \"Caption this image\"}\n}'\n```\n\nOr in Python:\n\n```python\nimport requests\n\nurl = \"http://127.0.0.1:8000/infer\"\nheaders = {\n    \"accept\": \"application/json\",\n    \"Content-Type\": \"application/json\"\n}\npayload = {\n    \"image\": \"https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg\",\n    \"infer_kwargs\": {\n        \"text\": \"Caption this image\"\n    }\n}\n\nresponse = requests.post(url, headers=headers, json=payload)\nprint(response.json())\n```\n\n### OpenAI chat completions API\nx.infer endpoint is also compatible with the OpenAI chat completions API format.\n\nYou'll have to install the `openai` package to use this feature.\n\n```bash\npip install openai\n```\n\n```python\nfrom openai import OpenAI\n\nclient = OpenAI(\n    api_key=\"dummy\",\n    base_url=\"http://127.0.0.1:8000/v1\"\n)\n\nresponse = client.chat.completions.create(\n    model=\"vikhyatk/moondream2\",\n    messages=[\n        {\n            \"role\": \"user\",\n            \"content\": [\n                {\n                    \"type\": \"image_url\",\n                    \"image_url\": \"https://raw.githubusercontent.com/dnth/x.infer/main/assets/demo/00aa2580828a9009.jpg\"\n                },\n                {\n                    \"type\": \"text\",\n                    \"text\": \"Caption this image\"\n                }\n            ]\n        }\n    ]\n)\n\nprint(response.choices[0].message.content)\n```\n\n\n### Add Your Own Model\n\n+ **Step 1:** Create a new model class that implements the `BaseXInferModel` interface.\n\n+ **Step 2:** Implement the required abstract methods `load_model`, `infer`, and `infer_batch`.\n\n+ **Step 3:** Decorate your class with the `register_model` decorator, specifying the model ID, implementation, and input/output.\n\nFor example:\n```python\n@register_model(\"my-model\", \"custom\", ModelInputOutput.IMAGE_TEXT_TO_TEXT)\nclass MyModel(BaseXInferModel):\n    def load_model(self):\n        # Load your model here\n        pass\n\n    def infer(self, image, prompt):\n        # Run single inference \n        pass\n\n    def infer_batch(self, images, prompts):\n        # Run batch inference here\n        pass\n```\n\nSee an example implementation of the Molmo model [here](https://github.com/dnth/x.infer/blob/main/xinfer/vllm/molmo.py).\n\n\n\n\n\n\n\n## \ud83e\udd16 Supported Models\n\n\n<details>\n<summary>Transformers</summary>\n\n<!DOCTYPE html>\n<html lang=\"en\">\n<body>\n    <table>\n        <thead>\n            <tr>\n                <th>Model</th>\n                <th>Usage</th>\n            </tr>\n        </thead>\n        <tbody>\n            <tr>\n                <td><a href=\"https://huggingface.co/collections/Salesforce/blip2-models-65242f91b4c4b4a32e5cb652\">BLIP2 Series</a></td>\n                <td><pre lang=\"python\"><code>xinfer.create_model(\"Salesforce/blip2-opt-2.7b\")</code></pre></td>\n            </tr>\n            <tr>\n                <td><a href=\"https://github.com/vikhyat/moondream\">Moondream2</a></td>\n                <td><pre lang=\"python\"><code>xinfer.create_model(\"vikhyatk/moondream2\")</code></pre></td>\n            </tr>\n            <tr>\n                <td><a href=\"https://huggingface.co/sashakunitsyn/vlrm-blip2-opt-2.7b\">VLRM-BLIP2</a></td>\n                <td><pre lang=\"python\"><code>xinfer.create_model(\"sashakunitsyn/vlrm-blip2-opt-2.7b\")</code></pre></td>\n            </tr>\n            <tr>\n                <td><a href=\"https://github.com/fpgaminer/joycaption\">JoyCaption</a></td>\n                <td><pre lang=\"python\"><code>xinfer.create_model(\"fancyfeast/llama-joycaption-alpha-two-hf-llava\")</code></pre></td>\n            </tr>\n            <tr>\n                <td><a href=\"https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct\">Llama-3.2 Vision Series</a></td>\n                <td><pre lang=\"python\"><code>xinfer.create_model(\"meta-llama/Llama-3.2-11B-Vision-Instruct\")</code></pre></td>\n            </tr>\n            <tr>\n                <td><a href=\"https://huggingface.co/microsoft/Florence-2-base-ft\">Florence-2 Series</a></td>\n                <td><pre lang=\"python\"><code>xinfer.create_model(\"microsoft/Florence-2-base-ft\")</code></pre></td>\n            </tr>\n            <tr>\n                <td><a href=\"https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct\">Qwen2-VL Series</a></td>\n                <td><pre lang=\"python\"><code>xinfer.create_model(\"Qwen/Qwen2-VL-2B-Instruct\")</code></pre></td>\n            </tr>\n        </tbody>\n    </table>\n</body>\n</html>\n\n\n\nYou can also load any [AutoModelForVision2Seq model](https://huggingface.co/docs/transformers/main/en/model_doc/auto#transformers.AutoModelForVision2Seq) \nfrom Transformers by using the `Vision2SeqModel` class.\n\n```python\nfrom xinfer.transformers import Vision2SeqModel\n\nmodel = Vision2SeqModel(\"facebook/chameleon-7b\")\nmodel = xinfer.create_model(model)\n```\n\n</details>\n\n<details>\n<summary>TIMM</summary>\n\nAll models from [TIMM](https://github.com/huggingface/pytorch-image-models) fine-tuned for ImageNet 1k are supported.\n\nFor example load a `resnet18.a1_in1k` model:\n```python\nxinfer.create_model(\"timm/resnet18.a1_in1k\")\n```\n\nYou can also load any model (or a custom timm model) by using the `TIMMModel` class.\n\n```python\nfrom xinfer.timm import TimmModel\n\nmodel = TimmModel(\"resnet18\")\nmodel = xinfer.create_model(model)\n```\n\n</details>\n\n<details>\n<summary>Ultralytics</summary>\n\n<table>\n    <thead>\n        <tr>\n            <th>Model</th>\n            <th>Usage</th>\n        </tr>\n    </thead>\n    <tbody>\n        <tr>\n            <td><a href=\"https://github.com/ultralytics/ultralytics\">YOLOv8 Detection Series</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ultralytics/yolov8n\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://github.com/ultralytics/ultralytics\">YOLOv10 Detection Series</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ultralytics/yolov10x\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://github.com/ultralytics/ultralytics\">YOLOv11 Detection Series</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ultralytics/yolov11s\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://github.com/ultralytics/ultralytics\">YOLOv8 Classification Series</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ultralytics/yolov8n-cls\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://github.com/ultralytics/ultralytics\">YOLOv11 Classification Series</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ultralytics/yolov11s-cls\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://github.com/ultralytics/ultralytics\">YOLOv8 Segmentation Series</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ultralytics/yolov8n-seg\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://github.com/ultralytics/ultralytics\">YOLOv8 Pose Series</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ultralytics/yolov8n-pose\")</code></pre></td>\n        </tr>\n    </tbody>\n</table>\n\n\nYou can also load any model from Ultralytics by using the `UltralyticsModel` class.\n\n```python\nfrom xinfer.ultralytics import UltralyticsModel\n\nmodel = UltralyticsModel(\"yolov5n6u\")\nmodel = xinfer.create_model(model)\n```\n\n</details>\n\n<details>\n<summary>vLLM</summary>\n\n<table>\n    <thead>\n        <tr>\n            <th>Model</th>\n            <th>Usage</th>\n        </tr>\n    </thead>\n    <tbody>\n        <tr>\n            <td><a href=\"https://huggingface.co/allenai/Molmo-72B-0924\">Molmo-72B</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"vllm/allenai/Molmo-72B-0924\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://huggingface.co/allenai/Molmo-7B-D-0924\">Molmo-7B-D</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"vllm/allenai/Molmo-7B-D-0924\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://huggingface.co/allenai/Molmo-7B-O-0924\">Molmo-7B-O</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"vllm/allenai/Molmo-7B-O-0924\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://huggingface.co/microsoft/Phi-3.5-vision-instruct\">Phi-3.5-vision-instruct</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"vllm/microsoft/Phi-3.5-vision-instruct\")</code></pre></td>\n        </tr>\n        <tr>\n            <td><a href=\"https://huggingface.co/microsoft/Phi-3-vision-128k-instruct\">Phi-3-vision-128k-instruct</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"vllm/microsoft/Phi-3-vision-128k-instruct\")</code></pre></td>\n        </tr>\n    </tbody>\n</table>\n\n</details>\n\n<details>\n<summary>Ollama</summary>\n\nTo use Ollama models, you'll need to install the Ollama on your machine. See [Ollama Installation Guide](https://ollama.com/download) for more details.\n\n<table>\n    <thead>\n        <tr>\n            <th>Model</th>\n            <th>Usage</th>\n        </tr>\n    </thead>\n    <tbody>\n        <tr>\n            <td><a href=\"https://github.com/ollama/ollama\">LLaVA Phi3</a></td>\n            <td><pre lang=\"python\"><code>xinfer.create_model(\"ollama/llava-phi3\")</code></pre></td>\n        </tr>\n    </tbody>\n</table>\n\n</details>\n\n\n\n## \ud83e\udd1d Contributing\n\nIf you'd like to contribute, here are some ways you can help:\n\n1. **Add new models:** Implement new model classes following the steps in the [Adding New Models](#-adding-new-models) section.\n\n2. **Improve documentation:** Help us enhance our documentation, including this README, inline code comments, and the [official docs](https://dnth.github.io/x.infer).\n\n3. **Report bugs:** If you find a bug, please [open an issue](https://github.com/dnth/x.infer/issues/new?assignees=&labels=bug&projects=&template=bug_report.md) with a clear description and steps to reproduce.\n\n4. **Suggest enhancements:** Have ideas for new features? [Open a feature request](https://github.com/dnth/x.infer/issues/new?assignees=&labels=Feature+Request&projects=&template=feature_request.md).\n\n5. **Financial support:** Please consider sponsoring the project to support continued development.\n\nPlease also see the code of conduct [here](./CODE_OF_CONDUCT.md).\nThank you for helping make x.infer better!\n\n## \u26a0\ufe0f Disclaimer\n\nx.infer is not affiliated with any of the libraries it supports. It is a simple wrapper that allows you to run inference with any of the supported models.\n\nAlthough x.infer is Apache 2.0 licensed, the models it supports may have their own licenses. Please check the individual model repositories for more details. \n\n<div align=\"center\">\n    <img src=\"https://raw.githubusercontent.com/dnth/x.infer/refs/heads/main/assets/github_banner.png\" alt=\"x.infer\" width=\"600\"/>\n    <br />\n    <br />\n    <a href=\"https://dnth.github.io/x.infer\" target=\"_blank\" rel=\"noopener noreferrer\"><strong>Explore the docs \u00bb</strong></a>\n    <br />\n    <a href=\"#quickstart\" target=\"_blank\" rel=\"noopener noreferrer\">Quickstart</a>\n    \u00b7\n    <a href=\"https://github.com/dnth/x.infer/issues/new?assignees=&labels=Feature+Request&projects=&template=feature_request.md\" target=\"_blank\" rel=\"noopener noreferrer\">Feature Request</a>\n    \u00b7\n    <a href=\"https://github.com/dnth/x.infer/issues/new?assignees=&labels=bug&projects=&template=bug_report.md\" target=\"_blank\" rel=\"noopener noreferrer\">Report Bug</a>\n    \u00b7\n    <a href=\"https://github.com/dnth/x.infer/discussions\" target=\"_blank\" rel=\"noopener noreferrer\">Discussions</a>\n    \u00b7\n    <a href=\"https://dicksonneoh.com/\" target=\"_blank\" rel=\"noopener noreferrer\">About</a>\n</div>\n\n\n\n<div align=\"right\">\n    <br />\n    <a href=\"#top\"><img src=\"https://img.shields.io/badge/Back_to_Top-\u2191-blue?style=for-the-badge\" alt=\"Back to Top\" /></a>\n</div>\n\n\n\n\n\n\n\n\n\n\n",
    "bugtrack_url": null,
    "license": "Apache Software License 2.0",
    "summary": "Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm, ultralytics, vllm, ollama and your custom model.",
    "version": "0.3.2",
    "project_urls": {
        "Homepage": "https://github.com/dnth/xinfer"
    },
    "split_keywords": [
        "xinfer"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "457881766b0423456bbce2585710abcb7e87ff3a8df9412e8f76517d9db59e88",
                "md5": "efbfeacbdc0fcc041f6c1e72ce83b475",
                "sha256": "6e4ce7bf581563a4a340136825d288d8dde2007c7d1d7ac008c782c3831f1c91"
            },
            "downloads": -1,
            "filename": "xinfer-0.3.2-py2.py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "efbfeacbdc0fcc041f6c1e72ce83b475",
            "packagetype": "bdist_wheel",
            "python_version": "py2.py3",
            "requires_python": ">=3.10",
            "size": 52446,
            "upload_time": "2024-11-20T15:48:44",
            "upload_time_iso_8601": "2024-11-20T15:48:44.824074Z",
            "url": "https://files.pythonhosted.org/packages/45/78/81766b0423456bbce2585710abcb7e87ff3a8df9412e8f76517d9db59e88/xinfer-0.3.2-py2.py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "a3957888a9bd4b03da2264d7fa4035db0426f9b942e15d120468ea003bc2acc7",
                "md5": "471cc06e8c3d3815e63c49ebef321e7b",
                "sha256": "d530747a587b311e5d55749b91a50553c57671800f863486dee6f6b463bf8004"
            },
            "downloads": -1,
            "filename": "xinfer-0.3.2.tar.gz",
            "has_sig": false,
            "md5_digest": "471cc06e8c3d3815e63c49ebef321e7b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 40621270,
            "upload_time": "2024-11-20T15:48:48",
            "upload_time_iso_8601": "2024-11-20T15:48:48.163342Z",
            "url": "https://files.pythonhosted.org/packages/a3/95/7888a9bd4b03da2264d7fa4035db0426f9b942e15d120468ea003bc2acc7/xinfer-0.3.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-11-20 15:48:48",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "dnth",
    "github_project": "xinfer",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "xinfer"
}

None