Name | Version | Summary | date |
mime-files-reader |
0.2.1 |
A tool to process various file types (images, PDFs, audio) using Google Generative AI |
2025-07-21 00:01:59 |
attachments |
0.21.0 |
The Python funnel for LLM context - turn any file into model-ready text + images, in one line. |
2025-07-14 03:31:59 |
roboml |
0.3.1 |
Machine learning models optimized for robotics experimentation and deployment |
2025-07-10 08:31:52 |
maestro |
1.0.0 |
Streamline the fine-tuning process for vision-language models like PaliGemma 2, Florence-2, and Qwen2.5-VL. |
2025-02-05 08:42:32 |
jetson-examples |
0.2.4 |
Running Gen AI models and applications on NVIDIA Jetson devices with one-line command |
2025-02-05 03:23:23 |
odysee |
1.0.2 |
High-performance quantum-inspired multimodal memory system with adaptive routing and distributed processing capabilities |
2025-01-30 17:53:05 |
exordium |
1.4.1 |
Collection of utility tools and deep learning methods. |
2025-01-10 09:27:44 |
nexusml |
0.1.0b0 |
A multimodal AutoML platform for classification and regression tasks |
2024-12-05 17:32:05 |
blinklinmult |
1.0.1 |
BlinkLinMulT: Transformer-based Eye Blink Detection. |
2024-11-26 10:40:45 |
symile |
0.1.0 |
Symile |
2024-11-05 19:25:20 |
personalitylinmult |
1.0.0 |
PersonalityLinMulT: Transformer-based Big Five Automatic Personality Perception. |
2024-11-05 14:08:23 |
stark-qa |
0.1.3 |
Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases |
2024-10-24 20:01:23 |
shira-audio |
0.1.29 |
audio search/retrieval library |
2024-10-21 14:02:02 |
lavis-gml |
1.0.2.post5 |
LAVIS - A One-stop Library for Language-Vision Intelligence |
2024-10-17 22:22:03 |
multimodal-transformers |
0.4.0 |
Multimodal Extension Library for PyTorch HuggingFace Transformers |
2024-09-24 19:14:50 |
cornac |
2.2.2 |
A Comparative Framework for Multimodal Recommender Systems |
2024-08-15 06:52:45 |
ofen |
0.0.1 |
Making transformers production ready |
2024-08-13 00:19:53 |
ammico-lavis |
1.0.2.3 |
LAVIS - A One-stop Library for Language-Vision Intelligence |
2024-06-12 09:34:56 |
mexca |
1.0.4 |
Emotion expression capture from multiple modalities. |
2024-05-01 12:27:38 |
gradio-awsbr-mmchatbot |
0.0.4 |
This component enables multi-modal input for the Anthropic Claude v3 suite of models available from Amazon Bedrock |
2024-04-16 02:58:17 |