ntloss

Name	ntloss JSON
Version	0.1.1 JSON
	download
home_page	None
Summary	Number Token Loss - A regression-alike loss to improve numerical reasoning in language models
upload_time	2025-08-19 07:30:23
maintainer	None
docs_url	None
author	None
requires_python	>=3.10
license	MIT
keywords	machine learning language models number token loss ai4math llm llm training reasoning models
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            <div align="center">


# `NTLoss` - a regression-like loss for LLMs


[![Paper](https://img.shields.io/badge/Paper-ICML-darkgreen.svg)](https://ibm.biz/ntl-paper)
[![Landing](https://img.shields.io/badge/Landing-Page-blue.svg)](https://ibm.biz/ntl-main)
[![Demo](https://img.shields.io/badge/🤗-Demo-yellow.svg)](https://ibm.biz/ntl-demo)
[![CI](https://github.com/AI4SD/number-token-loss/actions/workflows/ci.yaml/badge.svg)](https://github.com/AI4SD/number-token-loss/actions/workflows/ci.yaml)
[![License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
[![PyPI](https://img.shields.io/pypi/v/ntloss?label=pypi&color=brightgreen)](https://pypi.org/project/ntloss/)
[![Docs](https://github.com/AI4SD/number-token-loss/actions/workflows/docs.yaml/badge.svg)](https://ibm.biz/ntl-docs)
[![Downloads](https://static.pepy.tech/badge/ntloss)](https://pepy.tech/project/ntloss)

*`ntloss` is a PyPI package of the "Number Token Loss" for language models. A regression-like loss that improves LLM performance on math tasks. Follows* **Regress, Don't Guess, ICML 2025**


</div>

---

## 📖 Overview
This repo maintains the code for the `ntloss` [PyPI package](https://pypi.org/project/ntloss/)

- 🧑🏽‍💻 **Paper source code**: [Regress, Don't Guess – ICML 2025](https://ibm.biz/ntl-code)
- 📄 **Paper**: [Regress, Don't Guess – A Regression-like Loss on Number Tokens for Language Models](https://ibm.biz/ntl-paper)
- 🌐 **Project Page**: [Landing Page](https://ibm.biz/ntl-main)
- 🎮 **Demo**: [HuggingFace Spaces Demo (Streamlit)](https://ibm.biz/ntl-demo)
- 📖 **Docs**: [Documentation for the PyPI package](https://ibm.biz/ntl-docs)


## 🏃‍♂️ Quick Start


Simply install `ntloss` into your existing project
```sh
uv add ntloss
pip install ntloss # if you are oldschool
```

Use like this:
```py
from ntloss import NTLoss
ntl_fn = NTLoss(tokenizer=tokenizer)
ntl = ntl_fn(logits, labels)

# We recommend
loss = cross_entropy(logits, labels) + 0.3 * ntl
```

NOTE: `ntloss` is currently in alpha phase and pre-release. Feedback & PRs are very welcome.


## 📝 Citation

If you use `ntloss`, please cite our paper:

```bibtex
@inproceedings{zausinger2025regress,
  title   = {Regress, Don't Guess – A Regression-like Loss on Number Tokens for Language Models},
  author  = {Jonas Zausinger and Lars Pennig and Anamarija Kozina and Sean Sdahl
             and Julian Sikora and Adrian Dendorfer and Timofey Kuznetsov
             and Mohamad Hagog and Nina Wiedemann and Kacper Chlodny
             and Vincent Limbach and Anna Ketteler and Thorben Prein
             and Vishwa Mohan Singh and Michael Danziger and Jannis Born},
  booktitle = {Proc. of the 42nd International Conference on Machine Learning (ICML)},
  year    = {2025},
  url     = {https://ibm.biz/ntl-main}
}
```

## 📄 License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "ntloss",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": "Machine Learning, Language Models, Number Token Loss, AI4Math, LLM, LLM Training, Reasoning models",
    "author": null,
    "author_email": "Jannis Born <jab@zurich.ibm.com>, Lars Pennig <lpennig@t-online.de>, Jonas Zausinger <jonas.zausinger@tum.de>, Sarah de Ruiter <Sarah.Louise.De.Ruiter@ibm.com>",
    "download_url": "https://files.pythonhosted.org/packages/f5/a7/bf9a0eb18c7a04fa9f3ff4ac3d930c8e5288203e6c454a3551629c07473a/ntloss-0.1.1.tar.gz",
    "platform": null,
    "description": "<div align=\"center\">\n\n\n# `NTLoss` - a regression-like loss for LLMs\n\n\n[![Paper](https://img.shields.io/badge/Paper-ICML-darkgreen.svg)](https://ibm.biz/ntl-paper)\n[![Landing](https://img.shields.io/badge/Landing-Page-blue.svg)](https://ibm.biz/ntl-main)\n[![Demo](https://img.shields.io/badge/\ud83e\udd17-Demo-yellow.svg)](https://ibm.biz/ntl-demo)\n[![CI](https://github.com/AI4SD/number-token-loss/actions/workflows/ci.yaml/badge.svg)](https://github.com/AI4SD/number-token-loss/actions/workflows/ci.yaml)\n[![License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)\n[![PyPI](https://img.shields.io/pypi/v/ntloss?label=pypi&color=brightgreen)](https://pypi.org/project/ntloss/)\n[![Docs](https://github.com/AI4SD/number-token-loss/actions/workflows/docs.yaml/badge.svg)](https://ibm.biz/ntl-docs)\n[![Downloads](https://static.pepy.tech/badge/ntloss)](https://pepy.tech/project/ntloss)\n\n*`ntloss` is a PyPI package of the \"Number Token Loss\" for language models. A regression-like loss that improves LLM performance on math tasks. Follows* **Regress, Don't Guess, ICML 2025**\n\n\n</div>\n\n---\n\n## \ud83d\udcd6 Overview\nThis repo maintains the code for the `ntloss` [PyPI package](https://pypi.org/project/ntloss/)\n\n- \ud83e\uddd1\ud83c\udffd\u200d\ud83d\udcbb **Paper source code**: [Regress, Don't Guess \u2013 ICML 2025](https://ibm.biz/ntl-code)\n- \ud83d\udcc4 **Paper**: [Regress, Don't Guess \u2013 A Regression-like Loss on Number Tokens for Language Models](https://ibm.biz/ntl-paper)\n- \ud83c\udf10 **Project Page**: [Landing Page](https://ibm.biz/ntl-main)\n- \ud83c\udfae **Demo**: [HuggingFace Spaces Demo (Streamlit)](https://ibm.biz/ntl-demo)\n- \ud83d\udcd6 **Docs**: [Documentation for the PyPI package](https://ibm.biz/ntl-docs)\n\n\n## \ud83c\udfc3\u200d\u2642\ufe0f Quick Start\n\n\nSimply install `ntloss` into your existing project\n```sh\nuv add ntloss\npip install ntloss # if you are oldschool\n```\n\nUse like this:\n```py\nfrom ntloss import NTLoss\nntl_fn = NTLoss(tokenizer=tokenizer)\nntl = ntl_fn(logits, labels)\n\n# We recommend\nloss = cross_entropy(logits, labels) + 0.3 * ntl\n```\n\nNOTE: `ntloss` is currently in alpha phase and pre-release. Feedback & PRs are very welcome.\n\n\n## \ud83d\udcdd Citation\n\nIf you use `ntloss`, please cite our paper:\n\n```bibtex\n@inproceedings{zausinger2025regress,\n  title   = {Regress, Don't Guess \u2013 A Regression-like Loss on Number Tokens for Language Models},\n  author  = {Jonas Zausinger and Lars Pennig and Anamarija Kozina and Sean Sdahl\n             and Julian Sikora and Adrian Dendorfer and Timofey Kuznetsov\n             and Mohamad Hagog and Nina Wiedemann and Kacper Chlodny\n             and Vincent Limbach and Anna Ketteler and Thorben Prein\n             and Vishwa Mohan Singh and Michael Danziger and Jannis Born},\n  booktitle = {Proc. of the 42nd International Conference on Machine Learning (ICML)},\n  year    = {2025},\n  url     = {https://ibm.biz/ntl-main}\n}\n```\n\n## \ud83d\udcc4 License\n\nThis project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Number Token Loss - A regression-alike loss to improve numerical reasoning in language models",
    "version": "0.1.1",
    "project_urls": {
        "Bug Tracker": "https://github.com/AI4SD/number-token-loss/issues",
        "Demo": "https://huggingface.co/spaces/jannisborn/NumberTokenLoss",
        "Landing Page": "https://tum-ai.github.io/number-token-loss/",
        "Paper": "https://arxiv.org/abs/2411.02083",
        "Source code": "https://github.com/AI4SD/number-token-loss"
    },
    "split_keywords": [
        "machine learning",
        " language models",
        " number token loss",
        " ai4math",
        " llm",
        " llm training",
        " reasoning models"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "ef2bf5f969c76363b5186a5e61adbdc5952f6f9b16ef403e18b96d4b65d36626",
                "md5": "7ce8db76f2cf669910975f15b3794b49",
                "sha256": "ca5d954f4dcea99801bb0275385824fa05b72e9e60fa8d7203b918e0a4356b94"
            },
            "downloads": -1,
            "filename": "ntloss-0.1.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "7ce8db76f2cf669910975f15b3794b49",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 9708,
            "upload_time": "2025-08-19T07:30:22",
            "upload_time_iso_8601": "2025-08-19T07:30:22.954701Z",
            "url": "https://files.pythonhosted.org/packages/ef/2b/f5f969c76363b5186a5e61adbdc5952f6f9b16ef403e18b96d4b65d36626/ntloss-0.1.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "f5a7bf9a0eb18c7a04fa9f3ff4ac3d930c8e5288203e6c454a3551629c07473a",
                "md5": "02aa6ecd4eb0a49289ce42474203ffed",
                "sha256": "a73f5d5fbe800e3971cb740d94c4172e87df35f81f15c4e086dfc5296ad69ce3"
            },
            "downloads": -1,
            "filename": "ntloss-0.1.1.tar.gz",
            "has_sig": false,
            "md5_digest": "02aa6ecd4eb0a49289ce42474203ffed",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 15030,
            "upload_time": "2025-08-19T07:30:23",
            "upload_time_iso_8601": "2025-08-19T07:30:23.995688Z",
            "url": "https://files.pythonhosted.org/packages/f5/a7/bf9a0eb18c7a04fa9f3ff4ac3d930c8e5288203e6c454a3551629c07473a/ntloss-0.1.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-19 07:30:23",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "AI4SD",
    "github_project": "number-token-loss",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "ntloss"
}

None