pptagent


Namepptagent JSON
Version 0.2.14 PyPI version JSON
download
home_pageNone
SummaryPPTAgent, a tool for utilizing LLMs to generate PowerPoint presentations from documents.
upload_time2025-10-29 09:19:23
maintainerNone
docs_urlNone
authorNone
requires_python>=3.11
licenseNone
keywords presentation powerpoint ppt llm ai document-processing
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            <div align="right">
  <details>
    <summary >🌐 Language</summary>
    <div>
      <div align="center">
        <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=en">English</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=zh-CN">简体中文</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=zh-TW">繁體中文</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ja">日本語</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ko">한국어</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=hi">हिन्दी</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=th">ไทย</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=fr">Français</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=de">Deutsch</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=es">Español</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=it">Italiano</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ru">Русский</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=pt">Português</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=nl">Nederlands</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=pl">Polski</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ar">العربية</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=fa">فارسی</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=tr">Türkçe</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=vi">Tiếng Việt</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=id">Bahasa Indonesia</a>
        | <a href="https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=as">অসমীয়া</
      </div>
    </div>
  </details>
</div>

# PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides
<p align="center">
  📄 <a href="https://arxiv.org/abs/2501.03936" target="_blank">Paper</a> &nbsp; | &nbsp;
  🤗 <a href="#open-source-" target="_blank">OpenSource</a> &nbsp; | &nbsp;
  📝 <a href="./DOC.md" target="_blank">Documentation</a> &nbsp; | &nbsp;
  <a href="https://deepwiki.com/icip-cas/PPTAgent" target="_blank"><img src="https://deepwiki.com/icon.png" alt="Ask DeepWiki"> DeepWiki</a> &nbsp; | &nbsp;
  🙏 <a href="#citation-" target="_blank">Citation</a>
</p>

We present PPTAgent, an innovative system that automatically generates presentations from documents. Drawing inspiration from human presentation creation methods, our system employs a two-step process to ensure excellence in overall quality. Additionally, we introduce **PPTEval**, a comprehensive evaluation framework that assesses presentations across multiple dimensions.

> [!TIP]
> 🚀 Get started quickly with our pre-built Docker image - [See Docker instructions](DOC.md/#docker-)

## 📅 News

- [2025/09]: 🛠️ We support MCP server now, see [MCP Server](./DOC.md#mcp-server-) for details
- [2025/09]: 🚀 Released v2 with major improvements - see [release notes](https://github.com/icip-cas/PPTAgent/releases/tag/v0.2.0) for details
- [2025/08]: 🎉 Paper accepted to **EMNLP 2025**!
- [2025/05]: ✨ Released v1 with core functionality and 🌟 breakthrough: reached 1,000 stars on GitHub! - see [release notes](https://github.com/icip-cas/PPTAgent/releases/tag/v0.1.0) for details
- [2025/01]: 🔓 Open-sourced the codebase, with experimental code archived at [experiment release](https://github.com/icip-cas/PPTAgent/releases/tag/experiment)

## Open Source 🤗
We have released our model and data at [HuggingFace](https://huggingface.co/collections/ICIP/pptagent-68b80af43b4f4e0cb14d0bb2).

## Demo Video 🎥

https://github.com/user-attachments/assets/c3935a98-4d2b-4c46-9b36-e7c598d14863

## Distinctive Features ✨

- **Dynamic Content Generation**: Creates slides with seamlessly integrated text and images
- **Smart Reference Learning**: Leverages existing presentations without requiring manual annotation
- **Comprehensive Quality Assessment**: Evaluates presentations through multiple quality metrics

## Case Study 💡

- #### [Iphone 16 Pro](https://www.apple.com/iphone-16-pro/)

<div style="display: flex; flex-wrap: wrap; gap: 10px;">

  <img src="resource/iphone16pro/0001.jpg" alt="图片1" width="200"/>

  <img src="resource/iphone16pro/0002.jpg" alt="图片2" width="200"/>

  <img src="resource/iphone16pro/0003.jpg" alt="图片3" width="200"/>

  <img src="resource/iphone16pro/0004.jpg" alt="图片4" width="200"/>

  <img src="resource/iphone16pro/0005.jpg" alt="图片5" width="200"/>

  <img src="resource/iphone16pro/0006.jpg" alt="图片6" width="200"/>

  <img src="resource/iphone16pro/0007.jpg" alt="图片7" width="200"/>

</div>

- #### [Build Effective Agents](https://www.anthropic.com/research/building-effective-agents)

<div style="display: flex; flex-wrap: wrap; gap: 10px;">

  <img src="resource/build_effective_agents/0001.jpg" alt="图片1" width="200"/>

  <img src="resource/build_effective_agents/0002.jpg" alt="图片2" width="200"/>

  <img src="resource/build_effective_agents/0003.jpg" alt="图片3" width="200"/>

  <img src="resource/build_effective_agents/0004.jpg" alt="图片4" width="200"/>

  <img src="resource/build_effective_agents/0005.jpg" alt="图片5" width="200"/>

  <img src="resource/build_effective_agents/0006.jpg" alt="图片6" width="200"/>

  <img src="resource/build_effective_agents/0007.jpg" alt="图片7" width="200"/>

  <img src="resource/build_effective_agents/0008.jpg" alt="图片8" width="200"/>

<img src="resource/build_effective_agents/0009.jpg" alt="图片9" width="200"/>

<img src="resource/build_effective_agents/0010.jpg" alt="图片10" width="200"/>

</div>

## PPTAgent 🤖

PPTAgent follows a two-phase approach:
1. **Analysis Phase**: Extracts and learns from patterns in reference presentations
2. **Generation Phase**: Develops structured outlines and produces visually cohesive slides

Our system's workflow is illustrated below:


![PPTAgent Workflow](resource/fig2.jpg)

## PPTEval ⚖️

PPTEval evaluates presentations across three dimensions:
- **Content**: Check the accuracy and relevance of the slides.
- **Design**: Assesses the visual appeal and consistency.
- **Coherence**: Ensures the logical flow of ideas.

The workflow of PPTEval is shown below:
<p align="center">
<img src="resource/fig3.jpg" alt="PPTEval Workflow" style="width:70%;"/>
</p>


## Citation 🙏

If you find this project helpful, please use the following to cite it:
```bibtex
@article{zheng2025pptagent,
  title={PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides},
  author={Zheng, Hao and Guan, Xinyan and Kong, Hao and Zheng, Jia and Zhou, Weixiang and Lin, Hongyu and Lu, Yaojie and He, Ben and Han, Xianpei and Sun, Le},
  journal={arXiv preprint arXiv:2501.03936},
  year={2025}
}
```

[![Star History Chart](https://api.star-history.com/svg?repos=icip-cas/PPTAgent&type=Date)](https://star-history.com/#icip-cas/PPTAgent&Date)

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "pptagent",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.11",
    "maintainer_email": null,
    "keywords": "presentation, powerpoint, ppt, llm, ai, document-processing",
    "author": null,
    "author_email": "Hao Zheng <wszh712811@gmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/f0/2a/ace69eace520cffdda88dcd436f6d3e85c46952fbd5dd122941afa68e349/pptagent-0.2.14.tar.gz",
    "platform": null,
    "description": "<div align=\"right\">\n  <details>\n    <summary >\ud83c\udf10 Language</summary>\n    <div>\n      <div align=\"center\">\n        <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=en\">English</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=zh-CN\">\u7b80\u4f53\u4e2d\u6587</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=zh-TW\">\u7e41\u9ad4\u4e2d\u6587</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ja\">\u65e5\u672c\u8a9e</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ko\">\ud55c\uad6d\uc5b4</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=hi\">\u0939\u093f\u0928\u094d\u0926\u0940</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=th\">\u0e44\u0e17\u0e22</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=fr\">Fran\u00e7ais</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=de\">Deutsch</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=es\">Espa\u00f1ol</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=it\">Italiano</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ru\">\u0420\u0443\u0441\u0441\u043a\u0438\u0439</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=pt\">Portugu\u00eas</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=nl\">Nederlands</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=pl\">Polski</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=ar\">\u0627\u0644\u0639\u0631\u0628\u064a\u0629</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=fa\">\u0641\u0627\u0631\u0633\u06cc</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=tr\">T\u00fcrk\u00e7e</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=vi\">Ti\u1ebfng Vi\u1ec7t</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=id\">Bahasa Indonesia</a>\n        | <a href=\"https://openaitx.github.io/view.html?user=icip-cas&project=PPTAgent&lang=as\">\u0985\u09b8\u09ae\u09c0\u09af\u09bc\u09be</\n      </div>\n    </div>\n  </details>\n</div>\n\n# PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides\n<p align=\"center\">\n  \ud83d\udcc4 <a href=\"https://arxiv.org/abs/2501.03936\" target=\"_blank\">Paper</a> &nbsp; | &nbsp;\n  \ud83e\udd17 <a href=\"#open-source-\" target=\"_blank\">OpenSource</a> &nbsp; | &nbsp;\n  \ud83d\udcdd <a href=\"./DOC.md\" target=\"_blank\">Documentation</a> &nbsp; | &nbsp;\n  <a href=\"https://deepwiki.com/icip-cas/PPTAgent\" target=\"_blank\"><img src=\"https://deepwiki.com/icon.png\" alt=\"Ask DeepWiki\"> DeepWiki</a> &nbsp; | &nbsp;\n  \ud83d\ude4f <a href=\"#citation-\" target=\"_blank\">Citation</a>\n</p>\n\nWe present PPTAgent, an innovative system that automatically generates presentations from documents. Drawing inspiration from human presentation creation methods, our system employs a two-step process to ensure excellence in overall quality. Additionally, we introduce **PPTEval**, a comprehensive evaluation framework that assesses presentations across multiple dimensions.\n\n> [!TIP]\n> \ud83d\ude80 Get started quickly with our pre-built Docker image - [See Docker instructions](DOC.md/#docker-)\n\n## \ud83d\udcc5 News\n\n- [2025/09]: \ud83d\udee0\ufe0f We support MCP server now, see [MCP Server](./DOC.md#mcp-server-) for details\n- [2025/09]: \ud83d\ude80 Released v2 with major improvements - see [release notes](https://github.com/icip-cas/PPTAgent/releases/tag/v0.2.0) for details\n- [2025/08]: \ud83c\udf89 Paper accepted to **EMNLP 2025**!\n- [2025/05]: \u2728 Released v1 with core functionality and \ud83c\udf1f breakthrough: reached 1,000 stars on GitHub! - see [release notes](https://github.com/icip-cas/PPTAgent/releases/tag/v0.1.0) for details\n- [2025/01]: \ud83d\udd13 Open-sourced the codebase, with experimental code archived at [experiment release](https://github.com/icip-cas/PPTAgent/releases/tag/experiment)\n\n## Open Source \ud83e\udd17\nWe have released our model and data at [HuggingFace](https://huggingface.co/collections/ICIP/pptagent-68b80af43b4f4e0cb14d0bb2).\n\n## Demo Video \ud83c\udfa5\n\nhttps://github.com/user-attachments/assets/c3935a98-4d2b-4c46-9b36-e7c598d14863\n\n## Distinctive Features \u2728\n\n- **Dynamic Content Generation**: Creates slides with seamlessly integrated text and images\n- **Smart Reference Learning**: Leverages existing presentations without requiring manual annotation\n- **Comprehensive Quality Assessment**: Evaluates presentations through multiple quality metrics\n\n## Case Study \ud83d\udca1\n\n- #### [Iphone 16 Pro](https://www.apple.com/iphone-16-pro/)\n\n<div style=\"display: flex; flex-wrap: wrap; gap: 10px;\">\n\n  <img src=\"resource/iphone16pro/0001.jpg\" alt=\"\u56fe\u72471\" width=\"200\"/>\n\n  <img src=\"resource/iphone16pro/0002.jpg\" alt=\"\u56fe\u72472\" width=\"200\"/>\n\n  <img src=\"resource/iphone16pro/0003.jpg\" alt=\"\u56fe\u72473\" width=\"200\"/>\n\n  <img src=\"resource/iphone16pro/0004.jpg\" alt=\"\u56fe\u72474\" width=\"200\"/>\n\n  <img src=\"resource/iphone16pro/0005.jpg\" alt=\"\u56fe\u72475\" width=\"200\"/>\n\n  <img src=\"resource/iphone16pro/0006.jpg\" alt=\"\u56fe\u72476\" width=\"200\"/>\n\n  <img src=\"resource/iphone16pro/0007.jpg\" alt=\"\u56fe\u72477\" width=\"200\"/>\n\n</div>\n\n- #### [Build Effective Agents](https://www.anthropic.com/research/building-effective-agents)\n\n<div style=\"display: flex; flex-wrap: wrap; gap: 10px;\">\n\n  <img src=\"resource/build_effective_agents/0001.jpg\" alt=\"\u56fe\u72471\" width=\"200\"/>\n\n  <img src=\"resource/build_effective_agents/0002.jpg\" alt=\"\u56fe\u72472\" width=\"200\"/>\n\n  <img src=\"resource/build_effective_agents/0003.jpg\" alt=\"\u56fe\u72473\" width=\"200\"/>\n\n  <img src=\"resource/build_effective_agents/0004.jpg\" alt=\"\u56fe\u72474\" width=\"200\"/>\n\n  <img src=\"resource/build_effective_agents/0005.jpg\" alt=\"\u56fe\u72475\" width=\"200\"/>\n\n  <img src=\"resource/build_effective_agents/0006.jpg\" alt=\"\u56fe\u72476\" width=\"200\"/>\n\n  <img src=\"resource/build_effective_agents/0007.jpg\" alt=\"\u56fe\u72477\" width=\"200\"/>\n\n  <img src=\"resource/build_effective_agents/0008.jpg\" alt=\"\u56fe\u72478\" width=\"200\"/>\n\n<img src=\"resource/build_effective_agents/0009.jpg\" alt=\"\u56fe\u72479\" width=\"200\"/>\n\n<img src=\"resource/build_effective_agents/0010.jpg\" alt=\"\u56fe\u724710\" width=\"200\"/>\n\n</div>\n\n## PPTAgent \ud83e\udd16\n\nPPTAgent follows a two-phase approach:\n1. **Analysis Phase**: Extracts and learns from patterns in reference presentations\n2. **Generation Phase**: Develops structured outlines and produces visually cohesive slides\n\nOur system's workflow is illustrated below:\n\n\n![PPTAgent Workflow](resource/fig2.jpg)\n\n## PPTEval \u2696\ufe0f\n\nPPTEval evaluates presentations across three dimensions:\n- **Content**: Check the accuracy and relevance of the slides.\n- **Design**: Assesses the visual appeal and consistency.\n- **Coherence**: Ensures the logical flow of ideas.\n\nThe workflow of PPTEval is shown below:\n<p align=\"center\">\n<img src=\"resource/fig3.jpg\" alt=\"PPTEval Workflow\" style=\"width:70%;\"/>\n</p>\n\n\n## Citation \ud83d\ude4f\n\nIf you find this project helpful, please use the following to cite it:\n```bibtex\n@article{zheng2025pptagent,\n  title={PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides},\n  author={Zheng, Hao and Guan, Xinyan and Kong, Hao and Zheng, Jia and Zhou, Weixiang and Lin, Hongyu and Lu, Yaojie and He, Ben and Han, Xianpei and Sun, Le},\n  journal={arXiv preprint arXiv:2501.03936},\n  year={2025}\n}\n```\n\n[![Star History Chart](https://api.star-history.com/svg?repos=icip-cas/PPTAgent&type=Date)](https://star-history.com/#icip-cas/PPTAgent&Date)\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "PPTAgent, a tool for utilizing LLMs to generate PowerPoint presentations from documents.",
    "version": "0.2.14",
    "project_urls": {
        "Bug Tracker": "https://github.com/icip-cas/PPTAgent/issues",
        "Homepage": "https://github.com/icip-cas/PPTAgent"
    },
    "split_keywords": [
        "presentation",
        " powerpoint",
        " ppt",
        " llm",
        " ai",
        " document-processing"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "431d3317e9757cdb29e046b5fdc08ef07c747c21f9d54857209b3156533022c5",
                "md5": "90f735b85fa81e1b869b6aed83221679",
                "sha256": "eb6acf59af3280837ba3664726af5e719787c00ebac0b74c24499fb3eb251d1c"
            },
            "downloads": -1,
            "filename": "pptagent-0.2.14-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "90f735b85fa81e1b869b6aed83221679",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.11",
            "size": 4748974,
            "upload_time": "2025-10-29T09:19:20",
            "upload_time_iso_8601": "2025-10-29T09:19:20.043848Z",
            "url": "https://files.pythonhosted.org/packages/43/1d/3317e9757cdb29e046b5fdc08ef07c747c21f9d54857209b3156533022c5/pptagent-0.2.14-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "f02aace69eace520cffdda88dcd436f6d3e85c46952fbd5dd122941afa68e349",
                "md5": "c0b2f9e1360eee81ab13ce98ddfaec11",
                "sha256": "941977fada9c6e6b62c144a2e79529f0ed896e9c6e17cb70f2804e10153a5e60"
            },
            "downloads": -1,
            "filename": "pptagent-0.2.14.tar.gz",
            "has_sig": false,
            "md5_digest": "c0b2f9e1360eee81ab13ce98ddfaec11",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.11",
            "size": 4735229,
            "upload_time": "2025-10-29T09:19:23",
            "upload_time_iso_8601": "2025-10-29T09:19:23.832184Z",
            "url": "https://files.pythonhosted.org/packages/f0/2a/ace69eace520cffdda88dcd436f6d3e85c46952fbd5dd122941afa68e349/pptagent-0.2.14.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-10-29 09:19:23",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "icip-cas",
    "github_project": "PPTAgent",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "pptagent"
}
        
Elapsed time: 4.58150s