lmqg

Name	lmqg JSON
Version	0.1.1 JSON
	download
home_page	https://github.com/asahi417/lm-question-generation
Summary	Language Model for Question Generation.
upload_time	2023-06-18 21:20:53
maintainer
docs_url	None
author	Asahi Ushio
requires_python	>=3.6
license	MIT License
keywords	language model question-answering question-generation
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            [![license](https://img.shields.io/badge/License-MIT-brightgreen.svg)](https://github.com/asahi417/lmqg/blob/master/LICENSE)
[![PyPI version](https://badge.fury.io/py/lmqg.svg)](https://badge.fury.io/py/lmqg)
[![PyPI pyversions](https://img.shields.io/pypi/pyversions/lmqg.svg)](https://pypi.python.org/pypi/lmqg/)
[![PyPI status](https://img.shields.io/pypi/status/lmqg.svg)](https://pypi.python.org/pypi/lmqg/)

# Question and Answer Generation with Language Models

<p align="center">
  <img src="https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/qag.png" width="400">
  <br><em> Figure 1: Three distinct QAG approaches. </em>
</p>

The `lmqg` is a python library for question and answer generation (QAG) with language models (LMs). Here, we consider 
paragraph-level QAG, where user will provide a context (paragraph or document), and the model will generate a list of 
question and answer pairs on the context. With `lmqg`, you can do following things:
- [***Generation in One Line of Code:***](https://github.com/asahi417/lm-question-generation#generate-question--answer) Generate questions and answers in *8* languages (en/fr/ja/ko/ru/it/es/de).
- [***Model Training/Evaluation:***](https://github.com/asahi417/lm-question-generation#model-development) Train & evaluate your own QG/QAG models.
- [***QAG & QG Model Hosting:***](https://github.com/asahi417/lm-question-generation#rest-api-with-huggingface-inference-api) Host your QAG models on a web application or a restAPI server.
- [***AutoQG:***](https://github.com/asahi417/lm-question-generation/tree/master#autoqg) Online web service to generate questions and answers with our models.
 
***Update May 2023:*** Two papers got accepted by ACL 2023 ([QAG at finding](https://arxiv.org/abs/2305.17002), [LMQG at system demonstration](https://arxiv.org/abs/2305.17416)). \
***Update Oct 2022:*** Our [QG paper](https://aclanthology.org/2022.emnlp-main.42/) got accepted by EMNLP main 2022.

### A bit more about QAG models 📝
Our QAG models can be grouped into three types: **Pipeline**, **Multitask**, 
and **End2end** (see Figure 1). The **Pipeline** consists of question generation (QG) and answer extraction (AE) models independently,
where AE will parse all the sentences in the context to extract answers, and QG will generate questions on the answers.
The **Multitask** follows same architecture as the **Pipeline**, but the QG and AE models are shared model fine-tuned jointly.
Finally, **End2end** model will generate a list of question and answer pairs in an end-to-end manner.
In practice, **Pipeline** and **Multitask** generate more question and answer pairs, while **End2end** generates less but a few times faster, 
and the quality of the generated question and answer pairs depend on language.
All types are available in the *8* diverse languages (en/fr/ja/ko/ru/it/es/de) via `lmqg`, and the models are all shared on HuggingFace (see the [model card](https://github.com/asahi417/lm-question-generation/blob/master/MODEL_CARD.md)).
To know more about QAG, please check [our ACL 2023 paper](https://arxiv.org/abs/2305.17002) that describes the QAG models and reports a complete performance comparison of each QAG models in every language.

### Is QAG different from Question Generation (QG)? 🤔

<p align="center">
  <img src="https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/example.png" width="700">
  <br><em> Figure 2: An example of QAG (a) and QG (b). </em>
</p>

All the functionalities support question generation as well. Our QG model assumes user to specify an answer in addition to a context,
and the QG model will generate a question that is answerable by the answer given the context (see Figure 2 for a comparison of QAG and QG).
To know more about QG, please check [our EMNLP 2022 paper](https://aclanthology.org/2022.emnlp-main.42/) that describes the QG models more in detail.


## Get Started 🚀

Let's install `lmqg` via pip first.
```shell
pip install lmqg
```

## Generate Question & Answer
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/13izkdp2l7G2oeh_fwL7xJdR_67HMK_hQ?usp=sharing)

The main functionality of `lmqg` is to generate question and answer pairs on a given context with a handy api.
The available models for each QAG class can be found at [model card](https://github.com/asahi417/lm-question-generation/blob/master/MODEL_CARD.md#qag).

- ***QAG with End2end or Multitask Models:*** The end2end QAG models are fine-tuned to generate a list of QA pairs with a single inference, 
  so it is the fastest class among our QAG models. Meanwhile, multitask QAG models breakdown the QAG task into QG and AE, where they 
  parse each sentence to get the answer with AE, and generate question on each answer with QG. Multitask QAG potentially generate more QA pairs than end2end QAG, but 
  inference takes a few times more than end2end models. Both models can be used as following 
  
```python
from pprint import pprint
from lmqg import TransformersQG

# initialize model
model = TransformersQG('lmqg/t5-base-squad-qag') # or TransformersQG(model='lmqg/t5-base-squad-qg-ae') 
# paragraph to generate pairs of question and answer
context = "William Turner was an English painter who specialised in watercolour landscapes. He is often known " \
          "as William Turner of Oxford or just Turner of Oxford to distinguish him from his contemporary, " \
          "J. M. W. Turner. Many of Turner's paintings depicted the countryside around Oxford. One of his " \
          "best known pictures is a view of the city of Oxford from Hinksey Hill."
# model prediction
question_answer = model.generate_qa(context)
# the output is a list of tuple (question, answer)
pprint(question_answer)
[
    ('Who was an English painter who specialised in watercolour landscapes?', 'William Turner'),
    ('What is William Turner often known as?', 'William Turner of Oxford or just Turner of Oxford'),
    ("What did many of Turner's paintings depict?", 'the countryside around Oxford'),
    ("What is one of Turner's best known pictures?", 'a view of the city of Oxford from Hinksey Hill')
]
```

- ***QAG with Pipeline Models:*** The pipeline QAG is similar to multitask QAG, but the QG and AE models are independently fine-tuned, unlike 
  multitask QAG that fine-tunes QG and AE jointly. Pipeline QAG can improve the performance in some cases, but it is as heavy as multitask QAG with 
  more storage consuming due to the two models loaded. The pipeline QAG can be used as following. The `model` and `model_ae` are the QG and AE models respectively.
  
```python
from pprint import pprint
from lmqg import TransformersQG

# initialize model
model = TransformersQG(model='lmqg/t5-base-squad-qg', model_ae='lmqg/t5-base-squad-ae')  
# paragraph to generate pairs of question and answer
context = "William Turner was an English painter who specialised in watercolour landscapes. He is often known " \
          "as William Turner of Oxford or just Turner of Oxford to distinguish him from his contemporary, " \
          "J. M. W. Turner. Many of Turner's paintings depicted the countryside around Oxford. One of his " \
          "best known pictures is a view of the city of Oxford from Hinksey Hill."
# model prediction
question_answer = model.generate_qa(context)
# the output is a list of tuple (question, answer)
pprint(question_answer)
[
    ('Who was an English painter who specialised in watercolour landscapes?', 'William Turner'),
    ('What is another name for William Turner?', 'William Turner of Oxford'),
    ("What did many of William Turner's paintings depict around Oxford?", 'the countryside'),
    ('From what hill is a view of the city of Oxford taken?', 'Hinksey Hill.')
]
```

- ***QG only:*** The QG model can be used as following. The `model` is the QG model. See the [QG-Bench](https://github.com/asahi417/lm-question-generation/blob/master/QG_BENCH.md), 
a multilingual QG benchmark, for the list of available QG models.
  
```python
from pprint import pprint
from lmqg import TransformersQG

# initialize model
model = TransformersQG(model='lmqg/t5-base-squad-qg')

# a list of paragraph
context = [
    "William Turner was an English painter who specialised in watercolour landscapes",
    "William Turner was an English painter who specialised in watercolour landscapes"
]
# a list of answer (same size as the context)
answer = [
    "William Turner",
    "English"
]
# model prediction
question = model.generate_q(list_context=context, list_answer=answer)
pprint(question)
[
  'Who was an English painter who specialised in watercolour landscapes?',
  'What nationality was William Turner?'
]
``` 

- ***AE only:*** The QG model can be used as following. The `model` is the QG model.

```python
from pprint import pprint
from lmqg import TransformersQG

# initialize model
model = TransformersQG(model='lmqg/t5-base-squad-ae')
# model prediction
answer = model.generate_a("William Turner was an English painter who specialised in watercolour landscapes")
pprint(answer)
['William Turner']
```

## AutoQG

<p align="center">
  <img src="https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/autoqg.gif" width="500">
</p>

***AutoQG ([https://autoqg.net](https://autoqg.net/))*** is a free web application hosting our QAG models.

## Model Development
The `lmqg` also provides a command line interface to fine-tune and evaluate QG, AE, and QAG models.

### Model Training
<p align="center">
  <img src="https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/grid_search.png" width="650">
</p>

To fine-tune QG (or AE, QAG) model, we employ a two-stage hyper-parameter optimization, described as above diagram.
Following command is to run the fine-tuning with parameter optimization.
```shell
lmqg-train-search -c "tmp_ckpt" -d "lmqg/qg_squad" -m "t5-small" -b 64 --epoch-partial 5 -e 15 --language "en" --n-max-config 1 \
  -g 2 4 --lr 1e-04 5e-04 1e-03 --label-smoothing 0 0.15
```
Check `lmqg-train-search -h` to display all the options.

Fine-tuning models in python follows below.  
```python
from lmqg import GridSearcher
trainer = GridSearcher(
    checkpoint_dir='tmp_ckpt',
    dataset_path='lmqg/qg_squad',
    model='t5-small',
    epoch=15,
    epoch_partial=5,
    batch=64,
    n_max_config=5,
    gradient_accumulation_steps=[2, 4], 
    lr=[1e-04, 5e-04, 1e-03],
    label_smoothing=[0, 0.15]
)
trainer.run()
```


### Model Evaluation
The evaluation tool reports `BLEU4`, `ROUGE-L`, `METEOR`, `BERTScore`, and `MoverScore` following [QG-Bench](https://github.com/asahi417/lm-question-generation/blob/master/QG_BENCH.md).
From command line, run following command 
```shell
lmqg-eval -m "lmqg/t5-large-squad-qg" -e "./eval_metrics" -d "lmqg/qg_squad" -l "en"
```
where `-m` is a model alias on huggingface or path to local checkpoint, `-e` is the directly to export the metric file, `-d` is the dataset to evaluate, and `-l` is the language of the test set.
Instead of running model prediction, you can provide a prediction file instead to avoid computing it each time.
```shell
lmqg-eval --hyp-test '{your prediction file}' -e "./eval_metrics" -d "lmqg/qg_squad" -l "en"
```
The prediction file should be a text file of model generation in each line in the order of `test` split in the target dataset
([sample](https://huggingface.co/lmqg/t5-large-squad/raw/main/eval/samples.validation.hyp.paragraph_sentence.question.lmqg_qg_squad.default.txt)).
Check `lmqg-eval -h` to display all the options.


## Rest API with huggingface inference API

Finally, `lmqg` provides a rest API which hosts the model inference through huggingface inference API. You need huggingface API token to run your own API and install dependencies as below.
```shell
pip install lmqg[api]
```
Swagger UI is available at [`http://127.0.0.1:8088/docs`](http://127.0.0.1:8088/docs), when you run the app locally (replace the address by your server address).

### Build
- Build/Run Local (command line):
```shell
export API_TOKEN={Your Huggingface API Token}
uvicorn app:app --reload --port 8088
uvicorn app:app --host 0.0.0.0 --port 8088
```

- Build/Run Local (docker):
```shell
docker build -t lmqg/app:latest . --build-arg api_token={Your Huggingface API Token}
docker run -p 8080:8080 lmqg/app:latest
```

### API Description
You must pass the huggingface api token via the environmental variable `API_TOKEN`.
The main endpoint is `question_generation`, which has following parameters,

| Parameter        | Description                                                                         |
| ---------------- | ----------------------------------------------------------------------------------- |
| **input_text**   | input text, a paragraph or a sentence to generate question |
| **language**     | language |
| **qg_model**     | question generation model |
| **answer_model** | answer extraction model |

and return a list of dictionaries with `question` and `answer`. 
```shell
{
  "qa": [
    {"question": "Who founded Nintendo Karuta?", "answer": "Fusajiro Yamauchi"},
    {"question": "When did Nintendo distribute its first video game console, the Color TV-Game?", "answer": "1977"}
  ]
}
```

## Citation
Please cite following paper if you use any resource and see the code to reproduce the model if needed.

- [***Generative Language Models for Paragraph-Level Question Generation, EMNLP 2022 Main***](https://aclanthology.org/2022.emnlp-main.42/): The QG models ([code to reproduce experiments](https://github.com/asahi417/lm-question-generation/tree/master/misc/2022_emnlp_qg)).
```
@inproceedings{ushio-etal-2022-generative,
    title = "{G}enerative {L}anguage {M}odels for {P}aragraph-{L}evel {Q}uestion {G}eneration",
    author = "Ushio, Asahi  and
        Alva-Manchego, Fernando  and
        Camacho-Collados, Jose",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, U.A.E.",
    publisher = "Association for Computational Linguistics",
}
```

- [***An Empirical Comparison of LM-based Question and Answer Generation Methods, ACL 2023, Finding***](https://arxiv.org/abs/2305.17002): The QAG models ([code to reproduce experiments](https://github.com/asahi417/lm-question-generation/tree/master/misc/2023_acl_qag)).
```
@inproceedings{ushio-etal-2023-an-empirical,
    title = "An Empirical Comparison of LM-based Question and Answer Generation Methods",
    author = "Ushio, Asahi  and
        Alva-Manchego, Fernando  and
        Camacho-Collados, Jose",
    booktitle = "Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics: Findings",
    month = Jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
}
```

- [***A Practical Toolkit for Multilingual Question and Answer Generation, ACL 2023, System Demonstration***](https://arxiv.org/abs/2305.17416): The library and demo ([code to reproduce experiments](https://github.com/asahi417/lm-question-generation/tree/master/misc/2023_acl_qag)).

```
@inproceedings{ushio-etal-2023-a-practical-toolkit,
    title = "A Practical Toolkit for Multilingual Question and Answer Generation",
    author = "Ushio, Asahi  and
        Alva-Manchego, Fernando  and
        Camacho-Collados, Jose",
    booktitle = "Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics: System Demonstrations",
    month = Jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
}
```

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/asahi417/lm-question-generation",
    "name": "lmqg",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": "",
    "keywords": "language model,question-answering,question-generation",
    "author": "Asahi Ushio",
    "author_email": "asahi1992ushio@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/04/05/4ad68439adeed14e1d7937fc4d2110fec739609ea943c6a78a3e16098968/lmqg-0.1.1.tar.gz",
    "platform": null,
    "description": "[![license](https://img.shields.io/badge/License-MIT-brightgreen.svg)](https://github.com/asahi417/lmqg/blob/master/LICENSE)\n[![PyPI version](https://badge.fury.io/py/lmqg.svg)](https://badge.fury.io/py/lmqg)\n[![PyPI pyversions](https://img.shields.io/pypi/pyversions/lmqg.svg)](https://pypi.python.org/pypi/lmqg/)\n[![PyPI status](https://img.shields.io/pypi/status/lmqg.svg)](https://pypi.python.org/pypi/lmqg/)\n\n# Question and Answer Generation with Language Models\n\n<p align=\"center\">\n  <img src=\"https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/qag.png\" width=\"400\">\n  <br><em> Figure 1: Three distinct QAG approaches. </em>\n</p>\n\nThe `lmqg` is a python library for question and answer generation (QAG) with language models (LMs). Here, we consider \nparagraph-level QAG, where user will provide a context (paragraph or document), and the model will generate a list of \nquestion and answer pairs on the context. With `lmqg`, you can do following things:\n- [***Generation in One Line of Code:***](https://github.com/asahi417/lm-question-generation#generate-question--answer) Generate questions and answers in *8* languages (en/fr/ja/ko/ru/it/es/de).\n- [***Model Training/Evaluation:***](https://github.com/asahi417/lm-question-generation#model-development) Train & evaluate your own QG/QAG models.\n- [***QAG & QG Model Hosting:***](https://github.com/asahi417/lm-question-generation#rest-api-with-huggingface-inference-api) Host your QAG models on a web application or a restAPI server.\n- [***AutoQG:***](https://github.com/asahi417/lm-question-generation/tree/master#autoqg) Online web service to generate questions and answers with our models.\n \n***Update May 2023:*** Two papers got accepted by ACL 2023 ([QAG at finding](https://arxiv.org/abs/2305.17002), [LMQG at system demonstration](https://arxiv.org/abs/2305.17416)). \\\n***Update Oct 2022:*** Our [QG paper](https://aclanthology.org/2022.emnlp-main.42/) got accepted by EMNLP main 2022.\n\n### A bit more about QAG models \ud83d\udcdd\nOur QAG models can be grouped into three types: **Pipeline**, **Multitask**, \nand **End2end** (see Figure 1). The **Pipeline** consists of question generation (QG) and answer extraction (AE) models independently,\nwhere AE will parse all the sentences in the context to extract answers, and QG will generate questions on the answers.\nThe **Multitask** follows same architecture as the **Pipeline**, but the QG and AE models are shared model fine-tuned jointly.\nFinally, **End2end** model will generate a list of question and answer pairs in an end-to-end manner.\nIn practice, **Pipeline** and **Multitask** generate more question and answer pairs, while **End2end** generates less but a few times faster, \nand the quality of the generated question and answer pairs depend on language.\nAll types are available in the *8* diverse languages (en/fr/ja/ko/ru/it/es/de) via `lmqg`, and the models are all shared on HuggingFace (see the [model card](https://github.com/asahi417/lm-question-generation/blob/master/MODEL_CARD.md)).\nTo know more about QAG, please check [our ACL 2023 paper](https://arxiv.org/abs/2305.17002) that describes the QAG models and reports a complete performance comparison of each QAG models in every language.\n\n### Is QAG different from Question Generation (QG)? \ud83e\udd14\n\n<p align=\"center\">\n  <img src=\"https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/example.png\" width=\"700\">\n  <br><em> Figure 2: An example of QAG (a) and QG (b). </em>\n</p>\n\nAll the functionalities support question generation as well. Our QG model assumes user to specify an answer in addition to a context,\nand the QG model will generate a question that is answerable by the answer given the context (see Figure 2 for a comparison of QAG and QG).\nTo know more about QG, please check [our EMNLP 2022 paper](https://aclanthology.org/2022.emnlp-main.42/) that describes the QG models more in detail.\n\n\n## Get Started \ud83d\ude80\n\nLet's install `lmqg` via pip first.\n```shell\npip install lmqg\n```\n\n## Generate Question & Answer\n[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/13izkdp2l7G2oeh_fwL7xJdR_67HMK_hQ?usp=sharing)\n\nThe main functionality of `lmqg` is to generate question and answer pairs on a given context with a handy api.\nThe available models for each QAG class can be found at [model card](https://github.com/asahi417/lm-question-generation/blob/master/MODEL_CARD.md#qag).\n\n- ***QAG with End2end or Multitask Models:*** The end2end QAG models are fine-tuned to generate a list of QA pairs with a single inference, \n  so it is the fastest class among our QAG models. Meanwhile, multitask QAG models breakdown the QAG task into QG and AE, where they \n  parse each sentence to get the answer with AE, and generate question on each answer with QG. Multitask QAG potentially generate more QA pairs than end2end QAG, but \n  inference takes a few times more than end2end models. Both models can be used as following \n  \n```python\nfrom pprint import pprint\nfrom lmqg import TransformersQG\n\n# initialize model\nmodel = TransformersQG('lmqg/t5-base-squad-qag') # or TransformersQG(model='lmqg/t5-base-squad-qg-ae') \n# paragraph to generate pairs of question and answer\ncontext = \"William Turner was an English painter who specialised in watercolour landscapes. He is often known \" \\\n          \"as William Turner of Oxford or just Turner of Oxford to distinguish him from his contemporary, \" \\\n          \"J. M. W. Turner. Many of Turner's paintings depicted the countryside around Oxford. One of his \" \\\n          \"best known pictures is a view of the city of Oxford from Hinksey Hill.\"\n# model prediction\nquestion_answer = model.generate_qa(context)\n# the output is a list of tuple (question, answer)\npprint(question_answer)\n[\n    ('Who was an English painter who specialised in watercolour landscapes?', 'William Turner'),\n    ('What is William Turner often known as?', 'William Turner of Oxford or just Turner of Oxford'),\n    (\"What did many of Turner's paintings depict?\", 'the countryside around Oxford'),\n    (\"What is one of Turner's best known pictures?\", 'a view of the city of Oxford from Hinksey Hill')\n]\n```\n\n- ***QAG with Pipeline Models:*** The pipeline QAG is similar to multitask QAG, but the QG and AE models are independently fine-tuned, unlike \n  multitask QAG that fine-tunes QG and AE jointly. Pipeline QAG can improve the performance in some cases, but it is as heavy as multitask QAG with \n  more storage consuming due to the two models loaded. The pipeline QAG can be used as following. The `model` and `model_ae` are the QG and AE models respectively.\n  \n```python\nfrom pprint import pprint\nfrom lmqg import TransformersQG\n\n# initialize model\nmodel = TransformersQG(model='lmqg/t5-base-squad-qg', model_ae='lmqg/t5-base-squad-ae')  \n# paragraph to generate pairs of question and answer\ncontext = \"William Turner was an English painter who specialised in watercolour landscapes. He is often known \" \\\n          \"as William Turner of Oxford or just Turner of Oxford to distinguish him from his contemporary, \" \\\n          \"J. M. W. Turner. Many of Turner's paintings depicted the countryside around Oxford. One of his \" \\\n          \"best known pictures is a view of the city of Oxford from Hinksey Hill.\"\n# model prediction\nquestion_answer = model.generate_qa(context)\n# the output is a list of tuple (question, answer)\npprint(question_answer)\n[\n    ('Who was an English painter who specialised in watercolour landscapes?', 'William Turner'),\n    ('What is another name for William Turner?', 'William Turner of Oxford'),\n    (\"What did many of William Turner's paintings depict around Oxford?\", 'the countryside'),\n    ('From what hill is a view of the city of Oxford taken?', 'Hinksey Hill.')\n]\n```\n\n- ***QG only:*** The QG model can be used as following. The `model` is the QG model. See the [QG-Bench](https://github.com/asahi417/lm-question-generation/blob/master/QG_BENCH.md), \na multilingual QG benchmark, for the list of available QG models.\n  \n```python\nfrom pprint import pprint\nfrom lmqg import TransformersQG\n\n# initialize model\nmodel = TransformersQG(model='lmqg/t5-base-squad-qg')\n\n# a list of paragraph\ncontext = [\n    \"William Turner was an English painter who specialised in watercolour landscapes\",\n    \"William Turner was an English painter who specialised in watercolour landscapes\"\n]\n# a list of answer (same size as the context)\nanswer = [\n    \"William Turner\",\n    \"English\"\n]\n# model prediction\nquestion = model.generate_q(list_context=context, list_answer=answer)\npprint(question)\n[\n  'Who was an English painter who specialised in watercolour landscapes?',\n  'What nationality was William Turner?'\n]\n``` \n\n- ***AE only:*** The QG model can be used as following. The `model` is the QG model.\n\n```python\nfrom pprint import pprint\nfrom lmqg import TransformersQG\n\n# initialize model\nmodel = TransformersQG(model='lmqg/t5-base-squad-ae')\n# model prediction\nanswer = model.generate_a(\"William Turner was an English painter who specialised in watercolour landscapes\")\npprint(answer)\n['William Turner']\n```\n\n## AutoQG\n\n<p align=\"center\">\n  <img src=\"https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/autoqg.gif\" width=\"500\">\n</p>\n\n***AutoQG ([https://autoqg.net](https://autoqg.net/))*** is a free web application hosting our QAG models.\n\n## Model Development\nThe `lmqg` also provides a command line interface to fine-tune and evaluate QG, AE, and QAG models.\n\n### Model Training\n<p align=\"center\">\n  <img src=\"https://raw.githubusercontent.com/asahi417/lm-question-generation/master/assets/grid_search.png\" width=\"650\">\n</p>\n\nTo fine-tune QG (or AE, QAG) model, we employ a two-stage hyper-parameter optimization, described as above diagram.\nFollowing command is to run the fine-tuning with parameter optimization.\n```shell\nlmqg-train-search -c \"tmp_ckpt\" -d \"lmqg/qg_squad\" -m \"t5-small\" -b 64 --epoch-partial 5 -e 15 --language \"en\" --n-max-config 1 \\\n  -g 2 4 --lr 1e-04 5e-04 1e-03 --label-smoothing 0 0.15\n```\nCheck `lmqg-train-search -h` to display all the options.\n\nFine-tuning models in python follows below.  \n```python\nfrom lmqg import GridSearcher\ntrainer = GridSearcher(\n    checkpoint_dir='tmp_ckpt',\n    dataset_path='lmqg/qg_squad',\n    model='t5-small',\n    epoch=15,\n    epoch_partial=5,\n    batch=64,\n    n_max_config=5,\n    gradient_accumulation_steps=[2, 4], \n    lr=[1e-04, 5e-04, 1e-03],\n    label_smoothing=[0, 0.15]\n)\ntrainer.run()\n```\n\n\n### Model Evaluation\nThe evaluation tool reports `BLEU4`, `ROUGE-L`, `METEOR`, `BERTScore`, and `MoverScore` following [QG-Bench](https://github.com/asahi417/lm-question-generation/blob/master/QG_BENCH.md).\nFrom command line, run following command \n```shell\nlmqg-eval -m \"lmqg/t5-large-squad-qg\" -e \"./eval_metrics\" -d \"lmqg/qg_squad\" -l \"en\"\n```\nwhere `-m` is a model alias on huggingface or path to local checkpoint, `-e` is the directly to export the metric file, `-d` is the dataset to evaluate, and `-l` is the language of the test set.\nInstead of running model prediction, you can provide a prediction file instead to avoid computing it each time.\n```shell\nlmqg-eval --hyp-test '{your prediction file}' -e \"./eval_metrics\" -d \"lmqg/qg_squad\" -l \"en\"\n```\nThe prediction file should be a text file of model generation in each line in the order of `test` split in the target dataset\n([sample](https://huggingface.co/lmqg/t5-large-squad/raw/main/eval/samples.validation.hyp.paragraph_sentence.question.lmqg_qg_squad.default.txt)).\nCheck `lmqg-eval -h` to display all the options.\n\n\n## Rest API with huggingface inference API\n\nFinally, `lmqg` provides a rest API which hosts the model inference through huggingface inference API. You need huggingface API token to run your own API and install dependencies as below.\n```shell\npip install lmqg[api]\n```\nSwagger UI is available at [`http://127.0.0.1:8088/docs`](http://127.0.0.1:8088/docs), when you run the app locally (replace the address by your server address).\n\n### Build\n- Build/Run Local (command line):\n```shell\nexport API_TOKEN={Your Huggingface API Token}\nuvicorn app:app --reload --port 8088\nuvicorn app:app --host 0.0.0.0 --port 8088\n```\n\n- Build/Run Local (docker):\n```shell\ndocker build -t lmqg/app:latest . --build-arg api_token={Your Huggingface API Token}\ndocker run -p 8080:8080 lmqg/app:latest\n```\n\n### API Description\nYou must pass the huggingface api token via the environmental variable `API_TOKEN`.\nThe main endpoint is `question_generation`, which has following parameters,\n\n| Parameter        | Description                                                                         |\n| ---------------- | ----------------------------------------------------------------------------------- |\n| **input_text**   | input text, a paragraph or a sentence to generate question |\n| **language**     | language |\n| **qg_model**     | question generation model |\n| **answer_model** | answer extraction model |\n\nand return a list of dictionaries with `question` and `answer`. \n```shell\n{\n  \"qa\": [\n    {\"question\": \"Who founded Nintendo Karuta?\", \"answer\": \"Fusajiro Yamauchi\"},\n    {\"question\": \"When did Nintendo distribute its first video game console, the Color TV-Game?\", \"answer\": \"1977\"}\n  ]\n}\n```\n\n## Citation\nPlease cite following paper if you use any resource and see the code to reproduce the model if needed.\n\n- [***Generative Language Models for Paragraph-Level Question Generation, EMNLP 2022 Main***](https://aclanthology.org/2022.emnlp-main.42/): The QG models ([code to reproduce experiments](https://github.com/asahi417/lm-question-generation/tree/master/misc/2022_emnlp_qg)).\n```\n@inproceedings{ushio-etal-2022-generative,\n    title = \"{G}enerative {L}anguage {M}odels for {P}aragraph-{L}evel {Q}uestion {G}eneration\",\n    author = \"Ushio, Asahi  and\n        Alva-Manchego, Fernando  and\n        Camacho-Collados, Jose\",\n    booktitle = \"Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing\",\n    month = dec,\n    year = \"2022\",\n    address = \"Abu Dhabi, U.A.E.\",\n    publisher = \"Association for Computational Linguistics\",\n}\n```\n\n- [***An Empirical Comparison of LM-based Question and Answer Generation Methods, ACL 2023, Finding***](https://arxiv.org/abs/2305.17002): The QAG models ([code to reproduce experiments](https://github.com/asahi417/lm-question-generation/tree/master/misc/2023_acl_qag)).\n```\n@inproceedings{ushio-etal-2023-an-empirical,\n    title = \"An Empirical Comparison of LM-based Question and Answer Generation Methods\",\n    author = \"Ushio, Asahi  and\n        Alva-Manchego, Fernando  and\n        Camacho-Collados, Jose\",\n    booktitle = \"Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics: Findings\",\n    month = Jul,\n    year = \"2023\",\n    address = \"Toronto, Canada\",\n    publisher = \"Association for Computational Linguistics\",\n}\n```\n\n- [***A Practical Toolkit for Multilingual Question and Answer Generation, ACL 2023, System Demonstration***](https://arxiv.org/abs/2305.17416): The library and demo ([code to reproduce experiments](https://github.com/asahi417/lm-question-generation/tree/master/misc/2023_acl_qag)).\n\n```\n@inproceedings{ushio-etal-2023-a-practical-toolkit,\n    title = \"A Practical Toolkit for Multilingual Question and Answer Generation\",\n    author = \"Ushio, Asahi  and\n        Alva-Manchego, Fernando  and\n        Camacho-Collados, Jose\",\n    booktitle = \"Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics: System Demonstrations\",\n    month = Jul,\n    year = \"2023\",\n    address = \"Toronto, Canada\",\n    publisher = \"Association for Computational Linguistics\",\n}\n```\n\n\n",
    "bugtrack_url": null,
    "license": "MIT License",
    "summary": "Language Model for Question Generation.",
    "version": "0.1.1",
    "project_urls": {
        "Download": "https://github.com/asahi417/lm-question-generation/archive/v0.1.1.tar.gz",
        "Homepage": "https://github.com/asahi417/lm-question-generation"
    },
    "split_keywords": [
        "language model",
        "question-answering",
        "question-generation"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "04054ad68439adeed14e1d7937fc4d2110fec739609ea943c6a78a3e16098968",
                "md5": "b9e6f5a2b44f0fc7876b562ac882a828",
                "sha256": "72d30861d799fd6f42b18595216021aebd10361f7532ef141958ef34f4f4fa43"
            },
            "downloads": -1,
            "filename": "lmqg-0.1.1.tar.gz",
            "has_sig": false,
            "md5_digest": "b9e6f5a2b44f0fc7876b562ac882a828",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6",
            "size": 100147,
            "upload_time": "2023-06-18T21:20:53",
            "upload_time_iso_8601": "2023-06-18T21:20:53.595042Z",
            "url": "https://files.pythonhosted.org/packages/04/05/4ad68439adeed14e1d7937fc4d2110fec739609ea943c6a78a3e16098968/lmqg-0.1.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-06-18 21:20:53",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "asahi417",
    "github_project": "lm-question-generation",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "lmqg"
}

Asahi Ushio