Name | spacy-vb-en-core-web-sm JSON |
Version |
3.8.0
JSON |
| download |
home_page | https://explosion.ai |
Summary | Reupload of Spacy model to the PyPi index. English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer. |
upload_time | 2024-10-01 12:39:16 |
maintainer | None |
docs_url | None |
author | Explosion |
requires_python | >=3.6 |
license | None |
keywords |
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
### Details: https://spacy.io/models/en#en_core_web_sm
English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
| Feature | Description |
| --- | --- |
| **Name** | `en_core_web_sm` |
| **Version** | `3.8.0` |
| **spaCy** | `>=3.7.5,<3.9.0` |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
| **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University) |
| **License** | `MIT` |
| **Author** | [Explosion](https://explosion.ai) |
### Label Scheme
<details>
<summary>View label scheme (113 labels for 3 components)</summary>
| Component | Labels |
| --- | --- |
| **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, `_SP`, ```` |
| **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
| **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
</details>
### Accuracy
| Type | Score |
| --- | --- |
| `TOKEN_ACC` | 99.86 |
| `TOKEN_P` | 99.57 |
| `TOKEN_R` | 99.58 |
| `TOKEN_F` | 99.57 |
| `TAG_ACC` | 97.29 |
| `SENTS_P` | 92.01 |
| `SENTS_R` | 89.39 |
| `SENTS_F` | 90.68 |
| `DEP_UAS` | 91.77 |
| `DEP_LAS` | 89.92 |
| `ENTS_P` | 84.30 |
| `ENTS_R` | 84.36 |
| `ENTS_F` | 84.33 |
Raw data
{
"_id": null,
"home_page": "https://explosion.ai",
"name": "spacy-vb-en-core-web-sm",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.6",
"maintainer_email": null,
"keywords": null,
"author": "Explosion",
"author_email": "contact@explosion.ai",
"download_url": null,
"platform": null,
"description": "### Details: https://spacy.io/models/en#en_core_web_sm\r\n\r\nEnglish pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.\r\n\r\n| Feature | Description |\r\n| --- | --- |\r\n| **Name** | `en_core_web_sm` |\r\n| **Version** | `3.8.0` |\r\n| **spaCy** | `>=3.7.5,<3.9.0` |\r\n| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |\r\n| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |\r\n| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |\r\n| **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University) |\r\n| **License** | `MIT` |\r\n| **Author** | [Explosion](https://explosion.ai) |\r\n\r\n### Label Scheme\r\n\r\n<details>\r\n\r\n<summary>View label scheme (113 labels for 3 components)</summary>\r\n\r\n| Component | Labels |\r\n| --- | --- |\r\n| **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, `_SP`, ```` |\r\n| **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |\r\n| **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |\r\n\r\n</details>\r\n\r\n### Accuracy\r\n\r\n| Type | Score |\r\n| --- | --- |\r\n| `TOKEN_ACC` | 99.86 |\r\n| `TOKEN_P` | 99.57 |\r\n| `TOKEN_R` | 99.58 |\r\n| `TOKEN_F` | 99.57 |\r\n| `TAG_ACC` | 97.29 |\r\n| `SENTS_P` | 92.01 |\r\n| `SENTS_R` | 89.39 |\r\n| `SENTS_F` | 90.68 |\r\n| `DEP_UAS` | 91.77 |\r\n| `DEP_LAS` | 89.92 |\r\n| `ENTS_P` | 84.30 |\r\n| `ENTS_R` | 84.36 |\r\n| `ENTS_F` | 84.33 |\r\n",
"bugtrack_url": null,
"license": null,
"summary": "Reupload of Spacy model to the PyPi index. English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
"version": "3.8.0",
"project_urls": {
"Homepage": "https://explosion.ai"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "7436fe054d33f55cf849aaaba0edc45a04b7fc7b29b8f536d39d4b4cdf34ac3e",
"md5": "54e3e2f12d5b87813fde2654ce57dba5",
"sha256": "f45b3de50d602339194aa9d79b335f71fb1b44fb12943ca086e703892ab58ea2"
},
"downloads": -1,
"filename": "spacy_vb_en_core_web_sm-3.8.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "54e3e2f12d5b87813fde2654ce57dba5",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6",
"size": 2513,
"upload_time": "2024-10-01T12:39:16",
"upload_time_iso_8601": "2024-10-01T12:39:16.954903Z",
"url": "https://files.pythonhosted.org/packages/74/36/fe054d33f55cf849aaaba0edc45a04b7fc7b29b8f536d39d4b4cdf34ac3e/spacy_vb_en_core_web_sm-3.8.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-10-01 12:39:16",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "spacy-vb-en-core-web-sm"
}