spacy-vb-en-core-web-sm


Namespacy-vb-en-core-web-sm JSON
Version 3.8.0 PyPI version JSON
download
home_pagehttps://explosion.ai
SummaryReupload of Spacy model to the PyPi index. English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
upload_time2024-10-01 12:39:16
maintainerNone
docs_urlNone
authorExplosion
requires_python>=3.6
licenseNone
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            ### Details: https://spacy.io/models/en#en_core_web_sm

English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.

| Feature | Description |
| --- | --- |
| **Name** | `en_core_web_sm` |
| **Version** | `3.8.0` |
| **spaCy** | `>=3.7.5,<3.9.0` |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
| **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University) |
| **License** | `MIT` |
| **Author** | [Explosion](https://explosion.ai) |

### Label Scheme

<details>

<summary>View label scheme (113 labels for 3 components)</summary>

| Component | Labels |
| --- | --- |
| **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, `_SP`, ```` |
| **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
| **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |

</details>

### Accuracy

| Type | Score |
| --- | --- |
| `TOKEN_ACC` | 99.86 |
| `TOKEN_P` | 99.57 |
| `TOKEN_R` | 99.58 |
| `TOKEN_F` | 99.57 |
| `TAG_ACC` | 97.29 |
| `SENTS_P` | 92.01 |
| `SENTS_R` | 89.39 |
| `SENTS_F` | 90.68 |
| `DEP_UAS` | 91.77 |
| `DEP_LAS` | 89.92 |
| `ENTS_P` | 84.30 |
| `ENTS_R` | 84.36 |
| `ENTS_F` | 84.33 |

            

Raw data

            {
    "_id": null,
    "home_page": "https://explosion.ai",
    "name": "spacy-vb-en-core-web-sm",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": null,
    "keywords": null,
    "author": "Explosion",
    "author_email": "contact@explosion.ai",
    "download_url": null,
    "platform": null,
    "description": "### Details: https://spacy.io/models/en#en_core_web_sm\r\n\r\nEnglish pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.\r\n\r\n| Feature | Description |\r\n| --- | --- |\r\n| **Name** | `en_core_web_sm` |\r\n| **Version** | `3.8.0` |\r\n| **spaCy** | `>=3.7.5,<3.9.0` |\r\n| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |\r\n| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |\r\n| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |\r\n| **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University) |\r\n| **License** | `MIT` |\r\n| **Author** | [Explosion](https://explosion.ai) |\r\n\r\n### Label Scheme\r\n\r\n<details>\r\n\r\n<summary>View label scheme (113 labels for 3 components)</summary>\r\n\r\n| Component | Labels |\r\n| --- | --- |\r\n| **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, `_SP`, ```` |\r\n| **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |\r\n| **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |\r\n\r\n</details>\r\n\r\n### Accuracy\r\n\r\n| Type | Score |\r\n| --- | --- |\r\n| `TOKEN_ACC` | 99.86 |\r\n| `TOKEN_P` | 99.57 |\r\n| `TOKEN_R` | 99.58 |\r\n| `TOKEN_F` | 99.57 |\r\n| `TAG_ACC` | 97.29 |\r\n| `SENTS_P` | 92.01 |\r\n| `SENTS_R` | 89.39 |\r\n| `SENTS_F` | 90.68 |\r\n| `DEP_UAS` | 91.77 |\r\n| `DEP_LAS` | 89.92 |\r\n| `ENTS_P` | 84.30 |\r\n| `ENTS_R` | 84.36 |\r\n| `ENTS_F` | 84.33 |\r\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Reupload of Spacy model to the PyPi index. English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
    "version": "3.8.0",
    "project_urls": {
        "Homepage": "https://explosion.ai"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "7436fe054d33f55cf849aaaba0edc45a04b7fc7b29b8f536d39d4b4cdf34ac3e",
                "md5": "54e3e2f12d5b87813fde2654ce57dba5",
                "sha256": "f45b3de50d602339194aa9d79b335f71fb1b44fb12943ca086e703892ab58ea2"
            },
            "downloads": -1,
            "filename": "spacy_vb_en_core_web_sm-3.8.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "54e3e2f12d5b87813fde2654ce57dba5",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6",
            "size": 2513,
            "upload_time": "2024-10-01T12:39:16",
            "upload_time_iso_8601": "2024-10-01T12:39:16.954903Z",
            "url": "https://files.pythonhosted.org/packages/74/36/fe054d33f55cf849aaaba0edc45a04b7fc7b29b8f536d39d4b4cdf34ac3e/spacy_vb_en_core_web_sm-3.8.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-10-01 12:39:16",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "spacy-vb-en-core-web-sm"
}
        
Elapsed time: 0.35859s