pmidcite


Namepmidcite JSON
Version 0.1.2 PyPI version JSON
download
home_pageNone
SummaryTurbocharge a PubMed literature search using citation data from the NIH
upload_time2025-07-25 02:15:39
maintainerNone
docs_urlNone
authorNone
requires_python>=3.9
licenseNone
keywords library pubmed ncbi command-line-tool literature-mining citations citation-network google-search google-scholar
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # PubMed ID (PMID) Cite

[![Latest PyPI version](https://img.shields.io/pypi/v/pmidcite.svg)](https://pypi.org/project/pmidcite/)
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.5172712.svg)](https://doi.org/10.5281/zenodo.5172712)

<p align="center"><img src="https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite_citedby.png" alt="pmidcite summary" width="500"/></p>

Turbocharge a [**PubMed**](https://pubmed.ncbi.nlm.nih.gov) literature search with the command, `icite`, rather than clicking and clicking and clicking on [**Google Scholar**](/doc/images/README_twitter.md) "*Cited by N*" links.

This open-source project is part of [**a peer-reviewed**](https://pubmed.ncbi.nlm.nih.gov/33031632) [**commentary**](https://onlinelibrary.wiley.com/doi/10.1002/jrsm.1456) that was invited by the editors of [***Research Synthesis Methods***](https://onlinelibrary.wiley.com/journal/17592887).
Please [**Cite**](#how-to-cite) and star on GitHub
if you use *pmidcite* in your research or literature search.    

Contact: dvklopfenstein@protonmail.com     

# PubMed and NIH Citation data
PubMed contains peer-reviewed research papers
in biomedicine, biochemistry, chemistry, behavioral science, and other life sciences.    
[**Citation data**](https://icite.od.nih.gov) is downloaded
from the [**National Institutes of Health (NIH)**](https://www.nih.gov/)
each time `icite` is run and includes:
* Citation counts of all papers and clinical papers
* Performance of a paper among its peer papers
* Existence of MeSH terms for the human, animal, and molecular/cellular categories

# Table of Contents
* ***Quickstart on the*** [***command line***](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#command-line-interface-cli)
  * [**1) Download citation counts and data for a research paper**](https://github.com/dvklopfenstein/pmidcite#1-download-citation-counts-and-data-for-a-research-paper)
  * [**2) Forward citation search**](https://github.com/dvklopfenstein/pmidcite#2-forward-citation-search): following a paper's *Cited by* links or *Forward snowballing*
  * [**3) Backward citation search**](https://github.com/dvklopfenstein/pmidcite#3-backward-citation-search): following the links to a paper's references or *Backward snowballing*
  * [**4) Summarize a group of citations**](https://github.com/dvklopfenstein/pmidcite#4-summarize-a-group-of-citations)
  * [**5) Download citations for all papers returned from a PubMed search**](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#5-download-citations-for-all-papers-returned-from-a-pubmed-search)
* ***Examples in Jupyter notebooks using the *pmidcite* Python library***
  * [**1) Download NIH-OCC citation data**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/NIHOCC_data_download_always.ipynb)
  * [**2) Download missing or load existing NIH-OCC citation data**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/NIHOCC_data_download_or_import.ipynb)
  * [**3) Print a paper's citation and reference data**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/print_paper_all_refs_cites.ipynb)
  * [**4) Sort NIH iCite entries**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/print_paper_sort_cites.ipynb)
  * [**5) Query PubMed**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/query_pubmed.ipynb)
* ***Installation & citation***:
  * [**Installation**](#installation)
  * [**Setup**](#setup)
  * [**Google Scholar vs. PubMed**](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#pubmed-vs-google-scholar)
    * [**What is in PubMed?**](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#what-is-in-pubmed--take-a-quick-tour)
  * [**How to Cite *pmidcite***](#how-to-cite)
  * [**Contributing**](#contributing)
* [***References***](#references)

## 1) Download citation counts and data for a research paper
```$ icite -H 26032263```    
* This paper (PMID 26032263) has `25` citations, `10` references, and `4` authors.    
* This paper is performing well (`74`th percentile in column `%`) compared to its [peers](https://icite.od.nih.gov/user_guide?page_id=ug_overview).    
    
![Starting usage](https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite0.png)
### NIH percentile
This paper is performing well (`74`th percentile) compared to its [peers](https://icite.od.nih.gov/user_guide?page_id=ug_overview) (column `%`).     

The NIH percentile grouping (column `G`) helps to
highlight the better performing papers in groups `2`, `3`, and `4` by
sorting the citing papers by group first, then publication year.

The sort places the lower performing papers in groups `0` or `1` at the back.

New papers appear at the beginning of a sorted list,
no matter how many citations they have to
better facilitate researchers in finding the latest discoveries.

The grouping of papers by NIH percentile grouping is a novel feature created by [dvklopfenstein](https://github.com/dvklopfenstein) for this project.

## 2) Forward citation search
<p align="left"><img src="https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite_citedby_cit.png" alt="pmidcite summary" width="300"/></p>

Also known as following a paper's *Cited by* links or *Forward snowballing*    

```icite -H; icite 26032263 --load_citations | sort -k6 -r```    
or    
```icite -H; icite 26032263 -c | sort -k6 -r```    


## 3) Backward citation search
Also known as following links to a paper's references or *Backward snowballing*    
<p align="left"><img src="https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite_citedby_ref.png" alt="pmidcite summary" width="300"/></p>

```$ icite -H; icite 26032263 --load_references | sort -k6 -r```    
or    
```$ icite -H; icite 26032263 -r | sort -k6 -r```     

## 4) Summarize a group of citations
Create a file containing numerous PMIDs annotated with icite info
```
$ icite 30022098 -c -o goatools_cites.txt
  WROTE: goatools_cites.txt
```

Count the number of lines in the file
```
$ wc -l goatools_cites.txt
468 goatools_cites.txt
```

Summarize the papers in "goatools_cites.txt"
```
$ sumpaps goatools_cites.txt
i=026.9% 4=003.0% 3=018.9% 2=028.8% 1=015.9% 0=006.5%   6 years:2018-2024   465 papers goatools_cites.txt
```
* The output is on one line so many files containing sets of PMIDs may be compared
* The groups are from newest(`i`) to top-performing(`4`), great(`3`), very good(`2`), and overlooked(`1` and `0`)


## 5) Download citations for all papers returned from a PubMed search
1. [Do a search in PubMed](#1-do-a-search-in-pubmed)
2. [Save all results into a file containing all PMIDs found by the search](#2-save-all-results-into-a-list-of-pmids)
3. [Download the list of PMIDs](#3-download-the-list-of-pmids)
4. [Run icite to analyze all the PMIDs](#4-run-icite-to-analyze-all-the-pmids)

### 1. Do a search in [PubMed](https://pubmed.ncbi.nlm.nih.gov/)
<p align="center"><img src="https://github.com/dvklopfenstein/pmidcite/raw/main/doc/images/pubmed_HIV_AND_Me_srch.png" alt="pmidcite summary" width="800"/></p>   

### 2. Save all results into a list of PMIDs
<p align="center"><img src="https://github.com/dvklopfenstein/pmidcite/raw/main/doc/images/pubmed_HIV_AND_Me_save.png" alt="pmidcite summary" width="800"/></p>   

### 3. Download the list of PMIDs
<p align="center"><img src="https://github.com/dvklopfenstein/pmidcite/raw/main/doc/images/pubmed_HIV_AND_Me_dnld.png" alt="pmidcite summary" width="800"/></p>   

### 4. Run icite to analyze all the PMIDs
```
$ icite -i pmid-HIVANDDNAm-set.txt -o pmid-HIVANDDNAm-icite.txt
$ grep TOP pmid-HIVANDDNAm-icite.txt | sort -k6
```


## Command Line Interface (CLI)

A Command-Line Interface (CLI) can be preferable 
to a Graphical User Interface (GUI) because: 
* processing can be automated from a script
* time-consuming mouse clicking is reduced
* more data can be seen at once on a text screen
than in a browser, giving the researcher 
a better overall impression of the full set of information [[1]](#how-to-cite)

Researchers who use Linux or Mac already work from the command line.
Researchers who use Windows can get that Linux-like command line feeling
while still running native Windows programs by
downloading Cygwin from https://www.cygwin.com/ [[1]](#how-to-cite).


# PubMed vs Google Scholar
<p align="center">
<img src="https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/Search_Features_GS_v_PubMed.png" alt="Google Scholar vs PubMed" width="600"/>
</p>

In 2013, Boeker et al. [[6](#references)]
recommended that a scientific search interface contain five integrated search criteria. 
PubMed implements all five, while Google did not in 2013 or today.

Google's highly popular implementation of the forward citation search through their ubiquitous "Cited by N" links
is a "Better" experience than the PubMed's "forward citation search" implementation.

But if your research is in the health sciences and
you are amenable to working from the [command line](#command-line-interface-cli),
you can use PubMed in your browser plus
citation data downloaded from the NIH using the command-line  using *pmidcite*.
The NIH's citation data includes a paper's ranking among its co-citation network.


## What is in [PubMed](https://pubmed.ncbi.nlm.nih.gov)?  Take a [**quick tour**](https://www.nlm.nih.gov/pubs/techbull/ma20/brief/ma20_pubmed_essentials.html)
<img src="https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pubmed_content_2020_01_10.png" alt="PubMed Contents" width="850"/>

PubMed is a search interface and toolset used to access over 30.5 million article records from databases such as:
* **MEDLINE**: a highly selective database started in the 1960s
* **PubMed Central (PMC)**: an open-access database for full-text papers that are free of cost
* Additional content such as books and articles published before the 1960s


## Installation
To install from [**PyPI**](https://pypi.org/project/pmidcite/)    
```$ pip3 install pmidcite```

To install locally
```
$ git clone https://github.com/dvklopfenstein/pmidcite.git
$ cd ./pmidcite
$ pip3 install .
```

## Setup
Save your literature search in a GitHub repo.

### 1. Add a [pmidcite init file](doc/example_cfg/.pmidciterc)
Add a .pmidciterc init file to a non-git managed directory, such as home (~)
```
$ icite --generate-rcfile | tee ~/.pmidciterc
[pmidcite]
email = myname@email.edu
# To download PubMed search results, get an NCBI API key here:
# https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities
apikey = MY_LONG_HEX_NCBI_API_KEY
tool = my_scripts
```
```
$ export PMIDCITECONF=~/.pmidciterc
```
Do not version manage the `.pmidciterc` using a tool such as GitHub because it
contains your personal email and your private NCBI API key.


### 2. NCBI E-Utils API key
To download PubMed abstracts and PubMed search results using NCBI's E-Utils,
get an NCBI API key using these instructions:    
https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities

Set the `apikey` value in the config file: `~/.pmidciterc`

# Contributing
See the [**contributing guide**](/docs/CONTRIBUTING.md) for detailed instructions on how to get started contributing to the **pmidcite** project.

# Contact
email: dvklopfenstein@protonmail.com    
https://orcid.org/0000-0003-0161-7603

## How to Cite
_If you use **pmidcite** in your research or literature search, please cite paper 1 (pmidcite) and paper 3 (NIH citation data)._      

_Please also consider reading and citing Gusenbauer's response (paper 2) about improving search for all during the information avalanche of these times:_

1. **The *pmidcite* paper:**    
[**Commentary to Gusenbauer and Haddaway 2020: Evaluating Retrieval Qualities of PubMed and Google Scholar**](http://dx.doi.org/10.1002/jrsm.1456)    
Klopfenstein DV and Dampier W    
2020 | _Research Synthesis Methods_ | PMID: [33031632](https://pubmed.ncbi.nlm.nih.gov/33031632/) | DOI: [10.1002/jrsm.1456](http://dx.doi.org/10.1002/jrsm.1456) | [pdf](/doc/paper/JRSM_1456_iCite_main.pdf)

2. **Gusenbauer's response to the *pmidcite* paper:**    
[**What every Researcher should know about Searching – Clarified Concepts, Search Advice, and an Agenda to improve Finding in Academia**](https://onlinelibrary.wiley.com/doi/10.1002/jrsm.1457)    
Gusenbauer M and Haddaway N    
2020 | _Research Synthesis Methods_ | PMID: [33031639](https://pubmed.ncbi.nlm.nih.gov/33031639/) | DOI: [10.1002/jrsm.1457](https://onlinelibrary.wiley.com/doi/10.1002/jrsm.1457) | [pdf](/doc/paper/jrsm.1457.pdf)

3. **The NIH citation data used by *pmidcite* -- Scientific Influence, Translation, and Citation counts:**     
[**The NIH Open Citation Collection: A public access, broad coverage resource**](https://pubmed.ncbi.nlm.nih.gov/31600197/)    
Hutchins BI ... Santangelo GM    
2019 | _PLoS Biology_ | PMID: [31600197](https://pubmed.ncbi.nlm.nih.gov/31600197) | DOI: [10.1371/journal.pbio.3000385](https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3000385)    

## References

_Please consider reading and citing the paper [4] which inspired the creation of **pmidcite** [1] and the authors' response to our paper [2]_:

4. [**Which Academic Search Systems are Suitable for Systematic Reviews or Meta-Analyses? Evaluating Retrieval Qualities of Google Scholar, PubMed and 26 other Resources**](https://pubmed.ncbi.nlm.nih.gov/31614060/)    
Gusenbauer M and Haddaway N    
2019 | _Research Synthesis Methods_ | PMID: [31614060](https://pubmed.ncbi.nlm.nih.gov/31614060) | DOI: [10.1002/jrsm.1378](https://onlinelibrary.wiley.com/doi/full/10.1002/jrsm.1378)

_Mentioned in this README are also these outstanding contributions_:

5. [**Relative Citation Ratio (RCR): A New Metric That Uses Citation Rates to Measure Influence at the Article Level**](https://pubmed.ncbi.nlm.nih.gov/27599104/)    
Hutchins BI, Xin Yuan, Anderson JM, and Santangelo, George M.    
2016 | _PLoS Biology_ | PMID: [27599104](https://pubmed.ncbi.nlm.nih.gov/27599104) | DOI: [10.1371/journal.pbio.1002541](https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002541)

6. [**Google Scholar as replacement for systematic literature searches: good relative recall and precision are not enough**](https://pubmed.ncbi.nlm.nih.gov/24160679/)    
Boeker M et al.    
2013 | BMC Medical Research Methodology | PMID: [24160679](https://pubmed.ncbi.nlm.nih.gov/24160679) | DOI: [10.1186/1471-2288-13-131](https://bmcmedresmethodol.biomedcentral.com/articles/10.1186/1471-2288-13-131)

7. [**Best Match: New relevance search for PubMed**](https://pubmed.ncbi.nlm.nih.gov/30153250/)    
Fiorini N ... Lu Zhiyong    
2018 | PLoS Biology | PMID: [30153250](https://pubmed.ncbi.nlm.nih.gov/30153250) | DOI: [10.1371/journal.pbio.2005343](https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.2005343)    

## [PDFs](/doc/paper/JRSM_1456_iCite_main.pdf)
  * [**PMIDCITE Manuscript**](/doc/paper/JRSM_1456_iCite_main.pdf) with the original text box formatting
    * **Supplemental Material**
      * [S1. *pmidcite* information](/doc/paper/JRSM_1456_iCite_supp1_CitedByN.pdf)
      * [S2. Contents of PubMed](/doc/paper/JRSM_1456_iCite_supp2_PMcontents.pdf)
      * [S3. Screen shots Google Scholar taken Jan 2020](/doc/paper/JRSM_1456_iCite_supp3_GS.pdf)
  * [**Gusenbauer's Response**](/doc/paper/jrsm.1457.pdf)

## Contact
dvklopfenstein@protonmail.com    
https://orcid.org/0000-0003-0161-7603

Copyright (C) 2019-present [pmidcite](https://dvklopfenstein.github.io/pmidcite/), DV Klopfenstein, PhD. All rights reserved.

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "pmidcite",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": "\"DV Klopfenstein, PhD\" <dvklopfenstein@protonmail.com>",
    "keywords": "library, pubmed, ncbi, command-line-tool, literature-mining, citations, Citation-network, google-search, google-scholar",
    "author": null,
    "author_email": "\"DV Klopfenstein, PhD\" <dvklopfenstein@protonmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/bb/89/89fafe0c3b97780c1bee34f14b99769b976ea6f89897d04813e102902c65/pmidcite-0.1.2.tar.gz",
    "platform": null,
    "description": "# PubMed ID (PMID) Cite\n\n[![Latest PyPI version](https://img.shields.io/pypi/v/pmidcite.svg)](https://pypi.org/project/pmidcite/)\n[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.5172712.svg)](https://doi.org/10.5281/zenodo.5172712)\n\n<p align=\"center\"><img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite_citedby.png\" alt=\"pmidcite summary\" width=\"500\"/></p>\n\nTurbocharge a [**PubMed**](https://pubmed.ncbi.nlm.nih.gov) literature search with the command, `icite`, rather than clicking and clicking and clicking on [**Google Scholar**](/doc/images/README_twitter.md) \"*Cited by N*\" links.\n\nThis open-source project is part of [**a peer-reviewed**](https://pubmed.ncbi.nlm.nih.gov/33031632) [**commentary**](https://onlinelibrary.wiley.com/doi/10.1002/jrsm.1456) that was invited by the editors of [***Research Synthesis Methods***](https://onlinelibrary.wiley.com/journal/17592887).\nPlease [**Cite**](#how-to-cite) and star on GitHub\nif you use *pmidcite* in your research or literature search.    \n\nContact: dvklopfenstein@protonmail.com     \n\n# PubMed and NIH Citation data\nPubMed contains peer-reviewed research papers\nin biomedicine, biochemistry, chemistry, behavioral science, and other life sciences.    \n[**Citation data**](https://icite.od.nih.gov) is downloaded\nfrom the [**National Institutes of Health (NIH)**](https://www.nih.gov/)\neach time `icite` is run and includes:\n* Citation counts of all papers and clinical papers\n* Performance of a paper among its peer papers\n* Existence of MeSH terms for the human, animal, and molecular/cellular categories\n\n# Table of Contents\n* ***Quickstart on the*** [***command line***](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#command-line-interface-cli)\n  * [**1) Download citation counts and data for a research paper**](https://github.com/dvklopfenstein/pmidcite#1-download-citation-counts-and-data-for-a-research-paper)\n  * [**2) Forward citation search**](https://github.com/dvklopfenstein/pmidcite#2-forward-citation-search): following a paper's *Cited by* links or *Forward snowballing*\n  * [**3) Backward citation search**](https://github.com/dvklopfenstein/pmidcite#3-backward-citation-search): following the links to a paper's references or *Backward snowballing*\n  * [**4) Summarize a group of citations**](https://github.com/dvklopfenstein/pmidcite#4-summarize-a-group-of-citations)\n  * [**5) Download citations for all papers returned from a PubMed search**](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#5-download-citations-for-all-papers-returned-from-a-pubmed-search)\n* ***Examples in Jupyter notebooks using the *pmidcite* Python library***\n  * [**1) Download NIH-OCC citation data**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/NIHOCC_data_download_always.ipynb)\n  * [**2) Download missing or load existing NIH-OCC citation data**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/NIHOCC_data_download_or_import.ipynb)\n  * [**3) Print a paper's citation and reference data**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/print_paper_all_refs_cites.ipynb)\n  * [**4) Sort NIH iCite entries**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/print_paper_sort_cites.ipynb)\n  * [**5) Query PubMed**](https://github.com/dvklopfenstein/pmidcite/blob/main/notebooks/query_pubmed.ipynb)\n* ***Installation & citation***:\n  * [**Installation**](#installation)\n  * [**Setup**](#setup)\n  * [**Google Scholar vs. PubMed**](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#pubmed-vs-google-scholar)\n    * [**What is in PubMed?**](https://github.com/dvklopfenstein/pmidcite/blob/main/README.md#what-is-in-pubmed--take-a-quick-tour)\n  * [**How to Cite *pmidcite***](#how-to-cite)\n  * [**Contributing**](#contributing)\n* [***References***](#references)\n\n## 1) Download citation counts and data for a research paper\n```$ icite -H 26032263```    \n* This paper (PMID 26032263) has `25` citations, `10` references, and `4` authors.    \n* This paper is performing well (`74`th percentile in column `%`) compared to its [peers](https://icite.od.nih.gov/user_guide?page_id=ug_overview).    \n    \n![Starting usage](https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite0.png)\n### NIH percentile\nThis paper is performing well (`74`th percentile) compared to its [peers](https://icite.od.nih.gov/user_guide?page_id=ug_overview) (column `%`).     \n\nThe NIH percentile grouping (column `G`) helps to\nhighlight the better performing papers in groups `2`, `3`, and `4` by\nsorting the citing papers by group first, then publication year.\n\nThe sort places the lower performing papers in groups `0` or `1` at the back.\n\nNew papers appear at the beginning of a sorted list,\nno matter how many citations they have to\nbetter facilitate researchers in finding the latest discoveries.\n\nThe grouping of papers by NIH percentile grouping is a novel feature created by [dvklopfenstein](https://github.com/dvklopfenstein) for this project.\n\n## 2) Forward citation search\n<p align=\"left\"><img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite_citedby_cit.png\" alt=\"pmidcite summary\" width=\"300\"/></p>\n\nAlso known as following a paper's *Cited by* links or *Forward snowballing*    \n\n```icite -H; icite 26032263 --load_citations | sort -k6 -r```    \nor    \n```icite -H; icite 26032263 -c | sort -k6 -r```    \n\n\n## 3) Backward citation search\nAlso known as following links to a paper's references or *Backward snowballing*    \n<p align=\"left\"><img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pmidcite_citedby_ref.png\" alt=\"pmidcite summary\" width=\"300\"/></p>\n\n```$ icite -H; icite 26032263 --load_references | sort -k6 -r```    \nor    \n```$ icite -H; icite 26032263 -r | sort -k6 -r```     \n\n## 4) Summarize a group of citations\nCreate a file containing numerous PMIDs annotated with icite info\n```\n$ icite 30022098 -c -o goatools_cites.txt\n  WROTE: goatools_cites.txt\n```\n\nCount the number of lines in the file\n```\n$ wc -l goatools_cites.txt\n468 goatools_cites.txt\n```\n\nSummarize the papers in \"goatools_cites.txt\"\n```\n$ sumpaps goatools_cites.txt\ni=026.9% 4=003.0% 3=018.9% 2=028.8% 1=015.9% 0=006.5%   6 years:2018-2024   465 papers goatools_cites.txt\n```\n* The output is on one line so many files containing sets of PMIDs may be compared\n* The groups are from newest(`i`) to top-performing(`4`), great(`3`), very good(`2`), and overlooked(`1` and `0`)\n\n\n## 5) Download citations for all papers returned from a PubMed search\n1. [Do a search in PubMed](#1-do-a-search-in-pubmed)\n2. [Save all results into a file containing all PMIDs found by the search](#2-save-all-results-into-a-list-of-pmids)\n3. [Download the list of PMIDs](#3-download-the-list-of-pmids)\n4. [Run icite to analyze all the PMIDs](#4-run-icite-to-analyze-all-the-pmids)\n\n### 1. Do a search in [PubMed](https://pubmed.ncbi.nlm.nih.gov/)\n<p align=\"center\"><img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/doc/images/pubmed_HIV_AND_Me_srch.png\" alt=\"pmidcite summary\" width=\"800\"/></p>   \n\n### 2. Save all results into a list of PMIDs\n<p align=\"center\"><img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/doc/images/pubmed_HIV_AND_Me_save.png\" alt=\"pmidcite summary\" width=\"800\"/></p>   \n\n### 3. Download the list of PMIDs\n<p align=\"center\"><img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/doc/images/pubmed_HIV_AND_Me_dnld.png\" alt=\"pmidcite summary\" width=\"800\"/></p>   \n\n### 4. Run icite to analyze all the PMIDs\n```\n$ icite -i pmid-HIVANDDNAm-set.txt -o pmid-HIVANDDNAm-icite.txt\n$ grep TOP pmid-HIVANDDNAm-icite.txt | sort -k6\n```\n\n\n## Command Line Interface (CLI)\n\nA Command-Line Interface (CLI) can be preferable \nto a Graphical User Interface (GUI) because: \n* processing can be automated from a script\n* time-consuming mouse clicking is reduced\n* more data can be seen at once on a text screen\nthan in a browser, giving the researcher \na better overall impression of the full set of information [[1]](#how-to-cite)\n\nResearchers who use Linux or Mac already work from the command line.\nResearchers who use Windows can get that Linux-like command line feeling\nwhile still running native Windows programs by\ndownloading Cygwin from https://www.cygwin.com/ [[1]](#how-to-cite).\n\n\n# PubMed vs Google Scholar\n<p align=\"center\">\n<img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/Search_Features_GS_v_PubMed.png\" alt=\"Google Scholar vs PubMed\" width=\"600\"/>\n</p>\n\nIn 2013, Boeker et al. [[6](#references)]\nrecommended that a scientific search interface contain five integrated search criteria. \nPubMed implements all five, while Google did not in 2013 or today.\n\nGoogle's highly popular implementation of the forward citation search through their ubiquitous \"Cited by N\" links\nis a \"Better\" experience than the PubMed's \"forward citation search\" implementation.\n\nBut if your research is in the health sciences and\nyou are amenable to working from the [command line](#command-line-interface-cli),\nyou can use PubMed in your browser plus\ncitation data downloaded from the NIH using the command-line  using *pmidcite*.\nThe NIH's citation data includes a paper's ranking among its co-citation network.\n\n\n## What is in [PubMed](https://pubmed.ncbi.nlm.nih.gov)?  Take a [**quick tour**](https://www.nlm.nih.gov/pubs/techbull/ma20/brief/ma20_pubmed_essentials.html)\n<img src=\"https://github.com/dvklopfenstein/pmidcite/raw/main/docs/images/pubmed_content_2020_01_10.png\" alt=\"PubMed Contents\" width=\"850\"/>\n\nPubMed is a search interface and toolset used to access over 30.5 million article records from databases such as:\n* **MEDLINE**: a highly selective database started in the 1960s\n* **PubMed Central (PMC)**: an open-access database for full-text papers that are free of cost\n* Additional content such as books and articles published before the 1960s\n\n\n## Installation\nTo install from [**PyPI**](https://pypi.org/project/pmidcite/)    \n```$ pip3 install pmidcite```\n\nTo install locally\n```\n$ git clone https://github.com/dvklopfenstein/pmidcite.git\n$ cd ./pmidcite\n$ pip3 install .\n```\n\n## Setup\nSave your literature search in a GitHub repo.\n\n### 1. Add a [pmidcite init file](doc/example_cfg/.pmidciterc)\nAdd a .pmidciterc init file to a non-git managed directory, such as home (~)\n```\n$ icite --generate-rcfile | tee ~/.pmidciterc\n[pmidcite]\nemail = myname@email.edu\n# To download PubMed search results, get an NCBI API key here:\n# https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities\napikey = MY_LONG_HEX_NCBI_API_KEY\ntool = my_scripts\n```\n```\n$ export PMIDCITECONF=~/.pmidciterc\n```\nDo not version manage the `.pmidciterc` using a tool such as GitHub because it\ncontains your personal email and your private NCBI API key.\n\n\n### 2. NCBI E-Utils API key\nTo download PubMed abstracts and PubMed search results using NCBI's E-Utils,\nget an NCBI API key using these instructions:    \nhttps://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities\n\nSet the `apikey` value in the config file: `~/.pmidciterc`\n\n# Contributing\nSee the [**contributing guide**](/docs/CONTRIBUTING.md) for detailed instructions on how to get started contributing to the **pmidcite** project.\n\n# Contact\nemail: dvklopfenstein@protonmail.com    \nhttps://orcid.org/0000-0003-0161-7603\n\n## How to Cite\n_If you use **pmidcite** in your research or literature search, please cite paper 1 (pmidcite) and paper 3 (NIH citation data)._      \n\n_Please also consider reading and citing Gusenbauer's response (paper 2) about improving search for all during the information avalanche of these times:_\n\n1. **The *pmidcite* paper:**    \n[**Commentary to Gusenbauer and Haddaway 2020: Evaluating Retrieval Qualities of PubMed and Google Scholar**](http://dx.doi.org/10.1002/jrsm.1456)    \nKlopfenstein DV and Dampier W    \n2020 | _Research Synthesis Methods_ | PMID: [33031632](https://pubmed.ncbi.nlm.nih.gov/33031632/) | DOI: [10.1002/jrsm.1456](http://dx.doi.org/10.1002/jrsm.1456) | [pdf](/doc/paper/JRSM_1456_iCite_main.pdf)\n\n2. **Gusenbauer's response to the *pmidcite* paper:**    \n[**What every Researcher should know about Searching \u2013 Clarified Concepts, Search Advice, and an Agenda to improve Finding in Academia**](https://onlinelibrary.wiley.com/doi/10.1002/jrsm.1457)    \nGusenbauer M and Haddaway N    \n2020 | _Research Synthesis Methods_ | PMID: [33031639](https://pubmed.ncbi.nlm.nih.gov/33031639/) | DOI: [10.1002/jrsm.1457](https://onlinelibrary.wiley.com/doi/10.1002/jrsm.1457) | [pdf](/doc/paper/jrsm.1457.pdf)\n\n3. **The NIH citation data used by *pmidcite* -- Scientific Influence, Translation, and Citation counts:**     \n[**The NIH Open Citation Collection: A public access, broad coverage resource**](https://pubmed.ncbi.nlm.nih.gov/31600197/)    \nHutchins BI ... Santangelo GM    \n2019 | _PLoS Biology_ | PMID: [31600197](https://pubmed.ncbi.nlm.nih.gov/31600197) | DOI: [10.1371/journal.pbio.3000385](https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3000385)    \n\n## References\n\n_Please consider reading and citing the paper [4] which inspired the creation of **pmidcite** [1] and the authors' response to our paper [2]_:\n\n4. [**Which Academic Search Systems are Suitable for Systematic Reviews or Meta-Analyses? Evaluating Retrieval Qualities of Google Scholar, PubMed and 26 other Resources**](https://pubmed.ncbi.nlm.nih.gov/31614060/)    \nGusenbauer M and Haddaway N    \n2019 | _Research Synthesis Methods_ | PMID: [31614060](https://pubmed.ncbi.nlm.nih.gov/31614060) | DOI: [10.1002/jrsm.1378](https://onlinelibrary.wiley.com/doi/full/10.1002/jrsm.1378)\n\n_Mentioned in this README are also these outstanding contributions_:\n\n5. [**Relative Citation Ratio (RCR): A New Metric That Uses Citation Rates to Measure Influence at the Article Level**](https://pubmed.ncbi.nlm.nih.gov/27599104/)    \nHutchins BI, Xin Yuan, Anderson JM, and Santangelo, George M.    \n2016 | _PLoS Biology_ | PMID: [27599104](https://pubmed.ncbi.nlm.nih.gov/27599104) | DOI: [10.1371/journal.pbio.1002541](https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002541)\n\n6. [**Google Scholar as replacement for systematic literature searches: good relative recall and precision are not enough**](https://pubmed.ncbi.nlm.nih.gov/24160679/)    \nBoeker M et al.    \n2013 | BMC Medical Research Methodology | PMID: [24160679](https://pubmed.ncbi.nlm.nih.gov/24160679) | DOI: [10.1186/1471-2288-13-131](https://bmcmedresmethodol.biomedcentral.com/articles/10.1186/1471-2288-13-131)\n\n7. [**Best Match: New relevance search for PubMed**](https://pubmed.ncbi.nlm.nih.gov/30153250/)    \nFiorini N ... Lu Zhiyong    \n2018 | PLoS Biology | PMID: [30153250](https://pubmed.ncbi.nlm.nih.gov/30153250) | DOI: [10.1371/journal.pbio.2005343](https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.2005343)    \n\n## [PDFs](/doc/paper/JRSM_1456_iCite_main.pdf)\n  * [**PMIDCITE Manuscript**](/doc/paper/JRSM_1456_iCite_main.pdf) with the original text box formatting\n    * **Supplemental Material**\n      * [S1. *pmidcite* information](/doc/paper/JRSM_1456_iCite_supp1_CitedByN.pdf)\n      * [S2. Contents of PubMed](/doc/paper/JRSM_1456_iCite_supp2_PMcontents.pdf)\n      * [S3. Screen shots Google Scholar taken Jan 2020](/doc/paper/JRSM_1456_iCite_supp3_GS.pdf)\n  * [**Gusenbauer's Response**](/doc/paper/jrsm.1457.pdf)\n\n## Contact\ndvklopfenstein@protonmail.com    \nhttps://orcid.org/0000-0003-0161-7603\n\nCopyright (C) 2019-present [pmidcite](https://dvklopfenstein.github.io/pmidcite/), DV Klopfenstein, PhD. All rights reserved.\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Turbocharge a PubMed literature search using citation data from the NIH",
    "version": "0.1.2",
    "project_urls": {
        "Changelog": "https://github.com/dvklopfenstein/pmidcite/blob/main/CHANGELOG.rst",
        "Code": "https://github.com/dvklopfenstein/pmidcite",
        "Documentation": "https://dvklopfenstein.readthedocs.io",
        "Homepage": "https://github.com/dvklopfenstein/pmidcite",
        "Issue tracker": "https://github.com/dvklopfenstein/pmidcite/issues"
    },
    "split_keywords": [
        "library",
        " pubmed",
        " ncbi",
        " command-line-tool",
        " literature-mining",
        " citations",
        " citation-network",
        " google-search",
        " google-scholar"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "207647204b73b5cfe916cc0cfc63af4d10d956fb8797403fc40fc3c8e571c669",
                "md5": "3c8bef0893cddbf4e2a1f73d94e17f91",
                "sha256": "fc05cce15f076753464e91cf6a46a6b046d86e1a3d01695b379e500c6f9252a9"
            },
            "downloads": -1,
            "filename": "pmidcite-0.1.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "3c8bef0893cddbf4e2a1f73d94e17f91",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9",
            "size": 2601105,
            "upload_time": "2025-07-25T02:15:47",
            "upload_time_iso_8601": "2025-07-25T02:15:47.510361Z",
            "url": "https://files.pythonhosted.org/packages/20/76/47204b73b5cfe916cc0cfc63af4d10d956fb8797403fc40fc3c8e571c669/pmidcite-0.1.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "bb8989fafe0c3b97780c1bee34f14b99769b976ea6f89897d04813e102902c65",
                "md5": "e75a07befd06f0c853c53aefa77a404c",
                "sha256": "3537fd8ca47f64d1db66029c1f72d15f26c409fc3d28062a722fb68a85669ba1"
            },
            "downloads": -1,
            "filename": "pmidcite-0.1.2.tar.gz",
            "has_sig": false,
            "md5_digest": "e75a07befd06f0c853c53aefa77a404c",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 10041664,
            "upload_time": "2025-07-25T02:15:39",
            "upload_time_iso_8601": "2025-07-25T02:15:39.842638Z",
            "url": "https://files.pythonhosted.org/packages/bb/89/89fafe0c3b97780c1bee34f14b99769b976ea6f89897d04813e102902c65/pmidcite-0.1.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-07-25 02:15:39",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "dvklopfenstein",
    "github_project": "pmidcite",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "pmidcite"
}
        
Elapsed time: 1.52587s