collective.documentviewer


Namecollective.documentviewer JSON
Version 6.0.3 PyPI version JSON
download
home_pagehttps://github.com/collective/collective.documentviewer
SummaryDocument cloud's document viewer integration into plone.
upload_time2023-05-17 21:58:56
maintainer
docs_urlNone
authorNathan Van Gheem
requires_python>=2.7,!=3.0.*,!=3.1.*,!=3.2.*,!=3.3.*,!=3.4.*,!=3.5.*
licenseGPL
keywords plone documentviewer pdf ocr doc viewer
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI
coveralls test coverage No coveralls.
            =========================
collective.documentviewer
=========================

``collective.documentviewer`` integrates `DocumentCloud`_ viewer and PDF processing
into `Plone`_.


Examples
========

You can be seen in action the functionality that implements this add-on
at the following sites:

- Example viewer: https://www.documentcloud.org/documents/3219331-FOIA-Request-17-OIG-015.html


Features
========

- Very nice document viewer.

- OCR.

- Searchable on OCR text.

- Works with many different document types.

- `collective.celery`_ integration.

- Lots of configuration options.

- PDF Album view for display groups of PDFs.


Works with
----------

Besides displaying PDFs, it will also display:

- Word.

- Excel.

- Powerpoint.

- HTML.

- RTF.


Translations
============

This product has been translated into

- German.

- Spanish.

- Basque.

- French.

- Italian.

- Dutch.

- Simplified Chinese.

You can contribute for any message missing or other new languages, join us at
`Plone Collective Team <https://www.transifex.com/plone/plone-collective/>`_
into *Transifex.net* service with all world Plone translators community.


Installation
============


Install requirements
--------------------

- GraphicsMagick.

- ghostscript (version 9.0 preferred).

- Poppler

- tesseract (optional)

- qpdf

- OpenOffice or LibreOffice (optional, for doc, excel, ppt, etc. types)

- md5 or md5sum command line tool.


Installation on Cent OS/Red hat
-------------------------------

Special instructions for CentOS have been contributed by Eric Tyrer.
You can access them via `the git hub repo file location <https://github.com/collective/collective.documentviewer/blob/master/CENTOS-INSTALL.rst>`_.


Installation on Debian
----------------------

Special instructions for Debian have been contributed by Leonardo J. Caballero G.
You can access them via the `git hub repo file location <https://github.com/collective/collective.documentviewer/blob/master/DEBIAN-INSTALL.rst>`_.


Issues installation
-------------------

If on a Linux/Ubuntu/Debian machine you run into an error like::

    /var/lib/gems/1.9.1/gems/docsplit-0.7.2/lib/docsplit/image_extractor.rb:51:in `exists?': can't convert nil into String (TypeError)
    from /var/lib/gems/1.9.1/gems/docsplit-0.7.2/lib/docsplit/image_extractor.rb:51:in `ensure in convert'

This is because the ruby docsplit library is having an issue with the temp
folder accesses, and removal of temp files.   Just run the following command::

    sudo chmod 1777 /tmp && sudo chmod 1777 /var/tmp

And retry the conversion of your document


Development
===========

Normal flow: ::

    git clone git@github.com:collective/collective.documentviewer.git

    cd collective.documentviewer

    virtualenv .

    bin/pip install -r requirements.txt

    bin/buildout


Async Integration
-----------------

It it highly recommended to install and configure `collective.celery`_
in combination with this package. Doing so will manage all PDF
conversions processes asynchronously so the user isn't delayed
so much when saving files.


Settings
--------

The product can be configured via a control panel item
`Document Viewer Settings`.

Some interesting configuration options:

Storage Type
    If you want to be able to serve you files via Amazon Cloud,
    this will allow you to store the data in flat files that
    can be synced to another server.
Storage Location
    Where are the server to store the files.
OCR
    Use ``tesseract`` to scan the document for text. This process can be
    slow so if your PDFs do not need to be OCR'd, you may disable.
Auto Select Layout
    For PDF files added to the site, automatically select the
    document viewer display.
Auto Convert
    When PDF files are added and modified, automatically convert.
Auto layout file types
    Types that should automatically be converted to document viewer.


Dexterity support
-----------------

If you want to use it with your own Dexterity content type. You need to edit
the ``FTI`` in ``ZMI/portal_types/yourtype`` to add "documentviewer" in
the available view methods like this: ::

    <property name="view_methods" purge="False">
      <element value="documentviewer"/>
    </property>

Also you need to set the primary field in the schema, for example: ::

    <field name="myfile" marshal:primary="true"
           type="plone.namedfile.field.NamedBlobFile">


File storage integration
------------------------

If you choose to use basic file storage instead of ZODB blob storage,
there are a few things you'll want to keep in mind.

1) Use `Nginx`_ to then serve the file system files. This might require
   you install a local Nginx just for serving file storage on the
   Plone server. You can get creative with how your file storage
   is used though.

2) Since in Plone's delete operation, it can be interrupted and the deletion
   of a file on the OS system system can not be done within a transaction,
   no files are ever deleted. However, there is an action you can
   put in a `cron`_ task to clean up your file storage directory. Just call the
   url `http://zeoinstace/plone/@@dvcleanup-filestorage`.


Upgrading from page turner
--------------------------

If you currently have page turner installed, this project will supercede 
it. Your page turner views will work but no future files added to the site
will be converted to page turner.

To convert existing view, on every page turner enabled file, there will
be a button `Document Viewer Convert` that you can click to manually
convert page turner to document viewer.

To convert all existing views, go to ``portal_setup`` in the ZMI, upgrades,
select ``collective.documentviewer``, click to show old upgrades and there
should be an `upgrade-all` step to run.


Tests status
============

This add-on is tested using Travis CI. The current status of the add-on is:

.. image:: https://travis-ci.org/collective/collective.documentviewer.svg?branch=master
   :alt: Travis CI badge
   :target: https://travis-ci.org/collective/collective.documentviewer

.. image:: http://img.shields.io/pypi/v/collective.documentviewer.svg
   :alt: PyPI badge
   :target: https://pypi.org/project/collective.documentviewer


Contribute
==========

Have an idea? Found a bug? Let us know by `opening a ticket`_.

- Issue Tracker: https://github.com/collective/collective.documentviewer/issues
- Source Code: https://github.com/collective/collective.documentviewer
- Documentation: https://www.documentcloud.org/


Authors
=======

This product was developed by Wildcard Corp. team.

.. image:: https://raw.githubusercontent.com/collective/collective.documentviewer/i18n_improvements/docs/_static/wildcardcorp_logo.png
   :height: 111px
   :width: 330px
   :alt: Produced by wildcardcorp.com
   :align: right


License
=======

The project is licensed under the GPLv2.

.. _DocumentCloud: https://www.documentcloud.org/
.. _Plone: https://plone.org/
.. _collective.celery: https://pypi.org/project/collective.celery/
.. _Nginx: https://nginx.org/
.. _cron: https://crontab.guru/
.. _`opening a ticket`: https://github.com/collective/collective.documentviewer/issues

Changelog
=========

6.0.3 (2023-05-17)
------------------

- Allow usage of extra parameters with qpdf using `DOCUMENTVIEWER_QPDF_PARAMETERS` environment variable
  [mpeeters]


6.0.2 (2023-04-27)
------------------

- escape querystring search value [kleier]


6.0.1 (2021-02-03)
------------------

- Improved uninstall.  [pbauer]


6.0.0 (2020-05-12)
------------------

- Added Transifex.net service integration to manage the translation process.
  [macagua]

- Updated Spanish translation.
  [macagua]

- Updated the i18n support.
  [macagua]

- Fix the download link for the document. #78
  [b4oshany]

- Replaced ``docsplit``.  Instead call the various packages directly.
  See `pull request #79 <https://github.com/collective/collective.documentviewer/pull/79>`_.
  [alphaomega325]

- Python 3, Plone 5.2 compatible
  [vangheem]


5.0.4 (2019-01-09)
------------------

- use `plone.api.portal.get` instead of `getToolByName`
  [vangheem]


5.0.3 (2018-12-11)
------------------

- Handle possible error with no celery installed
  [vangheem]

5.0.2 (2018-12-11)
------------------

- Fix to work with latest collective.celery
  [vangheem]

- Fix issue breaking zoom on the 1st page of PDFs
  [obct537]

- Add function and browser view (``convert_all_unconverted``) to convert all files, which haven't been converted yet.
  [thet]

- Do not break if no global request is set. Fixes #71
  [ale-rt]

- Fix redundant condition
  [ale-rt]

- Handle plone.app.contenttypes file indexing.
  [thet]

- Add a custom migrator for plone.app.contenttypes and avoid converting while migrating to plone.app.contettypes.
  [thet]

- Added support for libreoffice under Nixos, which uses a different folder name
  for its conversion directories
  [pysailor]


5.0.1 (2017-07-16)
------------------

- Fix issue breaking zoom on the 1st page of PDFs
  [obct537]

5.0.0 (2016-09-02)
------------------

- log exception when pulling OCR text instead of completely erroring
  [vangheem]

5.0.0a1 (2016-08-05)
--------------------

- plone 5/dexterity only now
  [vangheem]

4.1.0 (2017-05-15)
------------------

- Persist converted pdf and publish it as pdf/dump.pdf and @@pdf.
  [jaroel]

4.0.14 (2016-08-01)
-------------------

- Added italian translation
  [keul]

- Fixed JavaScript issue on Chrome: expected global variable ``sidebar`` was not global
  [keul]

- handle conflict errors in async processes better
  [vangheem]

4.0.13 (2016-07-22)
-------------------

- hide doc viewer from screen readers since they won't be able to make
  sense of <img> tags for pdf pages.
  [vangheem]

4.0.12 (2016-07-22)
-------------------

- Handle errors when searching
  [vangheem]


4.0.11 (2016-07-12)
-------------------

- handle no blob found on content
  [vangheem]

4.0.10 (2016-06-30)
-------------------

- fix release

4.0.9 (2016-06-30)
------------------

- Handle file deleted to clean up files
  [vangheem]

- fix not being able to hide sidebar
  [vangheem]


4.0.8 (2016-06-21)
------------------

- use checkboxes for auto layout
  [vangheem]

4.0.7 (2016-06-20)
------------------

- do not convert Image types
  [vangheem]

- be able to completely hide contributor
  [vangheem]


4.0.6 (2016-01-27)
------------------

- make it so viewer can be responsive
  [vangheem]

4.0.5 (2016-01-25)
------------------

- reindex object after lead image is set so icon can be retrieved
  [vangheem]


4.0.4 (2016-01-25)
------------------

- fix celery conversation showing that it is still
  converting
  [vangheem]

4.0.3 (2015-09-30)
------------------

- fix import of namedfile, restores older plone compatibility
  [vangheem]

4.0.2 (2015-09-30)
------------------

- fix support for archetypes
  [vangheem]

4.0.1 (2015-09-28)
------------------

- add lead image support
  [vangheem]

- be able to use collective.celery for queuing tasks
  [vangheem]

- fix async monitor registration
  [pilz]


4.0.0 (2015-09-09)
------------------

- fix Plone 5 compatibility
  [vangheem]

- upgrade jquery.imgareaselect to latest
  [vangheem]

- upgrade document viewer to latest
  [vangheem]

- do not support upgrading from wildcard.pdfpal and wc.pageturner anymore. Use 3.x series
  [vangheem]

3.0.3 (2015-07-29)
------------------

- set response header on javascript variable file. Prevents js errors on chrome.
  [vangheem]

3.0.2 (2014-05-31)
------------------

- fix bug where it wouldn't work with collective.geo.*
  [vangheem]


3.0.1 (2014-05-08)
------------------

- add german translation
  [jhb]


3.0a1 (2013-09-03)
------------------

- Add Dexterity compatibility. To enable it on your content type, you have
  to define a primary field and add documentviewer in the available view
  methods, see documentation.
  [vincentfretin]

- Fix: users that can modify can now view info messages
  and 'annotations'/'sections' feature.
  [thomasdesvenain]

- Show contributor fullname if possible.
  Contributor and organization are in a span.
  [thomasdesvenain]

- Avoid replacing non-ascii characters by (?) during OCR process
  for non english languages.
  [thomasdesvenain]

- Plain text indexation is fixed for non converted contents.
  [thomasdesvenain]

- When a new release of the document is currently generated,
  user is notified by a status message.
  [thomasdesvenain]


2.2.2b3 (2013-05-31)
--------------------

- i18n fixes + french translations
  [thomasdesvenain]

- support to pass a document language to tesseract/docsplit based on a
  configurable adapter implementing IOCRLanguage
  [ajung]


2.2.2b2 (2013-05-31)
--------------------

- fix bug when using blob storage and text indexing is disabled
  [gbastien]


2.2.2b1 (2013-05-31)
--------------------

- only use defaultFactory when supported. For older versions of zope.schema
  [vangheem]


2.2.2a1 (2013-05-31)
--------------------

- added french translations
  [gbastien]

- added enable_indexation parameter in global and local settings
  Fixes : https://github.com/collective/collective.documentviewer/issues/21
  [gbastien]

- make local settings coherent regarding global settings
  Fixes : https://github.com/collective/collective.documentviewer/issues/22
  [gbastien]


2.2.1 (2013-03-12)
------------------

- fix use with latest libreoffice and docsplit.
  Fixes: https://github.com/collective/collective.documentviewer/issues/11

- do not require docsplit to be installed on the
  plone instance in order to display the viewer.
  In case the document was converted on another
  client.
  [vangheem]


2.2 (2013-02-06)
----------------

- fix z-index on viewer
  [damilgra]


2.2b2 (2013-01-10)
------------------

- fix getSite imports for plone 4.3


2.2b1 (2013-01-06)
------------------

- switch to using OFS.interfaces.IFolder for folder view
  [vangheem]

- while pdf is converting, show existing if available.
  [vangheem]

- move convert button to actions
  [vangheem]


2.2a2 (2012-10-01)
------------------

- another subsite fix
  [vangheem]


2.2a1 (2012-xx-xx)
------------------

- test for Plone 4.2 compatibility.
  [hvelarde]

- work with subsites

2.1b2 (2012-06-22)
------------------

- better handling of moving folders around

2.1b1 (2012-06-22)
------------------

- be able to obfuscate file paths for file storage

2.0.4 (2012-06-21)
------------------

- fix cleaning file location

- fix potential tranversal error for file resources


2.0.3 (2012-06-13)
------------------

- check for quota set before finding existing jobs.

2.0.2 (2012-06-12)
------------------

- include contentmenu zcml dependency

- upgrade conversion will now try and fix error'd conversions


2.0.1 (2012-05-15)
------------------

- fixing batching on group view

2.0.1b1 (2012-05-14)
--------------------

- add support for new formats: star office, ps, photoshop, visio, palm

2.0b1 (2012-05-11)
------------------

- add ability to add annotations and sections


1.5.1 (2012-04-30)
------------------

- fix security on file resources


1.5.0 (2012-04-29)
------------------

- no changes

1.5.0b1 (2012-04-27)
--------------------

- be able to move jobs to front of queue

- use portal_catalog instead of uid_catalog so security
  checks apply to resource urls.


1.4.2 (2012-04-24)
------------------

- no changes, first final release


1.4.1b3 (2012-04-23)
--------------------

- create local catalog and index before syncing db to prevent
  conflict errors.

- add redirect timeout to conversion info page


1.4.1b2 (2012-04-23)
--------------------

- make sure to close open file descriptors

- Change "Original Document (PDF)" to "Original Document"

- emit event after conversion

- only show queue link if manager

- convert button should work for files that do not have
  layout selected yet

- use communicate instead of wait with popen in case
  output is large. Prevents deadlocks.


1.4.1b1 (2012-04-23)
--------------------

- do not assume pdfpal is used along with pageturner
  on data conversion.

- better command runner

- track errors better and display them in interface if something
  happened during conversion

- new file storage structure to prevent too many files from
  being in one directory


1.4b1 (2012-04-21)
------------------

- fix full screen button when text or pages selected.

- be able to customize batch size


1.4a2 (2012-04-20)
------------------

- make sure to not use files with spaces


1.4a1 (2012-04-20)
------------------

- be able to detect if pdf already has text in it
  and do not OCR it if it does.


1.3b2 (2012-04-20)
------------------

- use jQuery instead of $()


1.3b1 (2012-04-20)
------------------

- default OCR to being off since it's pretty slow

- better logging when looking for binary files

- be able to override width of viewer


1.3a3 (2012-04-20)
------------------

- fix uninstall
  [vangheem]


1.3a2 (2012-04-19)
------------------

- fix async bug if it wasn't installed
  [vangheem]


1.3a1 (2012-04-19)
------------------

- make sure to initialize catalog after db sync
  for large PDFs.
  [vangheem]

- better integrate with pdfpal and pageturner so
  it's easy to upgrade from those products.
  [vangheem]


1.2a2 (2012-04-19)
------------------

- fix setting custom quota for async queue
  [vangheem]

- fix group view clear button
  [vangheem]

- add support for alternative md5sum binary
  [vangheem]


1.2a1 (2012-04-19)
------------------

- fix full screen page bug
  [vangheem]

- better async integration with quota setting
  [vangheem]

- View async queue for conversions
  [vangheem]

- index ocr data in portal catalog
  [vangheem]

- better pdf group view with search
  [vangheem]

- handle large files better
  [vangheem]

- check if file has already been converted by storing
  hash of the file to check against.
  [vangheem]

- be able to remove document viewer conversion tasks
  [vangheem]

- add ability to cleanup file storage files for deleted
  plone File objects.
  [vangheem]


1.1a1 (2012-04-18)
------------------

- add pdf folder album view
  [vangheem]

- fix async integration
  [vangheem]


1.0a2 (2012-04-17)
------------------

- add control panel icon
  [vangheem]

- fix uninstall procedure
  [vangheem]

- changing image type does not cause existing ones to fail.
  [vangheem]


1.0a1 (2012-04-17)
------------------

- Initial release

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/collective/collective.documentviewer",
    "name": "collective.documentviewer",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=2.7,!=3.0.*,!=3.1.*,!=3.2.*,!=3.3.*,!=3.4.*,!=3.5.*",
    "maintainer_email": "",
    "keywords": "plone documentviewer pdf ocr doc viewer",
    "author": "Nathan Van Gheem",
    "author_email": "vangheem@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/3f/9f/344f1bb922cc9b93cf3d37856763306d62361c5d575a7719eee26f69823b/collective.documentviewer-6.0.3.tar.gz",
    "platform": null,
    "description": "=========================\ncollective.documentviewer\n=========================\n\n``collective.documentviewer`` integrates `DocumentCloud`_ viewer and PDF processing\ninto `Plone`_.\n\n\nExamples\n========\n\nYou can be seen in action the functionality that implements this add-on\nat the following sites:\n\n- Example viewer: https://www.documentcloud.org/documents/3219331-FOIA-Request-17-OIG-015.html\n\n\nFeatures\n========\n\n- Very nice document viewer.\n\n- OCR.\n\n- Searchable on OCR text.\n\n- Works with many different document types.\n\n- `collective.celery`_ integration.\n\n- Lots of configuration options.\n\n- PDF Album view for display groups of PDFs.\n\n\nWorks with\n----------\n\nBesides displaying PDFs, it will also display:\n\n- Word.\n\n- Excel.\n\n- Powerpoint.\n\n- HTML.\n\n- RTF.\n\n\nTranslations\n============\n\nThis product has been translated into\n\n- German.\n\n- Spanish.\n\n- Basque.\n\n- French.\n\n- Italian.\n\n- Dutch.\n\n- Simplified Chinese.\n\nYou can contribute for any message missing or other new languages, join us at\n`Plone Collective Team <https://www.transifex.com/plone/plone-collective/>`_\ninto *Transifex.net* service with all world Plone translators community.\n\n\nInstallation\n============\n\n\nInstall requirements\n--------------------\n\n- GraphicsMagick.\n\n- ghostscript (version 9.0 preferred).\n\n- Poppler\n\n- tesseract (optional)\n\n- qpdf\n\n- OpenOffice or LibreOffice (optional, for doc, excel, ppt, etc. types)\n\n- md5 or md5sum command line tool.\n\n\nInstallation on Cent OS/Red hat\n-------------------------------\n\nSpecial instructions for CentOS have been contributed by Eric Tyrer.\nYou can access them via `the git hub repo file location <https://github.com/collective/collective.documentviewer/blob/master/CENTOS-INSTALL.rst>`_.\n\n\nInstallation on Debian\n----------------------\n\nSpecial instructions for Debian have been contributed by Leonardo J. Caballero G.\nYou can access them via the `git hub repo file location <https://github.com/collective/collective.documentviewer/blob/master/DEBIAN-INSTALL.rst>`_.\n\n\nIssues installation\n-------------------\n\nIf on a Linux/Ubuntu/Debian machine you run into an error like::\n\n    /var/lib/gems/1.9.1/gems/docsplit-0.7.2/lib/docsplit/image_extractor.rb:51:in `exists?': can't convert nil into String (TypeError)\n    from /var/lib/gems/1.9.1/gems/docsplit-0.7.2/lib/docsplit/image_extractor.rb:51:in `ensure in convert'\n\nThis is because the ruby docsplit library is having an issue with the temp\nfolder accesses, and removal of temp files.   Just run the following command::\n\n    sudo chmod 1777 /tmp && sudo chmod 1777 /var/tmp\n\nAnd retry the conversion of your document\n\n\nDevelopment\n===========\n\nNormal flow: ::\n\n    git clone git@github.com:collective/collective.documentviewer.git\n\n    cd collective.documentviewer\n\n    virtualenv .\n\n    bin/pip install -r requirements.txt\n\n    bin/buildout\n\n\nAsync Integration\n-----------------\n\nIt it highly recommended to install and configure `collective.celery`_\nin combination with this package. Doing so will manage all PDF\nconversions processes asynchronously so the user isn't delayed\nso much when saving files.\n\n\nSettings\n--------\n\nThe product can be configured via a control panel item\n`Document Viewer Settings`.\n\nSome interesting configuration options:\n\nStorage Type\n    If you want to be able to serve you files via Amazon Cloud,\n    this will allow you to store the data in flat files that\n    can be synced to another server.\nStorage Location\n    Where are the server to store the files.\nOCR\n    Use ``tesseract`` to scan the document for text. This process can be\n    slow so if your PDFs do not need to be OCR'd, you may disable.\nAuto Select Layout\n    For PDF files added to the site, automatically select the\n    document viewer display.\nAuto Convert\n    When PDF files are added and modified, automatically convert.\nAuto layout file types\n    Types that should automatically be converted to document viewer.\n\n\nDexterity support\n-----------------\n\nIf you want to use it with your own Dexterity content type. You need to edit\nthe ``FTI`` in ``ZMI/portal_types/yourtype`` to add \"documentviewer\" in\nthe available view methods like this: ::\n\n    <property name=\"view_methods\" purge=\"False\">\n      <element value=\"documentviewer\"/>\n    </property>\n\nAlso you need to set the primary field in the schema, for example: ::\n\n    <field name=\"myfile\" marshal:primary=\"true\"\n           type=\"plone.namedfile.field.NamedBlobFile\">\n\n\nFile storage integration\n------------------------\n\nIf you choose to use basic file storage instead of ZODB blob storage,\nthere are a few things you'll want to keep in mind.\n\n1) Use `Nginx`_ to then serve the file system files. This might require\n   you install a local Nginx just for serving file storage on the\n   Plone server. You can get creative with how your file storage\n   is used though.\n\n2) Since in Plone's delete operation, it can be interrupted and the deletion\n   of a file on the OS system system can not be done within a transaction,\n   no files are ever deleted. However, there is an action you can\n   put in a `cron`_ task to clean up your file storage directory. Just call the\n   url `http://zeoinstace/plone/@@dvcleanup-filestorage`.\n\n\nUpgrading from page turner\n--------------------------\n\nIf you currently have page turner installed, this project will supercede \nit. Your page turner views will work but no future files added to the site\nwill be converted to page turner.\n\nTo convert existing view, on every page turner enabled file, there will\nbe a button `Document Viewer Convert` that you can click to manually\nconvert page turner to document viewer.\n\nTo convert all existing views, go to ``portal_setup`` in the ZMI, upgrades,\nselect ``collective.documentviewer``, click to show old upgrades and there\nshould be an `upgrade-all` step to run.\n\n\nTests status\n============\n\nThis add-on is tested using Travis CI. The current status of the add-on is:\n\n.. image:: https://travis-ci.org/collective/collective.documentviewer.svg?branch=master\n   :alt: Travis CI badge\n   :target: https://travis-ci.org/collective/collective.documentviewer\n\n.. image:: http://img.shields.io/pypi/v/collective.documentviewer.svg\n   :alt: PyPI badge\n   :target: https://pypi.org/project/collective.documentviewer\n\n\nContribute\n==========\n\nHave an idea? Found a bug? Let us know by `opening a ticket`_.\n\n- Issue Tracker: https://github.com/collective/collective.documentviewer/issues\n- Source Code: https://github.com/collective/collective.documentviewer\n- Documentation: https://www.documentcloud.org/\n\n\nAuthors\n=======\n\nThis product was developed by Wildcard Corp. team.\n\n.. image:: https://raw.githubusercontent.com/collective/collective.documentviewer/i18n_improvements/docs/_static/wildcardcorp_logo.png\n   :height: 111px\n   :width: 330px\n   :alt: Produced by wildcardcorp.com\n   :align: right\n\n\nLicense\n=======\n\nThe project is licensed under the GPLv2.\n\n.. _DocumentCloud: https://www.documentcloud.org/\n.. _Plone: https://plone.org/\n.. _collective.celery: https://pypi.org/project/collective.celery/\n.. _Nginx: https://nginx.org/\n.. _cron: https://crontab.guru/\n.. _`opening a ticket`: https://github.com/collective/collective.documentviewer/issues\n\nChangelog\n=========\n\n6.0.3 (2023-05-17)\n------------------\n\n- Allow usage of extra parameters with qpdf using `DOCUMENTVIEWER_QPDF_PARAMETERS` environment variable\n  [mpeeters]\n\n\n6.0.2 (2023-04-27)\n------------------\n\n- escape querystring search value [kleier]\n\n\n6.0.1 (2021-02-03)\n------------------\n\n- Improved uninstall.  [pbauer]\n\n\n6.0.0 (2020-05-12)\n------------------\n\n- Added Transifex.net service integration to manage the translation process.\n  [macagua]\n\n- Updated Spanish translation.\n  [macagua]\n\n- Updated the i18n support.\n  [macagua]\n\n- Fix the download link for the document. #78\n  [b4oshany]\n\n- Replaced ``docsplit``.  Instead call the various packages directly.\n  See `pull request #79 <https://github.com/collective/collective.documentviewer/pull/79>`_.\n  [alphaomega325]\n\n- Python 3, Plone 5.2 compatible\n  [vangheem]\n\n\n5.0.4 (2019-01-09)\n------------------\n\n- use `plone.api.portal.get` instead of `getToolByName`\n  [vangheem]\n\n\n5.0.3 (2018-12-11)\n------------------\n\n- Handle possible error with no celery installed\n  [vangheem]\n\n5.0.2 (2018-12-11)\n------------------\n\n- Fix to work with latest collective.celery\n  [vangheem]\n\n- Fix issue breaking zoom on the 1st page of PDFs\n  [obct537]\n\n- Add function and browser view (``convert_all_unconverted``) to convert all files, which haven't been converted yet.\n  [thet]\n\n- Do not break if no global request is set. Fixes #71\n  [ale-rt]\n\n- Fix redundant condition\n  [ale-rt]\n\n- Handle plone.app.contenttypes file indexing.\n  [thet]\n\n- Add a custom migrator for plone.app.contenttypes and avoid converting while migrating to plone.app.contettypes.\n  [thet]\n\n- Added support for libreoffice under Nixos, which uses a different folder name\n  for its conversion directories\n  [pysailor]\n\n\n5.0.1 (2017-07-16)\n------------------\n\n- Fix issue breaking zoom on the 1st page of PDFs\n  [obct537]\n\n5.0.0 (2016-09-02)\n------------------\n\n- log exception when pulling OCR text instead of completely erroring\n  [vangheem]\n\n5.0.0a1 (2016-08-05)\n--------------------\n\n- plone 5/dexterity only now\n  [vangheem]\n\n4.1.0 (2017-05-15)\n------------------\n\n- Persist converted pdf and publish it as pdf/dump.pdf and @@pdf.\n  [jaroel]\n\n4.0.14 (2016-08-01)\n-------------------\n\n- Added italian translation\n  [keul]\n\n- Fixed JavaScript issue on Chrome: expected global variable ``sidebar`` was not global\n  [keul]\n\n- handle conflict errors in async processes better\n  [vangheem]\n\n4.0.13 (2016-07-22)\n-------------------\n\n- hide doc viewer from screen readers since they won't be able to make\n  sense of <img> tags for pdf pages.\n  [vangheem]\n\n4.0.12 (2016-07-22)\n-------------------\n\n- Handle errors when searching\n  [vangheem]\n\n\n4.0.11 (2016-07-12)\n-------------------\n\n- handle no blob found on content\n  [vangheem]\n\n4.0.10 (2016-06-30)\n-------------------\n\n- fix release\n\n4.0.9 (2016-06-30)\n------------------\n\n- Handle file deleted to clean up files\n  [vangheem]\n\n- fix not being able to hide sidebar\n  [vangheem]\n\n\n4.0.8 (2016-06-21)\n------------------\n\n- use checkboxes for auto layout\n  [vangheem]\n\n4.0.7 (2016-06-20)\n------------------\n\n- do not convert Image types\n  [vangheem]\n\n- be able to completely hide contributor\n  [vangheem]\n\n\n4.0.6 (2016-01-27)\n------------------\n\n- make it so viewer can be responsive\n  [vangheem]\n\n4.0.5 (2016-01-25)\n------------------\n\n- reindex object after lead image is set so icon can be retrieved\n  [vangheem]\n\n\n4.0.4 (2016-01-25)\n------------------\n\n- fix celery conversation showing that it is still\n  converting\n  [vangheem]\n\n4.0.3 (2015-09-30)\n------------------\n\n- fix import of namedfile, restores older plone compatibility\n  [vangheem]\n\n4.0.2 (2015-09-30)\n------------------\n\n- fix support for archetypes\n  [vangheem]\n\n4.0.1 (2015-09-28)\n------------------\n\n- add lead image support\n  [vangheem]\n\n- be able to use collective.celery for queuing tasks\n  [vangheem]\n\n- fix async monitor registration\n  [pilz]\n\n\n4.0.0 (2015-09-09)\n------------------\n\n- fix Plone 5 compatibility\n  [vangheem]\n\n- upgrade jquery.imgareaselect to latest\n  [vangheem]\n\n- upgrade document viewer to latest\n  [vangheem]\n\n- do not support upgrading from wildcard.pdfpal and wc.pageturner anymore. Use 3.x series\n  [vangheem]\n\n3.0.3 (2015-07-29)\n------------------\n\n- set response header on javascript variable file. Prevents js errors on chrome.\n  [vangheem]\n\n3.0.2 (2014-05-31)\n------------------\n\n- fix bug where it wouldn't work with collective.geo.*\n  [vangheem]\n\n\n3.0.1 (2014-05-08)\n------------------\n\n- add german translation\n  [jhb]\n\n\n3.0a1 (2013-09-03)\n------------------\n\n- Add Dexterity compatibility. To enable it on your content type, you have\n  to define a primary field and add documentviewer in the available view\n  methods, see documentation.\n  [vincentfretin]\n\n- Fix: users that can modify can now view info messages\n  and 'annotations'/'sections' feature.\n  [thomasdesvenain]\n\n- Show contributor fullname if possible.\n  Contributor and organization are in a span.\n  [thomasdesvenain]\n\n- Avoid replacing non-ascii characters by (?) during OCR process\n  for non english languages.\n  [thomasdesvenain]\n\n- Plain text indexation is fixed for non converted contents.\n  [thomasdesvenain]\n\n- When a new release of the document is currently generated,\n  user is notified by a status message.\n  [thomasdesvenain]\n\n\n2.2.2b3 (2013-05-31)\n--------------------\n\n- i18n fixes + french translations\n  [thomasdesvenain]\n\n- support to pass a document language to tesseract/docsplit based on a\n  configurable adapter implementing IOCRLanguage\n  [ajung]\n\n\n2.2.2b2 (2013-05-31)\n--------------------\n\n- fix bug when using blob storage and text indexing is disabled\n  [gbastien]\n\n\n2.2.2b1 (2013-05-31)\n--------------------\n\n- only use defaultFactory when supported. For older versions of zope.schema\n  [vangheem]\n\n\n2.2.2a1 (2013-05-31)\n--------------------\n\n- added french translations\n  [gbastien]\n\n- added enable_indexation parameter in global and local settings\n  Fixes : https://github.com/collective/collective.documentviewer/issues/21\n  [gbastien]\n\n- make local settings coherent regarding global settings\n  Fixes : https://github.com/collective/collective.documentviewer/issues/22\n  [gbastien]\n\n\n2.2.1 (2013-03-12)\n------------------\n\n- fix use with latest libreoffice and docsplit.\n  Fixes: https://github.com/collective/collective.documentviewer/issues/11\n\n- do not require docsplit to be installed on the\n  plone instance in order to display the viewer.\n  In case the document was converted on another\n  client.\n  [vangheem]\n\n\n2.2 (2013-02-06)\n----------------\n\n- fix z-index on viewer\n  [damilgra]\n\n\n2.2b2 (2013-01-10)\n------------------\n\n- fix getSite imports for plone 4.3\n\n\n2.2b1 (2013-01-06)\n------------------\n\n- switch to using OFS.interfaces.IFolder for folder view\n  [vangheem]\n\n- while pdf is converting, show existing if available.\n  [vangheem]\n\n- move convert button to actions\n  [vangheem]\n\n\n2.2a2 (2012-10-01)\n------------------\n\n- another subsite fix\n  [vangheem]\n\n\n2.2a1 (2012-xx-xx)\n------------------\n\n- test for Plone 4.2 compatibility.\n  [hvelarde]\n\n- work with subsites\n\n2.1b2 (2012-06-22)\n------------------\n\n- better handling of moving folders around\n\n2.1b1 (2012-06-22)\n------------------\n\n- be able to obfuscate file paths for file storage\n\n2.0.4 (2012-06-21)\n------------------\n\n- fix cleaning file location\n\n- fix potential tranversal error for file resources\n\n\n2.0.3 (2012-06-13)\n------------------\n\n- check for quota set before finding existing jobs.\n\n2.0.2 (2012-06-12)\n------------------\n\n- include contentmenu zcml dependency\n\n- upgrade conversion will now try and fix error'd conversions\n\n\n2.0.1 (2012-05-15)\n------------------\n\n- fixing batching on group view\n\n2.0.1b1 (2012-05-14)\n--------------------\n\n- add support for new formats: star office, ps, photoshop, visio, palm\n\n2.0b1 (2012-05-11)\n------------------\n\n- add ability to add annotations and sections\n\n\n1.5.1 (2012-04-30)\n------------------\n\n- fix security on file resources\n\n\n1.5.0 (2012-04-29)\n------------------\n\n- no changes\n\n1.5.0b1 (2012-04-27)\n--------------------\n\n- be able to move jobs to front of queue\n\n- use portal_catalog instead of uid_catalog so security\n  checks apply to resource urls.\n\n\n1.4.2 (2012-04-24)\n------------------\n\n- no changes, first final release\n\n\n1.4.1b3 (2012-04-23)\n--------------------\n\n- create local catalog and index before syncing db to prevent\n  conflict errors.\n\n- add redirect timeout to conversion info page\n\n\n1.4.1b2 (2012-04-23)\n--------------------\n\n- make sure to close open file descriptors\n\n- Change \"Original Document (PDF)\" to \"Original Document\"\n\n- emit event after conversion\n\n- only show queue link if manager\n\n- convert button should work for files that do not have\n  layout selected yet\n\n- use communicate instead of wait with popen in case\n  output is large. Prevents deadlocks.\n\n\n1.4.1b1 (2012-04-23)\n--------------------\n\n- do not assume pdfpal is used along with pageturner\n  on data conversion.\n\n- better command runner\n\n- track errors better and display them in interface if something\n  happened during conversion\n\n- new file storage structure to prevent too many files from\n  being in one directory\n\n\n1.4b1 (2012-04-21)\n------------------\n\n- fix full screen button when text or pages selected.\n\n- be able to customize batch size\n\n\n1.4a2 (2012-04-20)\n------------------\n\n- make sure to not use files with spaces\n\n\n1.4a1 (2012-04-20)\n------------------\n\n- be able to detect if pdf already has text in it\n  and do not OCR it if it does.\n\n\n1.3b2 (2012-04-20)\n------------------\n\n- use jQuery instead of $()\n\n\n1.3b1 (2012-04-20)\n------------------\n\n- default OCR to being off since it's pretty slow\n\n- better logging when looking for binary files\n\n- be able to override width of viewer\n\n\n1.3a3 (2012-04-20)\n------------------\n\n- fix uninstall\n  [vangheem]\n\n\n1.3a2 (2012-04-19)\n------------------\n\n- fix async bug if it wasn't installed\n  [vangheem]\n\n\n1.3a1 (2012-04-19)\n------------------\n\n- make sure to initialize catalog after db sync\n  for large PDFs.\n  [vangheem]\n\n- better integrate with pdfpal and pageturner so\n  it's easy to upgrade from those products.\n  [vangheem]\n\n\n1.2a2 (2012-04-19)\n------------------\n\n- fix setting custom quota for async queue\n  [vangheem]\n\n- fix group view clear button\n  [vangheem]\n\n- add support for alternative md5sum binary\n  [vangheem]\n\n\n1.2a1 (2012-04-19)\n------------------\n\n- fix full screen page bug\n  [vangheem]\n\n- better async integration with quota setting\n  [vangheem]\n\n- View async queue for conversions\n  [vangheem]\n\n- index ocr data in portal catalog\n  [vangheem]\n\n- better pdf group view with search\n  [vangheem]\n\n- handle large files better\n  [vangheem]\n\n- check if file has already been converted by storing\n  hash of the file to check against.\n  [vangheem]\n\n- be able to remove document viewer conversion tasks\n  [vangheem]\n\n- add ability to cleanup file storage files for deleted\n  plone File objects.\n  [vangheem]\n\n\n1.1a1 (2012-04-18)\n------------------\n\n- add pdf folder album view\n  [vangheem]\n\n- fix async integration\n  [vangheem]\n\n\n1.0a2 (2012-04-17)\n------------------\n\n- add control panel icon\n  [vangheem]\n\n- fix uninstall procedure\n  [vangheem]\n\n- changing image type does not cause existing ones to fail.\n  [vangheem]\n\n\n1.0a1 (2012-04-17)\n------------------\n\n- Initial release\n",
    "bugtrack_url": null,
    "license": "GPL",
    "summary": "Document cloud's document viewer integration into plone.",
    "version": "6.0.3",
    "project_urls": {
        "Homepage": "https://github.com/collective/collective.documentviewer"
    },
    "split_keywords": [
        "plone",
        "documentviewer",
        "pdf",
        "ocr",
        "doc",
        "viewer"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "54e950deecd48a96e24f69195f550d8932d12f25037c4b27b663546264679e63",
                "md5": "571bc6bbccccac84539ea33620d90d66",
                "sha256": "5ef186c0b9d4d060fbf7d6aee45ce199439ddce50794f939a0cba3689f38f7f7"
            },
            "downloads": -1,
            "filename": "collective.documentviewer-6.0.3-py2.py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "571bc6bbccccac84539ea33620d90d66",
            "packagetype": "bdist_wheel",
            "python_version": "py2.py3",
            "requires_python": ">=2.7,!=3.0.*,!=3.1.*,!=3.2.*,!=3.3.*,!=3.4.*,!=3.5.*",
            "size": 470253,
            "upload_time": "2023-05-17T21:58:53",
            "upload_time_iso_8601": "2023-05-17T21:58:53.477221Z",
            "url": "https://files.pythonhosted.org/packages/54/e9/50deecd48a96e24f69195f550d8932d12f25037c4b27b663546264679e63/collective.documentviewer-6.0.3-py2.py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "3f9f344f1bb922cc9b93cf3d37856763306d62361c5d575a7719eee26f69823b",
                "md5": "6d4a927be4a11ade7073d5f683b7163c",
                "sha256": "1b29a085cb01b3cae3df53342fc2a2a5fb16730d014f52bde07a94c5943eb7d2"
            },
            "downloads": -1,
            "filename": "collective.documentviewer-6.0.3.tar.gz",
            "has_sig": false,
            "md5_digest": "6d4a927be4a11ade7073d5f683b7163c",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=2.7,!=3.0.*,!=3.1.*,!=3.2.*,!=3.3.*,!=3.4.*,!=3.5.*",
            "size": 476321,
            "upload_time": "2023-05-17T21:58:56",
            "upload_time_iso_8601": "2023-05-17T21:58:56.651983Z",
            "url": "https://files.pythonhosted.org/packages/3f/9f/344f1bb922cc9b93cf3d37856763306d62361c5d575a7719eee26f69823b/collective.documentviewer-6.0.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-05-17 21:58:56",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "collective",
    "github_project": "collective.documentviewer",
    "travis_ci": true,
    "coveralls": false,
    "github_actions": false,
    "requirements": [],
    "lcname": "collective.documentviewer"
}
        
Elapsed time: 0.53645s