aiodataloader-ng

Name	aiodataloader-ng JSON
Version	0.2.1 JSON
	download
home_page	https://github.com/lablup/aiodataloader-ng
Summary	Asyncio DataLoader implementation for Python
upload_time	2022-01-22 07:23:19
maintainer
docs_url	None
author	Syrus Akbary, Joongi Kim and other contributors
requires_python
license	MIT
keywords	concurrent future deferred aiodataloader
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            Asyncio DataLoader
==================

DataLoader is a generic utility to be used as part of your application's
data fetching layer to provide a simplified and consistent API over
various remote data sources such as databases or web services via
batching and caching.

|Build Status| |Coverage Status|

A port of the "Loader" API originally developed by [@schrockn][] at
Facebook in 2010 as a simplifying force to coalesce the sundry key-value
store back-end APIs which existed at the time. At Facebook, "Loader"
became one of the implementation details of the "Ent" framework, a
privacy-aware data entity loading and caching layer within web server
product code. This ultimately became the underpinning for Facebook's
GraphQL server implementation and type definitions.

DataLoader is a simplified version of this original idea implemented in
Python for AsyncIO services. DataLoader is often used when implementing
a `graphene <https://github.com/graphql-python/graphene>`__ service,
though it is also broadly useful in other situations.

DataLoader is provided so that it may be useful not just to build
GraphQL services with AsyncIO but also as a publicly available reference
implementation of this concept in the hopes that it can be ported to
other languages. If you port DataLoader to another language, please open
an issue to include a link from this repository.

Getting Started
---------------

First, install DataLoader using pip.

.. code:: sh

    pip install aiodataloader

To get started, create a ``DataLoader``. Each ``DataLoader`` instance
represents a unique cache. Typically instances are created per request
when used within a web-server like
`Sanic <https://sanic.readthedocs.io/en/latest/>`__ if different users
can see different things.

    Note: DataLoader assumes a AsyncIO environment with ``async/await``
    available only in Python 3.5+.

Batching
--------

Batching is not an advanced feature, it's DataLoader's primary feature.
Create loaders by providing a batch loading function.

.. code:: python

    from aiodataloader import DataLoader

    class UserLoader(DataLoader):
        async def batch_load_fn(self, keys):
            return await my_batch_get_users(keys)

    user_loader = UserLoader()

A batch loading function accepts a Iterable of keys, and returns a
Promise which resolves to a List of values\ `\* <#batch-function>`__.

Then load individual values from the loader. DataLoader will coalesce
all individual loads which occur within a single frame of execution (a
single tick of the event loop) and then call your batch function with
all requested keys.

.. code:: python

    user1_future = user_loader.load(1)
    user2_future = user_loader.load(2)

    user1 = await user1_future
    user2 = await user2_future

    user1_invitedby = user_loader.load(user1.invited_by_id)
    user2_invitedby = user_loader.load(user2.invited_by_id)

    print("User 1 was invited by", await user1_invitedby)
    print("User 2 was invited by", await user2_invitedby)

A naive application may have issued four round-trips to a backend for
the required information, but with DataLoader this application will make
at most two.

DataLoader allows you to decouple unrelated parts of your application
without sacrificing the performance of batch data-loading. While the
loader presents an API that loads individual values, all concurrent
requests will be coalesced and presented to your batch loading function.
This allows your application to safely distribute data fetching
requirements throughout your application and maintain minimal outgoing
data requests.

Batch Function
~~~~~~~~~~~~~~

A batch loading function accepts an List of keys, and returns a Future
which resolves to an List of values. There are a few constraints that
must be upheld:

-  The List of values must be the same length as the List of keys.
-  Each index in the List of values must correspond to the same index in
   the List of keys.

For example, if your batch function was provided the List of keys:
``[ 2, 9, 6, 1 ]``, and loading from a back-end service returned the
values:

.. code:: python

    { 'id': 9, 'name': 'Chicago' }
    { 'id': 1, 'name': 'New York' }
    { 'id': 2, 'name': 'San Francisco' }

Our back-end service returned results in a different order than we
requested, likely because it was more efficient for it to do so. Also,
it omitted a result for key ``6``, which we can interpret as no value
existing for that key.

To uphold the constraints of the batch function, it must return an List
of values the same length as the List of keys, and re-order them to
ensure each index aligns with the original keys ``[ 2, 9, 6, 1 ]``:

.. code:: python

    [
      { 'id': 2, 'name': 'San Francisco' },
      { 'id': 9, 'name': 'Chicago' },
      None,
      { 'id': 1, 'name': 'New York' }
    ]

Caching
-------

DataLoader provides a memoization cache for all loads which occur in a
single request to your application. After ``.load()`` is called once
with a given key, the resulting value is cached to eliminate redundant
loads.

In addition to relieving pressure on your data storage, caching results
per-request also creates fewer objects which may relieve memory pressure
on your application:

.. code:: python

    user_future1 = user_loader.load(1)
    user_future2 = user_loader.load(1)

    assert user_future1 == user_future2

Caching per-Request
~~~~~~~~~~~~~~~~~~~

DataLoader caching *does not* replace Redis, Memcache, or any other
shared application-level cache. DataLoader is first and foremost a data
loading mechanism, and its cache only serves the purpose of not
repeatedly loading the same data in the context of a single request to
your Application. To do this, it maintains a simple in-memory
memoization cache (more accurately: ``.load()`` is a memoized function).

Avoid multiple requests from different users using the DataLoader
instance, which could result in cached data incorrectly appearing in
each request. Typically, DataLoader instances are created when a Request
begins, and are not used once the Request ends.

For example, when using with
`Sanic <https://sanic.readthedocs.io/en/latest/>`__:

.. code:: python

    def create_loaders(auth_token) {
        return {
          'users': user_loader,
        }
    }


    app = Sanic(__name__)

    @app.route("/")
    async def test(request):
        auth_token = authenticate_user(request)
        loaders = create_loaders(auth_token)
        return render_page(request, loaders)

Clearing Cache
~~~~~~~~~~~~~~

In certain uncommon cases, clearing the request cache may be necessary.

The most common example when clearing the loader's cache is necessary is
after a mutation or update within the same request, when a cached value
could be out of date and future loads should not use any possibly cached
value.

Here's a simple example using SQL UPDATE to illustrate.

.. code:: python

    # Request begins...
    user_loader = ...

    # And a value happens to be loaded (and cached).
    user4 = await user_loader.load(4)

    # A mutation occurs, invalidating what might be in cache.
    await sql_run('UPDATE users WHERE id=4 SET username="zuck"')
    user_loader.clear(4)

    # Later the value load is loaded again so the mutated data appears.
    user4 = await user_loader.load(4)

    # Request completes.

Caching Exceptions
~~~~~~~~~~~~~~~~~~

If a batch load fails (that is, a batch function throws or returns a
rejected Promise), then the requested values will not be cached. However
if a batch function returns an ``Exception`` instance for an individual
value, that ``Exception`` will be cached to avoid frequently loading the
same ``Exception``.

In some circumstances you may wish to clear the cache for these
individual Errors:

.. code:: python

    try:
        user_loader.load(1)
    except Exception as e:
        user_loader.clear(1)
        raise

Disabling Cache
~~~~~~~~~~~~~~~

In certain uncommon cases, a DataLoader which *does not* cache may be
desirable. Calling ``DataLoader(batch_fn, cache=false)`` will ensure
that every call to ``.load()`` will produce a *new* Future, and
requested keys will not be saved in memory.

However, when the memoization cache is disabled, your batch function
will receive an array of keys which may contain duplicates! Each key
will be associated with each call to ``.load()``. Your batch loader
should provide a value for each instance of the requested key.

For example:

.. code:: python

    class MyLoader(DataLoader):
        cache = False
        async def batch_load_fn(self, keys):
            print(keys)
            return keys

    my_loader = MyLoader()

    my_loader.load('A')
    my_loader.load('B')
    my_loader.load('A')

    # > [ 'A', 'B', 'A' ]

More complex cache behavior can be achieved by calling ``.clear()`` or
``.clear_all()`` rather than disabling the cache completely. For
example, this DataLoader will provide unique keys to a batch function
due to the memoization cache being enabled, but will immediately clear
its cache when the batch function is called so later requests will load
new values.

.. code:: python

    class MyLoader(DataLoader):
        cache = False
        async def batch_load_fn(self, keys):
            self.clear_all()
            return keys

API
---

class DataLoader
~~~~~~~~~~~~~~~~

DataLoader creates a public API for loading data from a particular data
back-end with unique keys such as the ``id`` column of a SQL table or
document name in a MongoDB database, given a batch loading function.

Each ``DataLoader`` instance contains a unique memoized cache. Use
caution when used in long-lived applications or those which serve many
users with different access permissions and consider creating a new
instance per web request.

``new DataLoader(batch_load_fn, **options)``
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Create a new ``DataLoader`` given a batch loading function and options.

-  *batch\_load\_fn*: An async function (coroutine) which accepts an
   List of keys and returns a Future which resolves to an List of
   values.

-  *options*:

-  *batch*: Default ``True``. Set to ``False`` to disable batching,
   instead immediately invoking ``batch_load_fn`` with a single load
   key.

-  *max\_batch\_size*: Default ``Infinity``. Limits the number of items
   that get passed in to the ``batch_load_fn``.

-  *cache*: Default ``True``. Set to ``False`` to disable memoization
   caching, instead creating a new Promise and new key in the
   ``batch_load_fn`` for every load of the same key.

-  *cache\_key\_fn*: A function to produce a cache key for a given load
   key. Defaults to ``key => key``. Useful to provide when Python
   objects are keys and two similarly shaped objects should be
   considered equivalent.

-  *cache\_map*: An instance of
   `dict <https://docs.python.org/3/tutorial/datastructures.html#dictionaries>`__
   (or an object with a similar API) to be used as the underlying cache
   for this loader. Default ``{}``.

``load(key)``
^^^^^^^^^^^^^

Loads a key, returning a ``Future`` for the value represented by that
key.

-  *key*: An key value to load.

``load_many(keys)``
^^^^^^^^^^^^^^^^^^^

Loads multiple keys, promising an array of values:

.. code:: python

    a, b = await my_loader.load_many([ 'a', 'b' ]);

This is equivalent to the more verbose:

.. code:: python

    from asyncio import gather
    a, b = await gather(
        my_loader.load('a'),
        my_loader.load('b')
    )

-  *keys*: A list of key values to load.

``clear(key)``
^^^^^^^^^^^^^^

Clears the value at ``key`` from the cache, if it exists. Returns itself
for method chaining.

-  *key*: An key value to clear.

``clear_all()``
^^^^^^^^^^^^^^^

Clears the entire cache. To be used when some event results in unknown
invalidations across this particular ``DataLoader``. Returns itself for
method chaining.

``prime(key, value)``
^^^^^^^^^^^^^^^^^^^^^

Primes the cache with the provided key and value. If the key already
exists, no change is made. (To forcefully prime the cache, clear the key
first with ``loader.clear(key).prime(key, value)``.) Returns itself for
method chaining.

Using with GraphQL
------------------

DataLoader pairs nicely well with
`GraphQL <https://github.com/graphql-python/graphene>`__. GraphQL fields
are designed to be stand-alone functions. Without a caching or batching
mechanism, it's easy for a naive GraphQL server to issue new database
requests each time a field is resolved.

Consider the following GraphQL request:

::

    {
      me {
        name
        bestFriend {
          name
        }
        friends(first: 5) {
          name
          bestFriend {
            name
          }
        }
      }
    }

Naively, if ``me``, ``bestFriend`` and ``friends`` each need to request
the backend, there could be at most 13 database requests!

When using DataLoader, we could define the ``User`` type using the
`SQLite <examples/SQL.md>`__ example with clearer code and at most 4
database requests, and possibly fewer if there are cache hits.

.. code:: python

    class User(graphene.ObjectType):
        name = graphene.String()
        best_friend = graphene.Field(lambda: User)
        friends = graphene.List(lambda: User)

        def resolve_best_friend(self, args, context, info):
            return user_loader.load(self.best_friend_id)

        def resolve_friends(self, args, context, info):
            return user_loader.load_many(self.friend_ids)

Common Patterns
---------------

Creating a new DataLoader per request.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

In many applications, a web server using DataLoader serves requests to
many different users with different access permissions. It may be
dangerous to use one cache across many users, and is encouraged to
create a new DataLoader per request:

.. code:: python

    def create_loaders(auth_token):
      return {
        'users': DataLoader(lambda ids: gen_users(auth_token, ids)),
        'cdn_urls': DataLoader(lambda raw_urls: gen_cdn_urls(auth_token, raw_urls)),
        'stories': DataLoader(lambda keys: gen_stories(auth_token, keys)),
      }
    }

    # When handling an incoming web request:
    loaders = create_loaders(request.query.auth_token)

    # Then, within application logic:
    user = await loaders.users.load(4)
    pic = await loaders.cdn_urls.load(user.raw_pic_url)

Creating an object where each key is a ``DataLoader`` is one common
pattern which provides a single value to pass around to code which needs
to perform data loading, such as part of the ``root_value`` in a
[graphql][] request.

Loading by alternative keys.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Occasionally, some kind of value can be accessed in multiple ways. For
example, perhaps a "User" type can be loaded not only by an "id" but
also by a "username" value. If the same user is loaded by both keys,
then it may be useful to fill both caches when a user is loaded from
either source:

.. code:: python

    async def user_by_id_batch_fn(ids):
        users = await gen_users_by_id(ids)
        for user in users:
            username_loader.prime(user.username, user)
        return users

    user_by_id_loader = DataLoader(user_by_id_batch_fn)

    async def username_batch_fn(names):
        users = await gen_usernames(names)
        for user in users:
            user_by_id_loader.prime(user.id, user)
        return users

    username_loader = DataLoader(username_batch_fn)

Custom Caches
-------------

DataLoader can optionaly be provided a custom dict instance to use as
its memoization cache. More specifically, any object that implements the
methods ``get()``, ``set()``, ``delete()`` and ``clear()`` can be
provided. This allows for custom dicts which implement various `cache
algorithms <https://en.wikipedia.org/wiki/Cache_algorithms>`__ to be
provided. By default, DataLoader uses the standard
`dict <https://docs.python.org/3/tutorial/datastructures.html#dictionaries>`__
which simply grows until the DataLoader is released. The default is
appropriate when requests to your application are short-lived.

Video Source Code Walkthrough
-----------------------------

**DataLoader Source Code Walkthrough (YouTube):**

.. |Build Status| image:: https://travis-ci.org/lalbup/aiodataloader-ng.svg
   :target: https://travis-ci.org/lablup/aiodataloader-ng
.. |Coverage Status| image:: https://coveralls.io/repos/lablup/aiodataloader-ng/badge.svg?branch=master&service=github
   :target: https://coveralls.io/github/lablup/aiodataloader-ng?branch=master

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/lablup/aiodataloader-ng",
    "name": "aiodataloader-ng",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "concurrent future deferred aiodataloader",
    "author": "Syrus Akbary, Joongi Kim and other contributors",
    "author_email": "joongi@lablup.com",
    "download_url": "https://files.pythonhosted.org/packages/e0/09/9d11e527dcd81e770526a32c95e207fa6ef0aeb92de6b31b418573ebfd0b/aiodataloader-ng-0.2.1.tar.gz",
    "platform": "",
    "description": "Asyncio DataLoader\n==================\n\nDataLoader is a generic utility to be used as part of your application's\ndata fetching layer to provide a simplified and consistent API over\nvarious remote data sources such as databases or web services via\nbatching and caching.\n\n|Build Status| |Coverage Status|\n\nA port of the \"Loader\" API originally developed by [@schrockn][] at\nFacebook in 2010 as a simplifying force to coalesce the sundry key-value\nstore back-end APIs which existed at the time. At Facebook, \"Loader\"\nbecame one of the implementation details of the \"Ent\" framework, a\nprivacy-aware data entity loading and caching layer within web server\nproduct code. This ultimately became the underpinning for Facebook's\nGraphQL server implementation and type definitions.\n\nDataLoader is a simplified version of this original idea implemented in\nPython for AsyncIO services. DataLoader is often used when implementing\na `graphene <https://github.com/graphql-python/graphene>`__ service,\nthough it is also broadly useful in other situations.\n\nDataLoader is provided so that it may be useful not just to build\nGraphQL services with AsyncIO but also as a publicly available reference\nimplementation of this concept in the hopes that it can be ported to\nother languages. If you port DataLoader to another language, please open\nan issue to include a link from this repository.\n\nGetting Started\n---------------\n\nFirst, install DataLoader using pip.\n\n.. code:: sh\n\n    pip install aiodataloader\n\nTo get started, create a ``DataLoader``. Each ``DataLoader`` instance\nrepresents a unique cache. Typically instances are created per request\nwhen used within a web-server like\n`Sanic <https://sanic.readthedocs.io/en/latest/>`__ if different users\ncan see different things.\n\n    Note: DataLoader assumes a AsyncIO environment with ``async/await``\n    available only in Python 3.5+.\n\nBatching\n--------\n\nBatching is not an advanced feature, it's DataLoader's primary feature.\nCreate loaders by providing a batch loading function.\n\n.. code:: python\n\n    from aiodataloader import DataLoader\n\n    class UserLoader(DataLoader):\n        async def batch_load_fn(self, keys):\n            return await my_batch_get_users(keys)\n\n    user_loader = UserLoader()\n\nA batch loading function accepts a Iterable of keys, and returns a\nPromise which resolves to a List of values\\ `\\* <#batch-function>`__.\n\nThen load individual values from the loader. DataLoader will coalesce\nall individual loads which occur within a single frame of execution (a\nsingle tick of the event loop) and then call your batch function with\nall requested keys.\n\n.. code:: python\n\n    user1_future = user_loader.load(1)\n    user2_future = user_loader.load(2)\n\n    user1 = await user1_future\n    user2 = await user2_future\n\n    user1_invitedby = user_loader.load(user1.invited_by_id)\n    user2_invitedby = user_loader.load(user2.invited_by_id)\n\n    print(\"User 1 was invited by\", await user1_invitedby)\n    print(\"User 2 was invited by\", await user2_invitedby)\n\nA naive application may have issued four round-trips to a backend for\nthe required information, but with DataLoader this application will make\nat most two.\n\nDataLoader allows you to decouple unrelated parts of your application\nwithout sacrificing the performance of batch data-loading. While the\nloader presents an API that loads individual values, all concurrent\nrequests will be coalesced and presented to your batch loading function.\nThis allows your application to safely distribute data fetching\nrequirements throughout your application and maintain minimal outgoing\ndata requests.\n\nBatch Function\n~~~~~~~~~~~~~~\n\nA batch loading function accepts an List of keys, and returns a Future\nwhich resolves to an List of values. There are a few constraints that\nmust be upheld:\n\n-  The List of values must be the same length as the List of keys.\n-  Each index in the List of values must correspond to the same index in\n   the List of keys.\n\nFor example, if your batch function was provided the List of keys:\n``[ 2, 9, 6, 1 ]``, and loading from a back-end service returned the\nvalues:\n\n.. code:: python\n\n    { 'id': 9, 'name': 'Chicago' }\n    { 'id': 1, 'name': 'New York' }\n    { 'id': 2, 'name': 'San Francisco' }\n\nOur back-end service returned results in a different order than we\nrequested, likely because it was more efficient for it to do so. Also,\nit omitted a result for key ``6``, which we can interpret as no value\nexisting for that key.\n\nTo uphold the constraints of the batch function, it must return an List\nof values the same length as the List of keys, and re-order them to\nensure each index aligns with the original keys ``[ 2, 9, 6, 1 ]``:\n\n.. code:: python\n\n    [\n      { 'id': 2, 'name': 'San Francisco' },\n      { 'id': 9, 'name': 'Chicago' },\n      None,\n      { 'id': 1, 'name': 'New York' }\n    ]\n\nCaching\n-------\n\nDataLoader provides a memoization cache for all loads which occur in a\nsingle request to your application. After ``.load()`` is called once\nwith a given key, the resulting value is cached to eliminate redundant\nloads.\n\nIn addition to relieving pressure on your data storage, caching results\nper-request also creates fewer objects which may relieve memory pressure\non your application:\n\n.. code:: python\n\n    user_future1 = user_loader.load(1)\n    user_future2 = user_loader.load(1)\n\n    assert user_future1 == user_future2\n\nCaching per-Request\n~~~~~~~~~~~~~~~~~~~\n\nDataLoader caching *does not* replace Redis, Memcache, or any other\nshared application-level cache. DataLoader is first and foremost a data\nloading mechanism, and its cache only serves the purpose of not\nrepeatedly loading the same data in the context of a single request to\nyour Application. To do this, it maintains a simple in-memory\nmemoization cache (more accurately: ``.load()`` is a memoized function).\n\nAvoid multiple requests from different users using the DataLoader\ninstance, which could result in cached data incorrectly appearing in\neach request. Typically, DataLoader instances are created when a Request\nbegins, and are not used once the Request ends.\n\nFor example, when using with\n`Sanic <https://sanic.readthedocs.io/en/latest/>`__:\n\n.. code:: python\n\n    def create_loaders(auth_token) {\n        return {\n          'users': user_loader,\n        }\n    }\n\n\n    app = Sanic(__name__)\n\n    @app.route(\"/\")\n    async def test(request):\n        auth_token = authenticate_user(request)\n        loaders = create_loaders(auth_token)\n        return render_page(request, loaders)\n\nClearing Cache\n~~~~~~~~~~~~~~\n\nIn certain uncommon cases, clearing the request cache may be necessary.\n\nThe most common example when clearing the loader's cache is necessary is\nafter a mutation or update within the same request, when a cached value\ncould be out of date and future loads should not use any possibly cached\nvalue.\n\nHere's a simple example using SQL UPDATE to illustrate.\n\n.. code:: python\n\n    # Request begins...\n    user_loader = ...\n\n    # And a value happens to be loaded (and cached).\n    user4 = await user_loader.load(4)\n\n    # A mutation occurs, invalidating what might be in cache.\n    await sql_run('UPDATE users WHERE id=4 SET username=\"zuck\"')\n    user_loader.clear(4)\n\n    # Later the value load is loaded again so the mutated data appears.\n    user4 = await user_loader.load(4)\n\n    # Request completes.\n\nCaching Exceptions\n~~~~~~~~~~~~~~~~~~\n\nIf a batch load fails (that is, a batch function throws or returns a\nrejected Promise), then the requested values will not be cached. However\nif a batch function returns an ``Exception`` instance for an individual\nvalue, that ``Exception`` will be cached to avoid frequently loading the\nsame ``Exception``.\n\nIn some circumstances you may wish to clear the cache for these\nindividual Errors:\n\n.. code:: python\n\n    try:\n        user_loader.load(1)\n    except Exception as e:\n        user_loader.clear(1)\n        raise\n\nDisabling Cache\n~~~~~~~~~~~~~~~\n\nIn certain uncommon cases, a DataLoader which *does not* cache may be\ndesirable. Calling ``DataLoader(batch_fn, cache=false)`` will ensure\nthat every call to ``.load()`` will produce a *new* Future, and\nrequested keys will not be saved in memory.\n\nHowever, when the memoization cache is disabled, your batch function\nwill receive an array of keys which may contain duplicates! Each key\nwill be associated with each call to ``.load()``. Your batch loader\nshould provide a value for each instance of the requested key.\n\nFor example:\n\n.. code:: python\n\n    class MyLoader(DataLoader):\n        cache = False\n        async def batch_load_fn(self, keys):\n            print(keys)\n            return keys\n\n    my_loader = MyLoader()\n\n    my_loader.load('A')\n    my_loader.load('B')\n    my_loader.load('A')\n\n    # > [ 'A', 'B', 'A' ]\n\nMore complex cache behavior can be achieved by calling ``.clear()`` or\n``.clear_all()`` rather than disabling the cache completely. For\nexample, this DataLoader will provide unique keys to a batch function\ndue to the memoization cache being enabled, but will immediately clear\nits cache when the batch function is called so later requests will load\nnew values.\n\n.. code:: python\n\n    class MyLoader(DataLoader):\n        cache = False\n        async def batch_load_fn(self, keys):\n            self.clear_all()\n            return keys\n\nAPI\n---\n\nclass DataLoader\n~~~~~~~~~~~~~~~~\n\nDataLoader creates a public API for loading data from a particular data\nback-end with unique keys such as the ``id`` column of a SQL table or\ndocument name in a MongoDB database, given a batch loading function.\n\nEach ``DataLoader`` instance contains a unique memoized cache. Use\ncaution when used in long-lived applications or those which serve many\nusers with different access permissions and consider creating a new\ninstance per web request.\n\n``new DataLoader(batch_load_fn, **options)``\n^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^\n\nCreate a new ``DataLoader`` given a batch loading function and options.\n\n-  *batch\\_load\\_fn*: An async function (coroutine) which accepts an\n   List of keys and returns a Future which resolves to an List of\n   values.\n\n-  *options*:\n\n-  *batch*: Default ``True``. Set to ``False`` to disable batching,\n   instead immediately invoking ``batch_load_fn`` with a single load\n   key.\n\n-  *max\\_batch\\_size*: Default ``Infinity``. Limits the number of items\n   that get passed in to the ``batch_load_fn``.\n\n-  *cache*: Default ``True``. Set to ``False`` to disable memoization\n   caching, instead creating a new Promise and new key in the\n   ``batch_load_fn`` for every load of the same key.\n\n-  *cache\\_key\\_fn*: A function to produce a cache key for a given load\n   key. Defaults to ``key => key``. Useful to provide when Python\n   objects are keys and two similarly shaped objects should be\n   considered equivalent.\n\n-  *cache\\_map*: An instance of\n   `dict <https://docs.python.org/3/tutorial/datastructures.html#dictionaries>`__\n   (or an object with a similar API) to be used as the underlying cache\n   for this loader. Default ``{}``.\n\n``load(key)``\n^^^^^^^^^^^^^\n\nLoads a key, returning a ``Future`` for the value represented by that\nkey.\n\n-  *key*: An key value to load.\n\n``load_many(keys)``\n^^^^^^^^^^^^^^^^^^^\n\nLoads multiple keys, promising an array of values:\n\n.. code:: python\n\n    a, b = await my_loader.load_many([ 'a', 'b' ]);\n\nThis is equivalent to the more verbose:\n\n.. code:: python\n\n    from asyncio import gather\n    a, b = await gather(\n        my_loader.load('a'),\n        my_loader.load('b')\n    )\n\n-  *keys*: A list of key values to load.\n\n``clear(key)``\n^^^^^^^^^^^^^^\n\nClears the value at ``key`` from the cache, if it exists. Returns itself\nfor method chaining.\n\n-  *key*: An key value to clear.\n\n``clear_all()``\n^^^^^^^^^^^^^^^\n\nClears the entire cache. To be used when some event results in unknown\ninvalidations across this particular ``DataLoader``. Returns itself for\nmethod chaining.\n\n``prime(key, value)``\n^^^^^^^^^^^^^^^^^^^^^\n\nPrimes the cache with the provided key and value. If the key already\nexists, no change is made. (To forcefully prime the cache, clear the key\nfirst with ``loader.clear(key).prime(key, value)``.) Returns itself for\nmethod chaining.\n\nUsing with GraphQL\n------------------\n\nDataLoader pairs nicely well with\n`GraphQL <https://github.com/graphql-python/graphene>`__. GraphQL fields\nare designed to be stand-alone functions. Without a caching or batching\nmechanism, it's easy for a naive GraphQL server to issue new database\nrequests each time a field is resolved.\n\nConsider the following GraphQL request:\n\n::\n\n    {\n      me {\n        name\n        bestFriend {\n          name\n        }\n        friends(first: 5) {\n          name\n          bestFriend {\n            name\n          }\n        }\n      }\n    }\n\nNaively, if ``me``, ``bestFriend`` and ``friends`` each need to request\nthe backend, there could be at most 13 database requests!\n\nWhen using DataLoader, we could define the ``User`` type using the\n`SQLite <examples/SQL.md>`__ example with clearer code and at most 4\ndatabase requests, and possibly fewer if there are cache hits.\n\n.. code:: python\n\n    class User(graphene.ObjectType):\n        name = graphene.String()\n        best_friend = graphene.Field(lambda: User)\n        friends = graphene.List(lambda: User)\n\n        def resolve_best_friend(self, args, context, info):\n            return user_loader.load(self.best_friend_id)\n\n        def resolve_friends(self, args, context, info):\n            return user_loader.load_many(self.friend_ids)\n\nCommon Patterns\n---------------\n\nCreating a new DataLoader per request.\n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n\nIn many applications, a web server using DataLoader serves requests to\nmany different users with different access permissions. It may be\ndangerous to use one cache across many users, and is encouraged to\ncreate a new DataLoader per request:\n\n.. code:: python\n\n    def create_loaders(auth_token):\n      return {\n        'users': DataLoader(lambda ids: gen_users(auth_token, ids)),\n        'cdn_urls': DataLoader(lambda raw_urls: gen_cdn_urls(auth_token, raw_urls)),\n        'stories': DataLoader(lambda keys: gen_stories(auth_token, keys)),\n      }\n    }\n\n    # When handling an incoming web request:\n    loaders = create_loaders(request.query.auth_token)\n\n    # Then, within application logic:\n    user = await loaders.users.load(4)\n    pic = await loaders.cdn_urls.load(user.raw_pic_url)\n\nCreating an object where each key is a ``DataLoader`` is one common\npattern which provides a single value to pass around to code which needs\nto perform data loading, such as part of the ``root_value`` in a\n[graphql][] request.\n\nLoading by alternative keys.\n~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n\nOccasionally, some kind of value can be accessed in multiple ways. For\nexample, perhaps a \"User\" type can be loaded not only by an \"id\" but\nalso by a \"username\" value. If the same user is loaded by both keys,\nthen it may be useful to fill both caches when a user is loaded from\neither source:\n\n.. code:: python\n\n    async def user_by_id_batch_fn(ids):\n        users = await gen_users_by_id(ids)\n        for user in users:\n            username_loader.prime(user.username, user)\n        return users\n\n    user_by_id_loader = DataLoader(user_by_id_batch_fn)\n\n    async def username_batch_fn(names):\n        users = await gen_usernames(names)\n        for user in users:\n            user_by_id_loader.prime(user.id, user)\n        return users\n\n    username_loader = DataLoader(username_batch_fn)\n\nCustom Caches\n-------------\n\nDataLoader can optionaly be provided a custom dict instance to use as\nits memoization cache. More specifically, any object that implements the\nmethods ``get()``, ``set()``, ``delete()`` and ``clear()`` can be\nprovided. This allows for custom dicts which implement various `cache\nalgorithms <https://en.wikipedia.org/wiki/Cache_algorithms>`__ to be\nprovided. By default, DataLoader uses the standard\n`dict <https://docs.python.org/3/tutorial/datastructures.html#dictionaries>`__\nwhich simply grows until the DataLoader is released. The default is\nappropriate when requests to your application are short-lived.\n\nVideo Source Code Walkthrough\n-----------------------------\n\n**DataLoader Source Code Walkthrough (YouTube):**\n\n.. |Build Status| image:: https://travis-ci.org/lalbup/aiodataloader-ng.svg\n   :target: https://travis-ci.org/lablup/aiodataloader-ng\n.. |Coverage Status| image:: https://coveralls.io/repos/lablup/aiodataloader-ng/badge.svg?branch=master&service=github\n   :target: https://coveralls.io/github/lablup/aiodataloader-ng?branch=master\n\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Asyncio DataLoader implementation for Python",
    "version": "0.2.1",
    "split_keywords": [
        "concurrent",
        "future",
        "deferred",
        "aiodataloader"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "md5": "e88deb570c705e85881547ce8ac0319d",
                "sha256": "c705656ae8cab12f8d313ddac4d68d6036bafe403f172c9687ece027cde21acc"
            },
            "downloads": -1,
            "filename": "aiodataloader_ng-0.2.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "e88deb570c705e85881547ce8ac0319d",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 10803,
            "upload_time": "2022-01-22T07:23:17",
            "upload_time_iso_8601": "2022-01-22T07:23:17.433334Z",
            "url": "https://files.pythonhosted.org/packages/d4/f7/9b837b7893d2db59c21df28434f31637a37884ce6b9e29b92d4dd1e68435/aiodataloader_ng-0.2.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "md5": "b291e9ec3cb212e6ed23e7d2aa6d4dcb",
                "sha256": "f655efa53d72f9887617443161e7920eaa10b79ba6710326223bf9de395515ec"
            },
            "downloads": -1,
            "filename": "aiodataloader-ng-0.2.1.tar.gz",
            "has_sig": false,
            "md5_digest": "b291e9ec3cb212e6ed23e7d2aa6d4dcb",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 11267,
            "upload_time": "2022-01-22T07:23:19",
            "upload_time_iso_8601": "2022-01-22T07:23:19.191812Z",
            "url": "https://files.pythonhosted.org/packages/e0/09/9d11e527dcd81e770526a32c95e207fa6ef0aeb92de6b31b418573ebfd0b/aiodataloader-ng-0.2.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2022-01-22 07:23:19",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "github_user": "lablup",
    "github_project": "aiodataloader-ng",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "aiodataloader-ng"
}

Syrus Akbary, Joongi Kim and other contributors