Making 1 Million Requests With Python-Aiohttp
Making 1 Million Requests With Python-Aiohttp
io
In this post I’d like to test limits of python aiohttp and check its performance in terms of requests
per minute. Everyone knows that asynchronous code performs better when applied to network
operations, but it’s still interesting to check this assumption and understand how exactly it is
better and why it’s is better. I’m going to check it by trying to make 1 million requests with
aiohttp client. How many requests per minute will aiohttp make? What kind of exceptions and
crashes can you expect when you try to make such volume of requests with very primitive
scripts? What are main gotchas that you need to think about when trying to make such volume of
requests?
Hello asyncio/aiohttp
Async programming is not easy. It’s not easy because using callbacks and thinking in terms of
events and event handlers requires more effort than usual synchronous programming. But it is
also difficult because asyncio is still relatively new and there are few blog posts, tutorials about
it. Official docs are very terse and contain only basic examples. There are some Stack Overflow
questions but not that many only 410 as of time of writing (compare with 2 585 questions tagged
“twisted”) There are couple of nice blog posts and articles about asyncio over there such as this,
that, that or perhaps even this or this.
To make it easier let’s start with the basics - simple HTTP hello world - just making GET and
fetching one single HTTP response.
import requests
def hello():
    return requests.get("http://httpbin.org/get")
print(hello())
#!/usr/local/bin/python3.11.2
import asyncio
from aiohttp import ClientSession
async def hello(url: str):
    async with ClientSession() as session:
        async with session.get(url) as response:
            response = await response.read()
            print(response)
asyncio.run(hello("http://httpbin.org/headers"))
hmm looks like I had to write lots of code for such a basic task… There is “async def” and
“async with” and two “awaits” here. It seems really confusing at first sight, let’s try to explain it
then.
You make your function asynchronous by using async keyword before function definition and
using await keyword. There are actually two asynchronous operations that our hello() function
performs. First it fetches response asynchronously, then it reads response body in asynchronous
manner.
After you open client session you can use it to make requests. This is where another
asynchronous operation starts, downloading request. Just as in case of client sessions responses
must be closed explicitly, and context manager’s with statement ensures it will be closed
properly in all circumstances.
It all does sound bit difficult but it’s not that complex and looks logical if you spend some time
trying to understand it.
This is really quick and easy, async will not be that easy, so you should always consider if
something more complex is actually necessary for your needs. If your app works nice with
synchronous code maybe there is no need to bother with async code? If you do need to bother
with async code here’s how you do that. Our hello() async function stays the same but we need
to wrap it in asyncio TaskGroup object and pass whole lists of Future objects as tasks to be
executed in the loop.
import asyncio
from aiohttp import ClientSession
asyncio.run(main())
Now let’s say we want to collect all responses in one list and do some postprocessing on them.
At the moment we’re not keeping response body anywhere, we just print it, let’s keep response
in the list, and print all responses at the end as JSON.
To collect several responses we will use asyncio Queue. Result of each download will be stored
inside queue, at the end of processing results will be printed as JSON.
import asyncio
from aiohttp import ClientSession
import json
print(json.dumps(results))
asyncio.run(main())
Common gotchas
Now let’s simulate real process of learning and let’s make mistake in above script and try to
debug it, this should be really helpful for demonstration purposes.
This code is broken, but it’s not that easy to figure out why if you dont know much about
asyncio. Even if you know Python well but you dont know asyncio or aiohttp well you’ll be in
trouble to figure out what happens.
What happens here? You expected to get response objects after all processing is done, but here
you actually get bunch of generators, why is that?
It happens because as I’ve mentioned earlier response.read() is async operation, this means
that it does not return result immediately, it just returns generator. This generator still needs to be
called and executed, and this does not happen by default, yield from in Python 3.4 and await
in Python 3.5 were added exactly for this purpose: to actually iterate over generator function. Fix
to above error is just adding await before response.read().
Sync vs Async
Finally time for some fun. Let’s check if async is really worth the hassle. What’s the difference
in efficiency between asynchronous client and blocking client? How many requests per minute
can I send with my async client?
With this questions in mind I set up simple (async) aiohttp server. My server is going to read full
html text of Frankenstein by Marry Shelley. It will add random delays between responses. Some
responses will have zero delay, and some will have maximum of 3 seconds delay. This should
resemble real applications, few apps respond to all requests with same latency, usually latency
differs from response to response.
#!/usr/local/bin/python3.5
import asyncio
from datetime import datetime
from aiohttp import web
import random
# set seed to ensure async and sync client get same distribution of delay
values
# and tests are fair
random.seed(1)
app = web.Application()
app.add_routes([web.get("/{name}", hello)])
web.run_app(app, port=8000)
import requests
r = 100
url = "http://localhost:8000/{}"
for i in range(r):
    res = requests.get(url.format(i))
    delay = res.headers.get("DELAY")
    d = res.headers.get("DATE")
    print("{}:{} delay {}".format(d, res.url, delay))
My async code looks just like above code samples above. How long will async client take?
It is interesting that it took exactly as long as longest delay from my server. If you look into
messages printed by client script you can see how great async HTTP client is. Some responses
had 0 delay but others got 3 seconds delay. In synchronous client they would be blocking and
waiting, your machine would simply stay idle for this time. Async client does not waste time,
when something is delayed it simply does something else, issues other requests or processes all
other responses. You can see this clearly in logs, first there are responses with 0 delay, then after
they arrrived you can see responses with 1 seconds delay, and so on until most delayed responses
arrive.
I’m going to start with sending 1k async requests. I’m curious how many requests my client can
handle.
So 1k requests take 7 seconds, pretty nice! How about 10k? Trying to make 10k requests
unfortunately fails…
    # Create client session that will ensure we dont open new connection
    # per each request.
    async with ClientSession() as session:
        for i in range(r):
            # pass Semaphore and session to every GET request
            task = asyncio.ensure_future(bound_fetch(sem, url.format(i),
session))
            tasks.append(task)
          responses = asyncio.gather(*tasks)
          await responses
number = 10000
asyncio.run(run(number))
At this point I can process 10k urls. It takes 23 seconds and returns some exceptions but overall
it’s pretty nice!
How about 100 000? This really makes my computer work hard but suprisingly it works ok.
Server turns out to be suprisingly stable although you can see that ram usage gets pretty high at
this point, cpu usage is around 100% all the time. What I find interesting is that my server takes
significantly less cpu than client. Here’s snapshot of linux ps output.
USER       PID %CPU %MEM            VSZ   RSS TTY           STAT START       TIME COMMAND
pawel     2447 56.3 1.0          216124 64976 pts/9         Sl+ 21:26        1:27
/usr/local/bin/python3.5         ./test_server.py
pawel     2527 101 3.5           674732 212076 pts/0        Rl+    21:26     2:30
/usr/local/bin/python3.5         ./bench.py
Finally I’m going to try 1 million requests. I really hope my laptop is not going to explode when
testing that.
Epilogue
You can see that asynchronous HTTP clients can be pretty powerful. Performing 1 million
requests from async client is not difficult, and the client performs really well in comparison to
synchronous code.
I wonder how it compares to other languages and async frameworks? Perhaps in some future
post I could compare Twisted Treq with aiohttp. There is also question how many concurrent
requests can be issued by async libraries in other languages. E.g. what would be results of
benchmarks for some Java async frameworks? Or C++ frameworks? Or some Rust HTTP
clients?
EDITS (24/04/2016)
EDITS (10/09/2016)
Earlier version of this post contained problematic usage of ClientSession that caused client to
crash. You can find this older version of article here. For more details about this issue see this
GitHub ticket.
EDITS (08/11/2016)
EDITS (14/04/2023)
Updated code to use more modern asyncio APIs (TaskGroups, asyncio.run() etc)