Paginated queries

Pagination in a web API is the mechanism for returning a partial answer to a query with many results, alongside an easy way for the client to retrieve the next batch of results. Implementing pagination for any query that may return a large number of results is considered best practice for web APIs. Most clients will only need the first batch or two of results, batching results reduces latency, and shorter replies are easier to manage and create less memory pressure on both the server and client.

There are numerous ways to manage pagination (see this blog post for a good introductory overview). Safir provides a generic implementation of keyset pagination with client-side cursors, which is often a reasonable choice.

Elements of a paginated query

A paginated query is a fairly complex operation that usually goes through multiple layers of data types.

First, your application will construct a SQL query that returns the full, unpaginated data set. This may use other query parameters or information from the client to filter the results in ways unrelated to pagination. This SQL query should not order or apply a LIMIT to the results; the order and limit will be applied when the paginated query is done.

Second, your application optionally may provide a limit on the number of results to return, a cursor indicating where in the overall list of results to pick up from, or both. If neither a limit nor a cursor is provided, the query is not paginated; you can still use the facilities discussed here for simplicity (and to avoid needing special cases for the non-paginated case), but pagination will not be done.

Third, your application passes the SQL query and any limit or cursor, along with a database session, into PaginatedQueryRunner to perform the query. This will apply the sort order and any restrictions from the limit or cursor and then execute the query in that session. It will return a PaginatedList, which holds the results along with pagination information.

Finally, the application will return, via the API handler, the list of entries included in the PaginatedList along with information about how to obtain the next or previous group of entries and, optionally, the total number of records. This pagination information is generally returned in HTTP headers, although if you wish to return it in a data structure wrapper around the results, you can do that instead.

Defining the cursor

To use the generic paginated query support, first you must define the cursor that will be used for the queries. The cursor class defines the following information needed for paginated queries:

  1. How to construct a cursor to get all entries before or after a given entry. Forward cursors must include the entry provided as a basis for the cursor, and reverse cursors must exclude it. The Pydantic model of the entries is a type parameter to the cursor.

  2. How to serialize to and deserialize from a string so that the cursor can be returned to an API client and sent back to retrieve the next batch of results.

  3. The sort order the cursor represents. A cursor class represents one and only one sort order, since keyset cursors rely on the sort order not changing. (Your application can use multiple cursors and thus support multiple sort orders, however.)

  4. How to apply the cursor to limit a SQL statement.

  5. How to invert a cursor (change it from going down the full list of results to going up the full list of results for previous links).

In the general case, your application must define the cursor by creating a subclass of PaginationCursor and implementing its abstract methods.

In the very common case that the API results are sorted first by some timestamp in descending order (most recent first) and then by an auto-increment unique key (most recently inserted row first), Safir provides DatetimeIdCursor, which is a generic cursor implementation that implements that ordering and keyset pagination policy. In this case, you need only subclass DatetimeIdCursor and provide the SQLAlchemy ORM model columns that correspond to the timestamp and the unique key.

For example, if you are requesting paginated results from a table whose ORM model is named Job, whose timestamp field is Job.creation_time, and whose unique key is Job.id, and using a Pydantic model named JobModel with the same field names, you can use the following cursor:

from safir.database import DatetimeIdCursor
from sqlalchemy.orm import InstrumentedAttribute


class JobCursor(DatetimeIdCursor[JobModel]):
    @staticmethod
    def id_column() -> InstrumentedAttribute:
        return Job.id

    @staticmethod
    def time_column() -> InstrumentedAttribute:
        return Job.creation_time

    @classmethod
    def from_entry(cls, entry: JobModel, *, reverse: bool = False) -> Self:
        return cls(id=entry.id, time=entry.creation_time, reverse=reverse)

(These are essentially class properties, but due to limitations in Python abstract data types and property decorators, they’re implemented as static methods.)

In this case, DatetimeIdCursor will handle all of the other details for you, including serialization and deserialization. The type parameter to DatetimeIdCursor must be the Pydantic model that the resulting paginated list will contain.

Performing paginated queries

Parse the cursor and limit

Handlers for routes that return paginated results should take an optional pagination cursor as a query parameter. This will be used by the client to move forward and backwards through the results.

The parameter declaration should generally look something like the following:

@router.get("/query", response_class=Model)
async def query(
    *,
    cursor: Annotated[
        str | None,
        Query(
            title="Pagination cursor",
            description="Cursor to navigate paginated results",
        ),
    ] = None,
    limit: Annotated[
        int,
        Query(
            title="Row limit",
            description="Maximum number of entries to return",
            examples=[100],
            ge=1,
            le=100,
        ),
    ] = 100,
    request: Request,
    response: Response,
) -> list[Model]:
    parsed_cursor = None
    if cursor:
        parsed_cursor = ModelCursor.from_str(cursor)
    ...

Unfortunately, due to limitations in FastAPI, you cannot annotate the cursor parameter with a validator that returns the appropriate object. You must instead parse the cursor in the body of the handler. from_str should raise InvalidCursorError on parse failure, which will be automatically converted into an HTTP 422 response if you use the error handler described in Handling HTTP errors with FastAPI.

By default, the exception raised by DatetimeIdCursor assumes the cursor is coming from a query parameter named cursor. If this is not true for your application and you are using a cursor derived from that class, you should either catch the exception, modify location and field_path as appropriate for your application, and then re-raise it or override the __str__ method to throw an exception with different metadata.

Note the limit parameter in the above code example, which should also be used on any paginated route. This sets the size of each block of results. As shown here, you will generally want to set some upper limit on how large the limit can be and set a default limit if none was provided. This ensures that clients cannot retrieve the full list of results with one query.

If the clients are sufficiently trusted or if you’re certain the application can handle returning the full list of objects without creating resource problems, you can allow limit to be omitted and default it to None. The paginated query support in Safir will treat that as an unlimited query and will return all of the available results. In this case, you should change the type to int | None and remove the le constraint on the parameter.

Create the runner

The first step of performing a paginated query is to create a PaginatedQueryRunner object. Its constructor takes as arguments the type of the Pydantic model that will hold each returned object and the type of the cursor that will be used for pagination.

runner = PaginatedQueryRunner(Job, JobCursor)

Construct the query

Then, define the SQL query as a SQLAlchemy Select statement. You can do this in two ways: either a query that returns a single SQLAlchemy ORM model, or a query for a list of specific columns. Other combinations are not supported.

For example:

stmt = select(Job).where(Job.username == "someuser")

Or, an example of selecting specific columns:

stmt = select(Job.id, Job.timestamp, Job.description)

Ensure that all of the attributes required to create a cursor are included in the query and in the Pydantic model.

In either case, the data returned by the query must be sufficient to construct the Pydantic model passed as the first argument to the PaginatedQueryRunner constructor. The query result will be passed into the model_validate method of that model. Among other things, this means that all necessary attributes must be present and the model must be able to handle any data conversion required.

If the model includes any timestamps, the model validation must be able to convert them from the time format stored in the database (see Handling datetimes in database tables) to an appropriate Python datetime. The easiest way to do this is to declare those fields as having the safir.pydantic.UtcDatetime type. See Normalizing datetime fields for more information.

This query will be run twice for any request that contains a cursor or a limit, once with those restrictions to get the data and again without restrictions to count the total number of rows. The count query therefore should be fast or you may see performance issues with paginated queries. In practice, this means the query should use indexed columns or only query small tables.

Run the query

Finally, you can run the query. There are two ways to do this depending on how the query is structured.

If the SQL query returns a single ORM model for each result row, use query_object:

results = await runner.query_object(
    session, stmt, cursor=cursor, limit=limit
)

If the SQL query returns a tuple of individually selected attributes that correspond to the fields of the result model (the first parameter to the PaginatedQueryRunner constructor), use query_row:

results = await runner.query_row(session, stmt, cursor=cursor, limit=limit)

Either way, the results will be a PaginatedList wrapping a list of Pydantic models of the appropriate type.

If you want to also return the total number of entries, run a separate COUNT query:

count = await runner.query_count(session, stmt)

This returns the total number of matching rows without regard to cursor or limit. Best practice is to return this information in the response so that clients can estimate the total number of result pages, but this query will only be fast if it can be satisfied from the table indices or the table is small, so it is not run by default.

Returning paginated results

Using the HTTP headers

HTTP provides the Link header (RFC 8288) to declare relationships between multiple web responses. Using a Link header with relation types first, next, and prev is a standard way of providing the client with pagination information.

The Safir PaginatedList type provides a method, link_header, which returns the contents of an HTTP Link header for a given paginated result. It takes as its argument the base URL for the query (usually the current URL of a route handler). This is the recommended way to return pagination information alongside a result.

Here is a very simplified example of a route handler that sets this header:

@router.get("/query", response_class=Model)
async def query(
    *,
    cursor: Annotated[
        str | None,
        Query(
            title="Pagination cursor",
            description="Cursor to navigate paginated results",
        ),
    ] = None,
    limit: Annotated[
        int,
        Query(
            title="Row limit",
            description="Maximum number of entries to return",
            examples=[100],
            ge=1,
            le=100,
        ),
    ] = 100,
    request: Request,
    response: Response,
) -> list[Model]:
    parsed_cursor = None
    if cursor:
        parsed_cursor = ModelCursor.from_str(cursor)
    runner = PaginatedQueryRunner(Model, ModelCursor)
    stmt = build_query(...)
    results = await runner.query_object(
        session, stmt, cursor=parsed_cursor, limit=limit
    )
    if cursor or limit:
        response.headers["Link"] = results.link_header(request.url)
        count = await runner.query_count(session, stmt)
        response.headers["X-Total-Count"] = str(count)
    return results.entries

Here, perform_query is a wrapper around PaginatedQueryRunner that constructs and runs the query. A real route handler would have more query parameters and more documentation.

Including result counts

The example above also sets a non-standard X-Total-Count header containing the total count of entries returned by the underlying query without pagination. query_count will return this information. There is no standard way to return this information to the client, but X-Total-Count is a widely-used informal standard.

If you will always want to include the count, use CountedPaginatedQueryRunner instead. Its query_object and query_row methods will return a CountedPaginatedList, which contains a count attribute holding the count. This is equivalent to calling query_object or query_object followed by query_count, but the encapsulation into a data structure makes it easier to pass the results between components of the service.

Here’s the same code above but using that approach:


@router.get("/query", response_class=Model)
async def query(
    *,
    cursor: Annotated[
        str | None,
        Query(
            title="Pagination cursor",
            description="Cursor to navigate paginated results",
        ),
    ] = None,
    limit: Annotated[
        int,
        Query(
            title="Row limit",
            description="Maximum number of entries to return",
            examples=[100],
            ge=1,
            le=100,
        ),
    ] = 100,
    request: Request,
    response: Response,
) -> list[Model]:
    parsed_cursor = None
    if cursor:
        parsed_cursor = ModelCursor.from_str(cursor)
    runner = CountedPaginatedQueryRunner(Model, ModelCursor)
    stmt = build_query(...)
    results = await runner.query_object(
        session, stmt, cursor=parsed_cursor, limit=limit
    )
    if cursor or limit:
        response.headers["Link"] = results.link_header(request.url)
        response.headers["X-Total-Count"] = str(results.count)
    return results.entries

Parsing paginated query responses

Safir provides PaginationLinkData to parse the contents of an RFC 8288 Link header and extract pagination links from it. This may be useful in clients of paginated query results, including tests of services that use the above approach to paginated queries.

from safir.database import PaginationLinkData


r = client.get("/some/url", query={"limit": 100})
links = PaginationLinkData.from_header(r.headers["Link"])
next_url = links.next_url
prev_url = links.prev_url
first_url = links.first_url

Currently, only the first, next, and previous URLs are extracted from the Link header. If any of these URLs are not present, the corresponding attribute of PaginationLinkData will be None.