Utilities for Pydantic models#
Several validation and configuration problems arise frequently with Pydantic models. Safir offers some utility functions to assist in solving them.
Normalizing datetime fields#
Pydantic supports several input formats for datetime
fields, but the resulting datetime
object may be timezone-naive.
Best practice for Python code is to only use timezone-aware datetime
objects in the UTC time zone.
Pydantic provides a utility function, normalize_datetime
, that can be used as a validator for a datetime
model field.
It ensures that any input is converted to UTC and is always timezone-aware.
Here’s an example of how to use it:
class Info(BaseModel):
last_used: Optional[datetime] = Field(
None,
title="Last used",
description="When last used in seconds since epoch",
example=1614986130,
)
_normalize_last_used = validator(
"last_used", allow_reuse=True, pre=True
)(normalize_datetime)
Multiple attributes can be listed as the initial arguments of validator
if there are multiple fields that need to be checked.
Accepting camel-case attributes#
Python prefers snake_case
for all object attributes, but some external sources of data (Kubernetes custom resources, YAML configuration files generated from Helm configuration) require or prefer camelCase
.
Thankfully, Pydantic supports converting from camel-case to snake-case on input using what Pydantic calls an “alias generator.”
Safir provides to_camel_case
, which can be used as that alias generator.
To use it, add a configuration block to any Pydantic model that has snake-case attributes but needs to accept them in camel-case form:
class Model(BaseModel):
some_field: str
class Config:
alias_generator = to_camel_case
allow_population_by_field_name = True
By default, only the generated aliases (so, in this case, only the camel-case form of the attribute, someField
) are supported.
The additional setting allow_population_by_field_name
, tells Pydantic to allow either some_field
or someField
in the input.
As a convenience, you can instead inherit from CamelCaseModel
, which is a derived class of BaseModel
with those settings added.
This is somewhat less obvious when reading the classes and thus less self-documenting, but is less tedious if you have numerous models that need to support camel-case.
CamelCaseModel
also overrides dict
and json
to change the default of by_alias
to True
so that this model exports in camel-case by default.
Requiring exactly one of a list of attributes#
Occasionally, you will have reason to write a model with several attributes, where one and only one of those attributes may be set. For example:
class Model(BaseModel):
docker: Optional[DockerConfig] = None
ghcr: Optional[GHCRConfig] = None
The intent here is that only one of those two configurations will be present: either Docker or GitHub Container Registry. However, Pydantic has no native way to express that, and the above model will accept input where neither or both of those attributes are set.
Safir provides a function, validate_exactly_one_of
, designed for this case.
It takes a list of fields, of which exactly one must be set, and builds a validation function that checks this property of the model.
So, in the above example, the full class would be:
class Model(BaseModel):
docker: Optional[DockerConfig] = None
ghcr: Optional[GHCRConfig] = None
_validate_type = validator("ghcr", always=True, allow_reuse=True)(
validate_exactly_one_of("docker", "ghcr")
)
Note the syntax, which is a little odd since it is calling a decorator on the results of a function builder.
The argument to validator
must always be the last of the possible attributes that may be set, ensuring that any other attributes have been seen when the validator runs.
always=True
must be set to ensure the validator runs regardless of which attribute is set.
allow_reuse=True
must be set due to limitations in Pydantic.