Many Meanings of Message Validation

This post is over a year old, some of this information may be out of date.

Many information systems consist of a front-end user interface where users enter inputs and back-end that processes the input data. This concept can be extended to information systems that send and receive messages. Regardless the data come from user input or other systems, those messages MUST be validated before going forward. There are always chances that compromised messages are projected to a system, which is a real threat to the system. If the system can't prevent those corrupted messages from being processed, the entire system will be down and result in the organisation's business loss of opportunities.

How can we implement the message validation to the system, or how can we validate messages? More specifically, what does the "validation" even mean? Throughout this post, I'm going to discuss several viewpoints of the "message validation".

Message Body/Payload Validation

Let's talk about the message payload (or body). Assuming there is an online pizza ordering system. As I love pineapple toppings, I'll place an order for a large pan of Hawaiian pizza, as well as a bottle of sparkling water. Here are my order items:

Hawaiian Pizza, Large, 1
Sparkling Water, Medium, 1

Hwaiian Pizza

My order is parsed as a JSON object and transmitted to the system. Here's a rough JSON request object. Of course, I intentionally omitted payment details and delivery details as they are not necessary for discussion.

https://gist.github.com/justinyoo/c5cd857042083f4c84bff28e4a7899e9?file=order.json

This JSON object represents a message and is sent to the system for processing. Before processing, the system MUST validate the payload. For example:

orderId: This field MUST be numeric.
itemId: This field MUST be a string with the format of category-subcategory-size.
amount: This field MUST be less than or equal to 100.

Therefore, the message payload validation MUST set the rule on each field for validation check. If any validation fails, the system SHOULD reject the message or take an exception handling process.

If you build a .NET based application, there are a bunch of open-source libraries for data validation. FluentValidation is one of the most famous libraries for data validation. Here's a sample code snippet using FluentValidation to check the message payload.

https://gist.github.com/justinyoo/c5cd857042083f4c84bff28e4a7899e9?file=fluent-validation.cs

You may have noticed that this sample code defines the OrderItem class first, which we assume that we know the message structure. What if a message comes in with a format not-understandable? What should we do in this case?

Message Structure Validation

Now, we're about to validate message structure. Structure validation consists of two parts. One is to check the payload structure through data contract or schema, and the other is to check the interface through service contract whether the message arrives at an agreed endpoint or not.

Handshake on Both Parties

Both parties sending and receiving messages MUST use the shared data contract or schema for communication. In other words, if the message sender uses one format and the other expects another format, the message will be ignored or not get processed. In addition to this, the sender MUST project messages through the agreed interfaces, including endpoint, protocol or method. Otherwise, the receiver cannot process the messages.

Which approaches are popular for message structure validations, then?

WSDL

XML Web Service via SOAP utilises WSDL. According to the WSDL spec, it defines both service contract (interface) and data contract (types). Here is a very simplified WSDL document based on the WSDL 2.0 spec.

https://gist.github.com/justinyoo/c5cd857042083f4c84bff28e4a7899e9?file=wsdl.xml

As WSDL defines service contract and data contract like above, both parties sending and receiving messages MUST conform to the contract to communicate with each other. All other messages outside the contract cannot be made. And based on this contract, we can easily create SDK. dotnet-svcutil is a good example to generate SDK.

Open API

Unlike legacy systems mainly use XML Web Service with SOAP, REST API (RESTful Web API, precisely) is widely adopted for message transmission. Open API is nowadays a de-facto standard to define services. Similar to WSDL, Open API spec version 3.0.2 defines Path for service contract, and Schema defines data contract. For SDK generation, AutoRest is such a great tool to meet the requirements.

https://gist.github.com/justinyoo/c5cd857042083f4c84bff28e4a7899e9?file=openapi.yaml

Now, either WSDL or Open API lets systems communicate with each other by validating message structures.

At this point, we might be facing another issue. Systems through either WSDL or Open API spec needs the synchronous way of communication. Of course, the receiving party can internally process the message in an async way, but at least the receiving end MUST synchronously return a response that the message has been accepted. For example, HTTP status codes like 201 (Created) or 202 (Accepted) SHOULD be returned.

In other words, as both systems depend on each other, if any side is temporarily unavailable, messages cannot be handled. It means there is no way to validate the message structure in this situation. Once the contract is established between systems, changing it is even harder. If we need to change the contract, it becomes really expensive to accommodate the change.

Schema Registry

Many attempts and patterns have been introduced to figure out the dependency between systems during message transmission. The Publisher/Subscriber Pattern is one of those patterns, and it's a great way for exchanging messages on the cloud. Instead of sending and receiving messages between systems on a real-time basis, a message broker is placed in the middle. The publisher (message sender) sends messages to the broker, and the subscriber (message receiver) picks up the messages from the broker. Both publisher and subscriber become completely decoupled and work asynchronously.

The message broker even works with multiple publishers and subscribers at the same time. It also accepts all messages without validating them, as long as both publishers and subscribers send messages with minimum requirements that the broker expects. In other words, message validation is solely for publishers' and subscribers' responsibility. Let's have a look at the diagram below. It's an over-simplified architecture implementing the pub/sub pattern using Azure Logic Apps and Service Bus.

Diagram Implementing Pub/Sub Pattern with Azure Logic Apps and Service Bus

The blue arrows indicate the direction of message flow.

A message coming from the source system passes through the publisher Logic App and is stored to Service Bus.
The subscriber Logic App picks up the message and transfers it to the target system.

The pattern itself works perfectly fine. However, there are a few questions:

Are we really sure that the messages from the publisher Logic App have the same structure that the subscriber Logic App would expect?
Is there a systematic way to validate message structure between publisher and subscriber?

We can't answer that.

Therefore, an event broker like Apache Kafka has introduced a Schema Registry to solve this concern. Outside the Kafka cluster, a separate Schema Registry is up and running. When an event producer sends events, it checks the registry to validate schema. The same thing happens on an event consumer side. When the event consumer picks up messages from the broker, it validates against the schema from the registry, before further processing.

With a similar approach, we can use Azure Service Bus by implementing a Schema Registry. Let's have a look at the diagram below. It's basically the same pattern above, but it adds up an Azure Storage instance as Schema registry and Azure Functions to perform validation.

Diagram Implementing Pub/Sub Pattern with Azure Logic Apps and Service Bus, and Azure Storage and Function App

The blue arrows are the main message flow like the previous diagram. On top of them, there are orange and green ones.

Orange arrows send the message payload from either publisher or subscriber Logic App.
Green arrows pick up the message schema from the schema registry (Blob Storage).
Azure Function App validates the message payload against the schema.

If we use the schema registry for Azure Service Bus, the overall system architecture will have several improvements:

There are no more dependencies left between the systems at both the publisher and subscriber side. That says one system change won't affect the other at all.
This decoupling also removes the dependency on the schema version change. Systems themselves still work as they are, but only change applies to the Logic Apps workflow.
Logic Apps don't internally implement the validation logic but divert to Function App for schema validation.
No more service contract is required. Only schema validation is required.

So far, we have discussed many perspectives about message validation. On top of checking the validity on message payload, validating message schema MUST be done. With WSDL or Open API, we've done the message schema validation, and we now use the schema registry for event-/message-driven architecture.

These perspectives are not new at all. Instead, they are always considered whenever designing a system. In the next post, let's implement a schema registry for Azure Service Bus, using Azure Storage.