Deserialization with System.Text.Json

Working with JSON is as common as working with a language’s primitive types. .NET has always had basic built-in support for JSON with things like DataContractJsonSerializer, but it didn’t have the functionality, flexibility, or performance necessary to be considered a first-class citizen. The release of .NET Core 3 shifted that narrative with the inclusion of System.Text.Json.

This post explores the different ways that you can read JSON with System.Text.Json. It’s the second post in the series, with a few more articles in the works:

Part 1: Why System.Text.Json exists, major differences with Newtonsoft, and how to find help with the new library.
Part 2 [this post]: Reading JSON documents.
Part 3: Writing JSON documents.
Part 4 [coming soon]: Model Binding in ASP.NET Core.
Part 5 [coming soon]: Considerations for using System.Text.Json in a production-grade project.

Overview

System.Text.Json provides three different ways for reading JSON. Each approach exposes the data in a different way, and the one you choose depends on what you’re trying to do:

JsonSerializer: The “general-purpose” API, meant to deserialize JSON into POCOs. It’s similar to Newtonsoft’s DeserializeObject, with some additional overloads for reading streams and raw bytes more efficiently.
JsonDocument: The “advanced” API that breaks down a JSON document into its constituent parts, and exposes it through a document object model.
Utf8JsonReader: The “full control” API that lets you decide what to do with each JSON token all the while keeping memory usage down to a minimum.

Let’s get a better idea of when to use each of these APIs in real-world scenarios.

JsonSerializer

It’s fairly common to have some JSON that you want to deserialize to an object. Just like with Json.NET, you can pass a string of JSON to the Deserialize method and get back a newly-instantiated POCO that represents the data. There’s nothing fancy about that. But if you’re reading data from a file or stream, you’re more likely to be working with a stream or array of bytes rather than a string.

That’s where the other overloads of the Deserialize method come in handy:

public static ValueTask<TValue> DeserializeAsync<TValue>(Stream utf8Json, JsonSerializerOptions options = null, CancellationToken cancellationToken = default);

public static TValue Deserialize<TValue>(ReadOnlySpan<byte> utf8Json, JsonSerializerOptions options = null);

The DeserializeAsync method is useful any time you’re reading a stream that contains JSON. One such place is the function trigger for an HTTP-based Azure Function. By default, a new HTTP trigger function comes pre-loaded with code similar to the following:

string requestBody = await new StreamReader(req.Body).ReadToEndAsync();
var data = JsonConvert.DeserializeObject<SomeObject>(requestBody);

The above code uses Json.NET, but the you get the idea. The stream is read completely into a string, which is then deserialized to a POCO. The same thing can be accomplished with System.Text.Json’s DeserializeAsync method in a single statement:

var data = await JsonSerializer.DeserializeAsync<SomeObject>(req.Body);

It’s much neater to deserialize the request body this way, and it avoids the unnecessary string allocation, since the serializer consumes the stream for you.

The Deserialize method can also take a ReadOnlySpan of bytes as input. Much like the stream example above, you previously had to read the bytes into a string before deserializing it to JSON. Instead, if you’ve already got the data loaded in memory, this overload saves you a few allocations and parses the JSON directly into a POCO.

It’s also worth nothing that some of the default options for System.Text.Json are different from Json.NET. System.Text.Json adheres to RFC 8259, so if you’re ever left wondering why a setting is different from Newtonsoft, that’s probably why.

You should used JsonSerializer when you:

Have a POCO that matches the JSON data, or it’s easy to create one.
Need to use most of the properties of the JSON in your application code.

JsonDocument

JsonDocument.Parse deserializes JSON to its constituent parts and makes it accessible through an object model that can represent any valid JSON. The object model gives you the power to read arbitrary parts of the JSON document, without forcing you to define a POCO. In that sense, it’s similar to the JObject type in Newtonsoft, but with a much nicer API.

A JsonDocument is composed of a single property called RootElement, of type JsonElement. Think of a JsonElement as being any JSON value, object, or array.

Here’s a diagram that shows the relationship between JsonDocument, JsonProperty, and JsonElement:

JsonElement has Get methods for primitive types that you can call like so: document.RootElement.GetString("Topic"), or document.RootElement.GetNumber("Part").

It also has a GetProperty method to retrieve a block of JSON within the document. For example, you would write document.RootElement.GetProperty("Stats") if you wanted to get a JsonElement that includes all the properties in that block of JSON.

There are two ways to go about getting to the data that interests you. The first, if you know what you’re looking for, is to access the element directly through the DOM. You could for example do the following to get a known property:

// {"Topic":"Json Serialization Part 1","Part":1,"Author":"Marc","Co-Author":"Helen","Keywords":["json","netcore","parsing"]}

var blogPost = JsonDocument.Parse(stringifiedJson);
var topic = blogPost.RootElement.GetProperty("Topic").GetString();

That works great for random access to a property that you know how to find. But what if you’re looking for a property that could be anywhere in the document? Or you need to read a particular property from each object in a JSON array? That’s where EnumerateObject and EnumerateArray come in. They can be used together to walk through any JsonDocument:

// {"Topic":"Json Serialization Part 1","Part":1,"Author":"Marc","Co-Author":"Helen","Keywords":["json","netcore","parsing"]}
var blogPost = JsonDocument.Parse(stringifiedJson);

// Find all authors, returns enumerable with "Marc", "Helen"
var authors = blogPost.RootElement.EnumerateObject()
                   .Where(it => it.Name.Contains("Author") && it.Value.ValueKind == JsonValueKind.String);

// Find all keywords, returns enumerable with "json", "netcore", "parsing"
var keywords = blogPost.RootElement.EnumerateObject()
                  .Where(it => it.Value.ValueKind == JsonValueKind.Array && it.Name == "Keywords")
                  .SelectMany(it => it.Value.EnumerateArray().Select(that => that.GetString()));

Serialization and deserialization are both expensive operations. The JsonDocument API is designed to keep allocations down a minimum, reducing the impact it has on your application.

You should use JsonDocument and its related types when:

The JSON would be too complex to represent in a POCO.
You need access to only a few specific parts of the JSON data.
You don’t know the format of the JSON or the JSON could have multiple formats.

Utf8JsonReader

Utf8JsonReader is lower level than both the JsonSerializer and JsonDocument APIs. It operates on individual JSON tokens so that you can decide what to do with each token. It’s designed to customize the deserialization process and keep allocations to a minimum, allowing you to read very large documents that wouldn’t be feasible with other deserialization means. You could use it, for example, to:

Find the value of a particular property hidden deep within the JSON.
Filter for JSON tokens that match some criteria.
Count the number of tokens that match some criteria.
Deserialize only the values you need from a large JSON.
Reading a large file from a stream.

Utf8JsonReader is for what I would consider edge cases — it’s not something you’re likely to use on a daily basis. For that reason, I won’t show any examples of how to use it here, but you can refer to the linked articles above for more details on its API.

You should consider using Utf8JsonReader when:

You need full control of how and what you’re going to deserialize.
You have a really large JSON document that can’t feasibly be read any other way.
You have to do some special processing of the JSON document, like counting certain tokens.

Summary

We saw a few different ways to parse JSON data with System.Text.Json. The method you choose depends on what you’re trying to accomplish and can be summarized as below:

	Good for
JsonSerializer	– Small to medium size JSON that’s deserializable to a POCO. – Making all properties and values accessible to your application code.
JsonDocument	– Complex JSON documents. – Reading only specific parts of the JSON. – Walking through an unknown JSON format.
Utf8JsonReader	– Reading extremely large JSON data sets. – Customizing the deserialization process to handle special scenarios – Controlling deserialization behaviour.

Now that we can read JSON data any way that we like, it’s time to figure out how to write JSON for others to consume. Look for that that post around mid-October.

13 comments

Pingback: Discover System.Text.Json – Marc Roussy
Pingback: Serialization with System.Text.Json – Marc Roussy
Miroslav Vanický says:

December 4, 2020 at 6:03 am

Hi,

thank You for this nice summary. I have one question to it. You write that Utf8JsonReader can be used to “Reading a large file from a stream”. This is the way I would like to use it, but if I see correctly there is no way how to do it now.
As I saw, some have written wrappers around Utf8JsonReader to achieve it, but pure Utf8JsonReader is not able to work with streams.

LikeLike

1. marc says:
  
  December 5, 2020 at 3:19 pm
  
  Hi Miroslav,
  
  You’re correct, Utf8JsonReader can’t read the stream directly. Instead, you need to supply it with a span of bytes that you’ve read from a stream. You can see an example here of how they take the span of bytes: https://docs.microsoft.com/en-us/dotnet/standard/serialization/write-custom-serializer-deserializer#use-utf8jsonreader
  
  Hope that helps,
  Marc
  
  LikeLike
  
Adam R says:

December 23, 2020 at 10:48 am

Utf8JsonReader doesn’t allow you read large files in a manageable way, the first operation in the example you have linked is File.ReadAllBytes. I’ve come from using the Jackson JSON library in java, which has a synchronous parse option to read in a chunk at a time or indicate the next token is unavailable which also makes it suitable for network streams and that is what is missing here.

LikeLike

1. marc says:
  
  December 24, 2020 at 11:16 am
  
  Hi Adam, thanks for comment. You’re the 2nd person to mention I’ve worded it in a way that indicates you can read the stream from Ut8JsonReader, I’m going to update the article to make it a bit clearer. Cheers, Marc.
  
  LikeLike
  
Pingback: When To Use System.Text.Json with ASP.NET Core – Marc Roussy
Antonio Santana says:

February 11, 2021 at 2:27 pm

Show de bola, parabéns. ajudou muito a explicação.

LikeLike

ccsolviach says:

April 21, 2021 at 11:15 am

Hi Marc,
interesting read, thanks for posting it.

what about: https://stackoverflow.com/questions/67198596/how-to-validate-json-using-system-text-json-before-deserialization

any ideas?

best,

Chris

LikeLike

1. marc says:
  
  April 22, 2021 at 7:08 pm
  
  Hi Chris,
  
  There isn’t anything built in to System.Text.Json to do that yet, as far as I know. Depending on how complex your need is, I would probably do something with the JsonElement.TryGetxyz methods: https://docs.microsoft.com/en-us/dotnet/api/system.text.json.jsonelement.trygetguid?view=net-5.0 and build my own very simple validator. If it’s to validate many properties on complex objects, then I’d look around if anyone has built something similar that can work alongside System.Text.Json, but unfortunately I don’t know of any.
  
  Hope that helps,
  Marc
  
  LikeLike
  
jdege says:

July 25, 2021 at 11:29 am

I’m trying to serialize and deserialize a POCO in System.Text.Json, and it simply doesn’t work.

When I deserialize to a type that includes an “object” property, it’s being deserialized as a ValueKind, which is worthless.

LikeLike

spj_uk says:

January 14, 2022 at 8:52 am

Thanks for this. What about if the JsonProperty “stats” had dynamic elements e.g. in one it was wordcount and views and another pages, chapers

e.g.
“stats”:
{
“views”: 21,
“pages”: 500
}

How can you retrieve elements where you may not know the name for example

LikeLike

Pingback: “system.text.json DeserializeAsync when to use” Code Answer – My Blog