Q: What's the difference between Avro JSON encoding and regular JSON?

Avro defines a JSON encoding in its own spec that's not the same as the JSON you'd write by hand. The biggest difference is unions: a union value ["null", "string"] containing the string "hello" is encoded as {"string": "hello"} in Avro JSON, but as just "hello" in normal JSON. This tool produces the normal-JSON view, which is what most engineers actually want to read.

Q: How do union types like ["null", "string"] appear in JSON?

In normal JSON they appear as either null or a value of the non-null type. So a field of type ["null", "string"] with value "hello" is just "hello", and with no value is null. In Avro's own JSON encoding the same value would be {"string": "hello"} or null. The tool uses the normal JSON form.

Q: What about logical types like date, timestamp, and decimal?

Logical types layer a semantic meaning on top of a primitive: date is an int (days since 1970-01-01), timestamp-millis is a long (milliseconds since epoch), decimal is bytes with a fixed precision and scale, and uuid is a string. In a JSON view these are typically rendered as their human-readable form: "2026-05-20", "2026-05-20T10:30:00Z", "123.45", "550e8400-e29b-41d4-a716-446655440000". The tool follows this convention.

Q: How do I convert binary Avro data to JSON?

Use one of: (1) avro-tools tojson --pretty schema.avsc .read(...).toString() in the Java SDK, avro.io.DatumReader in Python, or avro.Decoder in Go. Remember to strip the Confluent magic-byte + schema-ID header (the first 5 bytes) if you're consuming raw Kafka messages.

Question 1

What does this tool do?

Accepted Answer

It takes an Apache Avro schema written in the standard JSON-schema form (with "type": "record", "fields": [...], etc.) and generates a sample JSON document that matches the schema. It's useful for previewing what an Avro-encoded Kafka message or HDFS record will look like once decoded into plain JSON.

Question 2

Does this tool decode binary Avro records?

Accepted Answer

No — this tool only works on the schema text. To decode an actual binary Avro file you need avro-tools tojson schema.avsc < data.avro, the Confluent CLI (confluent kafka topic consume --value-format avro), or an Avro library in your language. For Kafka specifically, the first 5 bytes of every Avro-encoded message are a magic byte (0x00) plus a 4-byte schema ID — you must strip those before passing the rest to a raw Avro decoder.

Question 3

What's the difference between Avro JSON encoding and regular JSON?

Accepted Answer

Avro defines a JSON encoding in its own spec that's not the same as the JSON you'd write by hand. The biggest difference is unions: a union value ["null", "string"] containing the string "hello" is encoded as {"string": "hello"} in Avro JSON, but as just "hello" in normal JSON. This tool produces the normal-JSON view, which is what most engineers actually want to read.

Question 4

How do union types like ["null", "string"] appear in JSON?

Accepted Answer

In normal JSON they appear as either null or a value of the non-null type. So a field of type ["null", "string"] with value "hello" is just "hello", and with no value is null. In Avro's own JSON encoding the same value would be {"string": "hello"} or null. The tool uses the normal JSON form.

Question 5

What about logical types like date, timestamp, and decimal?

Accepted Answer

Logical types layer a semantic meaning on top of a primitive: date is an int (days since 1970-01-01), timestamp-millis is a long (milliseconds since epoch), decimal is bytes with a fixed precision and scale, and uuid is a string. In a JSON view these are typically rendered as their human-readable form: "2026-05-20", "2026-05-20T10:30:00Z", "123.45", "550e8400-e29b-41d4-a716-446655440000". The tool follows this convention.

Question 6

How do I convert binary Avro data to JSON?

Accepted Answer

Use one of: (1) avro-tools tojson --pretty schema.avsc < data.avro; (2) java -jar avro-tools.jar tojson data.avro if the file is a container with embedded schema; (3) the Confluent CLI: confluent kafka topic consume --value-format avro topic-name; (4) at runtime, DatumReader<GenericRecord>.read(...).toString() in the Java SDK, avro.io.DatumReader in Python, or avro.Decoder in Go. Remember to strip the Confluent magic-byte + schema-ID header (the first 5 bytes) if you're consuming raw Kafka messages.

Question 7

Where do I get the schema for a Kafka message?

Accepted Answer

If your cluster uses the Confluent Schema Registry, the schema ID is encoded in the first 5 bytes of every message. Fetch it with curl http://schema-registry:8081/schemas/ids/ or via the Confluent CLI: confluent schema-registry schema describe --subject -value. AWS Glue and Azure Event Hubs Schema Registry have equivalent APIs.

Avro type	JSON representation
`null`	`null`
`boolean`	`true` / `false`
`int`, `long`	JSON number (long can lose precision if `>2^53`)
`float`, `double`	JSON number
`bytes`, `fixed`	base64 string
`string`	string
`record`	JSON object with named fields
`array`	JSON array
`map`	JSON object with string keys
`enum`	the symbol name as a string
`union ["null", "T"]` (nullable)	`null` or a value of type `T`
`logical: date`	string `"YYYY-MM-DD"` (or a day count in strict Avro JSON encoding)
`logical: timestamp-millis`	string `"2026-05-20T10:30:00Z"` (or millis-since-epoch)
`logical: decimal`	string `"123.45"`
`logical: uuid`	string `"550e8400-e29b-41d4-a716-446655440000"`

TymBits

Title here

Avro to JSON Converter

Avro Schema Input

JSON Output

What this tool does

What is Apache Avro?

When you encounter Avro schemas

Avro’s type system → JSON

Common pitfalls

FAQs

Avro to JSON Converter

Avro Schema Input

JSON Output

What this tool does#

What is Apache Avro?#

When you encounter Avro schemas#

Avro’s type system → JSON#

Common pitfalls#