Kit

Codecs

Encode and decode anything

Introduction

Kit includes a powerful serialisation system called Codecs. Whether you're working with account data, instruction arguments, or custom binary layouts, Codecs give you the tools to transform structured data into bytes — and back again.

Codecs are composable, type-safe, and environment-agnostic. They are designed to provide a flexible and consistent foundation for handling binary data across the Solana stack.

Installation

Codecs are included within the @solana/kit library but you may also install them using their standalone package.

npm install @solana/codecs

Note that the @solana/codecs package itself is composed of several smaller packages, each providing a different set of codec helpers. Here's the list of all packages containing codecs, should you need to install them individually:

PackageDescription
@solana/kitIncludes @solana/codecs.
@solana/codecsIncludes all codecs packages below.
@solana/codecs-coreCore types and utilities for building codecs.
@solana/codecs-numbersCodecs for numbers of various sizes and characteristics.
@solana/codecs-stringsCodecs for strings of various encodings and size strategies.
@solana/codecs-data-structuresCodecs for a variety of data structures such as objects, enums, arrays, maps, etc.
@solana/optionsCodecs for Rust-like Options in JavaScript.

What is a Codec?

A Codec is an object that knows how to encode a any type into a Uint8Array and how to decode a Uint8Array back into that value.

No matter which serialization strategy we use, Codecs abstract away its implementation and offer a simple encode and decode interface. They are also highly composable, allowing us to build complex data structures from simple building blocks.

Here's a quick example that encodes and decodes a simple Person type.

// Use composable codecs to build complex data structures.
type  = { : string; : number };
const  = (): <> =>
  ([
    ["name", ((), ())],
    ["age", ()],
  ]);
 
// Use your own codecs to encode and decode data.
const  = ();
const  = .({ : "John", : 42 });
const  = .();

Composing codecs

The easiest way to create your own codecs is to compose the various codecs at your disposal.

For instance, consider the following codecs available:

  • getStructCodec: Creates a codec for objects with named fields.
  • getU32Codec: Creates a codec for unsigned 32-bit integers.
  • getUtf8Codec: Creates a codec for UTF-8 strings.
  • addCodecSizePrefix: Creates a codec that prefixes the encoded data with its length.
  • getBooleanCodec: Creates a codec for booleans using a single byte.

By combining them together we can create a custom codec for the following Person type.

type  = {
  : string;
  : number;
  : boolean;
};
 
const  = (): <> =>
  ([
    ["name", ((), ())],
    ["age", ()],
    ["verified", ()],
  ]);

This function returns a Codec object which contains both an encode and decode function that can be used to convert a Person type to and from a Uint8Array.

const  = ();
const  = .({ : "John", : 42, : true });
const  = .();

There is a significant library of composable codecs at your disposal, enabling you to compose complex types. Check out the available codecs section for more information. If you need a custom codec that cannot be composed from existing ones, you can always create your own as we will see in the "Creating custom codecs" section below.

Separate encoders and decoders

Whilst Codecs can both encode and decode, it is possible to only focus on encoding or decoding data, enabling the unused logic to be tree-shaken. For instance, here's our previous example using Encoders only to encode a Person type.

const  = (): <> =>
  ([
    ["name", ((), ())],
    ["age", ()],
    ["verified", ()],
  ]);
 
const  = ().({ : "John", : 42, : true });

The same can be done for decoding the Person type by using Decoders like so.

const  = (): <> =>
  ([
    ["name", ((), ())],
    ["age", ()],
    ["verified", ()],
  ]);
 
const  = ().();

Combining encoders and decoders

Separating Codecs into Encoders and Decoders is particularly good practice for library maintainers as it allows their users to tree-shake any of the encoders and/or decoders they don't need. However, we may still want to offer a codec helper for users who need both for convenience.

That's why this library offers a combineCodec helper that creates a Codec instance from a matching Encoder and Decoder.

const  = (): <> => ((), ());

This means library maintainers can offer Encoders, Decoders and Codecs for all their types whilst staying efficient and tree-shakeable. In summary, we recommend the following pattern when creating codecs for library types.

type MyType = /* ... */;
const getMyTypeEncoder = (): Encoder<MyType> => { /* ... */ };
const getMyTypeDecoder = (): Decoder<MyType> => { /* ... */ };
const getMyTypeCodec = (): Codec<MyType> => combineCodec(
  getMyTypeEncoder(),
  getMyTypeDecoder()
);

Different From and To types

When creating codecs, the encoded type is allowed to be looser than the decoded type. A good example of that is the u64 number codec:

const : <number | bigint, bigint> = ();

As you can see, the first type parameter is looser since it accepts numbers or big integers, whereas the second type parameter only accepts big integers. That's because when encoding a u64 number, you may provide either a bigint or a number for convenience. However, when you decode a u64 number, you will always get a bigint because not all u64 values can fit in a JavaScript number type.

const  = .(42);
const  = .(); // BigInt(42)

This relationship between the type we encode “From” and decode “To” can be generalized in TypeScript as To extends From.

Here's another example using an object with default values. You can read more about the transformCodec helper below.

type  = { : string; : number };
type  = { : string; ?: number };
 
const  = (): <> =>
  (
    ([
      ["name", ((), ())],
      ["age", ()],
    ]),
    () => ({ ..., : . ?? 42 }),
  );
 
const  = (): <> =>
  ([
    ["name", ((), ())],
    ["age", ()],
  ]);
 
const  = (): <, > => ((), ());

Fixed-size and variable-size codecs

It is also worth noting that Codecs can either be of fixed size or variable size.

FixedSizeCodecs have a fixedSize number attribute that tells us exactly how big their encoded data is in bytes.

const  = ();
 satisfies <number>;
.; // 4 bytes.

On the other hand, VariableSizeCodecs do not know the size of their encoded data in advance. Instead, they will grab that information either from the provided encoded data or from the value to encode. For the former, we can simply access the length of the Uint8Array. For the latter, it provides a getSizeFromValue that tells us the encoded byte size of the provided value.

const  = ((), ());
 satisfies <string>;
.("hello world"); // 4 + 11 bytes.

Also note that, if the VariableSizeCodec is bounded by a maximum size, it can be provided as a maxSize number attribute.

The following type guards are available to identify and/or assert the size of codecs: isFixedSize, isVariableSize, assertIsFixedSize and assertIsVariableSize.

Finally, note that the same is true for Encoders and Decoders.

  • A FixedSizeEncoder has a fixedSize number attribute.
  • A VariableSizeEncoder has a getSizeFromValue function and an optional maxSize number attribute.
  • A FixedSizeDecoder has a fixedSize number attribute.
  • A VariableSizeDecoder has an optional maxSize number attribute.

Creating custom codecs

If composing codecs isn't enough for you, you may implement your own codec logic by using the createCodec function. This function requires an object with a read and a write function telling us how to read from and write to an existing byte array.

The read function accepts the bytes to decode from and the offset at each we should start reading. It returns an array with two items:

  • The first item should be the decoded value.
  • The second item should be the next offset to read from.
({
  (, ) {
    const  = [];
    return [,  + 1];
  },
  ,
  ,
});

Reciprocally, the write function accepts the value to encode, the array of bytes to write the encoded value to and the offset at which it should be written. It should encode the given value, insert it in the byte array, and provide the next offset to write to as the return value.

({
  (: number, , ) {
    .([], );
    return  + 1;
  },
  ,
  ,
});

Additionally, we must specify the size of the codec. If we are defining a FixedSizeCodec, we must simply provide the fixedSize number attribute. For VariableSizeCodecs, we must provide the getSizeFromValue function as described in the previous section.

// FixedSizeCodec.
({
  : 1,
  ,
  ,
});
 
// VariableSizeCodec.
({
  : (: string) => .,
  ,
  ,
});

Here's a concrete example of a custom codec that encodes any unsigned integer in a single byte. Since a single byte can only store integers from 0 to 255, if any other integer is provided it will take its modulo 256 to ensure it fits in a single byte. Because it always requires a single byte, that codec is a FixedSizeCodec of size 1.

import {  } from "@solana/kit";
 
const  = () =>
  <number>({
    : 1,
    (, ) {
      const  = [];
      return [,  + 1];
    },
    (, , ) {
      .([ % 256], );
      return  + 1;
    },
  });

Note that, it is also possible to create custom encoders and decoders separately by using the createEncoder and createDecoder functions respectively and then use the combineCodec function on them just like we were doing with composed codecs.

This approach is recommended to library maintainers as it allows their users to tree-shake any of the encoders and/or decoders they don't need.

Here's our previous modulo u8 example but split into separate Encoder, Decoder and Codec instances.

import { , ,  } from "@solana/kit";
 
const  = () =>
  <number>({
    : 1,
    (, , ) {
      .([ % 256], );
      return  + 1;
    },
  });
 
const  = () =>
  <number>({
    : 1,
    (, ) {
      const  = [];
      return [,  + 1];
    },
  });
 
const  = () => ((), ());

Here's another example returning a VariableSizeCodec. This one transforms a simple string composed of characters from a to z to a buffer of numbers from 1 to 26 where 0 bytes are spaces.

import { , ,  } from "@solana/kit";
 
const  = " abcdefghijklmnopqrstuvwxyz";
 
const  = () =>
  <string>({
    : () => .,
    (, , ) {
      const  = [...].(() => .());
      .(, );
      return  + .;
    },
  });
 
const  = () =>
  <string>({
    (, ) {
      const  = [....()].(() => .()).("");
      return [, .];
    },
  });
 
const  = () => ((), ());

Available codecs

Core utilities

Numbers

Strings

Data structures

Core utilities listing

addCodecSentinel

One way of delimiting the size of a codec is to use sentinels. The addCodecSentinel function allows us to add a sentinel to the end of the encoded data and to read until that sentinel is found when decoding. It accepts any codec and a Uint8Array sentinel responsible for delimiting the encoded data.

const  = ((), new ([255, 255]));
.("hello");
// 0x68656c6c6fffff
//   |        └-- Our sentinel.
//   └-- Our encoded string.

Note that the sentinel must not be present in the encoded data and must be present in the decoded data for this to work. If this is not the case, dedicated errors will be thrown.

const  = new ([108, 108]); // 'll'
const  = ((), );
 
.("hello"); // Throws: sentinel is in encoded data.
.(new ([1, 2, 3])); // Throws: sentinel missing in decoded data.

Separate addEncoderSentinel and addDecoderSentinel functions are also available.

const  = ((), ).("hello");
const  = ((), ).();

addCodecSizePrefix

The addCodecSizePrefix function allows us to store the byte size of any codec as a number prefix, enabling us to contain variable-size codecs to their actual size.

When encoding, the size of the encoded data is stored before the encoded data itself. When decoding, the size is read first to know how many bytes to read next.

For example, say we want to represent a variable-size base-58 string using a u32 size prefix. Here's how we can use the addCodecSizePrefix function to achieve that.

const  = () => ((), ());
 
().("hello world");
// 0x0b00000068656c6c6f20776f726c64
//   |       └-- Our encoded base-58 string.
//   └-- Our encoded u32 size prefix.

You may also use the addEncoderSizePrefix and addDecoderSizePrefix functions to separate your codec logic like so:

const  = () => ((), ());
const  = () => ((), ());
const  = () => ((), ());

containsBytes

Checks if a Uint8Array contains another Uint8Array at a given offset.

(new ([1, 2, 3, 4]), new ([2, 3]), 1); // true
(new ([1, 2, 3, 4]), new ([2, 3]), 2); // false

fixBytes

Pads or truncates a Uint8Array so it has the specified length.

(new ([1, 2]), 4); // Uint8Array([1, 2, 0, 0])
(new ([1, 2, 3, 4]), 2); // Uint8Array([1, 2])

fixCodecSize

The fixCodecSize function allows us to bind the size of a given codec to the given fixed size.

For instance, say we wanted to represent a base-58 string that uses exactly 32 bytes when decoded. Here's how we can use the fixCodecSize helper to achieve that.

const  = () => ((), 32);

You may also use the fixEncoderSize and fixDecoderSize functions to separate your codec logic like so:

const  = () => ((), 32);
const  = () => ((), 32);
const  = () => ((), ());

mergeBytes

Concatenates an array of Uint8Arrays into a single Uint8Array.

const  = new ([0x01, 0x02]);
const  = new ([]);
const  = new ([0x03, 0x04]);
const  = ([, , ]);
//    ^ [0x01, 0x02, 0x03, 0x04]

offsetCodec

The offsetCodec function is a powerful codec primitive that allows us to move the offset of a given codec forward or backwards. It accepts one or two functions that takes the current offset and returns a new offset.

To understand how this works, let's take the following biggerU32Codec example which encodes a u32 number inside an 8-byte buffer by using the resizeCodec helper.

const  = ((), () =>  + 4);
.(0xffffffff);
// 0xffffffff00000000
//   |       └-- Empty buffer space caused by the resizeCodec function.
//   └-- Our encoded u32 number.

Now, let's say we want to move the offset of that codec 2 bytes forward so that the encoded number sits in the middle of the buffer. To achieve, this we can use the offsetCodec helper and provide a preOffset function that moves the "pre-offset" of the codec 2 bytes forward.

const  = (, {
  : ({  }) =>  + 2,
});
.(0xffffffff);
// 0x0000ffffffff0000
//       └-- Our encoded u32 number is now in the middle of the buffer.

We refer to this offset as the "pre-offset" because, once the inner codec is encoded or decoded, an additional offset will be returned which we refer to as the "post-offset". That "post-offset" is important as, unless we are reaching the end of our codec, it will be used by any further codecs to continue encoding or decoding data.

By default, that "post-offset" is simply the addition of the "pre-offset" and the size of the encoded or decoded inner data.

const  = (, {
  : ({  }) =>  + 2,
});
.(0xffffffff);
// 0x0000ffffffff0000
//   |   |       └-- Post-offset.
//   |   └-- New pre-offset: The original pre-offset + 2.
//   └-- Pre-offset: The original pre-offset before we adjusted it.

However, you may also provide a postOffset function to adjust the "post-offset". For instance, let's push the "post-offset" 2 bytes forward as well such that any further codecs will start doing their job at the end of our 8-byte u32 number.

const  = (, {
  : ({  }) =>  + 2,
  : ({  }) =>  + 2,
});
.(0xffffffff);
// 0x0000ffffffff0000
//   |   |       |   └-- New post-offset: The original post-offset + 2.
//   |   |       └-- Post-offset: The original post-offset before we adjusted it.
//   |   └-- New pre-offset: The original pre-offset + 2.
//   └-- Pre-offset: The original pre-offset before we adjusted it.

Both the preOffset and postOffset functions offer the following attributes:

  • bytes: The entire byte array being encoded or decoded.
  • preOffset: The original and unaltered pre-offset.
  • wrapBytes: A helper function that wraps the given offset around the byte array length. E.g. wrapBytes(-1) will refer to the last byte of the byte array.

Additionally, the post-offset function also provides the following attributes:

  • newPreOffset: The new pre-offset after the pre-offset function has been applied.
  • postOffset: The original and unaltered post-offset.

Note that you may also decide to ignore these attributes to achieve absolute offsets. However, relative offsets are usually recommended as they won't break your codecs when composed with other codecs.

const  = (, {
  : () => 2,
  : () => 8,
});
.(0xffffffff);
// 0x0000ffffffff0000

Also note that any negative offset or offset that exceeds the size of the byte array will throw a SolanaError of code SOLANA_ERROR__CODECS__OFFSET_OUT_OF_RANGE.

const  = (, { : () => -4 });
.(0xffffffff);
// throws new SolanaError(SOLANA_ERROR__CODECS__OFFSET_OUT_OF_RANGE)

To avoid this, you may use the wrapBytes function to wrap the offset around the byte array length. For instance, here's how we can use the wrapBytes function to move the pre-offset 4 bytes from the end of the byte array.

const  = (, {
  : ({  }) => (-4),
});
.(0xffffffff);
// 0x00000000ffffffff

As you can see, the offsetCodec helper allows you to jump all over the place with your codecs. This non-linear approach to encoding and decoding data allows you to achieve complex serialization strategies that would otherwise be impossible.

The offsetEncoder and offsetDecoder functions can also be used to split your codec logic into tree-shakeable functions.

const  = () => (, { : ({  }) =>  + 2 });
const  = () => (, { : ({  }) =>  + 2 });
const  = () => ((), ());

padBytes

Pads a Uint8Array with zeroes (to the right) to the specified length.

(new ([1, 2]), 4); // Uint8Array([1, 2, 0, 0])
(new ([1, 2, 3, 4]), 2); // Uint8Array([1, 2, 3, 4])

padLeftCodec

The padLeftCodec helper can be used to add padding to the left of a given codec. It accepts an offset number that tells us how big the padding should be.

const  = ((), 4);
.(0xffff);
// 0x00000000ffff
//   |       └-- Our encoded u16 number.
//   └-- Our 4-byte padding.

Note that the padLeftCodec function is a simple wrapper around the offsetCodec and resizeCodec functions. For more complex padding strategies, you may want to use the offsetCodec and resizeCodec functions directly instead.

Encoder-only and decoder-only helpers are available for these padding functions.

const  = () => ((), 6);
const  = () => ((), 6);
const  = () => ((), ());

padRightCodec

The padRightCodec helper can be used to add padding to the right of a given codec. It accepts an offset number that tells us how big the padding should be.

const  = ((), 4);
.(0xffff);
// 0xffff00000000
//   |   └-- Our 4-byte padding.
//   └-- Our encoded u16 number.

Note that the padRightCodec function is a simple wrapper around the offsetCodec and resizeCodec functions. For more complex padding strategies, you may want to use the offsetCodec and resizeCodec functions directly instead.

Encoder-only and decoder-only helpers are available for these padding functions.

const  = () => ((), 6);
const  = () => ((), 6);
const  = () => ((), ());

resizeCodec

The resizeCodec helper re-defines the size of a given codec by accepting a function that takes the current size of the codec and returns a new size. This works for both fixed-size and variable-size codecs.

// Fixed-size codec.
const  = () => ((), () =>  + 4);
().(42);
// 0x2a00000000000000
//   |       └-- Empty buffer space caused by the resizeCodec function.
//   └-- Our encoded u32 number.
 
// Variable-size codec.
const  = () => ((), () =>  + 4);
().("ABC");
// 0x41424300000000
//   |     └-- Empty buffer space caused by the resizeCodec function.
//   └-- Our encoded string.

Note that the resizeCodec function doesn't change any encoded or decoded bytes, it merely tells the encode and decode functions how big the Uint8Array should be before delegating to their respective write and read functions. In fact, this is completely bypassed when using the write and read functions directly. For instance:

const  = () => ((), () =>  + 4);
 
// Using the encode function.
().(42);
// 0x2a00000000000000
 
// Using the lower-level write function.
const  = new (4);
().(42, , 0);
// 0x2a000000

So when would it make sense to use the resizeCodec function? This function is particularly useful when combined with the offsetCodec function. Whilst offsetCodec may help us push the offset forward — e.g. to skip some padding — it won't change the size of the encoded data which means the last bytes will be truncated by how much we pushed the offset forward. The resizeCodec function can be used to fix that. For instance, here's how we can use the resizeCodec and the offsetCodec functions together to create a struct codec that includes some padding.

const  = ([
  ["name", ((), 8)],
  // There is a 4-byte padding between name and age.
  [
    "age",
    (
      ((), () =>  + 4),
      { : ({  }) =>  + 4 },
    ),
  ],
]);
 
.({ : "Alice", : 42 });
// 0x416c696365000000000000002a000000
//   |               |       └-- Our encoded u32 (42).
//   |               └-- The 4-bytes of padding we are skipping.
//   └-- Our 8-byte encoded string ("Alice").

Note that this can be achieved using the padLeftCodec helper which is implemented that way.

The resizeEncoder and resizeDecoder functions can also be used to split your codec logic into tree-shakeable functions.

const  = () => ((), () =>  + 4);
const  = () => ((), () =>  + 4);
const  = () => ((), ());

reverseCodec

The reverseCodec helper reverses the bytes of the provided FixedSizeCodec.

const  = () => (());

Note that number codecs can already do that for you via their endian option.

const  = () => ({ : . });

The reverseEncoder and reverseDecoder functions can also be used to achieve that.

const  = () => (());
const  = () => (());
const  = () => ((), ());

transformCodec

It is possible to transform a Codec<T> to a Codec<U> by providing two mapping functions: one that goes from T to U and one that does the opposite.

For instance, here's how you would map a u32 integer into a string representation of that number.

const  = () =>
  (
    (),
    (: string): number => (),
    (: number): string => .(),
  );
 
().("42"); // new Uint8Array([42])
().(new ([42])); // "42"

If a Codec has different From and To types, say Codec<OldFrom, OldTo>, and we want to map it to Codec<NewFrom, NewTo>, we must provide functions that map from NewFrom to OldFrom and from OldTo to NewTo.

To illustrate that, let's take our previous getStringU32Codec example but make it use a getU64Codec codec instead as it returns a Codec<number | bigint, bigint>. Additionally, let's make it so our getStringU64Codec function returns a Codec<number | string, string> so that it also accepts numbers when encoding values. Here's what our mapping functions look like:

const  = () =>
  (
    (),
    (: number | string): number | bigint =>
      typeof  === "string" ? () : ,
    (: bigint): string => .(),
  );

Note that the second function that maps the decoded type is optional. That means, you can omit it to simply update or loosen the type to encode whilst keeping the decoded type the same.

This is particularly useful to provide default values to object structures. For instance, here's how we can map a Person codec to give a default value to its age attribute.

type  = { : string; : number };
type  = { : string; ?: number };
const  = (): <, > =>
  ((), (: ):  => ({ ..., : . ?? 42 }));

Similar helpers exist to map Encoder and Decoder instances allowing you to separate your codec logic into tree-shakeable functions. Here's our getStringU32Codec written that way.

const  = () =>
  ((), (: string): number => ());
const  = () => ((), (: number): string => .());
const  = () => ((), ());

Numbers listing

getI8Codec

Encodes and decodes signed 8-bit integers. It supports values from -127 (-2^7) to 128 (2^7 - 1).

Values can be provided as either number or bigint, but the decoded value is always a number.

const  = ();
const  = .(-42); // 0xd6
const  = .(); // -42

getI8Encoder and getI8Decoder functions are also available.

getI16Codec

Encodes and decodes signed 16-bit integers. It supports values from -32,768 (-2^15) to 32,767 (2^15 - 1).

Values can be provided as either number or bigint, but the decoded value is always a number. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(-42); // 0xd6ff
const  = .(); // -42
 
// Big-endian.
const  = ({ : . });
const  = .(-42); // 0xffd6
const  = .(); // -42

getI16Encoder and getI16Decoder functions are also available.

getI32Codec

Encodes and decodes signed 32-bit integers. It supports values from -2,147,483,648 (-2^31) to 2,147,483,647 (2^31 - 1).

Values can be provided as either number or bigint, but the decoded value is always a number. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(-42); // 0xd6ffffff
const  = .(); // -42
 
// Big-endian.
const  = ({ : . });
const  = .(-42); // 0xffffffd6
const  = .(); // -42

getI32Encoder and getI32Decoder functions are also available.

getI64Codec

Encodes and decodes signed 64-bit integers. It supports values from -2^63 to 2^63 - 1.

Values can be provided as either number or bigint, but the decoded value is always a bigint. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(-42); // 0xd6ffffffffffffff
const  = .(); // BigInt(-42)
 
// Big-endian.
const  = ({ : . });
const  = .(-42); // 0xffffffffffffffd6
const  = .(); // BigInt(-42)

getI64Encoder and getI64Decoder functions are also available.

getI128Codec

Encodes and decodes signed 128-bit integers. It supports values from -2^127 to 2^127 - 1.

Values can be provided as either number or bigint, but the decoded value is always a bigint. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(-42); // 0xd6ffffffffffffffffffffffffffffff
const  = .(); // BigInt(-42)
 
// Big-endian.
const  = ({ : . });
const  = .(-42); // 0xffffffffffffffffffffffffffffffd6
const  = .(); // BigInt(-42)

getI128Encoder and getI128Decoder functions are also available.

getF32Codec

Encodes and decodes 32-bit floating-point numbers. Due to the IEEE 754 floating-point representation, some precision loss may occur.

Values can be provided as either number or bigint, but the decoded value is always a number. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(-1.5); // 0x0000c0bf
const  = .(); // -1.5
 
// Big-endian.
const  = ({ : . });
const  = .(-1.5); // 0xbfc00000
const  = .(); // -1.5

getF32Encoder and getF32Decoder functions are also available.

getF64Codec

Encodes and decodes 64-bit floating-point numbers. Due to the IEEE 754 floating-point representation, some precision loss may occur.

Values can be provided as either number or bigint, but the decoded value is always a number. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(-1.5); // 0x000000000000f8bf
const  = .(); // -1.5
 
// Big-endian.
const  = ({ : . });
const  = .(-1.5); // 0xbff8000000000000
const  = .(); // -1.5

getF64Encoder and getF64Decoder functions are also available.

getShortU16Codec

Encodes and decodes unsigned integer using 1 to 3 bytes based on the encoded value. It supports values from 0 to 4,194,303 (2^22 - 1).

The larger the value, the more bytes it uses.

  • If the value is <= 0x7f (127), it is stored in a single byte and the first bit is set to 0 to indicate the end of the value.
  • Otherwise, the first bit is set to 1 to indicate that the value continues in the next byte, which follows the same pattern.
  • This process repeats until the value is fully encoded in up to 3 bytes. The third and last byte, if needed, uses all 8 bits to store the remaining value.

In other words, the encoding scheme follows this structure:

0XXXXXXX                   <- Values 0 to 127 (1 byte)
1XXXXXXX 0XXXXXXX          <- Values 128 to 16,383 (2 bytes)
1XXXXXXX 1XXXXXXX XXXXXXXX <- Values 16,384 to 4,194,303 (3 bytes)

Values can be provided as either number or bigint, but the decoded value is always a number.

const  = ();
const  = .(42); // 0x2a
const  = .(128); // 0x8001
const  = .(16384); // 0x808001
 
.(); // 42
.(); // 128
.(); // 16384

getShortU16Encoder and getShortU16Decoder functions are also available.

getU8Codec

Encodes and decodes unsigned 8-bit integers. It supports values from 0 to 255 (2^8 - 1).

Values can be provided as either number or bigint, but the decoded value is always a number.

const  = ();
const  = .(42); // 0x2a
const  = .(); // 42

getU8Encoder and getU8Decoder functions are also available.

getU16Codec

Encodes and decodes unsigned 16-bit integers. It supports values from 0 to 65,535 (2^16 - 1).

Values can be provided as either number or bigint, but the decoded value is always a number. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(42); // 0x2a00
const  = .(); // 42
 
// Big-endian.
const  = ({ : . });
const  = .(42); // 0x002a
const  = .(); // 42

getU16Encoder and getU16Decoder functions are also available.

getU32Codec

Encodes and decodes unsigned 32-bit integers. It supports values from 0 to 4,294,967,295 (2^32 - 1).

Values can be provided as either number or bigint, but the decoded value is always a number. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(42); // 0x2a000000
const  = .(); // 42
 
// Big-endian.
const  = ({ : . });
const  = .(42); // 0x0000002a
const  = .(); // 42

getU32Encoder and getU32Decoder functions are also available.

getU64Codec

Encodes and decodes unsigned 64-bit integers. It supports values from 0 to 2^64 - 1.

Values can be provided as either number or bigint, but the decoded value is always a bigint. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(42); // 0x2a00000000000000
const  = .(); // 42
 
// Big-endian.
const  = ({ : . });
const  = .(42); // 0x000000000000002a
const  = .(); // 42

getU64Encoder and getU64Decoder functions are also available.

getU128Codec

Encodes and decodes unsigned 128-bit integers. It supports values from 0 to 2^128 - 1.

Values can be provided as either number or bigint, but the decoded value is always a bigint. Endianness can be specified using the endian option. The default is Endian.Little.

// Little-endian.
const  = ();
const  = .(42); // 0x2a000000000000000000000000000000
const  = .(); // 42
 
// Big-endian.
const  = ({ : . });
const  = .(42); // 0x0000000000000000000000000000002a
const  = .(); // 42

getU128Encoder and getU128Decoder functions are also available.

Strings listing

getBase10Codec

Encodes and decodes Base 10 strings.

const  = ();
const  = .("1024"); // 0x0400
const  = .(); // "1024"

This codec does not enforce a size boundary. It will encode and decode all bytes necessary to represent the string. To add size constraints to your codec, you may use utility functions such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

((), 4).("1024");
// 0x04000000 (padded to 4 bytes)
 
((), ()).("1024");
// 0x020000000400
//   |       └-- The 2 bytes of content.
//   └-- 4-byte prefix telling us to read 2 bytes.
 
((), new ([0xff, 0xff])).("1024");
// 0x0400ffff
//   |   └-- The sentinel signaling the end of the content.
//   └-- The 2 bytes of content.

getBase10Encoder and getBase10Decoder functions are also available.

getBase16Codec

Encodes and decodes Base 16 strings.

const  = ();
const  = .("deadface"); // 0xdeadface
const  = .(); // "deadface"

This codec does not enforce a size boundary. It will encode and decode all bytes necessary to represent the string. To add size constraints to your codec, you may use utility functions such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

((), 2).("deadface");
// 0xdead (truncated to 2 bytes)
 
((), ()).("deadface");
// 0x04000000deadface
//   |       └-- The 4 bytes of content.
//   └-- 4-byte prefix telling us to read 4 bytes.
 
((), new ([0xff, 0xff])).("deadface");
// 0xdeadfaceffff
//   |       └-- The sentinel signaling the end of the content.
//   └-- The 4 bytes of content.

getBase16Encoder and getBase16Decoder functions are also available.

getBase58Codec

Encodes and decodes Base 58 strings.

const  = ();
const  = .("heLLo"); // 0x1b6a3070
const  = .(); // "heLLo"

This codec does not enforce a size boundary. It will encode and decode all bytes necessary to represent the string. To add size constraints to your codec, you may use utility functions such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

((), 2).("heLLo");
// 0x1b6a (truncated to 2 bytes)
 
((), ()).("heLLo");
// 0x040000001b6a3070
//   |       └-- The 4 bytes of content.
//   └-- 4-byte prefix telling us to read 4 bytes.
 
((), new ([0xff, 0xff])).("heLLo");
// 0x1b6a3070ffff
//   |       └-- The sentinel signaling the end of the content.
//   └-- The 4 bytes of content.

getBase58Encoder and getBase58Decoder functions are also available.

getBase64Codec

Encodes and decodes Base 64 strings.

const  = ();
const  = .("hello+world"); // 0x85e965a3ec28ae57
const  = .(); // "hello+world"

This codec does not enforce a size boundary. It will encode and decode all bytes necessary to represent the string. To add size constraints to your codec, you may use utility functions such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

((), 4).("hello+world");
// 0x85e965a3 (truncated to 4 bytes)
 
((), ()).("hello+world");
// 0x0400000085e965a3ec28ae57
//   |       └-- The 8 bytes of content.
//   └-- 4-byte prefix telling us to read 8 bytes.
 
((), new ([0xff, 0xff])).("hello+world");
// 0x85e965a3ec28ae57ffff
//   |               └-- The sentinel signaling the end of the content.
//   └-- The 8 bytes of content.

getBase64Encoder and getBase64Decoder functions are also available.

getBaseXCodec

The getBaseXCodec accepts a custom alphabet of X characters and creates a base-X codec using that alphabet. It does so by iteratively dividing by X and handling leading zeros.

const  = ("0ehlo");
const  = .("hello"); // 0x05bd
const  = .(); // "hello"

This codec does not enforce a size boundary. It will encode and decode all bytes necessary to represent the string. To add size constraints to your codec, you may use utility functions such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

const  = ("0ehlo");
 
(, 4).("hello");
// 0x05bd0000 (padded to 4 bytes)
 
(, ()).("hello");
// 0x0200000005bd
//   |       └-- The 2 bytes of content.
//   └-- 4-byte prefix telling us to read 2 bytes.
 
(, new ([0xff, 0xff])).("hello");
// 0x05bdffff
//   |   └-- The sentinel signaling the end of the content.
//   └-- The 2 bytes of content.

getBaseXEncoder and getBaseXDecoder functions are also available.

getBaseXResliceCodec

The getBaseXResliceCodec accepts a custom alphabet of X characters and creates a base-X codec using that alphabet.

It does so by re-slicing bytes into custom chunks of bits that are then mapped to the provided alphabet. The number of bits per chunk is also provided as the second argument and should typically be set to log2(alphabet.length).

This is typically used to create codecs whose alphabet's length is a power of 2 such as base-16 or base-64.

const  = ("elho", 2);
const  = .("hellolol"); // 0x4aee
const  = .(); // "hellolol"

This codec does not enforce a size boundary. It will encode and decode all bytes necessary to represent the string. To add size constraints to your codec, you may use utility functions such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

const  = ("elho", 2);
 
(, 4).("hellolol");
// 0x4aee0000 (padded to 4 bytes)
 
(, ()).("hellolol");
// 0x020000004aee
//   |       └-- The 2 bytes of content.
//   └-- 4-byte prefix telling us to read 2 bytes.
 
(, new ([0xff, 0xff])).("hellolol");
// 0x4aeeffff
//   |   └-- The sentinel signaling the end of the content.
//   └-- The 2 bytes of content.

getBaseXResliceEncoder and getBaseXResliceDecoder functions are also available.

getUtf8Codec

Encodes and decodes UTF-8 strings.

const  = ();
const  = .("hello"); // 0x68656c6c6f
const  = .(); // "hello"

This codec does not enforce a size boundary. It will encode and decode all bytes necessary to represent the string. To add size constraints to your codec, you may use utility functions such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

((), 4).("hello");
// 0x68656c6c (truncated to 4 bytes)
 
((), ()).("hello");
// 0x0500000068656c6c6f
//   |       └-- The 5 bytes of content.
//   └-- 4-byte prefix telling us to read 5 bytes.
 
((), new ([0xff, 0xff])).("hello");
// 0x68656c6c6fffff
//   |         └-- The sentinel signaling the end of the content.
//   └-- The 5 bytes of content.

getUtf8Encoder and getUtf8Decoder functions are also available.

Data structures listing

getArrayCodec

The getArrayCodec function accepts any codec of type T and returns a codec of type Array<T>.

const  = (());
const  = .([1, 2, 3]); // 0x03000000010203
const  = .(); // [1, 2, 3]

By default, the size of the array is stored as a u32 prefix before encoding the items.

(()).([1, 2, 3]);
// 0x03000000010203
//   |       └-- 3 items of 1 byte each.
//   └-- 4-byte prefix telling us to read 3 items.

However, you may use the size option to configure this behaviour. It can be one of the following three strategies:

  • Codec<number>: When a number codec is provided, that codec will be used to encode and decode the size prefix.
  • number: When a number is provided, the codec will expect a fixed number of items in the array. An error will be thrown when trying to encode an array of a different length.
  • "remainder": When the string "remainder" is passed as a size, the codec will use the remainder of the bytes to encode/decode its items. This means the size is not stored or known in advance but simply inferred from the rest of the buffer. For instance, if we have an array of u16 numbers and 10 bytes remaining, we know there are 5 items in this array.
((), { : () }).([1, 2, 3]);
// 0x0300010203
//   |   └-- 3 items of 1 byte each.
//   └-- 2-byte prefix telling us to read 3 items.
 
((), { : 3 }).([1, 2, 3]);
// 0x010203
//   └-- 3 items of 1 byte each. There must always be 3 items in the array.
 
((), { : "remainder" }).([1, 2, 3]);
// 0x010203
//   └-- 3 items of 1 byte each. The size is inferred from the remainder of the bytes.

getArrayEncoder and getArrayDecoder functions are also available.

getBitArrayCodec

The getBitArrayCodec function returns a codec that encodes and decodes an array of booleans such that each boolean is represented by a single bit. It requires the size of the codec in bytes and an optional backward flag that can be used to reverse the order of the bits.

const  = [true, false, true, false, true, false, true, false];
 
(1).();
// 0xaa or 0b10101010
 
(1, { : true }).();
// 0x55 or 0b01010101

getBitArrayEncoder and getBitArrayDecoder functions are also available.

getBooleanCodec

The getBooleanCodec function returns a Codec<boolean> that stores the boolean as 0 or 1 using a u8 number by default.

const  = ();
const  = .(true); // 0x01
const  = .(); // true

You may configure that behaviour by providing an explicit number codec as the size option of the getBooleanCodec function. That number codec will then be used to encode and decode the values 0 and 1 accordingly.

({ : () }).(false); // 0x0000
({ : () }).(true); // 0x0100
 
({ : () }).(false); // 0x00000000
({ : () }).(true); // 0x01000000

getBooleanEncoder and getBooleanDecoder functions are also available.

getBytesCodec

The getBytesCodec function returns a Codec<Uint8Array> meaning it converts Uint8Arrays to and from… Uint8Arrays! Whilst this might seem a bit useless, it can be useful when composed into other codecs. For example, you could use it in a struct codec to say that a particular field should be left unserialised.

const  = ();
const  = .(new ([42])); // 0x2a
const  = .(); // 0x2a

The getBytesCodec function will encode and decode Uint8Arrays using as many bytes as necessary. If you'd like to restrict the number of bytes used by this codec, you may combine it with utilities such as fixCodecSize, addCodecSizePrefix or addCodecSentinel.

const  = new ([42, 43]); // 0x2a2b
 
((), 4).();
// 0x2a2b0000 (padded to 4 bytes)
 
((), ()).();
// 0x020000002a2b
//   |       └-- The 2 bytes of content.
//   └-- 4-byte prefix telling us to read 2 bytes.
 
((), new ([0xff, 0xff])).();
// 0x2a2bffff
//   |   └-- The sentinel signaling the end of the content.
//   └-- The 2 bytes of content.

getBytesEncoder and getBytesDecoder functions are also available.

getConstantCodec

The getConstantCodec function accepts any Uint8Array and returns a Codec<void>. When encoding, it will set the provided Uint8Array as-is. When decoding, it will assert that the next bytes contain the provided Uint8Array and move the offset forward.

const  = (new ([1, 2, 3]));
 
.(); // 0x010203
.(new ([1, 2, 3])); // undefined
.(new ([1, 2, 4])); // Throws an error.

getConstantEncoder and getConstantDecoder functions are also available.

getDiscriminatedUnionCodec

In Rust, enums are powerful data types whose variants can be one of the following:

  • An empty variant — e.g. enum Message { Quit }.
  • A tuple variant — e.g. enum Message { Write(String) }.
  • A struct variant — e.g. enum Message { Move { x: i32, y: i32 } }.

Whilst we do not have such powerful enums in JavaScript, we can emulate them in TypeScript using a union of objects such that each object is differentiated by a specific field. We call this a discriminated union.

We use a special field named __kind to distinguish between the different variants of a discriminated union. Additionally, since all variants are objects, we can use a fields property to wrap the array of tuple variants. Here is an example.

type  =
  | { : "quit" } // Empty variant.
  | { : "write"; : [string] } // Tuple variant.
  | { : "move"; : number; : number }; // Struct variant.

The getDiscriminatedUnionCodec function helps us encode and decode these discriminated unions.

It requires the discriminator and codec of each variant as a first argument. Similarly to the getStructCodec, these are defined as an array of variant tuples where the first item is the discriminator of the variant and the second item is its codec. Since empty variants do not have data to encode, they simply use the getUnitCodec which does nothing.

Here is how we can create a discriminated union codec for our previous example.

const  = ([
  // Empty variant.
  ["quit", ()],
 
  // Tuple variant.
  ["write", ([["fields", ([((), ())])]])],
 
  // Struct variant.
  [
    "move",
    ([
      ["x", ()],
      ["y", ()],
    ]),
  ],
]);

And here's how we can use such a codec to encode discriminated unions. Notice that by default, they use a u8 number prefix to distinguish between the different types of variants.

.({ : "quit" });
// 0x00
//   └-- 1-byte discriminator (Index 0 — the "quit" variant).
 
.({ : "write", : ["Hi"] });
// 0x01020000004869
//   | |       └-- utf8 string content ("Hi").
//   | └-- u32 string prefix (2 characters).
//   └-- 1-byte discriminator (Index 1 — the "write" variant).
 
.({ : "move", : 5, : 6 });
// 0x020500000006000000
//   | |       └-- Field y (6).
//   | └-- Field x (5).
//   └-- 1-byte discriminator (Index 2 — the "move" variant).

However, you may provide a number codec as the size option of the getDiscriminatedUnionCodec function to customise that behaviour.

const  = (
  [
    ["quit", ],
    ["write", ],
    ["move", ],
  ],
  { : () },
);
 
.({ : "quit" });
// 0x00000000
//   └------┘ 4-byte discriminator (Index 0).
 
.({ : "write", : ["Hi"] });
// 0x01000000020000004869
//   └------┘ 4-byte discriminator (Index 1).
 
.({ : "move", : 5, : 6 });
// 0x020000000500000006000000
//   └------┘ 4-byte discriminator (Index 2).

You may also customize the discriminator property — which defaults to __kind — by providing the desired property name as the discriminator option like so:

const  = (
  [
    ["quit", ],
    ["write", ],
    ["move", ],
  ],
  { : "message" },
);
 
.({ : "quit" });
.({ : "write", : ["Hi"] });
.({ : "move", : 5, : 6 });

Note that, the discriminator value of a variant may be any scalar value — such as number, bigint, boolean, a JavaScript enum, etc. For instance, the following is also valid:

enum  {
  ,
  ,
  ,
}
const  = ([
  [., ],
  [., ],
  [., ],
]);
 
.({ : . });
.({ : ., : ["Hi"] });
.({ : ., : 5, : 6 });

getDiscriminatedUnionEncoder and getDiscriminatedUnionDecoder functions are also available.

getEnumCodec

The getEnumCodec function accepts a JavaScript enum constructor and returns a codec for encoding and decoding values of that enum.

enum  {
  ,
  ,
}
 
const  = ();
const  = .(.); // 0x00
const  = .(); // Direction.Left

When encoding an enum, you may either provide the value of the enum variant — e.g. Direction.Left — or its key — e.g. 'Left'.

().(.); // 0x00
().(.); // 0x01
().("Left"); // 0x00
().("Right"); // 0x01

By default, a u8 number is being used to store the enum value. However, a number codec may be passed as the size option to configure that behaviour.

const  = (, { : () });
.(.); // 0x00000000
.(.); // 0x01000000

This function also works with lexical enums — e.g. enum Direction { Left = '←' } — explicit numerical enums — e.g. enum Speed { Left = 50 } — and hybrid enums with a mix of both.

enum  {
  ,
   = 5,
  ,
   = "nine",
}
 
().(.); // 0x00
().(.); // 0x01
().(.); // 0x02
().(.); // 0x03
().("One"); // 0x00
().("Five"); // 0x01
().("Six"); // 0x02
().("Nine"); // 0x03

Notice how, by default, the index of the enum variant is used to encode the value of the enum. For instance, in the example above, Numbers.Five is encoded as 0x01 even though its value is 5. This is also true for lexical enums.

However, when dealing with numerical enums that have explicit values, you may use the useValuesAsDiscriminators option to encode the value of the enum variant instead of its index.

enum  {
  ,
   = 5,
  ,
   = 9,
}
 
const  = (, { : true });
.(.); // 0x00
.(.); // 0x05
.(.); // 0x06
.(.); // 0x09
.("One"); // 0x00
.("Five"); // 0x05
.("Six"); // 0x06
.("Nine"); // 0x09

Note that when using the useValuesAsDiscriminators option on an enum that contains a lexical value, an error will be thrown.

enum  {
  ,
   = "two",
}
(, { : true }); // Throws an error.

getEnumEncoder and getEnumDecoder functions are also available.

getHiddenPrefixCodec

The getHiddenPrefixCodec function allow us to prepend a list of hidden Codec<void> to a given codec.

When encoding, the hidden codecs will be encoded before the main codec and the offset will be moved accordingly. When decoding, the hidden codecs will be decoded but only the result of the main codec will be returned. This is particularly helpful when creating data structures that include constant values that should not be included in the final type.

const  = ((), [
  (new ([1, 2, 3])),
  (new ([4, 5, 6])),
]);
 
.(42);
// 0x0102030405062a00
//   |     |     └-- Our main u16 codec (value = 42).
//   |     └-- Our second hidden prefix codec.
//   └-- Our first hidden prefix codec.
 
.(new ([1, 2, 3, 4, 5, 6, 42, 0])); // 42

getHiddenPrefixEncoder and getHiddenPrefixDecoder functions are also available.

getHiddenSuffixCodec

The getHiddenSuffixCodec function allow us to append a list of hidden Codec<void> to a given codec.

When encoding, the hidden codecs will be encoded after the main codec and the offset will be moved accordingly. When decoding, the hidden codecs will be decoded but only the result of the main codec will be returned. This is particularly helpful when creating data structures that include constant values that should not be included in the final type.

const  = ((), [
  (new ([1, 2, 3])),
  (new ([4, 5, 6])),
]);
 
.(42);
// 0x2a00010203040506
//   |   |     └-- Our second hidden suffix codec.
//   |   └-- Our first hidden suffix codec.
//   └-- Our main u16 codec (value = 42).
 
.(new ([42, 0, 1, 2, 3, 4, 5, 6])); // 42

getHiddenSuffixEncoder and getHiddenSuffixDecoder functions are also available.

getLiteralUnionCodec

The getLiteralUnionCodec function accepts an array of literal values — such as string, number, boolean, etc. — and returns a codec that encodes and decodes such values by using their index in the array. It uses TypeScript unions to represent all the possible values.

const  = (["left", "right", "up", "down"]);
 satisfies <"left" | "right" | "up" | "down">;
 
const  = .("left"); // 0x00
const  = .(); // 'left'

It uses a u8 number by default to store the index of the value. However, you may provide a number codec as the size option of the getLiteralUnionCodec function to customise that behaviour.

const  = (["left", "right", "up", "down"], {
  : (),
});
 
.("left"); // 0x00000000
.("right"); // 0x01000000
.("up"); // 0x02000000
.("down"); // 0x03000000

getLiteralUnionEncoder and getLiteralUnionDecoder functions are also available.

getMapCodec

The getMapCodec function accepts two codecs of type K and V and returns a codec of type Map<K, V>.

const  = ((), 8);
const  = ();
const  = (, );
 
const  = .(new ([["alice", 42]])); // 0x01000000616c6963650000002a
const  = .(); // new Map([["alice", 42]])

Each entry (key/value pair) is encoded one after the other with the key first and the value next. By default, the size of the map is stored as a u32 prefix before encoding the entries.

const  = (((), 8), ());
const  = new <string, number>();
.("alice", 42);
.("bob", 5);
 
.();
// 0x02000000616c6963650000002a626f62000000000005
//   |       |               | |               └-- 2nd entry value (5).
//   |       |               | └-- 2nd entry key ("bob").
//   |       |               └-- 1st entry value (42).
//   |       └-- 1st entry key ("alice").
//   └-- 4-byte prefix telling us to read 2 map entries.

However, you may use the size option to configure this behaviour. It can be one of the following three strategies:

  • Codec<number>: When a number codec is provided, that codec will be used to encode and decode the size prefix.
  • number: When a number is provided, the codec will expect a fixed number of entries in the map. An error will be thrown when trying to encode a map of a different length.
  • "remainder": When the string "remainder" is passed as a size, the codec will use the remainder of the bytes to encode/decode its entries. This means the size is not stored or known in advance but simply inferred from the rest of the buffer. For instance, if we have a map of u16 numbers and 10 bytes remaining, we know there are 5 entries in this map.
const  = ((), 8);
const  = ();
 
const  = new <string, number>();
.("alice", 42);
.("bob", 5);
 
(, , { : () }).();
// 0x0200616c6963650000002a626f62000000000005
//   |   |                 └-- Second entry.
//   |   └-- First entry.
//   └-- 2-byte prefix telling us to read 2 entries.
 
(, , { : 3 }).();
// 0x616c6963650000002a626f62000000000005
//   |                 └-- Second entry.
//   └-- First entry.
// There must always be 2 entries in the map.
 
(, , { : "remainder" }).();
// 0x616c6963650000002a626f62000000000005
//   |                 └-- Second entry.
//   └-- First entry.
// The size is inferred from the remainder of the bytes.

getMapEncoder and getMapDecoder functions are also available.

getNullableCodec

The getNullableCodec function accepts a codec of type T and returns a codec of type T | null. It stores whether or not the item exists as a boolean prefix using a u8 by default.

const  = ((), ());
 
().("Hi");
// 0x01020000004869
//   | |       └-- utf8 string content ("Hi").
//   | └-- u32 string prefix (2 characters).
//   └-- 1-byte prefix (true — The item exists).
 
().(null);
// 0x00
//   └-- 1-byte prefix (false — The item is null).

You may provide a number codec as the prefix option of the getNullableCodec function to configure how to store the boolean prefix.

const  = (, {
  : (),
});
 
.("Hi");
// 0x01000000020000004869
//   └------┘ 4-byte prefix (true).
 
.(null);
// 0x00000000
//   └------┘ 4-byte prefix (false).

Additionally, if the item is a FixedSizeCodec, you may set the noneValue option to "zeroes" to also make the returned nullable codec a FixedSizeCodec. To do so, it will pad null values with zeroes to match the length of existing values.

const  = (
  ((), 8), // Only works with fixed-size items.
  { : "zeroes" },
);
 
.("Hi");
// 0x014869000000000000
//   | └-- 8-byte utf8 string content ("Hi").
//   └-- 1-byte prefix (true — The item exists).
 
.(null);
// 0x000000000000000000
//   | └-- 8-byte of padding to make a fixed-size codec.
//   └-- 1-byte prefix (false — The item is null).

The noneValue option can also be set to an explicit byte array to use as the padding for null values. Note that, in this case, the returned codec will not be a FixedSizeCodec as the byte array representing null values may be of any length.

const  = ((), {
  : new ([255]), // 0xff means null.
});
 
.("Hi");
// 0x014869
//   | └-- 2-byte utf8 string content ("Hi").
//   └-- 1-byte prefix (true — The item exists).
 
.(null);
// 0x00ff
//   | └-- 1-byte representing null (0xff).
//   └-- 1-byte prefix (false — The item is null).

The prefix option of the getNullableCodec function can also be set to null, meaning no prefix will be used to determine whether the item exists. In this case, the codec will rely on the noneValue option to determine whether the item is null.

const  = ((), {
  : "zeroes", // 0x0000 means null.
  : null,
});
.(42); // 0x2a00
.(null); // 0x0000
 
const  = ((), {
  : new ([255]), // 0xff means null.
  : null,
});
.(42); // 0x2a00
.(null); // 0xff

Note that if prefix is set to null and no noneValue is provided, the codec assumes that the item exists if and only if some remaining bytes are available to decode. This could be useful to describe data structures that may or may not have additional data to the end of the buffer.

const codec = getNullableCodec(getU16Codec(), { prefix: null });
codec.encode(42); // 0x2a00
codec.encode(null); // Encodes nothing.
codec.decode(new Uint8Array([42, 0])); // 42
codec.decode(new Uint8Array([])); // null

To recap, here are all the possible configurations of the getNullableCodec function, using a u16 codec as an example.

encode(42) / encode(null)No noneValue (default)noneValue: "zeroes"Custom noneValue (0xff)
u8 prefix (default)0x012a00 / 0x000x012a00 / 0x0000000x012a00 / 0x00ff
Custom prefix (u16)0x01002a00 / 0x00000x01002a00 / 0x000000000x01002a00 / 0x0000ff
No prefix0x2a00 / 0x0x2a00 / 0x00000x2a00 / 0xff

Note that you might be interested in the Rust-like alternative version of nullable codecs, available as the getOptionCodec function.

getNullableEncoder and getNullableDecoder functions are also available.

getOptionCodec

The getOptionCodec function accepts a codec of type T and returns a codec of type Option<T> — as defined in the @solana/options package. Note that, when encoding, T or null may also be provided directly as input and will be interpreted as Some(T) or None respectively. However, when decoding, the output will always be an Option<T> type.

It stores whether or not the item exists as a boolean prefix using a u8 by default.

const  = ((), ());
 
().("Hi");
().(("Hi"));
// 0x01020000004869
//   | |       └-- utf8 string content ("Hi").
//   | └-- u32 string prefix (2 characters).
//   └-- 1-byte prefix (Some).
 
().(null);
().(());
// 0x00
//   └-- 1-byte prefix (None).

You may provide a number codec as the prefix option of the getOptionCodec function to configure how to store the boolean prefix.

const  = (, {
  : (),
});
 
.(("Hi"));
// 0x01000000020000004869
//   └------┘ 4-byte prefix (Some).
 
.(());
// 0x00000000
//   └------┘ 4-byte prefix (None).

Additionally, if the item is a FixedSizeCodec, you may set the noneValue option to "zeroes" to also make the returned Option codec a FixedSizeCodec. To do so, it will pad None values with zeroes to match the length of existing values.

const  = (
  ((), 8), // Only works with fixed-size items.
  { : "zeroes" },
);
 
.(("Hi"));
// 0x014869000000000000
//   | └-- 8-byte utf8 string content ("Hi").
//   └-- 1-byte prefix (Some).
 
.(());
// 0x000000000000000000
//   | └-- 8-byte of padding to make a fixed-size codec.
//   └-- 1-byte prefix (None).

The noneValue option can also be set to an explicit byte array to use as the padding for None values. Note that, in this case, the returned codec will not be a FixedSizeCodec as the byte array representing None values may be of any length.

const  = ((), {
  : new ([255]), // 0xff means None.
});
 
.(("Hi"));
// 0x014869
//   | └-- 2-byte utf8 string content ("Hi").
//   └-- 1-byte prefix (Some).
 
.(());
// 0x00ff
//   | └-- 1-byte representing None (0xff).
//   └-- 1-byte prefix (None).

The prefix option of the getOptionCodec function can also be set to null, meaning no prefix will be used to determine whether the item exists. In this case, the codec will rely on the noneValue option to determine whether the item is None.

const  = ((), {
  : "zeroes", // 0x0000 means None.
  : null,
});
.((42)); // 0x2a00
.(()); // 0x0000
 
const  = ((), {
  : new ([255]), // 0xff means None.
  : null,
});
.((42)); // 0x2a00
.(()); // 0xff

Note that if prefix is set to null and no noneValue is provided, the codec assume that the item exists if and only if some remaining bytes are available to decode. This could be useful to describe data structures that may or may not have additional data to the end of the buffer.

const  = ((), { : null });
.((42)); // 0x2a00
.(()); // Encodes nothing.
.(new ([42, 0])); // some(42)
.(new ([])); // none()

To recap, here are all the possible configurations of the getOptionCodec function, using a u16 codec as an example.

encode(some(42)) / encode(none())No noneValue (default)noneValue: "zeroes"Custom noneValue (0xff)
u8 prefix (default)0x012a00 / 0x000x012a00 / 0x0000000x012a00 / 0x00ff
Custom prefix (u16)0x01002a00 / 0x00000x01002a00 / 0x000000000x01002a00 / 0x0000ff
No prefix0x2a00 / 0x0x2a00 / 0x00000x2a00 / 0xff

getOptionEncoder and getOptionDecoder functions are also available.

getSetCodec

The getSetCodec function accepts any codec of type T and returns a codec of type Set<T>.

const  = (());
const  = .(new ([1, 2, 3])); // 0x03000000010203
const  = .(); // new Set([1, 2, 3])

By default, the size of the set is stored as a u32 prefix before encoding the items.

(()).(new ([1, 2, 3]));
// 0x03000000010203
//   |       └-- 3 items of 1 byte each.
//   └-- 4-byte prefix telling us to read 3 items.

However, you may use the size option to configure this behaviour. It can be one of the following three strategies:

  • Codec<number>: When a number codec is provided, that codec will be used to encode and decode the size prefix.
  • number: When a number is provided, the codec will expect a fixed number of items in the set. An error will be thrown when trying to encode a set of a different length.
  • "remainder": When the string "remainder" is passed as a size, the codec will use the remainder of the bytes to encode/decode its items. This means the size is not stored or known in advance but simply inferred from the rest of the buffer. For instance, if we have a set of u16 numbers and 10 bytes remaining, we know there are 5 items in this set.
((), { : () }).(new ([1, 2, 3]));
// 0x0300010203
//   |   └-- 3 items of 1 byte each.
//   └-- 2-byte prefix telling us to read 3 items.
 
((), { : 3 }).(new ([1, 2, 3]));
// 0x010203
//   └-- 3 items of 1 byte each. There must always be 3 items in the set.
 
((), { : "remainder" }).(new ([1, 2, 3]));
// 0x010203
//   └-- 3 items of 1 byte each. The size is inferred from the remainder of the bytes.

getSetEncoder and getSetDecoder functions are also available.

getStructCodec

The getStructCodec function accepts any number of field codecs and returns a codec for an object containing all these fields. Each provided field is an array such that the first item is the name of the field and the second item is the codec used to encode and decode that field type.

type  = { : string; : number };
const : <> = ([
  ["name", ((), ())],
  ["age", ()],
]);
 
const  = .({ : "alice", : 42 });
// 0x05000000616c6963652a
//   |                 └-- Age field.
//   └-- Name field.
 
const  = .();
// { name: "alice", age: 42 }

getStructEncoder and getStructDecoder functions are also available.

getTupleCodec

The getTupleCodec function accepts any number of codecs — T, U, V, etc. — and returns a tuple codec of type [T, U, V, …] such that each item is in the order of the provided codecs.

const  = ([((), ()), (), ()]);
 
const  = .(["alice", 42, 123]);
// 0x05000000616c6963652a7b00000000000000
//   |                 | └-- 3rd item (123).
//   |                 └-- 2nd item (42).
//   └-- 1st item ("alice").
 
const  = .();
// ["alice", 42, 123]

getTupleEncoder and getTupleDecoder functions are also available.

getUnionCodec

The getUnionCodec is a lower-lever codec helper that can be used to encode/decode any TypeScript union.

It accepts the following arguments:

  • An array of codecs, each defining a variant of the union.
  • A getIndexFromValue function which, given a value of the union, returns the index of the codec that should be used to encode that value.
  • A getIndexFromBytes function which, given the byte array to decode at a given offset, returns the index of the codec that should be used to decode the next bytes.
const : <number | boolean> = (
  [(), ()],
  () => (typeof  === "number" ? 0 : 1),
  (, ) => (.(). > 1 ? 0 : 1),
);
 
.(42); // 0x2a00
.(true); // 0x01

getUnionEncoder and getUnionDecoder functions are also available.

getUnitCodec

The getUnitCodec function returns a Codec<void> that encodes undefined into an empty Uint8Array and returns undefined without consuming any bytes when decoding. This is more of a low-level codec that can be used internally by other codecs. For instance, this is how getDiscriminatedUnionCodec describes the codecs of empty variants.

().(); // Empty Uint8Array
().(); // undefined

getUnitEncoder and getUnitDecoder functions are also available.

On this page