SNIP 23: Mixed-case checksum address encoding

ClementWalter · October 24, 2024, 1:34pm

I’m proposing to enforce the address representation to be 32 bytes long, checksumed as in the Ethereum ecosystem.

The checksum mechanism can be the same.

See also ERCs/ERCS/erc-55.md at master · ethereum/ERCs · GitHub

Proposed implementation

from starkware.starknet.public.abi import starknet_keccak

def checksum_encode(addr: int): # Takes a 32-byte binary address as input
    addr_bytes = int.to_bytes(addr, 32, "big")
    hex_addr = addr_bytes.hex()
    checksummed_buffer = ""

    # Treat the hex address as ascii/utf-8 for hashing
    hashed_address = f"{starknet_keccak(addr_bytes):064x}"

    # Iterate over each character in the hex address
    for nibble_index, character in enumerate(hex_addr):

        if character in "0123456789":
            # We can't upper-case the decimal digits
            checksummed_buffer += character
        elif character in "abcdef":
            # Check if the corresponding hex digit (nibble) in the hash is 8 or higher
            hashed_address_nibble = int(hashed_address[nibble_index], 16)
            if hashed_address_nibble > 7:
                checksummed_buffer += character.upper()
            else:
                checksummed_buffer += character
        else:
            raise eth_utils.ValidationError(
                f"Unrecognized hex character {character!r} at position {nibble_index}"
            )

    return "0x" + checksummed_buffer

checksum_encode(0x49d36570d4e46f48e99674bd3fcc84644ddd6b96f7c741b1562b82f9e004dc7)
# 0x049D36570D4e46f48e99674bd3fcc84644DdD6b96F7C741B1562B82f9e004dC7

checksum_encode(0xbd3fcc84644ddd6b96f7c741b1562b82f9e004dc7)
# 0x00000000000000000000000Bd3fcc84644dDd6B96F7C741b1562B82f9e004dc7

estheroche · October 24, 2024, 3:19pm

Awesome the checksum mechanism will be amazing since its same Ethereum

milan · October 25, 2024, 12:31pm

100% in favor of padding addresses.

It’s a huge PITA because every tool treats it differently (SDKs, Apibara, block explorers, etc.) and while 0x0f00 and 0xf00 are numerically the same, since they pretty much have to be stored as strings for interop, this introduces hard to spot bugs.

ClementWalter · October 25, 2024, 4:45pm

See the full proposal to be merged™ soon™

robertkodra · October 31, 2024, 11:38am

I’m also in favour of padding addresses for the sake of consistency as it’s always different with every dev tool.

estheroche · November 1, 2024, 1:08am

padding is a neccessary tool for dev, especially when it comes to issues of slot

haroldtrk · November 3, 2024, 11:05am

from starkware.starknet.public.abi import starknet_keccak
from typing import Union

class AddressValidationError(Exception):
“”“Custom exception for address validation errors.”“”
pass

def checksum_encode(addr: Union[int, str]) → str:
“”"
Generate a checksummed address following ERC-55 style convention for Starknet addresses.

Args:
    addr (Union[int, str]): Address as integer or hex string
    
Returns:
    str: Checksummed address string
    
Raises:
    AddressValidationError: If address is invalid
"""
try:
    # Convert string input to int if needed
    if isinstance(addr, str):
        addr = int(addr.replace("0x", ""), 16)
        
    # Validate address size
    if addr < 0:
        raise AddressValidationError("Address cannot be negative")
    if addr.bit_length() > 256:
        raise AddressValidationError("Address exceeds 32 bytes")
        
    # Convert to 32-byte representation
    addr_bytes = int.to_bytes(addr, 32, "big")
    hex_addr = addr_bytes.hex()
    
    # Calculate hash using starknet_keccak
    hashed_address = f"{starknet_keccak(addr_bytes):064x}"
    
    checksummed_buffer = []
    
    # Process each character
    for nibble_index, character in enumerate(hex_addr):
        if character in "0123456789":
            checksummed_buffer.append(character)
        elif character in "abcdef":
            # Check corresponding hash nibble
            hash_nibble = int(hashed_address[nibble_index], 16)
            checksummed_buffer.append(
                character.upper() if hash_nibble > 7 else character
            )
        else:
            raise AddressValidationError(
                f"Invalid hex character '{character}' at position {nibble_index}"
            )
            
    return "0x" + "".join(checksummed_buffer)
    
except ValueError as e:
    raise AddressValidationError(f"Invalid address format: {str(e)}")

def validate_checksum(address: str) → bool:
“”"
Validate if an address string has the correct checksum.

Args:
    address (str): Address string to validate
    
Returns:
    bool: True if checksum is valid, False otherwise
"""
try:
    if not address.startswith("0x"):
        return False
        
    addr_int = int(address.replace("0x", ""), 16)
    return checksum_encode(addr_int) == address
except (AddressValidationError, ValueError):
    return False

eiki · November 5, 2024, 12:52pm

Could we get buy-in from SDKs, as well as Wallets?
We’d like an answer and plan to include (and possible alerts and warnings on non-backward compatibility) from:

Braavos
Argent
Keplr
starknet-react
starknetJS
get-starknet
ledger
CEXes
starknet-rs
starkscan
voyager

etc.
whoever makes sense

0xjanek · December 9, 2024, 11:16pm

What’s wrong with the current checksum approach?

github.com

starknet-io/starknet.js/blob/develop/src/utils/address.ts#L66


      
          * @param {BigNumberish} address
          * @returns {string} Hex string : 0x followed by 64 characters. Mix of uppercase and lowercase
          * @example
          * ```typescript
          * const address = "0x90591d9fa3efc87067d95a643f8455e0b8190eb8cb7bfd39e4fb7571fdf";
          * const result = getChecksumAddress(address);
          * // result = "0x0000090591D9fA3EfC87067d95a643f8455E0b8190eb8Cb7bFd39e4fb7571fDF"
          * ```
          */
          // from https://github.com/ethers-io/ethers.js/blob/fc1e006575d59792fa97b4efb9ea2f8cca1944cf/packages/address/src.ts/index.ts#L12
          export function getChecksumAddress(address: BigNumberish): string {
           const chars = removeHexPrefix(validateAndParseAddress(address)).toLowerCase().split('');
           const hex = removeHexPrefix(keccakBn(address));
           const hashed = hexToBytes(hex.padStart(64, '0'));
          
           for (let i = 0; i < chars.length; i += 2) {
             if (hashed[i >> 1] >> 4 >= 8) {
               chars[i] = chars[i].toUpperCase();
             }
             if ((hashed[i >> 1] & 0x0f) >= 8) {
               chars[i + 1] = chars[i + 1].toUpperCase();

ClementWalter · December 10, 2024, 8:24am

most probably very similar as in the SNIP I’ve taken the EIP implementation and the snippet from starknet js quotes ethers.js

This is not a standard in Starknet though, which is the main purpose of this SNIP

0xjanek · December 10, 2024, 1:37pm

Okay, if it’s just about formalizing, can we make sure the SNIP matches the implementation?
Migrating to a new implementation would be painful for everyone

OdedSW · December 24, 2024, 11:04am

Hey @ClementWalter, I want to try and push this SNIP forward. Have you taken a look at the BigNumberish type in Starknet.js and check if the current SNIP matches the implementation?

ClementWalter · December 24, 2024, 11:55am

Not really yet. Let me summarize everything after Christmas, including feedback regarding starknet js existing implementation, and move forward beginning of Jan

ClementWalter · December 27, 2024, 10:21am

So summarizing the inputs from @haroldtrk and @0xjanek, here is a python implementation matching the already existing starknet-js one.

The key difference between the two implementation (proposed and refactored python VS starknet-js) is that starknet-js is hashing the minimal length input address (so a message of possibly less than 32 bytes) while I was proposing to hash the input address interpreted as a 32 bytes long message.

See

# Calculate hash using starknet_keccak
addr_bytes = int.to_bytes(addr, math.ceil(addr.bit_length() / 8), "big")
hashed_address = f"{starknet_keccak(addr_bytes):064x}"

instead of

# Calculate hash using starknet_keccak
addr_bytes = int.to_bytes(addr, 32, "big")
hashed_address = f"{starknet_keccak(addr_bytes):064x}"

I don’t see any issue adopting the existing starknet-js implementation, and consequently the final proposition consistent with the existing starknet-js is

import math
from typing import Union

from starkware.starknet.public.abi import starknet_keccak


class AddressValidationError(Exception):
    """Custom exception for address validation errors."""
    pass

def checksum_encode(addr: Union[int, str]) -> str:
    """
    Generate a checksummed address following ERC-55 style convention for Starknet addresses.
    Args:
        addr (Union[int, str]): Address as integer or hex string

    Returns:
        str: Checksummed address string

    Raises:
        AddressValidationError: If address is invalid
    """
    try:
        # Convert string input to int if needed
        if isinstance(addr, str):
            addr = int(addr.replace("0x", ""), 16)

        # Validate address size
        if addr < 0:
            raise AddressValidationError("Address cannot be negative")
        if addr.bit_length() > 256:
            raise AddressValidationError("Address exceeds 32 bytes")

        # Convert to 32-byte representation
        hex_addr = int.to_bytes(addr, 32, "big").hex()

        # Calculate hash using starknet_keccak
        addr_bytes = int.to_bytes(addr, math.ceil(addr.bit_length() / 8), "big")
        hashed_address = f"{starknet_keccak(addr_bytes):064x}"

        checksummed_buffer = []

        # Process each character
        for nibble_index, character in enumerate(hex_addr):
            if character in "0123456789":
                checksummed_buffer.append(character)
            elif character in "abcdef":
                # Check corresponding hash nibble
                hash_nibble = int(hashed_address[nibble_index], 16)
                checksummed_buffer.append(
                    character.upper() if hash_nibble > 7 else character
                )
            else:
                raise AddressValidationError(
                    f"Invalid hex character '{character}' at position {nibble_index}"
                )

        return "0x" + "".join(checksummed_buffer)

    except ValueError as e:
        raise AddressValidationError(f"Invalid address format: {str(e)}")

def validate_checksum(address: str) -> bool:
    """
    Validate if an address string has the correct checksum.

    Args:
        address (str): Address string to validate

    Returns:
        bool: True if checksum is valid, False otherwise
    """
    try:
        if not address.startswith("0x"):
            return False

        addr_int = int(address.replace("0x", ""), 16)
        return checksum_encode(addr_int) == address
    except (AddressValidationError, ValueError):
        return False

mubarak23 · January 6, 2025, 1:16pm

Encoding addresses in a typo-resistant format reduces the likelihood of funds being lost or misrouted due to human errors in address entry. This is a practical improvement that enhances the user experience and trust in the ecosystem.

Personally i have had issues with typos, thankfully it is not a huge fund

Immanuelolivia1 · January 8, 2025, 1:08am

The checksum mechanism will be fantastic, and fully supports padding addresses.

Topic		Replies	Views
QRCode StarkNet Transaction Encoding SNIPs	2	668	April 12, 2024
[SNIP] Off-chain signatures (à la EIP712) SNIPs	14	2904	December 6, 2023
Starknet Standard Interface Detection SNIPs	31	4384	August 29, 2023
Critical Issue with Contract Address Handling on Starknet 🙏 Help and Support	4	224	February 16, 2025
EIP712 authorization in Cairo 2 🙏 Help and Support cairo	1	62	August 9, 2024

SNIP 23: Mixed-case checksum address encoding

Related topics