Skip to main content

Schema & File Structure

R2 File Structure

We're hosting indexed.xyz on Cloudflare's R2. R2 has an S3 API, so you can grab these files either with the Goldsky CLI (the quickest way to get started), the AWS CLI tool, or rclone.

The prefix structure for data in R2 is:

s3://indexed-xyz-wnam/<chain>/(decoded|raw)/logs/v2.0.0/dt=<yyyy-MM-dd>

Right now these chains are supported, but if you'd like to see other chains here, shoot us an email and we'll consider adding it.

You'll probably want the decoded files, as that's what this document describes.

The data is partitioned by day. In most tools you can leave that part of the prefix off and download all data recursively, but to limit downloads and local storage, you may want to pull a smaller subset of the data to get started.

Decoded Logs

The objects in R2 are Parquet files.

The Parquet file scheme we're using is:

column_namecolumn_type
block_timestamBIGINT
addressVARCHAR
event_signatureVARCHAR
event_paramsVARCHAR[]
block_numberBIGINT
block_hashVARCHAR
log_indexBIGINT
transaction_hashVARCHAR
transaction_indexBIGINT
dataVARCHAR
topicsVARCHAR
idVARCHAR
dtVARCHAR

Here’s an example from one of the files, queried using DuckDB:

SELECT * FROM 'ethereum_decoded_logs_v1.2.0_partition_key=00_dt=2021_part-3acd096e-279f-42aa-9c5b-b03397cf6f7f-24-134.parquet' LIMIT 1;
block_timeaddressevent_signatureevent_paramsblock_numberblock_hashlog_indextransaction_hashtransaction_indexdatatopicsid
16095145710x056fd409e1d7a124bd7017459dfea2f387b6d5cdTransfer(address,address,uint256)[0xd24400ae8BfEBb18cA49Be86258a3C749cf46853, 0xe2983860a018FB295ec08C3B63A487874E4d1469, 25000]115692250x8911437fe4d3dad8e887ab58dc4107595cb2541bc3d1fbb27ba6da9dfb12bd5040xc9e3cb4357516616abddc10f5e63795694e5dbd7bcfa88b757a9aab076263066110x00000000000000000000000000000000000000000000000000000000000061a80xddf252ad1be2c89b69c2b068fc378daa952ba7f163c4a11628f55a4df523b3ef,0x000000000000000000000000d24400ae8bfebb18ca49be86258a3c749cf46853,0x000000000000000000000000e2983860a018fb295ec08c3b63a487874e4d1469decoded_log_0x8911437fe4d3dad8e887ab58dc4107595cb2541bc3d1fbb27ba6da9dfb12bd50_4

Some caveats to keep in mind:

  • We're still validating this dataset.
  • Though the prefix key we use is based on an all-lowercased address, and md5 digest, the addresses within the parquet files do not have a consistent case in all the fields.

Raw Blocks

column_namecolumn_type
idVARCHAR
numberBIGINT
hashVARCHAR
parent_hashVARCHAR
nonceVARCHAR
sha3_unclesVARCHAR
logs_bloomVARCHAR
transactions_rootVARCHAR
state_rootVARCHAR
receipts_rootVARCHAR
minerVARCHAR
difficultyDOUBLE
total_difficultyDOUBLE
sizeBIGINT
extra_dataVARCHAR
gas_limitBIGINT
gas_usedBIGINT
timestampBIGINT
transaction_countBIGINT
base_fee_per_gasBIGINT
dtVARCHAR

Raw Transactions

column_namecolumn_type
idVARCHAR
hashVARCHAR
nonceBIGINT
block_hashVARCHAR
block_numberBIGINT
transaction_indexBIGINT
from_addressVARCHAR
to_addressVARCHAR
valueVARCHAR
gasVARCHAR
gas_priceVARCHAR
inputVARCHAR
max_fee_per_gasVARCHAR
max_priority_fee_per_gasVARCHAR
transaction_typeBIGINT
block_timestampBIGINT
dtVARCHAR

Raw Logs

column_namecolumn_type
block_timestampBIGINT
block_numberBIGINT
block_hashVARCHAR
transaction_hashVARCHAR
transaction_indexBIGINT
log_indexBIGINT
addressVARCHAR
dataVARCHAR
topicsVARCHAR
idVARCHAR
dtVARCHAR

Arweave Raw Blocks

column_namecolumn_type
idVARCHAR
heightINTEGER
mined_atTIMESTAMP
created_atTIMESTAMP
previous_blockVARCHAR
mined_at_utcBIGINT

Arweave Raw Transactions

column_namecolumn_type
idVARCHAR
ownerVARCHAR
tagsVARCHAR
targetVARCHAR
quantityVARCHAR
rewardVARCHAR
signatureVARCHAR
last_txVARCHAR
data_sizeBIGINT
content_typeVARCHAR
formatINTEGER
created_atTIMESTAMP
deleted_atTIMESTAMP
heightINTEGER
owner_addressVARCHAR
data_rootVARCHAR
parentVARCHAR

Sui Raw Checkpoints

column_namecolumn_type
idVARCHAR
checkpoint_digestVARCHAR
sequence_numberBIGINT
epochBIGINT
timestamp_msBIGINT
previous_checkpoint_digestVARCHAR
end_of_epochBOOLEAN
total_gas_costDOUBLE
computation_costDOUBLE
storage_costDOUBLE
storage_rebateDOUBLE
non_refundable_storage_feeDOUBLE
total_transaction_blocksDOUBLE
total_transactionsDOUBLE
total_successful_transaction_blocksDOUBLE
total_successful_transactionsDOUBLE
network_total_transactionDOUBLE
validator_signatureVARCHAR

Sui Raw Events

column_namecolumn_type
idVARCHAR
transaction_digestVARCHAR
event_indexBIGINT
checkpointBIGINT
epochBIGINT
timestamp_msBIGINT
senderVARCHAR
packageVARCHAR
moduleVARCHAR
event_typeVARCHAR
bcsVARCHAR

Sui Raw Move Calls

column_namecolumn_type
idVARCHAR
transaction_digestVARCHAR
checkpointBIGINT
epochBIGINT
timestamp_msBIGINT
packageVARCHAR
moduleVARCHAR
function_VARCHAR

Sui Raw Packages

column_namecolumn_type
package_idVARCHAR
checkpointBIGINT
epochBIGINT
timestamp_msBIGINT
bcsVARCHAR
transaction_digestVARCHAR
package_versionBIGINT
original_package_idVARCHAR
idVARCHAR

Sui Raw Transaction Objects

column_namecolumn_type
idVARCHAR
object_idVARCHAR
versionBIGINT
transaction_digestVARCHAR
checkpointBIGINT
epochBIGINT
timestamp_msBIGINT
input_kindVARCHAR
object_statusVARCHAR

Sui Raw Transactions

column_namecolumn_type
idVARCHAR
transaction_digestVARCHAR
checkpointBIGINT
epochBIGINT
timestamp_msBIGINT
senderVARCHAR
transaction_kindVARCHAR
is_system_txnBOOLEAN
is_sponsored_txBOOLEAN
transaction_countBIGINT
execution_successBOOLEAN
inputBIGINT
shared_inputBIGINT
gas_coinsBIGINT
createdBIGINT
mutatedBIGINT
deletedBIGINT
transfersBIGINT
split_coinsBIGINT
merge_coinsBIGINT
publishBIGINT
upgradeBIGINT
othersBIGINT
move_callsBIGINT
packagesVARCHAR
gas_ownerVARCHAR
gas_object_idVARCHAR
gas_object_sequenceBIGINT
gas_object_digestVARCHAR
gas_budgetDOUBLE
total_gas_costDOUBLE
computation_costDOUBLE
storage_costDOUBLE
storage_rebateDOUBLE
non_refundable_storage_feeDOUBLE
gas_priceDOUBLE
raw_transactionVARCHAR
has_zklogin_sigBOOLEAN
has_upgraded_multisigBOOLEAN
transaction_jsonVARCHAR
effects_jsonVARCHAR