主页EIPs
EIPsEIP-7480
EIP-7480

EOF - Data section access instructions

Instructions to read data section of EOF container
DraftStandards Track: Core
创建时间: 2023-08-11
关联 EIP: EIP-3540, EIP-3670
Andrei Maiboroda (@gumb0), Alex Beregszaszi (@axic), Paweł Bylica (@chfast)
社区讨论原文链接编辑
1 分钟了解

相关视频
欢迎补充好内容
去提交
正文

Abstract

Four new instructions are introduced, that allow to read EOF container's data section: DATALOAD loads 32-byte word to stack, DATALOADN loads 32-byte word to stack where the word is addressed by a static immediate argument, DATASIZE loads data section size and DATACOPY copies a segment of data section to memory.

Motivation

Clear separation between code and data is one of the main features of EOF1. Data section may contain anything, e.g. compiler's metadata, but to make it useful for smart contracts, EVM has to have instructions that allow to read from data section. Previously existing instructions for bytecode inspection (CODECOPY, CODESIZE etc.) are deprecated in EOF1 and cannot be used for this purpose.

The DATALOAD, DATASIZE, DATACOPY instruction pattern follows the design of existing instructions for reading other kinds of data (i.e. returndata and calldata).

DATALOADN is an optimized version of DATALOAD, where data offset to read is set at compilation time, and therefore need not be validated at run-time, which makes the instruction cheaper.

Specification

We introduce four new instructions on the same block number EIP-3540 is activated on:

1 DATALOAD (0xe8) 2.DATALOADN (0xe9) 3.DATASIZE (0xea) 4.DATACOPY (0xeb)

If the code is legacy bytecode, all of these instructions result in an exceptional halt. (Note: This means no change to behaviour.)

If the code is valid EOF1, the following execution rules apply:

DATALOAD

  1. Pops one value, offset, from the stack.
  2. Reads [offset:offset+32] segment from the data section and pushes it as 32-byte value to the stack.
  3. If offset + 32 is greater than the data section size, bytes after the end of data section are set to 0.
  4. Deducts 4 gas.

DATALOADN

  1. Has one immediate argument,offset, encoded as a 16-bit unsigned big-endian value.
  2. Pops nothing from the stack.
  3. Reads [offset:offset+32] segment from the data section and pushes it as 32-byte value to the stack.
  4. Deducts 3 gas.

[offset:offset+32] is guaranteed to be within data bounds by code validation.

DATASIZE

  1. Pops nothing from the stack.
  2. Pushes the size of the data section of the active container to the stack.
  3. Deducts 2 gas.

DATACOPY

  1. Pops three values from the stack: mem_offset, offset, size.
  2. Performs memory expansion to mem_offset + size and deducts memory expansion cost.
  3. Deducts 3 * ((size + 31) // 32) gas for copying.
  4. Reads [offset:offset+size] segment from the data section and writes it to memory starting at offset mem_offset.
  5. If offset + size is greater than data section size, 0 bytes will be copied for bytes after the end of the data section.

Code Validation

We extend code section validation rules (as defined in EIP-3670).

  1. Code section is invalid in case an immediate argument offset of any DATALOADN is such that offset + 32 is greater than data section size.
  2. RJUMP, RJUMPI and RJUMPV immediate argument value (jump destination relative offset) validation: code section is invalid in case offset points to one of two bytes directly following DATALOADN instruction.

Rationale

Zero-padding on out of bounds access

Existing instructions for reading other kinds of data implicitly pad with zeroes on out of bounds access, with the only exception of return data copying.

It is benefitial to avoid exceptional failures, because compilers can employ optimizations like removing a code that copies data, but never accesses this copy afterwards, but such optimization is possible only if instruction never has other side effects like exceptional abort.

Backwards Compatibility

This change poses no risk to backwards compatibility, as it is introduced only for EOF1 contracts, for which deploying undefined instructions is not allowed, therefore there are no existing contracts using these instructions. The new instructions are not introduced for legacy bytecode (code which is not EOF formatted).

Security Considerations

TBA

Copyright and related rights waived via CC0.

扩展阅读
欢迎补充好内容
去提交
相关项目
欢迎补充好内容
去提交

不想错过最新的 EIP 动态?

订阅 EIPs Fun 周刊以跟进相关更新,建⽴你与 EIP 之间的连接 ,更好地建设以太坊。

详情
聚集 EIP 贡献者, 扩展以太坊
资源
GitHub
支持社区