Data Format

Each benchmark sample is an operator directory with five files following a strict contract.

data/<source>/<operator_name>/
├── reference.py     # class Model(nn.Module)
├── input.py         # _make_inputs(**kwargs) -> dict[str, Tensor]
├── shapes.json      # shape spec (dict keyed by id)
├── metadata.json    # id / dtype / origin (not visible to agent)
└── roofline.json    # W / Q / SOL_time (not visible to agent)

Visibility Layers

File Role Agent Visible In Release
reference.py PyTorch reference (class Model)
input.py Input constructor (_make_inputs)
shapes.json Shape specifications (init + input kwargs)
metadata.json Operator identity + upstream provenance
roofline.json Roofline estimates (W / Q / SOL_time)