Skip to content

eval_model.py in public repo is not directly runnable and blocks baseline SR reproduction #2

@siwon7

Description

@siwon7

The public repository does not currently
allow straightforward reproduction of the
README live success-rate baseline. The
evaluation script contains runnable-code
bugs, metric naming inconsistencies, and
undocumented simulator/version coupling.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions