Skip to content

LLM IFEval Dataset Implementation #1060

@farook-edev

Description

@farook-edev

Things left to do on this dataset are as follows:

  • Finish code to convert data to .tfrecord
  • Finish code to handle sending and receiving data.
  • Review Instruction validation code (possibly against existing implementation).
  • Test Accuracy and Performance metrics.
  • Potentially move accuracy calculation away from output processing (similar suggestion in TinyMMLU)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions