Skip to content

Pinned Loading

  1. LMLM LMLM Public

    Python 34 1

  2. phantom-wiki phantom-wiki Public

    Python package for generating datasets to evaluate reasoning and retrieval of large language models

    Python 25 4

  3. phantom-reasoning phantom-reasoning Public

    Code for Paper: Learning from Synthetic Data Improves Multi-hop Reasoning

    Python 8

  4. banditeval banditeval Public

    The official code release for On Speeding Up Language Model Evaluation

    Python 7 1

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…