Back

Gemmabench

View Repository
Hugging FaceLightevallm-evaluation-harnessGemmaLLMs

Gemmabench

Introduction

GemmaBench is a benchmarking tool designed to evaluate the performance of Gemma and other Hugging Face language models using standard benchmarking tasks. It integrates seamlessly with the LightEval framework and supports multiple backends for flexible benchmarking.

Features

Installation

Prerequisites:

  1. Clone the repository:
git clone https://github.com/Eyepatch0/gemmabench.git
cd gemmabench
  1. Create a virtual environment (optional but recommended):

    python -m venv venv
    venv\Scripts\activate # Activate it (Windows)
    source venv/bin/activate # Activate it (Linux/macOS)
  2. Install the required packages:

    pip install --upgrade pip
    pip install -r requirements.txt
  3. Create a .env file in the root directory and add your Hugging Face token:

    HUGGINGFACE_TOKEN=your_huggingface_token

Usage:

To benchmark a model, run the following command:

python run_benchmark.py

The script will guide you through:

Backend Options:

Future Work:

Contact:

You can reach out to me on LinkedIn or GitHub for any questions or feedback. I am always open to suggestions and improvements.