initial noebook draft wip #60

zainhas · 2025-11-17T18:38:33Z

Note

Adds a new LLM judge optimization notebook and a DPO training dataset.

Evals:
- Add notebook Evals/Optimizing_LLM_Judges.ipynb for LLM judge optimization experiments.
- Add DPO training dataset Evals/judge_dpo_data/rewardbench2_dpo_train.jsonl.

^{Written by Cursor Bugbot for commit 0e33a16. This will update automatically on new commits. Configure here.}

VProv · 2025-11-21T16:52:19Z

Cell that starts as

# pip install together
import json
import os
from together import Together

client = Together()

# Create dataset comparing two model responses
compare_data = [
    {
        "prompt": "Explain photosynthesis",
        "response_a": "Photosynthesis is how plants make food using sunlight.",
        "response_b": "Photosynthesis is the process by which plants convert light energy into chemical energy, using chlorophyll to transform CO2 and water into glucose and oxygen."
    },
]

seems to be redundant

initial noebook draft wip

0e33a16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

initial noebook draft wip #60

initial noebook draft wip #60

Uh oh!

zainhas commented Nov 17, 2025 •

edited by cursor bot

Loading

Uh oh!

VProv commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

initial noebook draft wip #60

Are you sure you want to change the base?

initial noebook draft wip #60

Uh oh!

Conversation

zainhas commented Nov 17, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VProv commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zainhas commented Nov 17, 2025 •

edited by cursor bot

Loading