Evaluation runs

truthful-1.2bLFM2-1.2BTruthfulQAComplete
60%
Yesterday
hellaswag-3bLFM-3BHellaSwagFailedYesterday
mmlu-moeLFM-40B-MoEMMLUComplete
88%
Yesterday
hellaswag-aLFM2-350MHellaSwagComplete
49%
1h ago
mmlu-regressLFM-7BMMLUComplete
79%
1h ago
gsm8k-350mLFM2-350MGSM8KComplete
31%
2d ago
humaneval-1.2bLFM2-1.2BHumanEvalComplete
46%
2d ago
arc-moeLFM-40B-MoEARC-ChallengeComplete
81%
2d ago
gsm8k-cotLFM-40B-MoEGSM8KIn progressRunning…2h ago
humaneval-bLFM-3BHumanEvalComplete
41%
2h ago
nightly-mmluLFM2-1.2BMMLUComplete
71%
2m ago
truthful-3bLFM-3BTruthfulQAIn progressRunning…3d ago
hellaswag-7bLFM-7BHellaSwagComplete
72%
3d ago
arc-sweepLFM2-1.2BARC-ChallengeComplete
67%
3h ago
truthful-aLFM-7BTruthfulQAFailed3h ago
hellaswag-bLFM-40B-MoEHellaSwagComplete
83%
4h ago
nightly-gsm8kLFM2-1.2BGSM8KIn progressRunning…4m ago
mmlu-350mLFM2-350MMMLUComplete
53%
5h ago
gsm8k-3bLFM-3BGSM8KComplete
44%
6h ago
humaneval-moeLFM-40B-MoEHumanEvalIn progressRunning…7h ago
arc-7bLFM-7BARC-ChallengeComplete
74%
9h ago
humaneval-sweepLFM-7BHumanEvalComplete
58%
18m ago
arc-baselineLFM-3BARC-ChallengeFailed32m ago
truthful-probeLFM-40B-MoETruthfulQAComplete
64%
51m ago