We provide a multi-step flow to run lm-evaluation-harness metrics offline for ONNX models. Offline mode is used to evaluate models generations on specific hardware (i.e., NPUs). The offline mode is ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results