This article guides you through the process of sideloading models required for Procyon AI Text Generation Benchmark. Note that an active internet connection is still required to run the test. 

TABLE OF CONTENTS

Models for workload version 1.0.191

Models for GPU, ONNX Runtime

ModelDownload Link
Phi-3.5-mini-instruct-onnx-int4

benchmarks.ul.com/downloads/onnxdml_Phi-3.5-mini-instruct-onnx-int4.zip

Mistral-7B-Instruct-v0.2-onnx-int4benchmarks.ul.com/downloads/onnxdml_mistral-7b-instruct-v0.2-ONNX.zip
llama-3_1-8b-instruct-onnx-int4benchmarks.ul.com/downloads/onnxdml_llama-3_1-8b-instruct-onnx-int4.zip
Llama-2-13b-chat-hf-onnx-int4benchmarks.ul.com/downloads/onnxdml_Llama-2-13b-chat-hf-onnx-int4.zip

Models for Intel GPUs, OpenVINO 2026.1 Runtime

ModelDownload Link
Phi-3.5-mini-instruct-ov-int4benchmarks.ul.com/downloads/openvino_2026_1_0_Phi-3.5-mini-instruct-ov-int4.zip
Mistral-7B-Instruct-v0.2-ov-int4benchmarks.ul.com/downloads/openvino_2026_1_0_Mistral-7B-Instruct-v0.2-ov-int4.zip
llama-3_1-8b-instruct-ov-int4benchmarks.ul.com/downloads/openvino_2026_1_0_llama-3_1-8b-instruct-ov-int4.zip
Llama-2-13b-chat-hf-ov-int4benchmarks.ul.com/downloads/openvino_2026_1_0_Llama-2-13b-chat-hf-ov-int4.zip

Models for AMD NPUs, ONNX VitisAI Runtime

ModelDownload Link
Phi-3.5-mini-instruct-int4-hybrid_prefill

benchmarks.ul.com/downloads/npu_onnxvaip_phi-3.5-mini-instruct-int4-hybrid_prefill.zip

Mistral-7B-Instruct-v0.2-int4-npuonlybenchmarks.ul.com/downloads/npu_onnxvaip_Mistral-7B-Instruct-v0.2-int4_npuonly.zip
Mistral-7B-Instruct-v0.2-int4-hybrid_prefillbenchmarks.ul.com/downloads/npu_onnxvaip_Mistral-7B-Instruct-v0.2-int4-hybrid_prefill.zip
llama-3_1-8b-instruct-int4-npuonlybenchmarks.ul.com/downloads/npu_onnxvaip_Llama-3.1-8B-Instruct-int4_npuonly.zip
llama-3_1-8b-instruct-int4-hybrid_prefillbenchmarks.ul.com/downloads/npu_onnxvaip_Llama-3.1-8B-Instruct-int4-hybrid_prefill.zip

Models for Intel NPUs, OpenVINO 2026.1 Runtime

ModelDownload Link
Phi-3.5-mini-instruct-ov-int4benchmarks.ul.com/downloads/npu_openvino_2026_1_0_Phi-3.5-mini-instruct-ov-int4.zip
Mistral-7B-Instruct-v0.2-ov-int4benchmarks.ul.com/downloads/npu_openvino_2026_1_0_Mistral-7B-Instruct-v0.2-ov-int4.zip
llama-3_1-8b-instruct-ov-int4benchmarks.ul.com/downloads/npu_openvino_2026_1_0_llama-3_1-8b-instruct-ov-int4.zip
Llama-2-13b-chat-hf-ov-int4benchmarks.ul.com/downloads/npu_openvino_2026_1_0_Llama-2-13b-chat-hf-ov-int4.zip

Models for Qualcomm NPUs, Genie Runtime

ModelDownload Link
Phi-3.5-mini-instructbenchmarks.ul.com/downloads/npu_qualgenie_2_43_1_Phi-3.5-mini-instruct.zip
llama-3_1-8b-instructbenchmarks.ul.com/downloads/npu_qualgenie_2_43_1_Llama-3.1-8B-Instruct.zip
Phi-3.5-mini-instruct (X2 Elite)benchmarks.ul.com/downloads/npu_qualgenie_2_43_1_Phi-3.5-mini-instruct_xe2.zip
llama-3_1-8b-instruct (X2 Elite)benchmarks.ul.com/downloads/npu_qualgenie_2_43_1_Llama-3.1-8B-Instruct_xe2.zip

Installing the Models

You may choose to download individual models or all, based on your configuration and benchmarking needs. 


1. By default, the benchmark is installed in

%ProgramData%\UL\Procyon\chops\dlc\ai-textgeneration-benchmark\

2. Create a subfolder named "models", if it does not exist;

%ProgramData%\UL\Procyon\chops\dlc\ai-textgeneration-benchmark\models

3. Unzip the downloaded models and copy them over to the models folder. Note that the extracted folders should not contain the prefix "zip_". The models directory with all the downloaded models should look like this; 


Legacy models

Models for OpenVINO NPU 2025.4 Runtime Update (Workload 1.0.152)

ModelDownload Link
Phi-3.5-mini-instruct-ov-int4 benchmarks.ul.com/downloads/npu_openvino_2025_4_0_Phi-3.5-mini-instruct-ov-int4.zip
Mistral-7B-Instruct-v0.2-ov-int4 benchmarks.ul.com/downloads/npu_openvino_2025_4_0_Mistral-7B-Instruct-v0.2-ov-int4.zip
llama-3_1-8b-instruct-ov-int4benchmarks.ul.com/downloads/npu_openvino_2025_4_0_llama-3_1-8b-instruct-ov-int4.zip
Llama-2-13b-chat-hf-ov-int4benchmarks.ul.com/downloads/npu_openvino_2025_4_0_Llama-2-13b-chat-hf-ov-int4.zip

Models for Qualcomm Genie NPU Runtime (Workload 1.0.152)

ModelDownload Link
Phi-3.5-mini-instruct benchmarks.ul.com/downloads/npu_qualgenie_Phi-3.5-mini-instruct.zip
llama-3_1-8b-instructbenchmarks.ul.com/downloads/npu_qualgenie_Llama-3.1-8B-Instruct.zip

Models for OpenVINO GPU 2025.2 Runtime Update (Workload 1.0.96 & 1.0.152)

ModelDownload Link
Phi-3.5-mini-instruct-ov-int4 benchmarks.ul.com/downloads/openvino_2025_2_Phi-3.5-mini-instruct-ov-int4.zip
Mistral-7B-Instruct-v0.2-ov-int4 benchmarks.ul.com/downloads/openvino_2025_2_Mistral-7B-Instruct-v0.2-ov-int4.zip
llama-3_1-8b-instruct-ov-int4benchmarks.ul.com/downloads/openvino_2025_2_llama-3_1-8b-instruct-ov-int4.zip
Llama-2-13b-chat-hf-ov-int4benchmarks.ul.com/downloads/openvino_2025_2_Llama-2-13b-chat-hf-ov-int4.zip

Models for OpenVINO GPU 2025.0 Runtime Update (Workload 1.0.82)

ModelDownload Link
Phi-3.5-mini-instruct-ov-int4 benchmarks.ul.com/downloads/openvino_2025_0_Phi-3.5-mini-instruct-ov-int4.zip
Mistral-7B-Instruct-v0.2-ov-int4 benchmarks.ul.com/downloads/openvino_2025_0_Mistral-7B-Instruct-v0.2-ov-int4.zip
llama-3_1-8b-instruct-ov-int4benchmarks.ul.com/downloads/openvino_2025_0_llama-3_1-8b-instruct-ov-int4.zip
Llama-2-13b-chat-hf-ov-int4benchmarks.ul.com/downloads/openvino_2025_0_Llama-2-13b-chat-hf-ov-int4.zip

Models for OpenVINO Runtime (launch version, Workload 1.0.73)

ModelDownload Link
Phi-3.5-mini-instruct-ov-int4 benchmarks.ul.com/downloads/openvino_Phi-3.5-mini-instruct-ov-int4.zip
Mistral-7B-Instruct-v0.2-ov-int4 benchmarks.ul.com/downloads/openvino_Mistral-7B-Instruct-v0.2-ov-int4.zip
llama-3_1-8b-instruct-ov-int4benchmarks.ul.com/downloads/openvino_llama-3_1-8b-instruct-ov-int4.zip
Llama-2-13b-chat-hf-ov-int4benchmarks.ul.com/downloads/openvino_Llama-2-13b-chat-hf-ov-int4.zip