Installation#

Detailed installation instructions for La Perf across different platforms.

System Requirements#

Minimum Requirements#

Python: 3.12 or higher
RAM: 8 GB (embeddings), 16 GB (LLM), 18 GB (VLM)
Disk Space: ~100 GB free for models and datasets
OS: Linux, macOS, or Windows

Recommended Requirements#

GPU: NVIDIA (CUDA), AMD (ROCm), or Apple Silicon (MPS)
RAM: 24 GB+ for comfortable multitasking
SSD: Fast storage for dataset loading

Installing uv#

La Perf uses uv as its package manager.

macOS/LinuxWindowsWith pip

curl -LsSf https://astral.sh/uv/install.sh | sh

powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

pip install uv

Verify installation:

uv --version

Why uv?

La Perf uses uv for fast, reliable dependency management. It's significantly faster than pip and handles environment isolation automatically.

Installing La Perf#

1. Clone the repository#

git clone https://github.com/bogdanminko/laperf.git
cd laperf

2. Install dependencies#

For benchmarking only#

uv sync

For development#

uv sync --group quality --group dev

This installs additional tools:

ruff - Fast Python linter
mypy - Type checker
bandit - Security scanner
pre-commit - Git hooks

3. Verify installation#

uv run python -c "import torch; print(torch.__version__)"

LM Studio Setup#

For LLM/VLM benchmarks, install LM Studio:

1. Download LM Studio#

Visit lmstudio.ai and download for your platform.

2. Load a model#

Best way to find it is using LM Studio UI

Load LLM

Search for gpt-oss-20b in available models

macOS (MLX)Windows/Linux (GGUF)

mlx-community/gpt-oss-20b-MXFP4-Q8

lmstudio-community/gpt-oss-20b-GGUF

Load VLM

Search for Qwen3-VL-8B-Thinking in available models

macOS (MLX)Windows/Linux (GGUF)

mlx-community/Qwen3-VL-8B-Thinking-4bit

Qwen/Qwen3-VL-8B-Thinking-GGUF-Q4_K_M

3. Start the server#

Click "Developer" tab
Click "Start Server"
Verify it's running on http://localhost:1234

Ollama Setup#

For LLM/VLM benchmarks, install Ollama:

1. Install Ollama#

macOSLinuxWindows

brew install ollama

curl -fsSL https://ollama.com/install.sh | sh

Download from ollama.com

2. Pull a model#

Pull LLM

ollama pull gpt-oss:20b

Pull VLM

ollama pull qwen3-vl:8b

Verifying Your Setup#

Run a quick test to ensure everything works:

Using make

make bench

Using uv

uv run python main.py

This will:

Auto-detect your hardware (CUDA / MPS / CPU)
Run all available benchmarks (all are pre-selected — you can toggle individual ones in the TUI using Space)
Save the results to results/report_{your_device}.json

Hardware Detection

La Perf automatically detects your GPU and optimizes accordingly. No manual configuration needed!

Power Monitoring Tool#

La Perf includes a standalone power monitoring tool available as a separate PyPI package.

📦 PyPI Package: laperf-power

Installation Options#

Option 1: Run without installation (recommended) ⭐

# Lightweight standalone package (~5 MB with psutil)
uvx laperf-power

Option 2: Install globally

# Using pip
pip install laperf-power

# Using uv
uv tool install laperf-power

# Run anywhere
laperf-power

Option 3: From source

git clone https://github.com/bogdanminko/laperf.git
cd laperf/laperf-power
uv pip install -e .
laperf-power

Usage#

laperf-power [OPTIONS]

Options:
  --interval SECONDS    Sampling interval in seconds (default: 10.0)
  --no-sudo            Disable sudo powermetrics on macOS
  --output FILE        Save results to JSON file
  -h, --help           Show help message

Press Ctrl+C to stop and view statistics.

Platform Support

macOS: Full support (with sudo for GPU/CPU power via powermetrics)
Linux (NVIDIA): GPU metrics via nvidia-smi
Windows: Basic CPU/RAM metrics via psutil

Troubleshooting#

uv command not found#

After installing uv, restart your terminal or run:

source ~/.bashrc  # or ~/.zshrc on macOS

Python version mismatch#

Ensure you're using Python 3.12+:

uv run python --version

CUDA not detected#

Install NVIDIA drivers
Install CUDA toolkit
Restart your system

Next Steps#

Quick Start Guide - Run your first benchmark
Requirements - Detailed hardware requirements
Benchmark Results - View benchmark results and metrics