OpenCV CUDA build process running on Jetson Orin, installing GPU-accelerated OpenCV from source
jetsonopencvcudajetpackedgeaicomputer vision

OpenCV with CUDA on Jetson: CUDA_ARCH_BIN, cmake flags, JetPack 5 and 6

Aaron Angulo · · Updated

Installing OpenCV with CUDA support on Jetson is not as simple as a package manager install, the prebuilt packages either lack CUDA support or link against the wrong CUDA version. Building from source with the right CMake flags gets you CUDA-accelerated OpenCV, but the build process has several known pitfalls that cost hours if you hit them blind.

Key Insights

  • apt install python3-opencv gives you a CPU-only build, it was compiled without CUDA flags and will not use the GPU for any operation, even on Jetson
  • CUDA architecture (CUDA_ARCH_BIN) must match your Jetson module, Orin is 8.7, Xavier is 7.2, Nano is 5.3; wrong value = silent fallback to CPU
  • Build time ranges from 20 minutes (Orin AGX) to 2 hours (Nano), add swap space before building on lower-end modules or the compiler will OOM and abort
  • JetPack 5 and 6 have the same cmake flags, the difference is in the pre-installed CUDA version (11.4 vs 12.x) and the available CUDA architectures
  • Uninstall the apt package before building, conflicts will cause import errors even if the build succeeds

Why the apt package doesn’t use CUDA

This is the most common Jetson computer vision confusion we encounter. Run this on a fresh Jetson with the apt-installed OpenCV:

import cv2
print(cv2.cuda.getCudaEnabledDeviceCount())  # Returns 0
print(cv2.getBuildInformation())              # CUDA: NO

The NVIDIA apt repository ships OpenCV built without CUDA because the correct CUDA_ARCH_BIN value is hardware-specific. Jetson Orin uses compute capability 8.7; Xavier uses 7.2; Nano uses 5.3. A single generic apt package can’t encode all of these, so NVIDIA ships the CPU fallback and expects you to build from source.

The result is that every Jetson project doing computer vision needs a from-source OpenCV build at some point. Here’s the exact sequence that works.

Step 1: Remove the apt package and add swap

sudo apt remove python3-opencv libopencv-dev libopencv-contrib-dev -y
sudo apt autoremove -y

Add swap space before building. Without it, the compiler will OOM on modules with less than 16GB RAM:

sudo fallocate -l 8G /swapfile
sudo chmod 600 /swapfile
sudo mkswap /swapfile
sudo swapon /swapfile

Verify swap is active:

free -h

Step 2: Install build dependencies

sudo apt update
sudo apt install -y \
    build-essential cmake git pkg-config \
    libjpeg-dev libpng-dev libtiff-dev \
    libavcodec-dev libavformat-dev libswscale-dev \
    libgtk2.0-dev libcanberra-gtk* \
    python3-dev python3-numpy \
    libgstreamer1.0-dev libgstreamer-plugins-base1.0-dev \
    libv4l-dev v4l-utils

Step 3: Clone OpenCV and opencv_contrib

cd ~
git clone --depth 1 --branch 4.9.0 https://github.com/opencv/opencv.git
git clone --depth 1 --branch 4.9.0 https://github.com/opencv/opencv_contrib.git

Use matching version tags for both repos. Mismatched versions will fail at compile time.

Step 4: Build with CUDA flags

Find your CUDA architecture. Check your module first:

cat /etc/nv_tegra_release | head -1

Then use the right value:

Jetson moduleCUDA_ARCH_BIN
AGX Orin, Orin NX, Orin Nano8.7
AGX Xavier, Xavier NX7.2
TX2, TX2 NX6.2
Nano (original 2019)5.3
cd ~/opencv
mkdir build && cd build

cmake -D CMAKE_BUILD_TYPE=RELEASE \
      -D CMAKE_INSTALL_PREFIX=/usr/local \
      -D OPENCV_EXTRA_MODULES_PATH=~/opencv_contrib/modules \
      -D WITH_CUDA=ON \
      -D CUDA_ARCH_BIN=8.7 \
      -D WITH_CUDNN=ON \
      -D OPENCV_DNN_CUDA=ON \
      -D WITH_GSTREAMER=ON \
      -D WITH_LIBV4L=ON \
      -D BUILD_opencv_python3=ON \
      -D BUILD_TESTS=OFF \
      -D BUILD_PERF_TESTS=OFF \
      -D BUILD_EXAMPLES=OFF \
      -D INSTALL_PYTHON_EXAMPLES=OFF \
      ..

Replace 8.7 with your module’s value from the table above.

Review the cmake output before building. Look for:

--   NVIDIA CUDA:                   YES (ver 12.2, CUFFT CUBLAS FAST_MATH)
--     NVIDIA GPU arch:             87
--   cuDNN:                         YES
--   GStreamer:                     YES

If CUDA shows NO, the CUDA toolkit path isn’t set correctly. On Jetson, CUDA is at /usr/local/cuda, verify with ls /usr/local/cuda/bin/nvcc.

Step 5: Compile and install

make -j$(nproc)
sudo make install
sudo ldconfig

On Jetson Nano this will take 90–120 minutes. On Orin AGX, closer to 25 minutes.

Step 6: Verify

import cv2

# Check CUDA is available
print("CUDA devices:", cv2.cuda.getCudaEnabledDeviceCount())

# Check build info for CUDA section
info = cv2.getBuildInformation()
cuda_start = info.find("NVIDIA CUDA")
print(info[cuda_start:cuda_start+200])

# Quick functional test
img = cv2.imread("test.jpg")
gpu_img = cv2.cuda_GpuMat()
gpu_img.upload(img)
gpu_gray = cv2.cuda.cvtColor(gpu_img, cv2.COLOR_BGR2GRAY)
result = gpu_gray.download()
print("GPU round-trip OK, shape:", result.shape)

getCudaEnabledDeviceCount() returning 1 confirms CUDA is active. If it returns 0 after a successful build, you have a Python path conflict, the old apt package is still being imported. Verify with python3 -c "import cv2; print(cv2.__file__)" to confirm the right .so is loading.

JetPack 5 vs JetPack 6 differences

The cmake flags are identical between JetPack 5 and 6. What changes:

JetPack 5.xJetPack 6.x
CUDA version11.412.x
cuDNN version8.x9.x
Python default3.83.10
Supported modulesOrin, XavierOrin only

For JetPack 5 on Xavier, the CUDA_ARCH_BIN is 7.2. For JetPack 6 on Orin, it’s 8.7. Everything else in the build sequence is the same.

Troubleshooting common build failures

cc1plus: fatal error: Killed during compilation

The compiler process was OOM-killed. This always means insufficient memory. Fix: add 8GB swap before starting the build (fallocate -l 8G /swapfile && chmod 600 /swapfile && mkswap /swapfile && swapon /swapfile). If the OOM continues with swap, reduce parallelism to make -j4 or make -j2. On Jetson Nano, 8GB swap + make -j2 is the only reliable combination.

getCudaEnabledDeviceCount() returns 0 after a successful build

Python is loading the apt-installed OpenCV, not the one you built. The symptom: python3 -c "import cv2; print(cv2.__file__)" shows a path under /usr/lib/python3/dist-packages/. Fix: sudo apt remove python3-opencv libopencv-dev and reopen your terminal. The newly built .so at /usr/local/lib/python3.x/dist-packages/ then takes precedence.

cmake reports NVIDIA CUDA: NO

The CUDA toolkit isn’t on the path. On Jetson, CUDA lives at /usr/local/cuda. Verify:

ls /usr/local/cuda/bin/nvcc
export PATH=/usr/local/cuda/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH

Add those exports to ~/.bashrc so they persist across sessions before re-running cmake.

WITH_CUDNN=ON but cmake shows cuDNN: NO

The cuDNN development headers are missing. Fix:

sudo apt install libcudnn8-dev  # JetPack 5
# or
sudo apt install libcudnn9-dev  # JetPack 6

Then verify /usr/include/cudnn.h exists and re-run cmake.

ImportError: libopencv_core.so.4.9: cannot open shared object file

The shared library path isn’t updated. Fix:

sudo ldconfig

If that doesn’t resolve it, add /usr/local/lib to /etc/ld.so.conf.d/opencv.conf and run sudo ldconfig again.

Building OpenCV with CUDA in Docker on Jetson

For reproducible deployments, building inside a container avoids polluting the host and makes the build portable across Jetson units.

Use an NVIDIA L4T base image that already includes CUDA:

FROM nvcr.io/nvidia/l4t-ml:r36.2.0-py3

RUN apt-get update && apt-get install -y \
    build-essential cmake git \
    libjpeg-dev libpng-dev \
    python3-dev python3-numpy \
    libgstreamer1.0-dev \
    && rm -rf /var/lib/apt/lists/*

RUN git clone --depth 1 --branch 4.9.0 https://github.com/opencv/opencv.git /opencv && \
    git clone --depth 1 --branch 4.9.0 https://github.com/opencv/opencv_contrib.git /opencv_contrib

RUN mkdir /opencv/build && cd /opencv/build && cmake \
    -D CMAKE_BUILD_TYPE=RELEASE \
    -D CMAKE_INSTALL_PREFIX=/usr/local \
    -D OPENCV_EXTRA_MODULES_PATH=/opencv_contrib/modules \
    -D WITH_CUDA=ON \
    -D CUDA_ARCH_BIN=8.7 \
    -D WITH_CUDNN=ON \
    -D OPENCV_DNN_CUDA=ON \
    -D WITH_GSTREAMER=ON \
    -D BUILD_opencv_python3=ON \
    -D BUILD_TESTS=OFF \
    -D BUILD_EXAMPLES=OFF \
    .. && make -j$(nproc) && make install && ldconfig

Run the container with GPU access:

docker build -t opencv-cuda-jetson .
docker run --runtime=nvidia --rm opencv-cuda-jetson \
    python3 -c "import cv2; print(cv2.cuda.getCudaEnabledDeviceCount())"

The --runtime=nvidia flag is required to expose the GPU inside the container. Without it, the CUDA device count will be 0.

If you’re also working out which JetPack version to target for your project, the JetPack versions and L4T compatibility table has a full breakdown. For production-grade inference pipelines on Jetson, the Jetson AI deployment service covers TensorRT optimization, model conversion, and throughput testing.

NVIDIA Jetson Expert Support

Stuck on a Jetson bring-up?

We've debugged this failure mode before. BSP, device tree, camera pipelines, OTA, most blockers clear in the first session. No long retainers. No guessing.

Frequently Asked Questions

Why does apt install python3-opencv not include CUDA on Jetson?

The apt package is compiled without CUDA support to keep it generic and dependency-free. It runs entirely on the CPU. NVIDIA doesn't ship a CUDA-enabled OpenCV via apt because the correct CUDA architecture flags (CUDA_ARCH_BIN) depend on the specific Jetson module. You have to build from source with those flags set to your hardware.

How do I verify that OpenCV is using CUDA on Jetson?

In Python: import cv2; print(cv2.cuda.getCudaEnabledDeviceCount()). If this returns 1 or more, CUDA is available. Also check cv2.getBuildInformation() and look for the CUDA section, it will show Enabled: YES with the device count and CUDA architecture.

How long does it take to build OpenCV from source on Jetson?

On Jetson Orin AGX with 12 cores: 20–30 minutes. On Jetson Xavier NX (6 cores): 45–60 minutes. On Jetson Nano (4 cores, 4GB): 90–120 minutes and you need swap space. Set make -j$(nproc) and add at least 4GB of swap before building on lower-end modules.

What CUDA_ARCH_BIN value should I use for my Jetson?

Jetson AGX Orin / Orin NX / Orin Nano: 8.7. Jetson AGX Xavier / Xavier NX: 7.2. Jetson TX2: 6.2. Jetson Nano (original): 5.3. Using the wrong value means CUDA code compiles for the wrong architecture and may not run or may run slowly.

Do I need to uninstall the apt OpenCV before building from source?

Yes, if you have python3-opencv installed. The apt package installs to the same paths as the source build and will conflict. Remove it first: sudo apt remove python3-opencv libopencv-dev. Then build from source. If you need both the system package for other tools and the CUDA build for your project, use a virtual environment and install the built .whl into it.

My build succeeded but getCudaEnabledDeviceCount() still returns 0. What's wrong?

Python is importing the old apt OpenCV, not the one you just built. Run python3 -c "import cv2; print(cv2.__file__)", if it shows a path like /usr/lib/python3/dist-packages/cv2.so, the apt package is still present and taking precedence. Remove it with sudo apt remove python3-opencv, then confirm the path shows /usr/local/lib/python3.x/dist-packages/ after removal.

The compiler gets killed halfway through the build. How do I fix OOM errors?

The Jetson is running out of memory during compilation. cc1plus: fatal error: Killed is the symptom. Fix: add at least 8GB of swap (fallocate -l 8G /swapfile), and if the OOM persists, reduce parallelism to make -j4 instead of -j$(nproc). On Jetson Nano (4GB), 8GB swap + make -j2 is the reliable combination. The extra compile time is worth it over a build that aborts at 80%.

Can I build OpenCV with CUDA inside a Docker container on Jetson?

Yes. Use an NVIDIA L4T base image (nvcr.io/nvidia/l4t-base or l4t-ml) that already has CUDA installed. Add --runtime=nvidia when running the container so the GPU is accessible. The cmake flags are the same as the host build. One caveat: the built .so will be tied to the CUDA version in the container, make sure the base image CUDA version matches the JetPack on your host.

Which OpenCV version should I build on Jetson?

4.9.0 is the current stable release and works on both JetPack 5 and JetPack 6. Avoid OpenCV 4.10 if you use the DNN module with cuDNN, it changed the cuDNN API and is not compatible with cuDNN 8.x (JetPack 5). If you are on JetPack 6 with cuDNN 9.x, 4.10 is safe.

Should I enable WITH_CUDNN in the cmake flags?

Yes, if you use OpenCV's DNN module for inference (cv2.dnn.readNet and related functions). WITH_CUDNN=ON + OPENCV_DNN_CUDA=ON enables GPU acceleration for DNN forward passes, which can be 5–10x faster than CPU on Orin. If you only use OpenCV for image processing (resize, warp, threshold) and not DNN inference, the cuDNN flags have no effect but are harmless to include.

Aarón Angulo, Co-Founder & CEO at ProventusNova

Written by

Aarón Angulo

Co-Founder & CEO · ProventusNova

Obsessed with client outcomes. Aarón ensures every engagement delivers real results, on time, on scope, no exceptions.

Connect on LinkedIn