docs.unity3d.com
Search Results for

    Show / Hide Table of Contents

    Create an engine to run a model

    To run a model, you need to create a worker. A worker is the engine that breaks the model down into runnable tasks. It schedules the tasks to run on a backend, such as the graphics processing unit (GPU) or central processing unit (CPU).

    Create a Worker

    Use new Worker(...) to create a worker. You must specify a backend type, which tells Sentis where to run the worker and a runtime model.

    For example, the following code creates a worker that runs on the GPU with Sentis compute shaders.

    using UnityEngine;
    using Unity.InferenceEngine;
    
    public class CreateWorker : MonoBehaviour
    {
        ModelAsset modelAsset;
        Model runtimeModel;
        Worker worker;
    
        void Start()
        {
            runtimeModel = ModelLoader.Load(modelAsset);
            worker = new Worker(runtimeModel, BackendType.GPUCompute);
        }
    }
    

    Backend types

    Sentis provides CPU and GPU backend types. To understand how Sentis runs operations with the different backends, refer to How Sentis runs a model.

    If a backend type doesn't support a Sentis layer in a model, the worker will assert. For more information, refer to Supported ONNX operators and Supported LiteRT operators.

    BackendType Usage
    BackendType.CPU - Faster than GPU for small models or when inputs/outputs are on the CPU.
    - On WebGL, Burst compiles to WebAssembly, which may result in slower performance. For more information, refer to Getting started with WebGL development.
    BackendType.GPUCompute - Generally the fastest backend for most models.
    - Avoids expensive data transfer when outputs remain on the GPU.
    - Uses DirectML for inference acceleration when running on DirectX12-supported platforms. For more information, refer to Supported ONNX operators.
    BackendType.GPUPixel - Use only on platforms that lack compute shader support.
    - Check platform support using SystemInfo.supportsComputeShaders.

    The speed of model performance depends on the platform's support for multithreading in Burst, its full support for compute shaders, and the resource usage of the game or application.

    To understand a model's performance, it’s important to Profile a model.

    Additional resources

    • Create a runtime model
    • How Sentis runs a model
    • Supported ONNX operators
    • Supported LiteRT operators
    • Run a model
    In This Article
    Back to top
    Copyright © 2025 Unity Technologies — Trademarks and terms of use
    • Legal
    • Privacy Policy
    • Cookie Policy
    • Do Not Sell or Share My Personal Information
    • Your Privacy Choices (Cookie Settings)