Note

Sentis is now called Inference Engine. The documentation has moved to https://docs.unity3d.com/Packages/com.unity.ai.inference@latest. Refer to the new location for the latest updates and guidance. Make sure to update your bookmarks and references accordingly.

Understand the Sentis workflow

To use Sentis to run a neural network in Unity, follow these steps:

Use the Unity.Sentis namespace.
Load a neural network model file.
Create input for the model.
Create an inference engine (a worker).
Run the model with the input to compute a result (inference).
Get the result.

Tip

Use the Workflow example to understand the workflow applied to a simple example.

Use the Unity.Sentis namespace

To use the Unity.Sentis namespace, add the following to the top of your script:

using Unity.Sentis;

Load a model

Sentis can import model files in Open Neural Network Exchange (ONNX) format. To load a model, follow these steps:

Export a model to ONNX format from a machine learning framework or download an ONNX model from the Internet.
Add the model file to the Assets folder of the Project window.
Create a runtime model in your script:

ModelAsset modelAsset = Resources.Load("model-file-in-assets-folder") as ModelAsset;
var runtimeModel = ModelLoader.Load(modelAsset);

You can also add a public ModelAsset modelAsset as a public variable in GameObjects. In this case specify the model manually.

Refer to Import a model file for more information.

Create input for the model

Use the Tensor API to create a tensor with data for the model. You can convert an array or a texture to a tensor. For example:

// Convert a texture to a tensor
Texture2D inputTexture = Resources.Load("image-file") as Texture2D;
Tensor<float> inputTensor = TextureConverter.ToTensor(inputTexture);
// Convert an array to a tensor
int[] array = new int[] {1,2,3,4};
Tensor<int> inputTensor = new Tensor<int>(new TensorShape(4), array);

Refer to Create input for a model for more information.

Create an inference engine (a worker)

In Sentis, a worker is the inference engine. You create a worker to break down the model into executable tasks, run the tasks on the GPU or CPU, and retrieve the result.

For example, the following creates a worker that runs on the GPU using Sentis compute shaders:

Worker worker = new Worker(runtimeModel, BackendType.GPUCompute);

Refer to Create an engine for more information.

Schedule the model

To run the model, use the Schedule method of the worker object with the input tensor.

worker.Schedule(inputTensor);

Sentis schedules the model layers on the given backend. Execution is asynchronous, so after this is called, tensor operations may still be pending.

Refer to Run a model for more information.

Get the output

You can use methods such as PeekOutput to get the output data from the model. For example:

Tensor<float> outputTensor = worker.PeekOutput() as Tensor<float>;

Refer to Get output from a model for more information.