Tutorial 2: AIPUBuilder Overview

The AIPUBuilder is the AI compiler to build from the NN model to AIPU binary executable file. _images/aipubuilder_overview.jpg

It includes the following components:

Parser: support Pytorch Tensorflow Tensorflow-lite ONNX models
GSim: hign level graph optimizer
OPT: quantization tool
- support auto quantization
- accuracy evaluation
- quantization
GBuilder:
- Graph compiler
- support multi-target
- memory allocation & optimization

Parser

_images/parser.jpg Parser has multiple front-end to parse different framework models.

GSIM is a IR to IR tool, supports float IR, quant IR or mixed IR.

GSIM also will act as intermediate stage between Parser/OPT/GBuilder.

_images/gsim.jpg

Support:

OPT is our quantization tool.

GraphBuilder(GBuilder) is the core NN compiler to build from the NN IR to AIPU binary code.

_images/gbuilder.jpg

Graph Lowering: transforms high-level IR into machine-specific libraries or code.
Graph Optimization:
- fuse block of ops to reduce overhead of itermediate data movement.
- auto-select the best library implementation for each operation.
Operator Plugin: integrate built-in high-performance TEC/AIFF libraries.
SRAM optimizer: memory allocation optimizer. A specific allocator to using internal/external SRAM.
Code Generator: code generator and linker produce the final executable code.