Course Outline

Introduction to Cambricon and MLU Architecture

  • Overview of Cambricon’s AI chip portfolio
  • MLU architecture and instruction pipeline
  • Supported model types and use cases

Installing the Development Toolchain

  • Installing BANGPy and Neuware SDK
  • Environment setup for Python and C++
  • Model compatibility and preprocessing

Model Development with BANGPy

  • Tensor structure and shape management
  • Computation graph construction
  • Custom operation support in BANGPy

Deploying with Neuware Runtime

  • Converting and loading models
  • Execution and inference control
  • Edge and data center deployment practices

Performance Optimization

  • Memory mapping and layer tuning
  • Execution tracing and profiling
  • Common bottlenecks and fixes

Integrating MLU into Applications

  • Using Neuware APIs for application integration
  • Streaming and multi-model support
  • Hybrid CPU-MLU inference scenarios

End-to-End Project and Use Case

  • Lab: Deploying a vision or NLP model
  • Edge inference with BANGPy integration
  • Testing accuracy and throughput

Summary and Next Steps

Requirements

  • An understanding of machine learning model structures
  • Experience with Python and/or C++
  • Familiarity with model deployment and acceleration concepts

Audience

  • Embedded AI developers
  • ML engineers deploying to edge or datacenter
  • Developers working with Chinese AI infrastructure
 21 Hours

Custom Corporate Training

Training solutions designed exclusively for businesses.

  • Customized Content: We adapt the syllabus and practical exercises to the real goals and needs of your project.
  • Flexible Schedule: Dates and times adapted to your team's agenda.
  • Format: Online (live), In-company (at your offices), or Hybrid.
Investment

Price per private group, online live training, starting from 4800 € + VAT*

Contact us for an exact quote and to hear our latest promotions

Upcoming Courses

Related Categories