paddlepaddle/paddle-lite

★ 7,255C++Audience · developerComplexity · 4/5Setup · moderate

Mindmap

mindmap
  root((paddle-lite))
    What it does
      On-device AI inference
      Model optimization
      Edge deployment
    Optimization Techniques
      Quantization
      Operator fusion
      Model conversion
    Supported Hardware
      Android phones
      iOS devices
      ARM embedded boards
      AI accelerator chips
    Developer APIs
      C++ library
      Java bindings
      Python bindings

mindmap root((paddle-lite)) What it does On-device AI inference Model optimization Edge deployment Optimization Techniques Quantization Operator fusion Model conversion Supported Hardware Android phones iOS devices ARM embedded boards AI accelerator chips Developer APIs C++ library Java bindings Python bindings

Click or tap to explore — scroll the page freely

Things people build with this

USE CASE 1

Run an image classification or object detection model on an Android phone without a server

USE CASE 2

Deploy a PaddlePaddle or converted TensorFlow model to an ARM embedded Linux board

USE CASE 3

Shrink and speed up a deep learning model using quantization for resource-constrained edge devices

Tech stack

C++JavaPythonAndroidiOS

Getting it running

Difficulty · moderate Time to first run · 1h+

Requires downloading pre-built platform libraries or compiling from source for the target device architecture.

License not mentioned in the explanation.

In plain English

Paddle Lite is an inference engine from Baidu's PaddlePaddle team designed to run AI models on phones, embedded devices, and other edge hardware where resources like memory and processing power are limited. Inference means taking a trained AI model and using it to make predictions on new data, which is a different and lighter task than training the model in the first place. The tool takes a model trained with the PaddlePaddle deep learning framework and optimizes it for a specific target device. This optimization step applies techniques like quantization, which reduces the numerical precision of the model's weights to shrink file size and speed up computation, and operator fusion, which combines multiple processing steps into single efficient operations. Models from other frameworks such as TensorFlow and PyTorch can be converted to the required format using a companion conversion tool before being passed through this same pipeline. Once optimized, the model can run on a very wide range of hardware. The list of supported targets includes Android and iOS phones, standard ARM-based embedded Linux boards, and specialized AI accelerator chips from manufacturers including Qualcomm, Huawei, MediaTek, Cambricon, and Intel. This breadth is one of Paddle Lite's distinguishing characteristics: a single development workflow can target many different devices without rewriting the application code. Developer APIs are available in C++, Java, and Python. Pre-built libraries for common platforms are available for direct download, which avoids the need to compile from source. Demo Android apps covering tasks like image classification, object detection, face landmark detection, and face mask detection are available as downloadable APK files for quick testing. The README is written primarily in Chinese, with an English version available in a separate file.

Copy-paste prompts

Prompt 1

I have a PaddlePaddle image classification model. Walk me through optimizing it with Paddle Lite for Android deployment, including quantization settings and generating the .nb model file.

Prompt 2

Write a Java Android Activity that loads a Paddle Lite .nb model file and runs inference on a bitmap image from the camera, returning the top classification result.

Prompt 3

How do I convert a PyTorch model to run with Paddle Lite? List all conversion tools and steps required, including any format intermediate steps.

Prompt 4

What quantization options does Paddle Lite support, INT8, FP16, FP32, and how do they trade off accuracy versus inference speed on a Qualcomm mobile chip?

Open on GitHub → Explain another repo

← paddlepaddle on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.