{ "cells": [ { "cell_type": "markdown", "id": "58d0e877-51fa-431a-9c76-dd0fe1177861", "metadata": {}, "source": [ "## PP-LiteSeg Introduction\n", "\n", "Real-world applications have high demands for semantic segmentation methods. Although semantic segmentation has made remarkable leap-forwards with deeplearning, the precision and performance of the semantic model is not satisfactory.\n", "\n", "So, we propose PP-LiteSeg, a novel lightweight model for the real-time semantic segmentation task. Specifically, we present a Flexible and Lightweight Decoder (FLD) to reduce computation overhead of previous decoder. To strengthen feature representations, we propose a Unified Attention Fusion Module (UAFM), which takes advantage of spatial and channel attention to produce a weight and then fuses the input features with the weight. Moreover, a Simple Pyramid Pooling Module (SPPM) is proposed to aggregate global context with low computation cost.\n", "\n", "On the Cityscapes test set, PP-LiteSeg achieves 72.0% mIoU/273.6 FPS and 77.5% mIoU/102.6 FPS on NVIDIA GTX 1080Ti. PP-LiteSeg achieves a superior tradeoff between accuracy and speed compared to other methods.\n", "\n", "PP-LiteSeg model is officially produced by PaddlePaddle and is a SOTA model proposed by PaddleSeg. More information about PaddleSeg can be found here https://github.com/PaddlePaddle/PaddleSeg." ] }, { "cell_type": "markdown", "id": "55360c7a-3191-40bf-99c5-64c1c3d89967", "metadata": {}, "source": [ "## 2. Model Effects and Application Scenarios\n", "\n", "### 2.1 Real-Time Semantic Segmentation Tasks:\n", "\n", "#### 2.1.1 Datasets:\n", "\n", "The dataset is mainly Cityscapes, which is divided into training set and test set.\n", "\n", "#### 2.1.2 Model Effects:\n", "\n", "The segmentation effect of PP-LiteSeg on the image is:\n", "\n", "Original image:\n", "