GitHub - wjiang42/raspberry-pi-opencv

raspberry-pi-opencv

Abstract

The final project of my independent study is a portable object detection system designed to track objects using AI. From a Raspberry Pi 4 connected to a camera, the program detects a specific object and points the camera toward it, tracking it. Such a program has a variety of applications, such as moving cinematography, but a real world application will require extensive improvement to be effective.

Overview

The project consists broadly of physical components and the AI program itself. The Raspberry Pi 4 is connected to a Raspberry Pi Camera mounted on an Arducam rotating platform, which can pitch and yaw. The program reads camera footage, an AI object detection model detects the specified object - in the final code, a person - and rotates the camera toward it. All code is written in Python and the Raspberry Pi runs on Linux.

When the program was written, a custom model had not yet been trained. As such, an existing model trained on a variety of objects and instructed to react to a specific class of object was used instead. A variety of different models were used, but the most successful model was yolov3-tiny, a YOLO objection model, which was trained on the COCO dataset and can identify 80 classes of objects. yolov3-tiny was selected as, even among YOLO models, it is remarkably efficient (at a detriment to accuracy). Since the program was designed to run on a Raspberry Pi 4, efficiency was highly desirable.

The code necessary to control the servos of the pan-tilt platform was extracted from Arducam's servo demo program, and the program reads camera data and for the object detection model using OpenCV.

Analysis

While the program succeeds in its objective, it is flawed. Despite the speed YOLO offers, the model achieved a frame rate of only 5-7 fps, even after the input was downscaled to 128x128. The program also suffered input lag upwards of 2 seconds. This is almost certainly due to the relatively low computational power of Raspberry Pis, as the object detection model performed well on my laptop. The low resolution, poor frame rate, inaccuracy inherent to YOLO and outrageous delay mean that while the project works, it leaves much to be desired. Future improvements include installing an AI accelerator or using more powerful hardware and a custom model to mitigate the resource-intensive nature of object detection.

Components

Raspberry Pi 4
Raspberry Pi Camera V2
Arducam Pan-Tilt Camera Platform

Software

Google Colab
Gedit
VSCode

Raspberry Pi 4 with Arducam Pan-Tilt Platform

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
LICENSE		LICENSE
README.md		README.md
coco.names		coco.names
ssd_object_detection.py		ssd_object_detection.py
yolov3-tiny.cfg		yolov3-tiny.cfg
yolov3-tiny.weights		yolov3-tiny.weights

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

raspberry-pi-opencv

Abstract

Overview

Analysis

Components

Software

About

Releases

Packages

Languages

License

wjiang42/raspberry-pi-opencv

Folders and files

Latest commit

History

Repository files navigation

raspberry-pi-opencv

Abstract

Overview

Analysis

Components

Software

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages