卷积神经网络预测实例_卷积神经网络预测代码资源-CSDN文库

共9个文件

xml：3个

py：2个

prototxt：1个

SSD

PYTHON

5星 · 超过95%的资源需积分: 48 4 浏览量 2018-11-07 17:02:12 上传评论 10 收藏 20.53MB RAR 举报

卷积神经网络（CNN，Convolutional Neural Network）是一种深度学习模型，特别适用于图像处理任务，如图像分类、目标检测和图像识别等。在本实例中，我们将探讨如何使用Python结合OpenCV3.4实现一个基于SSD（Single Shot MultiBox Detector）的目标检测系统。 SSD是一种高效的一体化目标检测算法，它能够在单个前向传播过程中同时完成多个对象的定位和分类，从而避免了多阶段检测器的复杂性。这个模型的特点在于其设计了一种新颖的损失函数，使得训练过程能够同时优化边界框回归和类别预测。在Python中实现SSD，我们需要依赖一些关键库，包括TensorFlow、Keras或PyTorch作为深度学习框架，以及OpenCV用于图像预处理和结果可视化。OpenCV是一个强大的计算机视觉库，包含多种图像处理和计算机视觉功能，例如图像读取、显示、变换、对象检测等。我们需要准备数据集，包括标注好的图像和对应的边界框信息。这些数据通常以XML或JSON格式存储，我们需要用特定的脚本将它们转换为适合训练的格式。数据预处理是深度学习中的重要步骤，包括图片缩放、归一化、增强等操作，以提高模型的泛化能力。接下来，我们构建SSD模型。该模型由多个卷积层和池化层组成，其中包括一些专门用于预测不同尺度和比例的物体的“特征层”。在每个特征层上，SSD会预测一系列固定的“锚点”（Anchor Boxes），这些锚点对应于不同大小和形状的物体，覆盖了可能的物体尺寸范围。训练SSD模型时，我们使用损失函数来衡量预测结果与真实标注的差异。这个损失函数通常包括两部分：分类损失和定位损失。分类损失用于评估模型对每个锚点是否包含物体的判断，定位损失则评估预测的边界框位置和大小与真实值的差距。在模型训练完成后，我们可以将其部署到测试集上进行验证，并生成检测效果。OpenCV的`cv2.DetectMultiScale`函数可以用来显示检测到的物体，通过绘制边界框和标注类别，我们可以直观地看到模型的性能。在提供的"Detect"文件中，可能包含了训练模型所需的代码、数据集、预训练模型权重等资源。通过运行这些文件，你可以体验到SSD在目标检测任务上的实际应用。理解并实践这个实例，不仅能帮助你掌握SSD的工作原理，还能提升你在深度学习和计算机视觉领域的技能。

资源推荐

资源详情

资源评论

收起资源包目录

Detect.rar （9个子文件）

Detect

real_time_object_detection.py 5KB

MobileNetSSD_deploy_0.prototxt 29KB

.idea

misc.xml 288B

workspace.xml 8KB

personDetect.iml 398B

inspectionProfiles

modules.xml 276B

yoloT.py 4KB

MobileNetSSD_deploy_0.caffemodel 22.08MB

timg.jpg 56KB

# USAGE # python real_time_object_detection.py --prototxt MobileNetSSD_deploy.prototxt.txt --model MobileNetSSD_deploy.caffemodel # import the necessary packages from imutils.video import VideoStream from imutils.video import FPS import numpy as np import argparse import imutils import time import cv2 # construct the argument parse and parse the arguments ap = argparse.ArgumentParser() ap.add_argument("-p", "--prototxt",default="MobileNetSSD_deploy_0.prototxt", help="path to Caffe 'deploy' prototxt file") ap.add_argument("-m", "--model",default="MobileNetSSD_deploy_0.caffemodel", help="path to Caffe pre-trained model") ap.add_argument("-c", "--confidence", type=float, default=0.2, help="minimum probability to filter weak detections") args = vars(ap.parse_args()) # initialize the list of class labels MobileNet SSD was trained to # detect, then generate a set of bounding box colors for each class CLASSES = ["background", "aeroplane", "bicycle", "bird", "boat", "bottle", "bus", "car", "cat", "chair", "cow", "diningtable", "dog", "horse", "motorbike", "person", "pottedplant", "sheep", "sofa", "train", "tvmonitor"] COLORS = np.random.uniform(0, 255, size=(len(CLASSES), 3)) # load our serialized model from disk print("[INFO] loading model...") net2 = cv2.dnn.readNetFromCaffe(args["prototxt"], args["model"]) # net2=cv2.dnn.readNetFromCaffe("VGG_SSD_300.prototxt","VGG_SSD_300.caffemodel") # net2=cv2.dnn.readNetFromTensorflow("face.pb") # initialize the video stream, allow the cammera sensor to warmup, # and initialize the FPS counter print("[INFO] starting video stream...") #vs = VideoStream(src=0).start() # vs =cv2.VideoCapture('C:\\Users\\voidking\\Desktop\\real-time-object-detection\\test_video.flv') # vs =cv2.VideoCapture('./test_video.flv') # vs =cv2.VideoCapture("video1.mp4") vs =cv2.VideoCapture('timg.jpg') time.sleep(2.0) fps = FPS().start() # loop over the frames from the video stream while True: # grab the frame from the threaded video stream and resize it # to have a maximum width of 400 pixels #frame = vs.read() #frame = imutils.resize(frame, width=400) # grab the frame from the threaded video file stream (grabbed,frame) = vs.read() # if the frame was not grabbed, then we have reached the end # of the stream if not grabbed: break frame = imutils.resize(frame, width=800) # grab the frame dimensions and convert it to a blob (h, w) = frame.shape[:2] blob = cv2.dnn.blobFromImage(cv2.resize(frame, (300, 300)), 0.007843, (300, 300), 127.5) # pass the blob through the network and obtain the detections and # predictions net2.setInput(blob) detections = net2.forward() # print(np.max(detections[0])) # print(detections) # loop over the detections for i in np.arange(0, detections.shape[2]): # extract the confidence (i.e., probability) associated with # the prediction confidence = detections[0, 0, i, 2] # filter out weak detections by ensuring the `confidence` is # greater than the minimum confidence idx = int(detections[0, 0, i, 1]) label = "{}: {:.2f}%".format(CLASSES[idx], confidence * 100) if confidence > args["confidence"]: if True: #if CLASSES[idx]=="person": # extract the index of the class label from the # `detections`, then compute the (x, y)-coordinates of # the bounding box for the object # idx = int(detections[0, 0, i, 1]) box = detections[0, 0, i, 3:7] * np.array([w, h, w, h]) (startX, startY, endX, endY) = box.astype("int") # draw the prediction on the frame cv2.rectangle(frame, (startX, startY), (endX, endY), COLORS[idx], 2) y = startY - 15 if startY - 15 > 15 else startY + 15 pix_person_height = endY - startY print ('pix_person_height = ', pix_person_height) print ('distance = ' , 174724 / pix_person_height) cv2.putText(frame, label, (startX, y), cv2.FONT_HERSHEY_SIMPLEX, 0.5, COLORS[idx], 2) # # extract the index of the class label from the # # `detections`, then compute the (x, y)-coordinates of # # the bounding box for the object # idx = int(detections[0, 0, i, 1]) # box = detections[0, 0, i, 3:7] * np.array([w, h, w, h]) # (startX, startY, endX, endY) = box.astype("int") # # # draw the prediction on the frame # # cv2.rectangle(frame, (startX, startY), (endX, endY), # COLORS[idx], 2) # y = startY - 15 if startY - 15 > 15 else startY + 15 # cv2.putText(frame, label, (startX, y), # cv2.FONT_HERSHEY_SIMPLEX, 0.5, COLORS[idx], 2) # # show the output frame cv2.imshow("Frame", frame) key = cv2.waitKey(1) & 0xFF # if the `q` key was pressed, break from the loop if key == ord("q"): break # update the FPS counter fps.update() # stop the timer and display FPS information fps.stop() print("[INFO] elapsed time: {:.2f}".format(fps.elapsed())) print("[INFO] approx. FPS: {:.2f}".format(fps.fps())) # do a bit of cleanup #销毁窗口 #cv2.destroyAllWindows()

评论收藏

内容反馈