Computer Vision - Programmer Think - where programmers share thinking

Computer Vision

Basic image operation and processing [Chapter 1 of computer vision]

I PIL:Python image processing library PIL(Python Imaging Library), which provides general image processing functions, has a large number of useful basic image operations, such as image scaling, clipping, rotation, color conversion and so on. The following will give specific examples of some PIL basic image operations. 1. Read an image and dis ...

Posted by TNIDBMNG on Thu, 10 Mar 2022 15:14:50 +0100

Tiff image processing 2_Python

Image data type conversion, image stretching and band judgment Through the pit... I won't mention it. Before the training of in-depth learning, when the data sets are almost completed, that is, downloading, labeling, cutting and screening are all finished. Sent to the network for training. Due to the inconsistency of data types, the prediction ...

Posted by f8ball on Wed, 09 Mar 2022 14:35:12 +0100

Full convolution network (FCN) practice: using FCN to realize semantic segmentation

Full convolution network (FCN) practice: using FCN to realize semantic segmentation FCN classifies images at pixel level, which solves the problem of semantic segmentation. Unlike the classical CNN, which uses the full connection layer to obtain a fixed length feature vector for classification after the convolution layer (full connection layer ...

Posted by purinkle on Wed, 09 Mar 2022 00:03:14 +0100

yolov5 training data set (ultralytics version)

0 preparation Environment configuration, reference hereDownload an official yolov5s PT model, placed in yolov5 root directory, download link here perhaps herePrepare the dataset in the format shown below. Each picture has its corresponding xml format annotation file. yolov5 ├── models ├── runs ├── utils ├── data │ ├── project01 │ │ ├── ...

Posted by chinstroker on Tue, 08 Mar 2022 16:59:09 +0100

[Python + + opencv + Python] license plate extraction, segmentation and recognition

If you want to complete the project, I will put the github link at the end of the article: You can see that the final identification license plate number is min G99999. In fact, I accidentally thought of doing a small project with c + + in the winter of the previous year, and then realized the license plate extraction and segmentation with c+ ...

Posted by ricta on Tue, 08 Mar 2022 05:50:40 +0100

Camera calibration for binocular vision

Binocular vision is based on geometric mathematics, and mathematical derivation is boring. Therefore, I will not introduce the mathematical principles here, but briefly describe the process of binocular vision. Binocular vision mainly includes five parts: camera calibration, image distortion correction, camera correction, image ...

Posted by ConnorSBB on Mon, 07 Mar 2022 23:07:11 +0100

Intelligent route planning of MyCobot manipulator head

I've been struggling whether to talk about the head movement first or the spatial coordinate system first. Later, I decided to talk about the head movement first, so that we can first feel how the manipulator is positioned in space, and then we will introduce in detail the spatial coordinates of the manipulator in the next section. Spatial coor ...

Posted by Labbat on Mon, 07 Mar 2022 12:46:58 +0100

UNet + + learning notes (backbone network + code)

paper 1 Abstract The article puts forward that UNet mainly has the following two defects: ① the optimal depth of the network is unknown, which requires a large number of experiments and integration of networks with different depths, resulting in low efficiency; ② skip connection introduces unnecessary restrictions, that is, it restricts fea ...

Posted by amazinggrace1983 on Sun, 06 Mar 2022 08:39:21 +0100

Customize the dataset and teach you to make your own VOC data reader

Link from AI Studio project https://aistudio.baidu.com/aistudio/projectdetail/2432755 Project background I recently found that there was no good example when I read the resnet50+FPN version of Faster-RCNN when I was preloading and loading data, when I read the class loaded by paddlepaddle's official data. So I did this today, how to make a d ...

Posted by Brad on Sat, 05 Mar 2022 09:42:50 +0100

[speech recognition] dial up speech recognition based on matlab GUI [including Matlab source code 1753]

1, Access code Get code method 1: The complete code has been uploaded to my resources: [speech recognition] dial up speech recognition based on matlab GUI [including Matlab source code 1753] Get code method 2: By subscribing to the payment column of zijishenguang blog, private bloggers can obtain this code with payment vouchers. remarks: If ...

Posted by TheTitans on Sat, 05 Mar 2022 07:16:47 +0100

Hot Topics