Blind spot limitation of autoregressive model pixel CNN and how to repair it

In the first two articles, we have introduced the autoregressive model pixelcnn and how to deal with multidimensional input data. In this article, we will focus on one of the biggest limitations of pixelcnn (i.e. blind spots) and how to improve it to repair it.In the first two articles, we introduced the concept of generating model PixelCNN and ...

Posted by thewooleymammoth on Tue, 28 Dec 2021 08:18:28 +0100

Summary of work training intelligent trash can 1

Summary 1 I was very lucky to participate in the 2021 engineering training provincial competition, which was the first provincial competition since my university. Although I didn't enter the national competition, I was quite satisfied with my achievements. The following are some problems we encountered during the preparation and our own so ...

Posted by rdoylelmt on Mon, 27 Dec 2021 14:12:56 +0100

YOLOV5 feature map visualization

Recently, when writing a paper, I encountered the problem of feature graph visualization, so I sorted the methods to solve this problem into notes. 1: Why visualization? It is often said that the essence of CNN is to extract features, but we don't know what features it extracts, which regions really play a role in recognition, or what the net ...

Posted by TwistedLogix on Mon, 27 Dec 2021 12:28:23 +0100

Yolo v5 model training

About Yolo 5's model training Author: fat orange preface With the continuous development of artificial intelligence, the technology of model training is becoming more and more important. Many people have opened the chapter of learning model training. Now I will introduce the basic content of model training. preparation data setlabelI ...

Posted by mmonaco on Mon, 27 Dec 2021 07:46:56 +0100

Deep learning of FTP on Linux

Introduction to FTP protocol: FTP protocol is a file transfer protocol, which acts on the application layer. Its function is to realize the file transfer between the server and the client (C/S architecture). The transmission uses the plaintext transmission of TCP connection (command channel and data flow channel). FTP protocol needs two TCP ha ...

Posted by amithn12 on Sun, 26 Dec 2021 19:03:45 +0100

Automatic generation of Chinese Tibetan poems based on LSTM

Like RNN neurons, LSTM neurons can maintain memory in their pipeline to allow solving sequence and time problems without disappearing gradient problems affecting their performance. Using the ancient poetry data set, the LSTM neural network model is used to train and automatically generate Chinese Tibetan head poetry. import pandas as pd i ...

Posted by axo on Sun, 26 Dec 2021 13:35:37 +0100

Reading notes Deep Learning for Computer Vision with Python - Volume III, Chapter 4 training AlexNet on ImageNet

Volume III Chapter IV training Alex net on ImageNet In the previous chapter, we discussed ImageNet dataset in detail; Specifically, the directory structure of the dataset and the supporting meta files used provide class labels for each image. We define two sets of files:         1. A configuration file ...

Posted by selliott on Sun, 26 Dec 2021 11:36:14 +0100

Yolov5 detection based on Region of Interest

Introduction of preface It must be easy for everyone to encounter such problems when doing yolo target detection. In fact, I don't want to detect the items in a part of the picture. It doesn't matter Just like this corner, I only want to detect the target of the front channel. This corner should be handed over to another camera for recogn ...

Posted by davidprogramer on Sun, 26 Dec 2021 03:08:55 +0100

Deep learning image classification of small data set based on KERAS in R language

Original link: http://tecdat.cn/?p=6714Original source: Tuo end data tribal official accountIt is common that image classification models must be trained with very little data, which may be encountered in practice if you conduct computer vision in a professional environment. "Few" samples can represent anywhere from hundreds to tens o ...

Posted by utherwun on Sat, 25 Dec 2021 10:29:59 +0100

12_ pytorch_ Autoencoder (self coding, unsupervised learning)

Pytorch autoencoder (self coding, unsupervised learning) 1, Compression and decompression There is a neural network. What it is doing is receiving a picture, then coding it, and finally restoring it from the coded picture Assuming that the neural network is like this, corresponding to the picture just above, we can see that the picture is ...

Posted by rudibr on Fri, 24 Dec 2021 10:46:24 +0100