NLP text classification practical introduction super detailed tutorial
catalogue
preface
1, Data loading
1. Load package
2. Read data
II. Text processing
1. Remove useless characters
2. Text segmentation
3. Remove stop words
4. Remove low-frequency words
5. Divide training set and test set
3, Convert text into vector form
1. Convert text into TF IDF vector
2. Convert text into word2vec vector
3. Conv ...
Posted by adamwhiles on Fri, 28 Jan 2022 02:58:21 +0100
44_Pandas converts classified variables into dummy variables (get_dummies)
44_Pandas converts classified variables into dummy variables (get_dummies)
To convert categorical variables (categorical data, qualitative data) into virtual variables in pandas, use pandas get_ Dummies() function.
Data such as gender classified by string can be converted into 0 for men and 1 for women, and multi class features can be convert ...
Posted by dodgeqwe on Thu, 27 Jan 2022 18:42:21 +0100
Detailed explanation of Label Smoothing and implementation of pytorch tenorflow
definitionLabel smoothing, like L1, L2 and dropout, is a regularization method in the field of machine learning. It is usually used for classification problems. The purpose is to prevent the model from predicting labels too confidently during training and improve the problem of poor generalization ability.backgroundFor the classification proble ...
Posted by runfastrick on Thu, 27 Jan 2022 14:28:53 +0100
Python packaging tool Pyintealler packaging py file is windows exe file process and stepping on pit record + practical example
Python packaging tool Pyintealler packaging py file is windows exe file process and stepping on pit record + practical example
catalogue
Python packaging tool Pyintealler packaging py file is windows exe file process and stepping on pit record + practical example
install
Model building code
Packing and pit stepping records
install
...
Posted by korion on Thu, 27 Jan 2022 02:03:49 +0100
Summary of neural network activation function. 2022.01
No reprint!!! The general activation function has the following properties:
Nonlinear When the activation function is linear, a two-layer neural network can basically approximate all functions. However, if the activation function is an identical activation function, that is, f(x)=x, this property is not satisfied. Moreover, if the MLP use ...
Posted by fatal on Wed, 26 Jan 2022 20:57:31 +0100
Python deep learning: theoretical knowledge of machine learning, including the calculation of information entropy (reading notes)
In today's article, we are formally exposed to the theoretical basis of deep learning - machine learning
1, Machine learning classification
1. Based on subject classification Statistics, artificial intelligence, information theory, control theory 2. Classification based on learning patterns Inductive learning, explanatory learning and feed ...
Posted by ferdi on Tue, 25 Jan 2022 20:51:16 +0100
R language uses self-organizing mapping neural network (SOM) to segment customers
Original link: http://tecdat.cn/?p=18726 Original source: Tuo end data tribal official account_ Self organization_ Mapped neural network (SOM) is an unsupervised data visualization technology, which can be used to visualize high-dimensional data sets in low-dimensional (usually 2-dimensional) representation. In this article, we studied how to u ...
Posted by nystateofmind27 on Tue, 25 Jan 2022 12:52:28 +0100
[source code analysis] deep learning distributed training framework horovod --- start on spark
[source code analysis] deep learning distributed training framework horovod (9) - start on spark
0x00 summary
Horovod is an easy-to-use high-performance distributed training framework released by Uber in 2017, which has been widely used in the industry.
This series will lead you to understand horovod through source code analysis. These a ...
Posted by updwebmaster on Tue, 25 Jan 2022 09:59:13 +0100
Chapter III model establishment and evaluation
Objective: to complete the Titanic survival prediction based on the Titanic data set.
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from IPython.display import Image
plt.rcParams['font.sans-serif'] = ['SimHei']
# Used to display Chinese labels normally
plt.rcParams['axes.unicode_minus'] = False ...
Posted by sidsel on Mon, 24 Jan 2022 20:30:06 +0100
Introduction to machine learning for programmers - common skills in the training process
Introduction to machine learning for programmers (IV) - common skills in the training process
This article will focus on some common skills in machine learning training using pytorch. Mastering them can make you get twice the result with half the effort.
Most of the codes used will be based on the last example in the previous article, that is ...
Posted by PDXDesigner on Mon, 24 Jan 2022 07:22:40 +0100