R language uses self-organizing mapping neural network (SOM) to segment customers
Original link: http://tecdat.cn/?p=18726 Original source: Tuo end data tribal official account_ Self organization_ Mapped neural network (SOM) is an unsupervised data visualization technology, which can be used to visualize high-dimensional data sets in low-dimensional (usually 2-dimensional) representation. In this article, we studied how to u ...
Posted by nystateofmind27 on Tue, 25 Jan 2022 12:52:28 +0100
Three common methods of t-test and examples used in Python
T-test is a statistical technique that can tell people how significant the difference between two groups of data is. It is achieved by comparing the semaphore (measured by the difference between samples or population averages) with the amount of noise (or change) in these samples. There are many useful articles that will tell you what t-tests a ...
Posted by Tremor on Sun, 23 Jan 2022 19:14:50 +0100
Feature Engineering - Data Preprocessing
1 exploratory data analysis
Data were used: Pima Indian diabetes prediction dataset. Link: https://pan.baidu.com/s/17M7UfnqGYTkpLmCXUOyTkw Extraction code: y4fz
import pandas as pd
pima_column_names = ['times_pregnant', 'plasma_glucose_concentration',
'diastolic_blood_pressure', 'triceps_thickness',
' ...
Posted by Dilbert137 on Sat, 22 Jan 2022 21:23:37 +0100
R language principal component analysis PCA spectral decomposition, singular value decomposition SVD prediction analysis of athlete performance data and dimensionality reduction visualization
Original link: http://tecdat.cn/?p=25067 This paper describes how to} perform principal component analysis (PCA) using R. You will learn how to} use PCA_ Forecast_ New individual and variable coordinates. We will also provide_ PCA results_ The theory behind it.There are two general methods to perform PCA in R:_ Spectral decomposition_ , check t ...
Posted by merebel on Sat, 22 Jan 2022 16:20:18 +0100
[entry 14] installing Huawei openGauss database on Windows -- compilation of openEuler virtual machine based on x86 architecture and JDBC connection of openGauss
1. Hype-V virtual machine installation openEuler
There are many virtual machine platforms, such as vmware, Hype-V, VirtualBox, etc. Considering the compatibility with wsl2, Hype-V is selected here to install the x86 architecture openEuler
Turn on the hype-v virtual machine Download the x86 architecture openEuler image Open the openEuler of ...
Posted by Nicoza on Fri, 21 Jan 2022 08:26:32 +0100
Linux jq and vim (notes)
1, jq
zcat client_log_20211226/*.gz | head -n 10000 | jq .event | sort | uniq -c
jq is a lightweight json processing command. json data can be sliced, filtered, mapped and transformed jq . Format and output json data
Common options
-c compact output json data-s reads all inputs into an array-r outputs the original string instead of a ...
Posted by AjBaz100 on Fri, 21 Jan 2022 08:03:28 +0100
Example analysis of simple crawler
Example analysis of simple crawler
With the exchange rate crawler demonstration written before
The website has been revised and this crawler code is invalid But let's put it out and give you an introduction to reptiles I am a novice, do not spray
import requests
from bs4 import BeautifulSoup
link = "https://www.msn. Cn / zh CN / money / curr ...
Posted by Lukeidiot on Thu, 20 Jan 2022 22:31:56 +0100
[algorithm competition learning] data analysis talent competition 1: visual analysis of user emotion
Competition background
Based on the analysis of network public opinion, the competition questions require players to conduct data analysis and visualization of brand issues according to users' comments. Through this competition question, we can guide the commonly used data visualization charts and data analysis methods to conduct exploratory d ...
Posted by neuro4848 on Thu, 20 Jan 2022 05:22:43 +0100
Clustering method in protocol reverse engineering for industrial protocols
1, Abstract
In this paper, we propose a method to analyze the private protocol structure, which can be used in industrial protocols. The method consists of six modules: traffic collection, message extraction, message size clustering, message similarity clustering, field extraction and session analysis. We use Schneider Modicon M580 to coll ...
Posted by ItsWesYo on Wed, 19 Jan 2022 17:45:51 +0100
[case] clustering algorithm
KMEANS clustering
https://www.cnblogs.com/pinard/p/6164214.html
1. Briefly describe the principle and workflow of K-means algorithm
K sample points are randomly selected as the initial centroid Calculate the distance from other samples to K centroids respectively, and divide each sample into the nearest cluster For the new cluster, calcul ...
Posted by Deviants on Wed, 19 Jan 2022 14:44:04 +0100