Exploration of multiple linear regression

The data is still state Take x77 data set as an example to explore the relationship between a state's crime rate and other factors, including population, illiteracy rate, average income and frost days (the average days when the temperature is below freezing). Label explanation: Murder crime rate, Population, Illiteracy illiteracy rate, Inc ...

Posted by superhoops on Sat, 05 Feb 2022 10:12:24 +0100

Share 2 scripts for Python processing Excel

1, Write in front It comes from the study / work needs of two readers. Coincidentally, the needs of these two readers are about batch merging sheet s (information needed for retrieval). The data style used in this paper can be disclosed with the consent of the reader. The data in the file is generated by the Rand function in Excel to ensur ...

Posted by astribuncio on Wed, 02 Feb 2022 17:00:12 +0100

Headmaster Wang's flirting with his sister can't be hammered? No matter how rich you are, licking a dog is just licking a dog! Python grabs microblog comments and eats melons!

Hello, everyone. I'm spicy~ As the title shows, we must have been bombed by President Wang's melon these two days. There have been several rounds of hot searches on the microblog. I also eat with relish. It's rare to see President Wang eat flat in front of girls. In addition, I chatted with a friend about the problems encountered in the collec ...

Posted by ugriffin on Tue, 01 Feb 2022 14:29:42 +0100

Sample sampling processing

1. Background In the recommendation system, the common targets are ctr and cvr. The data of these two targets are tilted seriously. If the sample is not sampled, the model is easy to learn bias, resulting in unstable online effect and poor generalization ability. 2. Sample sampling and processing 2.1 sample purification Usually, the pulled ...

Posted by krabople on Tue, 01 Feb 2022 14:01:31 +0100

LightGBM parameter adjustment: use OpenBox open source black box optimization system

brief introduction This paper will introduce how to use OpenBox open source black box optimization system to optimize the super parameters of LightGBM model. OpenBox is an open source system designed for black box optimization (project address: https://github.com/thomas-young-2013/open-box ), based on Bayesian optimization, the black bo ...

Posted by daprezjer on Tue, 01 Feb 2022 12:06:46 +0100

How to eliminate Multicollinearity

introduce Machine learning is a method to solve problems that cannot be clearly coded, such as classification problems. The machine learning model will learn a pattern from the data, so we can use it to determine which class the data belongs to. But there's a problem. How does this model work? Some people cannot accept a model with good perfo ...

Posted by wolfan on Sun, 30 Jan 2022 00:45:57 +0100

10 basic Python usages that are easy to be ignored but should be mastered

I've been writing code all my life, but I've never mastered the essence of coding. In most cases, I use Visual Basic because I am most comfortable with VB. At the same time, I have a little knowledge of other languages (R, C, JavaScript, Applescript, Hypertext and BASIC learned in 1979). A few years ago, I decided to use Python only to improve ...

Posted by maxime on Sat, 29 Jan 2022 17:57:01 +0100

44_Pandas converts classified variables into dummy variables (get_dummies)

44_Pandas converts classified variables into dummy variables (get_dummies) To convert categorical variables (categorical data, qualitative data) into virtual variables in pandas, use pandas get_ Dummies() function. Data such as gender classified by string can be converted into 0 for men and 1 for women, and multi class features can be convert ...

Posted by dodgeqwe on Thu, 27 Jan 2022 18:42:21 +0100

Matplotlib data visualization foundation pandas statistical analysis foundation

1. Analyze the relationship between the characteristics of population data from 1996 to 2015. The population data has a total of six characteristics, namely, year-end population, male population, female population, urban population, rural population and year. Looking at the changes of various characteristics over time, we can analyze the propo ...

Posted by don_s on Thu, 27 Jan 2022 03:53:18 +0100

Python packaging tool Pyintealler packaging py file is windows exe file process and stepping on pit record + practical example

Python packaging tool Pyintealler packaging py file is windows exe file process and stepping on pit record + practical example catalogue Python packaging tool Pyintealler packaging py file is windows exe file process and stepping on pit record + practical example install Model building code Packing and pit stepping records install ...

Posted by korion on Thu, 27 Jan 2022 02:03:49 +0100