Python data analysis - drawing-2-Seaborn advanced drawing-4-classification diagram
1, Classified scatter diagram
1.stripplot
Function: seaborn.stripplot
Common parameters:
x,y,hueReceive the variable name in data to represent the selected drawing variable, hue pass in the classification variable to classify the color.dataReceive DataFrame, array, list and series to represent the data set used for drawing.order,order_hueRe ...
Posted by Gamerz on Sat, 04 Dec 2021 22:34:19 -0800
Experiment 8 project case - e-commerce data analysis
Level 1: Statistics of user churn
Task description
This task: according to the user behavior data, write MapReduce program to count the loss of users.
Relevant knowledge
This training is an intermediate difficulty MapReduce programming exercise, which simulates the statistical analysis of e-commerce data in real scenes. Therefore, it is ...
Posted by kane007 on Sat, 04 Dec 2021 19:59:57 -0800
Python process and thread nanny teaching, a machine with multiple hands
How important are process threads? At the beginning of learning Python, you may not feel it, because the code you write can be executed from top to bottom, but in fact, it is very elementary. In order to make full use of the computer configuration to speed up the program progress, we often use multi process and multi thread when actually develo ...
Posted by jonathandg on Mon, 29 Nov 2021 03:07:19 -0800
Python modular programming (higher order functions)
Recursive function
Recursive function is to define a function, and then call this function within the function There must be an end in the recursive function, or it will be called one by one until more layers are called and the stack overflows A recursive function is an entry layer by layer and a return layer by layer
Preliminary underst ...
Posted by aeonsky on Sat, 27 Nov 2021 22:31:34 -0800
Linux three swordsmen: efficient data analysis
1. What is the Linux three swordsman
First of all, we want to know what the Linux three swordsmen are?
The first tool is grep, which performs pattern pattern in each file or matching line, that is, searches the content according to the regular expression and prints it out
The second tool is awk, which is abbreviated by three authors (aho, Kern ...
Posted by Apollo_Ares on Sat, 27 Nov 2021 21:14:55 -0800
Data analysis of hands-on learning -- establishment and evaluation of model
1. Model construction
1.1 get modeling data
#Read raw data
train = pd.read_csv('train.csv')
#Read cleaned data set
data = pd.read_csv('clear_data.csv')
1.2 select appropriate model
Before model selection, we need to know whether the data set is finally supervised learning or unsupervised learning
Machine learning is mainly divided into ...
Posted by boon4376 on Thu, 25 Nov 2021 10:27:37 -0800
Introduction to pandas (conclusion) -- Mo Xiaotian with little talent and learning
Originally, I was going to go directly to the actual combat. Later, I thought, I'd better finish the necessary things first. Otherwise, many things in the actual combat will be confused and it will be troublesome to talk recursively.
This article is the last content explanation of pandas. The three connected together should be enough to deal w ...
Posted by Tonka1979 on Wed, 24 Nov 2021 14:35:49 -0800
[data analysis and visualization] key points of data drawing 3 - spaghetti map
Key points of data mapping 3 - spaghetti map
Broken line diagrams with too many lines usually become unreadable. This kind of diagram is generally called spaghetti diagram. Therefore, this kind of chart can hardly provide information about the data.
Drawing example
Let's take the evolution of female baby names in the United States from 1 ...
Posted by webguy262 on Tue, 23 Nov 2021 22:19:03 -0800
Using the price level to correct the box office data, is Changjin Lake still the box office champion?
Zero, write in front
First of all, it is purely entertainment, and there are many loose places!!
Mankiw's macroeconomics mentioned an example when talking about the CPI (consumer price index), that is, avatar ranked first at the box office with us $761 million at that time, but after considering inflation, a ...
Posted by ixalmida on Tue, 23 Nov 2021 12:23:54 -0800
Introduction to KMmeans clustering learning:
1, Introduction to KMeans algorithm:
K in the name of KMeans algorithm represents the number of categories, and Means represents the mean value of samples in each category. Therefore, KMeans algorithm is also called k-Means algorithm. KMeans algorithm takes distance as the measure of similarity between samples, and assigns samples with similar ...
Posted by narimanam on Mon, 22 Nov 2021 21:35:54 -0800