Python data analysis - drawing-2-Seaborn advanced drawing-4-classification diagram

1, Classified scatter diagram 1.stripplot Function: seaborn.stripplot Common parameters: x,y,hueReceive the variable name in data to represent the selected drawing variable, hue pass in the classification variable to classify the color.dataReceive DataFrame, array, list and series to represent the data set used for drawing.order,order_hueRe ...

Posted by Gamerz on Sat, 04 Dec 2021 22:34:19 -0800

Experiment 8 project case - e-commerce data analysis

Level 1: Statistics of user churn Task description This task: according to the user behavior data, write MapReduce program to count the loss of users. Relevant knowledge This training is an intermediate difficulty MapReduce programming exercise, which simulates the statistical analysis of e-commerce data in real scenes. Therefore, it is ...

Posted by kane007 on Sat, 04 Dec 2021 19:59:57 -0800

Python process and thread nanny teaching, a machine with multiple hands

How important are process threads? At the beginning of learning Python, you may not feel it, because the code you write can be executed from top to bottom, but in fact, it is very elementary. In order to make full use of the computer configuration to speed up the program progress, we often use multi process and multi thread when actually develo ...

Posted by jonathandg on Mon, 29 Nov 2021 03:07:19 -0800

Python modular programming (higher order functions)

Recursive function Recursive function is to define a function, and then call this function within the function There must be an end in the recursive function, or it will be called one by one until more layers are called and the stack overflows A recursive function is an entry layer by layer and a return layer by layer Preliminary underst ...

Posted by aeonsky on Sat, 27 Nov 2021 22:31:34 -0800

Linux three swordsmen: efficient data analysis

1. What is the Linux three swordsman First of all, we want to know what the Linux three swordsmen are? The first tool is grep, which performs pattern pattern in each file or matching line, that is, searches the content according to the regular expression and prints it out The second tool is awk, which is abbreviated by three authors (aho, Kern ...

Posted by Apollo_Ares on Sat, 27 Nov 2021 21:14:55 -0800

Data analysis of hands-on learning -- establishment and evaluation of model

1. Model construction 1.1 get modeling data #Read raw data train = pd.read_csv('train.csv') #Read cleaned data set data = pd.read_csv('clear_data.csv') 1.2 select appropriate model Before model selection, we need to know whether the data set is finally supervised learning or unsupervised learning Machine learning is mainly divided into ...

Posted by boon4376 on Thu, 25 Nov 2021 10:27:37 -0800

Introduction to pandas (conclusion) -- Mo Xiaotian with little talent and learning

Originally, I was going to go directly to the actual combat. Later, I thought, I'd better finish the necessary things first. Otherwise, many things in the actual combat will be confused and it will be troublesome to talk recursively. This article is the last content explanation of pandas. The three connected together should be enough to deal w ...

Posted by Tonka1979 on Wed, 24 Nov 2021 14:35:49 -0800

[data analysis and visualization] key points of data drawing 3 - spaghetti map

Key points of data mapping 3 - spaghetti map Broken line diagrams with too many lines usually become unreadable. This kind of diagram is generally called spaghetti diagram. Therefore, this kind of chart can hardly provide information about the data. Drawing example Let's take the evolution of female baby names in the United States from 1 ...

Posted by webguy262 on Tue, 23 Nov 2021 22:19:03 -0800

Using the price level to correct the box office data, is Changjin Lake still the box office champion?

Zero, write in front First of all, it is purely entertainment, and there are many loose places!!           Mankiw's macroeconomics mentioned an example when talking about the CPI (consumer price index), that is, avatar ranked first at the box office with us $761 million at that time, but after considering inflation, a ...

Posted by ixalmida on Tue, 23 Nov 2021 12:23:54 -0800

Introduction to KMmeans clustering learning:

1, Introduction to KMeans algorithm: K in the name of KMeans algorithm represents the number of categories, and Means represents the mean value of samples in each category. Therefore, KMeans algorithm is also called k-Means algorithm. KMeans algorithm takes distance as the measure of similarity between samples, and assigns samples with similar ...

Posted by narimanam on Mon, 22 Nov 2021 21:35:54 -0800