The Flink real-time computing platform has been built since half a year. Elasticsearch has been used in some storage layers and contacted Flink from scratch. In the past half a year, many pits have been encountered, which has changed from traditional development to big data development. Elasticsearch contains a variety of fuses to prev ...
Posted by banzaimonkey on Mon, 06 Dec 2021 15:50:31 -0800
1) ID number
2) Diagnosis (M = malignant, B = benign)
Calculate 10 real value characteristics of each nucleus:
a) Radius (average distance from center to perimeter)
b) Texture (standard deviation of gray value)
e) Smoothness (local variation of radius length)
f) Compactness (perimeter ^ ...
Posted by ajlisowski on Sun, 05 Dec 2021 02:35:38 -0800
Data parameters OrderNumber: customer nickname LineNumber: purchase order. For example, the first three lines respectively represent three goods purchased by the same customer Model: trade name
Application of intelligent algorithm recommendation of association rules based on shopping basket.
Three basic ...
Posted by Skara on Sat, 04 Dec 2021 22:41:39 -0800
Level 1: Statistics of user churn
This task: according to the user behavior data, write MapReduce program to count the loss of users.
This training is an intermediate difficulty MapReduce programming exercise, which simulates the statistical analysis of e-commerce data in real scenes. Therefore, it is ...
Posted by kane007 on Sat, 04 Dec 2021 19:59:57 -0800
1, Installation package download
Extraction code: q8mb
Note: the above installation package is a zip package, which contains all the packages for installing CDH6.1. After downloading, unzip it
2, Upload the installation package to the master node
Note: the installation directory here sh ...
Detailed explanation of data warehouse and data mart: ODS, DW, DWD, DWM, DWS, ADS
Project requirements and architecture design of shangsilicon Valley data warehouse
Shang Silicon Valley data warehouse: 2 data warehouse layering + dimension modeling
Construction of 3 data warehouses in the actual combat of shangsilicon Valley ...
Posted by mogster on Fri, 03 Dec 2021 05:27:35 -0800
It is said that clickhouse is a columnar storage database used in olap scenarios with a large amount of data. Fortunately, it can also be used in actual scenarios. Let's talk about the simple experience of using this article.
1. Overall description
Not much about architecture, column storage, large amount of data and high performance. See off ...
Posted by arlabbafi on Fri, 03 Dec 2021 00:26:30 -0800
Shell command line interpreter, Linux scripting language
Common system variables: $HOME $PWD $SHELL $USERStrict space rules
There must be no spaces on either side of the equal signThere are spaces in the variable. You can wrap it with "" or ()There must be a space between expr operatorsThere should b ...
Posted by bumbar on Wed, 01 Dec 2021 13:51:37 -0800
1, AsyncDispatcher overview
As an asynchronous event scheduler in yarn, AsyncDispatcher is a component of scheduling events based on blocking queue in RM. It dispatches events in a specific single thread and sends the dispatched events to the corresponding EventHandler event processor registered in AsyncDispatcher for processi ...
Posted by cyberrate on Wed, 01 Dec 2021 07:10:26 -0800
@Warehouse notesDetailed explanation of data warehouse and data mart: ODS, DW, DWD, DWM, DWS, ADSProject requirements and architecture design of shangsilicon Valley digital warehouse practice 1Shang Silicon Valley digital warehouse actual combat 2 digital warehouse layered + dimensional modelingConstruction of 3 digital warehouses in shangsilic ...