Configure Linux Cluster time synchronization (ntp)
Environmental Science:Now there are three virtual machines of CentOS 7, namely, Hadoop 102, Hadoop 103 and Hadoop 104. Now configure ntp service (central server) on Hadoop 102. Hadoop 103 and 104 pull the time on Hadoop 102 respectively
1. Check whether ntp is installed (check and install all 3 sets, ...
Posted by shaoen01 on Thu, 30 Jan 2020 08:16:00 -0800
HBase? From introduction to Java client development
Article directory
Introduction to HBase
HBase introduction
HBase
HBase architecture
RegionServer cluster structure
HBase logical storage structure
HBase physical storage structure
HBase installation
preparation in advance
HBase installation steps
Hbase boot
HBase verification
Backup master
HBase ...
Posted by sager29 on Thu, 30 Jan 2020 01:18:29 -0800
HBase1.1.x is deployed on Hadoop 2.6.0 (partial distribution of three vm virtual machines)
HBase1.1.x is deployed on Hadoop 2.6.0 (partial distribution of three vm virtual machines)
download
hbase.apache official website
Configure environment variable / etc/profile
#hbase
export HBASE_HOME=/usr/local/soft/hbase-1.1.5
export PATH=$PATH:$HBASE_HOME/bin
Configure $HBase? Home / conf / h ...
Posted by kobayashi_one on Wed, 29 Jan 2020 02:19:51 -0800
Hadoop learning Chapter 2 HDFS
Hadoop learning Chapter 2 HDFS
Chapter II HDFS
HDFS overview
HDFS definition
Advantages and disadvantages of HDFS
HDFS architecture
HDFS file block size (interview focus)
Common command practice
HDFS client operation (development focus)
HDFS client environment preparation
HDFS file download
HDFS ...
Posted by goa103 on Mon, 27 Jan 2020 03:14:04 -0800
4.5 billion data migration records follow-up-to-day data volume levels of 10 million to clickhouse
4.5 billion data migration records follow-up-to-day data volume levels of 10 million to clickhouse
Related Document Address
flume Reference Address
waterdrop Reference Address
clickhouse Reference Address
kafka Reference Address
The environment log is on one server and the clickhouse cluster is on the other.
thinking
Previously, when m ...
Posted by gszauer on Mon, 20 Jan 2020 18:52:36 -0800
Quickly read very large tables as data frames
I have a very large table (30 million rows), and I want to load it as a data frame in R. read.table() has many convenient functions, but it seems that there are many logic in the implementation that will slow things down. As far as I'm concerned, I'm assuming that I know the type of column in advance, that the table doesn't contain any column ...
Posted by Johnlbuk on Fri, 17 Jan 2020 07:09:11 -0800
The trap of Broadcast Join in SparkSql 2.x (hint does not work)
Problem description
Use hint to specify the broadcast table, but cannot perform the specified broadcast;
preparation in advance
hive> select * from test.tmp_demo_small;
OK
tmp_demo_small.pas_phone tmp_demo_small.age
156 20
157 22
158 15
hive> analyze table test.tmp_demo_small compute statis ...
Posted by cbullock on Fri, 17 Jan 2020 06:02:22 -0800
How does Elasticsearch handle data that has an association?
Three main paradigms of relational databases
What is a paradigm? A paradigm is the rule of data modeling.
First paradigm: Ensure that each column maintains atomicity.
All fields in a database table are indivisible atomic values.
Second paradigm: Ensure that each column in the table is related to the ...
Posted by brmcdani on Tue, 14 Jan 2020 16:43:11 -0800
Hadoop Part 2: mapreedce
Mapreedce (3)
Project address: https://github.com/KingBobTitan/hadoop.git
MR's Shuffle explanation and Join implementation
First, review
1. MapReduce's history monitoring service: JobHistoryServer
Function: used to monitor the information of all MapReduce programs running on YARN
Configure log ...
Posted by nick1 on Tue, 14 Jan 2020 02:21:13 -0800
The construction of zookeeper single machine and cluster environment
1. Preface
19 years passed pretty fast. As the first one of 20 years, I want to make a simple summary here for the previous year: seriously, 19 years passed pretty miserable. Yes, the conclusion is this sentence. After all, the sense of ceremony must still exist.
Well, let's not talk about it. I hope it's better after 20 years. Save more money ...
Posted by randydg on Tue, 07 Jan 2020 02:51:34 -0800