Configure Linux Cluster time synchronization (ntp)

Environmental Science:Now there are three virtual machines of CentOS 7, namely, Hadoop 102, Hadoop 103 and Hadoop 104. Now configure ntp service (central server) on Hadoop 102. Hadoop 103 and 104 pull the time on Hadoop 102 respectively 1. Check whether ntp is installed (check and install all 3 sets, ...

Posted by shaoen01 on Thu, 30 Jan 2020 08:16:00 -0800

HBase? From introduction to Java client development

Article directory Introduction to HBase HBase introduction HBase HBase architecture RegionServer cluster structure HBase logical storage structure HBase physical storage structure HBase installation preparation in advance HBase installation steps Hbase boot HBase verification Backup master HBase ...

Posted by sager29 on Thu, 30 Jan 2020 01:18:29 -0800

HBase1.1.x is deployed on Hadoop 2.6.0 (partial distribution of three vm virtual machines)

HBase1.1.x is deployed on Hadoop 2.6.0 (partial distribution of three vm virtual machines) download hbase.apache official website Configure environment variable / etc/profile #hbase export HBASE_HOME=/usr/local/soft/hbase-1.1.5 export PATH=$PATH:$HBASE_HOME/bin Configure $HBase? Home / conf / h ...

Posted by kobayashi_one on Wed, 29 Jan 2020 02:19:51 -0800

Hadoop learning Chapter 2 HDFS

Hadoop learning Chapter 2 HDFS Chapter II HDFS HDFS overview HDFS definition Advantages and disadvantages of HDFS HDFS architecture HDFS file block size (interview focus) Common command practice HDFS client operation (development focus) HDFS client environment preparation HDFS file download HDFS ...

Posted by goa103 on Mon, 27 Jan 2020 03:14:04 -0800

4.5 billion data migration records follow-up-to-day data volume levels of 10 million to clickhouse

4.5 billion data migration records follow-up-to-day data volume levels of 10 million to clickhouse Related Document Address flume Reference Address waterdrop Reference Address clickhouse Reference Address kafka Reference Address The environment log is on one server and the clickhouse cluster is on the other. thinking Previously, when m ...

Posted by gszauer on Mon, 20 Jan 2020 18:52:36 -0800

Quickly read very large tables as data frames

I have a very large table (30 million rows), and I want to load it as a data frame in R. read.table() has many convenient functions, but it seems that there are many logic in the implementation that will slow things down. As far as I'm concerned, I'm assuming that I know the type of column in advance, that the table doesn't contain any column ...

Posted by Johnlbuk on Fri, 17 Jan 2020 07:09:11 -0800

The trap of Broadcast Join in SparkSql 2.x (hint does not work)

Problem description Use hint to specify the broadcast table, but cannot perform the specified broadcast; preparation in advance hive> select * from test.tmp_demo_small; OK tmp_demo_small.pas_phone tmp_demo_small.age 156 20 157 22 158 15 hive> analyze table test.tmp_demo_small compute statis ...

Posted by cbullock on Fri, 17 Jan 2020 06:02:22 -0800

How does Elasticsearch handle data that has an association?

Three main paradigms of relational databases What is a paradigm? A paradigm is the rule of data modeling. First paradigm: Ensure that each column maintains atomicity. All fields in a database table are indivisible atomic values. Second paradigm: Ensure that each column in the table is related to the ...

Posted by brmcdani on Tue, 14 Jan 2020 16:43:11 -0800

Hadoop Part 2: mapreedce

Mapreedce (3) Project address: https://github.com/KingBobTitan/hadoop.git MR's Shuffle explanation and Join implementation First, review 1. MapReduce's history monitoring service: JobHistoryServer Function: used to monitor the information of all MapReduce programs running on YARN Configure log ...

Posted by nick1 on Tue, 14 Jan 2020 02:21:13 -0800

The construction of zookeeper single machine and cluster environment

1. Preface 19 years passed pretty fast. As the first one of 20 years, I want to make a simple summary here for the previous year: seriously, 19 years passed pretty miserable. Yes, the conclusion is this sentence. After all, the sense of ceremony must still exist. Well, let's not talk about it. I hope it's better after 20 years. Save more money ...

Posted by randydg on Tue, 07 Jan 2020 02:51:34 -0800