Kafka Shell Basic Commands (Including topic Addendum, Delete and Revise)

Create kafka topic View all topic lists View the specified topic information Console to topic Production Data Console consumes topic data View the maximum (minimum) offset of a partition in topic Increase the number of topic partitions Delete topic, use cautiously, only delete metadata in zookeeper, message file must be deleted m ...

Posted by kaisaj on Mon, 27 May 2019 12:14:22 -0700

hue oozie stepped on the pit again, workflow,coordinator could finally run away.

The front summarizes some pits of sqoop1, oozie and HBase under hue. When the project expires today, we must work out oozie workflow and schedule execution on time to skr skr. 1. The pits of sqoop mysql imported and exported from the front have been trampled. Later, it was found that besides cdh(5.15), sqoop1 was not automatically configured. ...

Posted by Ree on Sat, 18 May 2019 22:13:03 -0700

UDF, UDTF and UDAF in hive

New table "apache_log" in hive CREATE TABLE apachelog ( host STRING, identity STRING, user STRING, time STRING, request STRING, status STRING, size STRING, referer STRING, agent STRING) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.RegexSerDe' WITH SERDEPROPERTIES ( "input.regex" = "([^ ] ...

Posted by DJH on Fri, 17 May 2019 11:36:59 -0700

The compiled source of hive supports UDF functions

Article Directory 1. Download the source code 2. Compile support for UDF 2.1 Upload and Unzip 2.2 Add UDF Function Class 2.3 Registration Functions 2.4 Compile hive 3. Deployment and installation 4. Testing UDF Friendly Tip: To reduce the probability of maven compilation errors on the server, you can choose to open the so ...

Posted by alant on Thu, 09 May 2019 08:26:40 -0700

Using LZO compression and supporting slicing for hadoop

1. introduction: Install LZO: lzo is not native to Linux systems, so you need to download and install packages. There are at least three packages to install here. lzo, lzop, hadoop-gpl-packaging. Add index: The main function of gpl-packing is to create index for compressed lzo files. Otherwise, whether the compressed file is la ...

Posted by oaf357 on Tue, 23 Apr 2019 17:00:34 -0700

Converting Hive database storage format to orc

Hive storage format textfile hive's default storage format Storage: row storage Disk overhead, data parsing overhead Compressed text file hive cannot be merged and split SequenceFile Binary files are serialized into files in the form of key and value Storage: row storage Separable compression Generally selected block compression ...

Posted by WendyLady on Sun, 21 Apr 2019 18:30:35 -0700

Hive UDF Realizes Strong Check of Identity Card

In the work, the ID number in the Hive table needs to be strongly checked. Because the last bit is the checking bit, simple regularization can not be realized, and UDF is used to implement the relevant functions. I just realized the function, did not do in-depth optimization, welcome to leave a message, under the guidance of h ...

Posted by carlg on Wed, 17 Apr 2019 20:06:33 -0700

Common sql statements in hive

data base Create a database hive> CREATE DATABASE financials; hive> CREATE DATABASE IF NOT EXISTS financials; Display existing databases hive> SHOW DATABASES; default financials hive> CREATE DATABASE human_resources; hive> SHOW DATABASES; default financials human_resources Conditional Query Database hive> SHOW DATABASE ...

Posted by johnsonzhang on Mon, 15 Apr 2019 14:42:32 -0700

Installation and Use of Log Collection Framework Flume

Installation and Use of Log Collection Framework Flume 1. Introduction to Flume 1.1. Overview of Flume Flume is a distributed, reliable and highly available system for collecting, transmitting and aggregating massive logs. Flume can collect files, socket data packets and other forms of source data. It can also output the collected d ...

Posted by MichaelR on Thu, 04 Apr 2019 20:12:30 -0700

Variables, Properties and Common Commands of Hive Command Line Interface

After "$" corresponds to the Shell command of Linux, and after "hive >" corresponds to the command in CLI. CLI options - help to see the parameters of hive $hive --help --service cli usage: hive -d,--define <key=value> Variable subsitution to apply to hive commands. ...

Posted by Visualant on Wed, 27 Mar 2019 20:54:28 -0700