Kafka Shell Basic Commands (Including topic Addendum, Delete and Revise)
Create kafka topic
View all topic lists
View the specified topic information
Console to topic Production Data
Console consumes topic data
View the maximum (minimum) offset of a partition in topic
Increase the number of topic partitions
Delete topic, use cautiously, only delete metadata in zookeeper, message file must be deleted m ...
Posted by kaisaj on Mon, 27 May 2019 12:14:22 -0700
hue oozie stepped on the pit again, workflow,coordinator could finally run away.
The front summarizes some pits of sqoop1, oozie and HBase under hue. When the project expires today, we must work out oozie workflow and schedule execution on time to skr skr.
1. The pits of sqoop mysql imported and exported from the front have been trampled. Later, it was found that besides cdh(5.15), sqoop1 was not automatically configured. ...
Posted by Ree on Sat, 18 May 2019 22:13:03 -0700
UDF, UDTF and UDAF in hive
New table "apache_log" in hive
CREATE TABLE apachelog (
host STRING,
identity STRING,
user STRING,
time STRING,
request STRING,
status STRING,
size STRING,
referer STRING,
agent STRING)
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
"input.regex" = "([^ ] ...
Posted by DJH on Fri, 17 May 2019 11:36:59 -0700
The compiled source of hive supports UDF functions
Article Directory
1. Download the source code
2. Compile support for UDF
2.1 Upload and Unzip
2.2 Add UDF Function Class
2.3 Registration Functions
2.4 Compile hive
3. Deployment and installation
4. Testing UDF
Friendly Tip: To reduce the probability of maven compilation errors on the server, you can choose to open the so ...
Posted by alant on Thu, 09 May 2019 08:26:40 -0700
Using LZO compression and supporting slicing for hadoop
1. introduction:
Install LZO:
lzo is not native to Linux systems, so you need to download and install packages. There are at least three packages to install here.
lzo, lzop, hadoop-gpl-packaging.
Add index:
The main function of gpl-packing is to create index for compressed lzo files. Otherwise, whether the compressed file is la ...
Posted by oaf357 on Tue, 23 Apr 2019 17:00:34 -0700
Converting Hive database storage format to orc
Hive storage format
textfile
hive's default storage format
Storage: row storage
Disk overhead, data parsing overhead
Compressed text file hive cannot be merged and split
SequenceFile
Binary files are serialized into files in the form of key and value
Storage: row storage
Separable compression
Generally selected block compression
...
Posted by WendyLady on Sun, 21 Apr 2019 18:30:35 -0700
Hive UDF Realizes Strong Check of Identity Card
In the work, the ID number in the Hive table needs to be strongly checked. Because the last bit is the checking bit, simple regularization can not be realized, and UDF is used to implement the relevant functions.
I just realized the function, did not do in-depth optimization, welcome to leave a message, under the guidance of h ...
Posted by carlg on Wed, 17 Apr 2019 20:06:33 -0700
Common sql statements in hive
data base
Create a database
hive> CREATE DATABASE financials;
hive> CREATE DATABASE IF NOT EXISTS financials;
Display existing databases
hive> SHOW DATABASES;
default
financials
hive> CREATE DATABASE human_resources;
hive> SHOW DATABASES;
default
financials
human_resources
Conditional Query Database
hive> SHOW DATABASE ...
Posted by johnsonzhang on Mon, 15 Apr 2019 14:42:32 -0700
Installation and Use of Log Collection Framework Flume
Installation and Use of Log Collection Framework Flume
1. Introduction to Flume
1.1. Overview of Flume
Flume is a distributed, reliable and highly available system for collecting, transmitting and aggregating massive logs.
Flume can collect files, socket data packets and other forms of source data.
It can also output the collected d ...
Posted by MichaelR on Thu, 04 Apr 2019 20:12:30 -0700
Variables, Properties and Common Commands of Hive Command Line Interface
After "$" corresponds to the Shell command of Linux, and after "hive >" corresponds to the command in CLI.
CLI options
- help to see the parameters of hive
$hive --help --service cli
usage: hive
-d,--define <key=value> Variable subsitution to apply to hive
commands. ...
Posted by Visualant on Wed, 27 Mar 2019 20:54:28 -0700