hive sql 50 exercises

Database and table building --Build database create database test; --Build table create table student(s_id string,s_name string,s_birth string,s_sex string) row format delimited fields terminated by '\t'; create table course(c_id string,c_name string,t_id string) row format delimited fields terminated by '\t'; create table teacher(t_id string ...

Posted by sujata_ghosh on Sat, 09 Oct 2021 01:56:40 -0700

Solving the problem of Chinese garbled code in CDH hive table building comments

1. Problem description After joining the new company, the author found that hive itself did not support Chinese annotations and was garbled when he was preparing to build the data warehouse, because the current big data started late and the data warehouse was still in the primary construction stage. The cluster was built with CDH 5.10.0. show ...

Posted by quimbley on Thu, 07 Oct 2021 21:01:11 -0700

Summary of common HIVE functions

Summary of common HIVE functions There are many hive functions. In the past, it was always used and checked. Today, I will summarize the commonly used functions for easy reference in the future. This paper mainly involves the following aspects: 1. Hive function introduction and built-in function view See Hive official documents for more i ...

Posted by jamesm87 on Thu, 07 Oct 2021 00:35:22 -0700

Hive query operation

query 1. Data preparation 1.1 create table // Create department table create table if not exists dept ( deptno int, dname string, loc int ) row format delimited fields terminated by '\t'; // Create employee table create table if not exists emp ( empno int, ename string, job string, mgr int, hiredate string, ...

Posted by sean.mccoy on Wed, 06 Oct 2021 06:41:53 -0700

E-commerce data warehouse of data warehouse -- 2. Business data collection platform

1, Introduction to e-commerce business 1.1 e-commerce business process The business process of e-commerce can be illustrated by taking the browsing footprint of an ordinary user as an example. Users click the home page of e-commerce to start browsing. They may find their favorite products through classified query or full-text search. Thes ...

Posted by BANDYCANDY on Mon, 04 Oct 2021 15:39:02 -0700

Basic knowledge and use of impala

Chapter 1 basic concepts of Impala 1.1 what is Impala Cloudera provides interactive SQL query function with high performance and low latency for HDFS and HBase data. Based on Hive, it uses memory computing, takes into account data warehouse, and has the advantages of real-time, batch processing, multi concurrency and so on. It is the prefer ...

Posted by yaba on Sun, 19 Sep 2021 06:13:40 -0700

Production Optimization Practice of hive3. X on spark 3.0

1 data tilt Most tasks are completed quickly, and only one or a few tasks are executed slowly or even fail. This phenomenon is data skew.The data skew is divided into single table query with GroupBy field and two table (or multi table) Join query. 1.1 single table data skew optimization 1.1.1 the map side performs aggregation - GroupBy opera ...

Posted by ben2k8 on Thu, 16 Sep 2021 14:35:16 -0700

Presto Distributed SQL Query Engine

Introduction to Presto Presto is a distributed SQL query engine developed by Facebook for efficient and real-time data analysis.Presto can connect Hive, Mysql, Kafka and other data sources. The most common way to connect Hive data sources is through Presto, which can solve the problem that Hive's MapReduce query takes too long.Presto is a ...

Posted by wing328 on Thu, 16 Sep 2021 12:43:56 -0700

[hard Hive] Hive Foundation (19): Hive function user defined function / user defined UDF function / user defined UDTF function

Welcome to the blog home page: Wechat search: Import_ Bigdata, hard core original author in big data field_ import_bigdata_ CSDN blog Welcome to like, collect, leave messages, and exchange messages!This article was originally written by [Wang Zhiwu] and started on CSDN blog!This article is the first CSDN forum. It is strictly prohibited to ...

Posted by Garcia on Fri, 10 Sep 2021 17:28:17 -0700

[hard hive] hive advanced (15): optimize (15) Explain view execution plan

Welcome to the blog home page: Wechat search: Import_ Bigdata, hard core original author in big data field_ import_bigdata_ CSDN blog Welcome to like, collect, leave messages, and exchange messages!This article was originally written by [Wang Zhiwu] and started on CSDN blog!This article is the first CSDN forum. It is strictly prohibited to repr ...

Posted by Tobeon on Fri, 10 Sep 2021 17:10:49 -0700