HBase quickly import huge amount of data - Bulk Loading
advantage:
If we store a large amount of HBase data at one time, the processing speed is slow, and the Region resources are particularly occupied, a more efficient and convenient method is to use the "Bulk Loading" method, that is, the HFileOutputFormat class provided by HBase. It uses the principle that hbase data information is ...
Posted by rharter on Sun, 05 Dec 2021 20:42:35 -0800
2021SC@SDUSC hbase code analysis HFile analysis
2021SC@SDUSC hbase source code analysis (x) HFile analysis (2)
2021SC@SDUSC 2021SC@SDUSC 2021SC@SDUSC 2021SC@SDUSC
HFileBlock
HFileBlock mainly includes two parts: BlockHeader and BlockData.
BlockHeader mainly stores Block related metadata, and BlockData is used to store specific data.
The core field in Block metadata is the BlockType ...
Posted by Dave96 on Mon, 29 Nov 2021 21:21:59 -0800
2021SC@SDUSC HBase project code analysis - snapshot
2021SC@SDUSC
1, Brief description
Snapshot is a function supported by many storage systems and database systems. A snapshot is the image of a whole file system or a directory at a certain time.
2, Basic principle
The simplest and crudest way to realize data file mirroring is to lock the copy (the reason for locking is that the data obtai ...
Posted by ibechane on Wed, 24 Nov 2021 00:20:54 -0800
2021SC@SDUSC Code analysis of HBase project -- compressed coding of HBase
2021SC@SDUSC
1, Brief description
Coding + compression can double the disk space of data and save considerable storage costs. At the same time, shrinking can usually improve the system throughput, so that the system can do more work In terms of space saving processing at the storage level, HBase provides two schemes: 1. Key based coding. I ...
Posted by Wolphie on Thu, 18 Nov 2021 19:37:18 -0800
Hbase specific operation (illustrated and super complete ~ ~ ~)
Purpose: (1) Understand the role of HBase in Hadoop architecture. (2) Proficient in using HBase to operate common Shell commands. Objectives: (1) Be familiar with hbase related operations, and master the operations of creating tables, modifying tables, looking up tables, deleting tables, etc. (2) You can create a table by yourself, be familiar ...
Posted by djBuilder on Wed, 17 Nov 2021 08:45:22 -0800
docker builds hbase environment
hbase is a member of hadoop ecology. To build hbase first, you need to install hadoop, then zookeeper, and then hbase. Now hbase can be installed directly through docker, and hadoop is not required in the container.
The installation is simple. Directly pull the image and run it.
docker run -d --name hbase -p 2181 ...
Posted by Procode on Mon, 08 Nov 2021 08:11:30 -0800
2021SC@SDUSC Hbase project code analysis - flush
2021SC@SDUSC
In the fourth article, we explored how cacheflush initializes. Now let's look at how cacheflush handles flush requests.
Through the analysis in the previous article, we know that there are two queues and collections that store flush requests ...
Posted by archbeta on Thu, 28 Oct 2021 06:35:10 -0700
docker deploys Hadoop and HBase environment (Centos7 system)
1, Install docker
1. Download offline package
Index of linux/static/stable/x86_64/
2. Decompression
tar -xzvf docker-18.06.3-ce.tgz
(ce version means community free version, please specify The difference between docker with ce and without ce)
3. Copy the extracted folder to the / usr/local directory
cp docker-18.06.3-ce /usr/local ...
Posted by jenni on Tue, 26 Oct 2021 02:36:01 -0700
2021SC@SDUSC Hbase project overview
2021SC@SDUSC
1, HBase overview
What is HBase
HBase is a database system built on HDFS, which provides high reliability, high performance, column storage, scalability and real-time reading and writing. It is mainly used to store unstructured and semi-structured loose data. HBase uses hadoop HDFS as its file storage system, Hadoop MapReduc ...
Posted by gman-03 on Sun, 24 Oct 2021 20:13:50 -0700
Start learning big data again - Hbase - day 56 Phoenix
Start learning big data again - Hbase - day 56 Phoenix
Phoenix overview
Hbase is suitable for storing a large number of NOSQL data with low requirements for relational operations. Due to the limitations of Hbase design, it is not possible to directly use the native API to perform the operations such as condition judgment a ...
Posted by Johnm on Mon, 20 Sep 2021 07:15:47 -0700