HBase quickly import huge amount of data - Bulk Loading

advantage: If we store a large amount of HBase data at one time, the processing speed is slow, and the Region resources are particularly occupied, a more efficient and convenient method is to use the "Bulk Loading" method, that is, the HFileOutputFormat class provided by HBase. It uses the principle that hbase data information is ...

Posted by rharter on Sun, 05 Dec 2021 20:42:35 -0800

2021SC@SDUSC hbase code analysis HFile analysis

2021SC@SDUSC hbase source code analysis (x) HFile analysis (2) 2021SC@SDUSC 2021SC@SDUSC 2021SC@SDUSC 2021SC@SDUSC HFileBlock HFileBlock mainly includes two parts: BlockHeader and BlockData. BlockHeader mainly stores Block related metadata, and BlockData is used to store specific data. The core field in Block metadata is the BlockType ...

Posted by Dave96 on Mon, 29 Nov 2021 21:21:59 -0800

2021SC@SDUSC HBase project code analysis - snapshot

2021SC@SDUSC 1, Brief description Snapshot is a function supported by many storage systems and database systems. A snapshot is the image of a whole file system or a directory at a certain time. 2, Basic principle The simplest and crudest way to realize data file mirroring is to lock the copy (the reason for locking is that the data obtai ...

Posted by ibechane on Wed, 24 Nov 2021 00:20:54 -0800

2021SC@SDUSC Code analysis of HBase project -- compressed coding of HBase

2021SC@SDUSC 1, Brief description Coding + compression can double the disk space of data and save considerable storage costs. At the same time, shrinking can usually improve the system throughput, so that the system can do more work In terms of space saving processing at the storage level, HBase provides two schemes: 1. Key based coding. I ...

Posted by Wolphie on Thu, 18 Nov 2021 19:37:18 -0800

Hbase specific operation (illustrated and super complete ~ ~ ~)

Purpose: (1) Understand the role of HBase in Hadoop architecture. (2) Proficient in using HBase to operate common Shell commands. Objectives: (1) Be familiar with hbase related operations, and master the operations of creating tables, modifying tables, looking up tables, deleting tables, etc. (2) You can create a table by yourself, be familiar ...

Posted by djBuilder on Wed, 17 Nov 2021 08:45:22 -0800

docker builds hbase environment

    hbase is a member of hadoop ecology. To build hbase first, you need to install hadoop, then zookeeper, and then hbase. Now hbase can be installed directly through docker, and hadoop is not required in the container.     The installation is simple. Directly pull the image and run it. docker run -d --name hbase -p 2181 ...

Posted by Procode on Mon, 08 Nov 2021 08:11:30 -0800

2021SC@SDUSC Hbase project code analysis - flush

2021SC@SDUSC                  In the fourth article, we explored how cacheflush initializes. Now let's look at how cacheflush handles flush requests.          Through the analysis in the previous article, we know that there are two queues and collections that store flush requests ...

Posted by archbeta on Thu, 28 Oct 2021 06:35:10 -0700

docker deploys Hadoop and HBase environment (Centos7 system)

1, Install docker 1. Download offline package Index of linux/static/stable/x86_64/ 2. Decompression tar -xzvf docker-18.06.3-ce.tgz (ce version means community free version, please specify   The difference between docker with ce and without ce) 3. Copy the extracted folder to the / usr/local directory cp docker-18.06.3-ce /usr/local ...

Posted by jenni on Tue, 26 Oct 2021 02:36:01 -0700

2021SC@SDUSC Hbase project overview

2021SC@SDUSC 1, HBase overview What is HBase HBase is a database system built on HDFS, which provides high reliability, high performance, column storage, scalability and real-time reading and writing. It is mainly used to store unstructured and semi-structured loose data. HBase uses hadoop HDFS as its file storage system, Hadoop MapReduc ...

Posted by gman-03 on Sun, 24 Oct 2021 20:13:50 -0700

Start learning big data again - Hbase - day 56 Phoenix

Start learning big data again - Hbase - day 56 Phoenix Phoenix overview    Hbase is suitable for storing a large number of NOSQL data with low requirements for relational operations. Due to the limitations of Hbase design, it is not possible to directly use the native API to perform the operations such as condition judgment a ...

Posted by Johnm on Mon, 20 Sep 2021 07:15:47 -0700