GPDB feature practice

date: 2020-01-11 15:51:39 Some time ago, my tutor asked me to know about the GreenPlum database. Later, I installed and used it for a while. It felt like it was no different from other databases, so it ended up. Now go over the features of GPDB and try to use them. In fact, I really need to understand t ...

Posted by vinoth on Tue, 14 Jan 2020 19:06:09 -0800

Installation and use of greenplus plug-in tablefunc (row column conversion)

Note: this plug-in is only demonstrated in the test environment. If it is used in the production environment, please assess the risk yourself Environment list: Host 1: intranet: 125.10.1.166 System configuration list: IP HOSTNAME CPU MEM DISK RELEASE 1 ...

Posted by beermaker74 on Sun, 15 Dec 2019 12:44:00 -0800

Handwritten database connection pool with gp connection jar package address

  Handwritten database connection and test   Recently, to connect to GP database (Greenplum SQL), you need to establish different number of connections In fact, when you want to write database connection, you can write interface directly through springdata jpa, which is a kind of thinking So in the use, I wrote a demo to t ...

Posted by bealers on Sun, 15 Dec 2019 10:22:26 -0800

Deepgreen versus Greenplum TPC-H performance tests (using VitesseData scripts)

Two days ago I sent an article based on[ Texas test script ] Test Contrast Articles Deepgreen vs. Greenplum TPC-H Performance Testing (Using Texas Scripts) Because of the small amount of test data, the two databases have several rounds of tests that are equal to 1 second, but most of the tests are better than Greenplum, and some are even as fas ...

Posted by Rebelrebellious on Sat, 22 Jun 2019 13:27:02 -0700

Greenplum adds mirror steps

[TOC] Summary The newly installed green plus cluster has only primary nodes and no mirror. High availability is not guaranteed. So you need to add mirror nodes to the cluster Note: When the amount of data is too large, adding mirror will result in greater disk pressure, because the background will always synchronize the data, which online syste ...

Posted by jumpenjuhosaphat on Sat, 30 Mar 2019 09:18:28 -0700

PostgreSQL aggregate expression FILTER, order, within group usage

Label PostgreSQL, aggregation, filter, order, within group background PostgreSQL has powerful analysis functions, such as supporting multi-dimensional analysis, supporting four kinds of aggregation, supporting window query, supporting recursive query and so on. For the use of four major types of aggregation, please refer to <PostgreSQL ag ...

Posted by thirdeye on Mon, 11 Feb 2019 17:09:19 -0800

Compile source code and install Greenplum database

Article directory Summary Download source packages Install some common dependencies Install c++ 11 Start compilation Install according to the normal steps of green plus Some details Summary Greenplum's binary files and RPM package files have limited download channels. Sometimes it may not be easy to download the correspo ...

Posted by nazariah on Sat, 02 Feb 2019 16:21:16 -0800

Spark2 Workflow Scheduling for hue Integrated Oozie Workflow

I. Environmental preparation CDH5.15.0,spark2.3.0,hue3.9.0 Note: Because the CDH cluster is used, the default version of spark is 1.6.0, and saprk2.3.0 is installed through the parcel package. At this time, there are two spark versions in the cluster. Hue integrates spark 1.6. It is necessary to upload the jar package and o ...

Posted by Xorandnotor on Thu, 24 Jan 2019 10:45:13 -0800

How to resolve the inconsistency between Greenplum master node and seg node metadata

As a distributed database, Greenplum's metadata often (true, often encounter) some errors, the typical one is the inconsistency between master node and seg node metadata. This paper introduces the solution combined with a specific error. phenomenon When checking the database with gpcheckcat-p 5432 database ename, output similar to the follow ...

Posted by teejayuu on Wed, 02 Jan 2019 09:39:08 -0800

Performance of Greenplum insert (single step batch copy) - and recommended parallel import of gpfdist and Aliyun oss external tables

Label PostgreSQL, Greenplum, HybridDB for PostgreSQL, insert, copy, external tables, oss, gpfdist background Greenplum is a MPP database product. Its advantage is excellent OLAP performance. It supports multi-node parallel computing and realizes real-time analysis of data volume at PB level. In addition to analytical capabilities, data write ...

Posted by stargate03 on Sat, 22 Dec 2018 14:18:06 -0800