Bitmaps and Bloom filters and hash segmentation

Share today about bitmaps and Bloom filters and hash segmentation. Bitmap simply means that an integer exists or does not exist in a bit bit. It is suitable for simple search in large data to determine whether an integer exists or not. In fact, the principle of bitmap is consistent with the direct fixing method of hash table. The fixing value ...

Posted by coffeecup on Fri, 05 Jul 2019 13:00:26 -0700

Incremental Synchronization Strategy Based on Big Data Development Suite

Today we'll discuss how to use the big data development suite for incremental synchronization. We divide data that need synchronization into data that will change (personnel table, for example, personnel status will change) and data that will not change (usually log data) according to whether the data will change after writing. For these two sc ...

Posted by hey_suburbia on Mon, 01 Jul 2019 13:53:44 -0700

How MaxCompute Implements Cross-Project Authorization

In actual enterprise usage scenarios, data, functions, computing resources in different projects may need to be used together. Then how to achieve this cross-project use, this paper uses practical examples to illustrate. Get ready Project space: grant_from, where authorized data, functions, and resources are in the project Project space: grant ...

Posted by Satabi2 on Thu, 27 Jun 2019 12:57:28 -0700

HBase Coprocessor Details

1. Brief description When using HBase, if you have billions of rows or millions of columns of data, the ability to return a large amount of data in a query is subject to the bandwidth of the network, and even if the network conditions allow, the client's computing may not be able to meet the requirements.In this case, Coprocessors arise.It all ...

Posted by richard-elland on Sun, 23 Jun 2019 09:53:31 -0700

Flink Zero Foundation Practical Course: How to Calculate Real-time Hot Commodities

In the last introductory tutorial, we were able to quickly build a basic Flink program. This article will take you step by step to implement a more sophisticated Flink application: real-time hot commodities. Before starting this article, we recommend that you practice the previous article first, because this article will follow the my-flink-pr ...

Posted by ozone on Thu, 20 Jun 2019 13:17:10 -0700

Several ways of mixing python with C

Python has been in the limelight these years, occupying a lot of positions in many fields. The Web, big data, artificial intelligence, operation and maintenance all have its own image, even the graphical interface is doing very well, and even when the word full-stack came out, it seems to be to describe it. Although Python has GIL problems that ...

Posted by Xager on Tue, 21 May 2019 12:06:04 -0700

[Three Asynchronous HTTP Programming] 2. Streaming HTTP Response

Standard Response and Content-Length Header Since HTTP 1.1, in order to process multiple HTTP requests and responses in a link, the server must return the appropriate Content-Length value along with the response. By default, you do not need to return the Content-Length header for simple requests, such as: def index = Action { Ok("Hello ...

Posted by johlwiler on Sun, 19 May 2019 16:00:43 -0700

Common operations of Elastic Search: query and aggregation

[TOC] 0 description Based on es 5.4 and es 5.6, this paper lists the queries frequently used in personal work (only Java API is used in work). If you need to see the complete, you can refer to the official documents.https://www.elastic.co/guide/en/elasticsearch/reference/5.4/search.html. 1 query First use a quick introduction to introduce, a ...

Posted by cheshil on Sat, 18 May 2019 03:00:15 -0700

Configure Oracle 19c DataGuard Step By Step Guide

This paper contains the following two knowledge points. The environmental information is shown in the following figure: Deploy Data Guard service; Configuration and use of Data Guard Broker.I. Deployment of Physical Standby 1.1 Main Library Preparations 1.1.1 Enable force logging SQL> ALTER DATABASE FORCE LOGGING; 1.1.2 Create standby data ...

Posted by alin19 on Fri, 17 May 2019 08:49:27 -0700

oracle starts with three steps

There are three processes to start oracle, nomount, mount, open I. nomount stage During the nomount phase, you can see that the instance has started.The oracle process creates a shared memory pool based on the parameter file. SQL> startup nomount; ORACLE instance started. Total System Global Area 1653518336 bytes Fixed Size 22138 ...

Posted by JellyFish on Thu, 16 May 2019 16:56:20 -0700