Bitmaps and Bloom filters and hash segmentation
Share today about bitmaps and Bloom filters and hash segmentation.
Bitmap simply means that an integer exists or does not exist in a bit bit. It is suitable for simple search in large data to determine whether an integer exists or not.
In fact, the principle of bitmap is consistent with the direct fixing method of hash table. The fixing value ...
Posted by coffeecup on Fri, 05 Jul 2019 13:00:26 -0700
Incremental Synchronization Strategy Based on Big Data Development Suite
Today we'll discuss how to use the big data development suite for incremental synchronization.
We divide data that need synchronization into data that will change (personnel table, for example, personnel status will change) and data that will not change (usually log data) according to whether the data will change after writing. For these two sc ...
Posted by hey_suburbia on Mon, 01 Jul 2019 13:53:44 -0700
How MaxCompute Implements Cross-Project Authorization
In actual enterprise usage scenarios, data, functions, computing resources in different projects may need to be used together. Then how to achieve this cross-project use, this paper uses practical examples to illustrate.
Get ready
Project space: grant_from, where authorized data, functions, and resources are in the project
Project space: grant ...
Posted by Satabi2 on Thu, 27 Jun 2019 12:57:28 -0700
HBase Coprocessor Details
1. Brief description
When using HBase, if you have billions of rows or millions of columns of data, the ability to return a large amount of data in a query is subject to the bandwidth of the network, and even if the network conditions allow, the client's computing may not be able to meet the requirements.In this case, Coprocessors arise.It all ...
Posted by richard-elland on Sun, 23 Jun 2019 09:53:31 -0700
Flink Zero Foundation Practical Course: How to Calculate Real-time Hot Commodities
In the last introductory tutorial, we were able to quickly build a basic Flink program. This article will take you step by step to implement a more sophisticated Flink application: real-time hot commodities. Before starting this article, we recommend that you practice the previous article first, because this article will follow the my-flink-pr ...
Posted by ozone on Thu, 20 Jun 2019 13:17:10 -0700
Several ways of mixing python with C
Python has been in the limelight these years, occupying a lot of positions in many fields. The Web, big data, artificial intelligence, operation and maintenance all have its own image, even the graphical interface is doing very well, and even when the word full-stack came out, it seems to be to describe it.
Although Python has GIL problems that ...
Posted by Xager on Tue, 21 May 2019 12:06:04 -0700
[Three Asynchronous HTTP Programming] 2. Streaming HTTP Response
Standard Response and Content-Length Header
Since HTTP 1.1, in order to process multiple HTTP requests and responses in a link, the server must return the appropriate Content-Length value along with the response.
By default, you do not need to return the Content-Length header for simple requests, such as:
def index = Action {
Ok("Hello ...
Posted by johlwiler on Sun, 19 May 2019 16:00:43 -0700
Common operations of Elastic Search: query and aggregation
[TOC]
0 description
Based on es 5.4 and es 5.6, this paper lists the queries frequently used in personal work (only Java API is used in work). If you need to see the complete, you can refer to the official documents.https://www.elastic.co/guide/en/elasticsearch/reference/5.4/search.html.
1 query
First use a quick introduction to introduce, a ...
Posted by cheshil on Sat, 18 May 2019 03:00:15 -0700
Configure Oracle 19c DataGuard Step By Step Guide
This paper contains the following two knowledge points. The environmental information is shown in the following figure:
Deploy Data Guard service;
Configuration and use of Data Guard Broker.I. Deployment of Physical Standby
1.1 Main Library Preparations
1.1.1 Enable force logging
SQL> ALTER DATABASE FORCE LOGGING;
1.1.2 Create standby data ...
Posted by alin19 on Fri, 17 May 2019 08:49:27 -0700
oracle starts with three steps
There are three processes to start oracle, nomount, mount, open
I. nomount stage
During the nomount phase, you can see that the instance has started.The oracle process creates a shared memory pool based on the parameter file.
SQL> startup nomount;
ORACLE instance started.
Total System Global Area 1653518336 bytes
Fixed Size 22138 ...
Posted by JellyFish on Thu, 16 May 2019 16:56:20 -0700