Flink sink Elasticsearch prevents task interruption
preface
The Flink real-time computing platform has been built since half a year. Elasticsearch has been used in some storage layers and contacted Flink from scratch. In the past half a year, many pits have been encountered, which has changed from traditional development to big data development. Elasticsearch contains a variety of fuses to prev ...
Posted by banzaimonkey on Mon, 06 Dec 2021 15:50:31 -0800
Flink practical tutorial - Advanced: dimension table Association
Author: Tencent cloud flow computing Oceanus team
Introduction to flow computing Oceanus
Stream computing Oceanus is a powerful tool for real-time analysis of big data product ecosystem. It is an enterprise level real-time big data analysis platform based on Apache Flink with the characteristics of one-stop development, seamless connection, s ...
Posted by jassh on Sat, 04 Dec 2021 13:49:59 -0800
Small pit record of flink performance monitoring based on Prometheus
background
The flink cluster in the company has been running for a long time and has not been fully monitored. Recently, it plans to do this. Through online research, the deployment mode adopted by the company is per job mode. Finally, the job index is pushed to the push gateway of the intermediate gateway based on prometheus, Then prometheus ...
Posted by jtgraphic on Fri, 26 Nov 2021 16:43:59 -0800
Flink of big data
preface
stay Flink of big data (Part I) In this paper, we introduce the characteristics, architecture, two-stage submission and data flow of Flink. This paper introduces the unique operator of Flink and the case of implementing WordCount with Flink
1, split and select operators
The split operator splits a DataStream into two or more ...
Posted by hoogeebear on Mon, 22 Nov 2021 07:22:57 -0800
Principle analysis of Apache Flink CDC batch stream fusion technology
This article is reproduced from the official account of "good future technology". The use of Flink CDC 2 is introduced in the case of Flink SQL, and the core design of CDC is interpreted. The main contents are as follows:caseCore designCode explanationIn August, Flink CDC released version 2.0.0. Compared with version 1.0, it supports ...
Posted by wxflint on Wed, 10 Nov 2021 23:25:41 -0800
The third layer of Flink execution plan - JobGraph
Flink execution plan level 1 - StreamTransformation The structure of list + linked list is constructed;
Flink execution plan layer 2 - StreamGraph Transform the first layer into a graph structure;
Next, it's time to convert StreamGraph to JobGraph. StreamGraph inherits the abstract class StreamingPlan and implements the getJobGraph method:
/**
...
Posted by tetecko81sk on Wed, 10 Nov 2021 09:49:48 -0800
The second layer of Flink execution plan - StreamGraph
1, Localstreaminenvironment
Localstreaminenvironment is a subclass of streaminexecutionenvironment. It runs programs locally, multithreaded, and in the JVM that instantiates localstreaminenvironment.
It generates an embedded Flink cluster in the background and executes programs on the cluster.
When instantiating this environment, it uses the de ...
Posted by capbiker on Tue, 09 Nov 2021 00:10:46 -0800
Day79_ Flink (V) FlinkSQL and CEP
Syllabus Course content learning effect Master the goal FlinkSQL FlinkTable master FlinkSQL master FlinkCEP FlinkCEP master Task performance optimization operator chain master slot sharing master Flink asynchronous IO master Checkpoint optimization master
1, Table & SQL
(1) Overview
Table API is a ...
Posted by stereo on Mon, 08 Nov 2021 11:18:42 -0800
Processfunction API (underlying API)
The transformation operator we learned before cannot access the timestamp information and watermark information of the event. This is extremely important in some application scenarios. For example, map conversion operators such as MapFunction cannot access the timestamp or the event time of the current event.
Based on this, the DataStream API ...
Posted by byronbailey on Mon, 01 Nov 2021 22:39:01 -0700
CDH6.3.2 integration with Apache Flink1.12.0
edition: Centos7.6 JDK1.8 Scala2.11 Python2.7 Git1.8.3.1 Apache Maven3.6.3 CDH6.3.2 Apache Flink1.12.0
The above software needs to be installed in advance!!!
1, Compile Flink
1 download the flink source code
git clone https://github.com/apache/flink.git
git checkout release-1.12.0
2. Add maven image
Add the following mirrors in the m ...
Posted by mustatin on Mon, 01 Nov 2021 04:52:28 -0700