Flink Sql With 1.14 Queries - Overview

Inquiry # The SELECT statement and VALUES statement are TableEnvironment This method takes the result of the SELECT statement (or VALUES statement) as a Table ATable can be used for Subsequent SQL and Table API queries,Convert to DataStream or Write TableSink . SQL and Table API queries can be seamlessly mixed, optimized as a whole and conve ...

Posted by thientanchuong on Wed, 23 Feb 2022 15:22:09 +0100

Business Data Diversion for Flink Real-Time Projects

Python WeChat Subscription Applet Course Video https://edu.csdn.net/course/detail/36074 Python Actual Quantitative Transaction Finance System https://edu.csdn.net/course/detail/35475 In the previous article, we have obtained the output stream of business data, which is the output stream of dim layer dimension data and the output stream of dw ...

Posted by apervizi on Mon, 21 Feb 2022 18:40:24 +0100

Understanding its core concepts from hudi persistence files

[overview] This is the first article in the hudi series, which first deepens the understanding of the concept from the core concept and the stored file format, and then gradually shares the use (spark/flink into hudi, hudi synchronous hive, etc.) and principles (compression mechanism, index, clustering, etc.) [what is a d ...

Posted by QbertsBrother on Sun, 20 Feb 2022 05:10:02 +0100

Flnk full development cycle

Abstract: Apache Flink, as the most popular computing engine for batch unification, is widely used in real-time ETL, event processing, data analysis, CEP, real-time machine learning and other fields. Beginning with Flink 1.9, the Apache Flink community began to provide support for the Python language on the basis of the original Java, Scala, SQ ...

Posted by Rulkster on Fri, 18 Feb 2022 23:34:26 +0100

Flink keying state AggregatingState development example

1, Keying status description Referring to the description on the official website, several keying states are introduced as follows: ValueState: save a value that can be updated and retrieved (as mentioned above, each value corresponds to the key of the current input data, so each key received by the operator may correspond to a value). This v ...

Posted by kumar_ldh on Thu, 17 Feb 2022 17:11:52 +0100

Flink streaming API

1 Environment 1.1 getExecutionEnvironment Create an execution environment that represents the context of the current executing program. If the program is called independently, this method returns to the local execution environment; If the program is called from the command-line client to submit to the cluster, this method returns the execut ...

Posted by kristolklp on Wed, 09 Feb 2022 16:49:27 +0100

Introduction to Maxwell of Flink real-time project

1. Introduction to Maxwell Maxwell is a MySQL real-time capture software written in Java, which is open source by Zendesk in the United States. Real time reading MySQL binary log Binlog and generates JSON format messages, which are sent to Kafka, Kinesis RabbitMQ, Redis, Google Cloud Pub/Sub, file or other platform applications. Official we ...

Posted by kraen123 on Mon, 07 Feb 2022 19:54:35 +0100

[Flink] Flink computing resource management

1. General Reprint: Flink source code reading notes (6) - Computing Resource Management In Flink, computing resources are allocated with Slot as the basic unit. This paper will analyze the management mechanism of computing resources in Flink. 2. Basic concept of task slot In the previous article, we learned about the startup process of Fl ...

Posted by sssphp on Thu, 03 Feb 2022 07:53:26 +0100

[Flink] FlinkSQL metadata verification

1. General Reprint: FlinkSQL metadata validation Flink1. After 9, the CatalogManager was introduced to manage the Catalog and CatalogBaseTable. When executing DDL statements, the table information was encapsulated as CatalogBaseTable and stored in the CatalogManager. At the same time, the Schema interface of calculate is extended, so that c ...

Posted by sticks464 on Thu, 03 Feb 2022 07:29:54 +0100

[Flink] reading notes of Flink source code (19) - Implementation of flow table Join in Flink SQL

1. General Reprint: Reading notes of Flink source code (19) - Implementation of flow table Join in Flink SQL In the process of data analysis using SQL, association query is often used. In traditional OLTP and OLAP fields, the data set of association query is bounded, so it can rely on caching bounded data set for query. However, in Streamin ...

Posted by rune_sm on Thu, 03 Feb 2022 00:38:27 +0100