Installation and deployment of Flink

Installation and deployment of Flink First, open the terminal and execute the command under / home/xxx / to obtain the installation package: wget https://labfile.oss.aliyuncs.com/courses/3423/flink-1.10.0-bin-scala_2.11.tar tar -xvf flink-1.10.0-bin-scala_2.11.tar Unzip it to this directory. If you have studied Spark, you should feel that t ...

Posted by GESmithPhoto on Mon, 31 Jan 2022 13:56:59 +0100

Spark BigData Program: big data real-time stream processing log

Spark BigData Program: big data real-time stream processing log 1, Project content Write python scripts to continuously generate user behavior logs of learning websites.Start Flume to collect the generated logs.Start Kafka to receive the log received by Flume.Use Spark Streaming to consume Kafka's user logs.Spark Streaming cleans the data ...

Posted by iriedodge on Mon, 31 Jan 2022 13:43:09 +0100

Big data Hive in 2021: teach you how to understand Hive database and table operation (learn to count in seconds)

The most detailed Hive article series in the whole network, it is strongly recommended to collect and pay attention! Later, the updated articles will list the catalogue of historical articles to help you review the key points of knowledge. catalogue Series of historical articles preface Hive database and table operations 1, Database op ...

Posted by bbaker on Mon, 31 Jan 2022 04:41:33 +0100

hadoop2.6.5 Mapper class source code analysis

Mapper class // // Source code recreated from a .class file by IntelliJ IDEA // (powered by Fernflower decompiler) // package org.apache.hadoop.mapreduce; import java.io.IOException; import org.apache.hadoop.classification.InterfaceAudience.Public; import org.apache.hadoop.classification.InterfaceStability.Stable; @Public @Stable public cla ...

Posted by warydig on Mon, 31 Jan 2022 03:11:39 +0100

Advanced tutorial of Pandas: rearrangement and rotation of Dataframe

brief introduction Using Pandas's pivot method, DF can be rotated and transformed. This paper will explain the secret of pivot in detail. Using Pivot pivot is used to reorganize the DF, and use the specified index, columns and values to reconstruct the existing DF. Take a Pivot example: Through pivot change, the new DF uses the value ...

Posted by Angry Coder on Mon, 31 Jan 2022 02:35:06 +0100

UDF is used in Hive and Impala

11.1 introduction to experimental environment The cluster environment is running normallyHive and Impala services are installed in the clusterOperating system: redhat6 fiveCDH and CM versions are 5.11.1EC2 user with sudo permission is used for operation 11.2 UDF function development - using Intellij tools Use Intellij tool to develop Hive's ...

Posted by Design on Sun, 30 Jan 2022 23:08:38 +0100

es actual combat - word frequency statistics using IK word segmentation

Introduction: use IK word splitter to segment words and generate word cloud.This paper mainly introduces how to make word frequency statistics through IK word segmentation. Use the word splitter to count the word frequency of the article. The main purpose is to realize the word cloud function as shown in the figure below, and you can find the k ...

Posted by novice4eva on Sun, 30 Jan 2022 23:03:14 +0100

Some word segmentation query problems of MatchPhraseQueryBuilder and MatchQueryBuilder in es query

When query time word segmentation and storage time word segmentation are not set in es Library 1, Index with default settings Picture: 1. When querying es using MatchQueryBuilder @Test public void searchTest() throws IOException{ // SearchResponse response = service.search("name", "shoes-0-num", 0, 30 ...

Posted by ZephyrWest on Sun, 30 Jan 2022 21:46:06 +0100

es actual combat - word frequency statistics using IK word segmentation

Introduction: use IK word splitter to segment words and generate word cloud. This paper mainly introduces how to make word frequency statistics through IK word segmentation. Use the word splitter to count the word frequency of the article. The main purpose is to realize the word cloud function as shown in the figure below, and you can find the ...

Posted by guayaquil on Sun, 30 Jan 2022 19:57:04 +0100

Logstash: import RSS feed data

In the actual use process, we sometimes want to import and search RSS data. In many real micro services, a lot of data is provided in the form of RSS feed, such as our common comment websites. So is there any way to import these data into Elasticsearch and search? The answer is to use the information provided by Logstash RSS input plugin . In ...

Posted by holowugz on Sun, 30 Jan 2022 18:46:39 +0100