Installation and deployment of Flink
Installation and deployment of Flink
First, open the terminal and execute the command under / home/xxx / to obtain the installation package:
wget https://labfile.oss.aliyuncs.com/courses/3423/flink-1.10.0-bin-scala_2.11.tar
tar -xvf flink-1.10.0-bin-scala_2.11.tar
Unzip it to this directory. If you have studied Spark, you should feel that t ...
Posted by GESmithPhoto on Mon, 31 Jan 2022 13:56:59 +0100
Spark BigData Program: big data real-time stream processing log
Spark BigData Program: big data real-time stream processing log
1, Project content
Write python scripts to continuously generate user behavior logs of learning websites.Start Flume to collect the generated logs.Start Kafka to receive the log received by Flume.Use Spark Streaming to consume Kafka's user logs.Spark Streaming cleans the data ...
Posted by iriedodge on Mon, 31 Jan 2022 13:43:09 +0100
Big data Hive in 2021: teach you how to understand Hive database and table operation (learn to count in seconds)
The most detailed Hive article series in the whole network, it is strongly recommended to collect and pay attention!
Later, the updated articles will list the catalogue of historical articles to help you review the key points of knowledge.
catalogue
Series of historical articles
preface
Hive database and table operations
1, Database op ...
Posted by bbaker on Mon, 31 Jan 2022 04:41:33 +0100
hadoop2.6.5 Mapper class source code analysis
Mapper class
//
// Source code recreated from a .class file by IntelliJ IDEA
// (powered by Fernflower decompiler)
//
package org.apache.hadoop.mapreduce;
import java.io.IOException;
import org.apache.hadoop.classification.InterfaceAudience.Public;
import org.apache.hadoop.classification.InterfaceStability.Stable;
@Public
@Stable
public cla ...
Posted by warydig on Mon, 31 Jan 2022 03:11:39 +0100
Advanced tutorial of Pandas: rearrangement and rotation of Dataframe
brief introduction
Using Pandas's pivot method, DF can be rotated and transformed. This paper will explain the secret of pivot in detail.
Using Pivot
pivot is used to reorganize the DF, and use the specified index, columns and values to reconstruct the existing DF.
Take a Pivot example:
Through pivot change, the new DF uses the value ...
Posted by Angry Coder on Mon, 31 Jan 2022 02:35:06 +0100
UDF is used in Hive and Impala
11.1 introduction to experimental environment
The cluster environment is running normallyHive and Impala services are installed in the clusterOperating system: redhat6 fiveCDH and CM versions are 5.11.1EC2 user with sudo permission is used for operation
11.2 UDF function development - using Intellij tools
Use Intellij tool to develop Hive's ...
Posted by Design on Sun, 30 Jan 2022 23:08:38 +0100
es actual combat - word frequency statistics using IK word segmentation
Introduction: use IK word splitter to segment words and generate word cloud.This paper mainly introduces how to make word frequency statistics through IK word segmentation. Use the word splitter to count the word frequency of the article. The main purpose is to realize the word cloud function as shown in the figure below, and you can find the k ...
Posted by novice4eva on Sun, 30 Jan 2022 23:03:14 +0100
Some word segmentation query problems of MatchPhraseQueryBuilder and MatchQueryBuilder in es query
When query time word segmentation and storage time word segmentation are not set in es Library
1, Index with default settings
Picture:
1. When querying es using MatchQueryBuilder
@Test
public void searchTest() throws IOException{
// SearchResponse response = service.search("name", "shoes-0-num", 0, 30 ...
Posted by ZephyrWest on Sun, 30 Jan 2022 21:46:06 +0100
es actual combat - word frequency statistics using IK word segmentation
Introduction: use IK word splitter to segment words and generate word cloud.
This paper mainly introduces how to make word frequency statistics through IK word segmentation. Use the word splitter to count the word frequency of the article. The main purpose is to realize the word cloud function as shown in the figure below, and you can find the ...
Posted by guayaquil on Sun, 30 Jan 2022 19:57:04 +0100
Logstash: import RSS feed data
In the actual use process, we sometimes want to import and search RSS data. In many real micro services, a lot of data is provided in the form of RSS feed, such as our common comment websites. So is there any way to import these data into Elasticsearch and search? The answer is to use the information provided by Logstash RSS input plugin . In ...
Posted by holowugz on Sun, 30 Jan 2022 18:46:39 +0100