The number of consecutive check-in days of hive users and the maximum number of consecutive check-in days in history
demand
Table temp_user_login, mark whether the user signs in every day (Note: this table contains the attendance records of all users on all working days), including three fields: date user id user_id if_login 0 not checked in 1 checked in
Question 1: count the number of consecutive sign in days of each user up to now (the output table only inc ...
Posted by borris_uk on Thu, 10 Mar 2022 05:54:34 +0100
Flink tutorial (26) - Flink multilingual development
01 introduction
In the previous blog, we learned the advanced features of Flink. Interested students can refer to the following:
Flink tutorial (01) - Flink knowledge mapFlink tutorial (02) - getting started with FlinkFlink tutorial (03) - Flink environment constructionFlink tutorial (04) - getting started with FlinkFlink tutorial (05) - ...
Posted by funguse on Wed, 09 Mar 2022 02:43:43 +0100
Flink tutorial (20) - Flink advanced features (dual stream Join)
01 introduction
In the previous blog, we learned about Flink's BroadcastState. Interested students can refer to the following:
Flink tutorial (01) - Flink knowledge mapFlink tutorial (02) - getting started with FlinkFlink tutorial (03) - Flink environment constructionFlink tutorial (04) - getting started with FlinkFlink tutorial (05) - si ...
Posted by andychamberlainuk on Tue, 08 Mar 2022 16:29:36 +0100
HIVE installation and SQL tutorial
Install brew
The following script is used for domestic brew installation:
/bin/zsh -c "$(curl -fsSL https://gitee.com/cunkai/HomebrewCN/raw/master/Homebrew.sh)"
Install hive
brew install hive
Hide execute sql statement
hive can directly convert sql statements into mapreduce code (the link below is very good)
https://geek-docs.com/hive/ ...
Posted by Teen0724 on Mon, 07 Mar 2022 23:39:51 +0100
Tez CDH5.16.4 compilation and installation
Environmental preparation
CentOS 7
apache-maven-3.6.3
hadoop-2.6.0-cdh5.16.2
protobuf-2.5.0 Download: https://github.com/protocolbuffers/protobuf/releases?after=v3.0.0-alpha-4.1
apache-tez-0.9.2-src.tar.gz Download: https://dlcdn.apache.org/tez/0.9.2/
Note: if you use Windows environment to compile, you need to install git. In addition, protobu ...
Posted by ju8ular1 on Mon, 07 Mar 2022 04:56:29 +0100
[Python] the most detailed basic tutorial on Python in the whole network (very detailed, sorted out)
identifier
In Python, all identifiers can include English (case sensitive), numbers, and underscores (), But it cannot start with a number.
Start with a single underscore_ The class to be imported cannot be accessed directly through the class provided by XXX, but the class provided by foo. Double underlined__ foo represents the private membe ...
Posted by skyagh on Fri, 04 Mar 2022 22:42:00 +0100
OLAP engine: cross data source analysis based on Presto component
1, Presto overview
1. Introduction to Presto
Presto is an open source distributed SQL query engine, which is suitable for interactive analysis and query. The data volume supports GB to PB bytes. Although Presto has the ability to parse SQL, it does not belong to the standard database category.
Presto supports online data query, including Hiv ...
Posted by daredevil14 on Fri, 04 Mar 2022 16:21:36 +0100
Hive tutorial (06) - Hive SerDe serialization and deserialization
01 introduction
In the previous tutorial, you have a preliminary understanding of Hive's data model, data types and operation commands. Interested students can refer to:
Hive tutorial (01) - getting to know hiveHive tutorial (02) - hive installationHive tutorial (03) - hive data modelHive tutorial (04) - hive data typesHive tutorial (05) ...
Posted by Jackanape on Tue, 22 Feb 2022 04:24:36 +0100
MongoDB deploys and interfaces with Hive of CDH
This article, referring to the blogs of two big men, has made some modifications for reference only.
1, MongoDB deployment
1.1 software version
CDH6.2.1 MongoDB3.4.24 CentOS7
1.2 download
Click the link https://www.mongodb.com/try/download/community , go to the official website to download the rpm file: We need to download four package s: ...
Posted by shneoh on Sun, 20 Feb 2022 06:55:58 +0100
Understanding its core concepts from hudi persistence files
[overview]
This is the first article in the hudi series, which first deepens the understanding of the concept from the core concept and the stored file format, and then gradually shares the use (spark/flink into hudi, hudi synchronous hive, etc.) and principles (compression mechanism, index, clustering, etc.)
[what is a d ...
Posted by QbertsBrother on Sun, 20 Feb 2022 05:10:02 +0100