The number of consecutive check-in days of hive users and the maximum number of consecutive check-in days in history

demand Table temp_user_login, mark whether the user signs in every day (Note: this table contains the attendance records of all users on all working days), including three fields: date user id user_id if_login 0 not checked in 1 checked in Question 1: count the number of consecutive sign in days of each user up to now (the output table only inc ...

Posted by borris_uk on Thu, 10 Mar 2022 05:54:34 +0100

Flink tutorial (26) - Flink multilingual development

01 introduction In the previous blog, we learned the advanced features of Flink. Interested students can refer to the following: Flink tutorial (01) - Flink knowledge mapFlink tutorial (02) - getting started with FlinkFlink tutorial (03) - Flink environment constructionFlink tutorial (04) - getting started with FlinkFlink tutorial (05) - ...

Posted by funguse on Wed, 09 Mar 2022 02:43:43 +0100

Flink tutorial (20) - Flink advanced features (dual stream Join)

01 introduction In the previous blog, we learned about Flink's BroadcastState. Interested students can refer to the following: Flink tutorial (01) - Flink knowledge mapFlink tutorial (02) - getting started with FlinkFlink tutorial (03) - Flink environment constructionFlink tutorial (04) - getting started with FlinkFlink tutorial (05) - si ...

Posted by andychamberlainuk on Tue, 08 Mar 2022 16:29:36 +0100

HIVE installation and SQL tutorial

Install brew The following script is used for domestic brew installation: /bin/zsh -c "$(curl -fsSL https://gitee.com/cunkai/HomebrewCN/raw/master/Homebrew.sh)" Install hive brew install hive Hide execute sql statement hive can directly convert sql statements into mapreduce code (the link below is very good) https://geek-docs.com/hive/ ...

Posted by Teen0724 on Mon, 07 Mar 2022 23:39:51 +0100

Tez CDH5.16.4 compilation and installation

Environmental preparation CentOS 7 apache-maven-3.6.3 hadoop-2.6.0-cdh5.16.2 protobuf-2.5.0 Download: https://github.com/protocolbuffers/protobuf/releases?after=v3.0.0-alpha-4.1 apache-tez-0.9.2-src.tar.gz Download: https://dlcdn.apache.org/tez/0.9.2/ Note: if you use Windows environment to compile, you need to install git. In addition, protobu ...

Posted by ju8ular1 on Mon, 07 Mar 2022 04:56:29 +0100

[Python] the most detailed basic tutorial on Python in the whole network (very detailed, sorted out)

identifier In Python, all identifiers can include English (case sensitive), numbers, and underscores (), But it cannot start with a number. Start with a single underscore_ The class to be imported cannot be accessed directly through the class provided by XXX, but the class provided by foo. Double underlined__ foo represents the private membe ...

Posted by skyagh on Fri, 04 Mar 2022 22:42:00 +0100

OLAP engine: cross data source analysis based on Presto component

1, Presto overview 1. Introduction to Presto Presto is an open source distributed SQL query engine, which is suitable for interactive analysis and query. The data volume supports GB to PB bytes. Although Presto has the ability to parse SQL, it does not belong to the standard database category. Presto supports online data query, including Hiv ...

Posted by daredevil14 on Fri, 04 Mar 2022 16:21:36 +0100

Hive tutorial (06) - Hive SerDe serialization and deserialization

01 introduction In the previous tutorial, you have a preliminary understanding of Hive's data model, data types and operation commands. Interested students can refer to: Hive tutorial (01) - getting to know hiveHive tutorial (02) - hive installationHive tutorial (03) - hive data modelHive tutorial (04) - hive data typesHive tutorial (05) ...

Posted by Jackanape on Tue, 22 Feb 2022 04:24:36 +0100

MongoDB deploys and interfaces with Hive of CDH

This article, referring to the blogs of two big men, has made some modifications for reference only. 1, MongoDB deployment 1.1 software version CDH6.2.1 MongoDB3.4.24 CentOS7 1.2 download Click the link https://www.mongodb.com/try/download/community , go to the official website to download the rpm file: We need to download four package s: ...

Posted by shneoh on Sun, 20 Feb 2022 06:55:58 +0100

Understanding its core concepts from hudi persistence files

[overview] This is the first article in the hudi series, which first deepens the understanding of the concept from the core concept and the stored file format, and then gradually shares the use (spark/flink into hudi, hudi synchronous hive, etc.) and principles (compression mechanism, index, clustering, etc.) [what is a d ...

Posted by QbertsBrother on Sun, 20 Feb 2022 05:10:02 +0100