Hudi of data Lake (10): use Spark to query the data in Hudi

catalogue 0. Links to related articles 1. Environmental preparation 1.1. Build server environment 1.2. Building Maven projects and writing data 2. Maven dependence 3. Core code 3.1. Direct query 3.2. Condition query 0. Links to related articles Summary of articles on basic knowledge points of big data 1. Environmental preparation ...

Posted by Jessup on Fri, 25 Feb 2022 06:11:53 +0100

Hudi of data Lake: Hudi quick experience

catalogue 0. Links to related articles 1. Compile Hudi source code 1.1. Maven installation 1.2. Download and compile hudi 2. Install HDFS 3. Install Spark 4. Run hudi program in spark shell It mainly introduces the integrated use of Apache native Hudi, HDFS, Spark, etc 0. Links to related articles Summary of articles on basic know ...

Posted by mrman23 on Mon, 21 Feb 2022 06:05:15 +0100

Play Hudi Docker Demo based on Ubuntu -- Spark write and query

brief introduction Last article Playing Hudi Docker Demo based on Ubuntu (2) -- writing test data to Kafka Describes how to write test data to fkaka cluster. This article describes how to use Spark to consume Kafka data and write the data to HDFS. Hudi is introduced into Spark in the form of Jar package. Types of Hudi tables and queries Tabl ...

Posted by dilum on Fri, 11 Feb 2022 17:01:45 +0100