Hive compression and storage
Hive compression and storage
1, Hadoop compression configuration
1.1 MR supported compression coding
Compression coding
In order to support a variety of compression / decompression algorithms, Hadoop introduces an encoder / decoder, as shown in the following table:
encoder
Comparison of compression performance
performance comparison ...
Posted by sparrrow on Thu, 30 Dec 2021 11:08:41 +0100
PB Enterprise E-commerce Offline Warehouse Project Reality-HW
1. Using zipper table to implement DIM layer business dimension table in core transaction analysis and rollback of zipper table
ODS Layer Data Preparation
Because we need to construct our own data, we use the existing ods_trade_shops creates a new table:
create database hw;
drop table if exists hw.ods_trade_shops;
create table hw.ods_trade_s ...
Posted by ndorfnz on Wed, 22 Dec 2021 22:09:10 +0100
Daily exercise - "calculating the value of Boolean expression" LeetCode Plus member exclusive question [detailed analysis] Hive / MySQL
Hello, I'm Lao Wu. You can also call me classmate Wu. Younger friends can also call me senior brother Wu. Welcome to the world of data analysis with me and learn together!
Interested friends can pay attention to me Data analysis column , there are many high-quality articles to share with you. In addition, you are also welcome to pay attention ...
Posted by iimrii on Tue, 21 Dec 2021 23:18:07 +0100
hive database and table command summary and 5 data import and export operations summary
1. hive basic data type
2. Data type
Instance: create tables with different data types
create table test(
name string,
friends array<string>,
children map<string, int>,
address struct<street:string, city:string>
)
row format delimited fields terminated by ','
collection items terminated by '_'
ma ...
Posted by maineyak on Sat, 18 Dec 2021 13:51:37 +0100
Senior big data Development Engineer - Hive learning notes
Hive improved chapter
Use of Hive
Hive's bucket table
1. Principle of drum dividing table
Bucket splitting is a more fine-grained partition relative to partition. Hive table or partition table can further divide bucketsDivide the bucket, take the hash value of the whole data content according to a column, and determine which bucket th ...
Posted by luisluis on Wed, 08 Dec 2021 08:35:11 +0100
SQOOP installation and use
SQOOP installation and use
SQOOP installation
1. Upload and unzip
tar -zxvf sqoop-1.4.7.bin__hadoop-2.6.0.tar.gz -C /usr/local/soft/
2. Modify folder name
mv sqoop-1.4.7.bin__hadoop-2.6.0/ sqoop-1.4.7
3. Modify profile
# Switch to the sqoop profile directory
cd /usr/local/soft/sqoop-1.4.7/conf
# Copy profile and rename
cp sqoop-env ...
Posted by MaxBodine on Mon, 06 Dec 2021 23:04:46 +0100
Hive UDF < user defined functions > getting started
1, Introduction
Hive has three types of UDFs: (normal) UDF, user-defined aggregate function (UDAF), and user-defined table generating function (UDTF).
UDF: the operation acts on a single data row and produces a data row as output. Most functions, such as mathematical and string functions, fall into this category.UDAF: accepts multiple input d ...
Posted by ublapach on Fri, 03 Dec 2021 05:59:54 +0100
FlinkCDC+Hudi+Hive big data real-time basic combat into the lake
catalogue
The new architecture is integrated with the lake warehouse
1, Version Description
2, Compile and package Hudi version 0.10.0
1. Use git to clone the latest master on github
2. Compilation and packaging
3, Create a flick project
1. Main contents of POM document
2.checkpoint
3.flinkcdc code
4.hudi code (refer to the official ...
Posted by WebbDawg on Fri, 03 Dec 2021 03:42:39 +0100
Hive code analysis report: semantic analysis ⑤
2021SC@SDUSC
catalogue
summary
Supplementary description doPhase1()
getMetaData(QB, ReadEntity) analysis
summary
In the last article, I analyzed doPhase1() Function, which is the initial stage of semantic analysis. The final goal of the program is to load the ast data into QB. The main idea of doPhase1 in this stage is to recursively tra ...
Posted by clewis4343 on Wed, 01 Dec 2021 21:52:13 +0100
Hive installation configuration
1, Hive installation
1. First, use xftp to transfer apache-hive-3.1.2-bin.tar.gz to the virtual machine, unzip it to / usr/local and change the file permissions.
sudo tar -zxvf ./apache-hive-3.1.2-bin.tar.gz -C /usr/local # Unzip into / usr/local
cd /usr/local/
sudo mv apache-hive-3.1.2-bin hive # Change the folder name to hive
sudo c ...
Posted by jolly on Wed, 24 Nov 2021 18:37:14 +0100