III-3. Interaction between HBase and Hive

III-3. Interaction between HBase and Hive 3.1 comparison between HBase and Hive [Hive] OutlineElaborate1. Data warehouseThe essence of Hive is actually equivalent to making a bijection relationship between the files already stored in HDFS in Mysql toIt is convenient to use HQL to manage queries.2. Used for data analysis and cleaningHive is s ...

Posted by abitshort on Sun, 28 Nov 2021 11:10:16 +0100

sqoop principle and basic application

1. Introduction to sqoop (1) Introduction: Sqoop is a tool of Apache for "transferring data between hadoop and relational database server".   import data: import data from MySQL and Oracle to hadoop's hdfs, hive, HBASE and other data storage systems.      Export data: export data from hadoop file system to relation ...

Posted by bouncer on Fri, 26 Nov 2021 14:36:34 +0100

Hive installation configuration

1, Hive installation 1. First, use xftp to transfer apache-hive-3.1.2-bin.tar.gz to the virtual machine, unzip it to / usr/local and change the file permissions. sudo tar -zxvf ./apache-hive-3.1.2-bin.tar.gz -C /usr/local # Unzip into / usr/local cd /usr/local/ sudo mv apache-hive-3.1.2-bin hive # Change the folder name to hive sudo c ...

Posted by jolly on Wed, 24 Nov 2021 18:37:14 +0100

Hive code analysis report: semantic analysis ④

2021SC@SDUSC catalogue Overview & & Review Code analysis: method of generating QB Summary: Overview & & Review As for the generation of QB by semantic parsing AST, we have always focused on code reading, and the content is scattered and split. Now, after reviewing the flow chart of HIVE compiler, we can see that semantic ...

Posted by julieb on Tue, 23 Nov 2021 17:55:20 +0100

Flume Agent Component Matching

1. Agent Components Components in Agent include Source, Channel, Sink. 1.1 Source The Source component can handle various types and formats of log data. Common source s in Flume: avroexecnetcatspooling directorytaildir Common CategoriesdescribeavroListen for Avro ports and receive Event s from external Avro client streamsexecExec source r ...

Posted by sohdubom on Sun, 21 Nov 2021 19:51:24 +0100

Configure Hadoop 2.7.1 + HBase 1.1.5 + mysql8.0.27 + hive1.2.1 + sqoop1.4.6 experimental environment in docker

preface Recently, a course experiment needs to be configured with the environment shown in the title, so the author takes this opportunity to use docker to build a returnable container experiment environment to complete the experiment, and also records some errors encountered in the construction process. 1, Environmental description ...

Posted by porco on Sun, 21 Nov 2021 00:36:54 +0100

Redis memory database

Redis is a high-performance key value in memory database. Redis is completely open source and free, and complies with the BSD protocol 1. Architecture KV database of single process and single thread modelIt is completely based on memory and provides data persistence functionThe data structure is simple and the operation is simpleUsing multipl ...

Posted by scarface222 on Thu, 18 Nov 2021 17:04:10 +0100

Construction of Hadoop high availability mode in CentOS 7.4

Foreword: it is only used for learning and communication. The configuration file may be different from that in work. The software used in this experiment includes: hadoop-2.6.0.tar.gzjdk-8u161-linux-x64.tar.gzzookeeper-3.4.5.tar.gz Building premise: all operations required for hadoop full distribution have been completed. Plan: A fully distr ...

Posted by prem on Wed, 17 Nov 2021 13:17:25 +0100

Hbase configuration of stand-alone version and pseudo distributed version and solutions to common problems

Hbase configuration of stand-alone version and pseudo distributed version and solutions to common problems Download Hbase The Hbase version used by the author is 2.3.7, which is applicable to hadoop-2.10.x and hadoop-3.x. The following is the image download website: Click here to jump to the image download website of Hbase-2.3.7 Download th ...

Posted by Dynamis on Tue, 16 Nov 2021 17:13:12 +0100

MapReduce programming -- merging and de duplication of files

catalogue 1, Problem description 2, Specific code 3, Specific operation 1, Problem description Merge multiple input files, eliminate the duplicate contents, and output the duplicated contents to one file.         Main idea: according to the process characteristics of reduce, the input value set will be automatically ...

Posted by PHP-Nut on Tue, 16 Nov 2021 11:53:04 +0100