Installation, configuration and use of HBase

catalogue Installation, configuration and use of HBase preface (1) Produced by Xiao Du (special version for big data of Haida Jike), please use it by the recipient!!! (2) Pay attention to the directory of this tutorial and modify it according to the specific situation of users! (3) Most directories with "~" should use the install ...

Posted by irkevin on Wed, 15 Dec 2021 19:59:14 +0100

Hbase index (Phoenix secondary index)

1. Introduction to Phoenix    Hbase is suitable for storing a large number of NOSQL data with low requirements for relational operations. Due to the limitations of Hbase design, it is not possible to directly use the native API to perform the operations such as condition judgment and aggregation commonly used in relational databa ...

Posted by natronp on Fri, 03 Dec 2021 22:04:23 +0100

III-3. Interaction between HBase and Hive

III-3. Interaction between HBase and Hive 3.1 comparison between HBase and Hive [Hive] OutlineElaborate1. Data warehouseThe essence of Hive is actually equivalent to making a bijection relationship between the files already stored in HDFS in Mysql toIt is convenient to use HQL to manage queries.2. Used for data analysis and cleaningHive is s ...

Posted by abitshort on Sun, 28 Nov 2021 11:10:16 +0100

2021SC@SDUSC Hbase project code analysis - LruBlockCache

2021SC@SDUSC catalogue 1, Introduction 2, Cache level 3, Implementation analysis of LruBlockCache 4, Implementation analysis of obsolete cache 1, Introduction         If you access hfile every time you read data, the efficiency is very low, especially when reading random small amounts of data. In order to improve th ...

Posted by ganlal on Thu, 25 Nov 2021 21:58:48 +0100

Configure Hadoop 2.7.1 + HBase 1.1.5 + mysql8.0.27 + hive1.2.1 + sqoop1.4.6 experimental environment in docker

preface Recently, a course experiment needs to be configured with the environment shown in the title, so the author takes this opportunity to use docker to build a returnable container experiment environment to complete the experiment, and also records some errors encountered in the construction process. 1, Environmental description ...

Posted by porco on Sun, 21 Nov 2021 00:36:54 +0100

Hbase configuration of stand-alone version and pseudo distributed version and solutions to common problems

Hbase configuration of stand-alone version and pseudo distributed version and solutions to common problems Download Hbase The Hbase version used by the author is 2.3.7, which is applicable to hadoop-2.10.x and hadoop-3.x. The following is the image download website: Click here to jump to the image download website of Hbase-2.3.7 Download th ...

Posted by Dynamis on Tue, 16 Nov 2021 17:13:12 +0100

2021SC@SDUSC Code analysis of HBase project -- Region load balancing of Region mechanism

2021SC@SDUSC 1, Overview In the distributed system, load balancing is a very important function. In HBase, Region is a data Region segmented by the table according to the row direction, which is managed by the RegionServer and provides data reading and writing services to the outside. If there are too many regions on a RegionServer, the co ...

Posted by xxreenaxx1 on Thu, 11 Nov 2021 00:13:05 +0100

2021SC@SDUSC hbase source code analysis and HLog analysis

2021SC@SDUSC hbase source code analysis (V) HLog analysis 2021SC@SDUSC2021SC@SDUSC 2021SC@SDUSC2021SC@SDUSC 2021SC@SDUSC2021SC@SDUSC HLog By default, the data of all write operations (write, update and delete) are written to the HLog in the form of append first, and then to the MemStore. In most cases, the HLog will not be read, but if ...

Posted by HockeyDevil07 on Tue, 26 Oct 2021 08:13:55 +0200

Cluster deployment of Apache HBase 1.2.12 under CentOS

Cluster deployment of Apache HBase 1.2.12 under CentOS 1, Dependent environment Pre dependency and installation CentOS 7. X, JDK, Hadoop 2.7.7 cluster, zookeeper-3.4.14 clusterThe three node zookeeper is deployed this time. The corresponding machine hostname s are bigdata02, bigdata03 and bigdata04 The cluster planning is shown in the follo ...

Posted by sdallas411 on Sun, 24 Oct 2021 18:03:25 +0200

2021SC@SDUSC Hbase project source code analysis - flush

2021SC@SDUSC catalogue 1, Introduction 2, Source code analysis 1, Introduction HBase is a distributed database based on LSM model (log structured merge trees). Compared with the B + tree used in Oracle ordinary index, a major feature of LSM model is that it adopts a balance between reading and writing, sacrificing the performance of some ...

Posted by Ruski on Fri, 15 Oct 2021 20:35:07 +0200